>PF07520#Virulence protein SrfB Length = 1041 Score = 29.2 bits (65), Expect = 0.037 Identities = 12/74 (16%), Positives = 26/74 (35%) Query: 310 HLRYSQWRHRCMSSNSKDAYKDLVRAVDNWHVEIFNYFDKRLTNAYTESINSIIRQVERM 369 W + + ++ DL V +W E+F F + + S ++ E Sbjct: 184 DPGAMSWFLQRLEADEDGNAVDLQLWVSDWLKEMFLDFKRAERPGRSISEENLPHMFEHW 243 Query: 370 GRGYSFDALRAKIL 383 R S+ + + + Sbjct: 244 ARYLSYLQVIQRAV 257
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 26.7 bits (59), Expect = 0.049 Identities = 10/84 (11%), Positives = 25/84 (29%), Gaps = 12/84 (14%) Query: 62 LLIGIAFVLLEVYFVKNKRMNKWILPSIILI--------ASIALSI---PFSVTFDASLI 110 +A + V+ W +P +++ +A ++ V F L+ Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931 Query: 111 -LFQVVVLMGLYIEDLINHNQNRE 133 + + I + +E Sbjct: 932 TTIGLSAKNAILIVEFAKDLMEKE 955
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 121 bits (304), Expect = 2e-35 Identities = 70/257 (27%), Positives = 125/257 (48%), Gaps = 17/257 (6%) Query: 3 RTVLVTGSGRGLGSYIVKALSEKGFNVI-INYNNSKEES-EKLKKEIGSQAIAIQADITD 60 + +TG+ +G+G + + L+ +G ++ ++YN K E K A A AD+ D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 61 REAVEQLVKKGTEHFGQIDVVVNNALVNFKFDPTTQKAFKDLTYKDYEQQLDGTLKAAFN 120 A++++ + G ID++VN A V + L+ +++E FN Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHS-----LSDEEWEATFSVNSTGVFN 122 Query: 121 VSQSVIPQFLERKDGAIISIGTNLYQNPVVPYHEYTTAKAALIGFTRNVAAELGQHGIRA 180 S+SV ++R+ G+I+++G+N P Y ++KAA + FT+ + EL ++ IR Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 181 NVVSGGLLKTT---------DASAVTTPEVFDLIAQSTPLRKVTTPQDVANMVVYLCSEA 231 N+VS G +T + + + PL+K+ P D+A+ V++L S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 232 ADGITGQNITVDGGLTM 248 A IT N+ VDGG T+ Sbjct: 243 AGHITMHNLCVDGGATL 259
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 27.8 bits (62), Expect = 0.028 Identities = 13/30 (43%), Positives = 15/30 (50%) Query: 1 MKVFVFGGNEGAGEHVLKKLAAKGHEAVTI 30 MK V G G HV K+L GH+ V I Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI 30
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.2 bits (70), Expect = 0.004 Identities = 16/76 (21%), Positives = 26/76 (34%) Query: 84 SNETPQNEVTETAQQEDAPQVTEESNEQTQQVAPNTEQQESAPQVTEETQQQPEQNTQQS 143 E P E + +Q T + + NT + P V E+ +P+ ++S Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRS 1226 Query: 144 EDVQAAAPEQNTESSE 159 E T SS Sbjct: 1227 VRSVPHNVEPATTSSN 1242 Score = 29.6 bits (66), Expect = 0.014 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 7/87 (8%) Query: 86 ETPQNEVTETAQQ----EDAPQVTEESNEQTQQVAPNTEQQESAPQVTEETQQQPEQNTQ 141 P T + E++ Q ++ + Q T Q +V +E + + NTQ Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR---EVAKEAKSNVKANTQ 1081 Query: 142 QSEDVQAAAPEQNTESSEATGGSTKEQ 168 +E Q+ + + T+++E +T E+ Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEK 1108
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.008 Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 18/93 (19%) Query: 35 VILKGASGSGKTTLLSIIGGLLGRSGGEVSL--NGENYLDIKEKA------LTSMRLKEI 86 V+L+G G GK+TL++ + GL S + ++Y I +T+ R + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADA 658 Query: 87 ----GFIFQSSHLI--PYMKVID----QLTFIG 109 F Y + + Q+ Sbjct: 659 EAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWC 691
>adhesinb#Adhesin B signature. Length = 310 Score = 29.8 bits (67), Expect = 0.009 Identities = 29/172 (16%), Positives = 62/172 (36%), Gaps = 17/172 (9%) Query: 62 ESGPDEWWNNVVESYEMLKDKGYEKISIGGVSLGGILSLKAAYSLEDINSVVAMSVPQG- 120 E+G + W+ +VE+ + ++K Y +S G ++ L+ + +++ G Sbjct: 94 ETGGNAWFTKLVENAKKKENKDYYAVSEG----VDVIYLEGQSEKGKEDPHAWLNLENGI 149 Query: 121 KDIEDLNKRVVSYIENFMEFVGRSDEEIDEKLKELDEKPMASLPDFEALIDEIHSRLGDI 180 +++ KR+ E ++ + EKL LD++ + I + Sbjct: 150 IYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKFNNIPGEKKMI------V 203 Query: 181 SVPLAVKYGGKDAALYEESADHIYEEVASEAKDMKVYPNTGHLMTKGKDKKL 232 + KY K + I E +K L+ K + K+ Sbjct: 204 TSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIK------TLVEKLRKTKV 249
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 107 bits (269), Expect = 7e-30 Identities = 76/261 (29%), Positives = 118/261 (45%), Gaps = 15/261 (5%) Query: 42 VGSEKLKNRKALVTGGDSGIGRAAAIAYAKEGADVAISYLPDEGSDAQEVKAVIEKA-GQ 100 + ++ ++ + A +TG GIG A A A +GA +A D + E KA + Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAV---DYNPEKLEKVVSSLKAEAR 57 Query: 101 KAVLLPGDLRDERFARELVHEAAEKLGGLDILVLNAAIQQFEKDIKNLSTEQLTDTFTVN 160 A P D+RD E+ ++G +DILV A + + I +LS E+ TF+VN Sbjct: 58 HAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEWEATFSVN 116 Query: 161 IFSNVWMLQEALDHLP--EGGSVVVTTSVQAFQPSGHLSDYAMTKSSQVAFVLAMTQQLA 218 + ++ GS+V S A P ++ YA +K++ V F + +LA Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 219 EKGIRINAVSPGPVWTVLQVA-----GGQPQE---SIPEFGQKEPLKRAGQPVELADTYV 270 E IR N VSPG T +Q + G Q S+ F PLK+ +P ++AD + Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236 Query: 271 LLASDSASYITGQVYGITGGT 291 L S A +IT + GG Sbjct: 237 FLVSGQAGHITMHNLCVDGGA 257
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.9 bits (114), Expect = 4e-08 Identities = 67/347 (19%), Positives = 125/347 (36%), Gaps = 24/347 (6%) Query: 60 AAFMGHFVEAKGPRISGLVSTLFFASGMAVAGLAVQLESLILLYFGYGVLGGIGLGIGY- 118 A +G + G R LVS A A+ A L L + G+ G G G Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAY 119 Query: 119 ---ITPVSTLVKWFPDRRGMATGLAIMGFGFAAMLASPAMEWLIVNVSIAGTFYILAVIY 175 IT + F G + A GFG M+A P + L+ S F+ A + Sbjct: 120 IADITDGDERARHF----GFMS--ACFGFG---MVAGPVLGGLMGGFSPHAPFFAAAALN 170 Query: 176 FVVMIASSLYLERPPEGYEPEGMNLDEKVTAKKDIVQLTANEAVRTRRFYFLWSMLFLNV 235 + + L PE ++ E L + A + + L ++ F+ Sbjct: 171 GLNFLTGCFLL---PESHKGERRPLRRE--ALNPLASFRWARGMTV--VAALMAVFFIMQ 223 Query: 236 TCGIAILAVASPMAQEIAGLSAGAAAVMVGIMGVFNGGGRLVWAS-ISDYIGRPNLYSLF 294 G A+ ++ A + + G+ + + + ++ +G L Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG 283 Query: 295 FIIQIALFLLLPSVSHALVFQAMLFVIISCYGGGFSAIPAYIGDIFGTKQLGAIHGYILT 354 I ++LL + + + V+++ G G A+ A + ++ G + G + Sbjct: 284 MIADGTGYILLAFATRGWMA-FPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAA 342 Query: 355 AWAAAGLVGPFISSTVYEAT-QSYTLTLYIFGALFIAALAISILIRG 400 + +VGP + + +Y A+ ++ +I GA L + L RG Sbjct: 343 LTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALY-LLCLPALRRG 388
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.8 bits (67), Expect = 0.041 Identities = 11/33 (33%), Positives = 17/33 (51%) Query: 698 IQEGEPIVIYNVNGVFQGFARIADIKAGNIGIQ 730 + EG+ I +YN + + F I DI I +Q Sbjct: 199 MLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 31.4 bits (71), Expect = 0.002 Identities = 16/39 (41%), Positives = 23/39 (58%) Query: 181 LFLIVIIRSVTLPGAMEGIKFFLTPDFSLISSEGILYAL 219 FL+ I SV LPG+M + FFL S I S+ ++ A+ Sbjct: 13 AFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAM 51
>PF06580#Sensor histidine kinase Length = 349 Score = 36.0 bits (83), Expect = 4e-04 Identities = 18/103 (17%), Positives = 39/103 (37%), Gaps = 21/103 (20%) Query: 486 IILNLIANGINYTHEGGTIEVSLRENIYEIRLIVTDDGIGIPEESLGRIFERFYRVDKAR 545 ++ N I +GI +GG I + ++ + L V + G + + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT--------------- 307 Query: 546 SRHSGGTGLGLAIVKHLIESHKG---RIEIESAEDEGTTITVI 585 TG GL V+ ++ G +I++ + + + +I Sbjct: 308 ---KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 102 bits (256), Expect = 2e-27 Identities = 35/138 (25%), Positives = 70/138 (50%), Gaps = 2/138 (1%) Query: 3 SILVVDDEPSIVTLLKFNLEQSGYSVLTAEDGNTGLDLALTEQPDLIVLDLMLPGMDGMD 62 +ILV DD+ +I T+L L ++GY V + T DL+V D+++P + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VCKTLRQEKMNTPILMLTAKDEEFDKILGLELGADDYMTKPFSPREVVARVKAIL--RRS 120 + +++ + + P+L+++A++ I E GA DY+ KPF E++ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 QVEALAEKAAEEVFSIGD 138 + L + + + + +G Sbjct: 125 RPSKLEDDSQDGMPLVGR 142
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 32.4 bits (73), Expect = 0.003 Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 15/92 (16%) Query: 170 MYE----DINTILKLLKRYEEDEYKDYLTQLGNIKSFDEE----VNILIDESESLSLLLI 221 MY N + +K + YKDY+ + + +++ ++ L++ ++L + L+ Sbjct: 593 MYNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDKYQDYMDSLLNNIDNLDVPLV 652 Query: 222 DIDNFKVVNDEHSYKSGDAL---IKQMANLLD 250 + + H K + + IK+++N+ D Sbjct: 653 SDEYV----NGHEAKDINEITNDIKEVSNIKD 680
>SECA#SecA protein signature. Length = 901 Score = 1082 bits (2799), Expect = 0.0 Identities = 412/904 (45%), Positives = 568/904 (62%), Gaps = 71/904 (7%) Query: 1 MGILDKVF-DGNKRELRSLRKIAEKVEDYKETMAGLDDASLQGKTDEFKEMLAGAEDDKA 59 + +L KVF N R LR +RK+ + + M L D L+GKT EF+ L E Sbjct: 3 IKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEV--- 59 Query: 60 EEKMLDQILPEAFAVVREASKRTLGLEPYPVQIMGGAALHKGDISEMKTGEGKTLTATMP 119 L+ ++PEAFAVVREASKR G+ + VQ++GG L++ I+EM+TGEGKTLTAT+P Sbjct: 60 ----LENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLP 115 Query: 120 VYLNALTGKGVHVITVNEYLSATQMEEMSVLYNFLKLTVGLNLNAKNSEEKREAYAADIT 179 YLNALTGKGVHV+TVN+YL+ E L+ FL LTVG+NL + KREAYAADIT Sbjct: 116 AYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADIT 175 Query: 180 YTTNNELGFDYLRDNMVTYKKDRVLRGLNYAIIDEVDSILIDEARTPLIISGRANQTNTQ 239 Y TNNE GFDYLRDNM ++RV R L+YA++DEVDSILIDEARTPLIISG A ++ Sbjct: 176 YGTNNEYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEM 235 Query: 240 YIQANQFVKMLKE-----------DEDFTYDIKTKNIQLNDDGMEKAEKWF-------KV 281 Y + N+ + L + F+ D K++ + L + G+ E+ + Sbjct: 236 YKRVNKIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEG 295 Query: 282 DNLYDVKHVNLLHHINQALKAHFSMQRDTDYVVEEDKIVIVDQFTGRKMKGRRFSDGLHQ 341 ++LY ++ L+HH+ AL+AH RD DY+V++ +++IVD+ TGR M+GRR+SDGLHQ Sbjct: 296 ESLYSPANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQ 355 Query: 342 AIEAKEGVEIQNESRTMASITFQNFFRQYNKLSGMTGTAKTEEEEFINIYNMKVTVIPTN 401 A+EAKEGV+IQNE++T+ASITFQN+FR Y KL+GMTGTA TE EF +IY + V+PTN Sbjct: 356 AVEAKEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTN 415 Query: 402 LPIAREDRTDKIYSTKDIKFKNVVDEVVERHRNGQPVLIGTVAVETSEYIANLLSKKGIR 461 P+ R+D D +Y T+ K + +++++ ER GQPVL+GT+++E SE ++N L+K GI+ Sbjct: 416 RPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIK 475 Query: 462 HNVLNAKNHEREADIIMSAGKKGAVTIATNMAGRGTDIKLG------------------- 502 HNVLNAK H EA I+ AG AVTIATNMAGRGTDI LG Sbjct: 476 HNVLNAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIE 535 Query: 503 ----------EGVKEAGGLAVIGTERHESRRIDDQLRGRAGRQGDVGVSTFYLSLEDDLM 552 + V EAGGL +IGTERHESRRID+QLRGR+GRQGD G S FYLS+ED LM Sbjct: 536 KIKADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALM 595 Query: 553 RRFGSERMQGMMGRLGMQEEE-ITSKMISKAVESSQKRVEGNNFDSRKKLLEYDDVLRRQ 611 R F S+R+ GMM +LGM+ E I ++KA+ ++Q++VE NFD RK+LLEYDDV Q Sbjct: 596 RIFASDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQ 655 Query: 612 REIIYDERNDIIDQDDVRDQLMGMIEASVERTVNYYILDDD--ELIDYDQFIKTIEDMYL 669 R IY +RN+++D DV + + + E + T++ YI E+ D + +++ + Sbjct: 656 RRAIYSQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFD 715 Query: 670 SDESIE--VPDVRGRENDEIIALILEKVNAELERKEEKLTSEKMRLFERMMMLRTIDQKW 727 D I + + + IL + +RKEE + +E MR FE+ +ML+T+D W Sbjct: 716 LDLPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLW 775 Query: 728 VEHIDSMDQLRTGIHLRSYGQINPLREYQNEGLQMFEDMLVAIEDDTAKYVLKTELKSDE 787 EH+ +MD LR GIHLR Y Q +P +EY+ E MF ML +++ + + K +++ E Sbjct: 776 KEHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPE 835 Query: 788 EI---------KREQVIKQNEMQTGDGKEKVKKGPVKK--EIKVGRNDPCPCGSGKKYKN 836 E+ + E++ + ++ D + E KVGRNDPCPCGSGKKYK Sbjct: 836 EVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQ 895 Query: 837 CHGQ 840 CHG+ Sbjct: 896 CHGR 899
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 40.8 bits (95), Expect = 8e-06 Identities = 25/116 (21%), Positives = 41/116 (35%), Gaps = 19/116 (16%) Query: 141 VEEAPQTEEAPAAE--EAPQAEEETEDQNTAQAAEVQE------APAVVEEDNSADEQAA 192 VE+ QT + QA+ + N + A V E APA E + + Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044 Query: 193 EQQAAEQQAAEQRAAEQERIEQREAEQA-----------EAEQEKQEAQAAAPQQT 237 +Q++ + EQ A E + A++A E Q E + +T Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100 Score = 39.7 bits (92), Expect = 2e-05 Identities = 30/128 (23%), Positives = 45/128 (35%), Gaps = 10/128 (7%) Query: 137 AAASVEEAPQTEEAPA--AEEAPQAEEETEDQNTAQAAEVQEAPAVVEEDNSADEQAAEQ 194 A V+EAP APA +E E ++ ++ Q+A ++ ++A Sbjct: 1016 EIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSN 1075 Query: 195 QAAEQQAAEQRAAEQERIE-QREAEQAEAEQEKQEAQAA-------APQQTSNVSGGNAV 246 A Q E + E E Q + A EK+E P+ TS VS Sbjct: 1076 VKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQ 1135 Query: 247 SVAQSVAA 254 S A Sbjct: 1136 SETVQPQA 1143 Score = 37.7 bits (87), Expect = 8e-05 Identities = 24/136 (17%), Positives = 48/136 (35%), Gaps = 11/136 (8%) Query: 124 AGKTLIVSADAAPAAASVEEAPQTEEAPAAEEAPQAEEETEDQNTAQAAEVQEAPAVVEE 183 A + + A S E +T+ E A +EE + + + QE P V + Sbjct: 1072 AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE---KAKVETEKTQEVPKVTSQ 1128 Query: 184 DNSADEQAAEQQAAEQQAAEQRAAEQERIEQREAEQAEAEQEKQEAQAAAPQQTSNVSGG 243 + EQ+ Q + A E ++ +++ P + ++ + Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVN-------IKEPQSQTNTTADT-EQPAKETSSNVE 1180 Query: 244 NAVSVAQSVAAGKSYV 259 V+ + +V G S V Sbjct: 1181 QPVTESTTVNTGNSVV 1196 Score = 30.4 bits (68), Expect = 0.015 Identities = 16/125 (12%), Positives = 33/125 (26%), Gaps = 4/125 (3%) Query: 130 VSADAAPAAASVEEAPQTEEAPAAEEAPQAEEETEDQNTAQAAEVQEA----PAVVEEDN 185 V++ +P E E + +E + Q A Q A V + Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184 Query: 186 SADEQAAEQQAAEQQAAEQRAAEQERIEQREAEQAEAEQEKQEAQAAAPQQTSNVSGGNA 245 + E A Q + + + + + + + S + Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244 Query: 246 VSVAQ 250 +VA Sbjct: 1245 STVAL 1249
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.2 bits (63), Expect = 0.042 Identities = 10/31 (32%), Positives = 18/31 (58%) Query: 148 VLITGESGVGKSETALELVKNGHRLVADDNV 178 L+TG +G + L++ GH++V DN+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNL 33
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 37.5 bits (87), Expect = 5e-05 Identities = 11/27 (40%), Positives = 15/27 (55%) Query: 1 MNVLITGGTGFIGGKLAEILKEEHDHV 27 M L+TG GFIG +++ L E V Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQV 27
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 30.4 bits (68), Expect = 0.027 Identities = 15/145 (10%), Positives = 44/145 (30%), Gaps = 1/145 (0%) Query: 135 ETMKYQAPIQMAGVFADLLKKADGTVDPSEVEDMEETQEFLEKFEEVMELVKKRNAELKK 194 + +A L+KA D + + + + L+ Sbjct: 177 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEG 236 Query: 195 ADEEVKSLKEDITSRYNSKIVGKESSSEKIPESIKSNMADLSKYYPRYLELKNKHEEEDS 254 A + I ++ E+ ++ ++++ M + + L+ + ++ Sbjct: 237 AMNFSTADSAKI-KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEA 295 Query: 255 EDEESEEELSDEEKEDKEKKRDDEE 279 E + E + + +RD + Sbjct: 296 EKADLEHQSQVLNANRQSLRRDLDA 320
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 45.2 bits (107), Expect = 3e-08 Identities = 26/119 (21%), Positives = 49/119 (41%), Gaps = 8/119 (6%) Query: 5 FIILGIFLCIVFYYDAIKQIIPNWLNVSGAVVGVGYHSLSAGVDGFIQSFGGGLVCGIIL 64 ++L L + + D K ++P+ L + G+ ++ L G + G + ++L Sbjct: 137 ALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFN-LLGGFVSLGDAVIGAMAGYLVL 195 Query: 65 LVLY-VFK------AIGAGDVKLFFAIGTITGILFGLYSIMYSIICAGIIGLLYLLFTR 116 LY FK +G GD KL A+G G ++ S + +G+ +L Sbjct: 196 WSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRN 254
>SECYTRNLCASE#Preprotein translocase SecY subunit signature. Length = 437 Score = 35.1 bits (81), Expect = 6e-04 Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 9/77 (11%) Query: 190 IKNSGVSIKGDMQYGDTDKYKIKKIKTMPPLNRKEKLYSSLIAIV--LLAVIWSQMPLSS 247 +K G I G T +Y + + + + LY LIA+V + V + S Sbjct: 347 MKKYGGFIPGIRAGRPTAEY-LSYV--LNRITWPGSLYLGLIALVPTMALVGF---GASQ 400 Query: 248 NILLG-TAFLLSVGVAV 263 N G T+ L+ VGV + Sbjct: 401 NFPFGGTSILIIVGVGL 417
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.004 Identities = 23/115 (20%), Positives = 44/115 (38%), Gaps = 2/115 (1%) Query: 90 PKSEIREMIENGELKEQSDETAEPSKPNPDETVSGEGEPDETAGGSGSEAISGTGVESSE 149 P + E +Q +T E ++ + ET + E + A + V S Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089 Query: 150 SESTSVEPGRPDESSPVHKPEP--TEQKQTQKQPQHKRRNQSRNRKKPNQQKQRQ 202 SE+ + E++ V K E E ++TQ+ P+ + + + Q Q + Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 27.3 bits (60), Expect = 0.019 Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 4/71 (5%) Query: 48 VNQFLSKGQIVKAKILSVDKHGKLNLTLKENEYFKSEEKKRDRRSVLEQIRETEKYGFES 107 +N F+S G+ + + ++ N E K L + EK E+ Sbjct: 236 LNPFISAGEALGIGDILYGTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREA 295 Query: 108 IRQKMPEWIEE 118 + + WI+E Sbjct: 296 VDR----WIQE 302
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 42.7 bits (100), Expect = 6e-06 Identities = 40/127 (31%), Positives = 62/127 (48%), Gaps = 15/127 (11%) Query: 409 TELDQINRRVMQLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVEKEKAQI 468 +LD QLE E Q L+ E + +S + L+++L +REA++ L +K + Q Sbjct: 316 RDLDASREAKKQLEAEHQKLE-EQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQN 374 Query: 469 Q-----------KVTGKREELDRVRKELEEAENNYE-LEKA-AELRHGRLPSLEKELAEL 515 + + RE +V K LEEA + LEK EL + + EKE AEL Sbjct: 375 KISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLT-EKEKAEL 433 Query: 516 EAQLQEE 522 +A+L+ E Sbjct: 434 QAKLEAE 440 Score = 37.7 bits (87), Expect = 2e-04 Identities = 18/113 (15%), Positives = 41/113 (36%), Gaps = 2/113 (1%) Query: 409 TELDQINRRVMQLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVEKEKAQI 468 + ++ + S L ++ +A + + A+I Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248 Query: 469 QKVTGKREELDRVRKELEEAENNYELEKAAELRHGRLPSLEKELAELEAQLQE 521 + + ++ L+ + ELE+A A+ ++ +LE E A LEA+ + Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADSA--KIKTLEAEKAALEAEKAD 299 Score = 33.9 bits (77), Expect = 0.003 Identities = 23/118 (19%), Positives = 48/118 (40%), Gaps = 17/118 (14%) Query: 403 EMGSNPTELDQINRRVMQLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVE 462 ++ ++ + LE E+ L+ + V + L+++L +REA++ L Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQSQ-VLNANRQSLRRDLDASREAKKQL----- 328 Query: 463 KEKAQIQKVTGKREELDRVRKELEEAENNYELEKAAELRHGRLPSLEKELAELEAQLQ 520 +A+ QK+ + + + R+ L + K LE E +LE Q + Sbjct: 329 --EAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ---------LEAEHQKLEEQNK 375 Score = 32.3 bits (73), Expect = 0.009 Identities = 31/106 (29%), Positives = 59/106 (55%), Gaps = 3/106 (2%) Query: 420 QLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVEKEKAQIQKVTGKREELD 479 QLE E Q L+ E + +S + L+++L +REA++ + + +E+ +++ + +EL+ Sbjct: 362 QLEAEHQKLE-EQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELE 420 Query: 480 RVRK--ELEEAENNYELEKAAELRHGRLPSLEKELAELEAQLQEES 523 +K E E+AE +LE A+ +L +ELA+L A +S Sbjct: 421 ESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDS 466
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 29.9 bits (67), Expect = 0.007 Identities = 8/34 (23%), Positives = 18/34 (52%) Query: 85 DPFSMEQTPMNINNKDIESFLDEAAKDNGETVPV 118 P + E+ + DI+ F++ +K+ +TV + Sbjct: 23 RPAAAEEFSASFKGTDIQEFINTVSKNLNKTVII 56
>PF06580#Sensor histidine kinase Length = 349 Score = 30.2 bits (68), Expect = 0.024 Identities = 21/136 (15%), Positives = 41/136 (30%), Gaps = 12/136 (8%) Query: 103 NTLALTLSIFGGILLLSLLFAFMMSWAGIFDDVFLLVIIISTISLGVVVP---TLKETNL 159 N G + F F + + I IS + L + +K Sbjct: 9 NKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGW 68 Query: 160 ITTQMGQIILLVAVIADLVTMIMLALYSQLYADSS-------QPIWLMGILVVFAVLFYF 212 + MGQIIL V ++ M+ + ++ + + + ++F V+ Sbjct: 69 LKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVT 128 Query: 213 LG--RVMHHAQFLKQL 226 + F K Sbjct: 129 FMWSLLYFGWHFFKNY 144
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 70.5 bits (172), Expect = 2e-16 Identities = 65/258 (25%), Positives = 100/258 (38%), Gaps = 17/258 (6%) Query: 3 LEGKTYVIMGVANKRSIAWGAARALDQMGAKLVFTILNERFRRELEKLLGELEGDHDIVV 62 +EGK I G A + I AR L GA + N ++ L + E H Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL-KAEARHAEAF 62 Query: 63 ECDVQDDAQIESAFREIGEKTGGIDGLLHAIAFAGKDELKGGYSETTREGFKNALDISTY 122 DV+D A I+ I + G ID L++ AG G + E ++ +++ Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNV---AGVLRP-GLIHSLSDEEWEATFSVNST 118 Query: 123 SLTVVAKHAKKIM--NEGGSIVTMTYLGGERAMPNYNVMGVAKAALDSSVRYLAYDLGED 180 + ++ K M GSIVT+ + +KAA + L +L E Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178 Query: 181 GFRVNAVSAGPIRT-----LSSSAVGEFKSILKEIEE---KAPLRRNVDQLEVGNTVAFL 232 R N VS G T L + G + I +E PL++ ++ + V FL Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238 Query: 233 LSDLASGITGEVVHVDSG 250 +S A IT + VD G Sbjct: 239 VSGQAGHITMHNLCVDGG 256
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 56.6 bits (136), Expect = 3e-12 Identities = 15/59 (25%), Positives = 30/59 (50%) Query: 12 RQYEIFAAAMAEFGEHGFKKASTNRIVKRAGMSKGMLYYYFDNKQSIFDDALDFALDHI 70 + I A+ F + G S I K AG+++G +Y++F +K +F + + + +I Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70
>SECYTRNLCASE#Preprotein translocase SecY subunit signature. Length = 437 Score = 27.4 bits (61), Expect = 0.014 Identities = 10/45 (22%), Positives = 21/45 (46%) Query: 61 GNRLVMLIFYMVFVFLPAILISVFQNNILLLGSIFVFTIFVYFIV 105 GN + +L+F + P+ L ++ + L G I T+ ++ Sbjct: 187 GNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLI 231
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.1 bits (117), Expect = 2e-08 Identities = 40/173 (23%), Positives = 68/173 (39%), Gaps = 6/173 (3%) Query: 251 IMLQGLGVGMLLPVLPTYITSELSLNYFQYTFFILIVFGLVGFSMTVLSRALDTNSVRL- 309 + L +G+G+++PVLP + + N + IL+ L + L S R Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILL--ALYALMQFACAPVLGALSDRFG 71 Query: 310 TFAVICGGFLIYAVGIMWFSTLETIWLIFAIASFIGLSYGIMLPAWNKYLAGTIMQDKSA 369 V+ AV +T +W+++ I + G Y+A D+ A Sbjct: 72 RRPVLLVSLAGAAVDYAIMATAPFLWVLY-IGRIVAGITGATGAVAGAYIADITDGDERA 130 Query: 370 ESWGVISSVQGIGAMIGPALGGLTADLFGTVDATLLASGLIFVLLFVYYAVLF 422 +G +S+ G G + GP LGGL + A A+ + L F+ L Sbjct: 131 RHFGFMSACFGFGMVAGPVLGGLMGGF--SPHAPFFAAAALNGLNFLTGCFLL 181 Score = 34.4 bits (79), Expect = 7e-04 Identities = 20/103 (19%), Positives = 44/103 (42%), Gaps = 2/103 (1%) Query: 313 VICGGFLIYAVGIMWFSTLETIWLIFAIASFIGLSYGIMLPAWNKYLAGTIMQDKSAESW 372 + G + G + + W+ F I + S GI +PA L+ + +++ + Sbjct: 279 ALMLGMIADGTGYILLAFATRGWMAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQ 337 Query: 373 GVISSVQGIGAMIGPALGG-LTADLFGTVDATLLASGLIFVLL 414 G ++++ + +++GP L + A T + +G LL Sbjct: 338 GSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLL 380
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 218 bits (556), Expect = 1e-65 Identities = 117/455 (25%), Positives = 207/455 (45%), Gaps = 64/455 (14%) Query: 15 IISHPDAGKTTLTEKLLLFGGAIREAGTV-KGKKSNKFATSDWMKVEQERGISVTSSVMQ 73 +++H DAGKTTLTE LL GAI E G+V KG +D +E++RGI++ + + Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGT-----TRTDNTLLERQRGITIQTGITS 62 Query: 74 FDFDGYKINILDTPGHEDFSEDTYRTLMAVDSAVMVIDAAKGIEPQTLKLFKVCKMRGIP 133 F ++ K+NI+DTPGH DF + YR+L +D A+++I A G++ QT LF + GIP Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122 Query: 134 IFTFINKLDRMGKEPFELLEEIESTLEIETYPMTWPVGMGQSFFGIINRKDRTINPYREE 193 FINK+D+ G + + ++I+ L E V + N E Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ-KVELYP--------NMCVTNFTESE 173 Query: 194 EKLQLTDDYGLKENHPIEADEAFQTAVEEFMLVEEAGDDFDKEKIST--------GDLTP 245 + + IE ++ +E++M +G + ++ L P Sbjct: 174 QWDTV-----------IEGNDDL---LEKYM----SGKSLEALELEQEESIRFHNCSLFP 215 Query: 246 VFFGSALSTFGIEEFLGTYVDFAPMPTSRQTKEDTEIEPLDDAFTGFIFKIQANMDPRHR 305 V+ GSA + GI+ + + T R E G +FKI+ + R Sbjct: 216 VYHGSAKNNIGIDNLIEVITNKFYSSTHRGQSE----------LCGKVFKIE--YSEK-R 262 Query: 306 DRLAFMRIVSGKFTRGMDATLARTGRKSKVSRATMFMADDTETVNEAYAGDIIGLYDTG- 364 RLA++R+ SG D+ K K++ + + +++AY+G+I+ L + Sbjct: 263 QRLAYIRLYSGVLHLR-DSVRISEKEKIKITEMYTSINGELCKIDKAYSGEIVILQNEFL 321 Query: 365 --TYQIGDTLYGPGAKKVEFEALPQFTPELFMKVSAKNVMKQKHFYKGIEQLVQEG-TIQ 421 +GDT P +++E P P L V +++ + ++ ++ Sbjct: 322 KLNSVLGDTKLLPQRERIEN---PL--PLLQTTVEPSKPQQREMLLDALLEISDSDPLLR 376 Query: 422 YYKTMHTNQPILGAVGQLQFEVFEHRMKNEYNTDV 456 YY T++ IL +G++Q EV ++ +Y+ ++ Sbjct: 377 YYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEI 411
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 26.4 bits (58), Expect = 0.034 Identities = 15/42 (35%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 37 QLYIIEMIAEEPGITQKTLVERFKKK-----QTSVSRAITRL 73 + I E+I TQ LV+ KK Q +VSR I L Sbjct: 7 HIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 31.8 bits (72), Expect = 0.010 Identities = 18/94 (19%), Positives = 38/94 (40%), Gaps = 8/94 (8%) Query: 242 RHMVDNSLSRTKSNYEFS----ITDRVKVLEKLQDILKVEDDEEVRTLIIDRMR-AQAGF 296 R + DN+ + +YE S +T R V+++L I++ D+ R+++ + A A Sbjct: 147 RQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAAD 206 Query: 297 MNRYLKE-HMEDYGQVASEVAHFTDEYFPYARMN 329 + + + E + + R N Sbjct: 207 VVKLVTELNKDTSKSALPGSM--VANVVADERTN 238
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 42.2 bits (99), Expect = 1e-07 Identities = 26/117 (22%), Positives = 45/117 (38%), Gaps = 9/117 (7%) Query: 8 TKELYEQCLDIRKRVFVEEQNVPLDREIDEHEDFATHILLRDDTPLGTVRYRPLSKETVK 67 T+E + K F + ++ +D E E A + ++ +G ++ R Sbjct: 38 TEERFS------KPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYAL 91 Query: 68 VERMAVMPEARGLKLGRKLMDFVHEHAKHYGYEKARLGAQTH---AASFYEKLGYKI 121 +E +AV + R +G L+ E AK + L Q A FY K + I Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 28.2 bits (63), Expect = 0.038 Identities = 15/76 (19%), Positives = 35/76 (46%), Gaps = 9/76 (11%) Query: 218 IQYGEMKEFGDAL-PIVERYNEFINQYKLLTKEEQLQYKEKMMEK--QKEKSRRAKPKKD 274 + + ++ + ++ Y +++ +Y + E +Q+ E K E++ K KD Sbjct: 80 VTFNDISSLSKHVDEGLDGYRDYLIKY---SDRELVQFFENAQLKRQYGEETETVKRDKD 136 Query: 275 ---KAPLSSLIPAFIL 287 K + +L+PA+ L Sbjct: 137 EIEKPSIFALLPAYAL 152
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 28.6 bits (64), Expect = 0.042 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 3/50 (6%) Query: 284 NVSMPQYVVDYTKEILEKLEGDKVTVFGLTYKGDVDDIRESPAFDIYELL 333 VSM VD T + G V ++G K +DD+ + YEL+ Sbjct: 298 TVSMDMLAVDLTP-CPQAGIGTPVELWGKEIK--IDDVAAAAGTVGYELM 344
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.022 Identities = 28/156 (17%), Positives = 55/156 (35%), Gaps = 19/156 (12%) Query: 12 VRDFDVSHYVWIYEKGNK----PILQSVQSMRKTGELKDFQKQILRVLSHRKVG-NKDFY 66 + + H + E G K +L+ + K+ + +H +G KD Y Sbjct: 577 GKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSY 636 Query: 67 --YLEHEGHELGWAEL----KTSI----VVYSKPREHVRLDLDKFMQEQEKQ-IFVVSKN 115 +EL E+ + +S ++ R +++Q+ +Q + + N Sbjct: 637 EQIAGIVAYELS--EMTAFRRADAEAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTN 694 Query: 116 NLRLLKDQMLDSRFIMVK-DGVEYEALFKKHRLQGW 150 + L D + RF V G +K R Q + Sbjct: 695 KRQYLFDITGNRRFWPVLVPGRANLVWLQKFRGQLF 730
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 33.6 bits (77), Expect = 4e-04 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 8/66 (12%) Query: 134 KMTLDAARLNGTEGEEGSVEAGKYADFVVLNDNPLGYDVELTDDLVEMTIVNGKIVYGSR 193 K T++ A +G E GS+E GK AD V+ NP + V+ +M ++ G I Sbjct: 408 KYTINPAIAHGLSHEIGSLEVGKRADLVLW--NPAFFGVK-----PDMVLLGGTIAAAP- 459 Query: 194 SGDQGA 199 GD A Sbjct: 460 MGDPNA 465
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 62.2 bits (151), Expect = 1e-12 Identities = 58/184 (31%), Positives = 76/184 (41%), Gaps = 8/184 (4%) Query: 25 LGLLAIMGPLNIDMYLPSFPGIARDLGTSPSLVQVSLTACLLGLAFGQVVIGPLSDAQGR 84 L +L+ LN + S P IA D P+ TA +L + G V G LSD G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 85 KRPLLIATSLFVVSSLLCAVAPNIY-VLIAARFLQGFTASAGVVLSRAVVRDVFSGRELS 143 KR LL + S++ V + + +LI ARF+QG A+A L VV Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 144 KFFSLLMVINAVAPMAAPIAGGAILLLPFASWHTIFLFLAVLGIMIVIIVAVSLRETLPP 203 K F L+ I A+ P GG I H I +L MI II L + L Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIA-------HYIHWSYLLLIPMITIITVPFLMKLLKK 191 Query: 204 AQRI 207 RI Sbjct: 192 EVRI 195
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 44.3 bits (105), Expect = 5e-07 Identities = 32/124 (25%), Positives = 49/124 (39%), Gaps = 34/124 (27%) Query: 5 IKNGDIYAPEHVGKKSVLLNGRIIIKIGDIDEEQLGRLFDVEVIDAEGMIVSPGIIDPHV 64 +K+G I A G + II+ G EVI EG IV+ G +D H+ Sbjct: 90 LKDGRIAAIGKAGNPDMQPGVTIIVGPG------------TEVIAGEGKIVTAGGMDSHI 137 Query: 65 HLIGGGGEGGFATRTPELQLSNIIKAGVTTVVG-----CLGTDGTT-----RHMTSLLAK 114 H I P+ Q+ + +G+T ++G GT TT H+ ++ Sbjct: 138 HFI-----------CPQ-QIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEA 185 Query: 115 ARAL 118 A A Sbjct: 186 ADAF 189
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 151 bits (383), Expect = 4e-42 Identities = 79/388 (20%), Positives = 146/388 (37%), Gaps = 40/388 (10%) Query: 90 MQKTSKFMVVNDNPA--ATLETI---EDLENVLPDH--DFLPYMAHEPMPENFDFIIT-- 140 M +V +D+ A L + + + ++A D ++T Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD----GDLVVTDV 55 Query: 141 --PGEANLVPTKAYQTFDIGARVVSIE---TVMELKEIFELEMKDSLLMQYYIKTMVHLT 195 P E + V+ + T M + E D L + + ++ + Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115 Query: 196 AKRSENTPVSIADQNKN-RTFSGISTESPQMQSTIRIASQMAKTSNIIHITGETGTGKQM 254 + + + + + S MQ R+ +++ +T + ITGE+GTGK++ Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175 Query: 255 LAEMIHNDSAYHDMPFYIYSGADKDPQSIDNELFG-------GEGEKHQGILREVNRGTV 307 +A +H+ + PF + A I++ELFG G + G + GT+ Sbjct: 176 VARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235 Query: 308 YIKNIDSIPYQLQNKLANYFDANA----GSS-----DVRIVTSSIDDLWELYKGDIISQK 358 ++ I +P Q +L G DVRIV ++ DL + + + Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRED 295 Query: 359 LYSYLSSYILKVPSISERKEDIPVLIDDFKNHFNRTEMQ---FSERVMNAFVRYDWPGNV 415 LY L+ L++P + +R EDIP L+ F + + F + + + WPGNV Sbjct: 296 LYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355 Query: 416 RELYNLISYCVCLNQ-KYVEIDSLPIFF 442 REL NL+ L + + + Sbjct: 356 RELENLVRRLTALYPQDVITREIIENEL 383
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.018 Identities = 15/71 (21%), Positives = 22/71 (30%) Query: 37 IVWEVLTPVISIMIYWFVFGTLRQRAPIEMGGTEVPFFYWLAIGFIVWTFFFQGSIEASK 96 I+ VL + I + WFV T R + V F LA+ I Sbjct: 76 IILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLY 135 Query: 97 SIYRRLKMLSK 107 + K + Sbjct: 136 FGWHFFKNYKQ 146
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 37.1 bits (86), Expect = 6e-06 Identities = 27/121 (22%), Positives = 50/121 (41%), Gaps = 16/121 (13%) Query: 14 KVITYGTFDLLHMGHINILRRAKERGDYLVVAVSSDEFNKLKHKEAYYSYEDR-KAILEA 72 I G+FD + GH++I+ R D + VAV + +K+ +S ++R + I +A Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRN-----PNKQPMFSVQERLEQIAKA 56 Query: 73 IKYVDEVIPEHNWGQKVKDVQKHDIDVFVMG----DDWKGEFDF------LKEYCEVVYL 122 I ++ + G V ++ + G D++ E L E V+L Sbjct: 57 IAHLPNAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFL 116 Query: 123 A 123 Sbjct: 117 T 117
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 87.1 bits (216), Expect = 8e-21 Identities = 56/299 (18%), Positives = 103/299 (34%), Gaps = 45/299 (15%) Query: 282 TILVTGAGGSIGSELVRQISKFQPRQVVLLGHGENSIYTILEEMS--GIKGNIEYIPIIA 339 LVTGA G IG + +++ + QVV + N Y + + + + + Sbjct: 2 KYLVTGAAGFIGFHVSKRLLE-AGHQVVGI-DNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 340 DVQDRKRIFKIFDKYKPNIVYHAAAHKHVPLMEYNPKEAVKNNIIGTKNTAEAAIEYKAE 399 D+ DR+ + +F V+ + V NP +N+ G N E K + Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 400 KFVLIST---------------DKAVNPPNVMGATKRMAEMVVQVLNGESEQTTLVAVRF 444 + S+ D +P ++ ATK+ E++ + +RF Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS-HLYGLPATGLRF 178 Query: 445 GNVLGSRGS---VIPKFRKQIEAGGPITVTDE-RMTRYFMTI------------------ 482 V G G + KF K + G I V + +M R F I Sbjct: 179 FTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHAD 238 Query: 483 PEASRLVIQAGTLANGGEVFVLDMGQPVKIVDLARNMIRLSGYSETEIQIQFSGIRPGE 541 + + V+ + PV+++D + + G E + ++PG+ Sbjct: 239 TQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG---IEAKKNMLPLQPGD 294
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 69.0 bits (169), Expect = 3e-15 Identities = 49/265 (18%), Positives = 97/265 (36%), Gaps = 32/265 (12%) Query: 8 LITGGTGSFGNAVLDRFLETDIKEIRIFSRDEKKQDDMRKKYRNEKI-----KFHLGDVR 62 L+TG G G V R LE + + I + ++ D K+ R E + +FH D+ Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYY-DVSLKQARLELLAQPGFQFHKIDLA 62 Query: 63 DKDSVKN--SMHGVDYIFHAAALKQVPSCEFFPMEAVKTNVVGTENVIDAAIEKNVEKVI 120 D++ + + + + +F + V P +N+ G N+++ ++ ++ Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLL 122 Query: 121 CLST---------------DKAAYPINAMGISKAMMEKVLVAKSKTVSSEDTLICGTRYG 165 S+ D +P++ +K E + S T G R+ Sbjct: 123 YASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPAT---GLRFF 179 Query: 166 NVMASRGS---VIPLFIQQIKEGKDITV-TDPNMTRFLMSLEEAVELVVFAFENAKSGDI 221 V G + F + + EGK I V M R +++ E ++ + D Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHAD- 238 Query: 222 MVQKSPSSTIKDLAQALKELFNADN 246 Q + + + A ++N N Sbjct: 239 -TQWTVETGTPAASIAPYRVYNIGN 262
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 57.1 bits (138), Expect = 3e-11 Identities = 47/229 (20%), Positives = 79/229 (34%), Gaps = 57/229 (24%) Query: 1 MNILITGANGFVGKNLSAELEQNTNYIV----------------------------YKI- 31 M L+TGA GF+G ++S L + + +V +KI Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 32 --TRETTEETFEEYCKKADFVFHL---AGVNR-PKNEKEFMTGNLDFTVKLVNELKKHDN 85 RE + F + VF V +N + NL + ++ E +H+ Sbjct: 61 LADREGMTDLFASG--HFERVFISPHRLAVRYSLENPHAYADSNLTGFLNIL-EGCRHNK 117 Query: 86 FAPVLITSSIQ----------AELD------NPYGKSKKAGEDIVFEYGGNNKVKTFVYR 129 +L SS + D + Y +KKA E + Y + R Sbjct: 118 IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177 Query: 130 LPNLFGKWCRPNYNSVVATFSHNIANGLPIRI-DNPDAKIKLLYIDDLI 177 ++G W RP+ + F+ + G I + + K YIDD+ Sbjct: 178 FFTVYGPWGRPDM--ALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 81.8 bits (202), Expect = 6e-19 Identities = 56/313 (17%), Positives = 100/313 (31%), Gaps = 72/313 (23%) Query: 282 TILVTGAGGSIGSEIVRQIAKFQPRKILLLGHGENSIYTILEEVLDNKTDSIS------- 334 LVTGA G IG + ++ LL G + +DN D Sbjct: 2 KYLVTGAAGFIGFHVSKR----------LLEAGHQVV------GIDNLNDYYDVSLKQAR 45 Query: 335 --------YVPIIADVQNRKRMFKVFEKYRPDIVYHAAAHKHVPMMEYNPQEAVKNNVIG 386 + D+ +R+ M +F + V+ + V NP +N+ G Sbjct: 46 LELLAQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTG 105 Query: 387 TKNTAEAACHFKAKKFVMIST---------------DKAVNPPNVMGATKRMAEMIVQAL 431 N E H K + + S+ D +P ++ ATK+ E++ Sbjct: 106 FLNILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY 165 Query: 432 DKGCEHTTLVAVRFGNVLGSRGS---VVPKFKKQIQLGGPVTV-TDPRMTRYFMTI---- 483 +RF V G G + KF K + G + V +M R F I Sbjct: 166 SH-LYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 484 --------------PEASRLVIQASTLAEGGEVFVLDMGEPVKIVDLAKNMIRLCGFAEE 529 + + + V+ + PV+++D + + G Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI--- 281 Query: 530 DIGIEFVGIRPGE 542 + + ++PG+ Sbjct: 282 EAKKNMLPLQPGD 294
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 30.0 bits (67), Expect = 0.017 Identities = 29/148 (19%), Positives = 48/148 (32%), Gaps = 24/148 (16%) Query: 162 DAAESLGAVYKGRMSGTFGKFGVYSFNGNKIITTSGGGMIISDEEIM----------IKK 211 S Y F +N N I+ T G S+ + M I Sbjct: 215 TYTLSSNNPYFNHPKNLFAAISTRQYNWNNILPTYSGR--ESNVQKMAISELMADVGISV 272 Query: 212 ALKKATQSKETAAHYQH----ENVGYNYRLSNICAGIGRGQ--MEVLEERIRQKRAIFEQ 265 + S + EN GYN + I G Q +++ + Q + ++ Q Sbjct: 273 DMDYGPSSGSAGSSRVQRALKENFGYNQSVHQINRGDFSKQDWEAQIDKELSQNQPVYYQ 332 Query: 266 YVYGLGDVDGLGFM---PEANDSFHTRW 290 G+G V G F+ + + +H W Sbjct: 333 ---GVGKVGGHAFVIDGADGRNFYHVNW 357
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 533 bits (1376), Expect = 0.0 Identities = 200/331 (60%), Positives = 250/331 (75%) Query: 3 KILVTGSAGFIGSHLSARLLQEGYTVAGIDNLNDYYDVGLKKDRLELLLQNRVKSYEADI 62 K LVTG+AGFIG H+S RLL+ G+ V GIDNLNDYYDV LK+ RLELL Q + ++ D+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 63 SDTGSVMEIFESEKPDIVINLAAQAGVRYSLENPHAYITSNINGFTNILEACRHQKVEQL 122 +D + ++F S + V + VRYSLENPHAY SN+ GF NILE CRH K++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 123 IYASSSSVYGANTSKPFSTSDNIDHPLSLYAATKKANELMAHTYSHLYRLPTTGLRFFTV 182 +YASSSSVYG N PFST D++DHP+SLYAATKKANELMAHTYSHLY LP TGLRFFTV Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181 Query: 183 YGPWGRPDMALFKFTKAILEDRPIDVYNNGDMLRDFTYVDDIVESIHRLVKLTPKPDPEW 242 YGPWGRPDMALFKFTKA+LE + IDVYN G M RDFTY+DDI E+I RL + P D +W Sbjct: 182 YGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQW 241 Query: 243 SGDNPNPSSSNAPYRIYNIGNNAPVRLMAFIEAIENRLGKKGEKNFMPLQPGDVPETYAD 302 + + P++S APYR+YNIGN++PV LM +I+A+E+ LG + +KN +PLQPGDV ET AD Sbjct: 242 TVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSAD 301 Query: 303 VEDLFRTTGFRPSTDIQDGVNHFIDWYLGYY 333 + L+ GF P T ++DGV +F++WY +Y Sbjct: 302 TKALYEVIGFTPETTVKDGVKNFVNWYRDFY 332
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 27.2 bits (60), Expect = 0.006 Identities = 6/41 (14%), Positives = 17/41 (41%), Gaps = 4/41 (9%) Query: 13 FMVLLGAALMIIGF----FTKDIKMWFIAFAIALLVRYYAA 49 +++ +G + + F F + WF+ I ++ + Sbjct: 324 YVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSF 364
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.4 bits (68), Expect = 0.005 Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 1/53 (1%) Query: 122 ADGSKKESEETAEEQTEEEQPEEQTEEEQPE-EQTDEVPAEESEGAVEDAAVE 173 S E++ET +T+E E+ E+ + E E+T EVP S+ + + E Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 55.0 bits (132), Expect = 1e-11 Identities = 36/198 (18%), Positives = 73/198 (36%), Gaps = 14/198 (7%) Query: 1 MEKQDLRKIKTRKAIDQAFTALIAEKGFEAMTIKDIAEEAIINRGTFYMHYEDKYALLES 60 K +TR+ I L +++G + ++ +IA+ A + RG Y H++DK L Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 61 YENTLLDGLYEILSRNIEEEHHKLSIGMPRKIATDTFNY-ISENADKIIALF-----NNQ 114 + E+ + + + R+I ++E +++ Sbjct: 62 IWELSESNIGELELEYQAKF-PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 115 GENQFEHKVRAHMLNYYRIHSDQL----IDKNRLRVDID-YLLAYITNAHI-GLIRNW-L 167 GE + + ++ +Q I+ L D+ A I +I GL+ NW Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 168 EHGRRETSEELADILEML 185 + +E D + +L Sbjct: 181 APQSFDLKKEARDYVAIL 198
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 54.8 bits (132), Expect = 8e-11 Identities = 22/122 (18%), Positives = 47/122 (38%), Gaps = 3/122 (2%) Query: 2 SILIIDDDLESSVRITNILKQSIHSDIKILEARSATEGLKMVKEDRPFIVVTELSLSDST 61 +IL+ DDD + L ++ + +A + + +VVT++ + D Sbjct: 5 TILVADDDAAIRTVLNQALSRA---GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLEVGKKILSEFNDIFVIAISQLKMFELVQESINSGFSGFHLKPVIKSEFLSTIERLILS 121 ++ +I D+ V+ +S F ++ G + KP +E + I R + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 122 RT 123 Sbjct: 122 PK 123
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 29.3 bits (66), Expect = 0.035 Identities = 23/155 (14%), Positives = 56/155 (36%), Gaps = 14/155 (9%) Query: 55 DKMEKGIITRLKTAMPAIFILFAVGIII--GTWIYSGTVPLLIYYGLQIISPTYFLVTAF 112 D +KG + + K + I+ +++ + + L++ Q P ++ Sbjct: 16 DARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYV 75 Query: 113 VIVAVVSVATGTAWGSTATAGVALMGIAAELDVSLAMAAGAVISGGVFGDKLSPLSDTTN 172 V ++ ALM IA+ + + G +ISG + ++ Sbjct: 76 VDNVLLEFFYLC---FPLLTVAALMAIASHV-----VQYGFLISGEAIKPDIKKINPIEG 127 Query: 173 LAPLVVEVNLYEHIKHMLWTTVPASIVGLIIWFFV 207 + +L E +K + + ++ ++IW + Sbjct: 128 AKRIFSIKSLVEFLK----SILKVVLLSILIWIII 158
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 36.3 bits (84), Expect = 2e-04 Identities = 14/26 (53%), Positives = 19/26 (73%) Query: 339 ITLNPAEAVNMDHEIGSIREGKKADI 364 T+NPA A + HEIGS+ GK+AD+ Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADL 434
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.4 bits (73), Expect = 0.002 Identities = 13/69 (18%), Positives = 28/69 (40%), Gaps = 7/69 (10%) Query: 36 LGIVGRSGSGKSTILKSIYGTYMPEEGAIMYHSKENGPVNIL-----EINDYELIRLRKT 90 + + G G GKST++ ++ G + + ++ I E++ E+ R+ Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELS--EMTAFRRA 656 Query: 91 EIGYVSQFL 99 + V F Sbjct: 657 DAEAVKAFF 665
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 29.6 bits (66), Expect = 0.001 Identities = 12/46 (26%), Positives = 25/46 (54%) Query: 42 GKEFVSVNPMKRFGEPEEVGNLVTFLLSNEATFSNAAVIPIDGGQS 87 + F + P+K+ +P ++ + V FL+S +A + +DGG + Sbjct: 213 LETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGAT 258
>adhesinb#Adhesin B signature. Length = 310 Score = 168 bits (426), Expect = 4e-50 Identities = 68/316 (21%), Positives = 124/316 (39%), Gaps = 12/316 (3%) Query: 1 MFRRSLWFLSAMSVIILTACGAASPEESEGSGKIEVYTTVFALQSLTEQIAGDNAEVHSI 60 M + L ++ + L AC + GS K+ V T + +T+ IAGD +HSI Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60 Query: 61 YPNGTDIHSYEPTQKDMLSYAESDLFITTNKELDAVSGKIADVLNEDIEILEAVGDTGHL 120 P G D H YEP +D+ +++DL L+ + +E + + + Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLE---TGGNAWFTKLVENAKKKENKDYY 117 Query: 121 LEDTHSHDHGEGDDHDHSHGEIDPHVWLDPVLSIDMAEAIKDKLSTLDPDNAEAYEENFE 180 + E DPH WL+ I A+ I +LS DP N E YE+N + Sbjct: 118 AVSEGVDVI-YLEGQSEKGKE-DPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLK 175 Query: 181 TVKADLEELD----ASLESVTEDSKVKNVYISHESIGYLANRYGFTQHGVSGMNNE-EPT 235 L LD ++ + K+ + S Y + Y + +N E E T Sbjct: 176 AYVEKLSALDKEAKEKFNNIPGEKKM--IVTSEGCFKYFSKAYNVPSAYIWEINTEEEGT 233 Query: 236 QKEVIDMVEGLKADGSKYILTEQNISNKVTDIIKDAGGVEQLGFHNLSVLMDEDNPDTDY 295 ++ +VE L+ + E ++ ++ + + + ++ Y Sbjct: 234 PDQIKTLVEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSY 293 Query: 296 QTLMRHNIEVLDRALN 311 ++M++N+E + L+ Sbjct: 294 YSMMKYNLEKIAEGLS 309
>PF06438#Heme acquisition protein HasAp Length = 205 Score = 28.4 bits (63), Expect = 0.029 Identities = 22/107 (20%), Positives = 39/107 (36%), Gaps = 12/107 (11%) Query: 80 SGKGLSGDLTLYAIAHAVSTNEVNAAMGKICATPTAGSAGVVPGVLFAMKEKHDVSREDM 139 +G SG L + + S +++ + G G V V++ + + + Sbjct: 99 TGGASSGGYALDSQEVSFSNLGLDSPIA-------QGRDGTVHKVVYGLMSGDSSALQGQ 151 Query: 140 IKFLFTSGAFGFVVANNASISGAAGGCQAEVGSASAMAAAALVEMAG 186 I L + + + AAG V A+ AAAA V + G Sbjct: 152 IDALLKAVDPSLSINSTFDQLAAAG-----VAHATPAAAAAEVGVVG 193
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 142 bits (359), Expect = 9e-44 Identities = 82/252 (32%), Positives = 127/252 (50%), Gaps = 10/252 (3%) Query: 3 KIALVTGASRGIGKSIALSLGKEYTVIVNYSGSREKAEGVADEINSEGGTAEAYQCHVQN 62 KIA +TGA++GIG+++A +L + I + EK E V + +E AEA+ V++ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 YDDVKAMIKYITDTYGSIDLVVNNAGVTKDNLLMRMKEDEWNQVIDVNLKGAFNVIQSVS 122 + + I G ID++VN AGV + L+ + ++EW VN G FN +SVS Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 123 RPMIRQKGGRIINISSIVGSLGNPGQTNYVASKAGIDGITKSVARELAPKGITVNAVAPG 182 + M+ ++ G I+ + S + Y +SKA TK + ELA I N V+PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 183 FIESDMTDVL--SDDIKEQMLG--------QIPLNHFGTVDDISETVKFLASGSAKYITG 232 E+DM L ++ EQ++ IPL DI++ V FL SG A +IT Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248 Query: 233 QTIHVNGGMYMG 244 + V+GG +G Sbjct: 249 HNLCVDGGATLG 260
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 24.8 bits (54), Expect = 0.038 Identities = 9/42 (21%), Positives = 18/42 (42%), Gaps = 2/42 (4%) Query: 33 GADSLDIAELVMELEDEFEMEIPDEEAEKINTVGDALNYIDK 74 GA++LD A+ + E + P + K+ D ++ Sbjct: 296 GANALDTAKAIKAKLAELQPFFP--QGMKVLYPYDTTPFVQL 335
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 53.9 bits (129), Expect = 2e-09 Identities = 56/381 (14%), Positives = 121/381 (31%), Gaps = 32/381 (8%) Query: 148 DEVLKARPEQRRNLIEETAGVMKYKLRKKESEKRLEDTAQNLSRVNDIIQELESRVNKLE 207 +++ + + E K L + L ND + E S + Sbjct: 42 AVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKL 101 Query: 208 RESANAKEYLALKEEISRSDIEVTAYDINALMTILRTEEEAYEEIEKKAEDCRAKLQQME 267 R++ + A K + + + M + + +E + A+ +E Sbjct: 102 RKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 161 Query: 268 QKMSELGSARDRHDSKNRELNSRLV-------ELSRRLENTGGRIELYKERKNNKGQLVE 320 + + + +K + L + EL + LE + Sbjct: 162 KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKA 221 Query: 321 ELKVRLSEQQARKETLAAKADEVDRTAASLNETALMLKKSLSDTDEQKKYLTKDRGDEIE 380 L R ++ + E + +L L+ ++ ++ + + Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 281 Query: 381 KLKDSYYDLMVEKTTLENDQRREESEKSRLDGSLRQKEERLAALRNDYDTEKSEH----- 435 K+K L EK LE ++ E + L+ + + L A R ++EH Sbjct: 282 KIKT----LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337 Query: 436 ----------------DALVDKKEKTKSELAHAREKYLDEKRNLAELNQKYDAEREKLHK 479 DA + K++ ++E E+ + + L + DA RE + Sbjct: 338 QNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ 397 Query: 480 ANRFIEQQSSKLEMLKNMQNE 500 + +E+ +SKL L+ + E Sbjct: 398 VEKALEEANSKLAALEKLNKE 418 Score = 44.3 bits (104), Expect = 3e-06 Identities = 38/210 (18%), Positives = 72/210 (34%), Gaps = 10/210 (4%) Query: 674 ESQRDMAETEEKLAEYKNKLEHMKDTVKKLGGEVAGQMEQLSKLESTGETLAEKHEQTES 733 + K+ + + + L + G M + + +TL + E+ Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190 Query: 734 AADRLGYQLEAKAETMAVLEEELKSLGHVNEE-----RDFEGLIKEAEDKLQKLDEHIRM 788 L LE ++K+L D E ++ A + I+ Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250 Query: 789 MSASDKDKKQKLALLTDEAHEIEREYTAVRERISHNSAEKERLGSELHDVQEAIEETEAQ 848 + A + + A L TA +I AEK L +E D++ + A Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310 Query: 849 QRLVAEDLGGMDLDALEKEEQELTAEVEKL 878 ++ + DLDA + +++L AE +KL Sbjct: 311 RQSLRR-----DLDASREAKKQLEAEHQKL 335 Score = 39.7 bits (92), Expect = 7e-05 Identities = 50/248 (20%), Positives = 94/248 (37%), Gaps = 13/248 (5%) Query: 669 KNSIIESQRDMAETEEKLAEYKNKLEHMKDTVKKLGGEVAGQMEQLSKLESTGETLAEKH 728 + A+ E+ L N +K L E A + ++LE E Sbjct: 217 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS 276 Query: 729 EQTESAADRLGYQLEAKAETMAVLEEELKSL-----GHVNEERDFEGLIKEAEDKLQKLD 783 + L + A A LE + + L + K+ E + QKL+ Sbjct: 277 TADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLE 336 Query: 784 EHIRMMSASDKDKKQKLALLTDEAHEIEREYTAVRERISHNSAEKERLGSELHDVQEAIE 843 E ++ AS + ++ L + ++E E+ + E+ + A ++ L +L +EA + Sbjct: 337 EQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 396 Query: 844 ETEAQQRLVAEDLGGMDLDALEKEEQELTAEVEKLYAETDEVSGLQHEIRKEYNSIVEER 903 + E + L ALEK +EL E E + LQ ++ E ++ E+ Sbjct: 397 QVEKAL-----EEANSKLAALEKLNKELE---ESKKLTEKEKAELQAKLEAEAKALKEKL 448 Query: 904 DRTSKELE 911 + ++EL Sbjct: 449 AKQAEELA 456 Score = 32.7 bits (74), Expect = 0.009 Identities = 28/248 (11%), Positives = 75/248 (30%), Gaps = 5/248 (2%) Query: 764 EERDFEGLIKEAEDKLQKLDEHIRMMSASDKDKKQKLALLTDEAHEIEREYTAVRERISH 823 + D K +D +L E + + + L+ + E+E + + + Sbjct: 72 KNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEG 131 Query: 824 NSAEKERLGSELHDVQEAIEETEAQQRLVAEDLGGMDLDALEKEEQELTAEVEKLYAETD 883 +++ ++ A++ + + L + +A+++ L AE Sbjct: 132 AMNFSTADSAKIKTLEAEKAALAARKADLEKAL-----EGAMNFSTADSAKIKTLEAEKA 186 Query: 884 EVSGLQHEIRKEYNSIVEERDRTSKELEECQETLRNHTGKKEKLDVKIEQKIEYLSENYK 943 + Q E+ K + S +++ + +K L+ +E + + + + Sbjct: 187 ALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 246 Query: 944 MTYEKAREEYDDFSDIDQKRMKISLNKKSIEELGPVNLGAIEEFDRVNERYQFLKSQEAD 1003 E+ + + + E + L+ Q Sbjct: 247 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQV 306 Query: 1004 LLEARSTL 1011 L R +L Sbjct: 307 LNANRQSL 314
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.6 bits (69), Expect = 0.017 Identities = 42/173 (24%), Positives = 75/173 (43%), Gaps = 23/173 (13%) Query: 58 LVTLTSLLSITAPFLVGYIVDNYFVQQRFDGLFRILMILLATYVLLSATQYIAAFLMV-- 115 L+ + + IT PFL+ + ++ FD ILM + + +L T Y +FL+V Sbjct: 171 LLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSV 230 Query: 116 ---GLSQRTVYKLRD-----RLFSHMQKLPIRFFDKRQHGEL---MSRMTNDIETISQTL 164 + + + K+ D L ++ + G + +S + ++ + Q L Sbjct: 231 LSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ-L 289 Query: 165 NTSFIQFTTSVVTLIGTVSVMIY------LSPLLTLLTVTIIPVLILAVGFIT 211 +T+ I SV+ GT+SV+I+ L L V I V L+V F+T Sbjct: 290 STAEI---GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLT 339
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 27.4 bits (60), Expect = 0.048 Identities = 13/45 (28%), Positives = 21/45 (46%), Gaps = 1/45 (2%) Query: 17 ITLDTGGIGHLINVPNPFRFEAALDSEVTIFTELIVREDSHTLYG 61 + D GGI ++ N+ F +E + + EL E +H L G Sbjct: 467 FSTDNGGI-YIENIGTFFTYERTPEESIYTLEELFRHEFTHYLQG 510
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 31.2 bits (71), Expect = 0.008 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 5/42 (11%) Query: 302 NNPVQRYSAEDIFKMATINGARAYNLQETMGKIKEGYKADLV 343 N V+RY A+ TIN A A+ L +G ++ G +ADLV Sbjct: 399 NFRVKRYIAK-----YTINPAIAHGLSHEIGSLEVGKRADLV 435
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.8 bits (202), Expect = 3e-20 Identities = 28/121 (23%), Positives = 58/121 (47%), Gaps = 4/121 (3%) Query: 1 MNGYNILIVEDEVSVSKGLKKVLEGEGANVSVNETGEGVVEQLADAH--LILMDIMLPFD 58 M G IL+ +D+ ++ L + L G +V + + +A L++ D+++P + Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 59 DGLSISKEIL-HRVDIPIIFLTAMNDIDSKLDGLKSGE-DYITKPFHPLELISRLNNVIS 116 + + I R D+P++ ++A N + + + G DY+ KPF ELI + ++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 117 R 117 Sbjct: 121 E 121
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 30.3 bits (68), Expect = 0.002 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 72 YGRFVWVCDLVTDTNKRSKGYGEKLLGFVHDWAAEKGYESVALSSGLQRTEAHRFYEN 129 + + + D+ + R KG G LL +WA E + + L + A FY Sbjct: 86 WNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAK 143
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.4 bits (76), Expect = 2e-04 Identities = 15/84 (17%), Positives = 36/84 (42%), Gaps = 2/84 (2%) Query: 54 INLQNKEVFGIYNQEELIGFLDLLFHYPDDSTCMIGYLVIDQRYRKQGLGQKIYNEVVTY 113 + + K F Y + IG + + ++ + I + + + YRK+G+G + ++ + + Sbjct: 60 VEEEGKAAFLYYLENNCIGRIKIRSNWNGYAL--IEDIAVAKDYRKKGVGTALLHKAIEW 117 Query: 114 LSKRDISKVRLGVIKDNIPAVKMW 137 + + L NI A + Sbjct: 118 AKENHFCGLMLETQDINISACHFY 141
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 7e-04 Identities = 14/66 (21%), Positives = 29/66 (43%), Gaps = 7/66 (10%) Query: 43 MILYGPPGIGKTSIASAIAGSTSYKFRTLNAVTNTKKDMQIVADEGKMSGSVILLLDEIH 102 ++L G GIGK+++ + + G + T + K + +++G V L E+ Sbjct: 599 VVLEGTGGIGKSTLINTLVG-LDFFSDTHFDIGTGKDSYE------QIAGIVAYELSEMT 651 Query: 103 RLDKAK 108 +A Sbjct: 652 AFRRAD 657
>PF03309#Bvg accessory factor Length = 271 Score = 29.0 bits (65), Expect = 0.016 Identities = 15/64 (23%), Positives = 24/64 (37%), Gaps = 3/64 (4%) Query: 74 EIDEGARMIGAVNTV-AVKDGVFKGYNTDISGYMNAFTARF-GEQKRKVLIIGAGGAAKA 131 E+ +IG NTV ++ G G+ + G +N G V ++ G A Sbjct: 174 ELTRPRSVIGK-NTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPL 232 Query: 132 VQRA 135 V Sbjct: 233 VLPD 236
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 42.1 bits (99), Expect = 2e-07 Identities = 17/72 (23%), Positives = 33/72 (45%), Gaps = 4/72 (5%) Query: 3 IGVFGGTFDPVHIGHIHAVAEAKIALNLDKVIIIPARQSPLKSSSPTKDKHRLNMLHHAV 62 ++ G+FDP+ GH+ + D+V + R +P K + + RL + A+ Sbjct: 2 NAIYPGSFDPITFGHLDIIERG--CRLFDQVYVAVLR-NPNKQPMFSVQE-RLEQIAKAI 57 Query: 63 EGYGFIEIDTFE 74 ++D+FE Sbjct: 58 AHLPNAQVDSFE 69
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.8 bits (64), Expect = 0.002 Identities = 9/30 (30%), Positives = 17/30 (56%) Query: 26 VIEHTEVQDSLKGQGAGSQLVDTMVEFAKQ 55 +IE V + +G G+ L+ +E+AK+ Sbjct: 91 LIEDIAVAKDYRKKGVGTALLHKAIEWAKE 120
>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature. Length = 398 Score = 37.7 bits (87), Expect = 2e-04 Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 6/74 (8%) Query: 214 YYGIETVTDSVKFTWQDDKGSNRSFDKDKIDSIISKIGNEDLKITDIQ----KKAKKTFA 269 Y+GI+T D W+ D N F D+ K+ +E+LKI DIQ +K K T Sbjct: 305 YFGIKT-KDGKTQEWEMDNPGN-DFMTGSKDTYTFKLKDENLKIDDIQNMWIRKRKYTAF 362 Query: 270 PALYDLTELQRDAN 283 P Y ++ AN Sbjct: 363 PDAYKPENIKIIAN 376
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.3 bits (219), Expect = 2e-22 Identities = 34/139 (24%), Positives = 69/139 (49%), Gaps = 6/139 (4%) Query: 3 HILVVEDEINLARFIELELVHEGYTVTLSDNGTDGLEKALDNEYECILLDLMLPELNGLE 62 ILV +D+ + + L GY V ++ N + + ++ D+++P+ N + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VCRRIRKV-KDVPIVIITAKGETYDKVVGLDYGADDYIVKPFEIEELLARIRVIM----- 116 + RI+K D+P+++++A+ + + GA DY+ KPF++ EL+ I + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 117 RRSANSEEKQEILELYGIS 135 R S ++ Q+ + L G S Sbjct: 125 RPSKLEDDSQDGMPLVGRS 143
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 40.0 bits (93), Expect = 2e-05 Identities = 25/105 (23%), Positives = 42/105 (40%), Gaps = 2/105 (1%) Query: 94 SEESKQEENKAEESKSEKKSAEDKKEEPASSEESESGDNDERIVATPSARRLAREKGIDL 153 + S+ E AE SK E K+ E K E+ A+ +++ + + + A E Sbjct: 1031 ATPSETTETVAENSKQESKTVE-KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089 Query: 154 SEINASDPRGLVRSQDVDNHSKQPAKAETPKQEAPKSKSSDKPEK 198 SE + + V+ K + E QE PK S P++ Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEK-TQEVPKVTSQVSPKQ 1133 Score = 32.7 bits (74), Expect = 0.004 Identities = 42/227 (18%), Positives = 77/227 (33%), Gaps = 28/227 (12%) Query: 11 ESITEGTIASWLKQKGDSVEKGENILELETDKVNVEVISEEA-GVITELKAEEGDTVEVG 69 S T T+A KQ+ +VEK E ET N EV E V + E Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDAT-ETTAQNREVAKEAKSNVKANTQTNE------- 1084 Query: 70 QVIAIVDENGEGGGSSDSSSGENKSEESKQEENKAEESKSE---KKSAEDKKEEPASSEE 126 V ++G + ++ + + K+E+ K E K++ K +++ ++ S Sbjct: 1085 -----VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139 Query: 127 SESGDNDERIVATPSARRLAREKG-----IDLSEINASDPRGLVRSQDVDNHSKQPAKAE 181 + T + + + ++ +S+ V N + E Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN--TGNSVVE 1197 Query: 182 TPKQEAPKSK----SSDKPEKPVVREKMSRRRKTIAKKLLEVSQNTA 224 P+ P + +S+ KP R + S R + S N Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 28.3 bits (63), Expect = 0.010 Identities = 9/22 (40%), Positives = 13/22 (59%) Query: 102 SYTADASELEPGSYEVVIHVNG 123 S + EL PG+Y V I++N Sbjct: 65 SRFENGQELPPGTYRVDIYLNN 86
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 31.5 bits (71), Expect = 0.026 Identities = 30/122 (24%), Positives = 46/122 (37%), Gaps = 32/122 (26%) Query: 255 PDTLKIEGLADGLQETLATGKAIGPFAELLERLGVDMDKFNGGLSDAIANGTEQNFVMQT 314 P++ IE + +QE + LE L D F+G +++A N NFVM Sbjct: 292 PNSASIEQIQSKIQE----------LGDTLEEL---RDSFDGYINNAFVNQIHLNFVMPP 338 Query: 315 LADNGLANVNQKFRENNKELVESRQSQQSFQQAMADLGTTLAPIATRITQGITGIVEKFN 374 A + +Q Q QQA A +A A R+ G I + + Sbjct: 339 QA-------------------QQQQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYK 379 Query: 375 NL 376 +L Sbjct: 380 DL 381
>PF01540#Adhesin lipoprotein Length = 475 Score = 25.9 bits (56), Expect = 0.019 Identities = 11/46 (23%), Positives = 25/46 (54%) Query: 2 QTLEEKDARIEAQAKKIRELRDEITQLQGEKRDLTDALNLTSREME 47 Q +++ + +I + KI+E E+ +L + + D + LT ++E Sbjct: 107 QKVDQANKKIADENLKIKEGAKELLKLSEKIQSFADTIALTITKLE 152
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.8 bits (64), Expect = 0.007 Identities = 12/58 (20%), Positives = 21/58 (36%), Gaps = 1/58 (1%) Query: 82 LIVKPEFSGNGFAKFAFKEAVKYAFEVLNMHKVYLYVDTENEKAVRIYEKQGFKNEGV 139 + V ++ G +A+++A E + + L N A Y K F V Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFIIGAV 151
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 28.5 bits (63), Expect = 0.010 Identities = 13/52 (25%), Positives = 23/52 (44%) Query: 36 KGHTLVIPRKPVENIYDLDEKTGAHIMKVITEVANAIKTAFNPAGLNVVQNN 87 KG + P + ++Y D ++G + + V + N +K A V NN Sbjct: 947 KGEKTLEPGRYYLSVYTYDNQSGTYTVNVKGNLKNEVKETAKDAIKEVENNN 998
>FLAGELLIN#Flagellin signature. Length = 507 Score = 26.5 bits (58), Expect = 0.039 Identities = 7/44 (15%), Positives = 18/44 (40%) Query: 77 DEVKTMISNYKADISPNIERIQKDVENLQNRGEDIQESVGKIQD 120 D + + ++ + R + NL N ++ + +I+D Sbjct: 425 DSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.9 bits (158), Expect = 2e-14 Identities = 29/120 (24%), Positives = 49/120 (40%), Gaps = 4/120 (3%) Query: 3 KIIMTDDHHIVREGMKFLLSTTEDIRVIEDFGTGAETLEFLSENHRDTDLVLLDLVMPEM 62 I++ DD +R + L + V A +++ DLV+ D+VMP+ Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDV-RITSNAATLWRWIAAGD--GDLVVTDVVMPDE 60 Query: 63 DGIEVTRRIKAEYPGIKVLVLSSYTSEEYIRPVFAAQADGYIIKEMAAEELIESIKNVIE 122 + ++ RIK P + VLV+S+ + A Y+ K ELI I + Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120
>PF06580#Sensor histidine kinase Length = 349 Score = 42.9 bits (101), Expect = 1e-06 Identities = 37/225 (16%), Positives = 94/225 (41%), Gaps = 26/225 (11%) Query: 155 AFQIGSTLKRIELTAQEQENMIIRERQRLARDLHDSVN-QMLFSIGITSHAAKTLKDKEK 213 + K+ E+ + +M +E Q +A L +N +F+ + + A L+D K Sbjct: 137 GWHFFKNYKQAEIDQWKMASMA-QEAQLMA--LKAQINPHFMFNA-LNNIRALILEDPTK 192 Query: 214 LSDAFDSIENTSKHAMREMKALIWQLKPIGLEKGIIDAIEKYADLLGLELEVKVTGFYDV 273 + S+ ++++R A + + E + ++ Y L ++ E ++ + Sbjct: 193 AREMLTSLSELMRYSLRYSNA---RQVSLADE---LTVVDSYLQLASIQFEDRLQFENQI 246 Query: 274 PDHIEVGLYRV----MQEGLNNVRKHSGSTKAE-----IAILSKSDELNIQIKDDGIGFE 324 + +V +Q + N KH + + + + + +++++ G Sbjct: 247 NP--AIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLAL 304 Query: 325 QKEKSGYSYGLGNMKDRVRKLGG---ILEIKSKKGEGTSIKVSVP 366 + K GL N+++R++ L G +++ K+G+ ++ V +P Sbjct: 305 KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348
>PF06917#Periplasmic pectate lyase Length = 555 Score = 28.7 bits (64), Expect = 0.039 Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 6/82 (7%) Query: 108 LLNDRLDKLEQFIK-SLDPDIET-KSMVDTGV-LSDREVARRSGLGYIGKNGFMINPNLG 164 +L +D L+ + + + D + T + + + G +S + R GY G G +I+P Sbjct: 336 VLQWVIDGLKNYYRFAYDVESNTLRPLWNDGQDMSGYVLPRD---GYYGVKGTVISPFPL 392 Query: 165 TYSYLGEMITSYPFPPDEELID 186 YL ++ ++ DEEL+D Sbjct: 393 DVDYLLPLVRAWRLSEDEELLD 414
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 31.6 bits (71), Expect = 0.008 Identities = 26/114 (22%), Positives = 52/114 (45%), Gaps = 19/114 (16%) Query: 152 GNYDKYREIKEHEIKRQQDEYKQYTAKRKHLEKAITHKV-NRSSNINRPKNKQDSDFRQT 210 GNYD E+K+ Q + ++ KR+HLEK + K+ ++S N N+ + K ++ ++ Sbjct: 602 GNYD--------EVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQKD 653 Query: 211 GAKPYFNKKKKK----------MEQVASSMKTRLEQLEVKEKPFEEKSIHFNTG 254 NK+ + ++ + + +LE + K F++ F G Sbjct: 654 EIFALINKEANRDARAIAYAQNLKGIKRELSDKLENVNKNLKDFDKSFDEFKNG 707
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 274 bits (701), Expect = 3e-92 Identities = 115/295 (38%), Positives = 169/295 (57%), Gaps = 10/295 (3%) Query: 86 KDIPVYAYQQDVPYGIDKVQAPLAHQNGDKGAGVKLAVIDTGIDADHEDLD---VHGGYS 142 + I ++P G++ +QAP +G GVK+AV+DTG DADH DL + G Sbjct: 11 QVIKQEQQVNEIPRGVEMIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNF 69 Query: 143 VFTSGVDADPYYDGSGHGTHVAGTAAALDNNVGVVGVAPEADLYAVKVLNSSGSGSSSGV 202 D + + D +GHGTHVAGT AA +N GVVGVAPEADL +KVLN GSG + Sbjct: 70 TDDDEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWI 129 Query: 203 VQGVEWAVQNGMDVINMSLGSSAHSQAIQDVVDAAYYEHDILVVAAAGNEGNASGTGDTV 262 +QG+ +A++ +D+I+MSLG + + V A ILV+ AAGNEG+ D + Sbjct: 130 IQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKA-VASQILVMCAAGNEGDGDDRTDEL 188 Query: 263 GYPAQYDSAFAVAATDENNQRASFSSTGPAVDISAPGVNILSTVPGNGYSSLNGTSMASP 322 GYP Y+ +V A + + + FS++ VD+ APG +ILSTVPG Y++ +GTSMA+P Sbjct: 189 GYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGGKYATFSGTSMATP 248 Query: 323 HVAGAGAVIRSSFPGTG-----AAEVRSLMQGASKYIGSDTNWYGSGLLQINSAV 372 HVAGA A+I+ + E+ + + + +G+ G+GLL + + Sbjct: 249 HVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVE 303
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 26.7 bits (59), Expect = 0.030 Identities = 12/68 (17%), Positives = 28/68 (41%), Gaps = 6/68 (8%) Query: 1 MKKKSNKIQRIRGFALALVFIGMGIMYLGVFFREYQIIFSL-FLIVGLLPIL----LSFV 55 + N+ + + +VF+ + +Y + ++ + IVG+L Sbjct: 865 ERLSGNQAPALVAISFVVVFLCLAALYES-WSIPVSVMLVVPLGIVGVLLAATLFNQKND 923 Query: 56 IYFWVGMI 63 +YF VG++ Sbjct: 924 VYFMVGLL 931
>BONTOXILYSIN#Bontoxilysin signature. Length = 1196 Score = 29.9 bits (67), Expect = 0.020 Identities = 22/157 (14%), Positives = 52/157 (33%), Gaps = 32/157 (20%) Query: 186 KRSEKEYGSIQKQITESEKLLKYQQDELKYHRSNREEMRLMNRLEREIQFKKLYFIHVGN 245 + SI Q + +++++ + +L + ++L+ Sbjct: 682 ELICMAKQSILAQESLVKQIVQNKFTDLSKASIPPDTLKLIRETT--------------- 726 Query: 246 LIYLPENIAMDFSDVEKEAMNEIARYLNTDFSSLKMTNPSIRSLRRQIKGMDEEKEAFKI 305 E +D S+ + +MN + +LN + + + + I M++ I Sbjct: 727 -----EKTFIDLSNESQISMNRVDNFLNKASICVFVED----IYPKFISYMEKYINNINI 777 Query: 306 HVLYEII----II----YQMIVQHEKTTGEVKKVDIK 334 I I +I + T + K +DI+ Sbjct: 778 KTREFIQRCTNINDNEKSILINSYTFKTIDFKFLDIQ 814
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.4 bits (214), Expect = 9e-22 Identities = 34/130 (26%), Positives = 65/130 (50%), Gaps = 1/130 (0%) Query: 2 TRVLIIEDNASIAEIERDYLEVNDIGSDIVLNGRDGLRMVYTGGYDLIVLDIMLPDIDGF 61 +L+ +D+A+I + L I N R + G DL+V D+++PD + F Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 EILKKIRDE-VDVPILMVTAKVSDIDIVRGLNLGADDYITKPFSPNELVARVKSHVTRYQ 120 ++L +I+ D+P+L+++A+ + + ++ GA DY+ KPF EL+ + + + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 121 RIMEKNSSDG 130 R K D Sbjct: 124 RRPSKLEDDS 133
>PF06580#Sensor histidine kinase Length = 349 Score = 40.6 bits (95), Expect = 5e-06 Identities = 31/199 (15%), Positives = 76/199 (38%), Gaps = 44/199 (22%) Query: 175 LRTIAAKTRE----LDKLIDELSLFSNLNMEESPLEKECIELDQFLSHIIDEAKLELE-- 228 L I A E +++ LS ++ S + + L L+ + ++ L+L Sbjct: 179 LNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQ--VSLADELTVV--DSYLQLASI 234 Query: 229 --DEKIEWSYEHPTEI-DIVIPADRMKLSRVFTNLLNNSVKY---RCRENHVIDIRLSRT 282 ++++++ + I D+ +P + L+ N +K+ + + I ++ ++ Sbjct: 235 QFEDRLQFENQINPAIMDVQVP------PMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288 Query: 283 GNDAVVDIKDNGRGIDREVLPKIFEPFYREESSRNKKTGGSGLGLSIV-ENIVRSHGGQ- 340 +++++ G + +G GL V E + +G + Sbjct: 289 NGTVTLEVENTGSLALKNT------------------KESTGTGLQNVRERLQMLYGTEA 330 Query: 341 -IDIKSEQGEWTMATVKLP 358 I + +QG+ A V +P Sbjct: 331 QIKLSEKQGKVN-AMVLIP 348
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 24.8 bits (54), Expect = 0.038 Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 11/55 (20%) Query: 14 EYIEPSTLTLIYD-YIAIIGVALMVFLF--------IPIMHMPVMASLLISLALL 59 +++ S ++ + AI+ V L+++LF IP + +PV LL + A+L Sbjct: 331 PFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV--VLLGTFAIL 383
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.3 bits (71), Expect = 0.012 Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 1/60 (1%) Query: 120 SGESVSRVVNDTAIIKDLITSHFPQLIGGIMSVVGSVVILFVLDWRMSLIMFISVPISIV 179 G V + T ++ I L IM V V+ LF+ + R +LI I+VP+ ++ Sbjct: 319 QGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVF-LVMYLFLQNMRATLIPTIAVPVVLL 377
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 30.7 bits (69), Expect = 0.005 Identities = 15/58 (25%), Positives = 30/58 (51%), Gaps = 8/58 (13%) Query: 116 LYNATFEDKELAKTIADAVKDYNPKLKLM--------GLSNQNLVKAGEEAGLEVRHE 165 L++A K+ K A+ ++ +L L+ G S +LV+ +E G+EV+++ Sbjct: 24 LHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTADELGIEVQYD 81
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.007 Identities = 6/23 (26%), Positives = 14/23 (60%) Query: 116 GLVEEILAENGDTVEYDQPLITI 138 +V+EI+ + G++V L+ + Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKL 127
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.2 bits (63), Expect = 0.048 Identities = 13/31 (41%), Positives = 16/31 (51%), Gaps = 1/31 (3%) Query: 1 MKFAIVGT-GVIGSGWITRILAHGHDVVATD 30 MK+ + G G IG R+L GH VV D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGID 31
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 57.3 bits (138), Expect = 1e-12 Identities = 17/84 (20%), Positives = 36/84 (42%) Query: 7 KMQLLEAAADIVNEHGSDYLTLDAVAERAGVSKGGLIYHFKNKDALIRGLVEHANQLYRD 66 + +L+ A + ++ G +L +A+ AGV++G + +HFK+K L + E + + Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 67 NVDRHIEPEDDSNGRWLRAFIEAT 90 + LR + Sbjct: 73 LELEYQAKFPGDPLSVLREILIHV 96
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.1 bits (65), Expect = 0.006 Identities = 30/112 (26%), Positives = 49/112 (43%), Gaps = 12/112 (10%) Query: 29 KDYSKEYIEDDVKQMDKNFFIERAKFTNCYVFVNENIGEIIGVGSIGSYWGSETESSLFT 88 K Y K+Y +DD MD ++ E K +++ EN IG I S W + + Sbjct: 44 KPYFKQYEDDD---MDVSYVEEEGKA--AFLYYLEN--NCIGRIKIRSNWNGY--ALIED 94 Query: 89 IFVSPDYQGMGIGKKIME--TLESDEYFLRSKRVEIPA-SITALTFYQKMGY 137 I V+ DY+ G+G ++ + E +E +I+A FY K + Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.014 Identities = 6/22 (27%), Positives = 12/22 (54%) Query: 159 TLDDIKAATNISRATLYRHLES 180 +L +I A ++R +Y H + Sbjct: 33 SLGEIAKAAGVTRGAIYWHFKD 54
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 29.8 bits (67), Expect = 0.023 Identities = 26/126 (20%), Positives = 49/126 (38%), Gaps = 11/126 (8%) Query: 11 WLTAPGANIMAGIVVALALIPEAIAFSIIAGVDPMVGLYASFLIAVIISIVGGRPAMISG 70 LT P + G++ L ++ ++I M G + + ++ G+ M Sbjct: 160 QLTLPL--LWGGLLFNLLGGFVSLGDAVIGA---MAGYLVLWSLYWAFKLLTGKEGM--- 211 Query: 71 ATGAIALLVVPLVSEHGVEYLLAATILMGIIQIIFGVLKVGKLMKFIPNSVMIGFVNALA 130 G LL L + G + L +L ++ G+ + L++ S I F LA Sbjct: 212 GYGDFKLLAA-LGAWLGWQALPIVLLLSSLVGAFMGIGLI--LLRNHHQSKPIPFGPYLA 268 Query: 131 IMIFMA 136 I ++A Sbjct: 269 IAGWIA 274
>PF01206#SirA family protein Length = 76 Score = 59.8 bits (145), Expect = 8e-14 Identities = 19/68 (27%), Positives = 41/68 (60%) Query: 126 VEASGLQCPGPLLKVNEVMGELEPGQQMEITVTDFGFCTDVEAWARKTGHSILKNEKSED 185 ++A+GL CP P+LK + + + G+ + + TD G D E+++++TGH +L+ ++ + Sbjct: 8 LDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEEDG 67 Query: 186 KVMVVLQK 193 L++ Sbjct: 68 TYHFRLKR 75
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.2 bits (229), Expect = 9e-24 Identities = 30/121 (24%), Positives = 57/121 (47%), Gaps = 1/121 (0%) Query: 2 KQKILVVEDDHMIRNLIKINLENNNYDVVEAADGAEAKNVFLDAHPCLVILDLMLPKVSG 61 ILV +DD IR ++ L YDV ++ A LV+ D+++P + Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 62 EEFFEWVREQERNEVSFIMLSAKSRVSDKVKGLKMGADDYITKPFEPDELVAHVEAVLRR 121 + +++ R ++ +++SA++ +K + GA DY+ KPF+ EL+ + L Sbjct: 63 FDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 122 T 122 Sbjct: 122 P 122
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 33.2 bits (76), Expect = 5e-04 Identities = 23/127 (18%), Positives = 35/127 (27%), Gaps = 18/127 (14%) Query: 1 MKVGIIGANGNIGLRLGKILSSRGVDTLGF---------VRKEEQAEKLKSIGVNPKTAD 51 MK + GA G IG + K L G +G K+ + E L G D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 IIETSTEDYTTLLEGTDVLVFTAGAGGAGV-------ETTRKIDGEGVSKMIEAAEDAGV 104 + E T L V + G ++E + Sbjct: 61 L--ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118 Query: 105 KRFILVS 111 + + S Sbjct: 119 QHLLYAS 125
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.8 bits (64), Expect = 0.006 Identities = 16/65 (24%), Positives = 29/65 (44%) Query: 47 AWDEARMVGIIRSSGDQNFTQYISDLIVHPEYKTKGLASKLMNTYINEVSEVDEIFLMMD 106 + E +G I+ + N I D+ V +Y+ KG+ + L++ I E LM++ Sbjct: 70 YYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLE 129 Query: 107 AAPGN 111 N Sbjct: 130 TQDIN 134
>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin signature. Length = 574 Score = 36.9 bits (85), Expect = 1e-04 Identities = 38/171 (22%), Positives = 65/171 (38%), Gaps = 11/171 (6%) Query: 4 LLSATLVSALFLAACSDGEENTEESTEEATAEEAATEESSEESTEEESTEEESGEGAESD 63 LL+A L+ + A +D + +TE T E ESSE +TE+ + + + Sbjct: 19 LLTAALIVGNLVTANADSNKQNTANTETTTTNEQPKPESSELTTEKAGQKMDDMLNSNDM 78 Query: 64 ADAASDEGSSEDLAEAEIDEEDMKSAYDLGEDKADMIDSATETDQSVEDVLQAPSEVTSY 123 A E E + E ED K + + D E + + + EV + Sbjct: 79 IKLAPKEMPLESAEKEEKKSEDNKKSEE---------DHTEEINDKIYSLNYNELEVLAK 129 Query: 124 QQETAIMIEVTEGEQVLDEAFTGNRAQIDETEGTLEVASDYIDESFNVTYP 174 ET EG + D+ R + + ++++ ID + TYP Sbjct: 130 NGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISI--IDSVTDRTYP 178
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 35.0 bits (80), Expect = 4e-04 Identities = 27/118 (22%), Positives = 45/118 (38%), Gaps = 5/118 (4%) Query: 24 EESTEEETMEESSEEETVEEESAGTTDEESEESSEEGSGNATTVEAEEDSLEESDEAESA 83 T+E E+ E TVE+E E+E++ E + +E S +AE A Sbjct: 1089 GSETKETQTTETKETATVEKE--EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146 Query: 84 SSDDRTEDLTEGEMDEVNPDDAYDIDEDKVRMVENATETD---NTVDDVLKAPSEITS 138 +D T ++ E + D ++ VE NT + V++ P T Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 29.4 bits (66), Expect = 0.004 Identities = 11/66 (16%), Positives = 22/66 (33%), Gaps = 10/66 (15%) Query: 5 ENFESLDLHILEEIYRLRVSVFI--------VEQECAYQEIDGKDPVSTHIYKTDDSGIS 56 N L E++ LR F + + D + T+++ D+ + Sbjct: 7 VNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNT--TYLFGIKDNTVI 64 Query: 57 AYLRIV 62 LR + Sbjct: 65 CSLRFI 70
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 136 bits (345), Expect = 2e-37 Identities = 89/402 (22%), Positives = 184/402 (45%), Gaps = 14/402 (3%) Query: 12 RLIFILLSGAFVALLSNTFLNVALPSIKDDFGITTSTVQWVSTAYMLVSGIVIPTTAFLM 71 +++ L +F ++L+ LNV+LP I +DF ++ WV+TA+ML I L Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 72 QRFSAKKLFIAAMLLFLTGTLVAGLSPT-FMVLILGRMIQASGSAILMPLLMNVMITSFP 130 + K+L + +++ G+++ + + F +LI+ R IQ +G+A L+M V+ P Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133 Query: 131 PAQRGTAMGLFSLVMFFAPAIGPTLSGFIVQHYSWHMLFFMMVPILLIVLSIGWLKLPQT 190 RG A GL ++ +GP + G I + W L ++P++ I+ +KL + Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKK 191 Query: 191 DTHYTTKIDIPSVILSTLGFGGILYGFSAAGNSGWMRPDVILTLFIGFTAVFFYIRKQIH 250 + DI +IL ++G + ++ I L + + +++ Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLSFLIFVKHIRK 242 Query: 251 MNDPMLNFKVYKFPMFTLASLLIGTMNMALFSGMILMPIYLQDIQGISPLDTG-ILLLPG 309 + DP ++ + K F + L G + + + ++P ++D+ +S + G +++ PG Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302 Query: 310 ALIMGFMSPISGKLFDMFGPKVLAITGLTLTVSTTFFFSQLEVDTSYTFLIMLYSIRAFG 369 + + I G L D GP + G+T +S +F + ++T+ F+ ++ G Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTF-LSVSFLTASFLLETTSWFMTIIIVFVLGG 361 Query: 370 MTLVMTPVMTNGMNQLTPELTPHGSSINSMLNQVSGAIGTAL 411 ++ T + T + L + G S+ + + +S G A+ Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 130 bits (328), Expect = 3e-35 Identities = 90/403 (22%), Positives = 183/403 (45%), Gaps = 14/403 (3%) Query: 27 AFAAILNQTLLATAIPHIMADLELEADVAQWLQSVFMLVNGIMIPVTAFLISKFSTRALF 86 +F ++LN+ +L ++P I D W+ + FML I V L + + L Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 87 FTALSLFGLGTLVCGISPN-FPILMAGRVLQAAGAGIIMPLMQTILFLVYPKSERGKAMG 145 + + G+++ + + F +L+ R +Q AGA L+ ++ PK RGKA G Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142 Query: 146 MFGLVISFAPAIGPTLSGWFIDIYPWRGLFYMLLPIVIIDLIVAYFILRNVTEQTNPKLD 205 + G +++ +GP + G W L L+P++ I + L + D Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKGHFD 200 Query: 206 MLSIILSTLGFGGLLYGFSVAGNSGWLSSAVIISLAVGAVALFIFIRRQNSLEQPILEFG 265 + IIL ++G + +S I L V ++ IF++ + P ++ G Sbjct: 201 IKGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPG 251 Query: 266 VFKDKIFTLTTALGMIVFMAMIGGAVILPILMQNMLGFSALASG-MMLFPGAVIMGVMSP 324 + K+ F + G I+F + G ++P +M+++ S G +++FPG + + + Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 325 ITGRLFDRYGARWLAIIGLGIVAVTSLMFTNLDTETTFTYLAVVNAFRMLGVSMVMMPVT 384 I G L DR G ++ IG+ ++V+ L + L ETT ++ ++ F + G+S ++ Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTKTVIS 370 Query: 385 TAGLNQMSKKLVPHGTAMNNTMRQIAGAVGTALLVSIMTNTML 427 T + + ++ G ++ N ++ G A++ +++ +L Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 119 bits (299), Expect = 2e-31 Identities = 81/419 (19%), Positives = 189/419 (45%), Gaps = 14/419 (3%) Query: 11 QTEIKKLPLMLVLLSGAFAAILNQTLLATAIPHIMADLNLEADVAQWLQSVFMLVNGIMI 70 Q+ ++ +++ L +F ++LN+ +L ++P I D N W+ + FML I Sbjct: 7 QSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGT 66 Query: 71 PVTAFLIGKFSTRSLFFTALILFGIGTLVCGLAPN-FTILLLGRILQASGAGIIMPLMQT 129 V L + + L +I+ G+++ + + F++L++ R +Q +GA L+ Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMV 126 Query: 130 ILFLIYPREKRGTAMGFFGLVISFAPAIGPTLSGWFVEIYPWRGLFYIILPIVIIDLIIA 189 ++ P+E RG A G G +++ +GP + G W + +++P++ II Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMI---TIIT 181 Query: 190 YFVLKNVTEQTNPKVDVFSIILSTLGFGGLLYGFSIAGSSGWLSPTVLISLGVGAITLTL 249 L + ++ F I L G+++ S V + ++ + Sbjct: 182 VPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSV------LSFLI 235 Query: 250 FIKRQFRLEQPILEFRVFRDPIFTLATIIGMVAFMTMIGGAIILPIFMQNMLGFTAFESG 309 F+K ++ P ++ + ++ F + + G + F T+ G ++P M+++ + E G Sbjct: 236 FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG 295 Query: 310 -LMMLPGALLMGIMSPVTGRMFDKFGARWLVIPGLGIVTVTTFMFAVLDTETTFTYLAVV 368 +++ PG + + I + G + D+ G +++ G+ ++V+ + L ETT ++ ++ Sbjct: 296 SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTII 354 Query: 369 NAVRMLGISMVMMPSTTAGLNQLTNKLVPHGTAMNNTMRQVAGAVGTALFVSVMTITMI 427 + G+S +T + L + G ++ N ++ G A+ +++I ++ Sbjct: 355 IVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.9 bits (67), Expect = 0.018 Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 6/85 (7%) Query: 123 GVKHQVAFNYRKTPAVALAKKYIEDGEIGRILSFRGTYLQDWSANPDSPLSWRFQES--- 179 + VA+ + + +K + G +G IL+FR L L+ F E+ Sbjct: 252 PSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQ-LALAFAEAFNT 310 Query: 180 --SAGSGALGDIGTHVIDLAHYLVA 202 AG A GD G + V Sbjct: 311 QHKAGFDANGDAGEDFFAIGKPAVL 335
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 62.8 bits (152), Expect = 1e-12 Identities = 50/265 (18%), Positives = 89/265 (33%), Gaps = 12/265 (4%) Query: 26 TTISADETDDIEEQSAQTQQESEETESQLNEPAESESTEEATGEQSQEASPELETDLTES 85 T I+ + S + E + P + +T T E E S + + ++ Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054 Query: 86 QLEPVEPEKNDAEAAKEEAINDRQNHDDEGEAEGTSPVNDFLPGNETENQE--ESSEEND 143 + + E + E AKE N + N T G+ET+ + E+ E Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKAN---------TQTNEVAQSGSETKETQTTETKETAT 1105 Query: 144 VEASSEEQTSEESTEETAGEESTEQPALEGPSSEEQQEEPSLETPTDDISEEEPNEEQTG 203 VE + + E T+E S P E + + Q EP+ E ++ +EP + Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE-NDPTVNIKEPQSQTNT 1164 Query: 204 EPAVDEPENEEPKEENTGETSGEESTEQTPTVENPKGESSSEPSTEELSKENTSEQTKEE 263 ++P E T VENP+ + + S+ + + + Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHR 1224 Query: 264 TVEMPQEESTEEAAGSAADDGKEAS 288 + E A S+ D A Sbjct: 1225 RSVRSVPHNVEPATTSSNDRSTVAL 1249 Score = 37.4 bits (86), Expect = 1e-04 Identities = 42/245 (17%), Positives = 78/245 (31%), Gaps = 7/245 (2%) Query: 134 NQEESSEENDVEASSEEQTSEESTEETAGEESTEQPAL--EGPSSEEQQEEPSLETPTDD 191 N E V+ ++ + + + + E+ A E P PS T T Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET-- 1039 Query: 192 ISEEEPNEEQTGEPAVDEPENEEPKEENTGETSGEESTEQTPTVENPKGESSSEP-STEE 250 ++E E +T E +E + E +N +S + T N +S SE T+ Sbjct: 1040 VAENSKQESKTVE--KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097 Query: 251 LSKENTSEQTKEETVEMPQEESTEEAAGSAADDGKEASVGEKQGKREIYRYDYDVLEGIN 310 + T+ KEE ++ E++ E ++ K+ Q + E R + + Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157 Query: 311 LTPGGSEEQLKLLDKRVNRLMTSKIVDVEDMSEEEIMEIEEEVKKEEGITQESDREELPN 370 + + + V +E TQ + E N Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217 Query: 371 TGENN 375 +N Sbjct: 1218 KPKNR 1222
>TATBPROTEIN#Bacterial sec-independent translocation TatB protein signature. Length = 171 Score = 35.8 bits (82), Expect = 4e-06 Identities = 12/65 (18%), Positives = 35/65 (53%), Gaps = 3/65 (4%) Query: 13 VGPTSMVVIAVVALIIFGPKKLPQFGRAMGSTLREFKDATKGLATDDDEE---EEKEKNQ 69 +G + ++++ ++ L++ GP++LP + + +R + + + +E +E + + Sbjct: 4 IGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQDSL 63 Query: 70 KKIES 74 KK+E Sbjct: 64 KKVEK 68
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.4 bits (99), Expect = 2e-06 Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 2/78 (2%) Query: 134 APAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQA--AAEQAAQKEAAAKQEREAAQQ 191 APA + E A+ ++QE E EQ A + Q A++A A Q E AQ Sbjct: 1029 APATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088 Query: 192 QKQEQQAQQQQQTQTENQ 209 + ++ Q + +T Sbjct: 1089 GSETKETQTTETKETATV 1106 Score = 38.9 bits (90), Expect = 2e-05 Identities = 20/96 (20%), Positives = 40/96 (41%), Gaps = 2/96 (2%) Query: 122 KTLTVSGENAEQAPAVEAPQVEEPAQPAQQ--EQSNNEAAEQQAAQQQEQAAAEQAAQKE 179 +T EN++Q ++ + Q E + + +A Q + A + KE Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 180 AAAKQEREAAQQQKQEQQAQQQQQTQTENQSTSSSS 215 + +E A +K+E+ + ++TQ + TS S Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130 Score = 35.0 bits (80), Expect = 3e-04 Identities = 22/109 (20%), Positives = 42/109 (38%), Gaps = 5/109 (4%) Query: 115 SDMIFAGKTLTVSGENAEQAPAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQAAAEQ 174 S++ +T V+ +E + +E A ++E++ E + Q + + + Sbjct: 1074 SNVKANTQTNEVAQSGSETKETQTT-ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK 1132 Query: 175 AAQKEAAAKQEREAAQQQK----QEQQAQQQQQTQTENQSTSSSSNSGQ 219 Q E Q A + +E Q+Q TE + +SSN Q Sbjct: 1133 QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181 Score = 30.8 bits (69), Expect = 0.008 Identities = 18/90 (20%), Positives = 37/90 (41%), Gaps = 5/90 (5%) Query: 131 AEQAPAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQAAAEQAAQKEAAAKQEREAAQ 190 E E P+V P Q++ E + QA +E KE ++ A Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQS---ETVQPQAEPARENDPTVNI--KEPQSQTNTTADT 1168 Query: 191 QQKQEQQAQQQQQTQTENQSTSSSSNSGQN 220 +Q ++ + +Q TE+ + ++ ++ +N Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198 Score = 30.4 bits (68), Expect = 0.011 Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 7/92 (7%) Query: 131 AEQAPAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQAAAEQAAQKEAAA---KQERE 187 Q + P+ P +N E A A A A + E A KQE + Sbjct: 994 TTNITTPNNIQADVPSVP----SNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049 Query: 188 AAQQQKQEQQAQQQQQTQTENQSTSSSSNSGQ 219 ++ +Q+ Q + ++ S+ + Q Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.6 bits (69), Expect = 0.013 Identities = 24/127 (18%), Positives = 56/127 (44%), Gaps = 11/127 (8%) Query: 168 ALGAISGALIFNPALDEMELFGEMLVSGRGGLFAVMMSAFLMAMLEQQIRKVVPNSLDLI 227 +G + A +FN D + G + G A+++ F ++E++ + VV +L + Sbjct: 908 IVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAV 967 Query: 228 VTSTITVFVVGLITVIGLQPV------GAVLSEGIIVSINWVLEVGGIFAGAVLAAVFLP 281 + + L ++G+ P+ G+ + + + +GG+ + +LA F+P Sbjct: 968 RMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGV-----MGGMVSATLLAIFFVP 1022 Query: 282 LVLVGLH 288 + V + Sbjct: 1023 VFFVVIR 1029
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 48.1 bits (114), Expect = 7e-08 Identities = 35/157 (22%), Positives = 62/157 (39%), Gaps = 9/157 (5%) Query: 298 TTTTKAENDAPEVDTGELESVIAEAEAVSEADRIPSLQSA-----LKNAKAVVEDDETTQ 352 TT + D P V + E IA + P+ S +N+K + E + Sbjct: 998 TTPNNIQADVPSVPSNNEE--IARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE 1055 Query: 353 QQADEVEAALASALEENEDELKAAEEESSEEGSSEESTEEKTNEASTEEETTEEPAAEET 412 Q A E A +E + +KA + + S E+ E +T E +E A+ Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115 Query: 413 GESTEEESPEEEMTEESAEESEAESETASAQAEASND 449 E T+E + ++ S ++ ++E+ A+ ND Sbjct: 1116 TEKTQEVP--KVTSQVSPKQEQSETVQPQAEPAREND 1150 Score = 32.7 bits (74), Expect = 0.003 Identities = 28/161 (17%), Positives = 55/161 (34%), Gaps = 9/161 (5%) Query: 289 EPLLTSYFRTTTTKAENDAPEVDTG-ELESVIAEAEAVSEADRIPSLQSALKNAKAVVED 347 P +TS ++E P+ + E + + E S+ + + K + VE Sbjct: 1122 VPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181 Query: 348 DETTQQQADEVEAALASALEENEDELKAAEEESSEEGSSEESTEEKTNEASTEEETTEEP 407 T + +++ EN + A + + S + + + EP Sbjct: 1182 PVTESTTVNT-----GNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236 Query: 408 AAEETGESTEEESPEEEMTEESAEESEAESETASAQAEASN 448 A + + + + T +A S+A A AQ A N Sbjct: 1237 ATTSSNDRSTVALCDLTSTNTNAVLSDA---RAKAQFVALN 1274
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 30.3 bits (68), Expect = 0.004 Identities = 14/38 (36%), Positives = 24/38 (63%) Query: 182 FILLIARSLTLPGAMEGVEFLLMPDFSAITSEAILFAL 219 F+L I S+ LPG+M + F L+ S+I S+ ++ A+ Sbjct: 14 FLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAM 51
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.0 bits (62), Expect = 0.014 Identities = 13/53 (24%), Positives = 28/53 (52%), Gaps = 6/53 (11%) Query: 87 ISVAPSYQNKGIGSQMIVTALKRAEEMGYESVIVLGHD------KYYPRFGFR 133 I+VA Y+ KG+G+ ++ A++ A+E + +++ D +Y + F Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147
>PF09025#YopR Core Length = 143 Score = 30.4 bits (68), Expect = 0.010 Identities = 14/83 (16%), Positives = 26/83 (31%), Gaps = 1/83 (1%) Query: 527 IMKEKNKPTINNNLGFPHAVHTLEKIKIKIA-ILDESLKDYEDLKLIILMAIPENNVNEA 585 P L E++ + A L D +LK ++ +P + Sbjct: 34 QALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPLGRQQQT 93 Query: 586 VLIRLYEEILSLATNDYLMDRIR 608 L++L + +YL R Sbjct: 94 FLLQLLGAVEHAPGGEYLAQLAR 116
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 116 bits (292), Expect = 1e-33 Identities = 65/271 (23%), Positives = 119/271 (43%), Gaps = 21/271 (7%) Query: 3 NWLNIDKKVVVITGGSSGIGRRILESLLENGAIVYNADMKDNPIDHNNYHY--------- 53 N I+ K+ ITG + GIG + +L GA + D ++ Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 54 LKTDVTQEENVKNTVEQIVNEQKQIDVLINNAGINLPRVLVDVRGEKPEYEINMKDLDFM 113 DV + +I E ID+L+N AG+ P + ++ ++ + Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRP---------GLIHSLSDEEWEAT 112 Query: 114 FAVNLKGPVLFSREVSRQFVEQQHGVIINVSSEAGQEGSQGQSIYSATKAALIGFTRSWA 173 F+VN G SR VS+ ++++ G I+ V S + Y+++KAA + FT+ Sbjct: 113 FSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG 172 Query: 174 KELGEHNIRVVAIAPGILEETGLRTAAYEEALAYSRNTTVEGLNSDYSKSIPIGRVGELT 233 EL E+NIR ++PG E + +E ++G + IP+ ++ + + Sbjct: 173 LELAEYNIRCNIVSPGSTETDMQWSLWADE---NGAEQVIKGSLETFKTGIPLKKLAKPS 229 Query: 234 EVADLVCYLASEKSSYITGTTINISGGKSRG 264 ++AD V +L S ++ +IT + + GG + G Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGATLG 260
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 31.7 bits (72), Expect = 0.011 Identities = 30/153 (19%), Positives = 56/153 (36%), Gaps = 15/153 (9%) Query: 10 LFDELLKNPSVTSKELEEKYKLTRRQFGYSFNKINDFLVSKNLPKIERGRQGNFIIEQTV 69 L K S+ E+ EK LT Q + ++N F I++ I Q Sbjct: 49 LVVLFFKTSSLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKRM----ISCQ-- 102 Query: 70 ISNLSDEDEFQIKESNVYSEKQRFFMILLMLIGSKEELSLNHFAIELDVSRNTILNDLKH 129 ++ S E +Y+ ++ ++ L FA +S ++ + Sbjct: 103 FTHPSKETYLYQ----LYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREA 158 Query: 130 VRKIASEFYLSIKYSRIKGYVIEGEEFYIRKLL 162 + + F L + ++I GEE+ IR L+ Sbjct: 159 LIPLLRNFELKLSKNKIV-----GEEYRIRYLI 186
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 31.0 bits (70), Expect = 0.003 Identities = 25/132 (18%), Positives = 56/132 (42%), Gaps = 14/132 (10%) Query: 79 LVVTSFLSLFGVGFDPASLNSAAVVIVTFSICYSVFQSEIIRGALHSLDKDQIEAAQSLG 138 L+ ++ + FD ++ +A + I +SI +V + +R L + + Sbjct: 191 LLTVGLFAVLQLKFDLTTV-AALLTITGYSINDTVVVFDRLRENLIKYKTMPL--RDVMN 247 Query: 139 YSTSQTLRKVIIPQVMTEALPDTMNAFLIIIKALSLAFLVTVVDIFAQARLVGAQTFSYL 198 S ++TL + ++ + T L+ + + L + V+ F A + G T +Y Sbjct: 248 LSVNETLSRTVMTGMTT----------LLALVPM-LIWGGDVIRGFVFAMVWGVFTGTYS 296 Query: 199 EAFVAAALVYWV 210 +VA +V ++ Sbjct: 297 SVYVAKNIVLFI 308
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 28.9 bits (64), Expect = 0.032 Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 4/51 (7%) Query: 346 GASVSYAMHLIKSLSDFETMPDSKISKKLCEFGVTLSRRTVNKYKNEILSQ 396 G + A ++K ++ + +P+ +F TV +Y+N+ LSQ Sbjct: 85 GKGLGLAEMMVKQMTPEQPLPEESTPAAPMKF----PLETVVRYQNQALSQ 131
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 361 bits (928), Expect = e-123 Identities = 121/342 (35%), Positives = 187/342 (54%), Gaps = 28/342 (8%) Query: 139 FRNIIYKSSVMEHVKNQIEKAAKTNANILITGETGVGKELFAKSIHDTSA-VKGEFIPIN 197 ++ +S+ M+ + + + +T+ ++ITGE+G GKEL A+++HD G F+ IN Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAIN 195 Query: 198 CGAIPSHLFESELFGYEKGAFTGANREGNKGKIELAEGGTLFLDEMGDMPLDMQVKFLRV 257 AIP L ESELFG+EKGAFTGA G+ E AEGGTLFLDE+GDMP+D Q + LRV Sbjct: 196 MAAIPRDLIESELFGHEKGAFTGAQTRS-TGRFEQAEGGTLFLDEIGDMPMDAQTRLLRV 254 Query: 258 LQEKQYFKLGGNKEKSADFRLVSATNRKIADLLASDDFRSDLLYRINVVNIHIPPLRERP 317 LQ+ +Y +GG +D R+V+ATN+ + + FR DL YR+NVV + +PPLR+R Sbjct: 255 LQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRA 314 Query: 318 DDIESLFFYYLYSLSEKYGTSVKYANQQLINHLKAYHWPGNVRELINVIERLVIFSNEEA 377 +DI L +++ +EK G VK +Q+ + +KA+ WPGNVREL N++ RL ++ Sbjct: 315 EDIPDLVRHFV-QQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDV 373 Query: 378 LNNEIFDQY---------LTEVGEDAKQATLPSVTEELE----------------LKDYV 412 + EI + + + + ++ EE + Sbjct: 374 ITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVL 433 Query: 413 EKIEADYIRHVLEENGQNVERASKALGISRPTLYAKVKRFGL 454 ++E I L N +A+ LG++R TL K++ G+ Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 30.1 bits (67), Expect = 0.030 Identities = 28/123 (22%), Positives = 58/123 (47%), Gaps = 9/123 (7%) Query: 381 NMTDIEEAQKLWEQGLE--ELGTDSITLELLSYDDDQRKAMAEYMKNQWENNLPGLTVAI 438 N ++++AQK E+ L E + +L S ++ K A+ N ++ + L I Sbjct: 603 NYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQKDEIFAL---I 659 Query: 439 NQQPNKQKLDLEGKQDYDMSFSGWRNDISDPVEFLNVHLSDGPYNWQDFANEEYDELVKK 498 N++ N+ + Y + G + ++SD +E +N +L D ++ +F N + + K Sbjct: 660 NKEANRDARAIA----YAQNLKGIKRELSDKLENVNKNLKDFDKSFDEFKNGKNKDFSKA 715 Query: 499 AQT 501 +T Sbjct: 716 EET 718
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.3 bits (234), Expect = 6e-25 Identities = 69/275 (25%), Positives = 122/275 (44%), Gaps = 24/275 (8%) Query: 4 NFEGLSGKTAVITGGSGVLCQEMAKELARQGMKVAILNRNKENGQKIADEIANNEGTAIA 63 N +G+ GK A ITG + + + +A+ LA QG +A ++ N E +K+ + A A Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 64 VTCDVLDEESVQKAYVTVKEQLGECDLLINGAGGNHPDAITDKETFEKGDIENDSLKSFF 123 DV D ++ + ++ ++G D+L+N AG P I Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHS------------------ 103 Query: 124 DLELKGFDHVFRLNLVGSLIPTQVFGKEMTN-RGGTVINISSMSAPSPMTKVPAYSAAKA 182 L + ++ F +N G ++ K M + R G+++ + S A P T + AY+++KA Sbjct: 104 -LSDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKA 162 Query: 183 GIDNLTQWLAVHFADAGIRVNAIAPGFFLTKQNRNLLLKEDGS---FSERAEKIISHTPQ 239 T+ L + A+ IR N ++PG T +L E+G+ E + P Sbjct: 163 AAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPL 222 Query: 240 RRFGDPEDLLGTLLWLADDNTSKFVTGITVPVDGG 274 ++ P D+ +L+L +T + VDGG Sbjct: 223 KKLAKPSDIADAVLFLVSGQAGH-ITMHNLCVDGG 256
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 31.7 bits (72), Expect = 0.008 Identities = 14/90 (15%), Positives = 31/90 (34%) Query: 141 AQFVAFGVLFEVVLGVSFSIGVIVGGIITIIYTMLGGFFAVALTDFIQGLLMAFALFILP 200 L +++G+S ++ I F+ AL+ + +L+ F P Sbjct: 30 VSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFP 89 Query: 201 ILAIIEIGGFNRMGTLLGESMGTEFLQPFF 230 +L + + G + E ++P Sbjct: 90 LLTVAALMAIASHVVQYGFLISGEAIKPDI 119
>PF01206#SirA family protein Length = 76 Score = 60.5 bits (147), Expect = 4e-14 Identities = 19/68 (27%), Positives = 44/68 (64%) Query: 126 IEASGLQCPGPLLRVNETMGQLDPGQQMEITVTDFGFCTDVEAWAKKTGNTVLKNEKKED 185 ++A+GL CP P+L+ +T+ ++ G+ + + TD G D E+++K+TG+ +L+ ++++ Sbjct: 8 LDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEEDG 67 Query: 186 KVVVVLEK 193 L++ Sbjct: 68 TYHFRLKR 75
>PF05043#Transcriptional activator Length = 493 Score = 37.2 bits (86), Expect = 2e-04 Identities = 22/132 (16%), Positives = 52/132 (39%), Gaps = 8/132 (6%) Query: 100 ELLLTRGHVKSEDLADALFISRSTLQSDLKAVKGIL-AQYDLEIESKPNYGMRATGTEMN 158 E + ++E + +IS S+L + + ++ Q+ E+ P + G E + Sbjct: 93 EFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPV---QIIGNERD 149 Query: 159 LRFCLSQYVFDRRVYKSEPQSVYFDSDELGAVHNTVAEALDDNQLVMTDIAINNLVIHIA 218 +R+ +QY ++ + P F++ + + + M L + + Sbjct: 150 IRYFFAQYFSEKYYFLEWP----FENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLV 205 Query: 219 IALRRIRDGYSV 230 L RI+ G+ + Sbjct: 206 TNLYRIKFGHFM 217
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 26.8 bits (59), Expect = 0.034 Identities = 20/112 (17%), Positives = 43/112 (38%), Gaps = 20/112 (17%) Query: 55 KRFYKEFGKISDLGQYM------IFIENEDSELVGTV---TAWHG--TVKDRLHGRLHWF 103 K ++K++ Y+ F+ ++ +G + + W+G ++D Sbjct: 44 KPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIED--------I 95 Query: 104 NVVPDFQGRGLGVPLLSKGMAMLQENHEEAF-LKVDVNNKMMVRLFISMGWK 154 V D++ +G+G LL K + +ENH L+ N + + Sbjct: 96 AVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147
>PERTACTIN#Pertactin signature. Length = 922 Score = 29.3 bits (65), Expect = 0.028 Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 13/121 (10%) Query: 86 GSGSVGMTQAANANPDGYTATMVIAELAMYEHLGT-SPLTPEDFKPVALINYDPAALTVP 144 G+G V + + AN +T+V L H+GT PL PED P ++ D + VP Sbjct: 159 GAGGVRVERGANVTVQ--RSTIVDGGL----HIGTLQPLQPEDLPPSRVVLGDTSVTAVP 212 Query: 145 ADAPYDTVGEFIEYAKE---HPGEVSVGNAGPGSIWHVAAANLENAADIELNHVPHEGAA 201 A V F+ A E G ++ G A + A +L+ A I P GA Sbjct: 213 ASGAPAAV--FVFGANELTVDGGHITGGRAAGVAAMDGAIVHLQRAT-IRRGDAPAGGAV 269 Query: 202 P 202 P Sbjct: 270 P 270
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 51.8 bits (124), Expect = 2e-09 Identities = 66/400 (16%), Positives = 138/400 (34%), Gaps = 66/400 (16%) Query: 18 IIFFYHFIVMFS------MYVSIVTIGNFAIENFNASASTAGLVASIFIVGVLAGRAISG 71 I+ + + FS + VS+ I N +FN ++ V + F++ G A+ G Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIAN----DFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 72 YQVNRLGARKIMYIGTVLFFLTYGLYFIDGGLV-LLIAARFLNGFATGLISTALNTLATI 130 ++LG ++++ G ++ + F+ LLI ARF+ G + + Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 131 SVPENRRGEGISYFSLSFVLGSAVGPFLGFLLLEIMS----FNTMLILVLIAVFIVALMT 186 +P+ RG+ +G VGP +G ++ + +I ++ F++ L+ Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190 Query: 187 PMVRLNNITRDYK----------------------------------------------- 199 VR+ D K Sbjct: 191 KEVRIKG-HFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249 Query: 200 PEAGKFRMIDRDALPMGFSVLFMGLAYASILSFLNLYAIEVNLVTAASFFFLVYSAVVML 259 P GK L G + + S++ ++ +++ S + V++ Sbjct: 250 PGLGKNIPFMIGVL-CGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308 Query: 260 TRPLTGKMMDQKGANIVLYPTFIFMAIGFYVLG--NSTTGFIMLLAGALIGLGFGNFQSI 317 + G ++D++G VL F+++ F TT + M + + G +++ Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368 Query: 318 AQTVCVNLADRDNVGLATSTYFIMLEVGLGFGPFFLGFLV 357 T+ + + G S + G G +G L+ Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408 Score = 34.1 bits (78), Expect = 0.001 Identities = 23/112 (20%), Positives = 46/112 (41%), Gaps = 4/112 (3%) Query: 69 ISGYQVNRLGARKIMYIGTVLFFLT-YGLYFIDGG--LVLLIAARFLNGFATGLISTALN 125 I G V+R G ++ IG ++ F+ + I F+ G + T ++ Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS-FTKTVIS 370 Query: 126 TLATISVPENRRGEGISYFSLSFVLGSAVGPFLGFLLLEIMSFNTMLILVLI 177 T+ + S+ + G G+S + + L G + LL I + L+ + + Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEV 422 Score = 28.7 bits (64), Expect = 0.050 Identities = 23/131 (17%), Positives = 50/131 (38%), Gaps = 1/131 (0%) Query: 253 YSAVVMLTRPLTGKMMDQKG-ANIVLYPTFIFMAIGFYVLGNSTTGFIMLLAGALIGLGF 311 + + + GK+ DQ G ++L+ I + ++++A + G G Sbjct: 58 FMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGA 117 Query: 312 GNFQSIAQTVCVNLADRDNVGLATSTYFIMLEVGLGFGPFFLGFLVPSLGYGGLYQSLVI 371 F ++ V ++N G A ++ +G G GP G + + + L +I Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI 177 Query: 372 SILVGLVIFYF 382 +I+ + Sbjct: 178 TIITVPFLMKL 188
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 59.5 bits (144), Expect = 8e-12 Identities = 77/372 (20%), Positives = 138/372 (37%), Gaps = 51/372 (13%) Query: 36 MPIFTEEFGVSATLSSLSMTITTLTLALSMLVFGSISESLGRKNIMVVSMFAASLLCILT 95 +P +F ++ T LT ++ V+G +S+ LG K +++ + ++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 ALSPNFY-VLIALRALQGVVLAGVPSIAMAYISEEIHPRSLAGAMGLYISGNALGAVFGR 154 + +F+ +LI R +QG A P++ M ++ I + A GL S A+G G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 VFSGVAADYIGWHGAMLGIGIISIIATVIFWKSLRPPRNFVAQNFN-------------F 201 G+ A YI W +L I +I+II TV F L + +F+ F Sbjct: 157 AIGGMIAHYIHW-SYLLLIPMITII-TVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFF 214 Query: 202 MQLTRS---------------LLHHM-------------KNPVLVCFFFVGFLLLGANLS 233 M T S + H+ KN + G ++ G Sbjct: 215 MLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAG 274 Query: 234 IYSYVTFVFLDVPYSLSQSIVSWIFLI--FIIGIFSSMITGRFVAKFGKVKFIYIALGIT 291 S V ++ DV + LS + + + + + I I G V + G + + I + Sbjct: 275 FVSMVPYMMKDV-HQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL 333 Query: 292 LLGVFLL-FIP--NLLIMVLGLSLFTYGFFASHSVVSGLVGENAVSNKAQAS-SLYLFFY 347 + F+ M + + G + +V+S +V + +A A SL F Sbjct: 334 SVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTS 393 Query: 348 YTGSSIGGTAAG 359 + G G Sbjct: 394 FLSEGTGIAIVG 405
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 56.6 bits (136), Expect = 2e-11 Identities = 49/208 (23%), Positives = 80/208 (38%), Gaps = 14/208 (6%) Query: 4 KTLAITGATSGIGRATVQELADDFDEIILLARNEVKAEILKKELKEMNRALKVKVIECNL 63 K ITGA GIG A + LA I + N K E + LK R + ++ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARH--AEAFPADV 66 Query: 64 ASLISVEKAALHTQENYEKIDCLINNAGV--VSLSRQETADGHELMMGTNYLGHYLLTHY 121 ++++ + ID L+N AGV L + + E N G + + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 122 LMPILMKSEAPQIVIVSSNAYGFTTLKSDYFKGKGNVMNLYGRSKLAVLYFMQELHEQFS 181 + +M + IV V SN G M Y SK A + F + L + + Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTS----------MAAYASSKAAAVMFTKCLGLELA 176 Query: 182 DQGVRVTAVHPGAVSTNLGRTKQNEKFG 209 + +R V PG+ T++ + ++ G Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENG 204
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 45.5 bits (108), Expect = 5e-08 Identities = 23/126 (18%), Positives = 46/126 (36%), Gaps = 17/126 (13%) Query: 1 MRVLVVGANGQIGHQVAEKLKNKGHDPVA---------MVRKEEQVSQFKDKGIETVLGD 51 M+ LV GA G IG V+++L GH V + K+ ++ G + D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LE--KDFSHAFEN--VDSVVFAAGSGGSTGA----DKTIIIDQEGAIETVDNAKRAGVKH 103 L + + F + + V + + + G + ++ + ++H Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 104 FVIISS 109 + SS Sbjct: 121 LLYASS 126
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.0 bits (65), Expect = 0.024 Identities = 10/61 (16%), Positives = 27/61 (44%), Gaps = 1/61 (1%) Query: 218 LATGLAYYLFASGLKNVKSSTAVTLSLAEPLTASLLGVFLVGEILDMWSWAGLIMLLMGI 277 ++ + + A+ ++ +V L + + LL L + D++ GL+ +G+ Sbjct: 878 ISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLT-TIGL 936 Query: 278 A 278 + Sbjct: 937 S 937
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 37.9 bits (88), Expect = 1e-04 Identities = 34/185 (18%), Positives = 59/185 (31%), Gaps = 13/185 (7%) Query: 494 EESEKLLNLESLLHNRVIGQKDAIGSI---SKAVRRARAGLKNPKRPIGSFIFLGPTGVG 550 + L +++ + S A++ L + + + G +G G Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTG 172 Query: 551 KTELARALSEAMFGEEDAMIRVDMSEYMEKHSVSRLVGSPPG-YVGYDDGGQLTEKVRRK 609 K +ARAL + + ++M+ S L G G + G + + Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST--GRFEQA 230 Query: 610 PYSLILFDEIEKAHPDVFNMLLQVLDDG---RLTDSNGRTVDFRNTIIVMTSNIG-AQEL 665 + DEI D LL+VL G + D R IV +N Q + Sbjct: 231 EGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATNKDLKQSI 287 Query: 666 KDQKF 670 F Sbjct: 288 NQGLF 292 Score = 36.3 bits (84), Expect = 5e-04 Identities = 17/82 (20%), Positives = 31/82 (37%), Gaps = 3/82 (3%) Query: 145 PDSLQQKDNMKKDHNTPTLDSLARDLTQIARDDMLDPVIGRSSEITRVIEVLSRRTKNN- 203 P + + + D P++GRS+ + + VL+R + + Sbjct: 103 PKPFDLTELIGIIGRALAEPKRRPSKLEDDSQD-GMPLVGRSAAMQEIYRVLARLMQTDL 161 Query: 204 PVLI-GEPGVGKTAIAEGLAQQ 224 ++I GE G GK +A L Sbjct: 162 TLMITGESGTGKELVARALHDY 183
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.005 Identities = 23/93 (24%), Positives = 39/93 (41%), Gaps = 5/93 (5%) Query: 91 EAVETEEAPQVTEEAQPQQQVQEAPQV-----TEEAPQVTEEQPAQNTQQSEDVQAAAPE 145 E +T+E P+VT + P+Q+ E Q E P V ++P T + D + A E Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174 Query: 146 QSTQSTGGSTKAQFLAAGGTEAMWQNIVMPEST 178 S+ T++ + G + P +T Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 45.1 bits (106), Expect = 6e-07 Identities = 22/177 (12%), Positives = 57/177 (32%), Gaps = 4/177 (2%) Query: 214 ARVKKEAEIAESENRRETEIQQAKDNEDISNEQYKREMNIAESRKEKDIKDAKILAETEK 273 ++ + S N + +A + +AE+ K++ K E + Sbjct: 1001 NNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK--NEQDA 1058 Query: 274 ENAAARAAGQLEEEERRLEVERQRLEIREQEKQNELKLRQMERENDVQ--LEKQQVEVRR 331 A+ +E + ++ Q E+ + + + +E EK +VE + Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118 Query: 332 QQAEADYYAQTKDAEARAESRMAEGKAEAEVIREKSMAEAEAIERRAKAMAEHKDVI 388 Q +Q + ++E+ + + E ++ E ++ + Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175 Score = 32.0 bits (72), Expect = 0.007 Identities = 36/274 (13%), Positives = 84/274 (30%), Gaps = 21/274 (7%) Query: 210 RPQIARVKKEAEIAESENRRETEIQQAKDNEDISNEQYKREMNIAESRKEKD----IKDA 265 P A + E +++E++ + + + RE+ K + A Sbjct: 1027 PPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA 1086 Query: 266 KILAETEKENAAARAAGQLEEEERRLEVERQRLEIREQEKQNELKLRQMEREN-----DV 320 + +ET++ E+E + +VE ++ + + +++ +Q + E + Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQ-EVPKVTSQVSPKQEQSETVQPQAEP 1145 Query: 321 QLEKQQVEVRRQQAEADYYAQTKDAEARAESRMAEGKAEAEVIREKSMAEAEAIERRAKA 380 E ++ + A+ S E + E E A Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205 Query: 381 MAEHKDVIILEKLIEIMPEFAKAVSDSMSNVESIRVLDSGSGDQLQSLPNTV-TGTMAKL 439 + + ++V NVE S + +L + T T A L Sbjct: 1206 TTQPTVNSESSNKPKN--RHRRSVRSVPHNVEPATT--SSNDRSTVALCDLTSTNTNAVL 1261 Query: 440 QESMGQM------TGFDLENFLGNLSSSEEADFS 467 ++ + G + + L + E ++ Sbjct: 1262 SDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 35.2 bits (81), Expect = 3e-04 Identities = 40/165 (24%), Positives = 66/165 (40%), Gaps = 25/165 (15%) Query: 15 GKVIIGH----EKVIELVFVSMLQKGHILFESVPGTGKTMLSKAV---AKAIGGSFKRIQ 67 G ++G +++ ++ M ++ GTGK ++++A+ K G F I Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAIN 195 Query: 68 ---FTPDVLPSDITG------LNIYNPKTQEFELRRGPVDTDILLADEINRATPRTQSAL 118 D++ S++ G T FE G L DEI Q+ L Sbjct: 196 MAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT----LFLDEIGDMPMDAQTRL 251 Query: 119 LEVMEEKQVTIDGERIPVSEPF-IVLATQNPIES--KQGTF--DL 158 L V+++ + T G R P+ IV AT ++ QG F DL Sbjct: 252 LRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDL 296
>PF06580#Sensor histidine kinase Length = 349 Score = 41.8 bits (98), Expect = 2e-06 Identities = 20/108 (18%), Positives = 45/108 (41%), Gaps = 15/108 (13%) Query: 244 RFDTEIETALYR------IIQESVFNAMKYA-----NVDAVDVTLMTREDYLEVIVEDEG 292 +F+ +I A+ ++Q V N +K+ + + + + VE+ G Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300 Query: 293 EGFDMHSSPQGSGLGLFGMRERAEAIGG---TLSIKSIVGRGTKITLI 337 + ++ + +G GL +RER + + G + + G+ + LI Sbjct: 301 SLA-LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 63.3 bits (154), Expect = 6e-14 Identities = 22/115 (19%), Positives = 53/115 (46%), Gaps = 3/115 (2%) Query: 2 KIVIADDHSVVRSGFSMIINYQKDMEVVATAGDGLEAYRMVQKYEPDIILMDISMPPGES 61 I++ADD + +R+ + ++ + +V + +R + + D+++ D+ MP E+ Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMP-DEN 61 Query: 62 GLIATGKISQDFPDTKIIILTMYDDEEYLFHSLKNGAKGYVLKSAPDAELLDAIR 116 +I + PD +++++ + + + GA Y+ K EL+ I Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>FLAGELLIN#Flagellin signature. Length = 507 Score = 31.6 bits (71), Expect = 0.001 Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 3/44 (6%) Query: 22 MHEITKWFERMRELEAGGGAGTQ---ELGALMDHVAQSKEIINR 62 ++EI +R+REL GT +L ++ D + Q E I+R Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDR 124
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 42.2 bits (99), Expect = 3e-06 Identities = 70/384 (18%), Positives = 136/384 (35%), Gaps = 71/384 (18%) Query: 24 IPFISEDVNIPAEQVAIITAVPVILGSVLRIPLGYYANVIGARKVFIASFILLLFPIYYI 83 +P I+ D N P + ++ S+ G ++ +G +++ + I+ F Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 84 SNTSSHIDLLIGG-FFLGIGGAMF-SVGVTSLPKYYPKEKHGLINGIYG-VGNIGTAIAS 140 S LLI F G G A F ++ + + +Y PKE G G+ G + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 141 FSAPVLAVQIGWQNTIRLLLVVLIVFIVINIFFGDRQEKLVKQPLFGQIKGII------- 193 ++A I W + L+ ++ I+ + + ++E +K IKGII Sbjct: 157 AIGGMIAHYIHWSYLL-LIPMITIITVPFLMKL-LKKEVRIKGHF--DIKGIILMSVGIV 212 Query: 194 -------NNEK----LWVISLWYF------------------------------ITFGSF 212 + + V+S F I FG+ Sbjct: 213 FFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTV 272 Query: 213 VAFTVFLPNFLITNYGIDNVDAGIRTAGFIALAT----FIRPLGGFIGDKFDPL----IA 264 F +P + + + + G + I T +GG + D+ PL I Sbjct: 273 AGFVSMVPYMMKDVHQLSTAEIG---SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIG 329 Query: 265 LIFTFLGITIGGIILAFSPTFMLFSIGCLL--VAATAGIGNGLVFKLVPQYFNKQAGIAN 322 + F + +L + FM I +L ++ T + + +V + Q ++AG Sbjct: 330 VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQ---QEAGAGM 386 Query: 323 GFVSMMGGLGGFFPPLILTLIHAI 346 ++ L I+ + +I Sbjct: 387 SLLNFTSFLSEGTGIAIVGGLLSI 410
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.2 bits (65), Expect = 0.026 Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 14/110 (12%) Query: 148 AVLQYYLA--LQAGWFPIAGWNGFIYSILPAIALASTPMAFIA---KLTRSSMLEETNSE 202 + QY +A G A G I S A+ LA +P++F++ K R++ +EE + Sbjct: 286 GISQYIIAQRAAQGLSTSAAAAGLIAS---AVTLAISPLSFLSIADKFKRANKIEEYSQR 342 Query: 203 YVKMAKAKGISRWAVVFKH--ALRNALLPVVTYLAPLTAGI---ITGSFV 247 + K+ G S A K A+ +L + T LA +++GI T S V Sbjct: 343 FKKLG-YDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLV 391
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 55.6 bits (134), Expect = 1e-10 Identities = 69/371 (18%), Positives = 131/371 (35%), Gaps = 34/371 (9%) Query: 4 PSRNIIIAVFMVGTFAIGMTEY--VVTGLLTQFAADLDVAIATTGLLLSVYAISVTIFGP 61 P+R +I+ + V A+G+ V+ GLL DV A G+LL++YA+ P Sbjct: 3 PNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVT-AHYGILLALYALMQFACAP 61 Query: 62 IVRLATLKFSPKLLLIILVSIFLISNIVAATAPNFEVLLFSRLLSASMHAPFFGLTMSLA 121 ++ + +F + +L++ ++ + + ATAP VL R+++ A +A Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121 Query: 122 MAISPPHKKTASIAAVNGGLTIAIMLGVPFGSFVGAALDWRLVFWIIAVLGLTTLIGIIL 181 I+ ++ ++ ++ G G +G F+ A L + Sbjct: 122 -DITDGDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCF 179 Query: 182 TTP-------NYRPKDIPKISKELSVIKNKNVLMTIFVIVFGFSGV----------FTAY 224 P ++ + V+ + + F V F Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGED 239 Query: 225 TFMEPMLRQITGFGTAGITISMFLFG-LGAVAGNFTAGTVQPSLLTSRIIM-TMGALGIV 282 F + I IS+ FG L ++A G V L R +M M A G Sbjct: 240 RF---------HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290 Query: 283 LFIFTFMLQMPVLAYAASLLFGMGTFGTTPILNSKIIFAAKEAPALSGTLAASVFNLANS 342 + F + + A+ +L G G + +E A++ +L + Sbjct: 291 YILLAFATRGWM-AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSI 349 Query: 343 IGATLGSALLN 353 +G L +A+ Sbjct: 350 VGPLLFTAIYA 360
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 33.6 bits (77), Expect = 0.001 Identities = 40/287 (13%), Positives = 95/287 (33%), Gaps = 18/287 (6%) Query: 37 DTDAAAISMLISAIGIGKLFGLSFAGKLSDSLGRKPMVITAGILYVIFLIAVPFSPTYGI 96 + A +L++ + + G LSD GR+P+++ + + + +P + Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWV 98 Query: 97 AFAFALLAGMGNSILDTSTYPALIEGFPKRASSATVLVKAFMSIGATILPLMITFFIAKE 156 + ++AG+ T A+ + + + F + A M+ + Sbjct: 99 LYIGRIVAGI------TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG 152 Query: 157 LFYGYT----FFIMAFVFLINAVHLTTVKFPKANMVVVEPGTDNNGENKKKAEPHFAVKP 212 L G++ FF A + +N + P+++ + ++ P + + Sbjct: 153 LMGGFSPHAPFFAAAALNGLNFL-TGCFLLPESH------KGERRPLRREALNPLASFRW 205 Query: 213 RFWREGVTVIFIGFTSVSLF-MIIQTWMATFAEEIIGMDESTAINLLSYYSFGGFITVIL 271 V + F + L + F E+ D +T L+ + + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 272 LATMLDRVFRPVTILILYPLIAIAALSALLFVTNYYVLVVIAFILGL 318 + + L+L + L F T ++ I +L Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.9 bits (75), Expect = 0.002 Identities = 48/285 (16%), Positives = 99/285 (34%), Gaps = 14/285 (4%) Query: 41 DMPSLLGIVLIVITVPRLIMMTYGGILADNYKKSTIMFGTNSAQAV--LLLCITLLVWND 98 D+ + GI+L + + + G L+D + + ++ + + AV ++ +W Sbjct: 40 DVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW-- 97 Query: 99 AMTLMALLSFAGLFGMLDAFFGPASTSLLPKIVDRPQLQKANAYFQGVDQVSFILGPVLA 158 L AG+ G G + + + I D + + + + GPVL Sbjct: 98 --VLYIGRIVAGITGAT----GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151 Query: 159 GMIMEVFDVSISFFVAFILVSLSALIILPPFIKEAAVENKVKQSQVENLKEGFNYVRQSN 218 G++ FF A L L+ L + E + + + N F + R Sbjct: 152 GLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMT 210 Query: 219 FLLIGMLILITLNFFVFGTLHIAIPLLVDVYGGTPINLSYMEMSLSIGMVLGTLILGRYI 278 + M + + + + D + + + I L ++ + Sbjct: 211 VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPV 270 Query: 279 IAKKG--RMSLYGLLATVIFYIIFSFM-DNLTLLPIMLLFIGFAM 320 A+ G R + G++A YI+ +F PIM+L + Sbjct: 271 AARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGI 315
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 79.3 bits (195), Expect = 1e-20 Identities = 29/166 (17%), Positives = 62/166 (37%), Gaps = 6/166 (3%) Query: 1 MDTKEKILDVGRQLFASYGYEGTTMTMIAGGVEIKKPSLYAHYTSKEQIFKDVLDKEVAD 60 +T++ ILDV +LF+ G T++ IA + + ++Y H+ K +F ++ + ++ Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69 Query: 61 YITFLHEAAAADDSSIKEKLYRLLVEHALDDEASMNFYYRFIKY-----QPAGLEEYIVG 115 E A L R ++ H L+ + ++ + G + Sbjct: 70 IGELELEYQAKFPGDPLSVL-REILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128 Query: 116 SFAEMESETEKIFEMILNQGKEQGEIDKSLSNTQIYRMYFLLVDGL 161 + + E+ E L E + L + + + GL Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174
>PF06580#Sensor histidine kinase Length = 349 Score = 54.1 bits (130), Expect = 3e-10 Identities = 60/360 (16%), Positives = 133/360 (36%), Gaps = 63/360 (17%) Query: 27 FYFIFRSISLWEIVVGIVITILFFAVYW--LTFNSRGALIYIGLSLEFIINIAMTVLFGY 84 F ++ S L ++ I I+++ + +F R + + + + + V+ G Sbjct: 30 FASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGM 89 Query: 85 VYFALFIAFYVGNIRSKAGFISMYVIHLVLTVGAIIFAFFINYVLFLSHLPFLIMTILGV 144 V+F + + L+ + AF + L + ++ + + Sbjct: 90 VWFVANTSIW----------------RLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSL 133 Query: 145 ILIPLNKYNRLKQEALEVKLEDANQRIAELAIVEER---HRIARDLHDTLGQKLSMIGLK 201 + + + KQ ++ + + A+L ++ + H + + L Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHF----MFNAL---------- 179 Query: 202 GELSRKLMDTDTEKAKKELQDIQNTARHALKEVREMISDMKNVNLKEELAHVKMILETAG 261 R L+ D KA++ L + R++L+ S+ + V+L +EL V L+ A Sbjct: 180 -NNIRALILEDPTKAREMLTSLSELMRYSLRY-----SNARQVSLADELTVVDSYLQLAS 233 Query: 262 IRH------DIQVETEFKDIPMLTESVLSMSLKEAVTNVVKH---SKAKQCSVMLT--ET 310 I+ + Q+ D+ V M ++ V N +KH + ++L + Sbjct: 234 IQFEDRLQFENQINPAIMDVQ-----VPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288 Query: 311 NKDIFLKISDDGR---NPQALEFGNGLQGMRERLTFVNGE---FEVFHSENGFEINITVP 364 N + L++ + G G GLQ +RERL + G ++ + + +P Sbjct: 289 NGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 52.1 bits (125), Expect = 2e-10 Identities = 27/161 (16%), Positives = 57/161 (35%), Gaps = 8/161 (4%) Query: 2 IRIVIAEDQNLLLGALG-ALLDLEEDITVVGKAANGEEVLELVRETRPDICLMDIEMPVM 60 I++A+D + L AL D+ + N + + D+ + D+ MP Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 TGLDAAEQLKAED--CKVIILTTFARPGYFERAKKANVRGYLLKDSPSETLANSIRQIMK 118 D ++K V++++ +A + YL K L I + + Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 119 GKRIYSPELIDIAFESENPL--TPREMEIIQLLGEGKKTKA 157 + +L D + + + + EI ++L +T Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL 161
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.004 Identities = 11/20 (55%), Positives = 13/20 (65%) Query: 38 LLGPSGAGKTTLVRELAGLD 57 L G G GK+TL+ L GLD Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 60.7 bits (147), Expect = 1e-12 Identities = 45/180 (25%), Positives = 86/180 (47%), Gaps = 9/180 (5%) Query: 163 FLTAGVSFIRERTTGTLERLLSTPIRKWEIVMGYLIGFALFTVLQSAIIAWYAIYILDML 222 F T +F R T E +L T +R +IV+G + A L A I A L Sbjct: 84 FETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAA----L 139 Query: 223 MVGVFIDVLWIILALALTAL---TLGILVSSFANNEFQMIQFIPIVVVPQIFFSG-LFNL 278 ++ +L+ + +ALT L +LG++V++ A + I + +V+ P +F SG +F + Sbjct: 140 GYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPV 199 Query: 279 DTISDLLSWIGPLTPLYYAAESLRDVMIRGYGWSDIYMNLLILLLFSLIFIVLNILVLRK 338 D + + PL ++ + +R +M+ D+ ++ L ++ +I L+ +LR+ Sbjct: 200 DQLPIVFQTAARFLPLSHSIDLIRPIMLGHPV-VDVCQHVGALCIYIVIPFFLSTALLRR 258
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.9 bits (80), Expect = 5e-04 Identities = 24/98 (24%), Positives = 42/98 (42%), Gaps = 11/98 (11%) Query: 66 GGVIFGHIGDRVGRKKTLIITLSLMGIATACIGFLPTYAQIGIAAPILLMLLRLIQGLGI 125 G ++G + D++G K+ L+ + + + ++ + I A R IQG G Sbjct: 65 GTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAGA 117 Query: 126 GGEWGGALLLATEYAPKEQR----GFFGSVPQMGITIG 159 +++ Y PKE R G GS+ MG +G Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.1 bits (78), Expect = 7e-05 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 2/75 (2%) Query: 45 LIGGLDAGMTVDKMLYLSTIFVKEKYRGHGVGRRLMYEMEKQAAEIGADLIRLD--SFSW 102 IG + + + I V + YR GVG L+++ + A E + L+ + Sbjct: 76 CIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINI 135 Query: 103 EGVGFYEKLGYEVIG 117 FY K + + Sbjct: 136 SACHFYAKHHFIIGA 150
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 27.9 bits (62), Expect = 0.034 Identities = 36/206 (17%), Positives = 72/206 (34%), Gaps = 13/206 (6%) Query: 6 AGLVLAMMVTAGCSSDEDELLDFYNAFQKTVEVEKEIETVSEEFDSLESEKGELQESLEN 65 G VL + G +D + + + +I + S E + L K + +N Sbjct: 120 KGDVLLKLTALGAEADTLKTQSSL-LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178 Query: 66 ASREELPEISAQLVENTDARIEQLDAEVAVMGDSRSRMETSRQYIEEISNGSNREKAESL 125 S EE+ +++ + E Q ++ R + NR + S Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQ-------KELNLDKKRAERLTVLARINRYENLSR 231 Query: 126 VEAM---DVRYKAHGDMIGSYKAVLESEREIFEYLGEEDVSQDEVDERLNSLSEEYQQVE 182 VE D H + AVLE E + E + E V + ++++ + + ++ + Sbjct: 232 VEKSRLDDFSSLLH-KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290 Query: 183 ENAAAFGEET-EKVNEIKKEIEDVIQ 207 F E +K+ + I + Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTL 316
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.3 bits (76), Expect = 0.002 Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 1/42 (2%) Query: 36 LAEVQNDKAVVEIPSPVDGTVKKLHV-EEGTVTTVGETIVTI 76 LA+ + + I +PV V++L V EG V T ET++ I Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVI 359
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.4 bits (71), Expect = 0.005 Identities = 31/152 (20%), Positives = 59/152 (38%), Gaps = 15/152 (9%) Query: 16 IAEKIFGWLAWLALLAVTGFILFFALVMVNDPAFIESFRQQMQNSL-SQMDTGGVSTEQM 74 + F L + G F ALVMV +I + L + G Sbjct: 98 VGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157 Query: 75 TDQMLSMLNSSWMIALYLAVPLILGIFGLLTMRR---RI-----LAGFLLLIAGILTAPM 126 M++ W L + + I+ + L+ + + RI + G +L+ GI+ Sbjct: 158 IGGMIAH-YIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF--F 214 Query: 127 VIFVITGLIPLFFVIAAI--LLFVRKDRVITH 156 ++F + I F +++ + L+FV+ R +T Sbjct: 215 MLFTTSYSI-SFLIVSVLSFLIFVKHIRKVTD 245
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.0 bits (150), Expect = 5e-14 Identities = 25/162 (15%), Positives = 46/162 (28%), Gaps = 5/162 (3%) Query: 1 MGLRETNKERRRSSIIKTAKAFFVEKGFNAVHMQEIADAEGIGIATLFRYFPKKEQLILA 60 + + R I+ A F ++G ++ + EIA A G+ ++ +F K L Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 61 AAISIMESEADAFKNILNH----PDKTAYEKIEDCFDYMKGIHISPSANTAKFNDAFQVY 116 + + P E + + F+ V Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121 Query: 117 IDNTTEPVENLLPYFEARRKIVDHFLQIIEQGKLDGTLHPER 158 + + L E+ +I IE L L R Sbjct: 122 EMAVVQQAQRNLC-LESYDRIEQTLKHCIEAKMLPADLMTRR 162
>OUTRSURFACE#Outer surface protein signature. Length = 273 Score = 30.3 bits (68), Expect = 0.013 Identities = 15/49 (30%), Positives = 19/49 (38%) Query: 93 DDEDTGEIETFTRDGKPISVDTFNSSRKTSGKEKRNGAGKDDSSVKKRA 141 DD E F DGK + +S KTS E N G+ + R Sbjct: 92 DDLSKTTFELFKEDGKTLVSRKVSSKDKTSTDEMFNEKGELSAKTMTRE 140
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 49.4 bits (118), Expect = 1e-08 Identities = 34/154 (22%), Positives = 64/154 (41%), Gaps = 20/154 (12%) Query: 202 GGVVIDIGADLTQFGYYERGALKYAGSLPVGG----NHITNDLSEAFN--TPFEVAEKVK 255 G +V+DIG T+ + Y+ S+ +GG I N + + AE++K Sbjct: 160 GSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIK 219 Query: 256 HQYGHAFFDLASDEDIVKLPQRD---GEP-DIEVTPKDLADIIELRLEEMLLDVFTELQ- 310 H+ G A+ E +++ R+ G P + ++ + ++ L ++ V L+ Sbjct: 220 HEIGSAYPGDEVRE--IEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQ 277 Query: 311 -----EAGITRVSGGFVVTGGTVNLLGVKELLQD 339 + I G V+TGG L + LL + Sbjct: 278 CPPELASDI--SERGMVLTGGGALLRNLDRLLME 309
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 39.0 bits (91), Expect = 7e-06 Identities = 31/182 (17%), Positives = 61/182 (33%), Gaps = 24/182 (13%) Query: 2 IKDNYTNIRQEIGDEATIIAVTK---Y-HTVEETLEAYEAGVRDFGENRPEGFLEKRKAL 57 +K N + +RQ A + +V K Y H +E A A F E + R+ Sbjct: 14 LKQNLSIVRQ-AATHARVWSVVKANAYGHGIERIWSAIGA-TDGFALLNLEEAITLRERG 71 Query: 58 PADANVHFIGTLQSRKVKQIAD--------DLYYLHSLDRESIAKKIEQYSNHTVKCFIQ 109 + G ++ ++ + L +L + ++ I Sbjct: 72 WKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLD----------IY 121 Query: 110 VNVSGEDSKHGLIPEEVPEFLETLSEYEKIEVIGLMTMAPHTEDRSLISEVFKRLSELKQ 169 + V+ ++ G P+ V + L + + LM+ E IS R+ + + Sbjct: 122 LKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAAE 181 Query: 170 KL 171 L Sbjct: 182 GL 183
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 27.7 bits (61), Expect = 0.038 Identities = 29/132 (21%), Positives = 48/132 (36%), Gaps = 16/132 (12%) Query: 13 VEYEEDVPAETSEPQ---KKQSEQNPKVTSFEQSARRRPEPVKADKPTDKKQPKDQNQNV 69 VE EE ET + Q K S+ +PK E + + + EP + + PT + N Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE-TVQPQAEPARENDPTVNIKEPQSQTNT 1164 Query: 70 RGKKMAEGIRGPERRKSAAKRQ--EKDRSTNRNEKETKLMTAETSNTKVCLFEPRVFSET 127 P + S+ Q + + N + T T +P V SE+ Sbjct: 1165 TADTEQ-----PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT----QPTVNSES 1215 Query: 128 QDIADELKHERA 139 + +H R+ Sbjct: 1216 SNKPKN-RHRRS 1226
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 183 bits (466), Expect = 1e-62 Identities = 87/148 (58%), Positives = 117/148 (79%) Query: 3 NKSIRQIKIREIISNGKVETQEDLVEKLNVYNFNVTQATVSRDIKELQLIKVPTPSGSYI 62 NK R IKIREII+ ++ETQ++LV+ L +NVTQATVSRDIKEL L+KVPT +GSY Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGSYK 61 Query: 63 YSMPKDRKFHPLEKLGRYLMDSFVKLDYTGNLLVLKTLPGNAQSIGAIIDQLEWEEVIGT 122 YS+P D++F+PL KL R LMD+FVK+D +L+VLKT+PGNAQ+IGA++D L+WEE++GT Sbjct: 62 YSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIMGT 121 Query: 123 ICGDDTCLLICRDEEAQLEIKDRIFNLI 150 ICGDDT L+ICR + ++ +I L+ Sbjct: 122 ICGDDTILIICRTHDDTKVVQKKILELL 149
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 30.0 bits (67), Expect = 0.029 Identities = 30/244 (12%), Positives = 75/244 (30%), Gaps = 9/244 (3%) Query: 155 EEKYKLYKESYKKFKALEEKIQDLEYKDRNRMQQLELYRHQYDELSSMGLVHGEEEQLEE 214 EK +E + LE+ ++ +++ + L++ E+ LE Sbjct: 109 SEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA--RKADLEKALEG 166 Query: 215 EISYFNNYEKIHDTLSIMRTQLDSEYSPQVMLYEIHKSIETMSKFDETYTAFTETILESY 274 +++ TL + L++ + + ++ + Sbjct: 167 AMNFSTADSAKIKTLEAEKAALEARQ--AELEKALEGAMNFSTADSAKIKTLEAEKAALA 224 Query: 275 HLLNELDSKVSGDLSNVDYDEGTYNEKQLRLASINNLKRKYNKTTEELIDLRESLNEDIM 334 +L+ + G ++ D + A++ + + K E ++ + + I Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 284 Query: 335 QL----ENIAQSFEKLESEKSAALDEMEKLASFLQNYRVERKHFLENRIKKELHDLDMPD 390 L + LE + + L L R +K LE +K + + Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQ-LEAEHQKLEEQNKISE 343 Query: 391 ADFE 394 A + Sbjct: 344 ASRQ 347 Score = 29.6 bits (66), Expect = 0.037 Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 12/152 (7%) Query: 247 YEIHKSIETMSKFDETYTAFTETILESYHLLNELDSKVSGDLSNVDYDEGTYNE------ 300 + + TE + + L + D +S S + E + Sbjct: 71 LKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130 Query: 301 -----KQLRLASINNLKRKYNKTTEELIDLRESLNEDIMQLENIAQSFEKLESEKSAALD 355 A I L+ + DL ++L + + + LE+EK+A Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190 Query: 356 EMEKLASFLQNYRVERKHFLENRIKKELHDLD 387 +L L+ +IK + Sbjct: 191 RQAELEKALEGAM-NFSTADSAKIKTLEAEKA 221
>PF06580#Sensor histidine kinase Length = 349 Score = 25.6 bits (56), Expect = 0.033 Identities = 12/82 (14%), Positives = 31/82 (37%), Gaps = 3/82 (3%) Query: 5 IALVILIVPVFLAGLGIKYMRDSMFGVVNDPFTLTVVQFVVGLGLTIFGVWFIGGYILHR 64 + ++I V+ + + FTL + ++ + + +W + + H Sbjct: 81 LPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGWHF 140 Query: 65 ERKNKRVSERFIEQSRQSRKAQ 86 + K + I+Q + + AQ Sbjct: 141 FKNYK---QAEIDQWKMASMAQ 159
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 173 bits (439), Expect = 1e-52 Identities = 75/341 (21%), Positives = 136/341 (39%), Gaps = 36/341 (10%) Query: 4 NILVLNLGSTSTKVAIYNNLNS------LAEE--------TLRHPSSETV--KPMPEQIE 47 ILV+N GS+S K + + + LAE T + K M + + Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 48 YRLKAILGFLTEQNFDPSTIDIVSARGGTLKPIEGGTYNINDQMVTD--------LLESR 99 + + + + A G + + GG Y + ++TD +E Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGH--RVVHGGEYFTSSVLITDDVLKAITDCIELA 119 Query: 100 YGRHASNMSGLIADRFRNKYDCKAVITDPVVVDELVDEVRMTGL------KGIERKSIFH 153 + +N+ G+ A + D + D + + K RK FH Sbjct: 120 PLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFH 179 Query: 154 ALNQKAVARKYAESVHKDYEDINVIICHMGGGITAGAHRRGRVIDVNDGLSG-EGPMSPN 212 + K V+++ AE ++K E + +I CH+G G + A + G+ ID + G + EG Sbjct: 180 GTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGT 239 Query: 213 RTGSLPNGAFAKYVIDHQLDYDSAYELITKKGGFMSLAG-TQDALELEKQAL-SGDGSAI 270 R+GS+ + + + + ++ KK G ++G + D +LE A +GD A Sbjct: 240 RSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQ 299 Query: 271 AIYEAMAVQIAKEIAARAAILKGETEQIIFTGGLAYSEYLI 311 A ++ K I + AA + G + I+FT G+ + I Sbjct: 300 LALNVFAYRVKKTIGSYAAAMGG-VDVIVFTAGIGENGPEI 339
>PF06580#Sensor histidine kinase Length = 349 Score = 35.6 bits (82), Expect = 2e-04 Identities = 21/148 (14%), Positives = 49/148 (33%), Gaps = 25/148 (16%) Query: 178 ETVDLEQIVNSIIRKFRIICMQKGIGFDVTLHAAEVQTDLKWCTFVLEQVISNSVKY--- 234 V L + + ++ +Q D++ +++ ++ N +K+ Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273 Query: 235 --TEEADIAITSDLIEGWITLEISDEGRGIRKEDLPRIFEAGFTSTSDHGDAQSTGMGLY 292 + I + G +TLE+ + G K +STG GL Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN-----------------TKESTGTGLQ 316 Query: 293 LAHEAAEAMH---IQMRIESEYGRGTTT 317 E + ++ Q+++ + G+ Sbjct: 317 NVRERLQMLYGTEAQIKLSEKQGKVNAM 344
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.0 bits (200), Expect = 6e-20 Identities = 31/137 (22%), Positives = 60/137 (43%), Gaps = 2/137 (1%) Query: 2 SKILIIEDDETLFSELKMRLEDWDYTVFGVTDFSNVLNDFITVGPDLVIIDITLPKYDGF 61 + IL+ +DD + + L L Y V ++ + + DLV+ D+ +P + F Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 YWCQRIRNI-SSLPIIFLSSRDHPTDMVMSMQMGSDDYIQKPFNFDVLVAKI-QALLRRT 119 RI+ LP++ +S+++ + + + G+ DY+ KPF+ L+ I +AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 120 YQYQNQNIDVVKFRDAV 136 + D V Sbjct: 124 RRPSKLEDDSQDGMPLV 140
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 64.5 bits (157), Expect = 2e-13 Identities = 74/384 (19%), Positives = 140/384 (36%), Gaps = 28/384 (7%) Query: 2 NKKLLITFTVGVFLLGMMELIISGILELMSDDLGISH---AMTGQLITVYAVSFAVFGPL 58 N+ L++ + V L + +I +L + DL S+ A G L+ +YA+ P+ Sbjct: 4 NRPLIVILST-VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPV 62 Query: 59 LVKATEKIRPKPVIIASLILFVIGNVIFGLSSTFLMLSLGRIVTAVAAAVFIVK---IMD 115 L +++ +PV++ SL + I + +L +GRIV + A V I D Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD 122 Query: 116 MTVLLSRPEIRGKMIALVYMGFSAANVFGIPIGTLIGQQFGWRIIFWLVIVIAILVGI-G 174 +T R G M A G A G +G L+G F F+ + L + G Sbjct: 123 ITDGDERARHFGFMSACFGFGMVA----GPVLGGLMG-GFSPHAPFFAAAALNGLNFLTG 177 Query: 175 ILSLVPDKKGEDLGEPLPDKILDRRNVFLYIGVTMAVLIGNYIVIGYISP-------LMT 227 L KGE + +A L+ + ++ + + Sbjct: 178 CFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237 Query: 228 SNGFTLKSVSIALLIAGAGGM---TGTYIGGLLVDRIGSKRTIIYMLILFMISMAILPLL 284 + F + +I + +A G + I G + R+G +R ++ +I +L Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297 Query: 285 YGSPALFYTNLFFWSVFQWSTSPSVQSGLVENVQGSAAMVFSWNMSGL-NLGIGIGAVIG 343 F + P++Q+ L V +++ L +L +G ++ Sbjct: 298 TRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355 Query: 344 GVYISNFDISYAPWLSVFIVGLGL 367 + ++ W +I G L Sbjct: 356 TAIYAASITTWNGW--AWIAGAAL 377
>PF05775#Enterobacteria AfaD invasin protein Length = 142 Score = 30.3 bits (68), Expect = 0.009 Identities = 12/37 (32%), Positives = 15/37 (40%), Gaps = 3/37 (8%) Query: 247 VAIGFIPQIFCYGVMGGLGIWFGVRLAEPNPGVYIVQ 283 +A G +I C G +W R G YIVQ Sbjct: 44 LATG---RIICQDTHSGFRVWINARQEGGGAGKYIVQ 77
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 42.3 bits (99), Expect = 3e-07 Identities = 13/45 (28%), Positives = 25/45 (55%) Query: 6 RKEETKNNLLDAFWELYKEKPLTKITVKEITDKAGYNRGTFYTYF 50 +ET+ ++LD L+ ++ ++ ++ EI AG RG Y +F Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHF 52
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 611 bits (1577), Expect = 0.0 Identities = 243/1037 (23%), Positives = 458/1037 (44%), Gaps = 34/1037 (3%) Query: 3 LSDFSIRRPNFTIVVMIILLLLGAVSLTRLPLQLMPNIEPPIAAVATTYQGAGPEEVMED 62 +++F IRRP F V+ IIL++ GA+++ +LP+ P I PP +V+ Y GA + V + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 63 VTVPIESELSSLSGLTNISSQSQES-SSVVILEFGYDTKIDDVENDIMRAVESA--DLPD 119 VT IE ++ + L +SS S + S + L F T D + + ++ A LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 120 EAGDPSFLKFDISMMPSIQMAVTSSGD--SVAEYQDQVDDLIT-ELENIEGVASITENGS 176 E S + S + + D V + L + GV + G+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 177 VTEEIQVNLDTEALEQYNMSQSDIAGIIEANNISIP----NATVTDTEDRTSISTRTVSE 232 +++ LD + L +Y ++ D+ ++ N I T + + S + Sbjct: 181 -QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 233 IDGVESLQELVLAELPDDGGTITLDDVAEVSIEEQSSNTLTRMNQEEALSIDVMLASDAN 292 E ++ L D G + L DVA V + ++ N + R+N + A + + LA+ AN Sbjct: 240 FKNPEEFGKVTLRVNSD-GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298 Query: 293 ASNVNKEFNAVLDEKLDEEEFSNLTVETLYDEGEYIDIAINSVYTSLISGAVLAMIVLFA 352 A + K A L +L + V YD ++ ++I+ V +L +L +V++ Sbjct: 299 ALDTAKAIKAKL-AELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYL 357 Query: 353 FLRNLKAPLIIGISIPFSVITTFALLFFTDISINMMTLGGLALGIGMLVDNAVVVIENIY 412 FL+N++A LI I++P ++ TFA+L SIN +T+ G+ L IG+LVD+A+VV+EN+ Sbjct: 358 FLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417 Query: 413 RHLSMGK-KPKQAASEGTKEVASAIIASTLTTAAVFLPVVFVSGLVGQLFTPFAITVAFS 471 R + K PK+A + ++ A++ + +AVF+P+ F G G ++ F+IT+ + Sbjct: 418 RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSA 477 Query: 472 LLGSLFIALTVVPMLASRILTAPDENMEKIRS------ERSYMRMLRKFTR---WSLNHR 522 + S+ +AL + P L + +L + + ++ + +T L Sbjct: 478 MALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGST 537 Query: 523 VLVLILTTLLLIVSALGIYNQGINLMPESDEGALTIEIEKEQGTIFEDTFDTVENIENEL 582 L++ L++ + + +PE D+G I+ G E T ++ + + Sbjct: 538 GRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYY 597 Query: 583 KDYPEVDTYLSNIGSSSPMMSMSEEPNKASITATLVDPADRSVTTNEF---INDIEDEIE 639 + + + + + + N +L +R+ N I+ + E+ Sbjct: 598 LKNEKANVES--VFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELG 655 Query: 640 KIDDSAEINIVPMSQSGMG---GEPNTLMLNVSDDSADRLAESEQTIIQALEDDEKIESV 696 KI D I + +G G L+ Q + A + + SV Sbjct: 656 KIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSV 715 Query: 697 ESSREEMVQELQVQVDRAAARENGLQPAQVGSALYEASNGVQATTVENNNEFLSIVVKYP 756 + E + +++VD+ A+ G+ + + + A G + + V+ Sbjct: 716 RPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQAD 775 Query: 757 DDVLSSMENFRDIQIANSEGEYVALSEVAELEEVDMLPMITRDSMEETSELTVTYASDMS 816 E+ + + ++ GE V S V P + R + + E+ A S Sbjct: 776 AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTS 835 Query: 817 LNEAGTYVENIIEDADFSDDTHYSIGGDLEMLTDAMPQMLLALILGVLFIYLVMVAQFES 876 +A +EN+ Y G + Q + + + ++L + A +ES Sbjct: 836 SGDAMALMENLASKLP--AGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893 Query: 877 FKHPFIVIMAVPLSIIGVMLALVITNNPLSIVSFVGIIMLLGIVVNNSILLVDYTNQQKE 936 + P V++ VPL I+GV+LA + N + VG++ +G+ N+IL+V++ E Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953 Query: 937 K-GYPTLEALELSVQHRFRPIVITALTTALGMLPLALGIGEGGEMVASMGIVVIGGLTSS 995 K G +EA ++V+ R RPI++T+L LG+LPLA+ G G ++GI V+GG+ S+ Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSA 1013 Query: 996 TIFTLFIIPIFYSYVDK 1012 T+ +F +P+F+ + + Sbjct: 1014 TLLAIFFVPVFFVVIRR 1030 Score = 106 bits (265), Expect = 4e-25 Identities = 80/531 (15%), Positives = 195/531 (36%), Gaps = 53/531 (9%) Query: 516 RWSLNHRVLVLILTTLLLIVSALGIYNQGINLMPESDEGALTI----------EIEKEQG 565 + + + +L +L++ AL I + P A+++ ++ Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62 Query: 566 TIFEDTFDTVENIENELKDYPEVDTYLSNIGSSSPMMSMSEEPNKASITATLVDPADRSV 625 + E + ++N+ M S S+ +IT T D + Sbjct: 63 QVIEQNMNGIDNLMY--------------------MSSTSDSAGSVTITLTFQSGTDPDI 102 Query: 626 TTNEFINDIEDEIEKIDDSAEINIVPMSQSGMGGEPNTLMLNVSDDSADRLAE----SEQ 681 + N ++ + + + + +S + VSD+ + Sbjct: 103 AQVQVQNKLQLATPLLPQEVQQQGISVEKSSSS--YLMVAGFVSDNPGTTQDDISDYVAS 160 Query: 682 TIIQALEDDEKIESVESSREEMVQELQVQVDRAAARENGLQPAQVGSALYEASNGVQA-- 739 + L + V+ + +++ +D + L P V + L ++ + A Sbjct: 161 NVKDTLSRLNGVGDVQLFGAQ--YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQ 218 Query: 740 ----TTVENNNEFLSIVVKYPDDVLSSMENFRDIQI-ANSEGEYVALSEVAELEE-VDML 793 + SI+ + + E F + + NS+G V L +VA +E + Sbjct: 219 LGGTPALPGQQLNASIIAQ---TRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY 275 Query: 794 PMITRDSMEETSELTVTYASDMSLNEAGTYVENIIED--ADFSDDTHYSIGGDL-EMLTD 850 +I R + + + L + A+ + + ++ + + F D + Sbjct: 276 NVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQL 335 Query: 851 AMPQMLLALILGVLFIYLVMVAQFESFKHPFIVIMAVPLSIIGVMLALVITNNPLSIVSF 910 ++ +++ L ++ ++LVM ++ + I +AVP+ ++G L ++ ++ Sbjct: 336 SIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTM 395 Query: 911 VGIIMLLGIVVNNSILLVD-YTNQQKEKGYPTLEALELSVQHRFRPIVITALTTALGMLP 969 G+++ +G++V+++I++V+ E P EA E S+ +V A+ + +P Sbjct: 396 FGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP 455 Query: 970 LALGIGEGGEMVASMGIVVIGGLTSSTIFTLFIIPIFYSYVDKETRKMHKK 1020 +A G G + I ++ + S + L + P + + K H + Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 125 bits (315), Expect = 5e-37 Identities = 71/257 (27%), Positives = 111/257 (43%), Gaps = 15/257 (5%) Query: 3 KVAVITGSGGGLGKGIAERLAKDGFKVVVNDINAEAVNSTVEEIKAGGYEVIGVQGDVSK 62 K+A ITG+ G+G+ +A LA G + D N E + V +KA DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 KEHQFLLVQRAVEVFGRLDVFVNNAGIDVVTPFLDVDEAQLNKAFSINVNGVVFGTQAAA 122 + R G +D+ VN AG+ + + + FS+N GV +++ + Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 123 EQFKKQESKGKIINACSIAGHESYEMLSTYSATKHAVKSFTHSSAKELAPYNIRVNAYCP 182 + + S G I+ S ++ Y+++K A FT ELA YNIR N P Sbjct: 129 KYMMDRRS-GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 183 GVAGTAM----W--DRIDEEMVKYYDHMEPGDAFKEFSGNILLGRPQEPEDVANLVSFLA 236 G T M W + E+++K G + F I L + +P D+A+ V FL Sbjct: 188 GSTETDMQWSLWADENGAEQVIK-------GSL-ETFKTGIPLKKLAKPSDIADAVLFLV 239 Query: 237 SDDSDYITGQAIVTDGG 253 S + +IT + DGG Sbjct: 240 SGQAGHITMHNLCVDGG 256
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 26.4 bits (58), Expect = 0.034 Identities = 14/43 (32%), Positives = 19/43 (44%), Gaps = 5/43 (11%) Query: 1 MKEYQFQATIEPDDHRSVKIINDMER--VVGHIEKDVLRKCEE 41 M E F I S ++I M+ V GH K+V CE+ Sbjct: 28 MSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEV---CEK 67