>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 61.2 bits (148), Expect = 1e-12 Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 34/165 (20%) Query: 121 IVTNNHVINGASKVDIRLS------------DGTKVPGEIVGADTFSDIAVVKISSEKVT 168 ++TN HV++ L +G +I D+A+VK S + Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173 Query: 169 -------TVAEFGDSSKLTVGETAIAIGSPLG-SEYANTVTQGIVSSLNRNVSLKSEDGQ 220 A ++++ V + G P ++G ++ Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITY------------- 220 Query: 221 AISTKAIQTDTAINPGNSGGPLINIQGQVIGITSSKIATNGGTSV 265 + +A+Q D + GNSG P+ N + +VIGI + +V Sbjct: 221 -LKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAV 264
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 27.6 bits (61), Expect = 0.023 Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 1/46 (2%) Query: 14 KYLKDGIAEYSKRISRFAKFEMIELSDEKTPDKASESENQ-KILEI 58 K L D I + I FA I + D P+ AS + Q KI E+ Sbjct: 262 KVLSDKIIQIYSDIKPFADIAGINVPDTGLPNSASIEQIQSKIQEL 307
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.7 bits (113), Expect = 3e-09 Identities = 21/104 (20%), Positives = 43/104 (41%), Gaps = 8/104 (7%) Query: 6 KRLKTKRTIENAMVQLLMEQPFDKISTVKLVEKAGISRSSFYTHYKDKYDMIEHYQSKLF 65 + +T++ I + ++L +Q S ++ + AG++R + Y H+KDK D+ Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 66 HTF-EYIFQKHAHHK-------RDAILEVFEYLESEPLLAALLS 101 E + A R+ ++ V E +E L+ Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.009 Identities = 11/30 (36%), Positives = 14/30 (46%) Query: 32 LIGANGAGKSTFLKILAGDIEPTTGHISLG 61 L G G GKST + L G + H +G Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 48.9 bits (116), Expect = 2e-08 Identities = 47/230 (20%), Positives = 83/230 (36%), Gaps = 9/230 (3%) Query: 27 AETTDDKIAAQDNKISNLTAQQQEAQKQVDQIQEQVSAIQAEQSNLQAENDRLQAESKKL 86 D ++ + +KI L A++ + +K ++ +A A+ L+AE L A L Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160 Query: 87 EGEITEL---SKNIVSRNQSL--EKQARSAQTNGAVTSYINTIVNSKSITEAISRVAAMS 141 E + S ++ ++L EK A A+ + + S + + I + A Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220 Query: 142 EIVSANNKMLEQQKADKKAISEKQVANNDAINTVIA----NQQKLADDAQALTTKQAELK 197 ++A LE+ S A + A Q +L + Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280 Query: 198 AAELSLAAEKATAEGEKASLLEQKAAAEAEARAAAVAEAAYKEKRASQQQ 247 A +L AEKA E EKA L Q A ++ A +E + + Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330 Score = 37.7 bits (87), Expect = 6e-05 Identities = 33/229 (14%), Positives = 81/229 (35%), Gaps = 6/229 (2%) Query: 31 DDKIAAQDNKISNLTAQQQEAQKQVDQIQEQVSAIQAEQSNLQAENDRLQAESKKLEGEI 90 + + A + + ++ ++ + +A+ A +++L+ + S +I Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248 Query: 91 TELSKNIVSRNQSLEKQARSAQTNGAVTSYINTIVNSKSITEAISRVAAMSEIVSANNKM 150 L + + ++ + ++ + I + AA+ + Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADS-----AKIKTLEAEKAALEAEKADLEHQ 303 Query: 151 LEQQKADKKAISEKQVANNDAINTVIANQQKLADDAQALTTKQAELKAAELSLAAEKATA 210 + A+++++ A+ +A + A QKL + + + L+ + K Sbjct: 304 SQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQL 363 Query: 211 EGEKASLLEQKAAAEAEARAAAVAEAAYKEKRASQQQSVLASANTNLTA 259 E E L EQ +EA R + + + Q + L AN+ L A Sbjct: 364 EAEHQKLEEQNKISEAS-RQSLRRDLDASREAKKQVEKALEEANSKLAA 411 Score = 29.6 bits (66), Expect = 0.025 Identities = 56/258 (21%), Positives = 105/258 (40%), Gaps = 19/258 (7%) Query: 25 AHAETTDDKIAAQDNKISNLTAQQQEAQKQVDQIQEQVSAIQAEQSNLQAENDRLQAESK 84 + KI + + + L A+Q E +K ++ +A A+ L+AE L+AE Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298 Query: 85 KLEGEITELSKNIVSRNQSLEKQARSAQTNGAVTSYINTIVNSKSITEAISRVAAMSEIV 144 LE + L+ N S + L+ + + + + + I+EA SR + ++ Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASREAKK---QLEAEHQKLEEQNKISEA-SRQSLRRDLD 354 Query: 145 SANNKMLEQQKADKKAISEKQVANNDAINTVIANQQKLADDAQALTTKQAELKAAELSLA 204 ++ + + +K + +++ + ++ L +A + L+ A LA Sbjct: 355 ASREAKKQLEAEHQKLEEQNKISEASRQSL----RRDLDASREAKKQVEKALEEANSKLA 410 Query: 205 A-EKATAEGE--KASLLEQKAAAEAEARAAAVAEAAYKEKRASQQQSVLASANTNLTAQV 261 A EK E E K ++KA +A+ A A A KEK A Q + + L A Sbjct: 411 ALEKLNKELEESKKLTEKEKAELQAKLEAEAKAL---KEKLAKQAEEL-----AKLRAGK 462 Query: 262 QAVSESAAAPVRAKVRPT 279 + S++ A K P Sbjct: 463 ASDSQTPDAKPGNKAVPG 480
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 251 bits (642), Expect = 5e-82 Identities = 91/314 (28%), Positives = 153/314 (48%), Gaps = 20/314 (6%) Query: 1 MKKISLLLA-SLCALFLVACSNQ---KQADGKLNIVTTFYPVYEFTKQVAGDTANVELLI 56 MKK+ LL L A+ LVAC++ + KL +V T + + TK +AGD ++ ++ Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 Query: 57 GAGTEPHEYEPSAKAVAKIQDADTFVYENENMET----WVPKLLDTLDKKKVKTIKATGD 112 G +PHEYEP + V K +AD Y N+ET W KL++ K + K D Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENK------D 114 Query: 113 MLLLPGGEEEEGDHDHGEEGHHHEFDPHVWLSPVRAIKLVEHIRDSLSADYPDKKETFEK 172 + G + E+G DPH WL+ I ++I LSA P+ KE +EK Sbjct: 115 YFAVSDGVDVIYLEGQNEKGKE---DPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEK 171 Query: 173 NAAAYIEKLQSLDKAYAEGLSQ--AKQKSFVTQHAAFNYLALDYGLKQVAISGLSPDAEP 230 N Y +KL LDK + ++ A++K VT AF Y + YG+ I ++ + E Sbjct: 172 NLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEG 231 Query: 231 SAARLAELTEYVKKNKIAYIYFEENASQALANTLSKEAGVKTDVLNPLESLTEEDTKAGE 290 + ++ L E +++ K+ ++ E + T+S++ + +S+ E+ + G+ Sbjct: 232 TPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKE-GD 290 Query: 291 NYISVMEKNLKALK 304 +Y S+M+ NL + Sbjct: 291 SYYSMMKYNLDKIA 304
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 37.3 bits (86), Expect = 2e-04 Identities = 28/134 (20%), Positives = 50/134 (37%), Gaps = 19/134 (14%) Query: 279 FIPWTDLGVTIF-DDFNAWLTGLPVIGNIVGSSTSALGTWYFPEGAMLFAFMGILIGVIY 337 I T+ GVTIF + L GNI+G +G G +L F L + Sbjct: 99 LIGLTERGVTIFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALS 158 Query: 338 GLKEDKIISSFMNG----------AADLLSVALIVAIARGIQVIMNDGMITDTILNWGK- 386 +K D++I +G A+ L L+ +A + + + G Sbjct: 159 SMKIDELIKKQKSGGNVSSSELAKASIELINQLVDTVASLNNNV---NSFSQQLNTLGSV 215 Query: 387 ----EGLSGLSSQV 396 + L+G+ +++ Sbjct: 216 LSNTKHLNGVGNKL 229
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 406 bits (1046), Expect = e-146 Identities = 139/312 (44%), Positives = 204/312 (65%), Gaps = 5/312 (1%) Query: 4 RKIVVALGGNAIL--SSDPSAKAQQEALVETAKHLVKLIKNGDDLIITHGNGPQVGNLLL 61 +++V+ALGGNA+ S + + + +TA+ + ++I G +++ITHGNGPQVG+LLL Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 62 QHLASDSEKN-PAFPLDSLVAMTEGSIGFWLKNALQNALLDEGIEKNVASVVTQVVVDKN 120 A + PA P+D AM++G IG+ ++ AL+N L G+EK V +++TQ +VDKN Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 121 DPAFVNLSKPIGPFYSEEEAKAEAEKSGATFKEDAGRGWRKVVASPKPVDIKEIETIRTL 180 DPAF N +KP+GPFY EE AK A + G KED+GRGWR+VV SP P E ETI+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 181 LNNGQVVVAAGGGGIPVVKENNGHLTGVEAVIDKDFASQRLAELVDADLFIVLTGVDYVF 240 + G +V+A+GGGG+PV+ E +G + GVEAVIDKD A ++LAE V+AD+F++LT V+ Sbjct: 183 VERGVIVIASGGGGVPVILE-DGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA 241 Query: 241 VNYNKPNQEKLEHVNVAQLEEYIKQDQFAPGSMLPKVEAAIAFVNGRPEGKAVITSLENL 300 + Y ++ L V V +L +Y ++ F GSM PKV AAI F+ +A+I LE Sbjct: 242 LYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIE-WGGERAIIAHLEKA 300 Query: 301 GALIESESGTII 312 +E ++GT + Sbjct: 301 VEALEGKTGTQV 312
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 136 bits (343), Expect = 3e-38 Identities = 116/372 (31%), Positives = 187/372 (50%), Gaps = 20/372 (5%) Query: 49 DEGYKSYIEEVAKAYEKEAGVKVTLKTGDALGGLDKLSLDNQSGNVPDVMMAPYDRVGSL 108 D+GY + EV K +EK+ G+KVT++ D L +K +G+ PD++ +DR G Sbjct: 40 DKGYNG-LAEVGKKFEKDTGIKVTVEHPDKLE--EKFPQVAATGDGPDIIFWAHDRFGGY 96 Query: 109 GSDGQLSEVKLSDGAKTDDTTKSLVTAA--NGKVYGAPAVIESLVMYYNKDLVKDAPKTF 166 G L+E+ D A D A NGK+ P +E+L + YNKDL+ + PKT+ Sbjct: 97 AQSGLLAEIT-PDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTW 155 Query: 167 ADLENLAKDSKYAFAGEDGKTTAFLADWTNFYYTYGLLAGNGAYVFG-QNGK-DAKDIGL 224 ++ L K+ K GK+ A + + Y+T+ L+A +G Y F +NGK D KD+G+ Sbjct: 156 EEIPALDKELK-----AKGKS-ALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGV 209 Query: 225 ANDGSIAGINYAKSWYEKWPKGMQ-DTEGAGNLIQTQFQEGKTAAIIDGPWKAQAFKDAK 283 N G+ AG+ + + K M DT+ + + + F +G+TA I+GPW +K Sbjct: 210 DNAGAKAGLTFLVDLIKN--KHMNADTDYS--IAEAAFNKGETAMTINGPWAWSNIDTSK 265 Query: 284 VNYGVATIPTLPNGKEYAAFGGGKAWVIPQAVKNLEASQKFVDFLVATEQQKVLYDKTNE 343 VNYGV +PT G+ F G + I A N E +++F++ + T++ +K Sbjct: 266 VNYGVTVLPTF-KGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKP 324 Query: 344 IPANTEARSYAEGKNDELTTAVIKQFKNTQPLPNISQMSAVWDPAKNMLFDAVSGQKDAK 403 + A E D A ++ + + +PNI QMSA W + + +A SG++ Sbjct: 325 LGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVD 384 Query: 404 TAANDAVTLIKE 415 A DA T I + Sbjct: 385 EALKDAQTRITK 396
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.9 bits (70), Expect = 0.007 Identities = 31/159 (19%), Positives = 51/159 (32%), Gaps = 9/159 (5%) Query: 152 LPFLAYAILGIFSVQYFFYLCVEYSNATTATILQFISPVFILFYNRLVYQKRASKSAVFY 211 PF A A L + +L E + + F A+ AVF+ Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF 220 Query: 212 V--LVAMLGVCLMATKG-DLSQLSMTPLALITGLLSAMGVMFNVILPQPFAKRYGFVPTV 268 + LV + L G D T + + + + ++ P A R G + Sbjct: 221 IMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRAL 280 Query: 269 GWGMILAGLFSNVLSPVYQLSFTLDIWSILICLIIAFFG 307 GMI G +L L+F W +++ G Sbjct: 281 MLGMIADGT-GYIL-----LAFATRGWMAFPIMVLLASG 313
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 27.8 bits (61), Expect = 0.022 Identities = 29/133 (21%), Positives = 53/133 (39%), Gaps = 6/133 (4%) Query: 2 RRNLIDSLIQYMLIIEVNNSGSSCRLREFGEKIKRLRLAKKISRSEFCGDESELSIRQLI 61 RR ++ I+ L+ +++ + +SC EFG+ ++RL K + ++ + LS Sbjct: 215 RRLVVLDFIEGSLLTDIDANDASCSRLEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTK 274 Query: 62 RIENGESRPTLTKLKYIAERLEVEDYKLMPSYIELDKEYLELKYFLMRTPTYEDETIAQK 121 ES L L + + EV+ L+ I L+ L K + Sbjct: 275 AFNAEESSWLLLMLSLLQQPHEVD--SLLADIIGLNALLLSHKEHASFLQIF----YQVC 328 Query: 122 KESVFDKIFEEYY 134 K +EEY+ Sbjct: 329 KAIPSSLFYEEYW 341
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.005 Identities = 30/165 (18%), Positives = 55/165 (33%), Gaps = 35/165 (21%) Query: 31 CVALIGPNGAGKTTLLDCLLGDKLVTSGQVSIQGLPVTSSKLDYTRAYLPQENIIVQ--- 87 V L G G GK+TL++ L+G + I + K Y + + + Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI-----GTGKDSYEQI---AGIVAYELSE 649 Query: 88 -----KLKVKELIAFFQR---IYPNPLSNQEIDQLLQFV----KQQKEQLAEKLSGGQKR 135 + + + AFF Y D Q V +++ L + G +R Sbjct: 650 MTAFRRADAEAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYLFD--ITGNRR 707 Query: 136 LFSFILTLIGRPKIVFLDEPTASMDTSTRQRFWEIVQELKAQGVT 180 + + + GR +V+L + R + + L G Sbjct: 708 F--WPVLVPGRANLVWLQK--------FRGQLFAEALHLYLAGER 742
>PF06580#Sensor histidine kinase Length = 349 Score = 38.3 bits (89), Expect = 3e-05 Identities = 66/376 (17%), Positives = 127/376 (33%), Gaps = 67/376 (17%) Query: 1 MLERLKSIHYMFWISLIFMVFPILTVVTGWLSAWHLLIDILFVVAYLGVLTTKSQRLSWL 60 L L M+F I + G + AY + +R WL Sbjct: 24 TLTGFGFASLYGSPKLHSMIFNIAISLMGLV----------LTHAYRSFI----KRQGWL 69 Query: 61 YWGILLTYVVGNTAFVAVNYIWFFFFLSNLLSYHFSVGGLKSLHVWTFLLAQVLVVGQLL 120 + + A V + +WF S F + T +A L + + Sbjct: 70 KLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAF---------INTKPVAFTLPLALSI 120 Query: 121 IFQRIEVEFLFYLLVILAFVDLMTFGLVRIRIVEDLKEAQAKQNAQINLLLAENERNRIG 180 IF + V F++ LL + F + ++ K A Q AQ+ L + +I Sbjct: 121 IFNVVVVTFMWSLL----YFGWHFFKNYKQAEIDQWKMASMAQEAQLMAL-----KAQIN 171 Query: 181 QDLHDSLGHTFAMLSVKTDLALQLFQMEAYPQVEKELKEIHQISKDSMNEVRTIVENLKS 240 + + + +E + + L + ++ + S+ + Sbjct: 172 PHF---MFNALNNIRALI--------LEDPTKAREMLTSLSELMRYSLRYSNA-----RQ 215 Query: 241 RTLTSELETVKKMLEIAGI----EVETDNQLDTASLTQELESMASMILLELVTNIIKHAK 296 +L EL V L++A I ++ +NQ++ A + ++ M++ LV N IKH Sbjct: 216 VSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV---PPMLVQTLVENGIKHGI 272 Query: 297 ASKA-----YLKLERTEKELILTVSDDGCGFAFLKGDE----LHTVRDRV---FPFSGEV 344 A LK + + L V + G + L VR+R+ + ++ Sbjct: 273 AQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQI 332 Query: 345 SVISQKHPTEVQVRLP 360 + ++ V +P Sbjct: 333 KLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.6 bits (178), Expect = 3e-17 Identities = 25/122 (20%), Positives = 51/122 (41%), Gaps = 2/122 (1%) Query: 2 KVLVAEDQSMLRDAMCQLLTLQPDVESVLQAKNGQEAIQLLEKESVDIAILDVEMPVKTG 61 +LVA+D + +R + Q L+ V N + + D+ + DV MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 62 LEVLEWIRSEKLETKVVVVTTFKRAGYFERAVKAGVDAYVLKERSIADLMQTLHTVLEGR 121 ++L I+ + + V+V++ +A + G Y+ K + +L+ + L Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 122 KE 123 K Sbjct: 123 KR 124
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.3 bits (65), Expect = 0.025 Identities = 21/79 (26%), Positives = 36/79 (45%), Gaps = 2/79 (2%) Query: 205 NGK--VRLVGYKETLKKAGITYSEGLVFESKYSYDDGYALAERLISSNATAAVVTGDELA 262 NGK ++ VG KAG+T+ L+ + D Y++AE + TA + G Sbjct: 199 NGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAW 258 Query: 263 AGVLNGLADKGVSVPEDFE 281 + + + GV+V F+ Sbjct: 259 SNIDTSKVNYGVTVLPTFK 277
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.7 bits (74), Expect = 9e-04 Identities = 37/153 (24%), Positives = 66/153 (43%), Gaps = 7/153 (4%) Query: 67 AQSQASKQLATEKESAKNAIEKAAKNKQDEIKGAPLSDKEKAELLARVEAEKQAALKEI- 125 A +A KQ+ E A + + K ++ + L++KEKAEL A++EAE +A +++ Sbjct: 390 ASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLA 449 Query: 126 ENAKTMEDVKEAETIGVQAIAMVTVPKRPVAPNAAPKTTSAPQATAGTMQDVTYQSPAGK 185 + A+ + ++ + Q P A P APQA Q+ + Sbjct: 450 KQAEELAKLRAGKASDSQT------PDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKR 503 Query: 186 QLPNTGSASSAALASLGLVVATSGFALLGRKTR 218 QLP+TG ++ + L V + K + Sbjct: 504 QLPSTGETANPFFTAAALTVMATAGVAAVVKRK 536
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 28.4 bits (63), Expect = 0.027 Identities = 8/31 (25%), Positives = 14/31 (45%) Query: 11 LAADYANFEREIKRLEATGAEYAHIDIMDSH 41 A A+ +I RL GA + +++D Sbjct: 171 YAKQIASLNDQISRLTGVGAGASPNNLLDQR 201
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 130 bits (328), Expect = 1e-35 Identities = 68/344 (19%), Positives = 133/344 (38%), Gaps = 73/344 (21%) Query: 137 DKDNLESSIVRKYEWDIDKVTGGGESYKLYSKSNSK-VSIAILDSGVDLQNTGLLKNLSN 195 + + V + ++ + ++ +++++ + V +A+LD+G D + L Sbjct: 10 YQVIKQEQQVNEIPRGVEMI----QAPAVWNQTRGRGVKVAVLDTGCDADHPDL------ 59 Query: 196 HSKNYVPNKGYLGKEEGEEGIISDIQDRLGHGTAVVAQIVGDDN---INGVNPHVNINVY 252 + + + +EG+ +D GHGT V I +N + GV P ++ + Sbjct: 60 -KARIIGGRNFTDDDEGDP---EIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLII 115 Query: 253 RIFGKS-SASPDWIVKAIFDAVDDGNDIINLSTGQYLMIDGEYEDGTNDFETFLKYKKAI 311 ++ K S DWI++ I+ A++ DII++S G G D +A+ Sbjct: 116 KVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLG-----------GPEDVPEL---HEAV 161 Query: 312 DYANQKGVIIVAALGNDSLNVSNQSDLLKLISSRKKVRKPGLVVDVPSYFSSTISVGGID 371 A ++++ A GN+ + + P ++ ISVG I+ Sbjct: 162 KKAVASQILVMCAAGNEGDGDDRTDE-----------------LGYPGCYNEVISVGAIN 204 Query: 372 RLGNLSDFSNKGDSDAIYAPAGSTLSLSELGLNNFINAEKYKEDWIFSATLGGYTYLYGN 431 + S+FSN + + AP LS + G Y G Sbjct: 205 FDRHASEFSNSNNEVDLVAPGEDILS---------------------TVPGGKYATFSGT 243 Query: 432 SFAAPKVSGAIAMIIDKYKLKDQP--YNYMFVKKILEETLPVKN 473 S A P V+GA+A+I + ++++ T+P+ N Sbjct: 244 SMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGN 287
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.8 bits (67), Expect = 0.037 Identities = 10/19 (52%), Positives = 14/19 (73%) Query: 498 TVAIVGESGSGKSTLAKIL 516 T+ I GESG+GK +A+ L Sbjct: 162 TLMITGESGTGKELVARAL 180
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 93.6 bits (232), Expect = 5e-25 Identities = 60/235 (25%), Positives = 98/235 (41%), Gaps = 13/235 (5%) Query: 3 KNVVITGATSGIGEAIARAYLEQGEDVVLTGRRIDRLEILKSEFAVSFPNQTVWTFPLDV 62 K ITGA GIGEA+AR QG + + ++ K ++ + FP DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI--AAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 63 TDMVMVKTVCSDILETIGRIDILVNNAGLALDLAPYQDYEELDMLTMLDTNVKGLMAVTH 122 D + + + I +G IDILVN AG+ L + + N G+ + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 123 CFLPAMIKVNQGHIINMGSTAGIYAYAGAAVYSATKAAVKTFSDGLRIDTIATDIKVTTI 182 M+ G I+ +GS A Y+++KAA F+ L ++ +I+ + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 QPGIVETDFST---VRFHGDKER----AASVYQGI---EALQAQDIADTVVYVTS 227 PG ETD +G ++ + GI + + DIAD V+++ S Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 53.8 bits (129), Expect = 2e-10 Identities = 54/267 (20%), Positives = 92/267 (34%), Gaps = 44/267 (16%) Query: 57 PEKIVTFDLGAADTIRALGFEKNIVGMPTKTVPTYLK-----DLVGTVKNVGSMKEPDLE 111 P +IV + + + ALG IV Y L +V +VG EP+LE Sbjct: 35 PNRIVALEWLPVELLLALG----IVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLE 90 Query: 112 AIAALEPDLIIASPRTQKFVDKFKEIAPTVLFQASKDDYWTSTKANIESLASAFGETSTQ 171 + ++P ++ S + IAP F S A S Sbjct: 91 LLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFS-----------DGKQPLAMARKSLT 139 Query: 172 K----------AKEELAKLDKSIQEVATKNESSDKKALAI--LLNEGKMAAFGAKSRFSF 219 + A+ LA+ + I+ + + + L + L++ M FG S F Sbjct: 140 EMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQE 199 Query: 220 LYQTLKFKPTDTKFEDSRHGQE-VSFESVKEI-NPDILFVINRTLAIGGDNSSN-DGVLE 276 + P + E + G VS + + + D+ L DNS + D ++ Sbjct: 200 ILDEYGI-PNAWQGETNFWGSTAVSIDRLAAYKDVDV-------LCFDHDNSKDMDALMA 251 Query: 277 NALIAETPAAKNGKIIQLTPDLWYLSG 303 L P + G+ Q P +W+ Sbjct: 252 TPLWQAMPFVRAGR-FQRVPAVWFYGA 277
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 37.5 bits (87), Expect = 9e-05 Identities = 30/139 (21%), Positives = 53/139 (38%), Gaps = 25/139 (17%) Query: 1 MALPTIAIVGRPNVGKSTLFNRI-----AGERISIV------------EDVEGVTRDRIY 43 M + I ++ + GK+TL + A + V E G+T Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60 Query: 44 ATGEWLNRSFSMIDTGGIDDVDAPFMEQIKHQAEIAMEEADVIVFVVSGKEGITDADEYV 103 + +W N ++IDT G D A + ++ D + ++S K+G+ + Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLA--------EVYRSLSVLDGAILLISAKDGVQAQTRIL 112 Query: 104 ARKLYKTHKPVILAVNKVD 122 L K P I +NK+D Sbjct: 113 FHALRKMGIPTIFFINKID 131
>SECA#SecA protein signature. Length = 901 Score = 1053 bits (2724), Expect = 0.0 Identities = 390/904 (43%), Positives = 560/904 (61%), Gaps = 71/904 (7%) Query: 1 MANILKTIIENDKG-EIRRLEKMADKVFKYEDQMAALTDDQLKAKTVEFKERYQNGESLD 59 + +L + + +RR+ K+ + + E +M L+D++LK KT EF+ R + GE L+ Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61 Query: 60 SLLYEAFAVVREGAKRVLGLFPYKVQVMGGIVLHHGDVPEMRTGEGKTLTATMPVYLNAL 119 +L+ EAFAVVRE +KRV G+ + VQ++GG+VL+ + EMRTGEGKTLTAT+P YLNAL Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121 Query: 120 SGKGVHVVTVNEYLSERDATEMGELYSWLGLSVGINLATKSPMEKKEAYECDITYSTNSE 179 +GKGVHVVTVN+YL++RDA L+ +LGL+VGINL K+EAY DITY TN+E Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181 Query: 180 IGFDYLRDNMVVRAENMVQRPLNYALVDEVDSILIDEARTPLIVSGANAVETSQLYHMAD 239 GFDYLRDNM E VQR L+YALVDEVDSILIDEARTPLI+SG + ++Y + Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSS-EMYKRVN 240 Query: 240 HYVKSLNKD------------DYIIDVQSKTIGLSDSGIDRAESYF-------KLENLYD 280 + L + + +D +S+ + L++ G+ E + E+LY Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300 Query: 281 IENVALTHFIDNALRANYIMLLDIDYVVSEEQEILIVDQFTGRTMEGRRYSDGLHQAIEA 340 N+ L H + ALRA+ + D+DY+V ++ E++IVD+ TGRTM+GRR+SDGLHQA+EA Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIV-KDGEVIIVDEHTGRTMQGRRWSDGLHQAVEA 359 Query: 341 KEGVPIQDETKTSASITYQNLFRMYKKLSGMTGTGKTEEEEFREIYNIRVIPIPTNRPVQ 400 KEGV IQ+E +T ASIT+QN FR+Y+KL+GMTGT TE EF IY + + +PTNRP+ Sbjct: 360 KEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMI 419 Query: 401 RIDHSDLLYASIESKFKAVVEDVKARYQKGQPVLVGTVAVETSDYISKKLVAAGVPHEVL 460 R D DL+Y + K +A++ED+K R KGQPVLVGT+++E S+ +S +L AG+ H VL Sbjct: 420 RKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVL 479 Query: 461 NAKNHYREAQIIMNAGQRGAVTIATNMAGRGTDIKLG----------------------- 497 NAK H EA I+ AG AVTIATNMAGRGTDI LG Sbjct: 480 NAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKA 539 Query: 498 ------EGVRELGGLCVIGTERHESRRIDNQLRGRSGRQGDPGESQFYLSLEDDLMKRFG 551 + V E GGL +IGTERHESRRIDNQLRGRSGRQGD G S+FYLS+ED LM+ F Sbjct: 540 DWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFA 599 Query: 552 SERLKGIFERLNMSE-EAIESRMLTRQVEAAQKRVEGNNHDTRKQVLQYDDVMREQREII 610 S+R+ G+ +L M EAIE +T+ + AQ++VE N D RKQ+L+YDDV +QR I Sbjct: 600 SDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAI 659 Query: 611 YAQRYDVITADRDLAPEIQAMIKRTIERVVDGHARAKQDEK---LEAILNFAKYNLLPED 667 Y+QR +++ D++ I ++ + + +D + + E+ + + K + + Sbjct: 660 YSQRNELLDVS-DVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDL 718 Query: 668 SIT--MEDLSGLSDKAIKEELFQRALKVYDSQVSKLRDEEAVKEFQKVLILRVVDNKWTD 725 I ++ L ++ ++E + ++++VY + + E ++ F+K ++L+ +D+ W + Sbjct: 719 PIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVG-AEMMRHFEKGVMLQTLDSLWKE 777 Query: 726 HIDALDQLRNAVGLRGYAQNNPVVEYQAEGFRMFNDMIGSIEFDVTRLMMKAQIH----- 780 H+ A+D LR + LRGYAQ +P EY+ E F MF M+ S++++V + K Q+ Sbjct: 778 HLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEV 837 Query: 781 ----EQERPQAERHISTTATRNIAAHQASMP---EDLDLNQIGRNELCPCGSGKKFKNCH 833 +Q R +AER + A+ ++GRN+ CPCGSGKK+K CH Sbjct: 838 EELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCH 897 Query: 834 GKRQ 837 G+ Q Sbjct: 898 GRLQ 901
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 26.9 bits (59), Expect = 0.017 Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 5/73 (6%) Query: 6 GIDIEELASIESAVTRHEGFAKRVLTAQEMERFTSLKGRRQIEYLAGRWSAKEAFSKAMG 65 GIDIE++ S + A ++ + E + + L +SAKE+ KA Sbjct: 105 GIDIEKIMSQHT----ATELAPSIIDSDERQILQA-SLLPFPLALTLAFSAKESVYKAFS 159 Query: 66 TGISKLGFQDLEV 78 ++ GF +V Sbjct: 160 DRVTLPGFNSAKV 172
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 349 bits (898), Expect = e-121 Identities = 128/365 (35%), Positives = 185/365 (50%), Gaps = 17/365 (4%) Query: 14 RPTKALIHLGAIRQNIQQMGAHIPQGTLKLAVVKANAYGHGAVAVAKAIQDDVDGFCVSN 73 RP +A + L A++QN+ + + +VVKANAYGHG + AI DGF + N Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARV-WSVVKANAYGHGIERIWSAI-GATDGFALLN 60 Query: 74 IDEAIELRQAGLSKPILIL-GVSEIEAVALAKEYDFTLTVAGLEWIQALLDKEVDLTGLT 132 ++EAI LR+ G PIL+L G + + + ++ T V ++AL + + L Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAP-LD 119 Query: 133 VHLKIDSGMGRIGFREASEVEQAQDLLQQHGVCVEGIFTHFATADEESDDYFNAQLERFK 192 ++LK++SGM R+GF+ + Q L V + +HFA A+ D + + R + Sbjct: 120 IYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHP--DGISGAMARIE 177 Query: 193 TILASMKEVPELVHASNSATTLWHVETIFNAVRMGDAMYGLNPSGAVLDL-PYDLIPALT 251 + + SNSA TLWH E F+ VR G +YG +PSG D+ L P +T Sbjct: 178 QAA---EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMT 234 Query: 252 LESALVHVKTVPAGACMGYGATYQADSEQVIATVPIGYADGWTRDMQN-FSVLVDGQACP 310 L S ++ V+T+ AG +GYG Y A EQ I V GYADG+ R VLVDG Sbjct: 235 LSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTM 294 Query: 311 IVGRVSMDQITIRLPKL--YPLGTKVTLIGSNGDKEITATQVATYRVTINYEVVCLLSDR 368 VG VSMD + + L +GT V L G KEI VA T+ YE++C L+ R Sbjct: 295 TVGTVSMDMLAVDLTPCPQAGIGTPVELWG----KEIKIDDVAAAAGTVGYELMCALALR 350 Query: 369 IPREY 373 +P Sbjct: 351 VPVVT 355
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 38.1 bits (88), Expect = 2e-04 Identities = 19/133 (14%), Positives = 42/133 (31%), Gaps = 15/133 (11%) Query: 21 QERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGESSTLTDTEK 80 YS+RKL G S+ V V G L T +T + T+ Sbjct: 4 NNTNRHYSLRKLKTGTASVAVALTVLGAG------------LVVNTNEVSAVATRSQTDT 51 Query: 81 SQPSSETELSGNKQEQERKDKQEEKIPRDYYARD--LENVETVIEKEDVETNASNGQRVD 138 + E + E + + + A + + + + ++ + Sbjct: 52 LEKVQE-RADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSE 110 Query: 139 LSSELDKLKKLEN 151 +S++ +L+ + Sbjct: 111 KASKIQELEARKA 123
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 32.8 bits (74), Expect = 0.003 Identities = 60/268 (22%), Positives = 105/268 (39%), Gaps = 21/268 (7%) Query: 72 TKIKIETFSWNDFYTKWTTGLANGNVPDISTALPNQVMEMVNSDALVPLNDSIKRIGQDK 131 T IK+ + K+ A G+ PDI ++ S L + + QDK Sbjct: 57 TGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPD--KAFQDK 114 Query: 132 FNETALNEAKIGDDYYSVPLYSHAQVMWVRTDLLKEHNIEVPKTWDQLYEASKKLKEAG- 190 + + + P+ A + DLL PKTW+++ K+LK G Sbjct: 115 LYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNP----PKTWEEIPALDKELKAKGK 170 Query: 191 ---VYGLSVPFGTNDLMATRFLNFYVRSGGGSLLTKDLKADLTSQLAQDGIKYWVKLYKE 247 ++ L P+ T L+A + + G KD+ D + A+ G+ + V L K Sbjct: 171 SALMFNLQEPYFTWPLIAADG-GYAFKYENGKYDIKDVGVD--NAGAKAGLTFLVDLIKN 227 Query: 248 ISPQDSLNFNVLQQATLFYQGKTAFDFNSGFHIGGINANSPQLIDSIDAYPIPKIKESDK 307 ++++ + A F +G+TA N + I+ + + P K + S Sbjct: 228 KHMNADTDYSIAEAA--FNKGETAMTINGPWAWSNIDTSKVNY--GVTVLPTFKGQPSKP 283 Query: 308 DQGIETSNIPMVVWKNSKHPEVAKAFLE 335 G+ ++ I S + E+AK FLE Sbjct: 284 FVGVLSAGINAA----SPNKELAKEFLE 307
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 32.9 bits (75), Expect = 0.003 Identities = 38/146 (26%), Positives = 63/146 (43%), Gaps = 14/146 (9%) Query: 72 LSLLLCVGLCIGLAKRDKGTAAL-AGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALV 130 L+ LL + + L D L +T L+ + L+ F++ G A+ + G LV Sbjct: 135 LAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLV 194 Query: 131 VGIV--AVYLHNR-----YNNIQLPSALGFFGGSRFVPIVTSFSSILIGFVFFVIWPPFQ 183 + + A L Y + +L +ALG + G + +PIV SS L+G + + Sbjct: 195 LWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSS-LVGAFMGIGLILLR 253 Query: 184 QLLVST----GGYISQAGPIGTFLYG 205 S G Y++ AG I L+G Sbjct: 254 NHHQSKPIPFGPYLAIAGWIA-LLWG 278
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.7 bits (66), Expect = 0.026 Identities = 31/104 (29%), Positives = 45/104 (43%), Gaps = 6/104 (5%) Query: 71 FEKANPDIKVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLF 130 FEK + IKV +E D EK G PD++F A R Y ++G LAE+ Sbjct: 53 FEK-DTGIKVTVEHPD--KLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEIT--- 106 Query: 131 TDEFVKDVNNENIVQASKAGDKAYMYPISSAPFYMAMNKKMLED 174 D+ +D A + K YPI+ + NK +L + Sbjct: 107 PDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPN 150
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 448 bits (1155), Expect = e-162 Identities = 309/309 (100%), Positives = 309/309 (100%) Query: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 Query: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSD 120 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSD Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSD 120 Query: 121 GVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKL 180 GVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKL Sbjct: 121 GVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKL 180 Query: 181 DKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV 240 DKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV Sbjct: 181 DKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV 240 Query: 241 EKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNL 300 EKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNL Sbjct: 241 EKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNL 300 Query: 301 DKIAEGLAK 309 DKIAEGLAK Sbjct: 301 DKIAEGLAK 309
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 77.9 bits (192), Expect = 7e-19 Identities = 31/142 (21%), Positives = 66/142 (46%), Gaps = 3/142 (2%) Query: 3 KILLIEDDQVIRQQIGKMLSEWGFEVVLVEDFMEVLSLFVQSEPHLVLMDIGLPLFNGYH 62 IL+ +DD IR + + LS G++V + + + + LV+ D+ +P N + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 WCQEIRKI-SKVPIMFLSSRDQAMDIVMAINMGADDFVTKPFDQQVLLAKVQGLL--RRS 119 I+K +P++ +S+++ M + A GA D++ KPFD L+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 120 YEFGRDESLLEYAGVILNTKSM 141 ++ + ++ + +M Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAM 146
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 27.4 bits (61), Expect = 0.041 Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 10/80 (12%) Query: 1 MKLAVIAANGQVGKAIVEEAVKRGHEVTAI--------VRSENKSQAESIIKKDLFELTK 52 MK V A G +G + + ++ GH+V I V + ++ + F+ K Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLE--LLAQPGFQFHK 58 Query: 53 DDLTGFDAVISAFGAYTPDT 72 DL + + F + + Sbjct: 59 IDLADREGMTDLFASGHFER 78
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 181 bits (461), Expect = 9e-57 Identities = 88/350 (25%), Positives = 150/350 (42%), Gaps = 48/350 (13%) Query: 4 KILVTGGAGFIGTHTVIELIQAGHQVVVVDNLVNSNRKSLEV--VERITGVEIPFYEADI 61 K LVTG AGFIG H L++AGHQVV +DNL + SL+ +E + F++ D+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 62 RDTDTLRDIFKQEEPTGVIHFAGLKAVGESTRIPLAYYDNNIAGTVSLLKAMEENNCKNI 121 D + + D+F V AV S P AY D+N+ G +++L+ N +++ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 122 IFSSSATVYGDPHTVPILE----DFPLSVTNPYGRTKLMLEEI---LTDIYKADSEWNVV 174 +++SS++VYG +P D P+S Y TK E + + +Y Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVS---LYAATKKANELMAHTYSHLYGLP----AT 174 Query: 175 LLRYFNPIGAHESGDLGENPNGIPNNLLPYVTQVAVGKLEQVQVFGDDYDTEDGTGVRDY 234 LR+F G P G P+ L T+ A+ + + + V+ G RD+ Sbjct: 175 GLRFFTVYG----------PWGRPDMALFKFTK-AMLEGKSIDVYN------YGKMKRDF 217 Query: 235 IHVVDLAKGHVAALKKIQKGSG---------------LNVYNLGTGKGYSVLEIIQNMEK 279 ++ D+A+ + I VYN+G +++ IQ +E Sbjct: 218 TYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALED 277 Query: 280 AVGCPIPYRIVERRPGDIAACYSDPAKAKAELGWEAELDITQMCEDAWRW 329 A+G ++ +PGD+ +D +G+ E + ++ W Sbjct: 278 ALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.013 Identities = 16/94 (17%), Positives = 32/94 (34%), Gaps = 16/94 (17%) Query: 8 IDGPASSGKSTVAKIIAKDFGFTYLDTGAMYRAATYMALKNQLGVE----------EVEA 57 ++G GKST+ + F+ +Y + + E + EA Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660 Query: 58 LLALL----DQHPISFGRSET--GDQLVFVGDVD 85 + A D++ ++GR Q+V + Sbjct: 661 VKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTN 694
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 29.2 bits (65), Expect = 6e-04 Identities = 8/28 (28%), Positives = 13/28 (46%) Query: 32 KKDKFLSILTSLAGIALVLAAVWLGWPK 59 ++ F+ L + LVL W+ W K Sbjct: 450 QQQSFIDQLLAAGRWLLVLVVAWILWRK 477
>PF05043#Transcriptional activator Length = 493 Score = 300 bits (769), Expect = 1e-98 Identities = 197/488 (40%), Positives = 296/488 (60%), Gaps = 2/488 (0%) Query: 9 MRNLLSTKVQRQLRLMETLIQNRNWMKLHELAEKLGCTERILKSDLNELRIAFPSINIQS 68 MR+LLS K RQL L+E L +++ W ELAE L CTER +K DL+ ++ AFP + S Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60 Query: 69 SVNGIMIDLEVNTSVEDIYQYFLANSQSFQLLEYMFFNEGLPIYRTIENLYFSSANLYRL 128 S NGI I ++ +E +Y +F +S F +LE++FFNEG + Y SS++LYR+ Sbjct: 61 STNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRI 120 Query: 129 GRNITKVLSSQFQIELSFTPSEIRGNEIDIRYFFAQYFSERYYFLDWPFPDLPEEDLTEF 188 I KV+ QFQ E+S TP +I GNE DIRYFFAQYFSE+YYFL+WPF + E L++ Sbjct: 121 ISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSSEPLSQL 180 Query: 189 ADFFYKITNYPMRFSIYRMYKLMIAISIHRVKNGHFIDLPNH-FYKEYYPLLKSIPNFQE 247 + YK T++PM S +RM KL++ +++R+K GHF+++ F + L + Sbjct: 181 LELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFMEVDKDSFNDQSLDFLMQAEGIEG 240 Query: 248 TLAYFSKHFGLEMTPDTIAQIFISFLQNDIFLDPQEFFNSLEDNSQARYSYQLLSQILEG 307 F + + + + + Q+F+S+ Q F+D F ++ +S SY LLS ++ Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKDSYVEKSYHLLSDFIDQ 300 Query: 308 LSKQYKITFTNHDELIWHLHNTAFFERQEIFSTPILFEQKALTIKKFEVYFPDFMGSARQ 367 +S +Y+I N D LIWHLHNTA RQE+F+ ILF+QK TI+ F+ FP F+ ++ Sbjct: 301 ISVKYQIEIENKDNLIWHLHNTAHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKK 360 Query: 368 ELAQYRQAIGQHDHPEQLEHLMYTILTHAENLSTQLLENRPPIKVLIISNFDHAISLTFV 427 EL+ Y + + + HL YT +TH ++L LL+N+P +KVL++SNFD + Sbjct: 361 ELSHYLETLEVCSSSMMVNHLSYTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVA 420 Query: 428 DMLSYYCNNRFTFDIWDELKTSPEILNQTDYDIIVSNFYIPGI-TKKFICRNHLSIMNLV 486 + LSYYC+N F ++W EL+ S E L + YDII+SNF IP I K+ I N+++ ++L+ Sbjct: 421 ETLSYYCSNNFELEVWTELELSKESLEDSPYDIIISNFIIPPIENKRLIYSNNINTVSLI 480 Query: 487 NHLNTLSN 494 LN + Sbjct: 481 YLLNAMMF 488
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 51.5 bits (123), Expect = 1e-08 Identities = 28/89 (31%), Positives = 33/89 (37%), Gaps = 4/89 (4%) Query: 2429 VTPSNDKPVPPTPNVPTPEVPVK-PVPAQPTPNVPTPEVPVQPTPAVSTPEVPVKPVPAV 2487 VT + P V P PV P P P E PV P+ KPV V Sbjct: 47 VTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV 106 Query: 2488 PEQP---VVPTPAQPATPVNANPVAPTTG 2513 EQP V P ++PA+P A T Sbjct: 107 QEQPKRDVKPVESRPASPFENTAPARLTS 135 Score = 34.6 bits (79), Expect = 0.004 Identities = 17/52 (32%), Positives = 20/52 (38%), Gaps = 1/52 (1%) Query: 2447 EVPVKPVPAQPTPNVPTPEVPVQPTPAVSTPEVPVK-PVPAVPEQPVVPTPA 2497 +V P PAQP ++P AV P PV P P P P A Sbjct: 34 QVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEA 85 Score = 31.1 bits (70), Expect = 0.042 Identities = 21/97 (21%), Positives = 30/97 (30%), Gaps = 4/97 (4%) Query: 2425 QDKPVTPSNDKPVPPTPNVPTPEVPVKPVPA---QPTPNV-PTPEVPVQPTPAVSTPEVP 2480 + P + PV P P+ KPV QP +V P P P + + Sbjct: 75 PEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLT 134 Query: 2481 VKPVPAVPEQPVVPTPAQPATPVNANPVAPTTGKENR 2517 A +PV + P P P + R Sbjct: 135 SSTATAATSKPVTSVASGPRALSRNQPQYPARAQALR 171
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 29.1 bits (65), Expect = 0.028 Identities = 8/21 (38%), Positives = 10/21 (47%) Query: 31 DILSLTLGEPDFTTPKNIQDA 51 L L L PDF+T + D Sbjct: 191 VNLVLQLRNPDFSTAVRVADV 211
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 31.7 bits (72), Expect = 0.006 Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 9/55 (16%) Query: 306 IVNDTVI--IDDFA-----HHPTEIIATLDAARQKYPSKEIVAVFQPHTFTRTIA 353 ++ D V+ I D H+P I + A Q P +VAVF F +T+ Sbjct: 103 LITDDVLKAITDCIELAPLHNPANIEG-IKACTQIMPDVPMVAVFDT-AFHQTMP 155
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.2 bits (86), Expect = 9e-06 Identities = 32/136 (23%), Positives = 60/136 (44%), Gaps = 22/136 (16%) Query: 25 SFPAEKQQLSHILEESIRKCADTFLLARDENQLLGYI-LSSPQSDNPQCLKVHSLVIESD 83 + + +S++ EE L EN +G I + S + + + + D Sbjct: 49 QYEDDDMDVSYVEEE-----GKAAFLYYLENNCIGRIKIRSNWNGY---ALIEDIAVAKD 100 Query: 84 HQRQGLGTLLLAALKEVAVELDYKGIRLESPDELLS---YFEMNGF----VDEEATLLY- 135 ++++G+GT LL E A E + G+ LE+ D +S ++ + F VD T+LY Sbjct: 101 YRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVD---TMLYS 157 Query: 136 --ATSQGYSMIWFNPF 149 T+ ++ W+ F Sbjct: 158 NFPTANEIAIFWYYKF 173
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 31.1 bits (70), Expect = 0.008 Identities = 29/129 (22%), Positives = 41/129 (31%), Gaps = 8/129 (6%) Query: 50 LMADSLSTVEEIMRKAPTVPTHPSQGVPASPADEIQRETPGVPSHPSQDV--PSSPAEES 107 ++A L T + + P P P +PAD E P P + V P E Sbjct: 28 VVAGLLYTSVHQVIELPA-PAQPISVTMVAPADL---EPPQAVQPPPEPVVEPEPEPEPI 83 Query: 108 GSRPGPGPVRPKKLEREYNETPTRVAVSYTTAEKKAEQAGPETPTPATETVDIIRDTSRR 167 P PV +K + P V K+ + P E R TS Sbjct: 84 PEPPKEAPVVIEKPKP--KPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSST 141 Query: 168 SRREGAKPA 176 + +KP Sbjct: 142 ATAATSKPV 150
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.8 bits (85), Expect = 1e-05 Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 3/76 (3%) Query: 76 IAETFGNWLEIEYLFVKEELRGQGIGSKLLQQAESEAKNRNCCFAFVNTYQFQAP--DFY 133 I + + IE + V ++ R +G+G+ LL +A AK + C + T FY Sbjct: 82 IRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141 Query: 134 QKHGYKEVFSLQDYLY 149 KH + + ++ LY Sbjct: 142 AKHHFI-IGAVDTMLY 156
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 28.7 bits (64), Expect = 0.026 Identities = 11/30 (36%), Positives = 17/30 (56%) Query: 1 MKKWMLVLVSLMTALFLVACGKNSSETSGD 30 MKK +LV ++A+ LVAC +T+ Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSG 30
>PERTACTIN#Pertactin signature. Length = 922 Score = 37.0 bits (85), Expect = 4e-05 Identities = 23/88 (26%), Positives = 32/88 (36%), Gaps = 3/88 (3%) Query: 96 TVRYDRLSTPEKPIPQPNPEHPSVPTPNPELPNQETPTPDKPTPEPGTPKTETPVNPDPE 155 T RY + + P P P P+ Q P P +P P P+ P PE Sbjct: 547 TYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPE 606 Query: 156 VPTYETGKREELPNTGTEANATLASAGI 183 P + EL ANA + + G+ Sbjct: 607 APAPQPPAGREL---SAAANAAVNTGGV 631
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 80.7 bits (199), Expect = 2e-18 Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 10/153 (6%) Query: 13 VNIGTIGHVDHGKTTLTAAI---TTVLARRLPSSVNQPKDYASIDAAPEERERGITINTA 69 +NIG + HVD GKTTLT ++ + + SV+ K D ER+RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITE--LGSVD--KGTTRTDNTLLERQRGITIQTG 59 Query: 70 HVEYETEKRHYAHIDAPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILLSRQV 129 ++ E ID PGH D++ + + +DGAIL++++ DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 130 GVKHLIVFMNKVDLVDDEELLELVEMEIRDLLS 162 G+ I F+NK+D + L V +I++ LS Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLS 149
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 34.0 bits (78), Expect = 0.001 Identities = 19/94 (20%), Positives = 33/94 (35%), Gaps = 20/94 (21%) Query: 164 RIAVVGG-GYIGVELAEAFERLGKEVVLVDIVDTVLNGYYDKDFTQMMAKNLEDHNIRLA 222 + V G G+IG +++ G +VV +D LN YYD +L+ + L Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGID----NLNDYYD--------VSLKQARLELL 49 Query: 223 LGQTVKAIEGD----GKVERLITDKESFDVDMVI 252 + + D + L + V Sbjct: 50 AQPGFQFHKIDLADREGMTDLFASGH---FERVF 80
>INTIMIN#Intimin signature. Length = 939 Score = 30.4 bits (68), Expect = 0.010 Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 5/48 (10%) Query: 69 ELWPRYADERYFLSKSHKDFVDRNLFITIRDKKTTCIKPYQQDLDLPH 116 ++ P+Y +E LS S D V RN I + KK + L++PH Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILS-----LNIPH 460
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 30.1 bits (67), Expect = 0.019 Identities = 15/59 (25%), Positives = 31/59 (52%), Gaps = 1/59 (1%) Query: 170 GISKKTSNSIKEVYPDYTSKLQTIYNGYDFQTILEKSQEKIDIEIAPQSICTIGRIEEN 228 GIS + K + P++ + ++++ + D +L + K +E+ +SI I I+EN Sbjct: 176 GISLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLELNNKSI-DINFIKEN 233
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 28.2 bits (62), Expect = 0.035 Identities = 15/63 (23%), Positives = 35/63 (55%), Gaps = 4/63 (6%) Query: 87 EDLSDLPDMEELAQMSPDEFIKTLEKSIADKTKDDIEAIQSLEQVEAKEEEQEQAEQEAE 146 ++LS++ + L M FI+ + K+ + ++D+ +L++ E E++ AE + E Sbjct: 248 DNLSNVARLTMLMAM----FIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEE 303 Query: 147 SKK 149 ++K Sbjct: 304 TRK 306
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 37.4 bits (87), Expect = 1e-04 Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 9/65 (13%) Query: 337 RTAALLQKMK---------SGDASQFPIETALKVLTIEGAKALGMENQIGSLEVGKQADF 387 RT KMK +GD F ++ + TI A A G+ ++IGSLEVGK+AD Sbjct: 375 RTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEIGSLEVGKRADL 434 Query: 388 LVIQP 392 ++ P Sbjct: 435 VLWNP 439
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.2 bits (81), Expect = 5e-04 Identities = 18/148 (12%), Positives = 53/148 (35%), Gaps = 25/148 (16%) Query: 38 DRMRQELALAEQKAMNEQQTKLAQKDQEIAQLQSQIQNF--DTEKELAKKEVEQ------ 89 + +R + EQ + + Q QK+ + + +++ + VE+ Sbjct: 183 EVLRLTSLIKEQFSTWQNQ--KYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240 Query: 90 --------TSHQALLAKDKEVQALENQLATLRL---EHENQLQKTLSDLEKERNQVKNQL 138 + A+L ++ + N+L + + E+++ + + KN++ Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300 Query: 139 LLQEKENELSLASVKQNYEAQLKAASEQ 166 L + ++ ++ +L E+ Sbjct: 301 LDKLRQTTDNIGL----LTLELAKNEER 324
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.9 bits (88), Expect = 6e-05 Identities = 27/141 (19%), Positives = 62/141 (43%), Gaps = 13/141 (9%) Query: 52 SVIGVLFNLFGGVIADSFKR----KKIIIVANILCGIACIILSFISQEQWMVFAIVITNI 107 + G+L +L +I ++ +++ I G I+L+F ++ WM F I++ Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATR-GWMAFPIMV--- 308 Query: 108 ILAFMSAFSGPSYKAFTKEIVKKDSISQLNSLLEITSTIIKVTIPMVAILLYKLLGIHGV 167 +LA P+ +A V ++ QL L +++ + P++ +Y + Sbjct: 309 LLAS-GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA----ASI 363 Query: 168 LLLDGFSFLIAASLISFIVPV 188 +G++++ A+L +P Sbjct: 364 TTWNGWAWIAGAALYLLCLPA 384
>PF01540#Adhesin lipoprotein Length = 475 Score = 26.2 bits (57), Expect = 0.026 Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 8/58 (13%) Query: 52 INTDTYDQLVFELRRIGNNINQIARAINQSHLISQDQLQELSKGVGELIKEVDKEFQV 109 I + +L E ++I N + ++ + N++ ELSK V + I E++K+F++ Sbjct: 351 IKAEDDKKLAEENQKIKNGVEELKKINNEA--------FELSKTVNKTIAELEKKFKI 400
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 34.9 bits (80), Expect = 8e-04 Identities = 19/127 (14%), Positives = 44/127 (34%), Gaps = 6/127 (4%) Query: 390 QEKINMKVDTSEIEKEIDNY-QKELRKSHSTKFKLIEEIDNLDVEDKHYKRRKQDLDDRL 448 + V S +++E D + +LR + + L + + D L ++ Sbjct: 50 GGWVGNGVYVSGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQM 109 Query: 449 YRMYDKIDELESSLIDAKAKKQTIEAEKLTGDNIYKVLIYFDKLYKVMNDVERRQLISAL 508 + + L S+ D A++ I + + D+ + + + I A Sbjct: 110 QDFFTSLQTLVSNAEDPAARQALIGKSEGLVNQFKT----TDQYLRDQDK-QVNIAIGAS 164 Query: 509 ISEIQVY 515 + +I Y Sbjct: 165 VDQINNY 171
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 60.3 bits (146), Expect = 2e-12 Identities = 50/263 (19%), Positives = 102/263 (38%), Gaps = 36/263 (13%) Query: 55 PERVATIAWGNHDVALALGIVPVGFSK-ANYGVSADKGVLPWTEEKIKELNGKANLFDDL 113 P R+ + W ++ LALGIVP G + NY + + LP + + ++ + Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLP---DSVIDVGLRTE----- 86 Query: 114 DGLNFEAISNSKPDVIL--AGYSGITKEDYDTLSKIAPVAAYK----SKPWQTLWRDMIK 167 N E ++ KP ++ AGY + L++IAP + +P + + + Sbjct: 87 --PNLELLTEMKPSFMVWSAGY----GPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTE 140 Query: 168 IDSKALGMEKEGDELIKNTEARISKELEKHPEIKGKIKGKKVLFTMINAADTSKFWIYTS 227 + + L ++ + + E I P + +L T+I D ++ Sbjct: 141 M-ADLLNLQSAAETHLAQYEDFI---RSMKPRFVKRGARPLLLTTLI---DPRHMLVFGP 193 Query: 228 KDPRANYLTDLGLVFPESLKEFESEDSF--AKEISAEEANKINDADVI-ITYGDDKTLEA 284 L + G+ ++ E +F + +S + D DV+ + + K ++A Sbjct: 194 NSLFQEILDEYGIP-----NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDA 248 Query: 285 LQKDPLLGKINAIKNGAVAVIPD 307 L PL + ++ G +P Sbjct: 249 LMATPLWQAMPFVRAGRFQRVPA 271
>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature. Length = 136 Score = 93.0 bits (231), Expect = 1e-27 Identities = 48/133 (36%), Positives = 72/133 (54%), Gaps = 11/133 (8%) Query: 1 MLKNLKSFLLRGNVIDLAVGVVIASAFGAIVTSLVNDIITPLILN-------PALKAAKV 53 ++K + F +RGNV+DLAVGV+I +AFG IV+SLV DII P + Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62 Query: 54 ERIAQLSWHGVGYGNFLSAIINFIFVGTALFFIIKGIEKAQKLTGIKKEKTAEKKPTELE 113 + + + YG F+ + +F+ V A+F IK I K + K+E A PT+ E Sbjct: 63 DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRK---KEEPAAAPAPTKEE 119 Query: 114 V-LQEIKALLEKK 125 V L EI+ LL+++ Sbjct: 120 VLLTEIRDLLKEQ 132
>PERTACTIN#Pertactin signature. Length = 922 Score = 31.6 bits (71), Expect = 0.015 Identities = 26/111 (23%), Positives = 40/111 (36%), Gaps = 23/111 (20%) Query: 340 RYR-----SNHWVPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEKLVKEAVRKVG 394 RYR + W P+P+ P+P P P P P P P P + Sbjct: 549 RYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQ---- 604 Query: 395 DGYVFEENGVPRYIPAKDLSAETAAGIDSK---------LAKQESLSHKLG 436 E P+ ++LSA A +++ A+ +LS +LG Sbjct: 605 -----PEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLG 650
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 228 bits (582), Expect = 1e-75 Identities = 86/315 (27%), Positives = 152/315 (48%), Gaps = 19/315 (6%) Query: 7 MKKQNLFLVLLSVFLLCLGAC-GQKESQTGKGMKIVTSFYPIYAMVKEVSGDLNDVR-MI 64 MKK LVL ++ + G+K++ +G+ +K+V + I + K ++GD D+ ++ Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 Query: 65 QSSSGIHSFEPSANDIAAIYDADVFVYHSHTLES----WAGSLDPNLKKSKVKVLEASEG 120 H +EP D+ +AD+ Y+ LE+ W L N KK++ K A Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVS- 119 Query: 121 MTLERVPGLEDVEAGDGVDEKTLYDPHTWLDPEKAGEEAQIIADKLSEVDSEHKETYQKN 180 G++ + +EK DPH WL+ E A+ IA +LS D +KE Y+KN Sbjct: 120 ------DGVDVIYLEGQ-NEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKN 172 Query: 181 AQAFIKKAQELTKKFQPKFEK--ATQKTFVTQHTAFSYLAKRFGLNQLGIAGISPEQEPS 238 + + K +L K+ + KF K A +K VT AF Y +K +G+ I I+ E+E + Sbjct: 173 LKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGT 232 Query: 239 PRQLTEIQEFVKTYKVKTIFTESNASSKVAETLVKSTGV---GLKTLNPLESDPQNDKTY 295 P Q+ + E ++ KV ++F ES+ + +T+ + T + + + + +Y Sbjct: 233 PEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSY 292 Query: 296 LENLEENMSILAEEL 310 ++ N+ +AE L Sbjct: 293 YSMMKYNLDKIAEGL 307
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 27.1 bits (60), Expect = 0.046 Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 6/66 (9%) Query: 7 MKKVMFAGLSLLSLVVLMACGEEETKKTQAAQQPKQQTTVQQIS-----VGKDVPDFTLQ 61 MKK+ + LS ++L+AC K T + Q+ K T I+ + D D Sbjct: 1 MKKLGTLLVLFLSAIILVACA-SGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSI 59 Query: 62 SMDGKE 67 G++ Sbjct: 60 VPIGQD 65
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.7 bits (74), Expect = 0.001 Identities = 15/57 (26%), Positives = 22/57 (38%), Gaps = 6/57 (10%) Query: 30 KGEVVVIL-GPSGCGKSTLLRCLNGLESIQGGDILLDGQSIVENKKDFHLVRQKIGM 85 K + V+L G G GKSTL+ L GL+ I K + + + Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHF-----DIGTGKDSYEQIAGIVAY 645
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 181 bits (461), Expect = 2e-51 Identities = 99/469 (21%), Positives = 193/469 (41%), Gaps = 72/469 (15%) Query: 15 IRNIAIIAHVDHGKTTLVDELLKQSETLD--ARTELAERAMDSNDIEKERGITILAKNTA 72 I NI ++AHVD GKTTL + LL S + + D+ +E++RGITI T+ Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62 Query: 73 VAYNGTRINIMDTPGHADFGGEVERIMKMVDGVVLVVDAYEGTMPQTRFVLKKALEQDLV 132 + T++NI+DTPGH DF EV R + ++DG +L++ A +G QTR + + + Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122 Query: 133 PIVVVNKIDKPSARPAEVVDEVLELF---------IELGADDDQLDFP--VVYASAING- 180 I +NKID+ + V ++ E +EL + +F + + I G Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182 Query: 181 ----TSSLSDDPAD------------QEATMAPIF--------------DTIIDHIPAPV 210 +S + ++ P++ + I + + Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSST 242 Query: 211 DNSDEPLQFQVSLLDYNDFVGRIGIGRVFRGTVKVGDQVTLSKLDGTTKNFRVTKLFGFF 270 L +V ++Y++ R+ R++ G + + D V +S + ++T+++ Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS----EKEKIKITEMYTSI 298 Query: 271 GLERREIQEAKAGDLIAVSGMEDIFVGETITPTDAVEALPILHIDEPTLQMTFLVNNSPF 330 E +I +A +G+++ + E + + + T + + P LQ T Sbjct: 299 NGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTT-------- 349 Query: 331 AGKEGKWVTSRKVEER------LQAELQTDVSLRVDPTDSPDKWTVSGRGELHLSILIET 384 V K ++R L +D LR + + +S G++ + + Sbjct: 350 -------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCAL 402 Query: 385 MRRE-GYELQVSRPEVIVKEIDGVKCEPFERVQIDTPEEYQGSVIQSLS 432 ++ + E+++ P VI E K E +++ P + S+ S+S Sbjct: 403 LQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASIGLSVS 450 Score = 41.0 bits (96), Expect = 1e-05 Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 1/77 (1%) Query: 410 EPFERVQIDTPEEYQGSVIQSLSERKGEMLDMISTGNGQTRLVFLVPARGLIGYSTEFLS 469 EP+ +I P+EY + ++D N + L +PAR + Y ++ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVD-TQLKNNEVILSGEIPARCIQEYRSDLTF 595 Query: 470 MTRGYGIMNHTFDQYLP 486 T G + Y Sbjct: 596 FTNGRSVCLTELKGYHV 612
>AEROLYSIN#Aerolysin signature. Length = 493 Score = 29.6 bits (66), Expect = 0.013 Identities = 9/25 (36%), Positives = 16/25 (64%) Query: 211 FTLNPDLAESNYRPLNQKELQIIKN 235 F+L + YRP+N++E Q +K+ Sbjct: 35 FSLGQGVCGDKYRPVNREEAQSVKS 59
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 80.5 bits (198), Expect = 3e-20 Identities = 48/182 (26%), Positives = 87/182 (47%), Gaps = 6/182 (3%) Query: 4 ILITGASGGLAQEMVKLLPND--QLILLGRNKEKLAQLYGNYS----HAELIEIDITDDS 57 ITGA+ G+ + + + L + + + N EKL ++ + HAE D+ D + Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70 Query: 58 ALEALVTDLYLRYGKIDVLINNAGYGIFEGFDQIADKDIHQMFEVNTFALMNLSRHLAAR 117 A++ + + G ID+L+N AG ++D++ F VN+ + N SR ++ Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130 Query: 118 MKESSKGHIINIVSMAGLIATGKSSLYSATKFAAIGFSNALRLELMPYGVYVTTVNPGPI 177 M + G I+ + S + + Y+++K AA+ F+ L LEL Y + V+PG Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190 Query: 178 RT 179 T Sbjct: 191 ET 192
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 93.4 bits (232), Expect = 3e-22 Identities = 50/233 (21%), Positives = 87/233 (37%), Gaps = 57/233 (24%) Query: 215 LKSINAPF-GKNFDGRGMVISNIDTGTDYRHKAMRIDDDAKASMRFKKEDLKGTDKNYWL 273 ++ I AP GRG+ ++ +DTG D H DLK Sbjct: 26 VEMIQAPAVWNQTRGRGVKVAVLDTGCDADH-----------------PDLKA------- 61 Query: 274 SDKIPHAFNYYNGGKITVEKYDDGRDYFDPHGMHIAGILAGNDTEQDIKNFNGIDGIAPN 333 +I N+ + + E + D + HG H+AG +A + N NG+ G+AP Sbjct: 62 --RIIGGRNFTDDDEGDPEIFKDY----NGHGTHVAGTIAATE------NENGVVGVAPE 109 Query: 334 AQIFSYKMYSDAGSGFAGDETMFHAIEDSIKHNVDVVSVSSGFTGTGLVGEKYWQAIRAL 393 A + K+ + GSG + I +I+ VD++S+S G +A++ Sbjct: 110 ADLLIIKVLNKQGSGQYDW--IIQGIYYAIEQKVDIISMSLGGPEDVPELH---EAVKKA 164 Query: 394 RKAGIPMVVATGNYATSASSSSWDLVANNHLKMTDTGNVTRTAAHEDAIAVAS 446 + I ++ A GN T + + + I+V + Sbjct: 165 VASQILVMCAAGNEGDGDDR---------------TDELGYPGCYNEVISVGA 202 Score = 59.5 bits (144), Expect = 5e-11 Identities = 36/139 (25%), Positives = 54/139 (38%), Gaps = 32/139 (23%) Query: 666 PDVSAPGKNIKSTLNVINGKSTYGYMSGTSMATPIVAASTVLIRPKLKEMLERPVLKNLK 725 D+ APG++I ST+ Y SGTSMATP VA + LI+ ER Sbjct: 219 VDLVAPGEDILSTVP----GGKYATFSGTSMATPHVAGALALIKQLANASFER------- 267 Query: 726 GDDKIDLTSLT-KIALQNTARPMMDATSWKEKSQYFASPRQQGAGLINVANALRNEVVAT 784 DLT L P+ + SP+ +G GL+ + ++ Sbjct: 268 -----DLTEPELYAQLIKRTIPLGN------------SPKMEGNGLLYLTAVEE---LSR 307 Query: 785 FKNTDSKGLVNSYGSISLK 803 +T + S S+ +K Sbjct: 308 IFDTQRVAGILSTASLKVK 326
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.3 bits (63), Expect = 0.044 Identities = 16/82 (19%), Positives = 29/82 (35%), Gaps = 4/82 (4%) Query: 163 IATASIAFWTKQSGAMIYIFYMFNDFAKYPI--SIYNSLLR-WLISFIVPFAFTAYYPAS 219 +++ + SG + + ++Y S + +VP A+ Sbjct: 856 YDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA 915 Query: 220 YFLQEK-DVFFNVGGLMLISLV 240 +K DV+F VG L I L Sbjct: 916 TLFNQKNDVYFMVGLLTTIGLS 937
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.8 bits (90), Expect = 3e-06 Identities = 28/135 (20%), Positives = 51/135 (37%), Gaps = 10/135 (7%) Query: 11 EVLAKIAKQAFRETFAYDNTEEQLQE-YFEEAYSLKTLSTELGNPDSETYFIMHEEEIAG 69 V ++ + Y TEE+ + YF++ + + + E G Sbjct: 21 VVFGRMIPAFENGVWTY--TEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIG 78 Query: 70 FLKVNWGSAQTERELEDAFEIQRLYVLQKFQGFGLGKQLFEFALELATKNSFSWAWLGVW 129 +K+ S L I+ + V + ++ G+G L A+E A +N F L Sbjct: 79 RIKIR--SNWNGYAL-----IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQ 131 Query: 130 EHNTKAQAFYNRYGF 144 + N A FY ++ F Sbjct: 132 DINISACHFYAKHHF 146
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 85.7 bits (212), Expect = 3e-19 Identities = 45/139 (32%), Positives = 63/139 (45%), Gaps = 18/139 (12%) Query: 439 IMGHVDHGKTTLLDTLRNSRVATGEAG------------------GITQHIGAYQIVENG 480 ++ HVD GKTTL ++L + A E G GIT G Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67 Query: 481 KKITFLDTPGHAAFTSMRARGASVTDITILVVAADDGVMPQTIEAINHSKAANVPIIVAI 540 K+ +DTPGH F + R SV D IL+++A DGV QT + + +P I I Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127 Query: 541 NKIDKPGANPERVIGELAE 559 NKID+ G + V ++ E Sbjct: 128 NKIDQNGIDLSTVYQDIKE 146
>PF06580#Sensor histidine kinase Length = 349 Score = 30.2 bits (68), Expect = 0.007 Identities = 15/77 (19%), Positives = 32/77 (41%), Gaps = 4/77 (5%) Query: 15 SYLFFVFGLSQLTLIVQNYWQFSSQIGNFVWIQNILSLLFSGVMIWILVKTGHGYLFRIP 74 + ++ + + F+S G+ I ++ S +M +L H Y I Sbjct: 9 NKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAIS-LMGLVLT---HAYRSFIK 64 Query: 75 RKKWLWYSILTVLVVVL 91 R+ WL ++ +++ VL Sbjct: 65 RQGWLKLNMGQIILRVL 81
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 148 bits (375), Expect = 7e-42 Identities = 72/367 (19%), Positives = 136/367 (37%), Gaps = 66/367 (17%) Query: 2 SKIIGIDLGTTNSAVAVLEGTESKIIANPEGNRTTPSVV-------SFKNGEIIVGDAAK 54 S + IDLGT N+ + V I+ N PSVV VG AK Sbjct: 10 SNDLSIDLGTANTLIYVKGQ---GIVLN------EPSVVAIRQDRAGSPKSVAAVGHDAK 60 Query: 55 RQAVTNPDTVISIKSKMGTSEKVSANGKEYTPQEISAMILQYLKGYAEDYLGEKVTKAVI 114 + P + +I+ K + +++ ++ + + + ++ Sbjct: 61 QMLGRTPGNIAAIRPM-----KDGVIADFFVTEKMLQHFIKQVHS---NSFMRPSPRVLV 112 Query: 115 TVPAYFNDAQRQATKDAGKIAGLEVERIVNEPTAAALAYGLDKTDKEEKILVFDLGGGTF 174 VP +R+A +++ + AG ++ EP AAA+ GL + +V D+GGGT Sbjct: 113 CVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGL-PVSEATGSMVVDIGGGTT 171 Query: 175 DVSILELGDGVFDVLSTAGDNKLGGDDFDQKIIDHLVAEFKKENGIDLSTDKMAMQRLKD 234 +V+++ L V + ++GGD FD+ II+++ + G + Sbjct: 172 EVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG-------------EA 213 Query: 235 AAEKAKKDLS----GVTSTQISLPFITAGEAGPLHLEMTLTRAKFDDL----------TR 280 AE+ K ++ G +I + E P + + + L Sbjct: 214 TAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLN-SNEILEALQEPLTGIVSAVM 272 Query: 281 DLVERTKVPVRQALSDAGLSLSEIDEVILVGGSTRIPAVVEAVKAETGKEPNKSVNPDEV 340 +E+ + +S+ G ++L GG + + + ETG + +P Sbjct: 273 VALEQCPPELASDISERG--------MVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTC 324 Query: 341 VAMGAAI 347 VA G Sbjct: 325 VARGGGK 331
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 27.2 bits (60), Expect = 0.004 Identities = 8/22 (36%), Positives = 17/22 (77%) Query: 36 IIDWVLLIVFAIQISYIFWRLS 57 I+ ++L+++F Q++ IFWR+ Sbjct: 17 ILFYLLMLLFCQQLAMIFWRIG 38
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.008 Identities = 19/79 (24%), Positives = 31/79 (39%), Gaps = 13/79 (16%) Query: 358 FDMLLRIKEEYPQHPVIYLTENGT------ALKE------VKPEGENDIIDDSKRIRYIE 405 FD+L RIK+ P PV+ ++ T A ++ KP ++I R Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 406 QHLHKVLE-ARDRGVNIQG 423 + LE G+ + G Sbjct: 123 KRRPSKLEDDSQDGMPLVG 141
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.2 bits (65), Expect = 0.043 Identities = 9/35 (25%), Positives = 18/35 (51%) Query: 74 NQSTVAIISLVACFGIAYRLSEGYGTDGPSAGIIA 108 + + ++ + IA R ++G T +AG+IA Sbjct: 277 TKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIA 311
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 230 bits (587), Expect = 4e-70 Identities = 108/451 (23%), Positives = 206/451 (45%), Gaps = 41/451 (9%) Query: 9 KRRTFAIISHPDAGKTTITEQLLYFGGEIREAGTVKGKKTGTFAKSDWMDIEKQRGISVT 68 K +++H DAGKTT+TE LLY G I E G+V GT ++D +E+QRGI++ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVD---KGT-TRTDNTLLERQRGITIQ 57 Query: 69 SSVMQFDYDGKRVNILDTPGHEDFSEDTYRTLMAVDAAVMVVDSAKGIEAQTKKLFEVVK 128 + + F ++ +VNI+DTPGH DF + YR+L +D A++++ + G++AQT+ LF ++ Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117 Query: 129 HRGIPVFTFMNKLDRDGREPLDLLQELEEILGIASYPMNWPIGMGKAFEGLYDLYNQRLE 188 GIP F+NK+D++G + + Q+++E L + + N + Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIK----------QKVELYPNMCVT 167 Query: 189 LYKGDERFASLEDGDKLFGSNPFYEQVKDDIELLNEAGNEFSEEAILAGELTPVFFGSAL 248 + E++ ++ +G+ + E+ L + L PV+ GSA Sbjct: 168 NFTESEQWDTVIEGN-----DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAK 222 Query: 249 TNFGVQTFLEIFLKFAPEPHGHKKTDGEIVDPYDKDFSGFVFKIQANMDPRHRDRIAFVR 308 N G+ +E+ + G VFKI+ + R R+A++R Sbjct: 223 NNIGIDNLIEVITNKFYSS----------THRGQSELCGKVFKIE--YSEK-RQRLAYIR 269 Query: 309 IVSGEFERGMSVNLPRTGKGAKLSNVTQFMAESRENVINAVAGDIIGVYDTG---TYQVG 365 + SG SV + K K++ + + + A +G+I+ + + +G Sbjct: 270 LYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLG 328 Query: 366 DTLTVGKNKFEFEPLPTFTPEIFMKVSAKNVMKQKSFHKGIEQLVQEG-AVQLYKNYQTG 424 DT + + + PLP + V +++ + ++ ++ Y + T Sbjct: 329 DTKLLPQRERIENPLPL----LQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATH 384 Query: 425 EYMLGAVGQLQFEVFKHRMEGEYNAEVVMSP 455 E +L +G++Q EV ++ +Y+ E+ + Sbjct: 385 EIILSFLGKVQMEVTCALLQEKYHVEIEIKE 415
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 127 bits (319), Expect = 8e-38 Identities = 78/254 (30%), Positives = 134/254 (52%), Gaps = 13/254 (5%) Query: 3 LEHKNIFITGSSRGIGLAIAHKFAQAGANIV-LNSRGAISEELLAEFSNYGIKVVPISGD 61 +E K FITG+++GIG A+A A GA+I ++ E++++ D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 62 VSDFADAKRMIDQAIAELGSVDVLVNNAGITQDTLMLKMTEADFEKVLKVNLTGAFNMTQ 121 V D A + + E+G +D+LVN AG+ + L+ +++ ++E VN TG FN ++ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 122 SVLKPMMKAREGAIINMSSVVGLMGNIGQANYAASKAGLIGFTKSVAREVASRNIRVNVI 181 SV K MM R G+I+ + S + A YA+SKA + FTK + E+A NIR N++ Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 182 APGMIESDMTAIL------SDKIKEATLAQ----IPMKEFGQAEQVADLTVFLAGQD--Y 229 +PG E+DM L ++++ + +L IP+K+ + +AD +FL + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 230 LTGQVIAIDGGLSM 243 +T + +DGG ++ Sbjct: 246 ITMHNLCVDGGATL 259
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 132 bits (334), Expect = 5e-38 Identities = 80/346 (23%), Positives = 138/346 (39%), Gaps = 42/346 (12%) Query: 6 NIIVTGGAGFIGSNFVHYVYENFPDVHVTVLDKLT--YAGN--RANIEEILGNRVELVVG 61 +VTG AGFIG + + E V +D L Y + +A +E + + Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEA-GH-QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 62 DIADAELVDKLAA--QADAIVHYAAESHNDNSLNDPSPFIHTNFIGTYTLLEAARKYDIR 119 D+AD E + L A + + SL +P + +N G +LE R I+ Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 120 FHHV--STDEVYGDLPLREDLPGHGEGPGEKFTAETKYNPSSPYSSTKAASDLIVKAWVR 177 H + S+ VYG L +P + + +P S Y++TK A++L+ + Sbjct: 120 -HLLYASSSSVYG---LNRKMPFSTDDSVD--------HPVSLYAATKKANELMAHTYSH 167 Query: 178 SFGVKATISNCSNNYGPYQHIEKFIPRQITNILSGIKPKLYGEGKNVRDWIHTND----- 232 +G+ AT YGP+ + + + +L G +Y GK RD+ + +D Sbjct: 168 LYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAI 227 Query: 233 --------HSSGVWTILTKGQI-----GETYLIGADGEKNNKEVLELILKEMGQAADAYD 279 H+ WT+ T Y IG + ++ + +G A + Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKK-N 286 Query: 280 HVTDRAGHDLRYAIDASKLRDELGWKPEFTNFEAGLKATIKWYTDN 325 + + G L + D L + +G+ PE T + G+K + WY D Sbjct: 287 MLPLQPGDVLETSADTKALYEVIGFTPE-TTVKDGVKNFVNWYRDF 331
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.8 bits (64), Expect = 0.006 Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 7/75 (9%) Query: 48 LAYDGAEVIGFLTVQETLFE-AEVLQIAVKGAYQGQGIASAL------FAQLPTDKEIFL 100 L Y IG + ++ A + IAV Y+ +G+ +AL +A+ + L Sbjct: 69 LYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLML 128 Query: 101 EVRQSNQRAQAFYKK 115 E + N A FY K Sbjct: 129 ETQDINISACHFYAK 143
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 24.7 bits (53), Expect = 0.042 Identities = 11/43 (25%), Positives = 22/43 (51%) Query: 31 SELEGRITARQLVEENRPEYNIEYIELLSDKLLDYEKETGAFE 73 S+LE R+ Q + E++ E I+ + L + ++ T +E Sbjct: 102 SQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYE 144
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 64.7 bits (157), Expect = 6e-13 Identities = 57/324 (17%), Positives = 113/324 (34%), Gaps = 23/324 (7%) Query: 11 LASVAILGAGFVASQPTVVRAEESPVASQSKAEKDYDAAKKDAKNAKKAVEDAQKALDDA 70 ++ +LGAG V + T + + + EK + A K + Sbjct: 23 AVALTVLGAGLVVN--TNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNN 80 Query: 71 KAAQKKYDEDQKKTEEKAALEKAASEEMDKAVAAVQQAYLAYQQATDKAAKDAADKMIDE 130 KA + DE ++ + + + + + +Q+ A + +KA + A Sbjct: 81 KALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQE-LEARKADLEKALEGA-----MN 134 Query: 131 AKKREEEAKTKFNTVRAMVVPEPEQLAETKKKSEEAKQKAPELTKKLEEAKAKLEEAEKK 190 + +A + L + + + K LE KA LE + + Sbjct: 135 FSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 194 Query: 191 ATEAKQKVDAEEVAPQAKIAELENQVHRLEQELKEIDESESEDYAKEGFRAPLQSKLDAK 250 +A + A AKI LE + L +++++ + L+A+ Sbjct: 195 LEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE 254 Query: 251 KAKL---------------SKLEELSDKIDELDAEIAKLEDQLKAAEENNNVEDYFKEGL 295 KA L + S KI L+AE A LE + E + V + ++ L Sbjct: 255 KAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSL 314 Query: 296 EKTIAAKKAELEKTEADLKKAVNE 319 + + A + ++ EA+ +K + Sbjct: 315 RRDLDASREAKKQLEAEHQKLEEQ 338 Score = 57.4 bits (138), Expect = 1e-10 Identities = 64/319 (20%), Positives = 110/319 (34%), Gaps = 12/319 (3%) Query: 42 AEKDYDAAKKDAKNAKKAVEDAQKALDDAKAAQKKYDEDQKKTEEKAALEKAASEEMDKA 101 + D + A + A N A K L+ KAA + + +K E A A K Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215 Query: 102 VAAVQQAYLAYQQATDKAAKDAADKMIDEAKKREEEAKTKFNTVRAMVVPEPEQLAETKK 161 + A + A A K +K ++ A K T+ A + AE +K Sbjct: 216 LEAEKAAL--------AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEK 267 Query: 162 KSEEAKQKAPELTKKLEEAKAKLEEAEKKATEAKQKVDAEEVAPQAKIAELENQVHRLEQ 221 E A + + K++ +A+ E + + + + Q+ +L+ +Q Sbjct: 268 ALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQ 327 Query: 222 ELKEIDESESEDYAKEGFRAPLQSKLDAKKAKLSKLEELSDKIDEL----DAEIAKLEDQ 277 E + E ++ E R L+ LDA + +LE K++E +A L Sbjct: 328 LEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 387 Query: 278 LKAAEENNNVEDYFKEGLEKTIAAKKAELEKTEADLKKAVNEPEKPAPAPETPAPEAPAE 337 L A+ E + E +AA + ++ E K E + E A + Sbjct: 388 LDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEK 447 Query: 338 QPKPAPAPQPAPAPKPEKP 356 K A A K Sbjct: 448 LAKQAEELAKLRAGKASDS 466 Score = 57.4 bits (138), Expect = 1e-10 Identities = 65/393 (16%), Positives = 135/393 (34%), Gaps = 30/393 (7%) Query: 40 SKAEKDYDAAKKDAKNAKKAVEDAQKALDDAKAAQKKYDEDQKKTEEKAALEKAASEEM- 98 S+ + + +KA+E A A K + ++ + A + A E Sbjct: 109 SEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 168 Query: 99 -DKAVAAVQQAYLAYQQATDKAAKDAADKMIDEAKKREEEAKTKFNTVRAMVVPEPEQLA 157 + + L ++A +A + +K ++ A K T+ A + A Sbjct: 169 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKA 228 Query: 158 ETKKKSEEAKQKAPELTKKLEEAKAKLEEAEKKATEAKQKVDAEEVAPQAKIAELENQVH 217 + +K E A + + K++ +A+ E + E ++ ++ A A+++ Sbjct: 229 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA 288 Query: 218 RLEQELKEIDESESEDYAKEGFRAPLQSKLDAKKAKLSKLE------------------E 259 E + E + R L+ LDA + +LE Sbjct: 289 EKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS 348 Query: 260 LSDKIDELDAEIAKLEDQLKAAEENNNVEDYFKEGLEKTIAAKKAELEKTEAD------- 312 L +D +LE + + EE N + + ++ L + + A + ++ E Sbjct: 349 LRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSK 408 Query: 313 ---LKKAVNEPEKPAPAPETPAPEAPAEQPKPAPAPQPAPAPKPEKPAEQPKPEKTDDQQ 369 L+K E E+ E E A+ A A + A + E+ A+ + +D Q Sbjct: 409 LAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQT 468 Query: 370 AEEDYARRSEEEYNRLTQQQPPKAEKPAPAPKT 402 + ++ + Q + AP +T Sbjct: 469 PDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKET 501
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 26.9 bits (59), Expect = 0.023 Identities = 18/65 (27%), Positives = 23/65 (35%) Query: 34 GAITGAAYAALAAAGGGGLQLVLASYGLRSALVAGIVKGLGVLGIHIGNAFANTVIRSIA 93 G ITG A +AA G + L A+ A G V G V G + F + Sbjct: 233 GHITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVL 292 Query: 94 SAGIG 98 G Sbjct: 293 DGWYG 297
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 105 bits (264), Expect = 1e-26 Identities = 79/441 (17%), Positives = 158/441 (35%), Gaps = 34/441 (7%) Query: 17 DKRPPAFAFILIISTAIILSGALVGAAYIPKNYIVKANGNSVITG-TEFLSAISSGKVVT 75 + ++ L A + + + ANG +G ++ + I + V Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 76 LHKSEGDMVNAGDVIISLSSGQ-----EGLQASSLNKQLVKLRAKEAIFQ----KFEQSL 126 + EG+ V GDV++ L++ Q+S L +L + R + K + Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 127 NEKYNRMSNSGEEQEYYGKVEYYLSQLNSENYNNGTQYSKIQDEYTKLNKITAERNQLDA 186 N EE+ V S + + Q + + L+K AER + A Sbjct: 170 LPDEPYFQNVSEEE-----VLRLTSLIKEQFSTWQNQKYQKE---LNLDKKRAERLTVLA 221 Query: 187 DLQTLQNELIQLQQQGDSPSLSDTTS-ADDKAKLETKILEITTKIEALKTNITSKNSEID 245 + +N + + L D +S +A + +LE K +E+ Sbjct: 222 RINRYENLSRVEKSR-----LDDFSSLLHKQAIAKHAVLEQENKYVEA-------VNELR 269 Query: 246 SQQSNIKDMNRTYNDPTSQAYNIYAQLVSELGTARSNNNKSITELEANLGVATGQDKAHS 305 +S ++ + + + +E+ +I L L + +A Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASV 329 Query: 306 ILAPNEGTLHYLVPLKQGMSIQQGQTIAEVSGKEKGYYVEAFVLASDISRVSKGAKVDVA 365 I AP + L +G + +T+ + ++ V A V DI ++ G + Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIK 389 Query: 366 ITGVNSQKYGTLKGQVRQIDSGTISQETKEGNISLYKVMIELETLTLKHGSETVVLQKDM 425 + +YG L G+V+ I+ I + + G + + V+I +E L G++ + L M Sbjct: 390 VEAFPYTRYGYLVGKVKNINLDAIEDQ-RLGLV--FNVIISIEENCLSTGNKNIPLSSGM 446 Query: 426 PVEVRIVYDKETYLDWILEML 446 V I + + ++L L Sbjct: 447 AVTAEIKTGMRSVISYLLSPL 467
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 30.1 bits (68), Expect = 0.004 Identities = 7/25 (28%), Positives = 12/25 (48%) Query: 24 DTKVEDVDQEINRFHQHLQLLKAQI 48 T + DV EI + L+ K ++ Sbjct: 31 KTSITDVSTEIEKLTAALEKSKEEL 55
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 75.3 bits (185), Expect = 4e-17 Identities = 65/349 (18%), Positives = 144/349 (41%), Gaps = 21/349 (6%) Query: 29 SQVFRLRRKKLATAKQKNIIT-LFNNLFSSGFHLVETISFLDRSALLDKQ--CVTQMRVG 85 S LRRK + ++T L ++ L E + + + + + +R Sbjct: 54 STGLSLRRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSK 113 Query: 86 LSQGKSFSEMMESL-GCSSAIVTQLSLA-EVHGNLHLSLGKIEEYLDNLAKVKKKLIEVA 143 + +G S ++ M+ G + + A E G+L L ++ +Y + +++ ++ + Sbjct: 114 VMEGHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAM 173 Query: 144 TYPLILLGFLLLIMLGLRNYLLPQLDSSNI--------ATQIIGNLPQIFLGMVGLVSVL 195 YP +L + ++ L + ++P++ I +T+++ + + G +L Sbjct: 174 IYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSD-AVRTFGPWMLL 232 Query: 196 ALLALTF-----YKRSSKMSVFS-ILARLPFIGIFVQTYLTAYYAREWGNMISQGMELTQ 249 ALLA ++ + F L LP IG + TA YAR + + + L Q Sbjct: 233 ALLAGFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQ 292 Query: 250 IFQMMQE-QGSQLFKEVGQDLAQTLKNGREFSQTIGTYPFFRKELSLIIEYGEVKSKLGS 308 ++ + + + ++ G + + F + +I GE +L S Sbjct: 293 AMRISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDS 352 Query: 309 ELEIYAEKTWEAFFTRVNRTMNLVQPLVFIFVALIIVLLYAAMLMPMYQ 357 LE A+ F +++ + L +PL+ + +A +++ + A+L P+ Q Sbjct: 353 MLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQ 401
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 43.3 bits (102), Expect = 1e-08 Identities = 20/57 (35%), Positives = 36/57 (63%) Query: 14 KAFTLVEMLVVLLIISVLFLLFVPNLTKQKEAVNDKGKAAVVKVVESQAELYSLEKN 70 + FTL+E++VV++II VL L VPNL KE + + + + +E+ ++Y L+ + Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNH 64
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 26.4 bits (58), Expect = 0.040 Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 10/69 (14%) Query: 29 KAFTMLESLLVLGLVSILALGLSGSVQSTFSAVEEQIFFMEFEELYRETQKRSVASQQKT 88 + FT+LE +L+L L+ + A G V F A + + L R + Q+ Sbjct: 4 RGFTLLEMMLILLLMGVSA----GMVLLAFPASRDDS---AAQTLARFEAQLRFVQQRGL 56 Query: 89 SLNLDGQMI 97 GQ Sbjct: 57 ---QTGQFF 62
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 495 bits (1277), Expect = e-178 Identities = 195/398 (48%), Positives = 275/398 (69%), Gaps = 6/398 (1%) Query: 3 KTIAINAGSSSLKWQLYLMPEEKVLAKGLIERIGLKDSISTVKFDGRSEQQILDIENHIQ 62 K + IN GSSSLK+QL + VLAKGL ERIG+ DS+ T +G + D+++H Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 63 AVKILLDDLI--RFDIIKAYDEITGVGHRVVAGGEYFKESTVVEGDVLEKVEELSLLAPL 120 A+K++LD L+ + +IK EI VGHRVV GGEYF S ++ DVL+ + + LAPL Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121 Query: 121 HNPANAAGVRAFKELLPDITSVVVFDTSFHTSMPEKAYRYPLPTKYYTENKVRKYGAHGT 180 HNPAN G++A +++PD+ V VFDT+FH +MP+ AY YP+P +YYT+ K+RKYG HGT Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181 Query: 181 SHQFVAGEAAKLLGRPLEDLKLITCHIGNGGSITAVKAGKSVDTSMGFTPLGGIMMGTRT 240 SH++V+ AA++L +P+E LK+ITCH+GNG SI AVK GKS+DTSMGFTPL G+ MGTR+ Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241 Query: 241 GDIDPAIIPYLMQYTEDFNTPEDISRVLNRESGLLGVSANSSDMRDI-EAAVAEGNHEAS 299 G IDP+II YLM+ + E++ +LN++SG+ G+S SSD RD+ +AA G+ A Sbjct: 242 GSIDPSIISYLMEKEN--ISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQ 299 Query: 300 LAYEMYVDRIQKHIGQYLAVLNGADAIVFTAGVGENAESFRRDVISGISWFGCDVDDEKN 359 LA ++ R++K IG Y A + G D IVFTAG+GEN R ++ G+ + G +D EKN Sbjct: 300 LALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKN 359 Query: 360 -VFGVTGDISTEAAKIRVLVIPTDEELVIARDVERLKK 396 V G IST +K+ V+V+PT+EE +IA+D E++ + Sbjct: 360 KVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 139 bits (352), Expect = 5e-40 Identities = 58/232 (25%), Positives = 116/232 (50%), Gaps = 20/232 (8%) Query: 39 FWSKLVYFFAEIIRFLSFDI-SIGVGIILFTVLIRTVLLPVFQVQMVASRKMQEAQPRIK 97 + + ++++++ + + G II+ T ++R ++ P+ + Q + KM+ QP+I+ Sbjct: 332 WLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQ 391 Query: 98 ALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPILIQMPVILALFQALSR-VDFL 156 A+RE+ + ++ QEM ++K V +P+LIQMP+ LAL+ L V+ Sbjct: 392 AMRERLGD----DKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELR 447 Query: 157 KTGHFLWI-NLGSVDTTLVLPILAAVFTFLSTWLSNKALSERNGATTAMMYGIPVLIFIF 215 + LWI +L + D +LPIL V F +S +++ +M +PV+ +F Sbjct: 448 QAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQ--KIMTFMPVIFTVF 505 Query: 216 AVYAPGGVALYWTVSNAYQVLQTYFLNNPFKIIAEREAVVQAQKDLENRKRK 267 ++ P G+ LY+ VSN ++Q + + ++ L +R++K Sbjct: 506 FLWFPSGLVLYYIVSNLVTIIQQQLIYRGLE-----------KRGLHSREKK 546
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 106 bits (265), Expect = 2e-27 Identities = 69/357 (19%), Positives = 142/357 (39%), Gaps = 9/357 (2%) Query: 10 LRIAWFGNFLTGASISLVVPFMPIFVENLGVGSQQVAFYAGLAISVSAISAALFSPIWGI 69 L + L I L++P +P + +L V S V + G+ +++ A+ +P+ G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDL-VHSNDVTAHYGILLALYALMQFACAPVLGA 65 Query: 70 LADKYGRKPMMIRAGLAMTITMGGLAFVPNIYWLIFLRLLNGVFAGFVPNATALIASQVP 129 L+D++GR+P+++ + + +A P ++ L R++ G+ A A IA Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITD 125 Query: 130 KEKSGSALGTLSTGVVAGTLTGPFIGGFIAELFGIRTVFLLVGSFLFLAAILTICFIKED 189 ++ G +S G + GP +GG + F F + L + + E Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 190 FQPVAKEKAIPTKELFTSVKYPYL---LLNLFLTSFVIQFSAQSIGPILALYVRDLGQTE 246 + + S ++ + L F++Q Q + ++ D + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244 Query: 247 NLLFVSGLIVSSMG-FSSMMSAGVMGKLGDKVGNHRLLVVAQFYSVIIYLLCANASSPLQ 305 G+ +++ G S+ A + G + ++G R L++ Y+L A A+ Sbjct: 245 ATTI--GISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302 Query: 306 LGLYRFLFGLGTGALIPGVNALLSKMTPKAGISRVFAFNQVFFYLGGVVGPMAGSAV 362 L G G +P + A+LS+ + ++ L +VGP+ +A+ Sbjct: 303 AFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358 Score = 57.5 bits (139), Expect = 3e-11 Identities = 44/178 (24%), Positives = 76/178 (42%), Gaps = 2/178 (1%) Query: 214 LLNLFLTSFVIQFSAQSIGPILALYVRDLGQTENLLFVSGLIVSSMGFSSMMSAGVMGKL 273 L+ + T + I P+L +RDL + ++ G++++ A V+G L Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 274 GDKVGNHRLLVVAQFYSVIIYLLCANASSPLQLGLYRFLFGLGTGALIPGVNALLSKMTP 333 D+ G +L+V+ + + Y + A A L + R + G+ TGA A ++ +T Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITD 125 Query: 334 KAGISRVFAFNQVFFYLGGVVGPMAGSAVAGQFGYHAVFYATSLCVAFSCLFNLIQFR 391 +R F F F G V GP+ G + G F HA F+A + + L Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 36.4 bits (84), Expect = 1e-04 Identities = 43/207 (20%), Positives = 80/207 (38%), Gaps = 34/207 (16%) Query: 3 FKSGFVAILGRPNVGKSTFLNHVMGQKIAIMSDKAQTTRNKIMGIYTTDKEQIVFIDTPG 62 + SG + LG + G + N ++ ++ I T+ + + ++ IDTPG Sbjct: 25 YNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS-------FQWENTKVNIIDTPG 77 Query: 63 IHKPKTALGDFMVESAYSTLREVDTVLFMVPADEARGKGDDMIIERLKAAKVPVILVVNK 122 H DF+ E Y +L +D + ++ A + ++ L+ +P I +NK Sbjct: 78 -HM------DFLAE-VYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFINK 129 Query: 123 IDKVHPDQLLSQIDDFRNQMDFKEIVPISALQGNNVSRLVDILSENLDEGFQYFPSDQIT 182 ID+ + ID D KE + + V ++ N E Q+ D + Sbjct: 130 IDQ-------NGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQW---DTVI 179 Query: 183 DHPERFLVSEMVREKVL---HLTREEI 206 + + L EK + L E+ Sbjct: 180 EGNDDLL------EKYMSGKSLEALEL 200
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 682 bits (1762), Expect = 0.0 Identities = 196/577 (33%), Positives = 323/577 (55%), Gaps = 31/577 (5%) Query: 10 MSFDGFFLHHIVEELRSELVNGRIQKINQPFEQELVLQIRSNRQSHRLLLSAHPVFGRIQ 69 M+ DG FL+ I++EL++ ++NG+I K+NQP + E++L IR R S +LL+S+ + RI Sbjct: 1 MALDGIFLYSIIDELKNTIINGKIDKVNQPEKDEIILNIRKGRLSFKLLISSSSNYPRIH 60 Query: 70 LTQTTFENPAQPSTFIMVLRKYLQGALIESIEQVENDRIVEMTVSNKNEIGDHIQATLII 129 LT T NP + F MVLRKY+ A I I Q+ DRIV + + +E+G + +LII Sbjct: 61 LTDLTKPNPIKAPMFCMVLRKYISNAKIVDIHQINQDRIVVIDFESTDELGFNSIYSLII 120 Query: 130 EIMGKHSNILLVDKSSHKILEVIKHVGFSQNSYRTLLPGSTYIAPPSTESLNPFTIKDEK 189 EIMG+HSN+ L+ K + I++ IKH+ N+YR++ PG Y+ PP + LNPF + Sbjct: 121 EIMGRHSNMTLIRKRDNIIMDSIKHITPDINTYRSIYPGIEYVYPPKSPKLNPFDFSYDM 180 Query: 190 LFEILQ--TQELTAKNLQSLFQGLGRDTANELERILVSEKL---------------SAFR 232 + + + +L +F G+ + ++E+ L + + F+ Sbjct: 181 IENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVEVCKDLFK 240 Query: 233 NFFNQETKPCLTETSFSPVPFA--------NQAGEPFANLSDLLDTYYKNKAERDRVKQQ 284 + + + + S V F + + + S LL+ +Y K + DR+K + Sbjct: 241 EIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDKSDRLKSK 300 Query: 285 ASELIRRVENELQKNRHKLKKQERELLATDNAEEFRQKGELLTTFLHQVPNDQDQVILDN 344 +S+L + V N + + K K L ++ + F+ GELLT ++ + + L N Sbjct: 301 SSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYALKKGLSHIELAN 360 Query: 345 YYTNQ--PIMIALDKALTPNQNAQRYFKRYQKLKEAVKYLTDLIEETKATILYLESVETV 402 YY+ + I LD+ TP+QN Q Y+K+Y KLK++ + + + + + + YL SV T Sbjct: 361 YYSENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTN 420 Query: 403 LNQA-GLEEIAEIREELIQTGFIRRRQ--REKIQKRKKLEQYLASDGKTIIYVGRNNLQN 459 +N A +EI EI++ELI+TG+I+ ++ + K K K +++ DG IYVG+NN+QN Sbjct: 421 INNADNYDEIEEIKKELIETGYIKFKKIYKSKKSKTSKPMHFISKDGID-IYVGKNNIQN 479 Query: 460 EELTFKMARKEELWFHAKDIPGSHVVISGNLDPSDAVKTDAAELAAYFSQGRLSNLVQVD 519 + LT K A K ++WFH K+IPGSHV++ +D ++ +AA LAAY+S+ + S+ V VD Sbjct: 480 DYLTLKFANKHDIWFHTKNIPGSHVIVKNIMDIPESTLLEAANLAAYYSKSQNSSNVPVD 539 Query: 520 MIEVKKLNKPTGGKPGFVTYTGQKTLRVTPDSKKIAS 556 EVK + KP G KPG V Y+ +T+ VTP + + + Sbjct: 540 YTEVKNVKKPNGAKPGMVIYSTNQTIYVTPTNPNLKN 576
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 30.5 bits (68), Expect = 0.026 Identities = 24/123 (19%), Positives = 48/123 (39%), Gaps = 27/123 (21%) Query: 620 LLAHSALESNWGRSKIAKDK----NNFFGI----------TAYDTTPYLSA--------- 656 +LA +ALES WG+ +I ++ N FG+ T TT Y + Sbjct: 174 ILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKF 233 Query: 657 KTFDDVDKGILGATKWIKENYIDRGRTFLGNKASGM----NVEYASDPYWGEKIASVMMK 712 + + + + + N T + G + YA+DP++ K+ +++ + Sbjct: 234 RVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQ 293 Query: 713 INE 715 + Sbjct: 294 MKS 296
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 34.0 bits (78), Expect = 0.002 Identities = 12/70 (17%), Positives = 28/70 (40%), Gaps = 1/70 (1%) Query: 37 LIFAAFKLGAAGITLYNLIRLLVGSLAYLAIFGLLIYLFFFKWIRKQEGLL-SGFFTIFA 95 + + K+ I ++ +I+ + +L L G+ I +Q ++ + F + + Sbjct: 140 FLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVIS 199 Query: 96 GLLLIFEAYL 105 FE Y Sbjct: 200 IADYAFEYYQ 209
>LCRVANTIGEN#Low calcium response V antigen signature. Length = 326 Score = 28.5 bits (63), Expect = 0.031 Identities = 21/78 (26%), Positives = 31/78 (39%) Query: 80 FVQVAEDTRINVKIKADQETEINGTGPTVEPVQLEELKAILSSLTAEDTVVFAGSSAKNL 139 VQ+ +D I++ IK D + V +E LK IL+ ED ++ G L Sbjct: 35 LVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLPEDAILKGGHYDNQL 94 Query: 140 GNVIYKDLISLTRQTGAQ 157 N I + L Q Sbjct: 95 QNGIKRVKEFLESSPNTQ 112
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 35.6 bits (82), Expect = 6e-05 Identities = 22/98 (22%), Positives = 44/98 (44%), Gaps = 12/98 (12%) Query: 1 MLKTERKQLILEELNQHHVVSLEKLVSLLE-----TSESTVRRDLDELEAENKLRRVHG- 54 M K +R I E + + + + ++LV +L+ +++TV RD+ EL +V Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHL----VKVPTN 56 Query: 55 GAELPHSLQEEETIQ--EKSVKNLQEKKLLAQKAASLI 90 +SL ++ K ++L + + A+ LI Sbjct: 57 NGSYKYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLI 94
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 29.8 bits (67), Expect = 0.018 Identities = 31/150 (20%), Positives = 52/150 (34%), Gaps = 21/150 (14%) Query: 1 MKKIFLTLL----TVSLLGGVSTAVAQDFTIAAKHA------IAVEANTGKILYEKDATQ 50 M+ I L ++ T+ L S + ++ I ++ +G+ L A + Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60 Query: 51 PVEIASITKLITVYLVYEALENGSITLSTPVDISDYPYQLTTNSEASNIPME----ARNY 106 + S K++ V ++ G L + L S P+ A Sbjct: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYR--QQDLVDYS-----PVSEKHLADGM 113 Query: 107 TVEELLEATLVSSANSAAIALAEKIAGSEK 136 TV EL A + S NSAA L + G Sbjct: 114 TVGELCAAAITMSDNSAANLLLATVGGPAG 143
>PF06580#Sensor histidine kinase Length = 349 Score = 34.8 bits (80), Expect = 6e-04 Identities = 13/76 (17%), Positives = 30/76 (39%), Gaps = 9/76 (11%) Query: 314 FRFENRIHRTIVTDQLLLKQL---MTI--LFDNAVKY----TEEDGEIDFLISATDRNLY 364 +FE+R+ + ++ M + L +N +K+ + G+I + + + Sbjct: 234 IQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVT 293 Query: 365 LLVSDNGIGISTEDKK 380 L V + G K+ Sbjct: 294 LEVENTGSLALKNTKE 309
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.7 bits (212), Expect = 2e-21 Identities = 34/118 (28%), Positives = 55/118 (46%), Gaps = 1/118 (0%) Query: 24 IKILLVEDDLGLSNSVFDFLDD-FADVMQVFDGEEGLYEAESGVYDLILLDLMLPEKNGF 82 IL+ +DD + + L DV + +G DL++ D+++P++N F Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 83 QVLKELREKGITTPVLIMTAKESLDDKGHGFELGADDYLTKPFYLEELKMRIQALLKR 140 +L +++ PVL+M+A+ + E GA DYL KPF L EL I L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 72.5 bits (178), Expect = 2e-16 Identities = 50/224 (22%), Positives = 91/224 (40%), Gaps = 30/224 (13%) Query: 26 VGMIRGEYLLRELNQNILLQSCQEFVKDYLDTICSFYLGKEVWYRFTEL-TNTEANCLVG 84 +G+ R E+L + +Q L + +E + Y + + GK V R ++ + E + L Sbjct: 293 IGLYRTEFLYMDRDQ---LPTEEEQFEAYKEVVQRMD-GKPVVIRTLDIGGDKELSYL-- 346 Query: 85 TKEFFDEGHPLFGYRGTRCLLACLDEF--QAEAHVVTEVYQTNPNLSVIFPFVNDADQLK 142 + E +P G+R R L D F Q A + Y NL V+FP + ++L+ Sbjct: 347 --QLPKELNPFLGFRAIRLCLEKQDIFRTQLRALLRASTY---GNLKVMFPMIATLEELR 401 Query: 143 QAITVLRQYGFTG-----------KVGTMIELPSAYFDLSSILETGISKIVVGMNDLTSF 191 QA ++++ +VG M+E+PS + + + +G NDL + Sbjct: 402 QAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKE-VDFFSIGTNDLIQY 460 Query: 192 VFATMRN----SQWHDMESPIMLDMLRDMQDKARKNKINFAVAG 231 A R S + P +L ++ + A + G Sbjct: 461 TMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCG 504
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 96.7 bits (240), Expect = 2e-26 Identities = 66/252 (26%), Positives = 106/252 (42%), Gaps = 24/252 (9%) Query: 3 KRVLITGVSSGIGLAQARLFLEKGYQVYGVDQGEKPLL-----EGDFRFLQRDLTLDL-- 55 K ITG + GIG A AR +G + VD + L D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 56 -----EPIFDWCPQV---DVLCNTAGVLDDYKPLLEQTAQDIQEIFEINYIIPVELTRYY 107 E ++ D+L N AGVL + + ++ + F +N +R Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLR-PGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 108 LTQMLENKKGIIINMCSIASSLAGGGGHAYTSSKHALAGFTKQLALDYAEAGIQVFGIAP 167 M++ + G I+ + S + + AY SSK A FTK L L+ AE I+ ++P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 168 GAVKTAMT--------AADFEPGGLADWVASETPIKRWIEPEEIAELSLFLASGKASAMQ 219 G+ +T M A+ G + + P+K+ +P +IA+ LFL SG+A + Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 220 GQILTIDGGWSL 231 L +DGG +L Sbjct: 248 MHNLCVDGGATL 259
>PF03309#Bvg accessory factor Length = 271 Score = 35.1 bits (81), Expect = 2e-04 Identities = 25/126 (19%), Positives = 45/126 (35%), Gaps = 14/126 (11%) Query: 11 IIGIDLGGTSIKFAILTTAGEIQ---GKWSIKTNILDEGSHIVDDMIESIQHRLDLLGLA 67 ++ ID+ T +++ +G+ +W I+T D++ +I L+G Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTA----DELALTI---DGLIGDD 54 Query: 68 AADFQGIGMGSPGVVDRDKGTVIGAYNLNWKTLQPIKQKIEKALGIPFFIDNDANVAALG 127 A G S V V W + + + GIP +DN V A Sbjct: 55 AERLTGASGLS--TVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGA-- 110 Query: 128 ERWMGA 133 +R + Sbjct: 111 DRIVNC 116
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 55.1 bits (132), Expect = 3e-09 Identities = 50/265 (18%), Positives = 76/265 (28%), Gaps = 16/265 (6%) Query: 194 NQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPAPVEEVGGEVE 253 NQ V+T + + P +E D P A P+ E E Sbjct: 989 NQTVDTTNITTPNNIQADV-PSVPSNNEEIARVDEAP---VPPPAPATPSETTETVAE-N 1043 Query: 254 SKPEEKVAVKPESQPSDKP------AEESKVEQAGEPVAPRKDEQAPVEPENQPEAPEEE 307 SK E K K E ++ A+E+K + E Q +E Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103 Query: 308 KAVEETPKQEESTPDTKAEETVE----PKEETKTAKGTQEEGKEGQAPVQEVNPEYKVTT 363 VE+ K + T T+ V PK+E Q E P + E + T Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK-EPQSQT 1162 Query: 364 GTVEKSTESELDFTTEVVPDDTKYVDEEVVERQGSKGVQVTKTTYETVEVVETDKVLSTT 423 T + + + ++ V T+ T T + E+ Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222 Query: 424 TEVKTPVVPKVVKKGTKPVETREEV 448 VP V+ T R V Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRSTV 1247 Score = 51.2 bits (122), Expect = 4e-08 Identities = 45/241 (18%), Positives = 73/241 (30%), Gaps = 25/241 (10%) Query: 172 EVTVVEVETPQSTTNQEQARTENQVVETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPK 231 EV ET ++ Q E VE EE K E KT+E PK S+V P + Sbjct: 1084 EVAQSGSETKET---QTTETKETATVEKEEKAKVETEKTQEVPKVT--SQVSPKQEQSET 1138 Query: 232 VEEGKEDSAEPAPVEEVGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPRKDE 291 V+ E + E P +P+SQ + E ++ V E Sbjct: 1139 VQPQAEPARENDPTVN-------------IKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185 Query: 292 QAPVEPENQ-PEAPEEEKAVEETPKQEEST---PDTKAEETVEPKEETKTAKGTQEEGKE 347 V N E PE P + P + +V T + Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245 Query: 348 GQAPVQEVNPEYKVTTGTVEKSTESELDFTTEVVPDDTKYVDEEVVERQGSKGVQVTKTT 407 A + + + V ++++ + + +G V V+ T+ Sbjct: 1246 TVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV---SQHISQLEMNNEGQYNVWVSNTS 1302 Query: 408 Y 408 Sbjct: 1303 M 1303 Score = 50.8 bits (121), Expect = 6e-08 Identities = 42/242 (17%), Positives = 86/242 (35%), Gaps = 26/242 (10%) Query: 158 GLDTVLEETSAKPGEVTVVEVETPQSTTNQEQARTENQVVE--TEEAPKEEAPKTEESPK 215 + ++ T+ +V + S N+E AR + V P E E+ K Sbjct: 987 KRNQTVDTTNITTPNNIQADVPSVPSN-NEEIARVDEAPVPPPAPATPSETTETVAENSK 1045 Query: 216 EEPKSEVKPTDDTLPKVEEGKEDSAEPAP----------VEEVGGEVESKPEEKVAVKPE 265 +E K+ K D + +E + E V + G E + + + Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET-QTTETKETA 1104 Query: 266 SQPSDKPAEESKVEQAGEP-----VAPRKDEQAPVEPENQPEAPEEEKAVEETPKQEEST 320 + ++ A+ + P V+P++++ V+P+ +P + + P+ + +T Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164 Query: 321 PDTKAEETVEPKEETKTA--KGTQEEGKEGQAPVQEVNPEYKVTTGTVEKSTESELDFTT 378 +T +P +ET + + E NPE T T + + SE Sbjct: 1165 T----ADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPE-NTTPATTQPTVNSESSNKP 1219 Query: 379 EV 380 + Sbjct: 1220 KN 1221 Score = 37.4 bits (86), Expect = 8e-04 Identities = 30/134 (22%), Positives = 45/134 (33%), Gaps = 11/134 (8%) Query: 163 LEETSAKPGEVTVVEVETPQSTTNQEQARTENQVVET----EEAPKEEAPKTEESPKEEP 218 E+T P + V + QS T Q QA + T E + E P +E Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175 Query: 219 KSEVKP------TDDTLPKVEEGKEDSAEPAPVEEVGGEVESKPEEKVAVKPESQPSDKP 272 S V+ T +T V E E++ V E +KP+ + S P + Sbjct: 1176 SSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235 Query: 273 AEESKVEQAGEPVA 286 + VA Sbjct: 1236 PATTSSNDR-STVA 1248 Score = 37.0 bits (85), Expect = 0.001 Identities = 42/264 (15%), Positives = 73/264 (27%), Gaps = 14/264 (5%) Query: 245 VEEVGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPRKDEQAPVEPENQPEAP 304 VE+ V++ PS E PV P E E Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044 Query: 305 EEEKAVEETPKQEESTPDTKAEETVEPKEETKTAKGTQEEGKEGQAPVQEVNPEYKVTTG 364 ++E E +Q+ + + E + + A E + + +E T Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104 Query: 365 TVEKS---------TESELDFTTEVVPDDTKYVDEEVVERQGSKGVQVT----KTTYETV 411 TVEK T+ T++V P + + + + Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164 Query: 412 EVVETDKVLSTTTEVKTPVVP-KVVKKGTKPVETREEVIPFATKEQEDDTLKRGTRQVAQ 470 T++ V+ PV V G VE E P T+ + + + Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHR 1224 Query: 471 EGVNGKKQITETYKTIRGEKTNEA 494 V E T +++ A Sbjct: 1225 RSVRSVPHNVEPATTSSNDRSTVA 1248
>PF06580#Sensor histidine kinase Length = 349 Score = 199 bits (508), Expect = 3e-61 Identities = 58/202 (28%), Positives = 100/202 (49%), Gaps = 9/202 (4%) Query: 357 QEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQRVVQVTKSLATYFRLAL-NQG 415 ++ QL AL +QINPHF++N L+ I + D + ++ SL+ R +L Sbjct: 154 MASMAQEAQLMALKAQINPHFMFNALNNIRALIL-EDPTKAREMLTSLSELMRYSLRYSN 212 Query: 416 KDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 475 + L+DE+ V YL + ++ D+L++E N A ++ +P +++Q LVEN + HGI Sbjct: 213 ARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGI 272 Query: 476 KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHF 535 + G I L K + + + +E+ G S G GLQNV +RL++ + Sbjct: 273 AQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKES------TGTGLQNVRERLQMLY 326 Query: 536 GANYQMKIDSRPQKGTKVEIYI 557 G Q+K+ + K + I Sbjct: 327 GTEAQIKLSEKQGKVN-AMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.3 bits (237), Expect = 1e-24 Identities = 35/129 (27%), Positives = 65/129 (50%), Gaps = 6/129 (4%) Query: 10 TILIVEDEYLVRQGLTKLVNVAAYDMEIIGQAENGRQAWELIQKQVPDIILTDINMPHLN 69 TIL+ +D+ +R L + ++ A YD+ N W I D+++TD+ MP N Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVR---ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 70 GIQLASLVRETYPQVHLVFLTGYDDFDYALSAVKLGVDDYLLKPFSRQDIEEMLGKIKQK 129 L +++ P + ++ ++ + F A+ A + G DYL KPF D+ E++G I + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGRA 118 Query: 130 LDKEEKEEQ 138 L + ++ Sbjct: 119 LAEPKRRPS 127
>adhesinb#Adhesin B signature. Length = 310 Score = 27.1 bits (60), Expect = 0.049 Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 1/33 (3%) Query: 10 MKKWQTCVLGAGSLLCLTACS-GKSVTSEHQTK 41 MKK + VL + + L ACS KS T +K Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSK 33
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.3 bits (71), Expect = 0.008 Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 2/72 (2%) Query: 100 GNLAIYIFASIILVAYLGKYIQYEAWRWIHRLVYLAYILGLFHIYMIMGNRLLTFNLLSF 159 GN A + A +V +L YE+W I V L LG+ + + + + F Sbjct: 869 GNQAPALVAISFVVVFLCLAALYESWS-IPVSVMLVVPLGIVGVLLAATLFNQKND-VYF 926 Query: 160 LVGSYALLGLLA 171 +VG +GL A Sbjct: 927 MVGLLTTIGLSA 938
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.011 Identities = 29/166 (17%), Positives = 61/166 (36%), Gaps = 30/166 (18%) Query: 288 ILSLSSV--QELRDDRETIDLLQMTQNLVKDYALLAKER-------ELQIDNSLTHQQAY 338 + SLS + LR L +V Y LA + E QI+ ++ Q Sbjct: 197 LTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQ-- 254 Query: 339 LNPSVMKLILSNLISNAIKHSVPGGLVRIGEREGELFIENSCSSEEQEKLAQSFSDNASR 398 V +++ L+ N IKH + + G++ + +++ + + S Sbjct: 255 ----VPPMLVQTLVENGIKHGIAQ-----LPQGGKILL---KGTKDNGTVTLEVENTGSL 302 Query: 399 KVK----GSGMGLFVVKSLLEH---EKLAYRFEMEENRLTFFIDFP 437 +K +G GL V+ L+ + + ++ ++ + P Sbjct: 303 ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.3 bits (211), Expect = 2e-21 Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 1/104 (0%) Query: 2 KILIVEDEEMIREGVSDYLTDCGYETIEAADGQEALEQFSSYEVALVLLDIQMPKLNGLE 61 IL+ +D+ IR ++ L+ GY+ ++ ++ + LV+ D+ MP N + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLAEIRKT-SQVPVLMLTAFQDEEYKMSAFASLADGYLEKPFSL 104 +L I+K +PVL+++A + A A YL KPF L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.002 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 4/41 (9%) Query: 28 FEPG-KF-YSII--GESGAGKSTLLSLLAGLDSPVEGSILF 64 EPG KF YS++ G G GKSTL++ L GLD + Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.8 bits (80), Expect = 4e-04 Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 4/56 (7%) Query: 218 LHQMILDQDQIQEIILSLWENSAVLTKTAQQLYLHRNSLQYKIDKWEELTGLQLKE 273 L+ +L + + I+ +L K A L L+RN+L+ KI + G+ + Sbjct: 428 LYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL----GVSVYR 479
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.8 bits (72), Expect = 0.001 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 17/73 (23%) Query: 30 YRDPYLSNMLNFDPNMP-------AFFLYYEKGELVGLLTV------YADDQDVEVTILV 76 + PY + D ++ A FLYY + +G + + YA +D+ V Sbjct: 42 FSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVA--- 98 Query: 77 HPGHRRQGIARAL 89 +R++G+ AL Sbjct: 99 -KDYRKKGVGTAL 110 Score = 29.9 bits (67), Expect = 0.007 Identities = 15/67 (22%), Positives = 29/67 (43%), Gaps = 3/67 (4%) Query: 212 VDLSTNTN---YLYGLAISEPERGKGYGSYLAKSLVNQLIEQNDKEFQIAVEDSNVGAKR 268 + + +N N + +A+++ R KG G+ L + E + + +D N+ A Sbjct: 80 IKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACH 139 Query: 269 LYEKIGF 275 Y K F Sbjct: 140 FYAKHHF 146
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 33.5 bits (76), Expect = 0.004 Identities = 11/54 (20%), Positives = 18/54 (33%) Query: 665 PSTESSSSSSDSSTSQSSSTTPSTNNSTTTNPNNNTQQSNTTPDQQNQNPQPAQ 718 P + S ++ + ++ N+NTQ N Q QP Q Sbjct: 326 PPEGGYKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSAQKTEIQPTQ 379
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.3 bits (73), Expect = 0.021 Identities = 27/210 (12%), Positives = 55/210 (26%), Gaps = 33/210 (15%) Query: 7 EKRCKYSIRKFSLGVASVMI-----GATFFGTSPVLADSVQSGSTANLPA---------- 51 YS+RK G ASV + GA + ++ T L Sbjct: 5 NTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERADKFEI 64 Query: 52 -------------DLATALATAKENDGHDFEAPKVGEDQGSPEVTDGPKTEEELLALEKE 98 AL + + K + +++ +EL A + + Sbjct: 65 ENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKAD 124 Query: 99 -----KPAEEKPKEDKPAAAKPETPKTVTPEWQTVEKKEQQGTVTIREEKGVRYNQLSST 153 + A D E K + +K +G + + L + Sbjct: 125 LEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE 184 Query: 154 AQNDNAGKPALFEKKGLTVDANGNATVDLT 183 A + L + ++ + + + Sbjct: 185 KAALEARQAELEKALEGAMNFSTADSAKIK 214
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 62.9 bits (153), Expect = 2e-13 Identities = 39/189 (20%), Positives = 71/189 (37%), Gaps = 35/189 (18%) Query: 2 ILITGANGQLGTELRYLLDERNEEYVAVD------------------------VAEMDIT 37 L+TGA G +G + L E + V +D ++D+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62 Query: 38 DAEMVEKVFEEVKPTLVYHCAAYTAV-DAAEDEGKELDFAINVTGTKNVAKASEKHG-AT 95 D E + +F V+ AV + E+ D N+TG N+ + + Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD--SNLTGFLNILEGCRHNKIQH 120 Query: 96 LVYISTDYVFDGKKPVGQEWEVDDRPD-PQTEYGRTKRMGEELVEKHVSNFYIIRTAW-- 152 L+Y S+ V+ + + + DD D P + Y TK+ E + + + + T Sbjct: 121 LLYASSSSVYGLNRKM--PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178 Query: 153 --VFGNYGK 159 V+G +G+ Sbjct: 179 FTVYGPWGR 187
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 82.1 bits (203), Expect = 4e-19 Identities = 54/306 (17%), Positives = 99/306 (32%), Gaps = 60/306 (19%) Query: 294 TILVTGAGGSIGSEICRQ----------VSRFNPERIVLLGHGENSIYLVYHELIRKFQG 343 LVTGA G IG + ++ + N V L EL+ + Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQAR-------LELLAQ--- 51 Query: 344 IDYVPVIADIQDYDRLLQVFEQYKPAIVYHAAAHKHVPMMERNPKEAFKNNIRGTYNVAK 403 + D+ D + + +F V+ + V NP +N+ G N+ + Sbjct: 52 PGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE 111 Query: 404 AVDEAKVSKMVMIST---------------DKAVNPPNVMGATKRVAELIVTGFNQRSQS 448 K+ ++ S+ D +P ++ ATK+ EL+ ++ Sbjct: 112 GCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171 Query: 449 TYCAVRFGNVLGSRGS---VIPVFERQIAEGGPVTV-TDFRMTRYFMTI----------- 493 +RF V G G + F + + EG + V +M R F I Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 494 -------PEASRLVIHAGAYAKDGEVFILDMGKPVKIYDLAKKMVLLSGHTESEIPIVEV 546 + + A V+ + PV++ D + L E + Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQ---ALEDALGIEAKKNML 288 Query: 547 GIRPGE 552 ++PG+ Sbjct: 289 PLQPGD 294
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 62/361 (17%), Positives = 118/361 (32%), Gaps = 19/361 (5%) Query: 6 LFFVPGIILIGVSLRTPFTVLPIILGNISQGLEVEVSSLGVLTSLPLLMFTLFSPFSTQL 65 + + +G+ L P VLP +L ++ +V + G+L +L LM +P L Sbjct: 10 ILSTVALDAVGIGLIMP--VLPGLLRDLVHSNDV-TAHYGILLALYALMQFACAPVLGAL 66 Query: 66 AQKIGLEHLFTYSLFFLTIGSLIRLI--NLPLLYLGTLMVGASVAVINVLLPSLI----- 118 + + G + SL + I L +LY+G ++ G + A V + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITD 125 Query: 119 QANQPKKIGFLTTLYVTSMGIATALASYLAVPITQASSWKGLILLLTLLCLATFLVWLP- 177 + + GF++ + M L + A + L FL+ Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESH 185 Query: 178 --NHRYNHRLAPQTKQKSQIKVMRNKQVWAIIIFSGFQSLIFYTVMTWLPTMSIHAGLSS 235 R R A + + +F Q + W+ + Sbjct: 186 KGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245 Query: 236 HEAGLLTSILSLISIPFSMTIPSLTTSLSTRNRQLMLTLVSLAGVIGISMLFFPINNFIY 295 G+ + ++ I + R LML + +A G +L F Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM--IADGTGYILLAFATR---G 300 Query: 296 WLAIHLLIGTATSALFPYLMVNFSLKTSAPEKTAQLSGLSQTGGYILAAFGPTLFGYSFD 355 W+A +++ A+ + + + E+ QL G + + GP LF + Sbjct: 301 WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360 Query: 356 L 356 Sbjct: 361 A 361
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 27.5 bits (61), Expect = 0.041 Identities = 14/70 (20%), Positives = 25/70 (35%) Query: 105 FAILVAALTVILAFFAVSILGIIGGFLFLVESFTVLAQAKSAFILIFGSGLLAIGASSLV 164 F+I A + + L + G + S +LI + GA++ Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121 Query: 165 LLGISYVARF 174 L + VAR+ Sbjct: 122 ALVMVVVARY 131
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.7 bits (100), Expect = 2e-06 Identities = 35/154 (22%), Positives = 51/154 (33%), Gaps = 11/154 (7%) Query: 16 SKNKPEEQAQEVADKAEETIADLDTPIEKNTQLEEEVSQAEVELESQQEEKIETPEDSEA 75 S K Q EVA ET + T K T E+ +A+VE E QE T + S Sbjct: 1074 SNVKANTQTNEVAQSGSETK-ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK 1132 Query: 76 RTKIEEKKASNSTEEEPDLSKETEKVTIAEESQEALPQQKATTKEPLLISKSLESPYIPD 135 + + E + E D +E Q T + S ++E P Sbjct: 1133 QEQSETVQPQAEPAREND------PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186 Query: 136 QAPKSRDKWKEQVLDFWSWLVEAIKSPTSKLETS 169 + + E + A PT E+S Sbjct: 1187 TTVNTGNSVVENPEN----TTPATTQPTVNSESS 1216 Score = 35.8 bits (82), Expect = 3e-04 Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 8/109 (7%) Query: 13 KTTSKNKPEEQAQEVADKAEETIADLDTPIEKNTQLEEEVSQAEVELESQQEEKIETPED 72 KT KN E+ A E + E + + ++ NTQ E E+Q E ET Sbjct: 1049 KTVEKN--EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA-- 1104 Query: 73 SEARTKIEEKKASNSTEEEPDLSKETEKVTIAEESQEALPQQKATTKEP 121 T +E+KA TE+ ++ K T +V+ +E E + Q +E Sbjct: 1105 ----TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149 Score = 35.4 bits (81), Expect = 4e-04 Identities = 23/126 (18%), Positives = 55/126 (43%), Gaps = 2/126 (1%) Query: 18 NKPEEQAQEVADKAEETIADLDTPIEKNTQLEEEVSQAEVELESQQEEKIETP-EDSEAR 76 E +++ + E+ D +N ++ +E +++ V+ +Q E ++ E E + Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKE-AKSNVKANTQTNEVAQSGSETKETQ 1096 Query: 77 TKIEEKKASNSTEEEPDLSKETEKVTIAEESQEALPQQKATTKEPLLISKSLESPYIPDQ 136 T ++ A+ EE+ + E + SQ + Q+++ T +P P + + Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK 1156 Query: 137 APKSRD 142 P+S+ Sbjct: 1157 EPQSQT 1162 Score = 31.6 bits (71), Expect = 0.006 Identities = 30/128 (23%), Positives = 47/128 (36%), Gaps = 15/128 (11%) Query: 20 PEEQAQEVADKAEETIADLDTPIEKNTQ--LEEEVSQAEVELESQQEEKIETPEDSEART 77 P E + VA E +EKN Q E EV E++ K T + A++ Sbjct: 1033 PSETTETVA----ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088 Query: 78 KIEEKKASNSTEEEPDLSKETEKVTIAEESQEALP--QQKATTKEPLLISKSLESPYIPD 135 E K+ + +KET V EE + Q+ + K +S + Sbjct: 1089 GSETKET------QTTETKETATVE-KEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141 Query: 136 QAPKSRDK 143 QA +R+ Sbjct: 1142 QAEPAREN 1149
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 31.0 bits (70), Expect = 0.010 Identities = 14/90 (15%), Positives = 35/90 (38%), Gaps = 6/90 (6%) Query: 146 DGLALGKGVVVAETVEQAVEAAHEMLLDNKFGDSGA--RVVIEEFLEGEEF----SLFAF 199 D L L KG++V E+ + E L + F + + ++ + + + ++F Sbjct: 220 DELVLNKGLLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQ 279 Query: 200 VNGDKFYIMPTAQDHKRAYDGDKGPNTGGM 229 ++ F + + Y P++ + Sbjct: 280 IDYSVFTSFTSDDMYFSIYVLTYNPSSSKI 309
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 29.8 bits (67), Expect = 0.005 Identities = 9/35 (25%), Positives = 14/35 (40%), Gaps = 1/35 (2%) Query: 105 YLPEFPGAHGIEDAWNAGVGQSGVTIHWVDSGVDT 139 +P WN G+ GV + +D+G D Sbjct: 21 EIPRGVEMIQAPAVWNQTRGR-GVKVAVLDTGCDA 54
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 30.0 bits (67), Expect = 0.015 Identities = 11/33 (33%), Positives = 15/33 (45%) Query: 193 YSLVRRVFADYTGEEVLPELEGKKLKEVLLEPT 225 YS R+ F DY E E E K L+ + + Sbjct: 93 YSQTRQYFYDYQIESNPREKEYKNLRNAISKNK 125
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 28.4 bits (63), Expect = 0.035 Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 4/74 (5%) Query: 136 KNDDLDDPFINDEHVKFLQIADDQQIAYLKEEARRINE----LLKVWFAEIGLKLIDFKL 191 K D L I+ V F + +D + + I + WF + + + ++ Sbjct: 868 KEDKLSLADIDFRDVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHEI 927 Query: 192 EFGFDKDGKIILAD 205 E FDK G+II D Sbjct: 928 EQIFDKSGRIITPD 941
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 64.5 bits (157), Expect = 3e-13 Identities = 68/444 (15%), Positives = 146/444 (32%), Gaps = 60/444 (13%) Query: 27 MALLLVFLLGFATVAEKEMSLSTRATVEPSRILANIQSTSN---NRILVNHLEENKLVKK 83 M L++ + + + + E+ + + S I+ N I+V +E + V+K Sbjct: 65 MGFLVIAFI-LSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIV---KEGESVRK 120 Query: 84 GDLLVQYQEGAEGVQAESYASQLDMLKDQKKQLEYLQKSLQEGENHFPEEDKFGYQATFR 143 GD+L++ A G +A++ +Q +L+ + +Q Y S N PE Sbjct: 121 GDVLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178 Query: 144 DYISQAGSLRASTSQQNETIASQNAAASQT----QAEIGNLISQTEAKIRDYQTAKSAIE 199 + L + +Q T +Q +AE ++++ + KS ++ Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238 Query: 200 TGTSLAGQNLAYSLYQSYKSQGEENPQTKVQAVAQVEAQISQLESSLATYRVQYAGSGTQ 259 +SL + + + + ++ +S L + + Sbjct: 239 DFSSLLHKQAI----------AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA--- 285 Query: 260 QAYASGLSSQLESLKSQHLAKVGQELSLLAQKILEAESGKKVQGNLLDKGKITASEDGVL 319 + ++ ++ L + + LL ++ + E I A + Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE-------ERQQASVIRAPVSVKV 338 Query: 320 HLNPETSDSSMVAEGTLLAQLYPS---LEREGKAKLTAYLSSKDVARIKVGDSVR----- 371 ++ +V L + P LE + +KD+ I VG + Sbjct: 339 QQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL------VQNKDIGFINVGQNAIIKVEA 392 Query: 372 --YTTTHDAGNQLFLDSTITSIDATATKTEKGNFF-----KIEAETNLTSEQAEKLRYGV 424 YT L + +I+ A + ++ IE T + L G+ Sbjct: 393 FPYTRYGY------LVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGM 446 Query: 425 EGRLQMITGKKSYLRYYLDQFLNK 448 ++ TG +S + Y L Sbjct: 447 AVTAEIKTGMRSVISYLLSPLEES 470
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 30.9 bits (69), Expect = 0.024 Identities = 34/209 (16%), Positives = 76/209 (36%), Gaps = 26/209 (12%) Query: 306 NLFFMTLLALPIYTVIIFAFMKPFEKMNRDTMEANAVLSSSIIEDINGIETIKSLTSESQ 365 N F ++ ++V++FA + +E NA+ DI + +E + Sbjct: 4 NKFIPNKFSIISFSVLLFAIS------SSQAIEVNAMNEHYTESDIKRNHKTEKNKTEKE 57 Query: 366 RYQKIDKEFVDYLKKSFTYSRAESQQKALKKVAHLLLNVGILWMGAVLVMDGKMSLGQLI 425 +++ V + T + + Q LKK+ +L + G + D + Sbjct: 58 KFKDSINNLVKTEFTNETLDKIQQTQDLLKKIPKDVLEIYSELGGEIYFTDID------L 111 Query: 426 TYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTV---EDLSLMKG 482 + L + +N +N + + ++ + E K + +D ++ Sbjct: 112 VEHKELQDLSEEEKNSMNSRGE-------KVPFASRFVFEKKRETPKLIINIKDYAI--N 162 Query: 483 DMTFKQVHYKYGYG--RDVLSDINLTVPQ 509 K+V+Y+ G G D++S P+ Sbjct: 163 SEQSKEVYYEIGKGISLDIISKDKSLDPE 191
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 27.4 bits (61), Expect = 0.005 Identities = 14/32 (43%), Positives = 19/32 (59%), Gaps = 2/32 (6%) Query: 39 VDLMEFILTLEDEFSIEISDEEIDQLQNVGDV 70 V+LM++I LED IE + + LQ GDV Sbjct: 266 VELMDYIQALEDALGIEA-KKNMLPLQP-GDV 295