>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 142 bits (361), Expect = 7e-40 Identities = 83/387 (21%), Positives = 149/387 (38%), Gaps = 84/387 (21%) Query: 5 IGIDLGTTNSCVAIMDGTTPRVLENAEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + + E PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKGQGIV-LNE--------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPFKIIAADNGDAWVEVKGQKMAPPQISAE 118 P N + AI+ + +D I F + + Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFVTEK------------------MLQH 93 Query: 119 VLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALA 178 +K++ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 94 FIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIG 150 Query: 179 YGL--DKGTGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRL 236 GL + TG+ V D+GGGT ++++I ++ V + +GG+ FD + Sbjct: 151 AGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAI 198 Query: 237 INYLVEEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADATG 292 INY+ + G + AE+ K E+ SA + ++ + Sbjct: 199 INYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGV 245 Query: 293 PKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIDD--VILVGGQTRMPMV 349 P+ + + LE+L E + + + VAL+ SDI + ++L GG + + Sbjct: 246 PRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNL 303 Query: 350 QKKVAEFFGKEPRKDVNPDEAVAIGAA 376 + + E G +P VA G Sbjct: 304 DRLLMEETGIPVVVAEDPLTCVARGGG 330
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 61.4 bits (149), Expect = 4e-17 Identities = 18/46 (39%), Positives = 30/46 (65%) Query: 23 HKAMIVALIVICITAVVAALVTRKDLCEVHIRTGQTEVAVFTAYES 68 +++ ++++C+T ++ +TRK LCE+ R G EVA F AYES Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 176 bits (447), Expect = 2e-52 Identities = 59/257 (22%), Positives = 105/257 (40%), Gaps = 4/257 (1%) Query: 2 AASPDIAKTRHQINLSNSTSFSKDGYSSNNTGITGIAGEHDQLNYGI---YVNQQQQNND 58 D + S S S +G +N G+ G E + L+Y + Y N+ Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664 Query: 59 TSLGTNLSWRTPIAIIDGSYSHSKNAWQSGGSISSGLVVWPGGINITNQLSDTFAILDAP 118 ++ L++R + YSHS + Q +S G++ G+ + L+DT ++ AP Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAP 724 Query: 119 GLEGAHINGQKYNRTNSKGQVVYDLIIPHRENHLVLDIANSESETELQGNRQIIAPYRGA 178 G + A + Q RT+ +G V +REN + LD +L + P RGA Sbjct: 725 GAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGA 784 Query: 179 VSYVQFTTDQRKPWYIQALRPDGSPLTFGYDVLDLQENNIGVVGQGSRLFIRVDEIPTGI 238 + +F + L + PL FG V + G+V ++++ + + Sbjct: 785 IVRAEFKARVGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKV 843 Query: 239 KVALNDEQNLFCTITFQ 255 +V +E+N C +Q Sbjct: 844 QVKWGEEENAHCVANYQ 860
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 91.1 bits (226), Expect = 9e-25 Identities = 27/128 (21%), Positives = 52/128 (40%), Gaps = 14/128 (10%) Query: 1 MAAWRYASQDYRTFSDHLYENDKHYHQSDYDDFYDIG------------RKNSLSANIMQ 48 + +RY++ Y F+D Y Y+ D + ++ L + Q Sbjct: 476 LVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQ 535 Query: 49 PLSNNLGNVSLSALWRNYWGRSGNAKDYQFSYSNNWQHISYTFSASQSYDENNKEEER-F 107 L + LS + YWG S + +Q + ++ I++T S S + + K ++ Sbjct: 536 QL-GRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQML 594 Query: 108 NLFISIPF 115 L ++IPF Sbjct: 595 ALNVNIPF 602
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 29.4 bits (66), Expect = 0.041 Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 18/103 (17%) Query: 300 ILIADKQSVGERAVKGICGQVDGSVV------PGFIGLEAGQS-AFGDIYAWFGRVLGWP 352 + I++K+ + + + ++G + G I + + + G P Sbjct: 281 VRISEKEKIK---ITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV---LGDTKLLP 334 Query: 353 L-EQLAAQHPELKAQINASQKQ----LLPALTEAWAKNPSLDH 390 E++ P L+ + S+ Q LL AL E +P L + Sbjct: 335 QRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 227 bits (581), Expect = 5e-73 Identities = 94/405 (23%), Positives = 182/405 (44%), Gaps = 13/405 (3%) Query: 6 LWRWHGITGDGNAQDGMLWAESRALLLMALQQQMVTPLSLKRIAINSAQ----------- 54 + + + G G A+S L+++ + PLS+ + + Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62 Query: 55 WRGDKS--AEVIHQLATLLKAGLTLSEGLALLAEQHPSKQWQALLQSLAHDLEQGIAFSN 112 R S A + QLATL+ A + L E L +A+Q L+ ++ + +G + ++ Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122 Query: 113 ALLPWSEVFPPLYQAMIRTGELTGKLDECCFELARQQKAQRQLTDKVKSALRYPIIILAM 172 A+ + F LY AM+ GE +G LD LA + ++Q+ +++ A+ YP ++ + Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182 Query: 173 AIMVVVAMLHFVLPEFAAIYKTFNTPLPALTQGIMTLADFSGEWSWLLVLFGFLLAIANK 232 AI VV +L V+P+ + LP T+ +M ++D + ++L +A + Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242 Query: 233 LLMRRPTWLIVRQKLLLRIPIMGSLMRGQKLTQIFTILALTQSAGITFLQGVESVRETMR 292 +++R+ + + LL +P++G + RG + L++ ++ + LQ + + M Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302 Query: 293 CPYWVQLLTQIQHDISNGHPIWLALKNTGEFSPLCLQLVRTGEASGSLDLMLDNLAHHHR 352 Y L+ + G + AL+ T F P+ ++ +GE SG LD ML+ A + Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362 Query: 353 DNTMALADNLAALLEPTLLIITGGIIGTLVVAMYLPIFHLGDAMS 397 + L EP L++ ++ +V+A+ PI L MS Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 49.1 bits (117), Expect = 2e-10 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 1/79 (1%) Query: 1 MDKQRGFTLIELMVVIGIIAILSAIGIPAYQNYLRKAALTDMLQTFVPYRTAVELCALEH 60 DKQRGFTL+E+MVVI II +L+++ +P KA + V A+++ L++ Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63 Query: 61 GGLDTCD-GGSNGIPSPTT 78 T + G + + +PT Sbjct: 64 HHYPTTNQGLESLVEAPTL 82
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 34.3 bits (78), Expect = 4e-04 Identities = 12/55 (21%), Positives = 28/55 (50%), Gaps = 4/55 (7%) Query: 186 NDYYRKVKELRAKNQITLPVILKNERQINVFLRT----EDIDLINVINEETLLQQ 236 + ++ EL A N T+ +K ++N+ +R D + I V +E+++++ Sbjct: 589 QNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADESVVKE 643
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.016 Identities = 14/95 (14%), Positives = 30/95 (31%), Gaps = 9/95 (9%) Query: 18 RPAMPRFKVSAFWLLILAWIFL-LVWIWWKGPMWTLYEEQWLKPLANRWLATAAWG---- 72 + + ++ A + + +VW +W L KP+A + Sbjct: 66 QGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVV 125 Query: 73 IIALVW----LTVRVMKRLQQLEKMQKQQREEAVD 103 ++ +W K +Q E Q + A + Sbjct: 126 VVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQE 160
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 26.5 bits (58), Expect = 0.013 Identities = 6/23 (26%), Positives = 10/23 (43%) Query: 45 AVYKDHPLQGSWKGYRDAHVEPD 67 +VYK + + G+ A V Sbjct: 153 SVYKAFSDRVTLPGFNSAKVTSL 175
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 435 bits (1120), Expect = e-157 Identities = 141/315 (44%), Positives = 203/315 (64%), Gaps = 3/315 (0%) Query: 1 MKELVVVAIGGNSIIKDNASQSIEHQAEAVKAVADMVLEMLASDYDIVLTHGNGPQVGLD 60 M + VV+A+GGN++ + S E + V+ A + E++A Y++V+THGNGPQVG Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60 Query: 61 LRRAEIAHEREGLPLTPLANCVADTQGGIGYLIQQALNNRLARHG-EKKAVTVVTQVEVD 119 L + G+P P+ A +QG IGY+IQQAL N L + G EKK VT++TQ VD Sbjct: 61 LLHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVD 120 Query: 120 KNDPGFAHPTKPIGAFFSESQRDKLQKANPDWCFVEDAGRGYRRVVASPEPKRIVEAPAI 179 KNDP F +PTKP+G F+ E +L + W ED+GRG+RRVV SP+PK VEA I Sbjct: 121 KNDPAFQNPTKPVGPFYDEETAKRLAREK-GWIVKEDSGRGWRRVVPSPDPKGHVEAETI 179 Query: 180 KALIQQGFVVIGAGGGGIPVVRTEAGDYQSVDAVIDKDLSTALLAREIHADILVITTDVE 239 K L+++G +VI +GGGG+PV+ E G+ + V+AVIDKDL+ LA E++ADI +I TDV Sbjct: 180 KKLVERGVIVIASGGGGVPVIL-EDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVN 238 Query: 240 KVCIHFGKPQQQALDRVDIATMTRYMQEGHFSPGSMLPKIIASLTFLEQGGKEVIITTPE 299 +++G ++Q L V + + +Y +EGHF GSM PK++A++ F+E GG+ II E Sbjct: 239 GAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLE 298 Query: 300 CLPAALRGETGTHII 314 AL G+TGT ++ Sbjct: 299 KAVEALEGKTGTQVL 313
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.6 bits (139), Expect = 4e-11 Identities = 48/195 (24%), Positives = 82/195 (42%), Gaps = 4/195 (2%) Query: 1 MSTRTPSSSSSRLMLTIGLCFLVALMEGLDLQAAGIAAGGIAQAFALDKMQMGWIFSAGI 60 M+T S+ + I LC L L+ ++ IA F W+ +A + Sbjct: 1 MNTSYSQSNLRHNQILIWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFM 59 Query: 61 LGLLPGALVGGMLADRYGRKRILIGSVALFGLFSLATAIAWD-FPSLVFARLMTGVGLGA 119 L G V G L+D+ G KR+L+ + + S+ + F L+ AR + G G A Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AA 118 Query: 120 ALPNLIA-LTSEAAGPRFRGTAVSLMYCGVPIGAALAATLGFAGANLAWQTVFWVGGVVP 178 A P L+ + + RG A L+ V +G + +G A+ + + ++ Sbjct: 119 AFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT 178 Query: 179 LILVPLLMRWLPESA 193 +I VP LM+ L + Sbjct: 179 IITVPFLMKLLKKEV 193
>TRNSINTIMINR#Translocated intimin receptor (Tir) signature. Length = 549 Score = 28.2 bits (62), Expect = 0.018 Identities = 14/56 (25%), Positives = 30/56 (53%), Gaps = 2/56 (3%) Query: 11 LKAGLVTSKKAAKVERTAKKSRVQAREARAAVEENKKAQLERDKQLSEQQKQAALA 66 + +G + ++ + AK++ AR+ AVE N +AQ + Q + +Q++ L+ Sbjct: 308 IPSGELKDDIVEQIAQQAKEAGEVARQQ--AVESNAQAQQRYEDQHARRQEELQLS 361
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 28.7 bits (64), Expect = 0.015 Identities = 13/31 (41%), Positives = 20/31 (64%) Query: 12 QVIIDETAGEVVIGANTRICHGAVIQGPVVI 42 +V+I+E G +VIGA+ RI AV G + + Sbjct: 262 KVVINERTGTIVIGADVRISRVAVSYGTLTV 292
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 29.7 bits (66), Expect = 0.019 Identities = 19/69 (27%), Positives = 30/69 (43%) Query: 265 DIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIKRAGADLIFSYF 324 + EL + +L + QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 325 ALDLAEKKI 333 L+L E++I Sbjct: 526 DLNLVERRI 534
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.4 bits (92), Expect = 3e-05 Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 20/347 (5%) Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121 +F +P++ + F RR LL + V A M W+ + ++A + Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109 Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181 + V A+ D+ +ER + GM+ L + ++ A Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167 Query: 182 AL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238 AL + + L PE + P+ + + + A L+ + ++ +G Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227 Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298 A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++ Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285 Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358 A GY LL+ + + V GG+G A A+L ++ L+ Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341 Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403 AL+++ + VGP+ + A +T+ + + AA+ L L + R Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.5 bits (58), Expect = 0.027 Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 3 KKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 36 KK+LF ++ GCA+ T+ PT P++ Sbjct: 7 KKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.043 Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%) Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119 E P+ E + ++G+ A + +Y RL D +++ G Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167 Query: 120 PTGSGKTLLAETL 132 +G+GK L+A L Sbjct: 168 ESGTGKELVARAL 180
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (294), Expect = 3e-38 Identities = 49/88 (55%), Positives = 67/88 (76%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89 NPQTG+EI I A+KVP+F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.007 Identities = 33/141 (23%), Positives = 56/141 (39%), Gaps = 25/141 (17%) Query: 247 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 306 W+ L L+ G +A +LR R+ + + P++ G+I Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272 Query: 307 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSAFEDMGDWLRQHPQQHISINLE 365 AR+ +T + S +PL Q +S + ++ R D +R+ H + LE Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328 Query: 366 STVLTSEKIPQLLREMINQSG 386 T L P ++R MI SG Sbjct: 329 QTAL----FPPMMRHMI-ASG 344
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1367 bits (3541), Expect = 0.0 Identities = 801/1033 (77%), Positives = 915/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300 + EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540 SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600 YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660 EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTSLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS++DIN ++ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900 M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020 +EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.7 bits (98), Expect = 4e-06 Identities = 32/212 (15%), Positives = 71/212 (33%), Gaps = 23/212 (10%) Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159 + Y A +L + + Q+ Q +++ ++ L +Q + Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYLDVTQ 218 + + + +P+S ++ + V TEG +V + T + V + D + + Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372 Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268 + D + KV I D I+ + G + ++++ Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429 Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300 + S + I L GM V A ++ G Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455 Score = 34.4 bits (79), Expect = 8e-04 Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%) Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107 ++I G+ T + R E++P + I+ + KEG + G L ++ +A Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134 Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167 D K Q++ A+L RYQ L E ++ Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186 Query: 168 RINLA 172 +L Sbjct: 187 LTSLI 191
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 222 bits (567), Expect = 5e-76 Identities = 215/215 (100%), Positives = 215/215 (100%) Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.017 Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%) Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRIKEE 87 N RA L + + L L+ + A L++ ++ E Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143 +LR ++ + + +A V E L +T ++ L +A+ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324 Query: 144 LQNAQ 148 Q + Sbjct: 325 QQASV 329
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.4 bits (99), Expect = 6e-06 Identities = 42/249 (16%), Positives = 79/249 (31%), Gaps = 27/249 (10%) Query: 402 PLPETTSQVLAARQ--QLQCVQGATKAKKSESAAATRARPVNNAALERLASVTDRVQARP 459 P E +Q + +Q + S + AR + A + A T Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEE--IARV-DEAPVPPPAPATPSETTET 1039 Query: 460 VPSALEKASAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPELA 508 V ++ S E AT Q +E V A + + A E ++T Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099 Query: 509 AKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRSSQ 558 K A E+ +V+ PK + + E +N ++++ Q Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159 Query: 559 RHLNNRGAQQKLAEALS-MLKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARESII 617 N ++ A+ S ++ E T V N V P A + + + Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKP 1219 Query: 618 ADNNIQTLR 626 + + +++R Sbjct: 1220 KNRHRRSVR 1228
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 27.8 bits (62), Expect = 0.046 Identities = 11/43 (25%), Positives = 18/43 (41%) Query: 38 GQLAAVAIVTCDGNVYSAGDSDYRFALESISKVCTLALALEDV 80 G++ + + G +A +D RF + S KV L V Sbjct: 38 GRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 77.8 bits (191), Expect = 5e-19 Identities = 49/212 (23%), Positives = 81/212 (38%), Gaps = 7/212 (3%) Query: 16 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPDDVERMNS----MGFT--GVLIDLDS 69 K ITG + GIG A L QG H+ A P+ +E++ S D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 70 PESVDRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 129 ++D + + + N AG G + ++S + E FS N G + + Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 130 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRHSGIKVSLIEP 189 M+ G IV S + AYA+SK A ++ L +EL I+ +++ P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 190 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 221 G T ++ ++ G F G Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.014 Identities = 12/20 (60%), Positives = 13/20 (65%) Query: 41 LVGESGSGKSTLLAILAGLD 60 L G G GKSTL+ L GLD Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.0 bits (70), Expect = 0.011 Identities = 17/150 (11%), Positives = 44/150 (29%), Gaps = 8/150 (5%) Query: 299 RSQLNYSEENLKQARAALERLYTALRGTEKTVAPAGGEAFEARFIEAMDDDFNTP----- 353 + ++ +L QAR R R E P E F +++ Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 354 EAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEV 413 E +S + + + A + + + + + + + + F + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249 Query: 414 AEIEALIQQRLDARKAKDWAAADAARDRLN 443 A+ L Q+ + + +++ Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIE 279
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 387 bits (995), Expect = e-138 Identities = 126/310 (40%), Positives = 176/310 (56%), Gaps = 16/310 (5%) Query: 2 KTLVVALGGNALLQRGEALTAENQYRNIASAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60 K +V+ALGGNAL QRG+ + E N+ +A + AR Y + I HGNGPQVG L L Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 61 QNLAWKE---VEPYPLDVLVAESQGMIGYMLAQSLSAQPQM----PPVTTVLTRIEVSPD 113 A + + P+DV A SQG IGYM+ Q+L + + V T++T+ V + Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 114 DPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRD-GKYLRRVVASPQPRKILDSEAIELL 172 DPAF P K +GP Y E + L GW +K D G+ RRVV SP P+ +++E I+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 173 LKEGHVVICSGGGGVPVTEDG---AGSEAVIDKDLAAALLAEQINADGLVILTDADAVYE 229 ++ G +VI SGGGGVPV + G EAVIDKDLA LAE++NAD +ILTD + Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242 Query: 230 NWGTPQQRAIRHATPDELAPFAKAD----GSMGPKVTAVSGYVRSRGKPAWIGALSRIEE 285 +GT +++ +R +EL + + GSMGPKV A ++ G+ A I L + E Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302 Query: 286 TLAGEAGTCI 295 L G+ GT + Sbjct: 303 ALEGKTGTQV 312
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 31.7 bits (72), Expect = 0.014 Identities = 17/72 (23%), Positives = 25/72 (34%), Gaps = 1/72 (1%) Query: 629 PAAVSDLRAALELEPNNSNIQAALGYALWDSGDIAQSREMLEQAHKRLPDDPALIRQLAY 688 VS+L + L N ++ Y S + ++ +ML L P L Sbjct: 100 KQNVSELLSLL-SNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHL 158 Query: 689 VNQRLDDMPATQ 700 V Q L M Q Sbjct: 159 VEQALVSMAEEQ 170
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.3 bits (84), Expect = 2e-04 Identities = 82/394 (20%), Positives = 145/394 (36%), Gaps = 38/394 (9%) Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83 + V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+ Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74 Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141 V+L + G ++ + P L +Y+ + G + G A A + Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126 Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201 + + G V P++GGL+ GG + + AA L L LP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253 + PL+ + LA FR+ +V + + ++ + A+ V++ D + Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241 Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309 A IG AA L + A+ +G +A ++L + ++ + Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369 M +V LA G ML Q E G++ G A +G L Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357 Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403 + A + + +G+ + L LL L LRR Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 63.4 bits (154), Expect = 1e-13 Identities = 61/289 (21%), Positives = 105/289 (36%), Gaps = 35/289 (12%) Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99 H P RIV+ LLA+ VAD + R W E L Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75 Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151 I EP+ E + P ++ SA G S + L+ IAP N+ D Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131 Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209 + LT++ ++ + A +AQ++ + + K + + ++ Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191 Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269 P S ++L++ G NA Q + + + LAA + + L + Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243 Query: 270 KDADAIYANPLLAHLPAVQNKQVYTLGTETFRLDYYSAMQVLDRLNSLL 318 KD DA+ A PL +P V+ + + F SAM + L++ + Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAI 292
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 440 bits (1134), Expect = e-159 Identities = 147/299 (49%), Positives = 194/299 (64%), Gaps = 18/299 (6%) Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60 MAIP +Q Y +P + D+PQNKV W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120 L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEGLL------PAPVPARKA-----------ALREVIL 223 FS ++H M+L+Y AGR VMT+ LL PA V A +R+ I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281 LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 360 bits (925), Expect = e-129 Identities = 110/258 (42%), Positives = 150/258 (58%), Gaps = 20/258 (7%) Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFAQEQYPFATE 49 GK ++TGA +GIG A A GA + D +A E +P Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63 Query: 50 VMDVADAAQVAQVCQRLLAETERLDVLVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109 DV D+A + ++ R+ E +D+LVN AG+LR G LS E+W+ TF+VN G FN Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAVSGVRC 169 + +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229 N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 230 ASHITLQDIVVDGGSTLG 247 A HIT+ ++ VDGG+TLG Sbjct: 243 AGHITMHNLCVDGGATLG 260
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.5 bits (235), Expect = 1e-24 Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 1/125 (0%) Query: 2 TNVLIVEDEQAIRRFLRTALEGDGMRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61 +L+ +D+ AIR L AL G V A DL++ D+ +PD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 EFIRDLRQWSP-VPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHS 120 + + +++ P +PV+V+SA++ I A + GA DYL KPF + EL + AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 121 ATAAP 125 + Sbjct: 124 RRPSK 128
>PF06580#Sensor histidine kinase Length = 349 Score = 31.4 bits (71), Expect = 0.014 Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 4/48 (8%) Query: 785 LLENAVKYAGAQAE----IGIDAHFEGENLQLDVWDNGPGLPPGQEQT 828 L+EN +K+ AQ I + + + L+V + G +++ Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKES 310
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.1 bits (73), Expect = 0.002 Identities = 19/82 (23%), Positives = 35/82 (42%), Gaps = 12/82 (14%) Query: 55 RVARLRKNACLKYQATPEGLRYPASRGL----RAEQMRELLNGHYIIHR-----KNLLIT 105 + + A + + P L + G+ R+ M+E+ ++ R L+IT Sbjct: 110 ELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR---VLARLMQTDLTLMIT 166 Query: 106 GPTGCGKSWIANALGEQACRQK 127 G +G GK +A AL + R+ Sbjct: 167 GESGTGKELVARALHDYGKRRN 188
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 56.6 bits (136), Expect = 1e-10 Identities = 30/188 (15%), Positives = 58/188 (30%), Gaps = 6/188 (3%) Query: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEE 158 E E+ Q QA+ + E A A ++ E Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVP----SNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 159 AAK--KAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAAAAEARKKAAAEKA 216 A+ K + +K E +A + A+ ++ A+ A + +K + E A + + K Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099 Query: 217 AADKKAAEKAAADKAAADKKAAAEKAAADKKAAAAKAAAEKAAADKKAAAAKAAAEKAAA 276 + EK K +K K + ++ + A+ K Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159 Query: 277 AKAAAEAD 284 ++ AD Sbjct: 1160 SQTNTTAD 1167 Score = 52.4 bits (125), Expect = 2e-09 Identities = 33/217 (15%), Positives = 69/217 (31%), Gaps = 1/217 (0%) Query: 66 RMQSQESSAKRSDEQRKIKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125 R ++E+ + + + Q+ E +E Q E + +EKE A E +K E Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125 Query: 126 QAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKK 185 +++ KQ + + A+ + + +E + A + A+ + E Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDP-TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184 Query: 186 AEAAAAALKKKAEAAEAAAAEARKKAAAEKAAADKKAAEKAAADKAAADKKAAAEKAAAD 245 E E + +++ K + + + + A + Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244 Query: 246 KKAAAAKAAAEKAAADKKAAAAKAAAEKAAAAKAAAE 282 A + A A AKA KA ++ Sbjct: 1245 STVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQ 1281 Score = 51.6 bits (123), Expect = 4e-09 Identities = 25/195 (12%), Positives = 67/195 (34%), Gaps = 5/195 (2%) Query: 68 QSQESSAKRSDEQRKIKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQA 127 Q+ S ++E+ ++ +E ++ + +K E+ A + Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN--EQDATET 1061 Query: 128 ELKQKQ-AEEAAAKAAADAKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKKA 186 + ++ A+EA + A+ + A + + + + E E KA E +K Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE-EKAKVETEKTQ 1120 Query: 187 EAAAAALKKKAEAAEAAAAEARKKAAAEKAAADKKAAEKAAADKAAADKKAAAEKAAADK 246 E + + ++ + + + A E E + AD + A++ +++ Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPT-VNIKEPQSQTNTTADTEQPAKETSSNV 1179 Query: 247 KAAAAKAAAEKAAAD 261 + ++ Sbjct: 1180 EQPVTESTTVNTGNS 1194 Score = 51.2 bits (122), Expect = 5e-09 Identities = 28/193 (14%), Positives = 61/193 (31%), Gaps = 5/193 (2%) Query: 87 QAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAK 146 QA E R+ + A + E A+ KQ + K DA Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN----SKQESKTVEKNEQDAT 1059 Query: 147 AKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAAAAE 206 + + A+EA A+ + A++ E Q E A +K E A+ + Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT-ETKETATVEKEEKAKVETEK 1118 Query: 207 ARKKAAAEKAAADKKAAEKAAADKAAADKKAAAEKAAADKKAAAAKAAAEKAAADKKAAA 266 ++ + K+ + +A ++ + ++ A + A + ++ Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178 Query: 267 AKAAAEKAAAAKA 279 + ++ Sbjct: 1179 VEQPVTESTTVNT 1191 Score = 51.2 bits (122), Expect = 6e-09 Identities = 27/242 (11%), Positives = 81/242 (33%), Gaps = 23/242 (9%) Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKIKEQQAAE-ELREKQAAEQER------L 103 D V A + ++ ++K+ + + EQ A E + ++ A++ + Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080 Query: 104 KQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKA 163 + E + ++ ++ Q K+ +K+ KAK E + +E K Sbjct: 1081 QTNEVAQSGSETKETQ-TTETKETATVEKE-----------EKAKVETEKT--QEVPKVT 1126 Query: 164 AADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAAAAE--ARKKAAAEKAAADKK 221 + + K+ ++E + AE ++ + + +++ A E A++ ++ + + Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186 Query: 222 AAEKAAADKAAADKKAAAEKAAADKKAAAAKAAAEKAAADKKAAAAKAAAEKAAAAKAAA 281 + + ++ + ++ ++ + Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRST 1246 Query: 282 EA 283 A Sbjct: 1247 VA 1248 Score = 42.0 bits (98), Expect = 4e-06 Identities = 31/228 (13%), Positives = 64/228 (28%), Gaps = 4/228 (1%) Query: 59 AVVEQYKRMQSQESSAKRSDEQRKIKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKK 118 A + + QS + + + K E EK E E+ +++ K +++ Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVEKE---EKAKVETEKTQEVPKVTSQVSPKQE 1134 Query: 119 QAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKA 178 Q+E QAE ++ K + A+E + + + Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194 Query: 179 AAEAQKKAEAAAAALKKKAEAAEAAAAEARKKAAAEKAAADKKAAEKAAADKAAADKKAA 238 E + A +E++ R+ + + A + A Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV-PHNVEPATTSSNDRSTVALCDLT 1253 Query: 239 AEKAAADKKAAAAKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEADDI 286 + A A AKA K + + E + + Sbjct: 1254 STNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNT 1301
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 116 bits (292), Expect = 5e-34 Identities = 35/119 (29%), Positives = 54/119 (45%), Gaps = 4/119 (3%) Query: 55 EEQARLQMQQLQQNNIVYFDLDKYDIRSDFAQMLDAHANFLRSN--PSYKVTVEGHADER 112 +Q + + V F+ +K ++ + LD + L + V V G+ D Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264 Query: 113 GTPEYNISLGERRANAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAYSKNRRAVL 171 G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++ Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGN-TCDNVKQRAALI 321
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.0 bits (65), Expect = 0.025 Identities = 14/72 (19%), Positives = 28/72 (38%), Gaps = 4/72 (5%) Query: 24 AFAQAPISSVGSGSVEDRVTQLERISNAHSQLLTQLQQQLS---DNQSDIDSLRGQIQEN 80 F I +G+ + D +++ H L Q L + + + S+R E+ Sbjct: 664 PFNMPAIVELGTATGFD-FELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLED 722 Query: 81 QYQLNQVVERQK 92 Q V+++K Sbjct: 723 TAQFKLEVDQEK 734
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.010 Identities = 9/18 (50%), Positives = 12/18 (66%) Query: 31 LVLLGPSGAGKSSLLRVL 48 +VL G G GKS+L+ L Sbjct: 599 VVLEGTGGIGKSTLINTL 616
>INTIMIN#Intimin signature. Length = 939 Score = 30.0 bits (67), Expect = 0.006 Identities = 34/169 (20%), Positives = 58/169 (34%), Gaps = 4/169 (2%) Query: 13 ITVVCATSSVMAADDNAITDGKVTFNGKVIAPACTLVAATKDSVVTLPNVSATKL--QTN 70 +V T+ + A N GK T K P +V+A + + N +A QT Sbjct: 598 FNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTK 657 Query: 71 GAVSGVKTDVPIALEGCDVTVTKNATFTFSGTADGVQPTAFANQATTDAATNVALQM--Y 128 +++ +K D A+ +T Q F + + Y Sbjct: 658 ASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGY 717 Query: 129 LPDGSTSVTPGTETSNIQLADSAEQTVTFKVDYIATGKATSGNVNAVTN 177 TS TPG + +++D A +V++ T GN+ V Sbjct: 718 AKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGT 766
>CLENTEROTOXN#Clostridium enterotoxin signature. Length = 319 Score = 31.9 bits (72), Expect = 0.004 Identities = 13/48 (27%), Positives = 22/48 (45%) Query: 295 VGVVVTDSQNNIISPAGGTLPLSIPDDADSIARMNVYPVSTTGVPPET 342 + V TD + I+ A T L++ D +S N+Y ++ P T Sbjct: 188 LTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWRSSNSYPWT 235
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 97.6 bits (243), Expect = 7e-25 Identities = 67/305 (21%), Positives = 125/305 (40%), Gaps = 7/305 (2%) Query: 10 IASPFWGGLADRKGRKLMLLRSALGMGIVMVLMGLAQNIWQFLILRALLGLLGGFVPNAN 69 +P G L+DR GR+ +LL S G + +M A +W I R + G+ G A Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117 Query: 70 ALIATQVPRNKSGWALGTLSTGGVSGALLGPMAGGLLADSYGLRPVFFITASVLILCFFV 129 A IA ++ G +S G + GP+ GGL+ + FF A++ L F Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLT 176 Query: 130 TLFCIREKFQPISKKEMLHMREVVTSLKNP---KLVLSLFVTTLIIQVATGSIAPILTLY 186 F + E + + + S + +V +L I+Q+ A + ++ Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236 Query: 187 VRELAGNVSNVAFISGMIASVPGVAALLSAPRLGKLGDRIGPEKILITALIFSVLLLIPM 246 + + I +A+ + +L A G + R+G + L+ +I I + Sbjct: 237 GEDRFH--WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294 Query: 247 SYVQTPLQLGILRFLLGAADGALLPAVQTLLVYNSSNQIAGRIFSYNQSFRDIGNVTGPL 306 ++ T + +L A+ G +PA+Q +L + G++ + + ++ GPL Sbjct: 295 AFA-TRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353 Query: 307 MGAAI 311 + AI Sbjct: 354 LFTAI 358 Score = 45.6 bits (108), Expect = 2e-07 Identities = 38/180 (21%), Positives = 73/180 (40%), Gaps = 3/180 (1%) Query: 156 LKNPKLVLSLFVTTLIIQVATGSIAPILTLYVRELAGNVSNVAFISGMIASVPGVAALLS 215 +K + ++ + T + V G I P+L +R+L + ++V G++ ++ + Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHS-NDVTAHYGILLALYALMQFAC 59 Query: 216 APRLGKLGDRIGPEKILITALIFSVLLLIPMSYVQTPLQLGILRFLLGAADGALLPAVQT 275 AP LG L DR G +L+ +L + + M+ L I R ++ GA Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGR-IVAGITGATGAVAGA 118 Query: 276 LLVYNSSNQIAGRIFSYNQSFRDIGNVTGPLMGAAISANYGFRAVFLVTAGVVLFNAVYS 335 + + R F + + G V GP++G + + A F A + N + Sbjct: 119 YIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAAAALNGLNFLTG 177
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.9 bits (98), Expect = 4e-06 Identities = 17/49 (34%), Positives = 29/49 (59%) Query: 353 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 401 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 37.2 bits (86), Expect = 9e-05 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%) Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 349 bits (897), Expect = e-126 Identities = 232/232 (100%), Positives = 232/232 (100%) Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 Query: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 426 bits (1096), Expect = e-151 Identities = 156/363 (42%), Positives = 212/363 (58%), Gaps = 9/363 (2%) Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63 F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72 Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTESLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123 ML LGIT G + KN+AAVMVT +LPPF G +DV VSS+G+A SLRGG L+M Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131 Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183 T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191 Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239 L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250 Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299 +N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308 Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359 GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+ Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367 Query: 360 AKL 362 A+L Sbjct: 368 AEL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 511 bits (1318), Expect = 0.0 Identities = 313/313 (100%), Positives = 313/313 (100%) Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240 Query: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300 Query: 301 VSKTYSMNIDNLF 313 VSKTYSMNIDNLF Sbjct: 301 VSKTYSMNIDNLF 313
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 681 bits (1758), Expect = 0.0 Identities = 540/546 (98%), Positives = 543/546 (99%) Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITVANGYSLVQGSTA 241 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNIT+ANGYSLVQGSTA Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQELDQTRNTLGQL 301 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQ+LDQTRNTLGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFTIGKPAVLQNTKNKGDVAIGATVTDASVVLATD 361 ALAFAEAFNTQHKAGFDANGDAGEDFF IGKPAVLQNTKNKGDVAIGATVTDAS VLATD Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360 Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420 Query: 422 NMDVLITDEAKIAMASEKDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481 NMDVLITDEAKIAMASE+DAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480 Query: 482 KTATLKTSSTTQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541 KTATLKTSS TQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540 Query: 542 ALINIR 547 ALINIR Sbjct: 541 ALINIR 546
>FLAGELLIN#Flagellin signature. Length = 507 Score = 45.4 bits (107), Expect = 2e-07 Identities = 41/226 (18%), Positives = 80/226 (35%), Gaps = 9/226 (3%) Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66 ++ Q N+ +S + + E++S+G R+ + DD + A + +Q + Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67 Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126 E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127 Query: 127 TTDGNGRYIFAGYKTETAPFSEANGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186 T NG + + +G E+I + +G G + + Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181 Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKETAAAALDKT 232 + T A + + TA DK Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 68.2 bits (166), Expect = 2e-13 Identities = 49/261 (18%), Positives = 87/261 (33%), Gaps = 26/261 (9%) Query: 551 VAPAPKAATATPAAPAQPGLLSRFFGALKALFSGGEEAKPTEQP-TPKAEAKPERQQDRR 609 T P + S E A+ E P P A A P Sbjct: 991 TVDTTNITTPNNIQADVPSVPSN----------NEEIARVDEAPVPPPAPATPSET---- 1036 Query: 610 KPRQSNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQTAETRESRQQAEV------T 663 + N ++++++ D E +NR A++ + + + Q EV T Sbjct: 1037 ----TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092 Query: 664 EKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQR 723 ++ +TT+ ++ E+ + + + Q+ K + + QE + + + R Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPK-VTSQVSPKQEQSETVQPQAEPARENDP 1151 Query: 724 QLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEE 783 +N K Q+ P E ++ E V E+ T V P A Q Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211 Query: 784 NNADNRDNGGMPRRSRRSPRH 804 N+ + RRS RS H Sbjct: 1212 NSESSNKPKNRHRRSVRSVPH 1232 Score = 60.8 bits (147), Expect = 2e-11 Identities = 48/288 (16%), Positives = 88/288 (30%), Gaps = 36/288 (12%) Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAATVVAPAPKAATATPAAPAQPGLL 571 P E+ + DVP P+ E A AP P A ATP+ + Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038 Query: 572 SRFFGALKALFSGGEEAKPTEQPTPKAEAKPERQQDRRKPRQSNRRDRNERRDTRSER-- 629 A E +K + K E Q+ + + + ++ Sbjct: 1039 ------TVA-----ENSKQESKTVEKNEQDATE-----TTAQNREVAKEAKSNVKANTQT 1082 Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKRQ 689 E + + E + + ++TA + + TEK + + + + + + Q Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142 Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744 A+ + +N++E Q + +P + + Q V +V V P Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200 Query: 745 VEETAAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786 A +P V + R + VP V T A Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 4/31 (12%) Query: 27 LKPG----KILTLLGPNGAGKSTLVRVVLGL 53 ++PG + L G G GKSTL+ ++GL Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 274 bits (702), Expect = 1e-93 Identities = 63/304 (20%), Positives = 114/304 (37%), Gaps = 25/304 (8%) Query: 4 KKTLLFAALSAALWGGATQA---------ADAAVVASLKPVGFIASAIADGVTETEVLLP 54 KK L + A VVA+ + I IA + ++P Sbjct: 2 KKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVP 61 Query: 55 DGASEHDYSLRPSDVKRLQNADLVVWVGPEMEAFMQKPVSKLPGAKQVTIAQLEDVKPLL 114 G H+Y P DVK+ ADL+ + G +E +KL + T E+ Sbjct: 62 IGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKT----ENKDYFA 117 Query: 115 MKSIHGDDDDHDHAEKSDEDHHHGDFNMHLWLSPEIARATAVAIHGKLVELMPQSRAKLD 174 + EK ED H WL+ E A I +L P ++ + Sbjct: 118 VSDGVDVIYLEGQNEKGKEDPH-------AWLNLENGIIFAKNIAKQLSAKDPNNKEFYE 170 Query: 175 ANLKDFEAQLASTETQVGNELA--PLKGKGYFVFHDAYGYFEKQFGLTPLGHFTVNPEIQ 232 NLK++ +L + + ++ P + K A+ YF K +G+ + +N E + Sbjct: 171 KNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEE 230 Query: 233 PGAQRLHEIRTQLVEQKATCVFAEPQFRPAVVESVARGTSVRMGT---LDPLGTNIKLGK 289 +++ + +L + K +F E +++V++ T++ + D + K G Sbjct: 231 GTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGD 290 Query: 290 TSYS 293 + YS Sbjct: 291 SYYS 294
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 25.9 bits (57), Expect = 0.036 Identities = 10/37 (27%), Positives = 18/37 (48%), Gaps = 5/37 (13%) Query: 43 EVMLTCRPGNALYVINPSTLVQYPLNDI-----AQKE 74 + +L RPG L + P + + +DI A++E Sbjct: 18 KKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQE 54
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 30.0 bits (67), Expect = 6e-04 Identities = 9/37 (24%), Positives = 17/37 (45%), Gaps = 5/37 (13%) Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTILL 35 + I+ G I+G++ W+ K ++ ILL Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 27.7 bits (61), Expect = 0.022 Identities = 18/61 (29%), Positives = 26/61 (42%) Query: 49 QGLSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGGRGVTLMGS 108 Q +I L IG + + LPPS ++ N ++ A VS LG +TL G Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGG 233 Query: 109 Q 109 Sbjct: 234 H 234
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 100 bits (249), Expect = 2e-27 Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%) Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58 I +TGA G GE + R QG + A E+L+++ L A+ DVR+ Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 59 AAIEEMLASLPAEWCNIDILVNNAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVL 118 AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178 M++R G I+ +GS P Y ++KA F+ L +L +R + PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227 T+ + ++G + +T++ + L P D+++AV + VS H+ Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 228 INTL 231 ++ L Sbjct: 248 MHNL 251
>PF05775#Enterobacteria AfaD invasin protein Length = 142 Score = 29.5 bits (66), Expect = 0.007 Identities = 11/53 (20%), Positives = 17/53 (32%), Gaps = 7/53 (13%) Query: 106 THRLLLVELEGEKWIADVGFGGQTLTAPI-------RLVSDLVQTTPHGEYRL 151 L + + G W + V G Q + I + D Q G+Y Sbjct: 82 PQHNLRIRISGNGWSSFVEKGIQGVFNTIKEDASIFYIEVDGNQQVQPGKYLF 134
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 582 bits (1501), Expect = 0.0 Identities = 312/388 (80%), Positives = 337/388 (86%), Gaps = 16/388 (4%) Query: 1 MKSKVLALLIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDNSAKDGDQSYARLG 60 MK KVLAL+IPALLAAGAAHAAE+YNKDGNKLDLYGKVDGLHYFSD+S+KDGDQ+Y R+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQINDQLTGYGQWEYNIQANNTESSKNQSWTRLAFAGLKFADYGSFDYGRNYGVMY 120 FKGETQINDQLTGYGQWEYN+QAN TE SWTRLAFAGLKF DYGSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DIEGWTDMLPEFGGDSYTNADNFMTGRANGVATYRNTDFFGLVNGLNFAVQYQGNNEGAS 180 D+EGWTDMLPEFGGDSYT ADN+MTGRANGVATYRNTDFFGLV+GLNFA+QYQG NE S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 181 N-----GQEGTNNGRDVRHENGDGWGLSTTYDLGMGFSAGAAYTSSDRTNDQVNH--TAA 233 G NNG D+R++NGDG+G+STTYD+GMGFSAGAAYT+SDRTN+QVN T A Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240 Query: 234 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPFGDS----DYAVANKTQNFEVTAQYQF 289 GGDKADAWTAGLKYDANNIYLATMYSETRNMTP+G + D VANKTQNFEVTAQYQF Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300 Query: 290 DFGLRPAVSFLMSKGRDLHAADGADNPAGVDDKDLVKYADVGATYYFNKNMSTYVDYKIN 349 DFGLRPAVSFLMSKG+DL N DDKDLVKYADVGATYYFNKN STYVDYKIN Sbjct: 301 DFGLRPAVSFLMSKGKDL-----TYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 350 LLDEDDSFYAANGISTDDIVALGLVYQF 377 LLD+DD FY GISTDDIVALG+VYQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>adhesinb#Adhesin B signature. Length = 310 Score = 331 bits (849), Expect = e-116 Identities = 90/296 (30%), Positives = 163/296 (55%), Gaps = 7/296 (2%) Query: 9 MLLGCLALTCSIAFQASATEKFKVITTFTIIADMAKNVAGDAAEVSSITKPGAEIHEYQP 68 +G A + + + + K V+ T +IIAD+ KN+AGD + SI G + HEY+P Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72 Query: 69 TPGDIKRAQGAQLILANGMNLEL----WFQRFYQHLNGVPE---VIVSSGVTPVGITEGP 121 P D+K+ A LI NG+NLE WF + ++ VS GV + + Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132 Query: 122 YEGKPNPHAWMSPDNALIYVDNIRDALIKYDPANAQTYQRNADTYKAKITQTLAPLRKQI 181 +GK +PHAW++ +N +IY NI L + DPAN +TY++N Y K++ +++ Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192 Query: 182 TELPENQRWMVTSEGAFSYLARDLGLKELYLWPINADQQGTPQQVRKVVDIVKKNHIPAV 241 +P ++ +VTSEG F Y ++ + Y+W IN +++GTP Q++ +V+ ++K +P++ Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252 Query: 242 FSESTISDKPARQVARETGAHYGGVLYVDSLSTENGPVPTYIDLLKVTTSTLVQGI 297 F ES++ D+P + V+++T ++ DS++ + +Y ++K + +G+ Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 147 bits (372), Expect = 2e-47 Identities = 67/201 (33%), Positives = 103/201 (51%), Gaps = 32/201 (15%) Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEHQSTLSAGYLQTHTDMPGSDDLKGINVKYRYEF 59 M+K C + L+A LA + + A+ ST++ GY Q+ + + G N+KYRYE Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEE 55 Query: 60 TDT-LGLITSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAG 118 ++ LG+I SF+Y T S T D +N+++ + AGP+ R+N+W S Y + G Sbjct: 56 DNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVG 107 Query: 119 VAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDGGRHSNTSLAWGAGVQFNPTESVAIDLAY 178 V Y + T T+ HD S+ ++GAG+QFNP E+VA+D +Y Sbjct: 108 VGYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSY 150 Query: 179 EGSGSGDWRTDGFIVGVGYKF 199 E S +I GVGY+F Sbjct: 151 EQSRIRSVDVGTWIAGVGYRF 171
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 30.1 bits (67), Expect = 0.033 Identities = 21/124 (16%), Positives = 41/124 (33%), Gaps = 20/124 (16%) Query: 161 DQLGVTAVDAHTLKIQLDKPLPWFVNLTANFAFFPVQKANVESGKEWTKPGNLIGNGAYV 220 D G ++ +K+ DKP+ + E + K N++ G Sbjct: 841 DNNGGINTESKKIKVVEDKPVE--------VINESEPNNDFEKANQIAK-SNMLVKGTLS 891 Query: 221 LKDRVVNEKLVVVPNTHYWDNAKTVLQKVTFLPINQESAATKRYLAGDID--ITESFPKN 278 +D + +Y+D AK K+T +N Y GD++ + + + Sbjct: 892 EEDYS---------DKYYFDVAKKGNVKITLNNLNSVGITWTLYKEGDLNNYVLYATGND 942 Query: 279 MYQK 282 Sbjct: 943 GTVL 946
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 31.4 bits (71), Expect = 0.001 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%) Query: 37 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 89 T EHL F+ HL + ++I G TGFY++ + S + D A++ Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113 Query: 90 AGESKI 95 ++KI Sbjct: 114 ENQNKI 119
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 256 bits (654), Expect = 1e-88 Identities = 237/239 (99%), Positives = 237/239 (99%) Query: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 60 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60 Query: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120 Query: 121 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180 PASPFENTAPAR TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180 Query: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239
>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature. Length = 289 Score = 32.6 bits (74), Expect = 0.001 Identities = 20/100 (20%), Positives = 35/100 (35%), Gaps = 8/100 (8%) Query: 53 FVSPEPQEMPDICKTEALFELEREYYPALKSQRLRLDVAYDAVKNFEETSKPSEYDIAYE 112 +V + PDI K ++L+ Y L L Y N+ +E ++Y Sbjct: 197 YVVGNTDDNPDITKYMGYYQLKIGY--HLGDAVLSAKGQY----NWNTGYGGAELGLSYP 250 Query: 113 IKSNPFIYYEGNFNDGFGTAIEDVPKVLQSIPEGFRLIDV 152 I + G+G ++ D + G L D+ Sbjct: 251 I--TKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLNDL 288
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 24.9 bits (54), Expect = 0.032 Identities = 11/21 (52%), Positives = 12/21 (57%) Query: 40 NKYSVSPEKYLKAAIAILYSD 60 NK S SP K L I +YSD Sbjct: 254 NKPSASPVKVLSDKIIQIYSD 274
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 31.2 bits (70), Expect = 4e-04 Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%) Query: 41 GPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97 P+PA G GS E + EA W +P A V +V KV Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.008 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 55 VVGESGCGKSTFARAI 70 + GESG GK ARA+ Sbjct: 165 ITGESGTGKELVARAL 180
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 117 bits (294), Expect = 5e-38 Identities = 103/103 (100%), Positives = 103/103 (100%) Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 61 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 756 bits (1953), Expect = 0.0 Identities = 478/555 (86%), Positives = 514/555 (92%), Gaps = 5/555 (0%) Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62 +TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64 Query: 63 IVSQLTQMNIPYCFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122 IV+QLTQMNIPY F+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124 Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182 FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184 Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242 LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244 Query: 243 QRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302 QRRIEAILSPIVGNGN+HAQVTAQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304 Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQ--QASTTSNS---GPRSTQRNETSN 357 +GYPGGVPGALSNQPAP N API+TPP NQ N Q Q ST++NS GPRSTQRNETSN Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364 Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTREAMGFSEK 417 YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIEDLTREAMGFS+K Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424 Query: 418 RGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477 RGD+LNVVNSPF++ D +GGELPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484 Query: 478 RRAEAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537 RR E KA Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544 Query: 538 VVALVIRQWINNDHE 552 VVALVIRQW++NDHE Sbjct: 545 VVALVIRQWMSNDHE 559
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 341 bits (875), Expect = e-119 Identities = 117/329 (35%), Positives = 196/329 (59%), Gaps = 2/329 (0%) Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTYVLAEFE 60 +S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120 + + DY R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180 I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190 Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239 L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++ Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250 Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299 V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310 Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328 VE Q+ I+ ++R+L E GE+VI G + Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 373 bits (959), Expect = e-135 Identities = 226/228 (99%), Positives = 228/228 (100%) Query: 1 MSDNLPWKTWTPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 MSDNLPWKTWTPDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 Query: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 202 bits (515), Expect = 2e-70 Identities = 146/147 (99%), Positives = 147/147 (100%) Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 +TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147 AALLAENRLDQKKMDEFAQRAAMRKPE Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 461 bits (1186), Expect = e-165 Identities = 361/375 (96%), Positives = 365/375 (97%) Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALASETTTDKAAPQVLVATDKPTTK 60 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALA ETTTDKAAPQ+LVATDKPTTK Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60 Query: 61 GELLISDIVSDAQQADLLIPVDETLPVINVEQSTSTPLTTAHTMTLAAVADKNTTKDEKA 120 GE LISDIVSDAQQA+LLIPVDET PVIN EQSTSTPLTTA TM LAAVADKNTTKDEKA Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120 Query: 121 DDLNEDLTASLSALFAMLPGFDNTPKVTDAPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180 DDLNED+TASLSALFAMLPGFDNTPKVTDAPSTVLP EKPTLFTKLTS QLTTAQPDDAP Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180 Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLLTVAAPVLSAPLGSHEW 240 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPL TVAAPVLSAPLGSHEW Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240 Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300 Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTVNHEPLAGEEDDTLPVPVS 360 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRT NHEPLAGE+DDTLPVPVS Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360 Query: 361 LQGRVTGNSGVDIFA 375 LQGRVTGNSGVDIFA Sbjct: 361 LQGRVTGNSGVDIFA 375
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 380 bits (977), Expect = e-134 Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 5 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 62 +LSQ EID LL S + E +S I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 63 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 123 VFIAVDNLFGGNGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 241 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DGIIAHVD 297 + + L ++ +++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 298 GVPVLTSQYGTLNGQYALRIEHLI 321 Q G + + A +I I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 210 bits (537), Expect = 5e-74 Identities = 124/137 (90%), Positives = 134/137 (97%) Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSSKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60 MSDMNNP+D+N GA+DDLWA+AL+EQK+T++KSAADAVFQQ GGGDVSG +QDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLISQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLI+QGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 331 bits (851), Expect = e-118 Identities = 243/245 (99%), Positives = 244/245 (99%) Query: 1 MRRLLSVAPVLLWLVTPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 MRRLLSVAPVLLWL+TPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSSVIDKIYVDAYQPFSEEK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMS VIDKIYVDAYQPFSEEK Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.1 bits (164), Expect = 1e-18 Identities = 22/78 (28%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63 + ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 IAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 201 bits (514), Expect = 1e-66 Identities = 256/261 (98%), Positives = 260/261 (99%) Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFVIAPSLPA 60 M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITF IAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEMFNLLADIISELPLI 261 EHLFSE+FNLLADIISELPLI Sbjct: 241 EHLFSEIFNLLADIISELPLI 261
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.043 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%) Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379 A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210 Query: 380 WD 381 WD Sbjct: 211 WD 212
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 36.0 bits (83), Expect = 7e-05 Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 9/93 (9%) Query: 23 AAQKLAADDDVDMLVILTACYFHDIVSLAKNHPQRQRSSILAAEETRRLLREEFEQFPA- 81 A +KLA + + D+ +ILT + +L + Q + EE R+ E F A Sbjct: 218 AGEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEE--GHFKAG 272 Query: 82 ---EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 111 K+ A I A IA L + ++ Sbjct: 273 SMGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305
>PF06580#Sensor histidine kinase Length = 349 Score = 32.5 bits (74), Expect = 0.003 Identities = 38/181 (20%), Positives = 63/181 (34%), Gaps = 37/181 (20%) Query: 290 ENILFLARADKNNVLVKLDSLS----------------LNKEVENLLDYL--EYLSDEKE 331 NI L D L SLS L E+ + YL + E Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDR 239 Query: 332 ICFKVECNQQIFADKI---LLQRMLSNLIVNAIRYSPEKSRIHITSFLDTNGYLNIDVAS 388 + F+ + N I ++ L+Q ++ N I + I P+ +I + D NG + ++V + Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD-NGTVTLEVEN 298 Query: 389 PGTKIHEPEKLFRRFWRGDNSRHSVGQGLGLSLVKA-IAELHGGSATYHYLNKHNVFRIM 447 G+ + K G GL V+ + L+G A K M Sbjct: 299 TGSLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344 Query: 448 L 448 + Sbjct: 345 V 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 82.6 bits (204), Expect = 2e-20 Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%) Query: 2 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 61 IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 117 +L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 46.0 bits (109), Expect = 2e-07 Identities = 43/366 (11%), Positives = 102/366 (27%), Gaps = 87/366 (23%) Query: 5 YKSRWVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGRRG---MRSG- 55 + ++ IA G+ + + A G + + ++ G Sbjct: 57 PRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116 Query: 56 ------PLA---PVQAATAVEQAVPRYLTGLGTIIAANTVTVRSRVDG--QLMALHFQEG 104 L + A + L ++ ++ +L Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176 Query: 105 QQVKAGDLLAEI------------DPSQFKVALAQTQGQLA-------KDKATLANARRD 145 Q V ++L Q ++ L + + + + + + Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236 Query: 146 LARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA-------------------- 185 L + L +++ + Q+ E ++ ++ + Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296 Query: 186 -----------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSGDTT 221 + + S I APV +V LK G +++ +T Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356 Query: 222 GIVVITQTHPIDLVFTLPESDIATVVQAQKAGKTLVVEAWDRTNSKKL-SEGTLLSLDNQ 280 +V++ + +++ + DI + Q A + VEA+ T L + ++LD Sbjct: 357 -MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINLDAI 413 Query: 281 IDATTG 286 D G Sbjct: 414 EDQRLG 419
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 916 bits (2368), Expect = 0.0 Identities = 297/1036 (28%), Positives = 511/1036 (49%), Gaps = 29/1036 (2%) Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72 + FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189 + + S + +M S TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302 ++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362 + TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538 +S +V+L LTP +CA +L S E + F FD + Y + K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFTNMAQRQRQVADVILQ 598 L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653 + V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+ Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658 Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709 D F+ P I + T + F L DAL+ QL+ Q P L V Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDQGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769 + + + VD++ A LG+S++D++ + A G ++ + ++ ++ + + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 ITPGLAALDTIRLTSSDGGVVPLSSIAKVEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829 +D + + S++G +VP S+ + + + P I S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVQEIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889 DA+ + + LP I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMSPRDAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009 G +A A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 923 bits (2388), Expect = 0.0 Identities = 288/1035 (27%), Positives = 508/1035 (49%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182 + S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDGTHRWQIQTNDELK 236 A+R+ L+ L ++ DV + N + G AL I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDSIRAKLPELQETIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355 T +I+AKL ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATIIPAVSVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RAT+IP ++VPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530 +LV+L LTP +C +LK + GF Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 ++ +A + L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRDERS---ETAQQIIDRLRVKLAKEPGAN 641 + +V V GF+ G N+GM F++LKP +ER+ +A+ +I R +++L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696 + + I G + ++ L D + + R +L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMNLVYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756 ++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816 ++K++V + G+ +P S F + + I G S D Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876 A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRHPLGITIVGGLVMSQLL 996 EA A +R RPI+MT+LA + G LPL +S G GS ++ +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031 Score = 79.5 bits (196), Expect = 4e-17 Identities = 77/446 (17%), Positives = 161/446 (36%), Gaps = 26/446 (5%) Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGANLFLMAVQDI 650 +DN+ + S S + +T + + Q+ ++L++ P + Q I Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127 Query: 651 RVGGRQSNASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQQDNGAE-- 703 V S+ +SD+ ++ ++ L+ L + DV GA+ Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183 Query: 704 MNLVYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759 M + D D + + + + + + T P Q + R+ Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243 Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817 + +N++G + L A+ N + G AA +L D Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302 Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874 + AI + +L P ++ + T Q +++ V + AI V++V+ + ++ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934 L +P +G L F + + + G++L IG++ +AI++V+ Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRHPLGITIVGGLVMSQ 994 L P+EA ++ ++ + +P+ GG + ITIV + +S Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPK 1020 L+ L TP + + + K Sbjct: 483 LVALILTPALCATLLKPVSAEHHENK 508
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 124 bits (314), Expect = 2e-33 Identities = 96/429 (22%), Positives = 188/429 (43%), Gaps = 23/429 (5%) Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79 F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138 I++ GS+ + + LL +AR +QG G A + + V + +P+E A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLL-LMPNYTMQTRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP+ I + L+ L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLAIAGLVAVGVVALVLYLLHARNNNRALFSLKL 257 G +L++VG+ L + + V V++ ++++ H R L Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 + F +G+ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQVVNRFGYRRVLVATTLGLSLITLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372 +V+R G VL +G++ +++ F+T + L W+ + V L G+ S + Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367 Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432 ++T+ L A +G SLL+ LS G+ I G LL + + Q+ + Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427 Query: 433 MYTWLSMAF 441 +Y+ L + F Sbjct: 428 LYSNLLLLF 436
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.009 Identities = 27/95 (28%), Positives = 35/95 (36%), Gaps = 20/95 (21%) Query: 164 RQTSWLIVALATLLAALATFLLA------RGLLAPVKRLVDGTHKLAAGDFTTRVTPTSE 217 RQ + L+ A L AL L+A V+ V H LA + P S Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131 Query: 218 DEL-----------GKLAQDFNQLASTLEKNQQMR 241 + L G L N+LA E+ QQMR Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-18 Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 130 PQRELQQQDAESPLII 145 + + D++ + + Sbjct: 124 RRPSKLEDDSQDGMPL 139
>LIPOLPP20#LPP20 lipoprotein precursor signature. Length = 175 Score = 26.6 bits (58), Expect = 0.026 Identities = 13/38 (34%), Positives = 24/38 (63%), Gaps = 1/38 (2%) Query: 18 EGEMKKIAAISLISIFLISGCAVHNDETSIGKFGLAYK 55 + ++KKI +S+++ +I GC+ H ++ I K AYK Sbjct: 2 KNQVKKILGMSVVAAMVIVGCS-HAPKSGISKSNKAYK 38
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 33.9 bits (77), Expect = 6e-04 Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 2/92 (2%) Query: 156 AQGCENKNVIIIGAGT-IGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGAMQTFNSSE 214 A+G E K I GA IG + + GA + A+D + EKL S + ++ Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 215 MSAPQMQSVLRELRFNQLILETAGVPQTVELA 246 A S + ++ E + V +A Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.0 bits (83), Expect = 2e-04 Identities = 54/286 (18%), Positives = 94/286 (32%), Gaps = 17/286 (5%) Query: 29 LSKSGFSAGEIGWSYACTAIAAILSPILVGSITDRFFSAQKVLAVLMFAGALLMYFAAQQ 88 L S G A A+ ++G+++DRF ++ + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147 A F +L + T A T ++A A + D+ R R G + G+ G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQILGY-ADISPTNIPLLITAGSSALLGVFAFFLPDTPPKSTGKMDIKVMLGLDALILL 206 P + G SP + P A + L + FL K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLVTAAIRYGFFIYGSTDEYFTYALLFLG 306 R G ++ L+LG++ Y + + ++ L Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311 Score = 34.0 bits (78), Expect = 0.001 Identities = 32/153 (20%), Positives = 53/153 (34%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLVTAAIRYGFFIYGSTDEYFTYALLFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y +L++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYQEPVN 372 + V D R G ++ C GFG + G LGG+M F+ P Sbjct: 109 TGATGAVAGAYIAD-ITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGG--FSPHAP-- 162 Query: 373 GLTFNWSGMWTFGAVMIAIIAVLFMIFFRESDN 405 + A + + + ES Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 28.9 bits (64), Expect = 0.036 Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%) Query: 93 NITLSNNQ---TSFTSGYSVTVTPAASNAKVNVSAGGGGSVMINGVATLSSA-----SSS 144 NI LS N+ T T + T++ S ++ + S G + + + + S+S Sbjct: 297 NIILSKNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNS 356 Query: 145 TRGSAAVQFLLCLLGGKSW 163 + A+ L L G ++W Sbjct: 357 NSSTVAIDHSLSLAGERTW 375
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 44.0 bits (104), Expect = 3e-07 Identities = 43/195 (22%), Positives = 77/195 (39%), Gaps = 18/195 (9%) Query: 4 MPKFRVSLFSLALMLAVPLAPQAVAKTAAATTASQPEIASGSAMI-VDLNTNKVIYSNHP 62 M R+ + SL + +PLA A + S+ +++ MI +DL + + + + Sbjct: 1 MRYIRLCIISL--LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRA 58 Query: 63 DLVRPIASISKLMTAMVVLDARLPLDEKLKVDISQTPEMKGVYSRV---RLNSEISRKDM 119 D P+ S K++ VL DE+L+ I + YS V L ++ ++ Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGEL 118 Query: 120 LLLALMSSENRAAASLAHHYPGGYKAFIKAMNAKAKSLGMNNTRFV--EPTGLS-----V 172 A+ S+N +AA+L GG + A + +G N TR E Sbjct: 119 CAAAITMSDN-SAANLLLATVGG----PAGLTAFLRQIGDNVTRLDRWETELNEALPGDA 173 Query: 173 HNVSTARDLTKLLIA 187 + +T + L Sbjct: 174 RDTTTPASMAATLRK 188
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 28.6 bits (64), Expect = 0.018 Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 2/33 (6%) Query: 164 WLHNLDQHLKHW-VWLILVVVL-VVGVRWWLKR 194 L + ++ + W++L ++ + R L++ Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQ 247
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 112 bits (282), Expect = 3e-32 Identities = 70/253 (27%), Positives = 115/253 (45%), Gaps = 12/253 (4%) Query: 3 QVAIITASDSGIGKECALLLAQQGFDIGITWHSDEEGAKDTAREVVSHGVRAEIVQLELG 62 ++A IT + GIG+ A LA QG I ++ E+ K + AE ++ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKA-EARHAEAFPADVR 67 Query: 63 KLPEGAQALEKLIQRLGRIDVLVNNAGAMTKAPFLDMAFDEWRKIFTVDVDGAFLCSQIA 122 + ++ + +G ID+LVN AG + ++ +EW F+V+ G F S+ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARQMVKQGQGGRIINITSVHEHTPLPDASAYTAAKHALGGLTKTMALELVRHKILVNAVA 182 ++ M+ + + G I+ + S P +AY ++K A TK + LEL + I N V+ Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGAIATPM-------NGMDDSDVKPDAEP---SIPLRRFGATHEIASLVAWLCSEGANYT 232 PG+ T M + +K E IPL++ +IA V +L S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 233 TGQSLIVDGGFML 245 T +L VDGG L Sbjct: 247 TMHNLCVDGGATL 259
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 29.0 bits (65), Expect = 0.024 Identities = 32/127 (25%), Positives = 53/127 (41%), Gaps = 5/127 (3%) Query: 122 GAKAMREAVPAHLPVSVKVRLGWDSGEK-KFEIADAVQQAGATELVVHGRTKEQGY-RAE 179 G EA+ ++ + +G + E+ K EI A E+ V GR +G R Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249 Query: 180 HIDWQAIGE-IRQRLNIPVIANGEIWDWQSAQQCIAISGCDAVMIGRGALNIPNLSRVVK 238 ++ I E +++ L V A + + IS V+ G GAL + NL R++ Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL-LRNLDRLL- 307 Query: 239 YNEPRMP 245 E +P Sbjct: 308 MEETGIP 314
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.4 bits (68), Expect = 0.027 Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 6/70 (8%) Query: 503 LHVSTPASEYSQGQ-DLF---NPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNL 558 L V+ E + + LF QR H V+ +T+ + K L N NG+Y YN Sbjct: 926 LQVADKTGEPNHNELTLFDASKAQRDHLNVSLV-GNTVDLGAWKYKLR-NVNGRYDLYNP 983 Query: 559 RGERVKDEKP 568 E+ Sbjct: 984 EVEKRNQTVD 993
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.1 bits (156), Expect = 3e-14 Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 2/113 (1%) Query: 9 VMIVDDHPLMRRGVRQLLELDPGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 68 +++ DD +R + Q L G++V + A+ D D+++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 121 D L +++ +++++ + + GA YL K D L+ I Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 45.8 bits (108), Expect = 8e-07 Identities = 55/319 (17%), Positives = 114/319 (35%) Query: 154 ARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTLETNA 213 + S S AA A + S+A T+A +A+++ AAAE+ A A + L+ Sbjct: 39 GKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIV 98 Query: 214 AASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNASSSASSAASSATAAANSA 273 + + A+ +AT A + + AK+ E + ++ + A Sbjct: 99 NEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRK 158 Query: 274 KAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAA 333 + + R + A + AA S+ A A + SA Q+ ++ Sbjct: 159 EIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSR 218 Query: 334 SSASTATTKAGEATEQATAAARSASAAKTSETNAKASETRAESSKTAAASSASSAASSAS 393 S+S A T + ++AK E + + ++ A Sbjct: 219 LSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRV 278 Query: 394 SASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAED 453 A ++E +Q +A++ + T+ + + + +++ + AE K+A++ Sbjct: 279 GAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQN 338 Query: 454 IASAVALEDASTTKKGIVQ 472 ++DA Q Sbjct: 339 NLLNSQIKDAVDATVSFYQ 357 Score = 31.6 bits (71), Expect = 0.017 Identities = 47/239 (19%), Positives = 91/239 (38%), Gaps = 22/239 (9%) Query: 315 AGQASASATAAGKSAESAASSASTATTKAGEATEQATAAARSASAAKTSETNAKASETRA 374 +G KS SAA A+ + A QA AAR+ +AA ++ +A Sbjct: 32 SGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAA--------EAQAKA 83 Query: 375 ESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSK 434 ++++ A + A +AS+ + + + A+ A +A A+++ Sbjct: 84 KANRDALTQRLKDIVNEALRHNASRTPSATELA-------HANNAAMQAEDERLRLAKAE 136 Query: 435 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSESLAATPKAVKAA 494 A A AE A + AE + E A T ++ ++L+ A +L+ KAV+ A Sbjct: 137 EKARKEAEAAEKAFQEAEQRRKEIEREKAETERQ--LKLAEAEEKRLAALSEEAKAVEIA 194 Query: 495 YDL-----ANGKYTAQDATTAQKGIIQLSSATNSTSETLAATPKAVKAANDNAEKRLQK 548 + + T + A ++ +TLA + A+ ++ + Sbjct: 195 QKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDEL 253
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 81.9 bits (202), Expect = 2e-22 Identities = 41/128 (32%), Positives = 68/128 (53%), Gaps = 15/128 (11%) Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHARTNVPGSDDLNGINVKYRYEF 59 M+K C + L+A LA + + A+ ST++ GY + + + G N+KYRYE Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEE 55 Query: 60 TDT-LGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAG 118 ++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y + G Sbjct: 56 DNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVG 107 Query: 119 VAYSRVST 126 V Y + T Sbjct: 108 VGYGKFQT 115
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 35.8 bits (82), Expect = 0.001 Identities = 35/143 (24%), Positives = 46/143 (32%), Gaps = 30/143 (20%) Query: 993 SVNANAGTLNNVTVNENCTIKGMLEATQV----RGDF---------VKAVSKSFPKQAGT 1039 + + L NVT + +K L+A ++ G F VKA S K A Sbjct: 235 AAQYDKKQLTNVTFDTETAVKDALKAQKIEVSSVGYFKAPHTFTVNVKATSNKNGKSATL 294 Query: 1040 WGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRK 1099 PN V S I+ N YD + G R Sbjct: 295 PVTVTVPNVADPVVPSQSKT---------IMHNAYFYDKDA--------KRVGTDKVTRY 337 Query: 1100 NGVLIASRETKGAIPGSYSAVID 1122 N V +A TK A SY VI+ Sbjct: 338 NTVTVAMNTTKLANGISYYEVIE 360
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.7 bits (61), Expect = 0.014 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 5/40 (12%) Query: 122 MTGILFSLGASMVLGGVAQML-----APKARTPRTQTTDN 156 M +LFS +M++ G AQ P A TP+ T + Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 39.7 bits (92), Expect = 4e-05 Identities = 56/377 (14%), Positives = 124/377 (32%), Gaps = 36/377 (9%) Query: 236 SGLTAMARQFHNVTAEQIAYVAQLQRSGDEAGALQAANEAATKGFDDQTRRLKENMGTLE 295 S R+ +E+ + + +L+ + + + + L+ L Sbjct: 95 SNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALA 154 Query: 296 TWADRTARAFKSMWDAVLDI-GRPDTAQEMLIKAEAAFKKADDIWNLRKDDYFVNDEARA 354 +A + + + T + EA + + + + Sbjct: 155 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 214 Query: 355 RYWDDREK-KRLERDAAQKRVDQQRQQDKNAQQQSDTEASRLKYTEEAQKAYERLQTPLE 413 ++ + D + ++ + EA + + + L+ + Sbjct: 215 TLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMN 274 Query: 414 KYTARQEELNKALKDGKILQADYNTLMAAAKKDYEATLKKPKQSGVKVYAGDRQEDSAHA 473 TA ++ + L+A+ L + A + + R D++ Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQ-SQVLNANRQSLR----------RDLDASRE 323 Query: 474 ALLTLQAELRTLEKHAGANEKISQQ-RRDL-------WKAESQFAVLEEAAQRRQLSAQE 525 A L+AE + LE+ +E Q RRDL + E++ LEE + + S Q Sbjct: 324 AKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS 383 Query: 526 KS--LLAHKDETLEYKRQLAALGDKVTYQEHLNALAQQADKFAQQQRAKRAAIDAKNRGL 583 L A ++ + ++ L K+ E LN +++ K ++++A Sbjct: 384 LRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKA------------ 431 Query: 584 TDRQAAREATEQRLKEQ 600 + QA EA + LKE+ Sbjct: 432 -ELQAKLEAEAKALKEK 447
>INTIMIN#Intimin signature. Length = 939 Score = 28.5 bits (63), Expect = 0.026 Identities = 28/202 (13%), Positives = 60/202 (29%), Gaps = 29/202 (14%) Query: 66 DWTATGQGQKSAGDTSFT----LAWMPGEQGQQALLAWFNEGDTRAYKIRFPNGTVDVFR 121 G G+ + S + + AL + A I + Sbjct: 611 SANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSAL-------NANAV-IFVDQTKASITE 662 Query: 122 GWVSSIGKAVTAKEVITRTVKVTNVGRPSMAEDRSTVTATTGMTVTP--------ASASV 173 ++ IT TVKV +P ++ + T ++ + A ++ Sbjct: 663 IKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTL 722 Query: 174 VKGQSTTLTVAFQPDGA-------TDKSFRAVSADKTKATVSVSGMTITVKG--VAAGKV 224 V+ + + F ++ D + +G+ + + G+V Sbjct: 723 TSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQV 782 Query: 225 NIPVVSGNGEFAAVAEINVTAS 246 N+ GNG++ + AS Sbjct: 783 NLKASGGNGKYTWRSANPAIAS 804
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 508 bits (1310), Expect = 0.0 Identities = 241/388 (62%), Positives = 280/388 (72%), Gaps = 33/388 (8%) Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYAR 60 MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY R Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58 Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120 +GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117 Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180 V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+ Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177 Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTNNQVIYGN 223 D+ NGDGFG STTY+ GF GA Y SDRTN QV G Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237 Query: 224 NSLNASGQNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEVV 277 A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFEV Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295 Query: 278 AQYQFDFGLRPSVAYLQSKGKDLG----AWGDQDLVEYIDVGATYYFNKNMSTFVDYKIN 333 AQYQFDFGLRP+V++L SKGKDL D+DLV+Y DVGATYYFNKN ST+VDYKIN Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 334 LIDKSD-FTKASGVATDDIVAVGMVYQF 360 L+D D F K +G++TDDIVA+GMVYQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 121 bits (306), Expect = 4e-32 Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%) Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78 + I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+ Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137 ++G RL L + S++ + + +LI R +QG A L ++ R P Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197 E R A L V + GP +GG I W +L+ +PM I+ L L +E Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192 Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257 ++ G+ L+ +G+ + ML F +S I +VSV+S + V Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241 Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317 +P +D L K+ F IG++ + +G + ++P ++++ + G Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301 Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376 G M ++I + G ++ ++ +V + S T F II+ G Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361 Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420 + ++TI S L + S+ NF LS G ++ Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 79.5 bits (196), Expect = 2e-18 Identities = 63/419 (15%), Positives = 124/419 (29%), Gaps = 96/419 (22%) Query: 8 KKQSNRKKYFSLLVIVLFIAFSGAYAYWSMELEDMISTDDAYVT-GNADPISAQVSGSVT 66 + +R+ I+ F+ + + ++E + + + G + I + V Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVK 108 Query: 67 VVNHKDTNYVRQGDILVSLDKTDATIALNKA----------------------------- 97 + K+ VR+GD+L+ L A K Sbjct: 109 EIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPEL 168 Query: 98 -----------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQ 131 K + Q + L + AE + + Y+ Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYEN 228 Query: 132 SLEDYNRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKAN 177 R+ L + I+K + S + + I + K Sbjct: 229 LSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE 288 Query: 178 KALVMN-------TPLNR-QPQVVEAADATKEAWLALKRTDIRSPVTGYIAQRSVQ-VGE 228 LV L + + + + + IR+PV+ + Q V G Sbjct: 289 YQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGG 348 Query: 229 TVSPGQSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINM 287 V+ ++LM +VP + V A + + + +GQ+ I + F G + Sbjct: 349 VVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLV 402 Query: 288 GTGNAFSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDTKD 342 G + + +V V +S++ L PL G+++TA I T Sbjct: 403 GK---VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 46.7 bits (111), Expect = 2e-08 Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%) Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGILV 63 ++ DD + L + ++ + + + + D+V+ DV +P N + Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123 L ++K + ++++SA+N + AI+A++ G + Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101 Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148 PF L + + L ++ Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.5 bits (196), Expect = 2e-17 Identities = 30/105 (28%), Positives = 51/105 (48%) Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019 +IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLSCGMNLCLFKPLTLD 1064 L ++++ LP+ ++A K G L KP L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.5 bits (71), Expect = 6e-04 Identities = 15/102 (14%), Positives = 38/102 (37%), Gaps = 4/102 (3%) Query: 24 LRPWNDPEMDIERKMNHDVSLFLVAEVNGEVVG--TVMGGYDGHRGSAYYLGVHPEFRGR 81 + + D +MD+ + FL + +G + ++G + V ++R + Sbjct: 47 FKQYEDDDMDVSYVEEEGKAAFL-YYLENNCIGRIKIRSNWNG-YALIEDIAVAKDYRKK 104 Query: 82 GIANALLNRLEKKLIARGCPKIQINVPEDNDMVLGMYERLGY 123 G+ ALL++ + + + + N Y + + Sbjct: 105 GVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.5 bits (121), Expect = 2e-09 Identities = 33/116 (28%), Positives = 50/116 (43%), Gaps = 9/116 (7%) Query: 63 VRDGIVWDFFGAVTIVRRHLD-TLEQQFGRRFSHAATSFPPGTDP---RISINVLESAGL 118 ++DG++ DFF +++ + F R P G R + AG Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135 Query: 119 EVSHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKKGKVTYSADEATGG 169 +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 108 bits (271), Expect = 4e-31 Identities = 40/151 (26%), Positives = 66/151 (43%), Gaps = 21/151 (13%) Query: 18 GCQSPQGKFTPEQVAAMQSYGFTESAGDWSLGLSDAILFAKNDYKLLPESQQQIQTMAAK 77 G +P P +Q+ FT L +LF N L PE Q + + ++ Sbjct: 194 GEAAPVVAPAPAPAPEVQTKHFT---------LKSDVLFNFNKATLKPEGQAALDQLYSQ 244 Query: 78 LASTGLTHARMD--GHTDNYGEDSYNEGLSLKRANVVADAWAIGGQIPRSNLTTQGLGKK 135 L++ + G+TD G D+YN+GLS +RA V D + I IP ++ +G+G+ Sbjct: 245 LSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVD-YLISKGIPADKISARGMGES 303 Query: 136 YPIASNKTAQGR---------AENRRVAVVI 157 P+ N + A +RRV + + Sbjct: 304 NPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 26.3 bits (58), Expect = 0.032 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 11/87 (12%) Query: 4 KTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRV--GMTQQQVAYALGT 61 K + AVL + AG LER ++ Q + + + VS+ + GMT ++ A Sbjct: 69 KVV-LCGAVLARVDAGDEQLERKIH---YRQQDLVDYSPVSEKHLADGMTVGELCAA--A 122 Query: 62 PLMSDPFGTNTWFYVFRQQPGHEGVTQ 88 MSD N + G G+T Sbjct: 123 ITMSDNSAANL---LLATVGGPAGLTA 146
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.0 bits (59), Expect = 0.012 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 8/75 (10%) Query: 12 IDGNQAKVD--VCGIQRDVDLTLVGSCDENGQPRVGQWVLVHVGFAMSVINEAEARDTLD 69 I GNQ + D G+ D L ++NG+P G W+ + + F + ++ ++ D + Sbjct: 171 IIGNQIRTDQKFMGV-FDESLKERQEAEKNGEPTGGDWLDIFLSF---IFDKKQSSDVKE 226 Query: 70 ALQN--MFDVEPDVG 82 A+ + V+PD+ Sbjct: 227 AINQEPVPHVQPDIA 241
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 35.6 bits (82), Expect = 4e-04 Identities = 45/314 (14%), Positives = 112/314 (35%), Gaps = 36/314 (11%) Query: 93 LGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 151 +G+ V G +SD +G +++ F ++ S + F + LI R + G G ++ Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123 Query: 152 GHTLLAEFSPRRHRGILLGAFSVVWT----VGYVLASIAGHHFISENPEAWRWLLASAAL 207 ++A + P+ +RG G + VG + + H+ W +LL + Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177 Query: 208 PALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLF-- 265 + + L + R +G F I+ +L + + + L Sbjct: 178 TIITVPFLMKLLKKEVR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234 Query: 266 -SSRYWRRTA--------FNSVFFVCLVIPWFVIYT----WLPTIAQTIGLEDALTASLM 312 ++ R+ ++ F+ V+ +I+ ++ + + L+ + + Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294 Query: 313 LNALLIVGALLGLV-------LTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLF 365 + ++ G + ++ L L L+ + + + L +S + + Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTII 354 Query: 366 VLFSTTISAVSNLV 379 ++F + + V Sbjct: 355 IVFVLGGLSFTKTV 368
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.0 bits (65), Expect = 0.035 Identities = 21/103 (20%), Positives = 45/103 (43%), Gaps = 8/103 (7%) Query: 48 GLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVA 107 G++++ + + G ++D+F R ++ ++ + +MAT P LWV+ ++ Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 108 FAITTILMLWSVSIKAASLLGD---HSEQGKIMGWMEGLRGVG 147 IT + A + + D E+ + G+M G G Sbjct: 106 AGITG-----ATGAVAGAYIADITDGDERARHFGFMSACFGFG 143
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 55.7 bits (134), Expect = 2e-10 Identities = 39/167 (23%), Positives = 69/167 (41%), Gaps = 1/167 (0%) Query: 38 LDIGVIAGALPFITDHFVLTSRLQEWVVSSMMLGAAIGALFNGWLSFRLGRKYSLMAGAI 97 L+ V+ +LP I + F WV ++ ML +IG G LS +LG K L+ G I Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87 Query: 98 LFVLGSIGSAFATS-VEMLIAARVVLGIAVGIASYTAPLYLSEMASENVRGKMISMYQLM 156 + GS+ S +LI AR + G + ++ + RGK + + Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147 Query: 157 VTLGIVLAFLSDTAFSYSGNWRAMLGVLALPAVLLIILVVFLPNSPR 203 V +G + ++ +W +L + + + + L+ L R Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 112 bits (280), Expect = 6e-32 Identities = 72/257 (28%), Positives = 130/257 (50%), Gaps = 11/257 (4%) Query: 3 LSAFSLEGKVAVVTGCDTGLGQGMALGLAQAGCDIVGI--NIVEPTETIKQVTALGRRFL 60 ++A +EGK+A +TG G+G+ +A LA G I + N + + + + A R Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 61 SLTADLRKIDGIPALLDRAVAEFGHIDILVNNAGLIRREDALEFSEKDWDDVMNLNIKSV 120 + AD+R I + R E G IDILVN AG++R S+++W+ ++N V Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 121 FFMSQAAAKHFIAQRNGGKIINITSMLSFQGGIRVPSYTASKSGVMGVTRLMANEWAKHN 180 F S++ +K+ + +R G I+ + S + + +Y +SK+ + T+ + E A++N Sbjct: 121 FNASRSVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179 Query: 181 INVNAIAPGYMATNNTQQLRADEQRSAEILD--------RIPAGRWGLPSDLMGPIVFLA 232 I N ++PG T+ L ADE + +++ IP + PSD+ ++FL Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239 Query: 233 SSASDYVNGYTIAVDGG 249 S + ++ + + VDGG Sbjct: 240 SGQAGHITMHNLCVDGG 256
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 71.5 bits (175), Expect = 2e-18 Identities = 28/164 (17%), Positives = 65/164 (39%), Gaps = 9/164 (5%) Query: 1 MSTETIEIFNNSDEWANQLKHALSKGENLALLHGLTPDILDRIYAYAFDYHEKGNITDAE 60 M ET + + E+ ++ L G +A+L+ ++ D L+++Y+ AF+ ++ G DA Sbjct: 1 MQQETTD----TQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAH 56 Query: 61 IYYKFLCIYAFENHEYLKDFASVCQPKKKYQQAYDLYKLSYNYSPYDDYSVIYRMGQCQI 120 ++ LC+ + + + Q +Y A Y + + +C + Sbjct: 57 KVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI-KEPRFPFHAAECLL 115 Query: 121 GAKNIDNAMQCFYH----IINNCEDDSVKSKAQAYIELLNDNSE 160 + A + I + E + ++ + +E + E Sbjct: 116 QKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKE 159
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 28.5 bits (63), Expect = 0.011 Identities = 36/163 (22%), Positives = 66/163 (40%), Gaps = 35/163 (21%) Query: 14 AMILSNNVFADEGHGIVKFKGEVISAPCSIKPGDEDLTVNLGEVADTVLKSDQKSLAE-- 71 A+++S +V A + + FKG++I C++ ++ VN G++ L + + Sbjct: 15 AVLMSQHVHAADN---LTFKGKLIIPACTV----QNAEVNWGDIEIQNLVQSGGNQKDFT 67 Query: 72 -----PFTIHLQDCMLSQGGTTYSKAKVTFTTANTMTGQTDLLKNTKETEIGGATGVGVR 126 P+++ ++ G T + V T+ + G L N+ + IG A Sbjct: 68 VDMNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNA------ 121 Query: 127 ILDSQSGEVTLGTPVV---ITFNNTNS----YQELNFKARMES 162 VTLG+ V IT Y +L +K M+S Sbjct: 122 --------VTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQS 156
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 285 bits (729), Expect = 2e-80 Identities = 136/571 (23%), Positives = 223/571 (39%), Gaps = 122/571 (21%) Query: 33 KKVILGIILSSIYGSYGETAFA-AMLDINNIWTRDYLDLAQNRGEFRPGATNVQLMMKDG 91 KK L I ++ +Y T + A L +++ + + D A+N+G+F GATNV + K+ Sbjct: 4 KKFKLNFIALTV--AYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNN 61 Query: 92 KIFH--FPE-LPVPDFSAVS-NKGATTSIGGAYSVTATH--------------------N 127 K P +P+ DFS V +K T I Y V H N Sbjct: 62 KDLGTALPNGIPMIDFSVVDVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGN 121 Query: 128 GTQHHAITTQSWDQTAYKASNRVSS----------------GDFSVHRLNKFVVETTGVT 171 H ++++ + + + + D+ + RL+KFV T Sbjct: 122 AKAHRDVSSEENRYFSVEKNEYPTKLNGKTVTTEDQTQKRREDYYMPRLDKFV------T 175 Query: 172 ESADFSLSPEDAMKRYGVNYNGKEQ-IIGFRAGAGTTSTILNGKQY-------------- 216 E A S + YN + + R G+G+ G Y Sbjct: 176 EVAPIEASTASS---DAGTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNL 232 Query: 217 -LFGQNYNPDLLSASLFNLDWKNKSYIYT--------------NRTPFKNSPIFGDSGSG 261 L G Y ++ + + ++ +N I ++ P N + GDSGS Sbjct: 233 KLVGDAYTY-GIAGTPYKVNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSP 291 Query: 262 SYLYDKEQQKWVFHGVTSTVGFLSSTNIAWTNYSLFNNILVNNLKKNFTNTMQLDGKKQE 321 ++YD+E+ KW+F G + +W ++++ + ++ + + K Sbjct: 292 LFVYDREKGKWLFLGSYD--FWAGYNKKSWQEWNIYKSQFTKDVLNKDSAGSLIGSKTDY 349 Query: 322 LSSIIKD-------------------------KDLSVSGGGELTLKQDTDLGIGGLIFDK 356 S K ++ G G LTL + D G GGL F+ Sbjct: 350 SWSSNGKTSTITGGEKSLNVDLADGKDKPNHGKSVTFEGSGTLTLNNNIDQGAGGLFFEG 409 Query: 357 NQTYKVYGKDKSYKGAGIDIDNNTTVEWNVKGVAGDNLHKIGSGTLDVKIAQGN--NLKI 414 + K + ++KGAG+ + TV W V D L KIG GTL V+ N +LK+ Sbjct: 410 DYEVKGTSDNTTWKGAGVSVAEGKTVTWKVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKV 469 Query: 415 GNGTVIL------SAEKAFNKIYMAGGKGTVKINAKDALSESGNGEIYFTRNGGTLDLNG 468 G+GTVIL S + AF + + G+ T+ +N + + IYF GG LDLNG Sbjct: 470 GDGTVILKQQTNGSGQHAFASVGIVSGRSTLVLNDDKQVDPNS---IYFGFRGGRLDLNG 526 Query: 469 YDQSFQKIAATDAGTTVTNSNVKQ-STLSLT 498 +F I D G + N N+ S +++T Sbjct: 527 NSLTFDHIRNIDDGARLVNHNMTNASNITIT 557
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.2 bits (68), Expect = 0.017 Identities = 36/239 (15%), Positives = 76/239 (31%), Gaps = 18/239 (7%) Query: 174 SHMQLYIGAALSAILVLFTLTLPHIPVAKQQANQSWTTLLGLDAFALFKNKRMAIFFIFS 233 H + AAL+ + L L + + L+ A F+ R Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPES---HKGERRPLRREALNPLASFRWARGMTVVAAL 215 Query: 234 MLLGAELQITNMFGNTFLHSFDKDPMFASSFIVQHASIIMSISQISETLF-ILTIPFFLS 292 M + +Q+ F +D + + I ++ I +L + + Sbjct: 216 MAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI---GISLAAFGILHSLAQAMITGPVAA 272 Query: 293 RYGIKNVMMISIVAWILRFALFAYGDPTPFGTVLLVLSMIVYGCAFDFFNISGSVFVEKE 352 R G + +M+ ++A + L A+ + ++V + + + ++ Sbjct: 273 RLGERRALMLGMIADGTGYILLAF-----ATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327 Query: 353 VSPAIRASAQGMFLMMTNGFGCILGGIVSGKVVEMYTQNGITDWQ-TVWLIFAGYSVVL 410 V + QG +T+ L IV + IT W W+ A ++ Sbjct: 328 VDEERQGQLQGSLAALTS-----LTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 75.3 bits (185), Expect = 6e-16 Identities = 75/434 (17%), Positives = 142/434 (32%), Gaps = 32/434 (7%) Query: 291 NSRVDAYRNEQLLGSFYLNSGSQFIDTSSFPPGSYSVALKVYENNQLTRTELVPFTKTGG 350 ++V +N + + + G I+ S + + + E + T+ VP++ Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367 Query: 351 LT-DGNAQWFLQAGKTTSQVS-DDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGN 408 L +G+ ++ + AG+ S + ++ +Q + L + +Y G AD AF G Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 409 NWTADLGGVGNLAISASVFRNDDGGKGDMQQANWS-NPGWPTLGF------YRTNSDG-- 459 G ++ ++ + D + D Q + N G YR ++ G Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487 Query: 460 -DACTTDSRESYNALSCYESISATVSQNFVGWNMMLGYTRTQNNTDDSLRWDKQQSFENN 518 A TT SR + + + + V F + + R + + + + + + Sbjct: 488 NFADTTYSRMNGYNIETQDGV-IQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLS 546 Query: 519 YLRQTT--AQSISETVQLSASRAFVMRDWILSTSVGVFHRNDNGGDNDDNGLYLSFS--L 574 QT ++ E Q + AF ++ ++ + D L L+ + Sbjct: 547 GSHQTYWGTSNVDEQFQAGLNTAFED----INWTLSYSLTKNAWQKGRDQMLALNVNIPF 602 Query: 575 SDTPTMDSNNNSHSTNVSTDYRYSEQDGDQTSWQLSHTFYNDSFSHKEL--GVTVGGLNT 632 S DS + + S + + T D+ + G GG Sbjct: 603 SHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGN 662 Query: 633 DTINSAVNGRWDGQYGNVYATVSDSYDRKNHDHLSAFTGTYSSTLAVSCYGVNLGASGTD 692 + G YGN S S + L S + GV LG D Sbjct: 663 SGSTGYATLNYRGGYGNANIGYSHS---DDIKQLYY---GVSGGVLAHANGVTLGQPLND 716 Query: 693 DLLGAVLVDVKGFS 706 VLV G Sbjct: 717 ---TVVLVKAPGAK 727
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 533 bits (1376), Expect = 0.0 Identities = 173/397 (43%), Positives = 253/397 (63%), Gaps = 11/397 (2%) Query: 11 VLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVN-GGEPAP--LAHHSYEGA 67 +LVINCGSSS+K+ ++++ D VL G+A+ I ++ L+ N GE ++ A Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62 Query: 68 LKAIAFELEKRNLN-----DSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLH 122 +K + L + + +GHR+ HGG FT S +ITD+V+ I LAPLH Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122 Query: 123 NYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTS 182 N AN+ GI++ Q+ P V VAVFDT+FHQTM AYLY +P++YY + +R+YGFHGTS Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182 Query: 183 HRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSG 242 H+YVSQRA +LN + ++ HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTRSG Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242 Query: 243 DVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLR-VLEKAWHEGHERAQLAI 301 +D +S++ + N S ++ ++NK+SG+ GISG+SSD R + + A+ G +RAQLA+ Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302 Query: 302 KTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVVIDTEMNNRS 361 F +R+ + I +AA++ +D I+FT GIGEN IR +++ L LG +D E N Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362 Query: 362 NSFGERIVSSENARVICAVIPTNEEKMIALDAIHLGK 398 E I+S+ +++V V+PTNEE MIA D + + Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1292 bits (3346), Expect = 0.0 Identities = 724/1032 (70%), Positives = 846/1032 (81%), Gaps = 1/1032 (0%) Query: 1 MANYFIDRPVFAWVLAIIMMLAGGLAIMNLPVAQYPQIAPPTITVSATYPGADAQTVEDS 60 MAN+FI RP+FAWVLAII+M+AG LAI+ LPVAQYP IAPP ++VSA YPGADAQTV+D+ Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGLDGLMYMSSTSDAAGNASITLTFETGTSPDIAQVQVQNKLQLAMPSLPE 120 VTQVIEQNMNG+D LMYMSSTSD+AG+ +ITLTF++GT PDIAQVQVQNKLQLA P LP+ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 AVQQQGISVDKSSSNILMVAAFISDNGSLNQYDIADYVASNIKDPLSRTAGVGSVQLFGS 180 VQQQGISV+KSSS+ LMVA F+SDN Q DI+DYVASN+KD LSR GVG VQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 EYAMRIWLDPQKLNKYNLVPSDVISQIKVQNNQISGGQLGGMPQAADQQLNASIIVQTRL 240 +YAMRIWLD LNKY L P DVI+Q+KVQN+QI+ GQLGG P QQLNASII QTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 QTPEEFGKILLKVQQDGSQVLLRDVARVELGAEDYSTVARYNGKPAAGIAIKLATGANAL 300 + PEEFGK+ L+V DGS V L+DVARVELG E+Y+ +AR NGKPAAG+ IKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTSRAVKEELNRLSAYFPASLKTVYPYDTTPFIEISIQEVFKTLVEAIILVFLVMYLFLQ 360 DT++A+K +L L +FP +K +YPYDTTPF+++SI EV KTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATIIPTIAVPVVILGTFAILSAVGFTINTLTMFGMVLAIGLLVDDAIVVVENVERVI 420 N RAT+IPTIAVPVV+LGTFAIL+A G++INTLTMFGMVLAIGLLVDDAIVVVENVERV+ Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEDKLPPKEATHKSMGQIQRALVGIAVVLSAVFMPMAFMSGATGEIYRQFSITLISSMLL 480 EDKLPPKEAT KSM QIQ ALVGIA+VLSAVF+PMAF G+TG IYRQFSIT++S+M L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVFVAMSLTPALCATILKAAPEGGHK-PNALFARFNTLFEKSTQHYTDSTRSLLRCTGRY 539 SV VA+ LTPALCAT+LK H+ F FNT F+ S HYT+S +L TGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 540 MVVYLLICAGMAVLFLRTPTSFLPEEDQGVFMTTAQLPSGATMVNTTKVLQQVTDYYLTK 599 +++Y LI AGM VLFLR P+SFLPEEDQGVF+T QLP+GAT T KVL QVTDYYL Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 600 EKDNVQSVFTVGGFGFSGQGQNNGLAFISLKPWSERVGEENSVTAIIQRAMIALSSINKA 659 EK NV+SVFTV GF FSGQ QN G+AF+SLKPW ER G+ENS A+I RA + L I Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 660 VVFPFNLPAVAELGTASGFDMELLDNGNLGHEKLTQARNELLSLAAQSPDQVTGVRPNGL 719 V PFN+PA+ ELGTA+GFD EL+D LGH+ LTQARN+LL +AAQ P + VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 720 EDTPMFKVNVNAAKAEAMGVALSDINQTISTAFGSSYVNDFLNQGRVKKVYVQAGTPFRM 779 EDT FK+ V+ KA+A+GV+LSDINQTISTA G +YVNDF+++GRVKK+YVQA FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 780 LPDNINQWYVRNASGTMAPLSAYSSTEWTYGSPRLERYNGIPSMEILGEAAAGKSTGDAM 839 LP+++++ YVR+A+G M P SA++++ W YGSPRLERYNG+PSMEI GEAA G S+GDAM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 840 KFMADLVAKLPAGVGYSWTGLSYQEALSSNQAPALYAISLVVVFLALAALYESWSIPFSV 899 M +L +KLPAG+GY WTG+SYQE LS NQAPAL AIS VVVFL LAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 900 MLVVPLGVVGALLATDLRGLSNDVYFQVGLLTTIGLSAKNAILIVEFAVEMMQKEGKTPI 959 MLVVPLG+VG LLA L NDVYF VGLLTTIGLSAKNAILIVEFA ++M+KEGK + Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 960 EAIIEAARMRLRPILMTSLAFILGVLPLVISHGAGSGAQNAVGTGVMGGMFAATVLAIYF 1019 EA + A RMRLRPILMTSLAFILGVLPL IS+GAGSGAQNAVG GVMGGM +AT+LAI+F Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1020 VPVFFVVVEHLF 1031 VPVFFVV+ F Sbjct: 1021 VPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 51.0 bits (122), Expect = 4e-09 Identities = 41/218 (18%), Positives = 70/218 (32%), Gaps = 33/218 (15%) Query: 97 LQAELNSAKGSLAKALSTASNARITFNRQASLLKTNYVSR-QDYDT-ARTQLNEAEANVT 154 + + A L S + K Y Q + +L + N+ Sbjct: 257 QENKYVEAVNELRVYKSQLEQIE----SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312 Query: 155 VAKAAVEQATINLQYANVTSPITGVSGKSSV-TVGALVTANQADSLVTVQRLDPIYVDLT 213 + + + Q + + +P++ + V T G +VT + +V V D + V Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTAL 371 Query: 214 QSVQDFLRMKEEVASGQIKQVQGSTPVQLNLE--NGKRY-SQTGTLK--FSDPTVDETTG 268 +D I + + +E RY G +K D D+ G Sbjct: 372 VQNKD------------IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419 Query: 269 SVT--LRAI------FPNPNGDLLPGMYVTALVDEGSR 298 V + +I N N L GM VTA + G R Sbjct: 420 LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457 Score = 32.1 bits (73), Expect = 0.004 Identities = 25/118 (21%), Positives = 47/118 (39%), Gaps = 7/118 (5%) Query: 53 PGRTVPY-EVAEIRPQVGGIIIKRNFI-EGDKVNQGDSLYQIDPAPLQAELNSAKGSLAK 110 G+ EI+P I+ K + EG+ V +GD L ++ +A+ + SL + Sbjct: 87 NGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145 Query: 111 ALSTASNARITFNRQASLLKTNYVSRQDYDTARTQLNEAEANVTVAKAAVEQATINLQ 168 A + +I +R L K + D + N +E V + +++ Q Sbjct: 146 ARLEQTRYQIL-SRSIELNKLPELKLPDEPYFQ---NVSEEEVLRLTSLIKEQFSTWQ 199
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 46.7 bits (111), Expect = 9e-08 Identities = 80/374 (21%), Positives = 132/374 (35%), Gaps = 39/374 (10%) Query: 20 FSAGLLGIGQNGLLVVLPVLVIQTNLSLSV---WAALLMLGSMLFLPSSPWWGKQISRTG 76 + L +G ++ VLP L+ S V + LL L +++ +P G R G Sbjct: 12 STVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFG 71 Query: 77 SKPVVLWALGGYGISFTLLGLGSVLMATSAITTAVGLGILIIARIAYGLTVSAMVPACQV 136 +PV+L +L G + + ++ L +L I RI G+T + A Sbjct: 72 RRPVLLVSLAGAAVDYAIMATAPFLW------------VLYIGRIVAGITGATGAVAGAY 119 Query: 137 WALQRAGEGNRMAALATISSGLSCGRLFGPLCAAAMLAIHPLAPLGLLMAAPVLALLMLL 196 A R +S+ G + GP+ M P AP A L L Sbjct: 120 IA-DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178 Query: 197 RL------PGTPPQPTPECKSVSLKRDCLPYLLCAILLAAAVSMMQLGLSPA-LTRQFVT 249 L P ++ R + A L+A M +G PA L F Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238 Query: 250 DTTAIS-QQVAWLLGLSAVAALIAQ---FGVLRPQRLTPVALLLSAGVLMSGGLAIMLSE 305 D + L + +AQ G + + AL+L +G + + + Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 306 QLWLFYPGCAVLSFGAALATPAYQLLLNDKLADGAGAGWLATSHTLGYGLCALLVPLVSK 365 + W+ +P +L+ G + PA Q +L+ + D G L L L S Sbjct: 299 RGWMAFPIMVLLASG-GIGMPALQAMLS-RQVDEERQGQLQ----------GSLAALTSL 346 Query: 366 TGVAIALIMAALFA 379 T + L+ A++A Sbjct: 347 TSIVGPLLFTAIYA 360
>PF04183#IucA / IucC family Length = 580 Score = 338 bits (867), Expect = e-111 Identities = 104/480 (21%), Positives = 178/480 (37%), Gaps = 46/480 (9%) Query: 37 ELLIPLDEQKSLHFRVAYFSPTQHHRF-----AFPARLVTASGSYPVDFTTLSRLIIDKL 91 E + + Q + + P RF + + A D L++ ++ +L Sbjct: 24 EQVFHAESQGDDRYCIN--LPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQL 81 Query: 92 RHQLFLPVPLCETFHQRVLESHVHTQQAIDARHDWAALREKALNFGEAEQALLTGHAFHP 151 + L + Q + + + Q + AR +A LN + Q LL+GH Sbjct: 82 KQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNA-DRLQCLLSGHPKFV 140 Query: 152 APKSHEPFNRREAERYLPDMAPHFPLRWFSVDKTQIAGES-LHLNLQQRLTRFAAENAPQ 210 K + + ERY P+ A F L W +V + + +++ Q LT A PQ Sbjct: 141 FNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLT---AAMDPQ 197 Query: 211 LLNELS--------DNQWLF-PLHPWQGEYLLQQGWCQALVAKGLIKDLGEAGTSWLPTT 261 S D+ WL P+HPWQ + + + A+G + LGE G WL Sbjct: 198 EFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF-AEGRMVSLGEFGDQWLAQQ 256 Query: 262 SSRSLYCATSRD--MIKFSLSVRLTNSIRTLSVKEVKRGMRLARLAQ----TDGWQMLQ- 314 S R+L A+ R IK L++ T+ R + + + G +R Q TD + Sbjct: 257 SLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATLVQSG 316 Query: 315 ---VRFPTFRVMQEDGWAGLLDLNGNIMQESLFALRENLLVDQPKSQTNVLVSLTQAAPD 371 + P + +G+A L + REN ++ VL++ + Sbjct: 317 AVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDE 376 Query: 372 GGDSLLVSAVKRLSDRLGITVQQAAHAWVDAYCQQVLKPLFTAEADYGLVLLAHQQNILV 431 L + + DR G+ A W+ + V+ PL+ YG+ L+AH QNI + Sbjct: 377 NNQPLAGAYI----DRSGLD----AETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITL 428 Query: 432 QMLGDLPVGFIYRDCQGSAFMPHATDWLDSIGEAQAENIFTHEQLLRYFPYYLLVNSTFA 491 M +P + +D QG M + + E + L++ Sbjct: 429 AMKEGVPQRVLLKDFQGD--MRLVKEEFPEMDSLPQE----VRDVTSRLSADYLIHDLQT 482
>PF04183#IucA / IucC family Length = 580 Score = 813 bits (2101), Expect = 0.0 Identities = 563/580 (97%), Positives = 569/580 (98%) Query: 1 MNHKDWDFVNRRLVAKMLSEMEYEQVFHAESQGDDHYCINLPGAQWRFIAERGIWGWLWI 60 MNHKDWD VNRRLVAKMLSE+EYEQVFHAESQGDD YCINLPGAQWRFIAERGIWGWLWI Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLPGAQWRFIAERGIWGWLWI 60 Query: 61 DAQTLRCTDEPVLAQTLLMQLKPVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD 120 DAQTLRC DEPVLAQTLLMQLK VLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD Sbjct: 61 DAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD 120 Query: 121 LINLDADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYTNTFRLHWLAVKREHMIWRC 180 LINL+ADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEY NTFRLHWLAVKREHMIWRC Sbjct: 121 LINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRC 180 Query: 181 DNDLDIQQLLTAAMDPQEFTRFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFVEG 240 DN++DI QLLTAAMDPQEF RFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF EG Sbjct: 181 DNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEG 240 Query: 241 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR 300 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR Sbjct: 241 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR 300 Query: 301 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK 360 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK Sbjct: 301 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK 360 Query: 361 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI Sbjct: 361 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420 Query: 421 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEAFPEMDSLPQEVRDATSRLSADYLIHDL 480 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKE FPEMDSLPQEVRD TSRLSADYLIHDL Sbjct: 421 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDL 480 Query: 481 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMNKHPQMAERFALFSLFRPQIIR 540 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYM KHPQM+ERFALFSLFRPQIIR Sbjct: 481 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFSLFRPQIIR 540 Query: 541 VVLNPVKLTWPDLDGGSRMLPNYLENLQNPLWLVTQEYES 580 VVLNPVKLTWPDLDGGSRMLPNYLE+LQNPLWLVTQEYES Sbjct: 541 VVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWLVTQEYES 580
>PF04619#Dr-family adhesin Length = 160 Score = 28.4 bits (63), Expect = 0.017 Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 4/60 (6%) Query: 29 VGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGSW 84 +G ++ D + G+ FL+ D+N ++ W + D GSW Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.4 bits (73), Expect = 0.003 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTEGDIWINDQRVTEMEPKD 75 +V+ G G GKSTL+ + GL+ + +D KD Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKD 634
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 51.2 bits (122), Expect = 8e-09 Identities = 36/181 (19%), Positives = 60/181 (33%), Gaps = 13/181 (7%) Query: 19 EQTPEKETEVQNEQPVVEEI---VQAQEPVKASEQAVEEQPQAHTEAEAETFAADVVEVT 75 TP + TE E E Q+ + + Q E +A + +A T EV Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN---EVA 1086 Query: 76 EQVVESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQAEAETV 135 + E+++ Q + V +E V E+ + +VSP++ Q+E Sbjct: 1087 QSGSETKETQTTE--TKETATVEKEEKAKVETEKTQ---EVPKVTSQVSPKQEQSETVQP 1141 Query: 136 EIVKAAEEEAAK--EEITDEELEAQALAAEAAEEAVMVVPPAEEEQPVEEIAQEQEKPTK 193 + A E + +E + A E + V P E V E P Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201 Query: 194 E 194 Sbjct: 1202 T 1202 Score = 47.8 bits (113), Expect = 9e-08 Identities = 47/213 (22%), Positives = 74/213 (34%), Gaps = 31/213 (14%) Query: 20 QTPEK-ETEVQNEQPVVEEIVQAQE----------PVKASEQAVEEQPQAHTEAE----- 63 TP + +V + EEI + E P + +E E Q E Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057 Query: 64 AETFAADVVEVTEQVVESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEV 123 A A EV ++ + KA + VAQ +ET + E V EE Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET------QTTETKETATVEKEEK 1111 Query: 124 SPEEWQAEAETVEIVKAAEEEAAKEEITDEELEAQALAAE------AAEEAVMVVPPAEE 177 + E +T E+ K + + K+E ++ A E E A+ Sbjct: 1112 AKVE---TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168 Query: 178 EQPVEEIAQEQEKPTKEGFFARLKRSLLKTKEN 210 EQP +E + E+P E S+++ EN Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201 Score = 44.3 bits (104), Expect = 1e-06 Identities = 26/159 (16%), Positives = 48/159 (30%), Gaps = 7/159 (4%) Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASE------QAVEEQPQAHTEAEAETFAAD 70 Q +T E T + E+ VE + P S+ Q+ QPQA E + Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155 Query: 71 VVEVTEQVVESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQA 130 ++ ++ QP E + E V E+ V + PE+ P Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTES-TTVNTGNSVVENPENTTPATTQPTVNSE 1214 Query: 131 EAETVEIVKAAEEEAAKEEITDEELEAQALAAEAAEEAV 169 + + + + + + A + Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT 1253 Score = 37.4 bits (86), Expect = 2e-04 Identities = 34/193 (17%), Positives = 62/193 (32%), Gaps = 22/193 (11%) Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASEQAVEEQPQAHTEAEAETFAADVVEVTE 76 +E E ++ V+ E+ Q+ K ++ ++ + E + Sbjct: 1065 NREVAKEAKSNVKAN-TQTNEVAQSGSETKETQTTETKETATVEKEE------------K 1111 Query: 77 QVVESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQAEAETVE 136 VE+EK Q +V +Q P E+ E P+ A E P E ++ Sbjct: 1112 AKVETEKTQEVPKVTSQVSP---------KQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162 Query: 137 IVKAAEEEAAKEEITDEELEAQALAAEAAEEAVMVVPPAEEEQPVEEIAQEQEKPTKEGF 196 A E+ AKE ++ E +V+ P + + + Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222 Query: 197 FARLKRSLLKTKE 209 R RS+ E Sbjct: 1223 HRRSVRSVPHNVE 1235
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 25.9 bits (57), Expect = 0.039 Identities = 6/21 (28%), Positives = 13/21 (61%) Query: 7 FFIVIIGLIVVAASFRFMQQR 27 +V+I AA ++F++Q+ Sbjct: 173 ALMVLIQSTSEAARYKFIEQQ 193
>PF01206#SirA family protein Length = 76 Score = 105 bits (265), Expect = 3e-34 Identities = 24/72 (33%), Positives = 41/72 (56%) Query: 9 DHTLDALGLRCPEPVMMVRKTVRNMQPGETLLIIADDPATTRDIPGFCTFMEHELVAKET 68 D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F HEL+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 DGLPYRYLIRKG 80 + Y + +++ Sbjct: 65 EDGTYHFRLKRA 76
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.7 bits (124), Expect = 2e-09 Identities = 80/398 (20%), Positives = 147/398 (36%), Gaps = 32/398 (8%) Query: 13 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70 ++ N ++ I+ + IGL + VLPG + D++ G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 71 PHAGRYADLLGPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGI-GQS 129 P G +D G + +++ L G + + Y L V L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPFLWV-----LYIGRIVAGITGAT 112 Query: 130 FAGTGSTLWGVGVVGSL--HIGRVISWNGIVNYGAMAMGAPLGVVFYHWGGLQALALIIM 187 A G+ + + H G + + G +G +G H A AL + Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172 Query: 188 GVALVAILLAIPRPTVK--ASKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIATF 240 LL + + P + + +A +A V A Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232 Query: 241 ITLFYDAK-GWDGAAFALTLFSCAFVGT---RLLFPNGINRIGGLNVAMICFSVEIIGLL 296 +F + + WD ++L + + + ++ R+G M+ + G + Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292 Query: 297 LVGVATMPWMAKVG-VLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGV 355 L+ AT WMA VLLA G + PAL + + V ++ QG + L+ + Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SI 349 Query: 356 TGPLAGLVMSWAGVPV----IYLAAAGLVAIALLLTWR 389 GPL + A + ++A A L + L R Sbjct: 350 VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.020 Identities = 10/34 (29%), Positives = 19/34 (55%) Query: 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLARLL 58 Q + ++ +++ T+ + G SG GK +AR L Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 49.9 bits (119), Expect = 4e-09 Identities = 42/171 (24%), Positives = 74/171 (43%), Gaps = 7/171 (4%) Query: 201 REREHGTVEHLLVMPITPFEIMMAKI-WSMGLVVLVVSGLSLVLMAKGVLGVPIEGSIPL 259 R T E +L + +I++ ++ W+ L +G+ +V A G + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148 Query: 260 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 318 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 319 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 368 +P +H + L + I+ ++ + I FF + ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.044 Identities = 9/26 (34%), Positives = 14/26 (53%) Query: 20 ARCMVGLIGPDGVGKSSLLSLISGAR 45 V L G G+GKS+L++ + G Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 85.3 bits (211), Expect = 2e-20 Identities = 72/408 (17%), Positives = 141/408 (34%), Gaps = 81/408 (19%) Query: 6 RHLAWWVVGLLVVAAVVAWWLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64 R +A++++G LV+A +++ G +GR + I + I+VK Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113 Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91 EG+ VR+G+VL K+ L++ R + + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQSELDSV 132 Q Q+ + L+++++E + +N+ ++ Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 133 AKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190 R SL + AI+ + + A L K+Q+ ++ I +A+ Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236 QT T + S ++AP +V Q +V G V+ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295 ++ +V D +T + + G + +G A + ++A P R V V Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410 Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341 +E D+RL L+F V I L + + +G+ A ++ Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 246 bits (630), Expect = 3e-87 Identities = 77/154 (50%), Positives = 111/154 (72%) Query: 5 AIYPGTFDPITNGHIDIVTRATQMFDHVILAIAASPSKKPMFTLEERVALTQQATAHLGN 64 AIYPG+FDPIT GH+DI+ R ++FD V +A+ +P+K+PMF+++ER+ +A AHL N Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPN 62 Query: 65 VEVVGFSDLMANFARNQHATVLIRGLRAVADFEYEMQLAHMNRHLMPELESVFLMPSKEW 124 +V F L N+AR + A ++RGLR ++DFE E+Q+A+ N+ L +LE+VFL S E+ Sbjct: 63 AQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTEY 122 Query: 125 SFISSSLVKEVARHQGDVTHFLPENVHQALVAKL 158 SF+SSSLVKEVAR G+V HF+P +V AL + Sbjct: 123 SFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQF 156
>PF03895#Serum resistance protein DsrA. Length = 79 Score = 64.8 bits (158), Expect = 6e-15 Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 2/79 (2%) Query: 1539 ESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGV-SMVSANGRWVYKLQ 1597 +L G+A+ A++ L Q G + S G Y ++A+A+GV S ++ + Sbjct: 2 SKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVA 61 Query: 1598 GSTNSQGEYSAALGAGIQW 1616 +T + G S G ++ Sbjct: 62 FNTYN-GGMSYGASVGYEF 79
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 65.2 bits (159), Expect = 1e-13 Identities = 56/314 (17%), Positives = 103/314 (32%), Gaps = 82/314 (26%) Query: 75 ITPQVTGIVTEVTDKNNQLIQKGEVLFKLDPVR------------YQARVD--RLQA--- 117 I P IV E+ K + ++KG+VL KL + QAR++ R Q Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 118 ------------------------DLMTATHNIK----TLRAQLTEAQANTTQVSAERDR 149 +++ T IK T + Q + + N + AER Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218 Query: 150 LFKNYQRY----------LKGSQAAVNPFS---------ERDIDDARQNF---LAQDALV 187 + RY L + ++ + E +A +Q + Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278 Query: 188 KGSVAE----QAQIQSQLDSMVNGE----QSQIVSLRAQLTEAKYNLEQTVIRAPSNGYV 239 + + + + + + I L +L + + + +VIRAP + V Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338 Query: 240 TQVLIR-PGTYAAALPLRPVMVFIPEQKRQIV-AQFRQNSLLRLKPGDDAEVVFNALPGQ 297 Q+ + G +MV +PE V A + + + G +A + A P Sbjct: 339 QQLKVHTEGGVVT--TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYT 396 Query: 298 VFH---GKLTSILP 308 + GK+ +I Sbjct: 397 RYGYLVGKVKNINL 410
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 27.3 bits (60), Expect = 0.028 Identities = 20/101 (19%), Positives = 37/101 (36%), Gaps = 18/101 (17%) Query: 31 AAIEKRQKEIADGLASAERAHKDLDLAKASATDQLKKAKAEAQVIIEQ--ANKRRSQILD 88 +EK +++ + A K+ + T + A++ ++ Q K + + Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108 Query: 89 EAKAEAEQERTKIVA----------------QAQAEIEAER 113 E KA+ E E+T+ V Q QAE E Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.2 bits (65), Expect = 0.046 Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 10/80 (12%) Query: 367 LGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAAGTT 426 LGD + D V + AG+ N G DV T G AT A T Sbjct: 616 LGDGD--DKVFLSAGSA--NIYAGK------GHDVVYYDKTDTGYLTIDGTKATEAGNYT 665 Query: 427 VTRNVGENALAISRVPQTQK 446 VTR +G + + V + Q+ Sbjct: 666 VTRVLGGDVKVLQEVVKEQE 685
>SECA#SecA protein signature. Length = 901 Score = 31.0 bits (70), Expect = 0.002 Identities = 11/71 (15%), Positives = 30/71 (42%) Query: 14 AKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73 +K + + EE+++ + R+ + +R ++ + + + ++G P P Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886 Query: 74 LGVAEKVTKQH 84 G +K + H Sbjct: 887 CGSGKKYKQCH 897
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 601 bits (1550), Expect = 0.0 Identities = 206/478 (43%), Positives = 300/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60 M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNIQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPDSWATLLAQWADRALRS---- 416 EN R LT + + + + EL + S + ++Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>PF06580#Sensor histidine kinase Length = 349 Score = 28.3 bits (63), Expect = 0.042 Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 41/190 (21%) Query: 171 IIEQADRLRNLVDRL---LGPQLPGTRVTE-SIHKVAERV---VTLVSMELPDNVRLIRD 223 I+E + R ++ L + L + + S+ V + L S++ D ++ Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 224 YDPSLPELAHDPDQIEQVLLN-IVRNALQ---ALGPEGGEIILRTRTAFQLTLHGERYRL 279 +P++ ++ Q+ +L+ +V N ++ A P+GG+I+L+ Sbjct: 246 INPAIMDV-----QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT------KDNGTVT- 293 Query: 280 AARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHSGK---IEFTSWP 336 ++VE+ G + ++ TG GL R + G I+ + Sbjct: 294 ---LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338 Query: 337 GHTEFSVYLP 346 G V +P Sbjct: 339 GKVNAMVLIP 348
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 149 bits (377), Expect = 2e-40 Identities = 81/404 (20%), Positives = 149/404 (36%), Gaps = 79/404 (19%) Query: 1 MDSNDLEKERGITILAKNTAIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAF 60 D+ LE++RGITI T+ +W + ++NI+DTPGH DF EV R +S++D +L++ A Sbjct: 43 TDNTLLERQRGITIQTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAK 102 Query: 61 DGPMPQTRFVTKKAFAYGLKPIVVINKVDRPGARPDWVVDQVFD-------------LFV 107 DG QTR + G+ I INK+D+ G V + + L+ Sbjct: 103 DGVQAQTRILFHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYP 162 Query: 108 NLDATDEQLD-----------------------------------------FPIVYASAL 126 N+ T+ FP+ + SA Sbjct: 163 NMCVTNFTESEQWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAK 222 Query: 127 NGIAGLDHEDMAEDMTPLYQAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKR 186 N I G+D+ L + I + + ++ +++Y+ + R+ Sbjct: 223 NNI-GIDN---------LIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYS 272 Query: 187 GKVKPNQQVTIIDSEGKTRNAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTV 246 G + V I + E K+ ++ + E + D A +G+IV + L ++ + Sbjct: 273 GVLHLRDSVRISEKEKI----KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVL 327 Query: 247 CDTQNVEALPALSVDEPTVSMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVE 306 DT+ + + P + + + D L LR Sbjct: 328 GDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPL---------LRYY 378 Query: 307 ETEDADAFRVSGRGELHLSVLIENMRRE-GFELAVSRPKVIFRE 349 +S G++ + V ++ + E+ + P VI+ E Sbjct: 379 VDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 32.9 bits (75), Expect = 0.004 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 356 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 415 EPY + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 416 MTSGTGLLYSTFSHY 430 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.0 bits (98), Expect = 2e-06 Identities = 32/155 (20%), Positives = 64/155 (41%), Gaps = 5/155 (3%) Query: 114 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSR 173 + +QAD+ P+ E+ ++ P + +AE + Q+S+ Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049 Query: 174 TTEQSWQQQT-RTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVARA 232 T E++ Q T T+Q V + + + A+TQ + T ++ ++ A V + Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109 Query: 233 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 267 A T + ++ + Q + EQ+ETV+ Q Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 29.9 bits (67), Expect = 0.006 Identities = 6/21 (28%), Positives = 11/21 (52%) Query: 179 FGNLDDPNSEISQLLRQKPTY 199 GNL++P ++ L+ P Sbjct: 75 TGNLEEPAVMVNYFLQGDPYQ 95
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.6 bits (74), Expect = 3e-04 Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 5/84 (5%) Query: 50 HLALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAG 109 L L+ +G I + + N I+++ V R VG+ LL A E A++ Sbjct: 68 FLYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122 Query: 110 AEMTELSTNVKRHDAHRFYLREGY 133 L T A FY + + Sbjct: 123 FCGLMLETQDINISACHFYAKHHF 146
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.012 Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 8/70 (11%) Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGDEWVDLVTAPARKVVEI------RK 89 VVL G G GKSTL+ +L + I G + + + E+ R+ Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655 Query: 90 TTVGWVSQFL 99 V F Sbjct: 656 ADAEAVKAFF 665
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.9 bits (171), Expect = 5e-16 Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%) Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63 +L+ DDDA + + + +++ G+ S I + DL++ D+ M EN Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61 Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112 DLLP + AR V+V+S+ T + G DYL KPF + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110
>PF06580#Sensor histidine kinase Length = 349 Score = 41.0 bits (96), Expect = 8e-06 Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%) Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499 L+EN ++ + GG+I + +G + EV + G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310 Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535 G GL V+++++ L G I + + G V IP Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 26.4 bits (58), Expect = 0.012 Identities = 9/28 (32%), Positives = 16/28 (57%) Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59 A+IE V + + +G+G L+ K +E Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.2 bits (68), Expect = 0.022 Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%) Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102 H L + YA P+LG +DR G R ++ + + ++ + L Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99 Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161 Y+ + G+ + + + D D R F + A G +A P+ GL Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155 Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220 + H F A + L FL+G + +++ L L + W M Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211 Query: 221 LAPVFFTLLL 230 +A + + Sbjct: 212 VAALMAVFFI 221
>SECA#SecA protein signature. Length = 901 Score = 32.2 bits (73), Expect = 0.005 Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 6/144 (4%) Query: 282 HVIDAADVRVQENIEAVNTVLEEIDAHEIPTLLVMNKIDMLEDFEPRIDRDEENK-PIRV 340 ++D +DV N + IDA+ P L ++ + + R+ D + PI Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722 Query: 341 WLSAQTGAGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400 WL + L + + + + + + R + LQ ++ W E ++ Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782 Query: 401 SLQVRMPIVDWRRLCKQEPALIDY 424 +R I R +++P +Y Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803
>cloacin#Cloacin signature. Length = 551 Score = 31.6 bits (71), Expect = 0.006 Identities = 25/81 (30%), Positives = 30/81 (37%), Gaps = 10/81 (12%) Query: 17 GSSKPGGNSEGNGNKGGRDQGPPDLDDIFRKLSKKLGGLGGGKGTGSGGGSSSQGP---- 72 S G +SE N GG G G GGG GTG G S+ P Sbjct: 33 ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG-GNLSAVAAPVAFG 91 Query: 73 -----RPQLGGRVVTIAAAAI 88 P GG V+I+A A+ Sbjct: 92 FPALSTPGAGGLAVSISAGAL 112
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 61.1 bits (148), Expect = 6e-13 Identities = 43/240 (17%), Positives = 90/240 (37%), Gaps = 13/240 (5%) Query: 38 TPQRIVVLELSFADALAAVDVSPIGIADDNDAKRILPEVRAHLKPWQSVGTRAQPSLEAI 97 P RIV LE + L A+ + P G+AD + + + E VG R +P+LE + Sbjct: 34 DPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSE-PPLPDSVIDVGLRTEPNLELL 92 Query: 98 AALKPDLIIADSSRHAGVYIALQQIASVLLLKSR--NETYAENLQSAAIIGEMVGKKREM 155 +KP ++ S+ + L +IA + A +S + +++ + Sbjct: 93 TEMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA 151 Query: 156 QARLEQHKERMAQWASQLPKGTR---VAFGTSREQQFNLHTQETWTGSVLASLGLNVPAA 212 + L Q+++ + + K + + + + +L G +P A Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYG--IPNA 209 Query: 213 MAGAS----MPSIGLEQLLAVNPAWLLVAHYREESIVKRWQQDPLWQMLTAAQKQQVASV 268 G + ++ +++L A +L + + PLWQ + + + V Sbjct: 210 WQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRV 269
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 32.9 bits (75), Expect = 0.004 Identities = 19/89 (21%), Positives = 29/89 (32%), Gaps = 9/89 (10%) Query: 546 GSFGTVQYSQIGKAVQSGNVEPEKARTWELGTRYDDGALTAEMGLFLINFNNQYDSNQTN 605 G F + NV EK + L + YD+ AL A + Q D+ Sbjct: 187 GFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASV------AVQQQDAKLVE 240 Query: 606 DTVTARGKTRHTGLETQARYDLGTLTPTL 634 + T + Y G +TP + Sbjct: 241 E---NYSHNSQTEVAATLAYRFGNVTPRV 266
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 29.4 bits (66), Expect = 0.016 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 10/69 (14%) Query: 187 FISGTGFATDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDP- 245 +G ++D+R L A + D A+LAL + R+ K++ + Sbjct: 273 VYGISGISSDFRDLEDAAF---------KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323 Query: 246 DVIVLGGGM 254 DVIV G+ Sbjct: 324 DVIVFTAGI 332
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.5 bits (126), Expect = 1e-09 Identities = 74/356 (20%), Positives = 126/356 (35%), Gaps = 35/356 (9%) Query: 5 ILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGH---MISYYALGVVVGAPIIALF 61 + ++AL G+G+ IM VL L ++ S H +++ YAL AP++ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 121 S R+ + +LL +A + A+ + +L IGR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 122 PGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDI 181 G A G +S ++ P+ L FS F A N + F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 RDEAKGNLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 229 + LR + + A + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 230 SVFSETAMTFIMMLVGLGM---VLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCG 286 F A T + L G+ + M++G ++ R R + ++L F Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 287 GMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 340 I + G+ LQ +L + E G G +A +L S VG Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.4 bits (92), Expect = 6e-05 Identities = 34/199 (17%), Positives = 71/199 (35%), Gaps = 14/199 (7%) Query: 671 QQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDDLPHSEETVALDNWRQVHEQCLALH 730 + + Q + Q R Q L+ +E + E + +V + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 731 SQQQTLQQQDVLAAQSLQKAQAQFDTAL--------QASVFDDQQAFLAALMDEQTLTQL 782 Q T Q Q +L K +A+ T L + V + ++L+ +Q + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA-- 250 Query: 783 EQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDGLALTVTVEQIQQEL-AQTHQKLREN 841 K + Q + V + +Q +Q + L+ + + Q + KLR+ Sbjct: 251 ---KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQT 307 Query: 842 TTSQGEIRQQLKQDADNRQ 860 T + G + +L ++ + +Q Sbjct: 308 TDNIGLLTLELAKNEERQQ 326 Score = 38.7 bits (90), Expect = 1e-04 Identities = 25/204 (12%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 487 EARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEG 546 EA ++ Q + Q ++E + E + +E + L Sbjct: 133 EADTLKTQSSLLQARLEQ---TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT- 188 Query: 547 AALRGQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTTSLNITLQPQDDIQPWLDAQD 606 + ++ Q Q + E R + + + + DD L Q Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA 248 Query: 607 -------EHERQL-RLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALTGYALTLP 658 E E + +++ + Q+ +I+ +++ + Q L Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF------KNEILD 302 Query: 659 QEDEEESWLATRQQEAQSWQQRQN 682 + + + E ++RQ Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQ 326 Score = 33.3 bits (76), Expect = 0.006 Identities = 16/150 (10%), Positives = 42/150 (28%), Gaps = 5/150 (3%) Query: 731 SQQQTLQQQDVLAAQSLQKAQAQFDTA----LQASVFDDQQAFLAALMDEQTLTQLEQLK 786 + Q + A + Q + L D+ F +E+ L +K Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS-EEEVLRLTSLIK 192 Query: 787 QNLENQRRQAQTLVTQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQG 846 + + Q + A+ + L L + ++ Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252 Query: 847 EIRQQLKQDADNRQQQQTLLQQIAQMTQQV 876 + +Q + + + + Q+ Q+ ++ Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEI 282
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 29.7 bits (66), Expect = 0.022 Identities = 13/70 (18%), Positives = 23/70 (32%), Gaps = 4/70 (5%) Query: 149 KQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFP 208 K+ ++ I ++Y + + + I T D + + I A Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAINEKEAFECI-YDSRTRSA--GKD-IVSVKINIDKAKK 190 Query: 209 AQNFPPADYI 218 N P DYI Sbjct: 191 ILNLPECDYI 200
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 93.4 bits (232), Expect = 3e-24 Identities = 33/149 (22%), Positives = 61/149 (40%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDCAVNQLNEPWPDLILLDWMLPGGSGIQ 63 ILV +D+A IR ++ L + G+ + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPTSHRVMAGEEP 152 E L D + G Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>PF06580#Sensor histidine kinase Length = 349 Score = 34.1 bits (78), Expect = 0.001 Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%) Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380 LV N + H P+G I ++ + VE+ G Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306 Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422 +G GL V+ + E+++ + GK +IP Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 47.2 bits (112), Expect = 3e-08 Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340 P+ H D+ + I L +D+ Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.022 Identities = 17/86 (19%), Positives = 24/86 (27%), Gaps = 13/86 (15%) Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352 PR E + +LG P + + + K + Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYILMGHVARVMEPGC 593 Query: 353 KHGEIFGLLGPNGAGKSTTFKMMCGL 378 K L G G GKST + GL Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGL 619 Score = 29.7 bits (66), Expect = 0.045 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56 Y L G G GK+TL+ L GL Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 62.5 bits (152), Expect = 6e-13 Identities = 42/259 (16%), Positives = 91/259 (35%), Gaps = 25/259 (9%) Query: 83 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 142 Q + + +A+ +LA E + + + + L + Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260 Query: 143 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 197 N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+ Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320 Query: 198 AELNLQDSTLIAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERYLDQA 255 E Q S + AP + V G V+ T+ + + V A V + + Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380 Query: 256 QPGRKVLLYTDGRPDKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 309 G+ ++ + P Y G++ ++ A D R LV+ + I + Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432 Query: 310 ----DADDALRQGMPVTVQ 324 + + L GM VT + Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 73.9 bits (181), Expect = 3e-18 Identities = 34/214 (15%), Positives = 77/214 (35%), Gaps = 17/214 (7%) Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71 + ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELLLRACRNMIKLLTQDDTVNLSKFISREQL 131 IGE E + P + +RE+L+ + + + + + F E + Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEII-FHKCEFV 120 Query: 132 SPTAAYHLVHEQVISPLHSHLTRLIAAWTGSDANDTRMILHTHALIGEILAFRLGKETIL 191 A + + + + + +A L T + + G Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKH--CIEAKMLPADLMTRRAAIIMRGYISG----- 173 Query: 192 LRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225 L W + + + ++ ++L+ Sbjct: 174 LMENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.025 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 30.3 bits (68), Expect = 0.009 Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 9/54 (16%) Query: 2 RRVFWLVAAALLLAGCAGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADD 55 R+V LV ALL AG A I K+G +LD Y ++ L HY +DD Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 74.0 bits (182), Expect = 6e-17 Identities = 70/363 (19%), Positives = 123/363 (33%), Gaps = 65/363 (17%) Query: 13 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------TGRNEAMGKLLEKMGAEFVPAD 63 MK LVTGA +G + + L + G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 64 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 116 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 117 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEEVINMLSQANPQTRFT 176 +++ ++ SS S+Y + D + +A +K A+E + + S T Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174 Query: 177 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENTVHAMWLASQEA 234 LR +++GP + + + + M SI + + G D TY ++ A+ Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234 Query: 235 CDKLPS--------------GRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLD 280 RVYNI N L +Q L D L I+ + +P D Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294 Query: 281 MIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPIITLDEGIEKTAAW 340 + T D E +G+ P T+ +G++ W Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327 Query: 341 LRD 343 RD Sbjct: 328 YRD 330
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.9 bits (135), Expect = 1e-10 Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54 + LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 SWPDNLPALLQD--IDTVYFLVH------SMGEGGDFIAQERQVALNVRDALREVPVKQL 106 + + + L + V+ H S+ + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature. Length = 393 Score = 28.4 bits (63), Expect = 0.046 Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%) Query: 137 GVNGDAVDPKSVTSWADL------WKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPK 190 G GD DP T+W D + ++ +L D + FQM + +GN T P Sbjct: 42 GFGGDPCDP--CTTWCDAISMRMGYYGDFVFDRVLKTDVNKEFQMGDKPTSTTGNATAPT 99 Query: 191 EIEAAYN 197 + A N Sbjct: 100 TLTAREN 106
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.017 Identities = 10/36 (27%), Positives = 19/36 (52%), Gaps = 1/36 (2%) Query: 46 LTLLGPSGCGKTTVLRLIAGLE-TVDSGRIMLDNED 80 + L G G GK+T++ + GL+ D+ + +D Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.048 Identities = 11/69 (15%), Positives = 22/69 (31%), Gaps = 20/69 (28%) Query: 389 NACKYCLE------FVEISARQTDEHLYIVVEDDGPGIPLSKREVIFDRGQRVDTLRPGQ 442 N K+ + + + + + + + VE+ G + +E Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------------ST 311 Query: 443 GVGLAVARE 451 G GL RE Sbjct: 312 GTGLQNVRE 320
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.8 bits (215), Expect = 5e-22 Identities = 31/124 (25%), Positives = 62/124 (50%) Query: 2 RVLVVEDNALLRHHLKVQIQDAGHQVDDAEDAKEADYYLNEHLPDIAIVDLGLPDEDGLS 61 +LV +D+A +R L + AG+ V +A ++ D+ + D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LIRRWRSNDVSLPILVLTARESWQDKVEVLSAGADDYVTKPFHIEEVMARMQALMRRNSG 121 L+ R + LP+LV++A+ ++ ++ GA DY+ KPF + E++ + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 LASQ 125 S+ Sbjct: 125 RPSK 128
>PF05844#YopD protein Length = 295 Score = 33.1 bits (75), Expect = 0.001 Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103 ++LL +L+R+ K+R++G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.005 Identities = 23/93 (24%), Positives = 35/93 (37%), Gaps = 11/93 (11%) Query: 46 LISISSPKELIQIAEYFRTPLATAVTGGDRISNSESPIPGGGDDYTQSQGEVNKQPNIED 105 L +SSP A P + G + ++ PGGGDD GE ++D Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDD-----GEDPFGEWLDD 435 Query: 106 LKKRM---EQSRLRKLRGDLDQLIESDPKLRAL 135 R+ + L+ R L + + S P L Sbjct: 436 EVARLRLRGRWLLKPRRAALIEALRSAPALAGC 468
>PF06580#Sensor histidine kinase Length = 349 Score = 42.5 bits (100), Expect = 4e-06 Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 52/151 (34%) Query: 361 ELDKSLIERIIDPLT--HLVRNSLDHGIELPEKRLAAGKNSVGNLILSAEHQGGNICIEV 418 +++ ++++ + P+ LV N + HGI G ++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 419 TDDGAGLNRERILAKAASQGLTVSENMSDDEVAMLIFAPGFSMAEQVTDVSGRGVGMDVV 478 + G+ + G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 479 KRNIQEMGG---HVEIQSKQGTGTTIRILLP 506 + +Q + G +++ KQG +L+P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.2 bits (159), Expect = 9e-14 Identities = 35/188 (18%), Positives = 72/188 (38%), Gaps = 23/188 (12%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAAKASLAAHKPLSAPTTLKAGPLLSSEKLIAIGASTGGTEAIRHVLQPLP 179 +AE R +K + + +G S E R + + + Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMP-----------------LVGRSAAMQEIYRVLARLMQ 158 Query: 180 LSSPALLI 187 ++ Sbjct: 159 TDLTLMIT 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 4e-24 Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGYGFVISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG V++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.0 bits (70), Expect = 0.011 Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%) Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314 I+S + L+ + I A A L K + + FFG F S ++ Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370 LG T L+ + L+ +LFL LP+ G F+ L +G+T Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583 Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATDTAAALGFIS 411 + + ++T +K E + E + A + F+S Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629
>PF06580#Sensor histidine kinase Length = 349 Score = 53.3 bits (128), Expect = 1e-09 Identities = 36/172 (20%), Positives = 73/172 (42%), Gaps = 23/172 (13%) Query: 424 PESSRELLSQIRNELNASWAQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPVKLD 483 P +RE+L+ + + S + +LT +++ + S +F ++ + Sbjct: 190 PTKAREMLTSLSELMRYSLRYSNARQVSLADELT------VVDSYLQLASIQFEDRLQFE 243 Query: 484 YQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTVAQNDNQVKLTV 534 Q+ P + VP L+Q E N +KH Q ++++ +++ V L V Sbjct: 244 NQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 535 QDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVTFIP 585 ++ G +N S G+ +R+R Q L G + +++ E G + IP Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 73.7 bits (181), Expect = 2e-17 Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>INTIMIN#Intimin signature. Length = 939 Score = 258 bits (660), Expect = 8e-80 Identities = 119/378 (31%), Positives = 196/378 (51%), Gaps = 21/378 (5%) Query: 32 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 91 G+ AK ALG + Q + +++WL +G A V+++ N F GS + +P D+ Sbjct: 184 GDYAKDTALGIAGN----QASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237 Query: 92 DRYLTWSQLGLTQQDDGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 151 ++ L + Q+G D +N+G GQR+ ++GYN F D + R G G E W Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297 Query: 152 EYLRLSANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVRVEQYFGER 209 +Y + S N Y + WHE ++R A G+D+ +P Y L + EQY+G+ Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357 Query: 210 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 269 V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P + Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417 Query: 270 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 329 Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L + Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476 Query: 330 RSRYGIRQLIWQGDTQILS-----LTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVV 384 +S+YG+ +++W D+ + S G+Q SA+ + I+P + +G SN ++++ Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531 Query: 385 EDNQGQRVSSNEITLTLV 402 D G SSN + LT+ Sbjct: 532 YDRNGN--SSNNVLLTIT 547
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 51.7 bits (124), Expect = 3e-09 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 20/129 (15%) Query: 132 AMMLH-IRQQAQAQLPEAITQAVIGRPINFQGLGGDEANAQAQEILERAAKRAGFRDVVF 190 M+ H I+Q + ++ P+ + E A + +A+ AG R+V Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140 Query: 191 QYEPIAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRSRLDREASLLGHSGCRI 250 EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 37.0 bits (86), Expect = 1e-04 Identities = 32/137 (23%), Positives = 56/137 (40%), Gaps = 23/137 (16%) Query: 332 RLSYRLV---RSAEECKIALSSV--AETRASLPFISNELAT------LISQRGLESALSQ 380 R +Y + +AE K + S + + LA ++ + AL + Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQE 262 Query: 381 PLARILEQVQLALDNAQEKPDV--------IYLTGGSARSPQIKKALAEQLPGIPIAGGD 432 PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIP+ + Sbjct: 263 PLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAE 319 Query: 433 D-FGSVTAGLARWAEVV 448 D V G + E++ Sbjct: 320 DPLTCVARGGGKALEMI 336
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 533 bits (1375), Expect = 0.0 Identities = 253/384 (65%), Positives = 294/384 (76%), Gaps = 17/384 (4%) Query: 1 MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDNKSEDGDQTYVRLG 60 MK KVL+L++PALL AGAA+AAE+YNKDGNKLDLYGKVDGLHYFSD+ S+DGDQTY+R+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGVTQVTDQLTGYGQWEYQIQGNTSEDNKENSWTRVAFAGLKFQDVGSFDYGRNYGVVY 120 FKG TQ+ DQLTGYGQWEY +Q NT+E NSWTR+AFAGLKF D GSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DVTSWTDVLPEFGGDTYG-SDNFMQQRGNGFATYRNTDFFGLVDGLNFAVQYQGKNGSVS 179 DV WTD+LPEFGGD+Y +DN+M R NG ATYRNTDFFGLVDGLNFA+QYQGKN S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 180 ------GEGMTNNGRGALRQNGDGVGGSITYDY-EGFGIGGAISSSKRTDDQN-SPLYIG 231 G NNG NGDG G S TYD GF G A ++S RT++Q + I Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240 Query: 232 NGDRAETYTGGLKYDANNIYLAAQYTQTYNATRVGSL------GWANKAQNFEAVAQYQF 285 GD+A+ +T GLKYDANNIYLA Y++T N T G G ANK QNFE AQYQF Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300 Query: 286 DFGLRPSVAYLQSKGKNLGTIAGRNYDDEDILKYVDVGATYYFNKNMSTYVDYKINLLD- 344 DFGLRP+V++L SKGK+L N DD+D++KY DVGATYYFNKN STYVDYKINLLD Sbjct: 301 DFGLRPAVSFLMSKGKDLTY-NNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359 Query: 345 DNQFTRDAGINTDNIVALGLVYQF 368 D+ F +DAGI+TD+IVALG+VYQF Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.9 bits (114), Expect = 9e-09 Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%) Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114 + L+ IK+ P L ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 115 DLPKALAALQKGKKFTPESVSRLLE 139 DL + + + + S+L + Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 82.2 bits (203), Expect = 3e-18 Identities = 29/106 (27%), Positives = 48/106 (45%) Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNHIDIVLSDVNMPNMDGYRL 886 ILV DD R +L L GY + ++ ++ D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 887 TQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVI 932 RI++ LPV+ ++A + E G L KP L + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 562 bits (1450), Expect = 0.0 Identities = 181/484 (37%), Positives = 270/484 (55%), Gaps = 35/484 (7%) Query: 1 MTAINRILIVDDEDNVRRMLSTAFALQGFETHCANNGRTALHLFADIHPDVVLMDIRMPE 60 MT IL+ DD+ +R +L+ A + G++ +N T A D+V+ D+ MP+ Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59 Query: 61 MDGIKALKEMRSHETRTPVILMTAYAEVETAVEALRCGAFDYVIKPFDLDELNLIVQRAL 120 + L ++ PV++M+A TA++A GA+DY+ KPFDL EL I+ RAL Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 121 QLQSMKKEIRHLHQALSASWQWGH-ILTNSPAMMDICKDTAKIALSQASVLISGESGTGK 179 + L Q G ++ S AM +I + A++ + +++I+GESGTGK Sbjct: 120 AEP------KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGK 173 Query: 180 ELIARAIHYNSRRAKGPFIKVNCAALPESLLESELFGHEKGAFTGAQTLRQGLFERANEG 239 EL+ARA+H +R GPF+ +N AA+P L+ESELFGHEKGAFTGAQT G FE+A G Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGG 233 Query: 240 TLLLDEIGEMPLVLQAKLLRILQEREFERIGGHQTIKVDIRIIAATNRDLQAMVKEGTFR 299 TL LDEIG+MP+ Q +LLR+LQ+ E+ +GG I+ D+RI+AATN+DL+ + +G FR Sbjct: 234 TLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFR 293 Query: 300 EDLFYRLNVIHLILPPLRDRREDISLLANHFLQKFSSENQRDIIDIDPMAMSLLTAWSWP 359 EDL+YRLNV+ L LPPLRDR EDI L HF+Q+ E + D A+ L+ A WP Sbjct: 294 EDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWP 352 Query: 360 GNIRELSNVIERAVVMNSGPIIFSEDLPPQIRQPV---------CNAGEVKTASVGERN- 409 GN+REL N++ R + +I E + ++R + +G + + E N Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM 412 Query: 410 ----------------LKEEIKRVEKRIIMEVLEQQEGNRTRTALMLGISRRALMYKLQE 453 + +E +I+ L GN+ + A +LG++R L K++E Sbjct: 413 RQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472 Query: 454 YGID 457 G+ Sbjct: 473 LGVS 476
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 407 bits (1048), Expect = e-148 Identities = 250/251 (99%), Positives = 250/251 (99%) Query: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 Query: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 Query: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADGLYPVLSWLTWPM 180 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMAD LYPVLSWLTWPM Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180 Query: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 Query: 241 IQDDLKDIDSE 251 IQDDLKDIDSE Sbjct: 241 IQDDLKDIDSE 251
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 45.3 bits (107), Expect = 3e-07 Identities = 32/165 (19%), Positives = 70/165 (42%), Gaps = 2/165 (1%) Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLIT 93 L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 94 ASSQSLA-MMILGTALTGLFSVVAQILVPLA-ATLASPDKRGKVVGTIMSGLLLGILLAR 151 S ++I+ + G + LV + A + RGK G I S + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 152 TVAGLLANLGGWRTVFWVASVLMALMALALWRGLPQMKSETHLNY 196 + G++A+ W + + + + + + +++ + H + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.018 Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 12/94 (12%) Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82 P QE+ L + + L R A+G + + T + ++L ALG Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808 Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRR 111 SS ++ D L + GW RE+ RR Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 78.7 bits (194), Expect = 5e-18 Identities = 64/412 (15%), Positives = 120/412 (29%), Gaps = 97/412 (23%) Query: 25 LLLTLLFIIIAVAIGIYWFLVLRHFEETDDA----YVAGNQIQIMSQVSGSVTKVWADNT 80 L FI+ + I VL E A +G +I + V ++ Sbjct: 57 PRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115 Query: 81 DFVKEGDVLVTLDPTDARQAFEKA------------------------------------ 104 + V++GDVL+ L A K Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175 Query: 105 ----------------KTALASSVRQTHQLMINSKQLQANIEVQKIALAKA-------QS 141 K ++ Q +Q +N + +A + + +S Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 142 DYNRRVPLGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQ 201 + L + I + + + A +L V Q ++ IL K E Q Q Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295 Query: 202 AATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQISPTTP 242 E+ + + + I +P++ V + V G ++ Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355 Query: 243 LMAVVPA-TNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKY---TGKVVGLDMGTGS 298 LM +VP + V A + I + +GQ I + + +Y GKV + + Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI-----N 409 Query: 299 AFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNR 350 ++ G V+ + + PL G++ + T R Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLST--GNKNIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 132 bits (334), Expect = 7e-36 Identities = 98/405 (24%), Positives = 169/405 (41%), Gaps = 23/405 (5%) Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRV 76 I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ ++ Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 77 GEVKLFLWSTIAFAIASWACGVS-SSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAK 135 G +L L+ I S V S ++LI R IQG A L ++ P Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136 Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETR 195 R A L V + GP +GG I+ HW + + +P+ + + L L +E R Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVR 194 Query: 196 TKRRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTD 255 K D G+ L+ +GI + ML F++ I +V+V++ + Sbjct: 195 IK-GHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243 Query: 256 DNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315 +P VD L K+ F IG LC + + G + ++P ++++V+ + G G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGF- 373 + VI+ I G + ++ +V F ++ S + I F Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358 Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416 ++ ++TI S L + A SL NFT L+ G +I Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 292 bits (750), Expect = e-105 Identities = 131/170 (77%), Positives = 148/170 (87%) Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFA 61 PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+GIHTLEHL+A Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60 Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121 GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120 Query: 122 ELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELALPKEKLQELHI 171 ELN YQCGT MHSL EA+ IA++ILE V +N N+ELALP+ L+EL I Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 775 bits (2003), Expect = 0.0 Identities = 320/849 (37%), Positives = 469/849 (55%), Gaps = 48/849 (5%) Query: 31 SGMLCTTANAEEYYFDPIMLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQK 90 + ++ E YF+P L DLSRF PGTY+VDI+LN ++ + Sbjct: 35 AFAAQAPLSSAELYFNPRFLAD--DPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATR 92 Query: 91 KITFTAN-AEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDF 149 +TF +EQ + P T QL +G+ + + DD+ + L +I A+ D Sbjct: 93 DVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP-LTSMIHDATAQLDV 151 Query: 150 NHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNM 209 QRLNL+IPQ + ARGY+ P WD GI NY+F+G+ + R G S YLN+ Sbjct: 152 GQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNL 211 Query: 210 QNGANFGPWRLRNYSTWTRNDQTSS------WNTISSYLQRDIKALKSQLLLGESATSGS 263 Q+G N G WRLR+ +TW+ N SS W I+++L+RDI L+S+L LG+ T G Sbjct: 212 QSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGD 271 Query: 264 IFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFE 323 IF F G QLASDDNMLP+SQRGFAP + GIA +A VTI+QNGY IY S VP G F Sbjct: 272 IFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFT 331 Query: 324 INDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDS 383 IND+Y + NSGDL+VTI+E+DG+ + F PYSS+P++QR GH +YS TAG YR+ Sbjct: 332 INDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQE 391 Query: 384 KEPEFAEATAIYGLNNTFTLYGGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDN 443 K P F ++T ++GL +T+YGG ++ Y A GIG +GALGALS+D+ +A++ + Sbjct: 392 K-PRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPD 450 Query: 444 QHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEA------------------ 485 G R Y K + E+ TNI + YRY+ GYF+F + Sbjct: 451 DSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQ 510 Query: 486 ----NTRNWNYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNDKNRNISVGVSGQQ 541 T +N ++ ++Q ++Q + +LY SGS Q YWG ++ + G++ Sbjct: 511 VKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF 570 Query: 542 WGIGYSLNYQYSRYTDQN-NDRALSLNLSIPLERWLPRSR--------VSYQMTSQKDRP 592 I ++L+Y ++ Q D+ L+LN++IP WL SY M+ + Sbjct: 571 EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGR 630 Query: 593 TQHEMRLNGSLLDDGRLSYSLEQSLDDDNNHNS----SLNASYRSPYGTFSAGYSYGNDS 648 + + G+LL+D LSYS++ + NS +YR YG + GYS+ +D Sbjct: 631 MTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDI 690 Query: 649 SQYNYGVTGGVVIHTHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYL 708 Q YGV+GGV+ H +GVTL Q L + L+ A GA +++N G+ TD GYAV+PY Sbjct: 691 KQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYA 750 Query: 709 TTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVIVSDRNGKPL 768 T Y+ENR+++DT L DNVDL+ VVP RGA+V A F A +G ++L+ + N KPL Sbjct: 751 TEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL-THNNKPL 809 Query: 769 PFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTT 828 PFGA+ +++ + IV + G +YLSG+ + V+WG + + C + P Sbjct: 810 PFGAMVTSESSQSSGIVADNGQVYLSGMPLAGK-VQVKWGEEENAHCVANYQLPPESQQQ 868 Query: 829 SVLQGTAQC 837 + Q +A+C Sbjct: 869 LLTQLSAEC 877
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 28.9 bits (64), Expect = 0.018 Identities = 41/160 (25%), Positives = 67/160 (41%), Gaps = 21/160 (13%) Query: 162 VKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTS 221 V+++I+GN+ P C IN G I V+FG IN + V +I+ C S Sbjct: 21 VQINIRGNVYIP-PCTINNGQNIVVDFGNINPEHVDNSRG------EVTKNISISCPYKS 73 Query: 222 KIKNSLQMRIDGTTGVVDQYNLVARRRSSDNVPDVGIRIENLGGGVANIPFQNG------ 275 SL +++ G T V Q N++A N+ GI + G + NG Sbjct: 74 ---GSLWIKVTGNTMGVGQNNVLA-----TNITHFGIALYQGKGMSTPLTLGNGSGNGYR 125 Query: 276 ILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVM 315 + + T + P G L G F+ TA+++++ Sbjct: 126 VTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMI 165
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 29.7 bits (66), Expect = 0.042 Identities = 11/72 (15%), Positives = 24/72 (33%), Gaps = 4/72 (5%) Query: 487 AGVNGGSGIALTGTPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGE 546 + V+G + + + I + + ++ T D + G R A + + Sbjct: 330 SEVHGNAEVHASFFDIGGSVSAGFSNSNSS----TVAIDHSLSLAGERTWAETMGLNTAD 385 Query: 547 IAFIKPMIAMRN 558 A + I N Sbjct: 386 TARLNANIRYVN 397
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 27.6 bits (61), Expect = 0.036 Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%) Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94 K+L GN + A T + IA + V AI+ D+ Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330 Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141 E Y+++ + LG+ GD LLA + A++A++T T++A Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.014 Identities = 8/22 (36%), Positives = 13/22 (59%) Query: 19 VLITGATGLVGGHLLRMLINEP 40 L+TGA G +G H+ + L+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 72.3 bits (177), Expect = 5e-16 Identities = 32/184 (17%), Positives = 63/184 (34%), Gaps = 38/184 (20%) Query: 90 GLGSGVIINASKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137 + SGV++ K +LTN HV++ L +G ++ + Sbjct: 102 FIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALG 190 D+A+++ + ++++ + +V G P V+ + Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212 Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249 S + L+ +Q D S GNSG + N E+IGI+ G+ Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263 Query: 250 MART 253 + Sbjct: 264 VFIN 267
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 52.7 bits (126), Expect = 8e-10 Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%) Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 125 TDLAVLKI-------NATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATG 177 DLA++K + + ++ + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 28.1 bits (62), Expect = 0.045 Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%) Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62 + GAA GIG+A+A L G+ ++ D P V S A + F + Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66 Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104 A + D+++ AGV R + F+VN+ V N + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 105 VAKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146 V+K + + + +NP ++AA KA K G+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 169 bits (430), Expect = 4e-57 Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%) Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74 + ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+ Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69 Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131 S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128 Query: 132 FTTPANGFTVKDLYEAILELF 152 K + + ILEL Sbjct: 129 LIICRTHDDTKVVQKKILELL 149
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 53.3 bits (128), Expect = 4e-10 Identities = 28/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%) Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61 SR V I+ F+ I + + S I P + ++ ++ Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110 Query: 62 NVHDNQLVKKGQILFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114 V + + V+KG +L + Q +L +A+ + YQ+L++ E + L Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168 Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154 + + VL + Q + Q + +L+L++ Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211 Score = 47.5 bits (113), Expect = 3e-08 Identities = 28/147 (19%), Positives = 53/147 (36%), Gaps = 15/147 (10%) Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150 E R + ++ + ++EE + + +L +L + + Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323 Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGGRPG 208 + +VIRAP V L V+T G +T T + +V ++ V A ++ + G Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383 Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231 A I P L G V ++ Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.7 bits (64), Expect = 0.023 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 164 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 222 A +V+D VTQ +E + ++ + S + + + L + D A QV ++L + Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113 Query: 223 SK 224 + Sbjct: 114 AT 115
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 132 bits (334), Expect = 5e-40 Identities = 79/226 (34%), Positives = 124/226 (54%), Gaps = 9/226 (3%) Query: 28 AAKPATTADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87 A A A + D K +Y++GA LG K + GI ++ D L G+QD Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66 Query: 88 A-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSST 146 + + L++++++ L F+ + + A+ K A +N+AKG + + G+ + Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126 Query: 147 GLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206 GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186 Query: 207 KNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251 + + G ++ +P +LAYG V G I PN TL+F + L+ VK A Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 30.7 bits (69), Expect = 0.021 Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%) Query: 261 TAIDPFKGLLLG---LFFISVGMSLNLGVLYTHL-LWVVISVVVLVAVKILVLYLLARLY 316 A+ P L + L+FIS + L +++ + W +++ V+ ++ L Sbjct: 318 AAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA-- 375 Query: 317 GVRSSERMQ 325 S +M+ Sbjct: 376 QYTSMAKMR 384
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 31.9 bits (72), Expect = 0.001 Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 16/135 (11%) Query: 12 YAHPESQDSVANRVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 69 Y P + D N+V P + + +HD+ ++ D F L + + Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68 Query: 70 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRNVITTGEPESA------Y 119 P+ + P DR L F GPG N +G Y +IT PE + Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124 Query: 120 RYDALNRYPMSDVLR 134 RY A R + +++R Sbjct: 125 RYSAFKRTNLLEMMR 139
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.7 bits (74), Expect = 0.005 Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 22/152 (14%) Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKE 563 + D + ++ E + + ++ R+ +R R + L E Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331 Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602 +LE++ + +L A+ + EE+ SE QS + +L A Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391 Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634 + + + LEE L A E+L + L E Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422 Score = 32.0 bits (72), Expect = 0.008 Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%) Query: 513 EDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572 + + ++ + E A A + D ++ + +++ Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179 Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632 L A+ A E + + E + TA + + ++ + ++ LE + Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239 Query: 633 EGQSN 637 ++ Sbjct: 240 FSTAD 244
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 31.8 bits (72), Expect = 0.002 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116 P+ + + E ++ KG SRK++ ++ + + GTF Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.3 bits (89), Expect = 4e-05 Identities = 35/208 (16%), Positives = 71/208 (34%), Gaps = 13/208 (6%) Query: 33 IIVEFLPVSLLTP----MAQDLGISEGVAGQSVTVTAFVAMFASLFITQTIQATDR--RN 86 + ++ + + L+ P + +DL S V + A A+ +DR R Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73 Query: 87 VVILFAVLL-TLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 145 V+L ++ + +++ A +L IGR G+ G A++ + + + Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132 Query: 146 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAVMG----VLCIFWIIKSLPSLPGE 201 + +V LG +G F AAA + + F + +S Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191 Query: 202 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 229 + N + M + A+ F Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 37.8 bits (88), Expect = 1e-04 Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%) Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71 V+R D +I N ILD + G + I +K IA +G A D P Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117 Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116 + I G G +D+H+H + P E A L GLT ++ Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 0.001 Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%) Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ C Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90 Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 +G SL +M + F Q G + + + ++ P+ RG G Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212 +G + A+Y+ + + + P + I+ L Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 40.6 bits (95), Expect = 1e-05 Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%) Query: 30 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 87 RH + IWL F+ N ++P+I + + + T F +T+ + V G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 88 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 144 +SD+ + + G+I +++ S F L ++ F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 145 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAAAALHYGWRAGMMIAGCMAIVVGIFLC 203 A Y + RG + L + +G + P + A + W ++ M ++ + Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184 Query: 204 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 263 + L +I G L I+ + Y VL Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 264 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 307 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 308 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 366 GS +F G + + GIL+ G L+++ + + F T F + + Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351 Query: 367 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 396 G++ + ++ AGA + ++L Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>PF06580#Sensor histidine kinase Length = 349 Score = 39.8 bits (93), Expect = 2e-05 Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%) Query: 366 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 425 LR ++L + + ++L L++ + + +V + Q + N Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266 Query: 426 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 479 +KH + L+G + + + L +E+ GS + + G GL +RER+ L Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326 Query: 480 G---TLHISCLHG-TRVSVSLP 497 G + +S G V +P Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 61.4 bits (149), Expect = 2e-13 Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%) Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61 T+ + DD +R+ Q L V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117 Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169 A+ R L + + + + + A L + T+ Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 59.9 bits (145), Expect = 7e-12 Identities = 41/184 (22%), Positives = 81/184 (44%), Gaps = 1/184 (0%) Query: 7 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 66 R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 67 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 125 +SD++G + ++L G+ I +++ V S ++LI A +QG G + + Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 126 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLVLCAGVTFSMARWM 185 + A L+ + + + P IGG++ +W L ++ V F M Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190 Query: 186 PETR 189 E R Sbjct: 191 KEVR 194
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.8 bits (90), Expect = 3e-06 Identities = 21/92 (22%), Positives = 34/92 (36%), Gaps = 16/92 (17%) Query: 86 VACIDGDVVGHLTIDVQQHPRRSHVADFGICVDSRWKNRGVASALMREMIE------MCD 139 + ++ + +G + I + + D + D R K GV +AL+ + IE C Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125 Query: 140 NWLRVDRIELTVFVDNAPAIKVYKKYGFEIEG 171 L I N A Y K+ F I Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 31.6 bits (71), Expect = 0.006 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%) Query: 275 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331 R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131 Query: 332 YAYADRSEYLGDPDFVKVPWQA 353 Y P F WQ+ Sbjct: 132 GRY---------PTFSYQDWQS 144
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 31.7 bits (72), Expect = 0.004 Identities = 27/109 (24%), Positives = 42/109 (38%), Gaps = 24/109 (22%) Query: 215 VITAENGIVFRENLLFTHRGLSGPAVLQISSYWQPGEFVSINLLPDVDLETFL--NEQRN 272 ++ E I RE RG GP +L + ++ + + + L T + N Q Sbjct: 58 LLNLEEAITLRE------RGWKGP-ILMLEGFFHAQD---LEIYDQHRLTTCVHSNWQLK 107 Query: 273 AHPNQSLKNTLAVHL------------PKRLVERLQQLGQIPNVSLKQL 309 A N LK L ++L P R++ QQL + NV L Sbjct: 108 ALQNARLKAPLDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 112 bits (282), Expect = 7e-32 Identities = 41/122 (33%), Positives = 63/122 (51%), Gaps = 11/122 (9%) Query: 108 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLTEY--PKTAVNVIGYTDSTGGHDLNMRLS 165 + ++V F+ + ATLKP G L + L+ +V V+GYTD G N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASALITQGVDASRIRTQGLGPANPIASNSTAEGK---------AQNRRVEITL 216 ++RA SV LI++G+ A +I +G+G +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SP 218 Sbjct: 335 KG 336
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.9 bits (80), Expect = 5e-05 Identities = 16/52 (30%), Positives = 22/52 (42%), Gaps = 5/52 (9%) Query: 76 VAPKAVRRGIGKALMQYV-----QQRYPHLMLEVYQKNQPAIDFYRAQGFHI 122 VA ++G+G AL+ + + LMLE N A FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 27.5 bits (61), Expect = 0.048 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 13/90 (14%) Query: 121 SMYNEFGDSTTTQTDPLWHASVSTLGWRVDSRLGDLRPWAQISYNQQFGENIWKAQSGLS 180 S+ + D+ + H S + + + R G++ P ++SY F + Sbjct: 228 SVAVQQQDAKLV-EENYSHNSQTEVAATLAYRFGNVTP--RVSYAHGFKGSF-------- 276 Query: 181 RMTATNQNGNWLDVTVGADMLLNQNIAAYA 210 ATN N ++ V VGA+ ++ +A Sbjct: 277 --DATNYNNDYDQVVVGAEYDFSKRTSALV 304
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 47/275 (17%), Positives = 94/275 (34%), Gaps = 32/275 (11%) Query: 44 PVSQVAFSFGLLSLGLAIS----SSVAGKLQERFGVKRVTMASGILLGLGFFLTAHSNNL 99 + V +G+L A+ + V G L +RFG + V + S + + + A + L Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96 Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFAIGSYGLGSLGFK 152 +L++ AG+ AG + + F + LG Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152 Query: 153 FIDTQLLETVGLEKTFVIWGAIALVMIVFGATLMKDAPKQEVKTSNGVVEKDYTLAESMR 212 L+ F A+ + + G L+ ++ K E + R Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWAR 207 Query: 213 --KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQSLAHLDVVSAANAVTVISIAN-LS 265 ++AV F+ + L+VI + H D + ++ I + L+ Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSLA 262 Query: 266 GRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300 ++ G ++ ++ R + +G + G L FA Sbjct: 263 QAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297 Score = 36.0 bits (83), Expect = 2e-04 Identities = 37/155 (23%), Positives = 64/155 (41%), Gaps = 9/155 (5%) Query: 241 AQSLAHLDVVSAANAVTVISIANLSGRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300 AH ++ A A+ + A + G L SD+ R V+ + + V A + A Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGAL-----SDRFGRRPVLLVSLAGAAVDYAIMATA 93 Query: 301 PLNAVTFFAAIACVAFNFGGTITVFPSLVSEFFGLNNLAKNYGVIYLGFGIGSIFGSIIA 360 P V + I VA G T V + +++ + A+++G + FG G + G ++ Sbjct: 94 PFLWVLYIGRI--VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151 Query: 361 SLFGGF--YVTFYVIFALLILSLALSTTIRQPEQK 393 L GGF + F+ AL L+ + K Sbjct: 152 GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 120 bits (302), Expect = 2e-39 Identities = 50/89 (56%), Positives = 66/89 (74%) Query: 2 NKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61 NK LI +AE EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90 NPQTG+EIKI A+ VPAF +GKALKDAVK Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 2e-04 Identities = 49/262 (18%), Positives = 104/262 (39%), Gaps = 43/262 (16%) Query: 197 ILFALATVLLA-SVLSFFW-YRRYLRSRQLLQDEMKRKEKLVALGHLAAGV-AHEIRNPL 253 I+F + V S+L F W + + + ++ Q +M + L L A + H + N L Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL 179 Query: 254 SSIKGLAKYFAERAPAGGEAHQLAQVM---AKEADRLNRVVSELLELVKPTHLALQAVDL 310 ++I+ L +A L+++M + ++ +++ L +V ++L L ++ Sbjct: 180 NNIRALILEDPTKAREM--LTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASIQF 236 Query: 311 NTLINHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLL-NLYLNAIQAIGQHG 369 + Q+ + ++Q+ P L Q L+ N + I + Q G Sbjct: 237 EDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVENGIKHGIAQLPQGG 279 Query: 370 VISVTASESGAGVKISVTDSGKGIAADQLDAIFTPYFTTKAEGTGLGLAVVHNIVEQHGG 429 I + ++ V + V ++G + E TG GL V ++ G Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYG 327 Query: 430 ---TIQVASQEGKGATFTLWLP 448 I+++ ++GK + +P Sbjct: 328 TEAQIKLSEKQGKV-NAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 525 bits (1353), Expect = 0.0 Identities = 187/468 (39%), Positives = 257/468 (54%), Gaps = 35/468 (7%) Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67 ILV DDD + T+L L GY+V + ++ + DLV+ DV M + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 68 LKEIKALNPAIPVLIMTAYSSVETAVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSI 127 L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 128 DAETPAVSASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187 ++ S +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185 Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADRRREGRFVEADGGTLFLDEIGDISP 247 R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+ Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245 Query: 248 MMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307 Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV + Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305 Query: 308 EVPSLRQRREDIPLLAVHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367 +P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364 Query: 368 AVVLLTGEYISERELPLAI------------AGTPIPLGQSQDI---------------- 399 L + I+ + + A L SQ + Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424 Query: 400 ------QPLVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441 + L E+E +ILAAL T GN+ +AA LG+ R TL K+ Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 30.7 bits (69), Expect = 0.001 Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 5/54 (9%) Query: 78 IDPDVCGCGVGRMLVEHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126 + D GVG L+ A+ A E L + N A FY K F + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.003 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 289 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346 ET+ PD+ L A P L Y + N D +T + + QL+++ Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 29.3 bits (65), Expect = 0.017 Identities = 20/95 (21%), Positives = 38/95 (40%), Gaps = 3/95 (3%) Query: 20 AAGTVKVFSNGSSEAKTLTGAEHLIDLVGQPRLANSWWPGAVISEELATAAALRQQQALL 79 A + + + LT + L D+V + N+ + A AA++ + L Sbjct: 73 AKAAAEAQAKAKANRDALT--QRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERL 130 Query: 80 TRLAEQGADSSTDDAAAINALRQQIQALEVTGRQK 114 RLA+ + + AA A ++ Q + R+K Sbjct: 131 -RLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREK 164
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.6 bits (82), Expect = 3e-04 Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 3/87 (3%) Query: 279 VIGVMLSIFQQFVGINVVLYYAPEVFKTLGASTDIALLQTIIVGVINLTFTVLAIMT--- 335 +I ++ ++ VGI +++ P + + L S D+ I++ + L A + Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 336 VDKFGRKPLQIIGALGMAIGMFSLGTA 362 D+FGR+P+ ++ G A+ + TA Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATA 93
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 753 bits (1946), Expect = 0.0 Identities = 395/396 (99%), Positives = 395/396 (99%) Query: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 Query: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 Query: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP Sbjct: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 Query: 181 YFTWPLIAADGGYAFKYENGKYDIKDVAVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 YFTWPLIAADGGYAFKYENGKYDIKDV VDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE Sbjct: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 Query: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE Sbjct: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300 Query: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP Sbjct: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 Query: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK Sbjct: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 13/35 (37%), Positives = 18/35 (51%) Query: 32 VVFVGPSGCGKSTLLRMIAGLETITSGDLFIGEKR 66 VV G G GKSTL+ + GL+ + IG + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.020 Identities = 12/22 (54%), Positives = 13/22 (59%) Query: 32 MVALLGPSGSGKSTLLRHLSGL 53 V L G G GKSTL+ L GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.9 bits (101), Expect = 2e-06 Identities = 57/290 (19%), Positives = 105/290 (36%), Gaps = 55/290 (18%) Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGE 144 G L D++GR+ +L +++ ++ + P +W +L I ++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112 Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGW 200 A ++A+ + +R GFM + FG +AG VLG G++ S Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159 Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKYWRS 260 PFF A L + L K E+ P SF+ W Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFR------WAR 207 Query: 261 LLTCIGLVIATNVTYYML----LTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVM 315 +T + ++A ++ + H+ G+ + ++ L + Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMIT 267 Query: 316 GLLSDRFGRRPFVLLG----SVALFVLA--------IPAFILINSNVIGL 353 G ++ R G R ++LG +LA P +L+ S IG+ Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317 Score = 39.4 bits (92), Expect = 3e-05 Identities = 39/164 (23%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFI 344 L H+ + +G+L+ + A+M PV+G LSDRFGRRP +L+ L A+ I Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAI 89 Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLVAG 401 + + + +++ G ++A I V + + + R + ++A F +VAG Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147 Query: 402 LTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITG-VTMKETANR 444 P L + S + P + + + +TG + E+ Sbjct: 148 --PVLGGLMGGFSPH--APFFAAAALNGLNFLTGCFLLPESHKG 187
>PF06580#Sensor histidine kinase Length = 349 Score = 36.4 bits (84), Expect = 1e-04 Identities = 40/182 (21%), Positives = 80/182 (43%), Gaps = 34/182 (18%) Query: 181 ARLDQMMESVSQLLQLARAGQSFSSGNYQHVKLLEDV-ILPSYDELSTML--DQRQQTLL 237 + +M+ S+S+L++ S N + V L +++ ++ SY +L+++ D+ Q Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 238 LPESAADITVQGDATLLRMLLRNLVENAHRY----SPQGSNIMIKLQEDGGAV-MAVEDE 292 + + D+ V ML++ LVEN ++ PQG I++K +D G V + VE+ Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 293 GPGIDESKCGELSKAFVRMDSRYGGIGLGLSIV-SRITQLHHGQFFLQNRQETSGTRAWV 351 G + + G GL V R+ L+ + ++ ++ A V Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 352 RL 353 + Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.2 bits (229), Expect = 7e-24 Identities = 41/121 (33%), Positives = 60/121 (49%) Query: 2 KILIVEDDTLLLQGLILAAQTEGYTCDGVTTARMAEQSLEAGHYSLVVLDLGLPDEDGLH 61 IL+ +DD + L A GY + A + + AG LVV D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLARIRQKKYTLPVLILTARDTLTDKIAGLDVGADDYLVKPFALEELHARIRALLRRHNN 121 L RI++ + LPVL+++A++T I + GA DYL KPF L EL I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 Q 122 + Sbjct: 125 R 125
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 29.2 bits (65), Expect = 0.014 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%) Query: 145 PLLVSHGIALGCLVSTILGLPAWAERRLRLRNCSISRVDYQESLW 189 P +V GIA G V T+ GL W ++ N D + +W Sbjct: 42 PAIVG-GIATGAAVGTVSGLLGWGLKQAEEAN---KTPDKPDKVW 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 87.2 bits (216), Expect = 5e-22 Identities = 34/139 (24%), Positives = 60/139 (43%) Query: 1 MQRETVWLVEDEQGIADTLVYMLQQEGFAVEVFERGLPVLDKARQQVPDVMILDVGLPDI 60 M T+ + +D+ I L L + G+ V + + D+++ DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGFELCRQLLALHPALPVLFLTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLR 120 + F+L ++ P LPVL ++A++ + + E GA DY+ KPF E+ + L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 RVKKFSSPSPVIRIGHFEL 139 K+ S L Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139
>PF06580#Sensor histidine kinase Length = 349 Score = 33.3 bits (76), Expect = 0.002 Identities = 42/182 (23%), Positives = 73/182 (40%), Gaps = 40/182 (21%) Query: 312 LRQARLENRQEVVLTAVDVAALFR---RVSEARTVQLAE--KNITLHVT--------PTE 358 +R LE+ + ++ L R R S AR V LA+ + ++ + Sbjct: 182 IRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ 241 Query: 359 VNVAAEPALLDQALGNLL-----DNA----IDFTPESGRITLSAEVDQEHVTLKVLDTGS 409 PA++D + +L +N I P+ G+I L D VTL+V +TGS Sbjct: 242 FENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301 Query: 410 GIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSE-VARLFNGEVTLR-NVQEGGVLASL 467 N ++S+G GL V E + L+ E ++ + ++G V A + Sbjct: 302 LALK----------------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 468 RL 469 + Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.8 bits (202), Expect = 4e-20 Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 1/122 (0%) Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGK 60 M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P + Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 NGLLLARELRE-QANVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119 N L +++ + ++ ++ ++ ++ + I E GA DY+ KPF+ EL L+ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 120 RT 121 Sbjct: 121 EP 122