>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 209 bits (534), Expect = 1e-72 Identities = 59/144 (40%), Positives = 88/144 (61%), Gaps = 7/144 (4%) Query: 6 VESFNLDHTKVKAPYVRLAGRKDGENGDVILKYDVRFKQPNKEHMEMKSLHSLEHLTAEL 65 ++SF +DHT++ AP VR+A GD I +D+RF PNK+ + K +H+LEHL A Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGF 62 Query: 66 IRNHAD----YVVDWSPMGCQTGFYLTVINHDNYDDILSVLEATMKDVLVA---TEVPAS 118 +RNH + ++D SPMGC+TGFY+++I + + A M+DVL ++P Sbjct: 63 MRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIPEL 122 Query: 119 NEVQCGWAASHTLEGAQQLATEFL 142 NE QCG AA H+L+ A+Q+A L Sbjct: 123 NEYQCGTAAMHSLDEAKQIAKNIL 146
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 31.4 bits (70), Expect = 0.006 Identities = 26/71 (36%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 78 IATDYKTAETKAVGSGVAGAGAGVAVVAIGP-SVAMG-LATTFGVAST----GTAISALS 131 I + A+ AV G GV VAIGP S A+G A T+G AST G AI A + Sbjct: 75 IGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARA 134 Query: 132 GAAATNASLAW 142 + T ++ + Sbjct: 135 STSDTGVAVGF 145
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.4 bits (66), Expect = 0.028 Identities = 13/99 (13%), Positives = 36/99 (36%), Gaps = 23/99 (23%) Query: 100 LQDLQRRRDELQREKTNWLHQIAEQASKIPGLGHLFDDYSVLQAEKKV-QLMEDFRDFIH 158 + + + + E E + S++ + +L A+++ + + F+ Sbjct: 254 VLEQENKYVEAVNE-------LRVYKSQLEQIES-----EILSAKEEYQLVTQLFK---- 297 Query: 159 RHDSDFSELNGLIEQVLLGLEELGKQRSFNGDKQGYSPI 197 +E+ + Q + L + + N ++Q S I Sbjct: 298 ------NEILDKLRQTTDNIGLLTLELAKNEERQQASVI 330
>PF06917#Periplasmic pectate lyase Length = 555 Score = 26.4 bits (58), Expect = 0.032 Identities = 14/44 (31%), Positives = 19/44 (43%) Query: 18 FIANENKLIIWNGYFETILDNLLDCDVEKKGIVKEYFNHEGWYD 61 + NE+ L W G+ LD L K V E +H +YD Sbjct: 106 GVHNESGLFYWGGHRFLNLDTLKTEGPASKDQVHELKHHLPYYD 149
>PF05043#Transcriptional activator Length = 493 Score = 33.4 bits (76), Expect = 0.002 Identities = 31/139 (22%), Positives = 60/139 (43%), Gaps = 13/139 (9%) Query: 1 MDTLLMKKDLAKLTLFRELVYNQPKELSLDYFSELLNISKRSTLRTVEELAHDLEKDFED 60 M LL KK +L L EL++ + +ELLN ++R+ V++ ++ F D Sbjct: 1 MRDLLSKKSHRQLELL-ELLFEHKRWFHRSELAELLNCTERA----VKDDLSHVKSAFPD 55 Query: 61 MEIKKNKYSYSIMNNSLMNNEYFIVSLQLFY--LKNSIQFNIIYSLLTKYFDSMTQLSEY 118 + +S N + N +++ K+S F+I+ + + + Sbjct: 56 L------IFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKE 109 Query: 119 LYISTPHLYRQMPEIKRFL 137 YIS+ LYR + +I + + Sbjct: 110 FYISSSSLYRIISQINKVI 128
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 33.4 bits (76), Expect = 6e-04 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 2/78 (2%) Query: 176 QVIMIGGLLF-SPITYPTDRLPSLLVRFFEILPFVPSSNLIRSMFYDQGIVNI-YNIIVI 233 Q ++I +LF S +P D+LP + LP S +LIR + +V++ ++ + Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGAL 241 Query: 234 CFWLVLNMLLSLVSLSRR 251 C ++V+ LS L RR Sbjct: 242 CIYIVIPFFLSTALLRRR 259
>PF05043#Transcriptional activator Length = 493 Score = 52.6 bits (126), Expect = 2e-09 Identities = 32/169 (18%), Positives = 66/169 (39%), Gaps = 7/169 (4%) Query: 1 MKRLLDPNFIPILSLLKQLNKDYPSRSITFFSEQLKLDRRTILKTIHTLQLDISRNHWEN 60 M+ LL L LL+ L + + +E L R + + ++ + + Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60 Query: 61 MLTIEIIDKSVYTTISPFFSIEVFFSHYMSESFAVRLFLSLFKYPSDSIDEICEYLYVSK 120 + + IE+ + H+ S + +F + IC+ Y+S Sbjct: 61 ------STNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISS 114 Query: 121 ATFYRRIKYSKEVLDDFNLSLDFTDSKNKLVGSETQIRYFFSTLFWEVF 169 ++ YR I +V+ + + + +++G+E IRYFF+ F E + Sbjct: 115 SSLYRIISQINKVIKR-QFQFEVSLTPVQIIGNERDIRYFFAQYFSEKY 162
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 28.2 bits (62), Expect = 0.025 Identities = 26/110 (23%), Positives = 41/110 (37%), Gaps = 14/110 (12%) Query: 121 LKGKKVPTNILKASISIPKGKISTTADGDMQAPAVALPLTLS--KAAAPVMTAKKNNGLG 178 L G T L +PKG + + G ++ LPL +S + AP + ++G G Sbjct: 4 LNGFSSATLALITPPFLPKGGKALSQSGPDGLASITLPLPISAERGFAPALALHYSSGGG 63 Query: 179 IWANDFNGLAGTVTIKVPTNAYIDQYS------------ANITWSLQDAP 216 T++I T+ + QY+ T S DAP Sbjct: 64 NGPFGVGWSCATMSIARSTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDAP 113
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 28.4 bits (63), Expect = 0.024 Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 4/36 (11%) Query: 48 EDPMGPLDPLNPDNPN----PPSPVDPMDPENPGTG 79 +P P +P NPDNPN P +P +P +P+NP G Sbjct: 290 NNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNPDNG 325
>PF05043#Transcriptional activator Length = 493 Score = 33.8 bits (77), Expect = 0.001 Identities = 17/95 (17%), Positives = 43/95 (45%), Gaps = 7/95 (7%) Query: 80 TSILEIRQYFLEESIAFKLLINLYQKQFIKLKNFSLTYYYSPSVVY---KKVNELKVKLK 136 + I + +F + S F +L ++ + + ++ +Y S S +Y ++N++ + Sbjct: 73 SDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQF 132 Query: 137 GYGLTIKSEHGSIYLGGEEEKKRYFYSEIFYYVYG 171 + +++ + G E RYF+++ F Y Sbjct: 133 QFEVSLTPVQ----IIGNERDIRYFFAQYFSEKYY 163
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.4 bits (99), Expect = 4e-06 Identities = 46/228 (20%), Positives = 77/228 (33%), Gaps = 25/228 (10%) Query: 87 QQENQELTVKIDQREDQLEKQARVVQVNGDTQNYIDFVLEAKSMSDIIGRVDVVAQMVSA 146 + E + TV QA V V + E + D S Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNN--------EEIARVDEAPVPPPAPATPSE 1035 Query: 147 NRAMVKQQADDKAQVVKQEKEVAKKSDEQKVLAADLAKTQEKLQTQKLEKESIVAQIAAD 206 V + + +++ V++ ++ A ++ Q A AK+ K TQ E VAQ ++ Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE----VAQSGSE 1091 Query: 207 TATAEGDKNKFLAQKAAAEKEAEDLR----IAKVAADKKASED--------AEAARV-VQ 253 T + + K A EK + + KV + ++ AE AR Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151 Query: 254 LANAKAAEAAKNTTVAAAPPAENPQGGGTPPVSTGAYGRPTNAPVSSS 301 N K ++ NTT PA+ PV+ N+ V + Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 513 bits (1324), Expect = 0.0 Identities = 190/410 (46%), Positives = 271/410 (66%), Gaps = 8/410 (1%) Query: 3 NPIHIMSEIGKLKTVLLKRPGEEVENLTPDIMGRLLFDDIPYLPIIQEEHDYFAKALTDN 62 NPI+I SEIG+LK VLL RPGEE+ENLTP IM LFDDIPYL + ++EH+ FA L +N Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65 Query: 63 GTEVLYLEKLTAEAI-DAGGIREVFIDRMLSESEISSPKIASALREYLLSMETFPMVTKI 121 E+ Y+E L +E + + + FI + + E+EI + + L++Y S+ M++K+ Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKM 125 Query: 122 MAGVRTRDIDVTTSNLVDISNKEHYPFFMDPMPNLYFTRDPAASLGNGLTINSMHYTARR 181 ++GV T ++ TS+L D+ N F +DPMPN+ FTRDP AS+GNG+TIN M R+ Sbjct: 126 ISGVVTEELKNYTSSLDDLVN-GANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKVRQ 184 Query: 182 RESMFMEIIIQYHPRFANKGVEVWLDRDHPESIEGGDELVLNERVVAIGISQRTSAKAIE 241 RE++F E I +YHP + + V +WL+R S+EGGDELVLN+ ++ IGIS+RT AK++E Sbjct: 185 RETIFAEYIFKYHPVYK-ENVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAKSVE 243 Query: 242 ALAKALFSRNSNFEKVVAIKIPNVRAMMHLDTVFTMVDYDKFTIHPGIQADGGKVDTYII 301 LA +LF ++F+ ++A +IP R+ MHLDTVFT +DY FT Y++ Sbjct: 244 KLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSD---DMYFSIYVL 300 Query: 302 EPSTVPGEIRMTERN-DLQEVLREVLNVPELILIPCGNGDEIVAPREQWNDGSNTLAIAP 360 + +I + + +++VL L ++ +I C GD I REQWNDG+N LAIAP Sbjct: 301 TYNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAP 359 Query: 361 GVVVTYNRNYVSNELLRSYGVKVIEVISSELSRGRGGPRCMSMPLIREDL 410 G ++ Y+RN+V+N+L G+KV + SSELSRGRGGPRCMSMPLIRED+ Sbjct: 360 GEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 393 bits (1011), Expect = e-140 Identities = 136/313 (43%), Positives = 194/313 (61%), Gaps = 4/313 (1%) Query: 3 KRKIVVALGGNAIL--STDASDKAQKEALKATAAYLVEIIKQGNELIISHGNGPQVGNLV 60 +++V+ALGGNA+ S + + ++ TA + EII +G E++I+HGNGPQVG+L+ Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 61 LQQQAAAS-KSNPAMPLDTCVAMTQGSIGYWLQNALENAFKKEGIEKSVISVVSQVVVDQ 119 L A + PA P+D AM+QG IGY +Q AL+N +K G+EK V+++++Q +VD+ Sbjct: 62 LHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDK 121 Query: 120 NDVAFIHPTKPIGPFLTQSEAHEQMLLSDDTYQEDAGRGWRKVVPSPKPVSILEYPIINQ 179 ND AF +PTKP+GPF + A +ED+GRGWR+VVPSP P +E I + Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181 Query: 180 LVENGVVTISVGGGGIPVIEAENEFVGVEAVIDKDFASQKLAELVEADLLVILTGVEQVY 239 LVE GV+ I+ GGGG+PVI + E GVEAVIDKD A +KLAE V AD+ +ILT V Sbjct: 182 LVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA 241 Query: 240 INYNQPNQKALTTVTTKELYQYIQENQFAPGSMLPKIEAAISFVEHNPKGKAVITSLENL 299 + Y ++ L V +EL +Y +E F GSM PK+ AAI F+E + +A+I LE Sbjct: 242 LYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGE-RAIIAHLEKA 300 Query: 300 GNFNTENAGTTIV 312 GT ++ Sbjct: 301 VEALEGKTGTQVL 313
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.7 bits (100), Expect = 6e-06 Identities = 52/313 (16%), Positives = 95/313 (30%), Gaps = 35/313 (11%) Query: 77 PFQVTQKTNPNPSQTAE------SEYKARLQELETNFKEKQLHFEQEMLEKEKEAQQNRQ 130 T T PN Q +E AR+ E E E Q+++ Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050 Query: 131 DLERNLKEKLEQQYNEQVATIKEEKEQNREQLQKQFNEQIAEIEHLSLEKRIEIETKYQE 190 EK EQ E A +E ++ + + N Q E+ E ET+ E Sbjct: 1051 ------VEKNEQDATETTAQNREVAKE--AKSNVKANTQTNEVAQSGSE---TKETQTTE 1099 Query: 191 KLKMLEGIYAEQQDLSEAKYAEKEANLEKAYQDKQAQFDKQQQAKIAEIEAQKRKQDEEY 250 E + + E++A +E + + Q K + E + + + Sbjct: 1100 T--------KETATVEK----EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147 Query: 251 ERKYLEKIEKLDLFYKERQTVLEKDIAEKEKEIIRNAETLAEEKLDRAAREAEQLLSEIK 310 E I++ + QT D + KE N E E + E Sbjct: 1148 ENDPTVNIKE-----PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202 Query: 311 AKVALQQQNLEHSTIETEEKNSKAIED-AKKMASNIINSANGEAKKKLEKAELQAKTALE 369 Q S+ + + ++ +++ + +S + + L Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262 Query: 370 SARSDSQLLIENT 382 AR+ +Q + N Sbjct: 1263 DARAKAQFVALNV 1275 Score = 35.0 bits (80), Expect = 0.001 Identities = 33/265 (12%), Positives = 73/265 (27%), Gaps = 8/265 (3%) Query: 115 EQEMLEKEKEAQQNRQDLERNLKEKLEQQYNEQVATIKEEKEQNREQLQKQFNEQIAEIE 174 E E + + NE++A + E + Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043 Query: 175 HLSLEKRIEIETKYQEKLKMLEGIYAEQQDLSEAKYAEKEANLEKAYQDKQAQFDKQQQA 234 K +E + + A+ + S K + E A + + + + Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAK-EAKSNVKANTQTN--EVAQSGSETKETQTTET 1100 Query: 235 KIAEIEAQKRKQDEEYERKYLEKIEKLDLFYKERQTVLEKDIAEKEKEIIRNAETLAEEK 294 K ++ K E E+ + K+ Q+ + AE +E + +E Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE--NDPTVNIKEP 1158 Query: 295 LDRAAREAEQLLSEIKAKVALQQQNLEHSTIET---EEKNSKAIEDAKKMASNIINSANG 351 + A+ + ++Q E +T+ T +N + A + S+N Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218 Query: 352 EAKKKLEKAELQAKTALESARSDSQ 376 + + S + Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSND 1243
>PF05043#Transcriptional activator Length = 493 Score = 62.3 bits (151), Expect = 2e-12 Identities = 80/423 (18%), Positives = 165/423 (39%), Gaps = 53/423 (12%) Query: 1 MEELLSSSEQRELKIIHLLYQEEKLWTVEQLANYLQCSIDTCYRYIDRIKQIFYDHGNEF 60 M +LLS R+L+++ LL++ ++ + +LA L C+ + +K F D Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPD----- 55 Query: 61 ELISKKTKGVLLKKTEHASLSKYESIYIEETIDFKLLSELFHSTYLTTEKLADHLFISKS 120 + T G+ + T+ + + + + + F +L +F + E + +IS S Sbjct: 56 LIFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSS 115 Query: 121 TLYRKLKKIAILLRKN-GIHLNISTLQLTGNEVWIREFFYLVYWSTSDSGFWPFE----- 174 +LYR + +I ++++ ++++ +Q+ GNE IR FF + WPFE Sbjct: 116 SLYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSSE 175 Query: 175 -------------SVPKHVLTHRVENIISSQN------SYFSTIEKLKLTYR-----MAI 210 S P ++ THR+ ++ N +F ++K + M Sbjct: 176 PLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFMEVDKDSFNDQSLDFLMQA 235 Query: 211 SFIRVQQKNFITH--------SIGDSFIDPFKEEYFEFITLNLMKTVPSNYQKNEKDYLS 262 I ++F + + F+ F++ +F +L + +Y + LS Sbjct: 236 EGIEGVAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKDSYVEKSYHLLS 295 Query: 263 LIFCTYPYLDKSDLNFCGIVSWHSINNTIPYQLTDKLLSSLSTIYPTQNLLTNKKLFYQL 322 + ++ + WH N Y+ L T + + N +Q Sbjct: 296 DFIDQISVKYQIEIENKDNLIWHLHNTAHLYR------QELFTEFILFDQKGNTIRNFQN 349 Query: 323 LCISIYATYFQASFSKTSEFLKL---STLLKQTHTCFYLNTKHALATICQEEPFKRILLQ 379 + + + S E L++ S ++ F +TKH + + Q +P ++L+ Sbjct: 350 IFPKFVSD-VKKELSHYLETLEVCSSSMMVNHLSYTFITHTKHLVINLLQNQPKLKVLVM 408 Query: 380 PNF 382 NF Sbjct: 409 SNF 411
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 32.3 bits (73), Expect = 0.002 Identities = 15/41 (36%), Positives = 22/41 (53%) Query: 37 NSISDVTFTTNTDPTNPVNPTDPTKPVLPVDPLDPADPHEP 77 +I D+ F + P NP NP +P P P +P +P +P P Sbjct: 276 QNIEDIHFANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNP 316 Score = 29.6 bits (66), Expect = 0.012 Identities = 13/42 (30%), Positives = 21/42 (50%) Query: 36 MNSISDVTFTTNTDPTNPVNPTDPTKPVLPVDPLDPADPHEP 77 ++ +D +P NP NP +P P P +P +P +P P Sbjct: 281 IHFANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNP 322
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 40.0 bits (93), Expect = 1e-05 Identities = 34/243 (13%), Positives = 81/243 (33%), Gaps = 6/243 (2%) Query: 27 ADDYSDKINSQNEKIKEIETQEKDVTTKLEGVTKEIVVAEEKARVLVEQSQATHAEMEKL 86 ++ + KI+E+E ++ D+ LEG K + L + A A L Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160 Query: 87 TKEVDSLNAKIEKRTAQLEKQARAVQVSASSEGYVDFIL------SADSLSDVVGRVDVV 140 K ++ +A+++ + + ++ L S + + Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220 Query: 141 AQMVSANRELVKAQAEDKATVESNKKKTETKLTEQHEVAGQLEKLKGELEGKKLEQESVV 200 A + + +L KA ++ K +T E+ + + +L+ LEG + Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280 Query: 201 ATLAASKASAEGERDGFIAQKEEADRKAADLKAAEEAAKKAPVLQTSTEDKKAPVTTTNQ 260 A + +A + ++ A+ ++ + + E + + N+ Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340 Query: 261 ESN 263 S Sbjct: 341 ISE 343 Score = 39.7 bits (92), Expect = 2e-05 Identities = 50/253 (19%), Positives = 90/253 (35%), Gaps = 13/253 (5%) Query: 23 LTALADDYSDKINSQNEKIKEIETQEKDVTTKLEGVTKEIVVAEEKARVLVEQSQATHAE 82 L A + + ++ + K++ + E E + L QSQ +A Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310 Query: 83 MEKLTKEVDSLNAKIEKRTAQLEKQARAVQVS-ASSEGYVDFILSADSLSDVVGRVDVVA 141 + L +++D+ ++ A+ +K ++S AS + L D + + + A Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS-----LRRDLDASREAKKQLEA 365 Query: 142 QMVSANRELVKAQAE------DKATVESNKKKTETKLTEQHEVAGQLEKLKGELEGKKLE 195 + + ++A D KK+ E L E + LEKL ELE K Sbjct: 366 EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKL 425 Query: 196 QESVVATLAAS-KASAEGERDGFIAQKEEADRKAADLKAAEEAAKKAPVLQTSTEDKKAP 254 E A L A +A A+ ++ Q EE + A + + P + +AP Sbjct: 426 TEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAP 485 Query: 255 VTTTNQESNPPAP 267 T N Sbjct: 486 QAGTKPNQNKAPM 498
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 9/19 (47%), Positives = 11/19 (57%) Query: 16 IVGKNGTGKSTFLMVLAGL 34 + G G GKST + L GL Sbjct: 601 LEGTGGIGKSTLINTLVGL 619
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 2e-18 Identities = 34/177 (19%), Positives = 64/177 (36%), Gaps = 15/177 (8%) Query: 3 GRIVIVDDEPITRMDIRDILEAGGYDVVGEASDGFEAIELCKSQHPDLVIMDIQMPLLDG 62 I++ DD+ R + L GYDV S+ + DLV+ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 63 LKAGKKIASENLAGGIILLSAFSDPTNTERAKNFGALGYLVKPLDEKSLIPTVEMSIAKG 122 +I ++++SA + +A GA YL KP D Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL-------------- 108 Query: 123 KETQKLEEQLNKLTKKLEERKIIERAKGILMIENKITEEDAYQMIRTLSMDKRSPMI 179 E + + K+ + + G+ ++ ++ Y+++ L + MI Sbjct: 109 TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMI 165
>PF06580#Sensor histidine kinase Length = 349 Score = 44.9 bits (106), Expect = 4e-07 Identities = 39/216 (18%), Positives = 79/216 (36%), Gaps = 28/216 (12%) Query: 265 KTEIKNKEAEIISKSVAIREIHHRVK-----NNLQSVVSLLRIQARRCESQEAKTALNES 319 + EI + +++ + + ++ N L ++ +L+ +A+ L S Sbjct: 146 QAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDP-----TKAREML-TS 199 Query: 320 VSRILAISATHELLSKQVEDGIQLKTVLESV-VY-NIQRC-FLDRNHITVVSDVSPDIVI 376 +S ++ S L + L L V Y + F DR + + ++P I Sbjct: 200 LSELMRYS-----LRYSNARQVSLADELTVVDSYLQLASIQFEDR--LQFENQINPAI-- 250 Query: 377 DSDRTVAIALIVNELLQNSYDHAFGN-EQVGLIKLTAQAEEKVITISVIDDGTGFDVKKV 435 ++V L++N H Q G I L + +T+ V + G+ Sbjct: 251 --MDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK 308 Query: 436 STTSLGLQIVNSYVKD--KLRGKIKIKSKEETGTST 469 +T GLQ V ++ +IK+ K+ + Sbjct: 309 ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 40.0 bits (93), Expect = 4e-05 Identities = 17/76 (22%), Positives = 33/76 (43%) Query: 636 PRGKEAKEEYPREDGASFEEAKKALEAKEAEKMKEEKAELEARKKAEEEATVEEEKGAEE 695 + +E +E A+ + + A E ++ + + + A + EE+A VE EK E Sbjct: 1063 AQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEV 1122 Query: 696 ASKEIKLEKKEEDSHE 711 ++ K+E S Sbjct: 1123 PKVTSQVSPKQEQSET 1138 Score = 30.4 bits (68), Expect = 0.029 Identities = 13/60 (21%), Positives = 27/60 (45%) Query: 648 EDGASFEEAKKALEAKEAEKMKEEKAELEARKKAEEEATVEEEKGAEEASKEIKLEKKEE 707 + G+ +E + + A KEEKA++E K E + +E S+ ++ + + Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146
>adhesinb#Adhesin B signature. Length = 310 Score = 158 bits (401), Expect = 3e-47 Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 3/193 (1%) Query: 241 DHHDADEGSDTDEHDHEGHSHAFDPHIWLDPVIAQQQVQTIKDGLVKADDVNKDSYEKNA 300 D++ EG D + + DPH WL+ Q I L + D NK++YEKN Sbjct: 115 DYYAVSEGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNL 174 Query: 301 ASYIEKLKKLDKDFENELKD--TKNRTFVTQHTAFAYLANRYNLEQVAISGLSPDLEPSP 358 +Y+EKL LDK+ + + + + + VT F Y + YN+ I ++ + E +P Sbjct: 175 KAYVEKLSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTP 234 Query: 359 AKLAELSDFVKENNISVIYFENSASPKISKTLASGTGAVLEVLSPIEGVSQSDQDKGIDY 418 ++ L + +++ + ++ E+S + KT++ T + + V++ ++G Y Sbjct: 235 DQIKTLVEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAE-KGEEGDSY 293 Query: 419 IKVMEANLKALKK 431 +M+ NL+ + + Sbjct: 294 YSMMKYNLEKIAE 306 Score = 100 bits (250), Expect = 8e-26 Identities = 36/154 (23%), Positives = 71/154 (46%), Gaps = 17/154 (11%) Query: 5 RKLKLLLPLLVVILVVVGCTQPGKTTAPQEKTKLQVVTTFFPMYDFTRNVTKEHADVTML 64 +K + L+ LL+ + + C+ K++ +KL VV T + D T+N+ + ++ + Sbjct: 2 KKCRFLVLLLLAFVGLAACSS-QKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60 Query: 65 MKAGVEPHDYEPSAKDIAKIADADVFIYNSEYMET----WVPSVLKNIDSKKTT-VIDAS 119 + G +PH+YEP +D+ K + AD+ YN +ET W +++N K+ S Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120 Query: 120 KNIPLLAGSDEHSEEDSEHHHDTDEHEEEPHFHL 153 + + ++ + + D PH L Sbjct: 121 EGVDVI----YLEGQSEKGKED-------PHAWL 143
>cloacin#Cloacin signature. Length = 551 Score = 45.1 bits (106), Expect = 5e-07 Identities = 28/82 (34%), Positives = 35/82 (42%), Gaps = 4/82 (4%) Query: 445 SGGGG-GNRGGASRGKGGYRGGSGGGERRGGAQRG---GKDNRRSGGGAGSGGSWNKDAK 500 SGG G G+ GA G GG G GGA G +N GGG+GSG W + Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61 Query: 501 RGSNAGGSSSSEARKGGRGNSS 522 G+ G +S G S+ Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83 Score = 37.8 bits (87), Expect = 1e-04 Identities = 24/60 (40%), Positives = 27/60 (45%), Gaps = 9/60 (15%) Query: 443 GGSGGGGGNRGGASRGKG--------GYRGGSGGGERRGGAQRGGKDNRRSGGGAGSGGS 494 GG G G GGAS G G G GSG G G N SGGG+G+GG+ Sbjct: 22 GGPTGLGVG-GGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80 Score = 36.2 bits (83), Expect = 3e-04 Identities = 21/72 (29%), Positives = 24/72 (33%), Gaps = 5/72 (6%) Query: 464 GGSGGGERRGGAQRGGKDNRRSGGGAGSGGS-----WNKDAKRGSNAGGSSSSEARKGGR 518 GG G G G G N G GG+ W+ + GS G Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62 Query: 519 GNSSDNRRSGGG 530 GN N SGGG Sbjct: 63 GNGGGNGNSGGG 74
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 356 bits (916), Expect = e-124 Identities = 113/373 (30%), Positives = 181/373 (48%), Gaps = 25/373 (6%) Query: 7 RNTYALIDRNAIFNNIKNQMNLLEHDTEVYAVVKADGYGHGALEVATIAREAGVQGFCVA 66 R A +D A+ N+ H V++VVKA+ YGHG + + GF + Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATH-ARVWSVVKANAYGHGIERIWSAIGATD--GFALL 59 Query: 67 LIDEALELRQAGFKEPILIM-GLVEAKYAKLLLDQRISVAVGYLKWLEEAEFYLKREKSF 125 ++EA+ LR+ G+K PIL++ G A+ ++ R++ V L+ + Sbjct: 60 NLEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARL----- 114 Query: 126 SENRKLDIHLAIDTGMGRVGFRTSAELAEVESYLTHSTSFNCQGVFTHFATADSKDTKQF 185 LDI+L +++GM R+GF+ + V L + + +HFA A+ D Sbjct: 115 --KAPLDIYLKVNSGMNRLGFQPD-RVLTVWQQLRAMANVGEMTLMSHFAEAEHPDG--I 169 Query: 186 HQQVEKFQKLVEGMTVKPTYIHSANSATSLWHQKYQKRIVRLGIAMYGLNPSGRELDL-P 244 + + ++ EG +NSA +LWH + VR GI +YG +PSG+ D+ Sbjct: 170 SGAMARIEQAAEG---LECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIAN 226 Query: 245 IKLKPAMSVETTLVQVKQMSAGETVSYGATYKAKEGEWIGTLPIGYADGWRRSLQGQT-V 303 L+P M++ + ++ V+ + AGE V YG Y A++ + IG + GYADG+ R T V Sbjct: 227 TGLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPV 286 Query: 304 LVEGERCEIVGRICMDQCMIRLNK--EVPIGTKVVLIGKDQNDEISAQEIAEYLDTINYE 361 LV+G R VG + MD + L + IGT V L GK EI ++A T+ YE Sbjct: 287 LVDGVRTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWGK----EIKIDDVAAAAGTVGYE 342 Query: 362 IVCGFTQRLPRVY 374 ++C R+P V Sbjct: 343 LMCALALRVPVVT 355
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 77.6 bits (191), Expect = 8e-18 Identities = 69/390 (17%), Positives = 137/390 (35%), Gaps = 27/390 (6%) Query: 27 LKQPKPAWAVAFACVIAFMGIGLVDPILKSISEQLHAT---PAETSLLFTSYMLVTGIVM 83 +K +P + + +GIGL+ P+L + L + A +L Y L+ Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 84 LFSGYISSRIGAKKTILYGLVIIIIFAVLGGFSNSVGELVGFRAGWGLGNALFISTALSA 143 G +S R G + +L L + + + + L R G+ A + A + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAY 119 Query: 144 IVGVSVGGTEQSIIMY-EAAMGLGMSVGPLLGGLLGSISWRAPFFGVATLMAVAFISVTI 202 I ++ G + A G GM GP+LGGL+G S APFF A L + F++ Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179 Query: 203 LL------------EKIPKPTKKVAFWDGLRALKHKGLLIMGITALFYNFGFFTLLAYSP 250 LL + P + G+ + L+ + L Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVV--AALMAVFFIMQLVGQVPAALWVIFG 237 Query: 251 FRMESYSAMQVGFVFFGWGLCLAISSVFLAPKLQEKFGTKNMMYAALLLFALDLMIMGFG 310 + A +G +G+ +++ + + + G + + ++ +++ F Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297 Query: 311 ANNSTIISICIIIA--GLFQGVNNTLITTAVMEVSPVDRSIASSAYSFIRFTGGALAPWL 368 I +++A G+ +++ +V + + + + + P L Sbjct: 298 TRGWMAFPIMVLLASGGIGMPALQAMLSR---QVDEERQGQLQGSLAALTSLTSIVGPLL 354 Query: 369 AGKLADWYNPHVTFWFAAVAVACGALVLFI 398 + Y +T W +A AL L Sbjct: 355 FTAI---YAASITTWNGWAWIAGAALYLLC 381
>BACSURFANTGN#Yersinia/Haemophilus virulence surface antigen signature. Length = 322 Score = 27.4 bits (60), Expect = 0.017 Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 4/53 (7%) Query: 3 HHIDIYVKDLEKQSNFWSWFLGELGY---QEFQKWETGISWKKADFYYVLSIG 52 H I YV + + F F GE + ++F+KW T W + ++Y L +G Sbjct: 258 HAIAAYVNEKSGVTFFDPNF-GEFHFSDKEKFRKWFTNSFWGNSMYHYPLGVG 309
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 123 bits (309), Expect = 1e-32 Identities = 88/399 (22%), Positives = 174/399 (43%), Gaps = 15/399 (3%) Query: 21 TFMTAVEGTIVSTAMPTIVGSLEGM-AIMNWVFSIYLLTNAMMTPVYGKLSDMIGRKPIF 79 +F + + +++ ++P I A NWV + ++LT ++ T VYGKLSD +G K + Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 80 IIGAIIFVIGSSLCGLAQTMDQLILF-RAIQGIGAGAIMPVSFTIIADIYPYEKRAKVMG 138 + G II GS + + + L++ R IQG GA A + ++A P E R K G Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142 Query: 139 MNGAAWGIAGIFGPLLGGFIVDQLSWHWIFYINVPVGIITIILIALFLHEDFSFEKKPID 198 + G+ + GP +GG I + HW + + +P+ I + + L + K D Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200 Query: 199 FLGCFSLMAALLFLLYGFQIVGDTGEFSASMAGVFALAVGMFALFIFAEKRAIDPIIPLS 258 G + ++F + ++V F +F+ ++ DP + Sbjct: 201 IKGIILMSVGIVFFMLFTTS---------YSISFLIVSVLSFLIFVKHIRKVTDPFVDPG 251 Query: 259 LFNNRTFVIQNIVAALVSGFLIGIDVYIPMWMQGLLGMK-AAMGGFAITPMSLTWIIGSF 317 L N F+I + ++ G + G +P M+ + + A +G I P +++ II + Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 318 IAGRVILKHPVKSILSSSLVIVGISGLMMVLAPMTTPFAFFLLVTAIIGIGMGITITTTT 377 I G ++ + +L+ + + +S L TT + +++ ++G +T Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371 Query: 378 VTAQSVVPQDQIGVATSSNTLFRILGQTVMVSVYGIVLN 416 + + S+ Q + G S L + +++ G +L+ Sbjct: 372 IVSSSLKQQ-EAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 43.7 bits (103), Expect = 9e-07 Identities = 53/364 (14%), Positives = 109/364 (29%), Gaps = 13/364 (3%) Query: 3 RDLWIVAIGMVLLYTGLSFIWPFNMLYMTENLGMSDTAAGTALLVN--SGIGIIGSVIGG 60 R L ++ + L G+ I P + + + +D A +L+ + + + + G Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 61 IIFDRVSGYVSLAIGTGILVLTTGSLFLFHGHPAF--IYNIWAVSVAMGMVFAGLYTAAG 118 + DR L + L + P +Y V+ G A Sbjct: 65 ALSDRFGRRPVLLVS---LAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121 Query: 119 LTHPSGGRTG-FNTIYVAQNIGVAVGPFLAGFLAKDGLGNVYTGSFAFALIYALFFFVYF 177 R F + G+ GP L G + + + A + L Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181 Query: 178 RKIDWHSNKVTSETKHKQKGTVRGKATKIGLISFGLLLLTYLFCQLPHVQWQSNLSTYMT 237 + + + + G+ L+ + QL + + Sbjct: 182 PE---SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238 Query: 238 SQKGVTTAQYGNLWSINGTLILIGQVLIIPLVARFKEKLSLQIYIGIGLFFCSFLFAMQA 297 + G + G L + Q +I VA + + +G+ ++ A Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA-LMLGMIADGTGYILLAFA 297 Query: 298 ESYGGFLLGMILLTLGEMFAWPAIPAIAYKLAPVGQAGLYQGLVNGTATAARMIAPIFGA 357 M+LL G + PA+ A+ + + G QG + + ++ P+ Sbjct: 298 TRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356 Query: 358 VVVA 361 + A Sbjct: 357 AIYA 360
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 49.2 bits (117), Expect = 7e-09 Identities = 39/185 (21%), Positives = 72/185 (38%), Gaps = 25/185 (13%) Query: 57 NPERVVVFDMGMLDTIDALGESDAVVGVA----------KDSLPKYLSKFDSDKVESAGG 106 +P R+V + ++ + ALG GVA + LP D V G Sbjct: 34 DPNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVSEPPLP--------DSVIDVGL 83 Query: 107 IKEPDFEKINALKPDLIIISGRQSDSLDELKKIAPTLSLE--IDSKDLWESINKNVSTIG 164 EP+ E + +KP ++ S S + L +IAP + L + K+++ + Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMA-RKSLTEMA 142 Query: 165 TIFDKSDEAKKKLDALSEKIDVLNKKNTGSDMK--TLTVLLNEGSLSAYGKGSRFAILND 222 + + A+ L + I + + + LT L++ + +G S F + D Sbjct: 143 DLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202 Query: 223 VFGFP 227 +G P Sbjct: 203 EYGIP 207
>SECA#SecA protein signature. Length = 901 Score = 30.6 bits (69), Expect = 0.023 Identities = 27/99 (27%), Positives = 41/99 (41%), Gaps = 10/99 (10%) Query: 261 AQKRVVNEICSDLRQSLHMHRLLQGDV-----GSGKTIVAAIALFATVNAGFQGALMVPT 315 A KRV D+ Q L L + + G GKT+ A + A +NA + V T Sbjct: 74 ASKRVFGMRHFDV-QLLGGMVLNERCIAEMRTGEGKTLTATLP--AYLNALTGKGVHVVT 130 Query: 316 --GILAEQHMESLDQLFDPLEVKVALLTGATKTKERREI 352 LA++ E+ LF+ L + V + +RE Sbjct: 131 VNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREA 169
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 51.2 bits (122), Expect = 2e-08 Identities = 50/318 (15%), Positives = 98/318 (30%), Gaps = 32/318 (10%) Query: 165 AGVLKYKTRKKKAEQKLFETE-DNLNRVQDIVYELEDQIEPLREQSSIAKDYVMQKEQLS 223 G KYK R L+ E + N+ D I +Q + S Sbjct: 964 LGAWKYKLRNVNGRYDLYNPEVEKRNQTVD-----TTNITTPNN---------IQADVPS 1009 Query: 224 EVEIALTVVEVEMLKEKWLANKNQAETLATEISEARQELQTAETTVADLREKRQKMDAQL 283 + V+ A +ET T ++QE +T E D E + Sbjct: 1010 VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA 1069 Query: 284 DESQARLVELVKTYEQTEAQKKVLSERSKNTKENREQFEQSKAKLEVKIQELDQQLADLT 343 E+++ + +T E ++ + ++ TKE ++ KAK+E E Q++ +T Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET---EKTQEVPKVT 1126 Query: 344 KDLTEKQAHERELRGTLAAAEKEQKMFNQNSSVTVESLRDDYVDLMQKQTTLRNEQGYLE 403 ++ KQ ++ A + N + Q QT + Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVN--------------IKEPQSQTNTTADTEQPA 1172 Query: 404 KTFLQASQKNMKSDATVRALESDSALAKAKVQEKQLELSTVQKNLATKLLGHQEIQADLQ 463 K ++ + TV S + + + K + +++ Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232 Query: 464 KNRYDLETNEQKMYEALR 481 ++ + AL Sbjct: 1233 NVEPATTSSNDRSTVALC 1250
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 28.6 bits (64), Expect = 0.032 Identities = 10/49 (20%), Positives = 20/49 (40%), Gaps = 17/49 (34%) Query: 106 LVSWLLGNIASIDELKKEASNINVVSAKNNLLDVVVPLHWIVADQTGSS 154 L+ W++ + + L+ V+P W +AD+TG+ Sbjct: 203 LLQWMVDDRVA-----------------GPLIRSVLPAGWFIADKTGAG 234
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 32.9 bits (75), Expect = 0.002 Identities = 8/34 (23%), Positives = 16/34 (47%) Query: 282 IVALLIFLLIASCCIVIIGLLIRRKKKIEREKKL 315 AL I LL++S +G+ + + + K + Sbjct: 228 WQALPIVLLLSSLVGAFMGIGLILLRNHHQSKPI 261
>PF05043#Transcriptional activator Length = 493 Score = 51.5 bits (123), Expect = 4e-09 Identities = 63/344 (18%), Positives = 141/344 (40%), Gaps = 19/344 (5%) Query: 1 MNFFFNRRTLRKILAFYFIVENNGNVDLNNVSEYLKCTVRTTKDVLDELENDVKQWDSEV 60 M ++++ R++ + E+ + ++E L CT R KD L + K ++ Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHV----KSAFPDL 56 Query: 61 YLQQKSDGYLYVYIPSDLSIHGIYLYYLEDNINFNWMKQYFFENTVEINEYALDHYVSYS 120 ++G + + D I +Y ++ + + +F+ ++ FF + + Y+S S Sbjct: 57 IFHSSTNG-IRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSS 115 Query: 121 TMYKNMKVIDRNILSRYKLQFNTNVNATILGDEKQKRLFFFDFFWYSYSGLKWPFYNVEK 180 ++Y+ + I++ I +++ + + I+G+E+ R FF +F Y L+WPF N Sbjct: 116 SLYRIISQINKVIKRQFQFEVSLTPV-QIIGNERDIRYFFAQYFSEKYYFLEWPFENFSS 174 Query: 181 KKFDIFFKYIEKIRVRNIGLSEKEILRYYFAIIFHRIEIGETC--DESILKNQLLDDSKH 238 + + + K + LS +L+ +RI+ G D+ +Q LD Sbjct: 175 EPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFMEVDKDSFNDQSLDFLMQ 234 Query: 239 YTILKESLFPIYKSIFPKLMKKDIESEIAFLYLTIFGLEFHLEDNAIVSEMLVYVQTHDI 298 ++ + +S + E + L+++ + + + E L Sbjct: 235 AEGIEG----VAQSFESEYNISLDEEVVCQLFVS------YFQKMFFIDESLFMKCVKKD 284 Query: 299 NVVEYTNFWMKEFFLYFDIKLNAREYSILYSNLIHIHSRASLFE 342 + VE + + +F +K E + + H+H+ A L+ Sbjct: 285 SYVEKSYHLLSDFIDQISVKYQI-EIENKDNLIWHLHNTAHLYR 327
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 27.7 bits (61), Expect = 0.016 Identities = 22/81 (27%), Positives = 42/81 (51%), Gaps = 6/81 (7%) Query: 53 KNYEERKNVRIDEVKQQLAGLVDTFEVSILIGDPASEIIKHVKKNNYDLLIMGSRGLNIL 112 K+ ++R ++ E+K+++ +T + S L +SE +K + ++M S + I Sbjct: 440 KSGKDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSE-----EKRLFSTILMNSGNMEI- 493 Query: 113 QEFVMGSVSHKVMKYVPIPVL 133 QE G +KVMK +P+ L Sbjct: 494 QEMNTGVPGNKVMKKLPLSSL 514
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 53.9 bits (129), Expect = 4e-11 Identities = 48/191 (25%), Positives = 84/191 (43%), Gaps = 21/191 (10%) Query: 3 DKSHSVLLVVDVQKAFNDVSWGERSNQTAE--SHIAELITLFRQNEIDVIHIKHQSN-NP 59 D + +VLL+ D+Q F D ++ ++ E ++I +L Q I V++ + NP Sbjct: 27 DPNRAVLLIHDMQNYFVD-AFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85 Query: 60 E-----SLFY-------PEHITSEFKSEATPLKNELILTKTVNSAFIGTNLEEILHEKGI 107 + + F+ P + +E P ++L+LTK SAF TNL E++ ++G Sbjct: 86 DDRALLTDFWGPGLNSGPYE--EKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGR 143 Query: 108 TKLYIVGLTTPHCISTTTRMAANLGFKCYLVEDATASFELIGH-TGIKYSANEVQELTVV 166 +L I G+ T A K + V DA A F L H ++Y+A V Sbjct: 144 DQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCA--FTV 201 Query: 167 TLNEEFAEILS 177 + ++ + Sbjct: 202 MTDSLLDQLQN 212
>TETREPRESSOR#Tetracycline repressor protein signature. Length = 218 Score = 28.3 bits (63), Expect = 0.009 Identities = 13/38 (34%), Positives = 23/38 (60%) Query: 26 LKQHRINEAQLKILTELKIEALSLKKLALSLTTDKSTL 63 L + + +A L++L E I+ L+ +KLA L ++ TL Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTL 41
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 31.6 bits (71), Expect = 0.011 Identities = 36/245 (14%), Positives = 80/245 (32%), Gaps = 31/245 (12%) Query: 300 LQSGKQIVYEQAMSAQNEVLEEEKRISSLKTDIESEKSYISRLQQEKESLRNKYISIRDN 359 L ++ ++A +A+ E E+R + +IE EK+ R + E+ + ++ + Sbjct: 132 LAKAEEKARKEAEAAEKAFQEAEQR----RKEIEREKAETERQLKLAEAEEKRLAALSE- 186 Query: 360 NFPDFDEHKTTCQFCNQDLPVEQQATIKETYQKEREAFNLNRASELEQINEQGTSLSKEE 419 E K VE Q E + + +++ + E Sbjct: 187 ------EAKA----------VEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEM 230 Query: 420 EVHEELLQDLKNQAADTTELDKRIKVRDDLKSQHTAIIKQIEAIQNNATPFTETEQYKKK 479 + +L +A ELD+ +K + EA + E+ +K+ Sbjct: 231 KTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQ 290 Query: 480 ITEIEAIKQEIITIQTGDDKELQSQKEVISNIKLSINQLNEQLYEYVLSEKQEARKQELI 539 +T E I T K + + +++ +E+ + Q + Sbjct: 291 VTASETRINRINADITQIQKAISQVSNNRNAGIARVHE----------AEENLKKAQNNL 340 Query: 540 EEEKL 544 ++ Sbjct: 341 LNSQI 345
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 34.2 bits (78), Expect = 7e-05 Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 14/80 (17%) Query: 5 RGFTLIETILILSII-----MVLFGLPTVIANKTYEKVQKMLFFEAFQSHLLATQNYALL 59 RGFTL+E +LIL ++ MVL P + + + + F++ L Q L Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLAR------FEAQLRFVQQRGLQ 57 Query: 60 ANKKTSLTIFKSGTVRYQVL 79 + +++ R+Q L Sbjct: 58 TGQFFGVSVHPD---RWQFL 74
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 46.4 bits (110), Expect = 9e-10 Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 2/69 (2%) Query: 16 KGFTLVEMILVLFVISVLLILVIPNVVQQKKKIDNQGTEALMTVIETQIELFLLE--KEP 73 +GFTL+E+++V+ +I VL LV+PN++ K+K D Q + + +E ++++ L+ P Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYP 67 Query: 74 GIEVSFAAL 82 +L Sbjct: 68 TTNQGLESL 76
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 74.5 bits (183), Expect = 7e-17 Identities = 57/344 (16%), Positives = 140/344 (40%), Gaps = 11/344 (3%) Query: 16 KKIKETQQSFFLTKLAVLVAEGFSLKESLLFLKIML--PKQAVWLNQALNQLEEGQEFFQ 73 ++ + + +LA LVA L+E+L + P + + +++ EG Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122 Query: 74 VLNQLG--FSERISSQVYLAQIHGQFSQVLADSGAFLEANGKRKKKLKQLLQYPMLLVIF 131 + F + V + G VL + E + + +++Q + YP +L + Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182 Query: 132 MFGILFGIRLLLLPHFNDLVQQNGS----FTSLVSGIAIGLIYYFPYVFMGLLFSLLIIK 187 ++ + +++P + T ++ G++ + + P++ + LL + + Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242 Query: 188 ISLTNYFKKQTAIAKLNFIVALPFFGNLIKLYYTYYFSYEWAQLVKSGYSMLRIIEVMKA 247 + L +++ ++ ++ LP G + + T ++ + L S +L+ + + Sbjct: 243 VMLR---QEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299 Query: 248 KETTKIMQEVANEMEKGMKNGIGLHVVMKQLPFLKTELGAIIFHGELTSQLASELNLYGQ 307 + + + ++ G+ LH ++Q + +I GE + +L S L Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359 Query: 308 ICQNEFVQKIEKFMGWIQPLVFILVAFFILCIYLALLLPMFTMM 351 EF ++ +G +PL+ + +A +L I LA+L P+ + Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLN 403
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.035 Identities = 19/102 (18%), Positives = 34/102 (33%), Gaps = 25/102 (24%) Query: 364 LLANAIK--FTQKY--GEISISLNKVGNNVEIKVKDNGIGINEEEVEHIFDRFYMADPAR 419 L+ N IK Q G+I + K V ++V++ G + E Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309 Query: 420 SSSQGGQGIGLAIVKSIVEAHSG---SIRVESNIGTGSCFFI 458 G GL V+ ++ G I++ G + + Sbjct: 310 -----STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL 346
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.2 bits (229), Expect = 9e-24 Identities = 30/117 (25%), Positives = 56/117 (47%), Gaps = 1/117 (0%) Query: 2 KILIVDDEPKILEIVDAYLISKNYSVYKATSGKEALEKYHFISPDLVILDLMLPDISGLD 61 IL+ DD+ I +++ L Y V ++ DLV+ D+++PD + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VCETIRKE-TETPIIMLTAKSGEEDILKGLALGADDYIVKPFSPKELVARVETVLRR 117 + I+K + P+++++A++ +K GA DY+ KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.6 bits (69), Expect = 0.012 Identities = 17/157 (10%), Positives = 45/157 (28%), Gaps = 5/157 (3%) Query: 64 GMAIPQKETKTYLDPTLGNLNELYVTEGQSIDIGTPLISYQDDKIQEQINEQARGIERVK 123 G +K + E+ V EG+S+ G L+ + + + + + Sbjct: 88 GKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR 147 Query: 124 TSIANTQERNGEAQKKKETAISNITTTKAMINQLEQSSEPEDLAKVQQYTQEIAKYEGIV 183 Q + + K + + SE E L ++ + ++ Sbjct: 148 LEQTRYQILSRSIELNKLPELKLPDEPY-----FQNVSEEEVLRLTSLIKEQFSTWQNQK 202 Query: 184 EGQEAQIETLETALQDAQADLTERQAAVDQLQQKVTS 220 +E ++ A + + + ++ Sbjct: 203 YQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 33.2 bits (75), Expect = 0.004 Identities = 27/101 (26%), Positives = 48/101 (47%), Gaps = 3/101 (2%) Query: 466 PRRMNKMYKQAEKTRKDDKKARK--NDAKNKKKEEQEKNNKKDGGSNKNGLNEQKNRNNK 523 P+ + + K EK ++ ++A+K D + K+KEE+ KN N Q NNK Sbjct: 138 PKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNK 197 Query: 524 DKSD-SRKPKENENKDNNHAPDLNKRNKNDTNKPNSNENKK 563 + S+ ++ +ENE D+ ++ + + K NKK Sbjct: 198 NLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKK 238
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 29.1 bits (65), Expect = 0.029 Identities = 17/49 (34%), Positives = 20/49 (40%), Gaps = 6/49 (12%) Query: 256 LDSNQMLIFSPNKLFNHYISNVLPELGEKNMIQ--TTFLDFAQSRISGL 302 +D ML+F PN LF +L E G N Q T F I L Sbjct: 183 IDPRHMLVFGPNSLF----QEILDEYGIPNAWQGETNFWGSTAVSIDRL 227
>PF05043#Transcriptional activator Length = 493 Score = 60.0 bits (145), Expect = 9e-12 Identities = 50/228 (21%), Positives = 92/228 (40%), Gaps = 14/228 (6%) Query: 1 MNVFLERISLRKMYLLSLLDSEKRGFSIKELEQKLGHNSKTITKMVQSLKIELAPWQNSI 60 M L + S R++ LL LL KR F EL + L + + + +K Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAF-----PD 55 Query: 61 TLVTNNDRTLSLKKKASFSLETINLYYLKESFIFKACDAIFNEEFIDIATFSSANYISYS 120 + ++ + + +E + ++ K S F + IF E + YIS S Sbjct: 56 LIFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSS 115 Query: 121 TLYGRLNEIKPLLEH-YSIEFKANNMASFEGEEKQIRYFFYHFYWSTHWGMEWPFQKIDK 179 +LY +++I +++ + E + G E+ IRYFF ++ ++ +EWPF+ Sbjct: 116 SLYRIISQINKVIKRQFQFEVSLTPV-QIIGNERDIRYFFAQYFSEKYYFLEWPFENFSS 174 Query: 180 N---QFCEIIKRIEGLRKTTTLYISEQESIAFWLGVITTRINLGHTIE 224 Q E++ + + +S + L RI GH +E Sbjct: 175 EPLSQLLELVYKETSFP----MNLSTHRMLKLLLVTNLYRIKFGHFME 218
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 53.1 bits (127), Expect = 2e-10 Identities = 60/251 (23%), Positives = 94/251 (37%), Gaps = 35/251 (13%) Query: 18 IAWGCAKAMNDCGATVI---YTYQNDRVKKQLEKLVGTEANLVECDVATDEQVEEAFNQI 74 I A+ + GA + Y + K A DV ++E +I Sbjct: 20 IGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAIDEITARI 79 Query: 75 HKTYGTIDGLVHSIAFARKEELGGNVFDSTREGFAIAHDISSYSLLLVSRYASKIM--NP 132 + G ID LV+ R G + + E + ++S + SR SK M Sbjct: 80 EREMGPIDILVNVAGVLRP----GLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDRR 135 Query: 133 GGSIITMTYIGSERA------IANYNIMGLAKASLEAAVRYLALDLAKQDIRVNAISSGA 186 GSI+T +GS A +A Y +KA+ + L L+LA+ +IR N +S G+ Sbjct: 136 SGSIVT---VGSNPAGVPRTSMAAY---ASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189 Query: 187 IKTL-----------AASGIKGFNALLDEQAARTPSGKQVTTEEVGNTAAFLMSDMSRGI 235 +T A IKG L+ P K ++ + FL+S + I Sbjct: 190 TETDMQWSLWADENGAEQVIKGS---LETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 236 VGEIIYVDKGT 246 + VD G Sbjct: 247 TMHNLCVDGGA 257
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.004 Identities = 11/46 (23%), Positives = 17/46 (36%), Gaps = 12/46 (26%) Query: 36 GRNGIGKSTLLRSLAQMEPIQKGEILWEGNWLTKADVSFVNLSDYY 81 G GIGKSTL+ +L ++ + + D Y Sbjct: 603 GTGGIGKSTLINTLVGLD------------FFSDTHFDIGTGKDSY 636
>ENTEROTOXINA#Heat-labile enterotoxin A chain signature. Length = 258 Score = 29.2 bits (65), Expect = 0.024 Identities = 9/39 (23%), Positives = 18/39 (46%) Query: 183 GIPPNIMEWSEKSWLKYHIKYCVDNKKYFVYPNCSLSTN 221 G PP+ W E+ W+ + + C ++ + C+ T Sbjct: 184 GFPPDHQAWREEPWIHHAPQGCGNSSRTITGDTCNEETQ 222
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 28.5 bits (63), Expect = 0.010 Identities = 25/114 (21%), Positives = 50/114 (43%), Gaps = 7/114 (6%) Query: 6 PDVDVEATKKNARRVLRQYSRLEREAGKNYSQRLTVEISDMPRGSASIKSTPIEDMVTKK 65 + ++ KK A RV + LE+EA + Y + + +IS+ + IE Sbjct: 54 KENAIQWEKKEAERVEKNLDTLEKEALELYKKD-SEQISNYSQTRQYFYDYQIES----- 107 Query: 66 VTAEKKVWEILEAIYLLPRLSKEILWYSYIDKDHWSVTKIARALDYSDKAIEKY 119 EK+ + AI ++ K I Y + + ++ K R + ++ ++EK+ Sbjct: 108 NPREKEYKNLRNAI-SKNKIDKPINVYYFESPEKFAFNKEIRTENQNEISLEKF 160
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 66.2 bits (161), Expect = 1e-15 Identities = 30/204 (14%), Positives = 75/204 (36%), Gaps = 20/204 (9%) Query: 6 KKTDLRILRSKKMIFEAFVKLVKLKGYEAVTIQDIATEAMINRATFYAHFKDKNDLYDEV 65 +KT +++ I + ++L +G + ++ +IA A + R Y HFKDK+DL+ E+ Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 66 FSYALDTFTKI---LDSELLENGNQIQINKLEHMITEIFYVIRENRIFYLILTEGNSANS 122 + + ++ ++ + + L H++ R+ I+ Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST-VTEERRRLLMEIIFHKCEFVG 121 Query: 123 LKKKVHTLIEQRYAEIFNQLK-----------ITEN-DVEVPIDFIIDYMSSIFISMVHW 170 V E +++++ + + + Y+S + + Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN---- 177 Query: 171 WLTSKSDFPPEQMAHLLIKLVGNG 194 WL + F ++ A + ++ Sbjct: 178 WLFAPQSFDLKKEARDYVAILLEM 201
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 59.8 bits (145), Expect = 2e-12 Identities = 53/315 (16%), Positives = 97/315 (30%), Gaps = 55/315 (17%) Query: 1 MKVLITGGNGQLGTELTRLLDEANIDYITTDA------------------------KSMD 36 MK L+TG G +G +++ L EA + D +D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 37 ITDEKIVNQIIQKIKPNIIYHCAAYTAVDKAEDEGKSLNQLINVDGTRYVAKAAEKIG-A 95 + D + + + ++ AV + + + N+ G + + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADS-NLTGFLNILEGCRHNKIQ 119 Query: 96 TIVYISTDYVFEGNKKENYTVDDSPN-PRNEYGRAKYEGELEIQKYASKYYI----IRTS 150 ++Y S+ V+ N+K ++ DDS + P + Y K EL Y+ Y + +R Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179 Query: 151 WVYGEFG----ANFVYTMQRLAKSNPVL----------TVVSD--------QLGRPTWTR 188 VYG +G A F +T L + + T + D Q P Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239 Query: 189 NLAEFMLFITEKKADYGIYHFSNDETCSWYEFASEILKNTDTKIMPISSEEFPQKAKRPQ 248 A Y +Y+ N ++ + + P Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETS 299 Query: 249 HSILDLRKTKELGFK 263 L + +GF Sbjct: 300 ADTKALY--EVIGFT 312
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 174 bits (444), Expect = 2e-54 Identities = 73/341 (21%), Positives = 137/341 (40%), Gaps = 38/341 (11%) Query: 1 MNVLVTGGAGFIGSNYVHYMLENHPDYNIINLDLLTYAGNIHNLDDV---------IDNP 51 M LVTG AGFIG + +LE + ++ +D N+++ DV + P Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEA--GHQVVGID------NLNDYYDVSLKQARLELLAQP 52 Query: 52 NHVFVEGNICNRELVRNLVKTYGITHFVNFAAESHVDRSILNPEIFVETNIQGTLALLDV 111 F + ++ +RE + +L + V S+ NP + ++N+ G L +L+ Sbjct: 53 GFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEG 112 Query: 112 AKELSIEKYLQVSTDEVYGSLGAEGYFTEETPLA-PNSPYSASKTGADLLVRAYYETYDM 170 + I+ L S+ VYG L + F+ + + P S Y+A+K +L+ Y Y + Sbjct: 113 CRHNKIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171 Query: 171 NVNITRCSNNYGPYHFPEKLIPLMISNGMDNKELPIYGDGLNIRDWLHVQDHCQAIDLVL 230 R YGP+ P+ + ++ K + +Y G RD+ ++ D +AI + Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 231 HKGRK------------------GEVYNVGGHNERTNNEIVDIVIEKLGLSRDLIKYVDD 272 VYN+G + + + + + LG+ + Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK-KNMLPL 290 Query: 273 RLGHDKRYAIDPTKLETELGWKPKYTFDTGIVETIEWYQAN 313 + G + D L +G+ P+ T G+ + WY+ Sbjct: 291 QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.7 bits (72), Expect = 0.008 Identities = 37/222 (16%), Positives = 75/222 (33%), Gaps = 31/222 (13%) Query: 1 MIIFVLIVFLWSLIGEIWITYLLTIIIAFSFGIANMIKINLRFEP------IYPEELKMA 54 +F+ + F G I+ + +TI+ A + + + L P + P + Sbjct: 450 SAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL----VALILTPALCATLLKPVSAEHH 505 Query: 55 GNPGDLFSFF------SLEQYNLSVGKIVILILLSITVVVALIFISHKLVKKIFKVSVKY 108 N G F +F S+ Y SVGKI+ + + ++ L ++ + Sbjct: 506 ENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPE 565 Query: 109 LDRKILLIRGILLVISSFLL-------LTVYNFNQPGNEVKKIVDKYAI-WSDNSQNSTY 160 D+ + L L ++ +T Y V+ + +S +QN+ Sbjct: 566 EDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAG- 624 Query: 161 HENGFVIGFIYNFPVAVISKPSNYSEESIKKIMDKYMVRADA 202 + F+ P + N +E I + + D Sbjct: 625 ------MAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 50.1 bits (119), Expect = 3e-08 Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 19/156 (12%) Query: 129 EFIMQVSENAKLLASQNDLYASVMIAQSILESAHGTSVLGKI---PVNNLFGIK--GRYN 183 F+ Q+S A+L + Q+ + +++AQ+ LES G + + P NLFG+K G + Sbjct: 151 AFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWK 210 Query: 184 NQFFEKESLEQLPDGTWVTKKSEFRKYESWEKSQLDYVEKIKKGPNANAGDNSWNPSYYA 243 E + E +G K++FR Y S+ ++ DYV + + P A + Sbjct: 211 GPVTEITTTE-YENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAAVTTA------- 262 Query: 244 GAWRSNTSSYRDATAALVGKYASDKTYDSKLNQIIE 279 S+ + A A YA+D Y KL +I+ Sbjct: 263 ------ASAEQGAQALQDAGYATDPHYARKLTNMIQ 292
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 163 bits (414), Expect = 6e-50 Identities = 81/344 (23%), Positives = 144/344 (41%), Gaps = 42/344 (12%) Query: 1 MTVLVLGGAGYIGSHAVDQLITKGYDVAVVDNLKTGHKESLSDK---------ARFYQGD 51 M LV G AG+IG H +L+ G+ V +DNL + SL +F++ D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 IRDKAFMEDVFTKENIEGVIHFAASSLVGESMEIPLDYFNNNVYGTQVVLEVMEKYNVKS 111 + D+ M D+F + E V V S+E P Y ++N+ G +LE ++ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 112 IIFSSSAATYGEPKVIPI-EETAATNPESTYGETKLMMEKMLKWCDKAYGMRFVALRYFN 170 ++++SS++ YG + +P + + +P S Y TK E M YG+ LR+F Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180 Query: 171 VAGAKLDGTIGEDHNPESHLLPIILQTALGQREKFTIYGEDYETPDGTCIRDYVHVVDLI 230 V G P+ L G+ +Y G RD+ ++ D+ Sbjct: 181 VYGPWGR--------PDMALFKFTKAMLEGKS--IDVYN------YGKMKRDFTYIDDIA 224 Query: 231 DAHILALEYLQAGNSSNT---------------FNLGSSTGFSVKQMLEAAREVTGKEIP 275 +A I + + ++ T +N+G+S+ + ++A + G E Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284 Query: 276 ATVVSRRAGDPSTLIAASDKAREVLGWKPQYTDVNKIIESAWNW 319 ++ + GD A + EV+G+ P+ T V +++ NW Sbjct: 285 KNMLPLQPGDVLETSADTKALYEVIGFTPE-TTVKDGVKNFVNW 327
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.013 Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 6/61 (9%) Query: 355 KLDAV-ALVGPNGIGKSTLLKSLVD-----DIPLIQGEKRFGANVEVGYYDQEQANLNST 408 K D L G GIGKSTL+ +LV D G + G E + + + Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAF 653 Query: 409 K 409 + Sbjct: 654 R 654
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.4 bits (89), Expect = 4e-06 Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 5/84 (5%) Query: 68 ELLGFIGYRQLFDE-VELTNIAVHPKVQGQGLSQAFL---VQWIKKLHQARVVHLEVRKS 123 +G I R ++ + +IAV + +G+ A L ++W K+ H ++ LE + Sbjct: 75 NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLM-LETQDI 133 Query: 124 NQVAIHVYKKVGFKLINIRQDYYD 147 N A H Y K F + + Y Sbjct: 134 NISACHFYAKHHFIIGAVDTMLYS 157
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 40.3 bits (94), Expect = 1e-06 Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 6/101 (5%) Query: 82 MKNNRNALYLVLRVYDVAIGFI---GSWFVEGEAHVTNIAIIPNYRRYGLASFLMEQMRH 138 ++ A +L + + IG I +W G A + +IA+ +YR+ G+ + L+ + Sbjct: 60 VEEEGKAAFLY-YLENNCIGRIKIRSNW--NGYALIEDIAVAKDYRKKGVGTALLHKAIE 116 Query: 139 LAEADHNQLFSLEVRMSNTGAQELYRKLGFKDGKIKKAYYS 179 A+ +H LE + N A Y K F G + YS Sbjct: 117 WAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYS 157
>PF05043#Transcriptional activator Length = 493 Score = 51.1 bits (122), Expect = 6e-09 Identities = 38/183 (20%), Positives = 85/183 (46%), Gaps = 6/183 (3%) Query: 6 DKIIYRKIQLLKVLDASNDYLKTVDLANFLDLSLKTVQKELESLIEDLKNSSYAVKLEKV 65 K +R+++LL++L + +LA L+ + + V+ +L + + Sbjct: 6 SKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHV-KSAFPDLIFHSSTNG 64 Query: 66 GNLYRFIKKSSVNMDLIYLDFKRESIYFYLMRKAVFRKTIK-EKKEIDYFYSASHLYKNK 124 + +++++Y F + S +F ++ F + + E +++ S+S LY+ Sbjct: 65 IRIINT---DDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRII 121 Query: 125 RIFKTYLMN-YGLELDLSTLSIEGNEINIRFLYFQFFWENYRGVEWPFDTIDRQELIREI 183 + + E+ L+ + I GNE +IR+ + Q+F E Y +EWPF+ + L + + Sbjct: 122 SQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSSEPLSQLL 181 Query: 184 EQL 186 E + Sbjct: 182 ELV 184
>PF05043#Transcriptional activator Length = 493 Score = 54.9 bits (132), Expect = 4e-10 Identities = 41/222 (18%), Positives = 92/222 (41%), Gaps = 15/222 (6%) Query: 1 MRNLLNREDGKKVSLFRFIEECTQQTASF--SAIMHELEISEFVLLRTAENLSKDIELND 58 MR+LL+++ +++ L +E + F S + L +E + + Sbjct: 1 MRDLLSKKSHRQLEL---LELLFEHKRWFHRSELAELLNCTERAVKDDLS------HVKS 51 Query: 59 LTPHFSLTISRTHKTITLKKDSDASISMLYIIYIKNSLSYNILVDILNGKFVSMTDFGEF 118 P + S T+ + D + + + K+S ++IL I + + Sbjct: 52 AFPDL-IFHSSTNGIRIINTDDSDIEMVYHHFF-KHSTHFSILEFIFFNEGCQAESICKE 109 Query: 119 NFVSYSAVHKKIQEVKKELAN-YQVRLS-SKYELVGDEIKIRMFFYHLYYPRFNQLNFPF 176 ++S S++++ I ++ K + +Q +S + +++G+E IR FF + ++ L +PF Sbjct: 110 FYISSSSLYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPF 169 Query: 177 EAKYEKLAQQFIRLLENHLDHSIQETLKTKINFFLSVSLKRI 218 E + Q + L+ + + + L +L RI Sbjct: 170 ENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRI 211
>PF05043#Transcriptional activator Length = 493 Score = 31.5 bits (71), Expect = 0.001 Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 5/61 (8%) Query: 49 EQLFALGYSACFNSALEL--VMGQEKVSGKSQVTATVELLSDPSDNGFKLAVELDVAIEG 106 E++ + + F + + + V S V + LLSD D +++V+ + IE Sbjct: 255 EEVVCQLFVSYFQKMFFIDESLFMKCVKKDSYVEKSYHLLSDFID---QISVKYQIEIEN 311 Query: 107 K 107 K Sbjct: 312 K 312
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 49.2 bits (117), Expect = 1e-09 Identities = 30/185 (16%), Positives = 63/185 (34%), Gaps = 29/185 (15%) Query: 6 QSLLSKKWIIDSLLYLLKTKPYSEITITEITKKAGVARLTFYRNFESKDQILI------- 58 ++ +++ I+D L L + S ++ EI K AGV R Y +F+ K + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 59 TRSNYLFQEYLDEIRQN--GKITSIQQALLQCFNNW-----------QRDSQVMELLIKN 105 + L EY + + + I +L+ + V E+ + Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127 Query: 106 DLIYLIEQPFYTFLEKIIEEI-------PDLDNLDNTQKIFVLGRVTRTMLDWISTKSTK 158 + Y +E+ ++ DL I + G ++ M +W+ + Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLM--TRRAAIIMRGYISGLMENWLFAPQSF 185 Query: 159 SSTEI 163 + Sbjct: 186 DLKKE 190
>INTIMIN#Intimin signature. Length = 939 Score = 33.9 bits (77), Expect = 0.004 Identities = 31/128 (24%), Positives = 48/128 (37%), Gaps = 6/128 (4%) Query: 557 YPDRSTVGNSIGKILVEESLSTGNKVQYEYEVNFEVAAVPESIEVSPKIATIKTGETQQL 616 Y + + GK LV +S EV F + + + T G+ + Sbjct: 717 YAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE-IVGTGVKGKLPTV 775 Query: 617 SAKV----LPENSVNTDLKWTSSNEELATVD-EEGIVFGKRRGEVEISVETTNGLTDRAT 671 + L + N W S+N +A+VD G V K +G ISV +++ T T Sbjct: 776 WLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYT 835 Query: 672 IQIVNIEI 679 I N I Sbjct: 836 IATPNSLI 843
>PF05043#Transcriptional activator Length = 493 Score = 43.0 bits (101), Expect = 2e-06 Identities = 21/108 (19%), Positives = 52/108 (48%), Gaps = 8/108 (7%) Query: 87 QTVISYICQKTLEFKILELFLYGNLKKVHQFLLEHGIGYTTYYRVLRKISGLLQK-YGIS 145 + V + + + F ILE + + E I ++ YR++ +I+ ++++ + Sbjct: 76 EMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFE 135 Query: 146 INTNSLELVGKESEIRLFYFQFLWTLCEGFG---WPFKNSDEKKIIER 190 ++ ++++G E +IR F+ Q+ E + WPF+N + + + Sbjct: 136 VSLTPVQIIGNERDIRYFFAQY---FSEKYYFLEWPFENFS-SEPLSQ 179
>PF05043#Transcriptional activator Length = 493 Score = 59.6 bits (144), Expect = 1e-11 Identities = 51/235 (21%), Positives = 104/235 (44%), Gaps = 12/235 (5%) Query: 6 FLEKQDIRKIGLFKFLEASYQQRATFSDISENLNISDFILLNTVDELTRDIEANQLTDCF 65 L K+ R++ L + L +++ S+++E LN ++ + + + + + D Sbjct: 4 LLSKKSHRQLELLELL-FEHKRWFHRSELAELLNCTERAVKDDLSHVK-----SAFPDLI 57 Query: 66 KLEKTEKHIILKKSGKASVQVLAWLYLKKSHSFKILDEIYRGTFTNIASYSESNFVSYTS 125 T I+ V + K S F IL+ I+ S + ++S +S Sbjct: 58 FHSSTNGIRIINTDDSDIEMVYHHFF-KHSTHFSILEFIFFNEGCQAESICKEFYISSSS 116 Query: 126 VYNRIQELKKILR-SYEIELS-SRFKLIGDEMKIRMYFYQVYYERFNRIEFPFEPKKKEV 183 +Y I ++ K+++ ++ E+S + ++IG+E IR +F Q + E++ +E+PFE E Sbjct: 117 LYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSSEP 176 Query: 184 NMLFIQQLEEQLQHSFSEVDKAKLDFFLAVNLNRIQQGTDLHSSTVERKKINVQL 238 ++ + ++ + L L NL RI+ G H V++ N Q Sbjct: 177 LSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFG---HFMEVDKDSFNDQS 228
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.0 bits (98), Expect = 3e-06 Identities = 24/124 (19%), Positives = 43/124 (34%), Gaps = 3/124 (2%) Query: 197 GLERGETAAQIITEIQAVEKRKKEREEQRKKEEEARIARELEQEQLRAKREE-EARLAAE 255 G E ET E VEK +K + E K +E ++ ++ +Q +++ + +A A E Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148 Query: 256 RAELEKANEQQETYLEDDVLTEENYTEDPFVDVPEPIEEEVVVPVKEEEPVRTAIIEITG 315 E Q + E ++ +V +P+ E V Sbjct: 1149 NDPTVNIKEPQS--QTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206 Query: 316 TNEQ 319 T Sbjct: 1207 TQPT 1210 Score = 32.7 bits (74), Expect = 0.002 Identities = 22/108 (20%), Positives = 40/108 (37%), Gaps = 4/108 (3%) Query: 204 AAQIITEIQAVEKRKKEREEQRKKEEEARIARELEQEQLRAKREEEARLAAERAELEKAN 263 AQ +E + + + + +KEE+A++ E QE + + + +A Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144 Query: 264 EQQE---TYLEDDVLTEENYTEDPFVDVPEPIEEEVVVPVKEEEPVRT 308 +E T + ++ N T D + V PV E V T Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADT-EQPAKETSSNVEQPVTESTTVNT 1191
>cloacin#Cloacin signature. Length = 551 Score = 30.8 bits (69), Expect = 0.007 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 1/63 (1%) Query: 102 QILYVKAGNLKEKVNILPLNADLLAKQKQRADEEQRSLAETRAKKEAEDKLAAEAKAEED 161 Q+ +KA + VN D AK+K AD S E+R KKE + + +AE ++ Sbjct: 391 QMAGLKAQRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDKKR-SAENNLNDE 449 Query: 162 RIK 164 + K Sbjct: 450 KNK 452
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 46.7 bits (111), Expect = 5e-08 Identities = 16/58 (27%), Positives = 29/58 (50%) Query: 210 SALEESELLSELKKIFSDQAEWRELVKALWETQGNISMAAKSLYVHRNTLQYRIDRFN 267 ++ ++ S L + E+ ++ AL T+GN AA L ++RNTL+ +I Sbjct: 417 ASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 37.9 bits (88), Expect = 2e-04 Identities = 36/173 (20%), Positives = 63/173 (36%), Gaps = 18/173 (10%) Query: 502 EQKESERLLNLEKVLHSRVVGQEDAVSAVSRAMRR-ARSGLKDPNRPIGSFMFLGPTGVG 560 E K L + +VG+ A+ + R + R ++ L + M G +G G Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL--------TLMITGESGTG 172 Query: 561 KTELAKALAESMFGSEDALIRVDMSEYMEKYSTSRLIGSPPG-YVGYDEGGQLTEKIRQK 619 K +A+AL + + ++M+ S L G G + G + Q Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST--GRFEQA 230 Query: 620 PYSVVLLDEVEKAHPDVFNILLQVLDDGHLT---DAKGRKVDFKNTILIMTSN 669 + LDE+ D LL+VL G T + D + ++ +N Sbjct: 231 EGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN 280
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 42.3 bits (99), Expect = 3e-07 Identities = 15/58 (25%), Positives = 25/58 (43%) Query: 7 TKKVIAHSLKELMQLTAFQKISIRDIMGHADIRRQTFYYHFQDKYELLAWIYNQEASE 64 T++ I L S+ +I A + R Y+HF+DK +L + I+ S Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69
>PF05272#Virulence-associated E family protein Length = 892 Score = 42.0 bits (98), Expect = 3e-07 Identities = 12/55 (21%), Positives = 21/55 (38%), Gaps = 2/55 (3%) Query: 38 GPNGSGKSTILRLLAGVLLNTDGVISLENQDDNYVKWTKQNSIYVSSGERGLMSK 92 G G GKST++ L G+ +D + D+Y + + E + Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI--AGIVAYELSEMTAFRR 655
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 27.5 bits (61), Expect = 0.032 Identities = 9/18 (50%), Positives = 14/18 (77%) Query: 106 VKIGDAVKKGDPLVKIDR 123 VK G++V+KGD L+K+ Sbjct: 112 VKEGESVRKGDVLLKLTA 129
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 29.0 bits (65), Expect = 0.023 Identities = 25/145 (17%), Positives = 45/145 (31%), Gaps = 25/145 (17%) Query: 10 NAVLVVDGTTEKVAIGKGIGFNKKKNDLVFDYDIEQLFIMENEQENFQQLLSQIDESYFF 69 + G VAI K + + N + I + E E L + E Sbjct: 6 TGIAASSG----VAIAKAF-IHLEPNVDIEKTSITDV---STEIEKLTAALEKSKEE--- 54 Query: 70 ASERIIEHAEHALKEKLNEHIHIALADHIAFAMDRLKNGIVVRNKLRKEIEVLYAEEFLI 129 + + + + A H+ D L I+ E + Sbjct: 55 -----LRAIKDQTEASMGADKAEIFAAHLLVLDDPE---------LVDGIKGKIENEQMN 100 Query: 130 AEWAIEYLSTQFGSVFTLDEAAYIA 154 AE+A++ +S F S+F + Y+ Sbjct: 101 AEYALKEVSDMFVSMFESMDNEYMK 125
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.9 bits (236), Expect = 1e-24 Identities = 29/117 (24%), Positives = 63/117 (53%), Gaps = 1/117 (0%) Query: 3 KILVVDDEKPISDIVKFNLTKEGYEVSTAYDGEEALKMVPEVEPDLIILDLMLPKIDGLE 62 ILV DD+ I ++ L++ GY+V + + + + DL++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VCREVRKNY-DMPIIMVTAKDSEIDKVLGLELGADDYVTKPFSNRELVARVKANLRR 118 + ++K D+P+++++A+++ + + E GA DY+ KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>PF06580#Sensor histidine kinase Length = 349 Score = 30.2 bits (68), Expect = 0.026 Identities = 35/213 (16%), Positives = 64/213 (30%), Gaps = 55/213 (25%) Query: 408 IAPRFL--------AVTQEETDRMIRMITDLLNLSRMDAGKDTFELEYVNINELFSHVLN 459 I P F+ A+ E+ + M+T L L R V++ + + V + Sbjct: 170 INPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS--NARQVSLADELTVVDS 227 Query: 460 RFDMMLQSADKPVKPFVIKRDFTKRDLSVEVDADKMIQ-------VLDNIMNNAIKY--- 509 + F R L E + I ++ ++ N IK+ Sbjct: 228 YLQLA-------------SIQFEDR-LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273 Query: 510 -SPSGGTITCRLMETHNNIVISIADEGLGVPKKDIPHVFDRFFRVDKARARSMGGTGLGL 568 P GG I + + + + + + + G K TG GL Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------------STGTGL 315 Query: 569 AISKEVVQKHGGKIWLESIENK--GSTFFISLP 599 +E +Q G + K + +P Sbjct: 316 QNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 54.2 bits (130), Expect = 5e-10 Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 34/161 (21%) Query: 250 IVTNNHVIDGSDAIEVILK------------DGTKVEAKLIGADQWTDLAVLSIPADKVK 297 ++TN HV+D + LK +G ++ DLA++ ++ Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173 Query: 298 -------TVATFGNSDDIKVGEPAIAIGSPLGTNFATSVTQGIVSAKDRSVAMDIDGDGV 350 AT N+ + +V + G P AT + + + G+ Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATM-------WESKGKITYLKGEA- 225 Query: 351 EDWDMTAIQTDAAINPGNSGGALINLAGQVIGINSMKISQD 391 +Q D + GNSG + N +VIGI+ + + Sbjct: 226 -------MQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNE 259
>PF06580#Sensor histidine kinase Length = 349 Score = 37.5 bits (87), Expect = 8e-05 Identities = 18/94 (19%), Positives = 45/94 (47%), Gaps = 4/94 (4%) Query: 335 ILAIVIGNVIDNAIQASIRICPEDRHINIVIKQFNNDLLVEVSNNFNPEELSTRHHRKNE 394 + +++ +++N I+ I P+ I + + N + +EV N L+ ++ +++ Sbjct: 255 VPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT---GSLALKNTKEST 311 Query: 395 GFGMKNIDGLLQQI-GGIYRHWTEESKHFVTVVI 427 G G++N+ LQ + G + E + V ++ Sbjct: 312 GTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.9 bits (153), Expect = 1e-13 Identities = 20/120 (16%), Positives = 49/120 (40%), Gaps = 5/120 (4%) Query: 2 KVAICDDNPSLTEVINTMLTDYDPNMFETFTFYNPHNLINQLDIEKFDFFILDIEMDEMS 61 + + DD+ ++ V+N L+ ++ N L + D + D+ M + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAG---YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GIDLAKNIRERGILSPIVFLTSYKEYMEEV--FQVQTFDYLLKPPTKDRMKQVLDKLNQH 119 DL I++ P++ +++ +M + + +DYL KP + ++ + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 25.8 bits (56), Expect = 0.015 Identities = 9/32 (28%), Positives = 14/32 (43%) Query: 6 SGPTTPPIVPENPATNEIDIAQVQGNLPTTGE 37 G P P N+ + + + LP+TGE Sbjct: 479 PGKGQAPQAGTKPNQNKAPMKETKRQLPSTGE 510
>PF06580#Sensor histidine kinase Length = 349 Score = 31.0 bits (70), Expect = 0.007 Identities = 34/179 (18%), Positives = 77/179 (43%), Gaps = 27/179 (15%) Query: 173 AIIIQESGRLTTLSSTILHLSKVENHEI-ISEKRAIQLDEQLR--QTILLLEPKWQKKRI 229 A+I+++ + + + LS++ + + S R + L ++L + L L + R+ Sbjct: 184 ALILEDPTKAREM---LTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRL 240 Query: 230 IWELELDDSILNSD--EDLLQQMWINLLDNAIKFSPENGVVKVKLMNLTDTVIVKITDQG 287 +E +++ +I++ L+Q + N + + I P+ G + +K TV +++ + G Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300 Query: 288 SGMSSETQQRLFDKFYQGDASHSKEGNGLGMSLVKNILRICDGE---IGLKSSLGNGSS 343 S ++KE G G+ V+ L++ G I L G ++ Sbjct: 301 SLA----------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.3 bits (211), Expect = 2e-21 Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 3/120 (2%) Query: 3 TILIVEDDPHTRNLMEIILKNNGFQTVTATNGIEALDVLDKRMISLIILDIMIPEMDGYQ 62 TIL+ +DD R ++ L G+ +N + L++ D+++P+ + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 LTQNLREADFQLPILMVTAKETPSEKKKGFLVGTDDYMTKPVDEEEMIL---RILALLRR 119 L +++A LP+L+++A+ T K G DY+ KP D E+I R LA +R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124
>PF05043#Transcriptional activator Length = 493 Score = 59.6 bits (144), Expect = 1e-11 Identities = 55/268 (20%), Positives = 102/268 (38%), Gaps = 30/268 (11%) Query: 1 MKKLLDKPFHLILKLLEHFYKKTPQETINYYSDFLNVDRRTILKIITDLERDIADCQWEN 60 M+ LL K H L+LLE ++ + ++ LN R + ++ ++ D Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPD----- 55 Query: 61 QLTLEVTETKIIATFSTNFSLENFYRYYMERSLCVELVQSIFKEAEISLDQIIENFFVSR 120 + T I + +E Y ++ + S +++ IF + I + F++S Sbjct: 56 LIFHSSTNGIRIINTDDS-DIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISS 114 Query: 121 TTFYRRITPLKEVL-AEFDLELDFTKKQFLIGEEKQIRYFFSVFFWEIFRSTGEYKHPDL 179 ++ YR I+ + +V+ +F E+ T Q +IG E+ IRYFF+ +F E + + Sbjct: 115 SSLYRIISQINKVIKRQFQFEVSLTPVQ-IIGNERDIRYFFAQYFSEKYY----FLEWPF 169 Query: 180 KDTEYLNRIKQDLNLSIPH---------FLYFQLYLNISLTRISQGYLVSEVIPYPIKEI 230 ++ + Q L L +L L +L RI G+ + Sbjct: 170 ENFS-SEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFME----VDKDSF 224 Query: 231 NYSYAQFKNLTAPYFNKLMPAQQSLEIH 258 N F + QS E Sbjct: 225 NDQSLDF----LMQAEGIEGVAQSFESE 248
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 99 bits (249), Expect = 2e-26 Identities = 44/137 (32%), Positives = 70/137 (51%), Gaps = 2/137 (1%) Query: 2 KILVVDDDKEIVELLSIYIKNEGYEVEKAYNGKEAMTKIVTNPDIDLMVLDVMMPKMDGI 61 ILV DDD I +L+ + GY+V N + + D DL+V DV+MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLW-RWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 EVVKELRKE-SQMPVLMLSAKTTDMDKIQGLITGADDYVAKPFNPLEVMARIKSLLRRSN 120 +++ ++K +PVL++SA+ T M I+ GA DY+ KPF+ E++ I L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 121 YQVTNDEPDILEIGPLV 137 + + E D + PLV Sbjct: 124 RRPSKLEDDSQDGMPLV 140
>PF06580#Sensor histidine kinase Length = 349 Score = 32.5 bits (74), Expect = 0.002 Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 4/44 (9%) Query: 272 LISNALKYGVGGKK----ITIEAQKVGKEVIIAVNNDGPIIPEE 311 L+ N +K+G+ I ++ K V + V N G + + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 40.9 bits (96), Expect = 6e-06 Identities = 35/168 (20%), Positives = 56/168 (33%), Gaps = 23/168 (13%) Query: 45 FAIEPKTGKVLLNQNGDAQLGIASMTKMITEYLVLEAIKEGKLTWDQKLSIDDYSYNVSQ 104 ++ +G+ L D + + S K++ VL + G ++K+ Q Sbjct: 43 IEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHY-------RQ 95 Query: 105 NNELSNVPLRK---DSQYTVKELFEAMAIYSANAAAITLATAVSGSEPAFVDAMREKVKS 161 + + P+ + TV EL A S N+AA L V G +R Sbjct: 96 QDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPA-GLTAFLR----Q 150 Query: 162 WGAKDFYLVNATGLTNSDLHGNIYPGSADTDENTMTARDMAIVAQHLL 209 G L N L G+ +T T MA + LL Sbjct: 151 IGDNVTRLDRWETELNEALPGDA--------RDTTTPASMAATLRKLL 190
>PF05043#Transcriptional activator Length = 493 Score = 31.8 bits (72), Expect = 0.004 Identities = 21/133 (15%), Positives = 47/133 (35%), Gaps = 18/133 (13%) Query: 10 ILEFLLMNKANVTNAVLASELDVSERTVRRDLHEVEAILDTFQLKLSKENSQLSIIGTEI 69 +LE L +K + LA L+ +ER V+ DL V++ + S N I + Sbjct: 15 LLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDL-IFHSSTNGIRIINTDDS 73 Query: 70 NRQNFKWQLLDLA-------------HNEFTPLERQNFI----LKTLLRETEPLKLMALA 112 + + + + + ++ +I L ++ + + Sbjct: 74 DIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQ 133 Query: 113 TDLSVTISTISSD 125 ++S+T I + Sbjct: 134 FEVSLTPVQIIGN 146
>PF05043#Transcriptional activator Length = 493 Score = 38.0 bits (88), Expect = 1e-05 Identities = 33/147 (22%), Positives = 63/147 (42%), Gaps = 6/147 (4%) Query: 6 KTNMQLKLMNGFTFYQKLIFKGGLLNDVPLQFKQRTLFIINRLPEIIQRTFKKYFPHEDE 65 K N+ L N Y++ +F +L D + I + +++ Y + Sbjct: 312 KDNLIWHLHNTAHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKKELSHYLETLEV 371 Query: 66 ----WMANYFIYIIITHWDTFIPNMLKKAPIIHVGIVVETDLEHALYLKNKLAYYYP--F 119 M N+ Y ITH + N+L+ P + V ++ D HA ++ L+YY F Sbjct: 372 CSSSMMVNHLSYTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVAETLSYYCSNNF 431 Query: 120 NLDAMLIPDITTERIDNKKLDIILTTF 146 L+ +++ E +++ DII++ F Sbjct: 432 ELEVWTELELSKESLEDSPYDIIISNF 458
>PF05043#Transcriptional activator Length = 493 Score = 72.7 bits (178), Expect = 5e-17 Identities = 53/199 (26%), Positives = 92/199 (46%), Gaps = 2/199 (1%) Query: 24 IDIYTLADELFMSVRNLKKYIDDLNVLINPISIYFIDTNSVNIHYPDSLNYQHIYKSIYV 83 LA+ L + R +K + + P I+ TN + I D + + +Y + Sbjct: 26 FHRSELAELLNCTERAVKDDLSHVKSAF-PDLIFHSSTNGIRIINTDDSDIEMVYHHFFK 84 Query: 84 NNLNYSLLELLFLEENNTLETLEEHFFLSESTLRRTISFINQRL-APFDIIIDTKNFNII 142 ++ ++S+LE +F E E++ + F++S S+L R IS IN+ + F + II Sbjct: 85 HSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPVQII 144 Query: 143 GDEKNIIQFFVSYFQEKYTFQDIKLGNSLVQFLDYIYSDFTKFLNFPTNFPTKNRFIFWV 202 G+E++I FF YF EKY F + N + L + K +FP N T + Sbjct: 145 GNERDIRYFFAQYFSEKYYFLEWPFENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLL 204 Query: 203 GVGLKRIERNHSLPINNNS 221 L RI+ H + ++ +S Sbjct: 205 VTNLYRIKFGHFMEVDKDS 223
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 30.1 bits (67), Expect = 0.047 Identities = 28/113 (24%), Positives = 46/113 (40%), Gaps = 21/113 (18%) Query: 575 DVSKAGEHPIELTVADKAGNK---SEMIHSTLKIL-------EANKQLESKKQLELKSKD 624 D KA + + VA KAG+ ++ S + + A ++L S+ QL L Sbjct: 32 DFLKAADKAFKDVVATKAGDLKAGTKSGESAINTVGLKPPTDAAREKLSSEGQLTLLLGK 91 Query: 625 LIQKTTIEKNEFLLKQLAA--TAWEINAEAEKIDLTEQIIIQNSDEITENPGE 675 L+ + L QL + W+ E++K ++ IQ S E GE Sbjct: 92 LMTLL----GDVSLSQLESRLAVWQAMIESQK-----EMGIQVSKEFQTALGE 135
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 41.2 bits (96), Expect = 3e-06 Identities = 45/216 (20%), Positives = 74/216 (34%), Gaps = 41/216 (18%) Query: 32 ATPQQNKTQVVKKETKKDTVANSKKTA---KEKKESLESEKKNESSKESSKDSSKE--AD 86 A Q N+ ETK+ +K+TA KE+K +E+EK E K +S+ S K+ ++ Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137 Query: 87 TEKKIADKNVEKATSTVIESADD----------------------VQEEVANSTTNNETK 124 T + A+ E + I+ V E +T N+ + Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197 Query: 125 NPVVEHKEEPNAPPTTPS------------QPEPTQPTPENKPTPEPKKEVTFSIKGTAT 172 NP + S + P P + + + T T Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257 Query: 173 NS--SSYFISPQKVEMKEGQSVMDVLSDYCRNNGIQ 206 N+ S Q V + G++V +S NN Q Sbjct: 1258 NAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293 Score = 40.0 bits (93), Expect = 8e-06 Identities = 27/152 (17%), Positives = 59/152 (38%), Gaps = 9/152 (5%) Query: 32 ATPQQNKTQVVKKETKKDTVANSKKTAKEKKESLESE----KKNESSKESSKDSSKEADT 87 + Q++KT ++ +T A +++ AKE K ++++ + +S E+ + + E Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102 Query: 88 EKKIADKNVEKATSTVIESADDVQEEVANSTTNNETKNPVVEHKEEPNA-----PPTTPS 142 + + K + + V +V+ +ET P E E + P + + Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162 Query: 143 QPEPTQPTPENKPTPEPKKEVTFSIKGTATNS 174 P + + ++ VT S NS Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194 Score = 31.2 bits (70), Expect = 0.005 Identities = 32/163 (19%), Positives = 55/163 (33%), Gaps = 8/163 (4%) Query: 25 NNKYAHVATPQQNKTQVVKKETKKDTVANSKKTAKEKKESLESEKKNESSKESSKDSSKE 84 N + VA + ETK+ ++ AK + E K E K +S+ S K+ Sbjct: 1079 NTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETE-----KTQEVPKVTSQVSPKQ 1133 Query: 85 --ADTEKKIADKNVEKATSTVIESADDVQEEVANSTTNNETKNPVVEHKEEPNAPPTTPS 142 ++T + A+ E + I+ A++ + + VE + T + Sbjct: 1134 EQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193 Query: 143 QPEPTQPTPENKPTPEPKKEVTFSIKGTATNSSSYFISPQKVE 185 P T +P S K + S P VE Sbjct: 1194 SVVEN-PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235 Score = 30.4 bits (68), Expect = 0.009 Identities = 35/187 (18%), Positives = 61/187 (32%), Gaps = 26/187 (13%) Query: 24 LNNKYAHVATPQQNKTQVVKKETKKDT-------VANSKKTAKEKKESLESEKKNESSKE 76 NN A V + N ++ + + ++ A+ K+ ++ +KNE Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT 1059 Query: 77 SSKDSSKEADTEKKIADKNVEKATSTVIESADDVQEEVANSTTNNETKNPVVEHKEEPN- 135 + ++E E K NV+ T T E + T ETK KEE Sbjct: 1060 ETTAQNREVAKEAK---SNVKANTQT--NEVAQSGSETKETQTT-ETKETATVEKEEKAK 1113 Query: 136 ------------APPTTPSQPEPTQPTPENKPTPEPKKEVTFSIKGTATNSSSYFISPQK 183 +P Q + P+ +P E V + TN+++ P K Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173 Query: 184 VEMKEGQ 190 + Sbjct: 1174 ETSSNVE 1180
>adhesinb#Adhesin B signature. Length = 310 Score = 27.5 bits (61), Expect = 0.015 Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 1/58 (1%) Query: 1 MKKIIRLVLSIGVIALIVGCGNAAVTKEDSSSSKTKDVTVTVILKENHKEFDQKKIEV 58 MKK LVL + + C + + ++ SSK V I+ + K KI + Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQK-SSTETGSSKLNVVATNSIIADITKNIAGDKINL 57
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 37.3 bits (86), Expect = 7e-05 Identities = 26/129 (20%), Positives = 43/129 (33%), Gaps = 8/129 (6%) Query: 306 GGASIYDFVNNPVPQLIETPVDPDP---EVTDPEVTDPETTDPEVTDPETTEPEVTNPDP 362 GA + + V Q+IE P P + P + EPE P+P Sbjct: 25 HGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADL-EPPQAVQPPPEPVVEPE---PEP 80 Query: 363 GVTEPETTNPEKEKTPDPTPTEKPTVVEPVAVKEADKPANLIKAPISNTAENTKSDETNN 422 PE P P KP V++ + +++ ++ ENT + Sbjct: 81 EPI-PEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTS 139 Query: 423 FPKTGESSE 431 T +S+ Sbjct: 140 STATAATSK 148
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 43.9 bits (103), Expect = 1e-07 Identities = 45/181 (24%), Positives = 63/181 (34%), Gaps = 29/181 (16%) Query: 2 KKALIIGGNGTIGSAVSNALNDSYEIITA------------------GRTHGDVKVDITS 43 K A I G IG AV+ L I A R D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 44 VESI----TRLFEIVGEVDAVIITAGQAHFGALKDMTPQD--NLISVNSKLLGQVNTVLI 97 +I R+ +G +D ++ AG G + ++ ++ SVNS G N Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNS--TGVFNASRS 126 Query: 98 GTNYVKDH--GSFTLVTGIMMDDPILAGASAALANGGVKAFAKSAALEL-PRGIRINTVS 154 + Y+ D GS V P + A+ A + F K LEL IR N VS Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 155 P 155 P Sbjct: 187 P 187
>SECGEXPORT#Protein-export SecG membrane protein signature. Length = 110 Score = 41.1 bits (96), Expect = 2e-08 Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 4/76 (5%) Query: 1 MYNALLLAMLVISVLLIIVITMQPTKTNSASSALTGGAE-QLFGKQKARGFEAVLQRVTV 59 MY ALL+ L++++ L+ +I +Q K ++ GA LFG + F + R+T Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNF---MTRMTA 57 Query: 60 ILGIAFFVIALVLAYV 75 +L FF+I+LVL + Sbjct: 58 LLATLFFIISLVLGNI 73
>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6 signature. Length = 547 Score = 30.0 bits (67), Expect = 0.010 Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 3/57 (5%) Query: 74 TQNALAF--LREKGYQEIAVFGLSLGGIFATKALEEEGLLAAGTLCSPLFLNENNHV 128 T N + F L G +E F ++L + A A E +L++ TL P+ EN H+ Sbjct: 491 TGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDA-RGEAILSSDTLTVPVSDTENTHI 546
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 41.7 bits (98), Expect = 2e-06 Identities = 23/124 (18%), Positives = 47/124 (37%), Gaps = 13/124 (10%) Query: 2 HIAICEDETVQQEELYNLLQAYQSFFPTPLAIEVFSNAEDLIEVCHYSGNRFDLIFLDIA 61 I + +D+ + L L + + SNA L + DL+ D+ Sbjct: 5 TILVADDDAAIRTVLNQALS------RAGYDVRITSNAATLWR--WIAAGDGDLVVTDVV 56 Query: 62 LPKLNGIEAAKIIRQMDAEVELVFLT--SMLDYSLEGYHVKALRYLLKPIKAHQLDELLK 119 +P N + I++ ++ ++ ++ + +++ A YL KP L EL+ Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIG 113 Query: 120 TIKN 123 I Sbjct: 114 IIGR 117
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 28.0 bits (62), Expect = 0.023 Identities = 13/42 (30%), Positives = 16/42 (38%) Query: 106 SKPAESSSAVSSSSAPVEETPAPVEPTPPVEETPPPVEEPPV 147 PA+ S + A +E A P PV E P E P Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPE 80
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 30.2 bits (68), Expect = 0.017 Identities = 20/76 (26%), Positives = 33/76 (43%), Gaps = 10/76 (13%) Query: 48 WLGKEFNIIDT-GGIDIGDEPFLEQIKQQAEIAMDEADVIIFITSGRESVTDADENVAKM 106 W + NIIDT G +D FL ++ ++ D I + S ++ V + Sbjct: 65 WENTKVNIIDTPGHMD-----FLAEV----YRSLSVLDGAILLISAKDGVQAQTRILFHA 115 Query: 107 LYRTKKPVLLAVNKID 122 L + P + +NKID Sbjct: 116 LRKMGIPTIFFINKID 131
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 133 bits (337), Expect = 9e-45 Identities = 71/91 (78%), Positives = 78/91 (85%) Query: 1 MANKAELIESVATSTGLTKKDATAAVDAVFETIQTTLSSGEKVQLIGFGNFEVRERAARK 60 MANK +LI VA +T LTKKD+ AAVDAVF + + L+ GEKVQLIGFGNFEVRERAARK Sbjct: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARK 60 Query: 61 GRNPQTGEEIQIAASKVPAFKPGKALKDAVK 91 GRNPQTGEEI+I ASKVPAFK GKALKDAVK Sbjct: 61 GRNPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 38.8 bits (90), Expect = 1e-05 Identities = 32/132 (24%), Positives = 51/132 (38%), Gaps = 7/132 (5%) Query: 202 ETTDLLFELGFTYLQNKEYRRASETLFKLKELDPSYTSLYPYLAKSLEEENQLDRATEVI 261 +T + L+ L F Q+ +Y A + L LD + + L + Q D A Sbjct: 34 DTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSY 93 Query: 262 REGLRADQYNPELFYYAADLFLKLGDEEQGEYYYQESLELNPDN-------ETVQLALIN 314 G D P ++AA+ L+ G+ + E + EL D V L Sbjct: 94 SYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEA 153 Query: 315 LYLKQERFNEAV 326 + LK+E +E V Sbjct: 154 IKLKKEMEHECV 165
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.008 Identities = 9/23 (39%), Positives = 16/23 (69%) Query: 25 NYTQAAQLLGITQPALTQQIKKL 47 N +AA LLG+ + L ++I++L Sbjct: 451 NQIKAADLLGLNRNTLRKKIREL 473
>PF06580#Sensor histidine kinase Length = 349 Score = 212 bits (541), Expect = 1e-65 Identities = 66/216 (30%), Positives = 116/216 (53%), Gaps = 13/216 (6%) Query: 362 QLELGEAEIQSKLLKDAEIKSLQAQVNPHFFFNAMNTISALMRQNAEQARTLLLQLSTYF 421 Q E+ + ++ + ++A++ +L+AQ+NPHF FNA+N I AL+ ++ +AR +L LS Sbjct: 146 QAEIDQWKMA-SMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELM 204 Query: 422 RANLQGARQVLIPLTAELKHVEAYLSLEQARFPQRFQVTFNIYPNLETLLLPPFLLQVLV 481 R +L+ + + L EL V++YL L +F R Q I P + + +PP L+Q LV Sbjct: 205 RYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV 264 Query: 482 ENAIRHAFGNRKTDNQIVVQLEQKENFVLVQVSDNGIGIPLDRVEKVGKEVIESEKGTGT 541 EN I+H +I+++ + V ++V + G + +++ TGT Sbjct: 265 ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-----------SLALKNTKESTGT 313 Query: 542 ALENLNKRLVGLFGLEASLQFSQNKTGGTTVLLKIP 577 L+N+ +RL L+G EA ++ S K G ++ IP Sbjct: 314 GLQNVRERLQMLYGTEAQIKLS-EKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 9e-18 Identities = 33/162 (20%), Positives = 61/162 (37%), Gaps = 13/162 (8%) Query: 2 HILIVDDEPLARDELAYLVETHPNVISVDKAESIEEALEKMVNQKPDLVFLDIHLTDESG 61 IL+ DD+ R L + V + + DLV D+ + DE+ Sbjct: 5 TILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 62 FDLADKFKKMNHPPKIVFATAYD--EYALKAFEVDAIDYILKPFEEERVRQAVEKSHSAI 119 FDL + KK ++ +A + A+KA E A DY+ KPF+ + + + A+ Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---AL 119 Query: 120 SHLSTGEETTNLAREINGKI-----AIQAD-ERIFVIALSDI 155 + + + A+Q + + +D+ Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL 161
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 175 bits (446), Expect = 2e-49 Identities = 99/436 (22%), Positives = 173/436 (39%), Gaps = 83/436 (19%) Query: 12 RIRNFSIIAHIDHGKSTLADRILEK---TDTVANRDMQAQLLDSMDLERERGITIKLNAV 68 +I N ++AH+D GK+TL + +L + + D D+ LER+RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 ELTYTAKDGIDYTFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLA 128 + + ++IDTPGH+DF EV RSL+ +GAIL++ A G++AQT + Sbjct: 62 SFQWE-----NTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 129 LDNDLEIIPVINKIDLPAADPERVRGEIEDVIG--------------------------- 161 + I INKID D V +I++ + Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 162 --IDASDAVLA--------------------------------SAKAGIGIEDILEQIVE 187 I+ +D +L SAK IGI++++E I Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236 Query: 188 KVPAPTGDLDAPLQALIFDSVYDSYRGVILNVRIMSGVVKSGDKIQMMSNGATFEVADVG 247 K + T + L +F Y R + +R+ SGV+ D +++ Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYT 296 Query: 248 IFSPKPIKRDFLMVGDVGYITASIKTVQDTRVGDTITLANNPATEALPGYRKMNPMVYCG 307 + + K D G++ + + +GDT L E P++ Sbjct: 297 SINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQRERIENPL------PLLQTT 349 Query: 308 LYPIDSSRYNELREALERLQLNDAALQFE--AETSQALGFGFRCGFLGLLHMDVIQERLE 365 + P + L +AL + +D L++ + T + + FLG + M+V L+ Sbjct: 350 VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCALLQ 404 Query: 366 REFDLDLITTAPSVIY 381 ++ +++ P+VIY Sbjct: 405 EKYHVEIEIKEPTVIY 420 Score = 39.5 bits (92), Expect = 4e-05 Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 2/80 (2%) Query: 410 EPYVKASIMVPNDYVGAVMEIAQRKRGEFITMDYLDEFRVNVIYEIPLSEIVYDFFDKLK 469 EPY+ I P +Y+ A + + + V + EIP I ++ L Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNN-EVILSGEIPARCI-QEYRSDLT 594 Query: 470 SSTKGYASLDYDLIGYRPSK 489 T G + +L GY + Sbjct: 595 FFTNGRSVCLTELKGYHVTT 614
>cloacin#Cloacin signature. Length = 551 Score = 30.5 bits (68), Expect = 0.014 Identities = 13/32 (40%), Positives = 14/32 (43%) Query: 69 GHASTDPNFGGGGFGGGGFGGFGGSSAGFGGG 100 G S GG G G GG G G +G GG Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 146 bits (369), Expect = 5e-41 Identities = 81/362 (22%), Positives = 141/362 (38%), Gaps = 55/362 (15%) Query: 2 SKIIGIDLGTTNSAVAVLEGGEAKIIANPEGNRTTPSVV-------SFKNGEIQVGEVAK 54 S + IDLGT N+ + V G I+ N PSVV VG AK Sbjct: 10 SNDLSIDLGTANTLIYVKGQG---IVLN------EPSVVAIRQDRAGSPKSVAAVGHDAK 60 Query: 55 RQAVTNPNTISSVKRHIGEAGFSIEMEGKKYTAQEISAMILQY-LKGFAEEYLGEKVEKA 113 + P I++++ M+ ++ +LQ+ +K + Sbjct: 61 QMLGRTPGNIAAIR----------PMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRV 110 Query: 114 VITVPAYFNDAQRQATKDAGKIAGLEVERIVNEPTAAALAYGLDKTDKDEKVLVFDLGGG 173 ++ VP +R+A +++ + AG ++ EP AAA+ GL + +V D+GGG Sbjct: 111 LVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGL-PVSEATGSMVVDIGGG 169 Query: 174 TFDVSILELGDGVFDVLATAGDNKLGGDDFDNKIIDYMVAEFKKENAIDLSKDKMAVQRL 233 T +V+++ L V + ++GGD FD II+Y+ + Sbjct: 170 TTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG------------- 211 Query: 234 KDAAEKAKKDLS----GVTSTQISLPFITAGEAGPLHLEMNLTRAKFDELTHDLVDRTKV 289 + AE+ K ++ G +I + E P +N + + L + + Sbjct: 212 EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLN-SNEILEAL-QEPLTGIVS 269 Query: 290 PVRQALKD-AGLTASDIDE--VILVGGSTRIPAVVEAVKKETNQEPNKSVNPDEVVAMGA 346 V AL+ ASDI E ++L GG + + + +ET + +P VA G Sbjct: 270 AVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGG 329 Query: 347 AI 348 Sbjct: 330 GK 331
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 29.3 bits (65), Expect = 0.026 Identities = 15/45 (33%), Positives = 23/45 (51%) Query: 232 FVRGRMNILDFNEAMDVEKFKSIYNLMDGNTSLTSLINQSHEGIE 276 F G M +LD D++ L+ N +L+S++ SH GIE Sbjct: 267 FTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIE 311
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 134 bits (337), Expect = 2e-40 Identities = 75/251 (29%), Positives = 117/251 (46%), Gaps = 14/251 (5%) Query: 4 LKNKVALVTGGTSGIGEKITDCFIAEGATVVVCDINQEALNKAKGKENVVTK-----KLD 58 ++ K+A +TG GIGE + ++GA + D N E L K + D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 59 ISSEAEWTQVVQEVIEQFGKIDILANNAGISSDKGLETTTVEEWELQHKINALGPFLGMK 118 + A ++ + + G IDIL N AG+ + + + EEWE +N+ G F + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 119 AVVPYMKKAGQGSIVNTASYTALVG-AGINGYTGSKGSIRAVSKAAAADLGYFNIRVNSV 177 +V YM GSIV S A V + Y SK + +K +L +NIR N V Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 178 YPGVIETPMSAAVSEYKDAMAQLIQAT--------PLGRIGKPEEVANAILFLASDEASF 229 PG ET M ++ ++ Q+I+ + PL ++ KP ++A+A+LFL S +A Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 230 INGAELVIDGG 240 I L +DGG Sbjct: 246 ITMHNLCVDGG 256
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 5e-18 Identities = 30/139 (21%), Positives = 57/139 (41%), Gaps = 3/139 (2%) Query: 3 RMLLVDDEYMILAGLQKLIPWQELGIEIVGTAKNGQEALDFVRNNVVDIVISDVTMPLLS 62 +L+ DD+ I L + + G ++ N ++ D+V++DV MP + Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 63 GIEFIRQAQSEDIYFHFLILSGYQEFDYVKEGLRMGADNYLIKPVDKVELIETLEKIIKE 122 + + + + L++S F + GA +YL KP D ELI + + + E Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 123 LNSEAEQLQTQSVLFDYLL 141 +L+ S L+ Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140
>PF06580#Sensor histidine kinase Length = 349 Score = 188 bits (480), Expect = 5e-57 Identities = 51/209 (24%), Positives = 99/209 (47%), Gaps = 16/209 (7%) Query: 354 DIYTLEIKQKDAHMRALQSQINPHFLYNTLEYIRMYAVSEGAEELADVVYTFATLLRNN- 412 D + + ++A + AL++QINPHF++N L IR + E + +++ + + L+R + Sbjct: 150 DQWKMASMAQEAQLMALKAQINPHFMFNALNNIRA-LILEDPTKAREMLTSLSELMRYSL 208 Query: 413 -TSQEKTTTLKKELEFCEKYVYLYQMRYPGNIAYSFAIDSAIENLVIPKFSIQPLIENYF 471 S + +L EL + Y+ L +++ + + I+ AI ++ +P +Q L+EN Sbjct: 209 RYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGI 268 Query: 472 VHGIDYMRIDNVISVKANIEEDKITILIRDNGKGMSSEKIKDLNQSLMESHSKFGGSIGI 531 HGI + I +K + +T+ + + G SL ++K G+ Sbjct: 269 KHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-------------SLALKNTKESTGTGL 315 Query: 532 LNVNERLRSYFGESYQMCIQETQAHGVTI 560 NV ERL+ +G Q+ + E Q + Sbjct: 316 QNVRERLQMLYGTEAQIKLSEKQGKVNAM 344
>PF06580#Sensor histidine kinase Length = 349 Score = 31.8 bits (72), Expect = 0.006 Identities = 18/105 (17%), Positives = 37/105 (35%), Gaps = 22/105 (20%) Query: 359 IILNLLSNALKFTESGGKIQVKVKIKDQFAVLMVEDSGCGIDSTDLEHIFDRFYMADDSR 418 ++ N + + + GGKI +K + L VE++G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG------------------SLAL 304 Query: 419 KQNNEGQGIGLAIVKSIVKA---HDGTVSVSSSVNLGTRFIIQLP 460 K E G GL V+ ++ + + +S ++ +P Sbjct: 305 KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 93.4 bits (232), Expect = 3e-24 Identities = 33/163 (20%), Positives = 68/163 (41%), Gaps = 10/163 (6%) Query: 2 KILIVDDEPKILDVIEAYLVVNGHLVYRAETGSQALEKYRVVGPDLIILDWMLPDSSGME 61 IL+ DD+ I V+ L G+ V + DL++ D ++PD + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VCQEIR-LESSVPIIMLTAKAGEKNIISGLKMGADDYVVKPFSPKELVMRVETVLRRTGY 120 + I+ +P+++++A+ I + GA DY+ KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 QAKTQKIKFNDGKLVIDV---------LGKQVFQSNQLVCLTG 154 + + DG ++ + ++ Q++ + +TG Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167
>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature. Length = 136 Score = 27.9 bits (62), Expect = 0.001 Identities = 9/32 (28%), Positives = 19/32 (59%) Query: 28 GMGMFSSSIIDFLLLILLIFVVVKVVQGIRSK 59 G+F ++ DFL++ IF+ +K++ + K Sbjct: 74 HYGVFIQNVFDFLIVAFAIFMAIKLINKLNRK 105
>PF06580#Sensor histidine kinase Length = 349 Score = 34.1 bits (78), Expect = 0.001 Identities = 28/170 (16%), Positives = 63/170 (37%), Gaps = 38/170 (22%) Query: 427 LSKLEQKQVPLEQELIEVQ-----EAVRSSFRL-VKHKADEKNMNLLLNDADPIYLLGDS 480 L +QV L EL V +++ RL +++ + M++ + P L+ Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV----PPMLV--- 260 Query: 481 GRLKQIITNLLTNAVSYTEAGGKVEVFVEQSETEATIKISDNGMGIPEAELDRIFERFYR 540 + ++ N + + ++ GGK+ + + T+++ + G + Sbjct: 261 ---QTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------------ 305 Query: 541 VDKARSRNSGGTGLGLSIVKYLVENFNG---TIQVESKLGLGTTFTIILP 587 TG GL V+ ++ G I++ K G +++P Sbjct: 306 ------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.6 bits (230), Expect = 7e-24 Identities = 34/151 (22%), Positives = 78/151 (51%), Gaps = 4/151 (2%) Query: 3 KVLIVDDEESILTLLAFNLEKAGYEVQTAMDGLIGYQLALENQYDFIILDLMMPSMDGME 62 +L+ DD+ +I T+L L +AGY+V+ + ++ D ++ D++MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VCKKLRQEKIETPIMILTAKDDELEKIIGLELGADDYMTKPFSPREVLARMKAIMRRIKP 122 + ++++ + + P+++++A++ + I E GA DY+ KPF E++ + R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI----IGRALA 120 Query: 123 ESKEKINESHEASPEEEVVIGELQIFPELYE 153 E K + ++ + S + ++G E+Y Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 38.1 bits (88), Expect = 6e-05 Identities = 40/193 (20%), Positives = 64/193 (33%), Gaps = 16/193 (8%) Query: 81 TLKTLTTEVTALNEKIAQREDKLKEQARTIQVNGDTQNYIDFVLSAKSFGDVVGRVDVVS 140 L+ + N ++ +R + I + Q V S S + + RVD Sbjct: 970 KLRNVNGRYDLYNPEVEKRNQTVD--TTNITTPNNIQAD---VPSVPSNNEEIARVD--E 1022 Query: 141 QMVSANQDLVKEQKSDKEEVASKQKETETKSQEQALLAAKLEATKADLEQQKLEKEAIVA 200 V + ++ SKQ+ + EQ + + ++ K +A Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA-KEAKSNVKANTQ 1081 Query: 201 TLASEQSGAESEKASFLAKKE--DAEKAAKAIATANAAPVVAVQTSTTAP------AATP 252 T QSG+E+++ KE EK KA V TS +P P Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141 Query: 253 VAENNAPAAPPVN 265 AE P VN Sbjct: 1142 QAEPARENDPTVN 1154 Score = 34.7 bits (79), Expect = 7e-04 Identities = 41/228 (17%), Positives = 74/228 (32%), Gaps = 25/228 (10%) Query: 46 KKDAAQTEIGTITDTIAKNEENSVKLVAEMKETQATLKTLTTEVTALNEKIAQREDKLKE 105 A + + + IA+ +E V A ++ TTE A N K + + E Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE------TTETVAENSKQESKTVEKNE 1055 Query: 106 Q------ARTIQVNGDTQNYIDFVLSAKSFGDVVGRVDVVSQMVSANQDLVKEQKSDKEE 159 Q A+ +V + ++ + + V++++ K E Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115 Query: 160 VASKQKETETKSQEQALLAAKLEATKADLEQQKLEKEAIVATLASEQSGAESEKASFLAK 219 Q+ + SQ ++ K E ++ Q + +E E + A Sbjct: 1116 TEKTQEVPKVTSQ----VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA----- 1166 Query: 220 KEDAEKAAKAIATANAAPVVAVQTSTT--APAATPVAENNAPAAPPVN 265 D E+ AK ++ PV T T + P A P VN Sbjct: 1167 --DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212
>SECA#SecA protein signature. Length = 901 Score = 1116 bits (2887), Expect = 0.0 Identities = 424/904 (46%), Positives = 583/904 (64%), Gaps = 73/904 (8%) Query: 1 MANFLKQLIEN-DKKEIKSLEKVADKIDAYGDRMAALSDEELQAKTPEFKKRYQAGETLD 59 + L ++ + + + ++ + KV + I+A M LSDEEL+ KT EF+ R + GE L+ Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61 Query: 60 DLLPEAFAVVREAAKRVLGLYPYRVQLMGGMTLHKGNIPEMKTGEGKTLTATMPVYLNAL 119 +L+PEAFAVVREA+KRV G+ + VQL+GGM L++ I EM+TGEGKTLTAT+P YLNAL Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121 Query: 120 AGEGVHVVTVNEYLASRDATEMGELYTFLGLTVGLNLNSKSSEEKREAYNADITYSTNNE 179 G+GVHVVTVN+YLA RDA L+ FLGLTVG+NL + KREAY ADITY TNNE Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181 Query: 180 LGFDYLRDNMVVYREQMVQRPLNYAIVDEVDSILIDEARTPLIISGQAEKSTALYTRADF 239 GFDYLRDNM E+ VQR L+YA+VDEVDSILIDEARTPLIISG AE S+ +Y R + Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNK 241 Query: 240 FVKSLT-----------AEADYTIDVQSKTIALTEEGMVKAEKTF-------KVENLYDI 281 + L E +++D +S+ + LTE G+V E+ + E+LY Sbjct: 242 IIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSP 301 Query: 282 DNTALIHHIDQALRANYIMLRDIDYVVQEGEVLIVDQFTGRIMDGRRYSDGLHQAIEAKE 341 N L+HH+ ALRA+ + RD+DY+V++GEV+IVD+ TGR M GRR+SDGLHQA+EAKE Sbjct: 302 ANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKE 361 Query: 342 GVEIENESKTMANVTFQNFFRMYKKLSGMTGTAKTEQEEFREIYNIQVVEIPTNKPIIRD 401 GV+I+NE++T+A++TFQN+FR+Y+KL+GMTGTA TE EF IY + V +PTN+P+IR Sbjct: 362 GVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRK 421 Query: 402 DRPDLLYPTLESKFNAVVEDIKTRHANGQPILVGTVAVETSELLSDLLTKAKIHHEVLNA 461 D PDL+Y T K A++EDIK R A GQP+LVGT+++E SEL+S+ LTKA I H VLNA Sbjct: 422 DLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA 481 Query: 462 KNHFKEAEIIMSAGQKGAVTIATNMAGRGTDIKLGA------------------------ 497 K H EA I+ AG AVTIATNMAGRGTDI LG Sbjct: 482 KFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADW 541 Query: 498 -----GVIEAGGLCVIGTERHESRRIDNQLRGRAGRQGDPGVTQFYLSLEDELMKRFGSE 552 V+EAGGL +IGTERHESRRIDNQLRGR+GRQGD G ++FYLS+ED LM+ F S+ Sbjct: 542 QVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASD 601 Query: 553 RIQAILERLRVQEEDAVIQSKMISRQVESAQKRVEGNNYDTRKNVLEYDDVMREQREIMY 612 R+ ++ +L ++ +A I+ +++ + +AQ++VE N+D RK +LEYDDV +QR +Y Sbjct: 602 RVSGMMRKLGMKPGEA-IEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660 Query: 613 GQRLEVIMATESLKKITMAMIQRTVNRMVS--VNTQGNKEEWNLQGIHDFATSAIVHEDS 670 QR E ++ + + ++ + + + Q +E W++ G+ + + + Sbjct: 661 SQRNE-LLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLD-- 717 Query: 671 LTVQDLENKTPEEIEALLMKRV----EDIYTTKEQQF-NEQMLEFEKVVILRVVDSKWTD 725 L + + +K PE E L +R+ ++Y KE+ E M FEK V+L+ +DS W + Sbjct: 718 LPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKE 777 Query: 726 HIDTMDQLRQGIGLRAYAQTNPLVEYQAEGFKLFEEMIAAIEYDVTRLLMKSEIR----- 780 H+ MD LRQGI LR YAQ +P EY+ E F +F M+ +++Y+V L K ++R Sbjct: 778 HLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEV 837 Query: 781 ------QNLQREQVAQGSPARSTGDGDVVEAAKHKPVKNDD-KIGRNDPCPCGSGKKYKN 833 + ++ E++AQ D AA + + K+GRNDPCPCGSGKKYK Sbjct: 838 EELEQQRRMEAERLAQMQQLSHQDDDS--AAAAALAAQTGERKVGRNDPCPCGSGKKYKQ 895 Query: 834 CHGK 837 CHG+ Sbjct: 896 CHGR 899
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 63.4 bits (154), Expect = 1e-13 Identities = 40/193 (20%), Positives = 74/193 (38%), Gaps = 9/193 (4%) Query: 39 TQEKKEQRVIATTVASAEIMAKLDYPLVGIPTTSK-----ELPKQYKDVVEVGSPMGPDL 93 R++A E++ L G+ T P V++VG P+L Sbjct: 30 AAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNL 89 Query: 94 EIMRTLKPDLVLSTSTLQTDLEDGLTAAKLKT-NFLDFRS-IASMEKEIKTLGEQLNRQS 151 E++ +KP ++ ++ E A + NF D + +A K + + + LN QS Sbjct: 90 ELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQS 149 Query: 152 EAEQLTQSIDQKVGKAQTK-VKQDKKPKVLILMGIPGSYLVVTEKAYIGDLVRLAGGENV 210 AE + + + + VK+ +P +L + P LV + +++ G N Sbjct: 150 AAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNA 209 Query: 211 ITGQEQEYLASNT 223 G E + S Sbjct: 210 WQG-ETNFWGSTA 221
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 33.1 bits (75), Expect = 0.004 Identities = 17/75 (22%), Positives = 30/75 (40%) Query: 424 LTTDLNAHVAYQVDMGGGKLYNGQANFRLLFDPTQAVAIPSNEFPSAPEPEKPVTPVDPE 483 + T L + + + GK+ G +RL + ++ + P AP+P P P+ Sbjct: 528 VQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQ 587 Query: 484 KPTDPEKPINPVKPA 498 P + P PA Sbjct: 588 PPQPQPEAPAPQPPA 602
>AEROLYSIN#Aerolysin signature. Length = 493 Score = 27.7 bits (61), Expect = 0.035 Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 2/33 (6%) Query: 116 SAPIAAKIKVDIE--NSDLSYHHEYKINLSFDL 146 + P +KI V IE +D+SY +E+K ++S+DL Sbjct: 307 TVPARSKIPVKIELYKADISYPYEFKADVSYDL 339
>INTIMIN#Intimin signature. Length = 939 Score = 31.6 bits (71), Expect = 0.024 Identities = 58/313 (18%), Positives = 95/313 (30%), Gaps = 41/313 (13%) Query: 603 STGVKVTTIVKANNQTSEASTIVKNGEIAKTTISKIDNATTF----------VSGTGEPN 652 S V +T V +N Q + + A T +K D V+ P Sbjct: 539 SNNVLLTITVLSNGQVVDQVGVTDFT--ADKTSAKADGTEAITYTATVKKNGVAQANVPV 596 Query: 653 GAITLSANGTI-LASGKVDSAGKYSFTIAKQAVGVTVLAKVTLNGKESEVSTIVTKASEK 711 +S + S + +GK + T+ G V++ T S + A Sbjct: 597 SFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEM----TSALNANA--- 649 Query: 712 VVAPVIHDYYITDINAKGTIGGSAKQVAI-YVNGVKKRTAAVTNGSFTIYT--GDLGLTV 768 V+ IT+I A T + Q AI Y V K V+N T T G L + Sbjct: 650 VIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNST 709 Query: 769 A---GQSFQIAGLFDGVEGPKTT--KIVEARNQLIAPTINDYY-----TTDANVSGTITG 818 + L G ++ + + AP + + + + GT Sbjct: 710 EKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVK 769 Query: 819 SAKQVAIFIDGVQKRTAAVNNGKYVIYTGDLGLTTLGKKFQVAGIDGIMVGPKTEATVK- 877 G A+ NGKY + + + ++ + K T+ Sbjct: 770 GKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKE-----KGTTTISV 824 Query: 878 --SKQQLVAPTIN 888 S Q TI Sbjct: 825 ISSDNQTATYTIA 837 Score = 31.6 bits (71), Expect = 0.025 Identities = 50/345 (14%), Positives = 103/345 (29%), Gaps = 52/345 (15%) Query: 506 VFTVEDGKNIDTSKPGNYTIQYTVKNSNGNEAQAVTELIVEEKKIVQTTISDLDTTSTTV 565 + + +G+ +D ++T T ++G EA T + + +++ + V Sbjct: 546 ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGV----AQANVPVSFNIV 601 Query: 566 SGLGEPNGLIEVKANQQVIATGTVGSDGKYTIQMPKQSTGVKVTTIVKANNQTS---EAS 622 SG + + GK T+ + G V + A ++ A Sbjct: 602 SGTAVLS-----------ANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAV 650 Query: 623 TIVKNGEIAKTTISKIDNATTFVSGTGEPNGAITLSANGTILASGKV------------- 669 V + + T I K D T +G + + +++ +V Sbjct: 651 IFVDQTKASITEI-KADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNST 709 Query: 670 ---DSAGKYSFTIAKQAVGVTVLAKVTLNGKESEVSTIVTKASEKVVAPVIHDYYITDIN 726 D+ G T+ G K ++ + S+V+ V + + D +I Sbjct: 710 EKTDTNGYAKVTLTSTTPG-----KSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIV 764 Query: 727 AKGTIGGSAKQVAIYVNGVKKRTAAVTNGSFTIYTGDLG----------LTVAGQSFQIA 776 G G Y G A+ NG +T + + +T+ + Sbjct: 765 GTGVKGKLPTVWLQY--GQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTI 822 Query: 777 GLFDGVEGPKTTKIVEARNQLIAPTINDYYTTDANVSGTITGSAK 821 + T I + ++ DA + G Sbjct: 823 SVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKL 867
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 208 bits (530), Expect = 2e-67 Identities = 86/345 (24%), Positives = 143/345 (41%), Gaps = 52/345 (15%) Query: 3 TILITGGAGFIGSHLV------NHYGTYAKVVVVDNLSMGH-------RENILPSENVVF 49 L+TG AGFIG H+ H +VV +DNL+ + R +L F Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-----QVVGIDNLNDYYDVSLKQARLELLAQPGFQF 56 Query: 50 IKEDIGNKKLLNQLFKEFTFDYVFHLAAVANVAESIEFPWSTHLINQDATLLLLEQVKKQ 109 K D+ +++ + LF F+ VF V S+E P + N L +LE + Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116 Query: 110 KQLLKRFVFASSASVYGDSPHAVQTEGDTV-QPLSPYALDKYASEQFTLMYHRLYGVKTT 168 K ++ ++ASS+SVYG + + D+V P+S YA K A+E Y LYG+ T Sbjct: 117 K--IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPAT 174 Query: 169 AVRFFNVYGENQNPNSPYSGVLSLLNQGLKTNENSEYFEFNKYGDGEQTRDFVYVQDVIQ 228 +RFF VYG P+ + +G + Y G+ RDF Y+ D+ + Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGK---------SIDVYNYGKMKRDFTYIDDIAE 225 Query: 229 ALLLVSE-----------------KEKAIGEVYNIGTGAKTSLNQLLVLSQSLSQKKLCI 271 A++ + + A VYNIG + L + + + Sbjct: 226 AIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKK 285 Query: 272 QTKLARKGDIMNSLANITKIKE-LGYQPLYSIDKGIVCYWKRTIE 315 + GD++ + A+ + E +G+ P ++ G+ K + Sbjct: 286 NMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGV----KNFVN 326
>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature. Length = 136 Score = 122 bits (308), Expect = 4e-39 Identities = 60/136 (44%), Positives = 87/136 (63%), Gaps = 11/136 (8%) Query: 4 NMLAEFKEFALRGSVLDLAVGVVIGGAFSAIVTSLVTNIITPIIVALTGGSNISDLSIKI 63 +++ EF+EFA+RG+V+DLAVGV+IG AF IV+SLV +II P + L GG + ++ + Sbjct: 2 SIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTL 61 Query: 64 LNAK-------LMYGAFLQSIIDFLIIAFSIFMFIKVINTFVAKMKKPVEEVEEEVEINA 116 +A+ + YG F+Q++ DFLI+AF+IFM IK+IN K+ + EE Sbjct: 62 RDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLIN----KLNRKKEEPAAAPAPTK 117 Query: 117 TEEYLKEIRDLLAQQN 132 E L EIRDLL +QN Sbjct: 118 EEVLLTEIRDLLKEQN 133
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 39/226 (17%), Positives = 77/226 (34%), Gaps = 8/226 (3%) Query: 203 DLEKNLPQIDTYTQEILDLQTKMPDIKAKLTKANEFVEYLPQ--VNQMTAKISEVNSLMP 260 DLEK L ++ + KA L +E + +N TA +++ +L Sbjct: 124 DLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEA 183 Query: 261 QLDQTGSLILDLQKNIPQIQNAGRQIAQIDQDFDGIAATLNQGIDEANQALTVIQEVQTI 320 + + +L+K + N + + + A L + +AL T Sbjct: 184 EKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTA 243 Query: 321 MPDVIELGQDASQTVESTKEVIAKIQSALEPVKQAIDTGLTILKSVASSIGKLADNLSST 380 I+ + + A+++ ALE +K++ + L + Sbjct: 244 DSAKIK---TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADL 300 Query: 381 VVTEDNKAAIIASLTSLSTHLDTANTMLSGLIDNLEKLQEASGSNE 426 E + A+ SL LD + L +KL+E + +E Sbjct: 301 ---EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 343
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.5 bits (66), Expect = 0.019 Identities = 13/76 (17%), Positives = 25/76 (32%) Query: 141 HIRNGESLIDTQLASGFESSNGFRDAFSKTMGDVPQRSKKIQVLSSAWIETKLGSMLAIS 200 + NG SL+ A + D T+ V + I++ LG +L Sbjct: 227 TMANGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFR 286 Query: 201 DEHHLFLLEFVDRVGL 216 + + ++ L Sbjct: 287 SQDLDQTRNTLGQLAL 302
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 131 bits (330), Expect = 4e-35 Identities = 61/239 (25%), Positives = 96/239 (40%), Gaps = 56/239 (23%) Query: 196 QLANINQVWEEYQLKGEGMVVSIIDTGIDPSHKDLRLSDTTKEKISLEEIQINIAEMGHG 255 ++ VW + +G G+ V+++DTG D H D Sbjct: 27 EMIQAPAVWNQT--RGRGVKVAVLDTGCDADHPD-------------------------- 58 Query: 256 KAFTRKIPYGYNYADNNTTIIDENPTTNMHGMHVAGIAAANGIGADSTTAVLGVAPEAQL 315 +I G N+ D++ + N HG HVAG AA V+GVAPEA L Sbjct: 59 --LKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENE----NGVVGVAPEADL 112 Query: 316 LAMKVFS-NSSGAVALNDDVIAAIEDSVKLGADILNMSLGSVAGNRDSNDPVQISIREAA 374 L +KV + SG D +I I +++ DI++MSLG + + ++++A Sbjct: 113 LIIKVLNKQGSGQY---DWIIQGIYYAIEQKVDIISMSLGGP----EDVPELHEAVKKAV 165 Query: 375 EVGVLSVIAAGNSGLSTSNDTNVAPQNKFGTIDTGTLGSPGVTDEGLTVASLESSVQIS 433 +L + AAGN G T LG PG +E ++V ++ S Sbjct: 166 ASQILVMCAAGNEGDGDDR--------------TDELGYPGCYNEVISVGAINFDRHAS 210 Score = 75.7 bits (186), Expect = 2e-16 Identities = 43/133 (32%), Positives = 58/133 (43%), Gaps = 20/133 (15%) Query: 587 GKISDFSSWGPTPSLEFKPEITAPGGQIYSTANQNSYQTNSGTSMAAPFVAGTEALIYQA 646 S+FS+ + ++ APG I ST Y T SGTSMA P VAG ALI Q Sbjct: 207 RHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSMATPHVAGALALIKQL 260 Query: 647 LKAEQSP-LTGLNLIEFAKASLLNTAIPVMDQKHSDVIISPRRQGAGLLQADQAIK-NKV 704 A LT L A L+ IP+ + SP+ +G GLL + +++ Sbjct: 261 ANASFERDLTEPEL----YAQLIKRTIPLGN--------SPKMEGNGLLYLTAVEELSRI 308 Query: 705 YLTDAKTGKASIA 717 + T G S A Sbjct: 309 FDTQRVAGILSTA 321
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 59.9 bits (145), Expect = 6e-12 Identities = 35/159 (22%), Positives = 71/159 (44%), Gaps = 1/159 (0%) Query: 21 MDIMFLTFALTSIIADLNVSGAAAGLISSITNVGMLLGGVTFGILADRFGRIKIFTYTIL 80 ++ M L +L I D N A+ +++ + +G +G L+D+ G ++ + I+ Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87 Query: 81 IFAFATGAMYFASNIYLVYLF-RFLSGIGAGGEYGIGMAIVAEAFPKEKLGKMTSIVAIT 139 I F + + + + + + RF+ G GA + M +VA PKE GK ++ Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147 Query: 140 GQVGSIIAAIIAAIIIPRFGWNALFLFGLLPVVLTFFIR 178 +G + I +I W+ L L ++ ++ F+ Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 141 bits (357), Expect = 1e-37 Identities = 61/240 (25%), Positives = 117/240 (48%), Gaps = 22/240 (9%) Query: 74 DNPSLQMMIEQA-RAAILYPPKGLNMLFYGETGVGKSMFANLIYEYACSVKKTTKHYPFI 132 + ++Q + R L ++ GE+G GK + A +++Y ++ PF+ Sbjct: 142 RSAAMQEIYRVLARLM----QTDLTLMITGESGTGKELVARALHDYG-----KRRNGPFV 192 Query: 133 HFNCSDYANNPQLLMGQLFGVAKNAYTGATEEKKGLIEEANGGMLFLDEIHRLPPEGQEM 192 N + A L+ +LFG K A+TGA G E+A GG LFLDEI +P + Q Sbjct: 193 AINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTR 250 Query: 193 FFTFIDRGLYRRLGESNSESSAEVQILAATTEAPESALLKTFQR-----RI-PMVIKIPN 246 + +G Y +G + ++V+I+AAT + + ++ + R R+ + +++P Sbjct: 251 LLRVLQQGEYTTVG-GRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPP 309 Query: 247 LLERTIEERASLITLFFNQEADRLGIPIK-VSQNSMRALLSYNCPNNIGQLKNDIQLACA 305 L +R E + F Q+A++ G+ +K Q ++ + ++ P N+ +L+N ++ A Sbjct: 310 LRDRA--EDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTA 367
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 30.0 bits (67), Expect = 0.008 Identities = 16/42 (38%), Positives = 24/42 (57%) Query: 67 DYTLDELRALTLAERYQQFPLYDRSWELEKIPTLEEVLKLLQ 108 D LD +R +TL E++++ EL + PT+EE KLL Sbjct: 258 DRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 43.2 bits (101), Expect = 1e-06 Identities = 86/375 (22%), Positives = 140/375 (37%), Gaps = 41/375 (10%) Query: 75 VEVKAEFQGTYEESLPKFQSVGGTKDAPTIVQVQEIGTKMMIDSGFIEPMQKFIDADNYD 134 ++V E EE KF V T D P I+ SG + I D Sbjct: 59 IKVTVEHPDKLEE---KFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAE----ITPDKAF 111 Query: 135 TSDLEENIANYYKVDGKFYSMPFNSSTPVMYYNKEAFKKAGLDPENPPQTFEEIEKAGLA 194 L + + +GK + P + YNK+ NPP+T+EEI Sbjct: 112 QDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKE 164 Query: 195 IKKSNPAMKGFALQA--YGWLYEELLANQGSLLMNNDNGRSKTPTKVAYDNAAGRSIFEW 252 +K + F LQ + W L+A G +NG+ V DNA ++ + Sbjct: 165 LKAKGKSALMFNLQEPYFTW---PLIAADGGYAFKYENGKYDI-KDVGVDNAGAKAGLTF 220 Query: 253 AEQMIKDETFANYGTNADNMVAGFINGDVAMFLQSSASAGQVIDGAKFEVGEAYLP-YPE 311 +IK++ N T+ A F G+ AM + + A ID +K G LP + Sbjct: 221 LVDLIKNKHM-NADTDYSIAEAAFNKGETAMTI-NGPWAWSNIDTSKVNYGVTVLPTFKG 278 Query: 312 KAEREGVVIGGASLWMSKGKETAEQEAAWDFLK-YLATPEVQAEWHVATGYFAINSKAYD 370 + + V + A + + +E A +FL+ YL T E G A+N Sbjct: 279 QPSKPFVGVLSAGI----NAASPNKELAKEFLENYLLTDE---------GLEAVNKDKPL 325 Query: 371 EAIVAEAYKKKPQLKVAVEQLQATKTSAATQGALMNMLPEERKIMETALEQVYNGAEIEP 430 A+ ++Y++ ++A + A A +G +M +P+ V N A Sbjct: 326 GAVALKSYEE----ELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQ 381 Query: 431 TFKAAVEQVNQAIEQ 445 T A++ I + Sbjct: 382 TVDEALKDAQTRITK 396
>PF05272#Virulence-associated E family protein Length = 892 Score = 35.8 bits (82), Expect = 3e-04 Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 9/55 (16%) Query: 34 VLVGPSGCGKSTMLRMIAGLEDISDGTLKIDGEVVNHLPPKERDLAMVFQNYALY 88 VL G G GKST++ + GL+ SD I +D Y Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY 645
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 33.3 bits (76), Expect = 0.002 Identities = 63/364 (17%), Positives = 132/364 (36%), Gaps = 29/364 (7%) Query: 40 LVSSGYEIQLVSVIFSLYGLFVAIFSWLTSFFVNIFSVRKVMIAGLIIYLVSAVILIAGI 99 LV S ++ +LY L + + + F R V++ L V I+ Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP 94 Query: 100 YFELLPVIAVAYTLRGASYPLFAYAFLIWITLRSEFKNLGKATSWFWFSFNLGLTIISPL 159 + +L + + + GA+ + ++ G + F G+ + P+ Sbjct: 95 FLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFG----FMSACFGFGM-VAGPV 149 Query: 160 LASLLLKFSNSINIL---AVGMVMALLGSFL---SLKVNRDHLPTFNNNKSILYEMQEGI 213 L L+ FS A+ + L G FL S K R L N + G+ Sbjct: 150 LGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM 209 Query: 214 MILFEYPRLAIGLVVKAINNIGQFGFVIMMPIFLVNHGYSLSQWGIIWATTYVVNSFA-G 272 ++ +A+ +++ + + +VI + + GI A +++S A Sbjct: 210 TVV--AALMAVFFIMQLVGQVPAALWVIFGEDRF---HWDATTIGISLAAFGILHSLAQA 264 Query: 273 ILFGNLGDYYGWRKIVCYFSGTLTALSCFLIGSVVFYFPGNFFLLMLAFIVFSFGIAAFG 332 ++ G + G R+ + + G ++ F ++ ++ + G Sbjct: 265 MITGPVAARLGERRALML------GMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318 Query: 333 PLSALIPAMALEKKTTALSVLNLG-SGLSNFLGPVLVTVLF----QKFDGFFVLAVFAVL 387 L A++ E++ L + L++ +GP+L T ++ ++G + A L Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNG-WAWIAGAAL 377 Query: 388 YLLA 391 YLL Sbjct: 378 YLLC 381
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.7 bits (85), Expect = 1e-04 Identities = 34/139 (24%), Positives = 57/139 (41%), Gaps = 7/139 (5%) Query: 242 GYFLTLYTVIQMLCSFIIPTLMDQFGKMKQWMFFSSGLVFIGAGMIALAPSVVFFVLGII 301 G L LY ++Q C+ ++ L D+FG+ + + S + ++A AP + +G I Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGR-RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI 104 Query: 302 LAAI-GLGGLFPIAVLLPIRNTKTAEETSLWTSMIQSFGYILGGFMPVLMGAVKDTTGST 360 +A I G G A + I + + S FG + G + LMG S Sbjct: 105 VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF-----SP 159 Query: 361 IAPFLIMVLLSSVLLILSF 379 APF L+ + + Sbjct: 160 HAPFFAAAALNGLNFLTGC 178
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 26.2 bits (57), Expect = 0.025 Identities = 13/38 (34%), Positives = 22/38 (57%) Query: 55 INSEIVGHGLLSPVIIEQKESSIIEDAVMALAPLAVKK 92 ++ +IV LL+P + E S+++D V AL L K+ Sbjct: 272 VDLKIVSTSLLTPTSLTGGEESMLKDQVNALKGLNSKR 309
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 32.3 bits (73), Expect = 0.008 Identities = 15/58 (25%), Positives = 20/58 (34%) Query: 55 APEEEAITVPPAETPITPPIPETEQEPLKQDVATPTITDAPTIADPQPEAAPEKKTIP 112 A E V P P+ P PE E P A I P+P+ + + P Sbjct: 53 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP 110
>PF05043#Transcriptional activator Length = 493 Score = 63.8 bits (155), Expect = 6e-13 Identities = 42/180 (23%), Positives = 85/180 (47%), Gaps = 7/180 (3%) Query: 3 LDSLLSTSTLREITLIKLLNQRYPNWISKEEISSQLYISNRTLKSTVIVINSFFKEKNYE 62 + LLS + R++ L++LL + W + E++ L + R +K + + S F + Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKR-WFHRSELAELLNCTERAVKDDLSHVKSAFPD---- 55 Query: 63 QYIETDAALGYKLSASTNAISNLIYLERLKSSTSFNLLLSIYNKKFISSKHFTDYYFISL 122 I + G ++ + ++ ++Y K ST F++L I+ + ++ ++IS Sbjct: 56 -LIFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISS 114 Query: 123 TSFYKDVRTINLILDK-FGIQFNSREGTLSGESSQIRYFFCKFFWFSYGMSEWPFQQVDE 181 +S Y+ + IN ++ + F + + + G IRYFF ++F Y EWPF+ Sbjct: 115 SSLYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSS 174