>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 27.8 bits (62), Expect = 0.034 Identities = 11/32 (34%), Positives = 17/32 (53%) Query: 302 IGVFIMMFLYGGWLVWVVLGFTAMYMILRLAT 333 +GV + +FL GW V+L + + L LA Sbjct: 54 LGVCLCLFLLSGWYGEVLLSYGRQVIFLALAK 85
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 108 bits (271), Expect = 7e-28 Identities = 81/430 (18%), Positives = 159/430 (36%), Gaps = 62/430 (14%) Query: 33 FIAALCAIFLVLLITLIIYGTYTRRINVNGEVISQPHPINIFSPQQGFITKKWVEVGDIV 92 +A FLV+ L + G NG++ I + + + V+ G+ V Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118 Query: 93 RKGQHLYQIDV--SRTTFSGNVSLNSLEAINNQLSQIDSIINNTQKNKELTLLN------ 144 RKG L ++ + S + QI S K EL L + Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178 Query: 145 ------------LRQQLAQYQKAHKKSQELVDNAGKGMDDMRRTMASYGTYQRQGLITKD 192 +++Q + +Q + + +D + + Y + + K Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRY---ENLSRVEKS 235 Query: 193 QLTNQRSLF----------YQQQNAFQSLNTQLIQESLQIAKLESEIS-------TRASD 235 +L + SL +Q+N + +L Q+ ++ESEI Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295 Query: 236 FDNDISQYLFQKGDLKRQLAE-----VDASGMLLINSPSDGKIENMSV-TQGQMVNVNDS 289 F N+I L Q D L + +I +P K++ + V T+G +V ++ Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355 Query: 290 LVQLTPSDNPYYCLVLWVPNNSVPYINTGDKVNIRYDAFPFEKFGQFPGRIISISNVPVS 349 L+ + P D+ L V N + +IN G I+ +AFP+ ++G G++ +I+ + Sbjct: 356 LMVIVPEDDTLEVTAL-VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414 Query: 350 QQEIASYNIAPRLPNGGLIEPYYKVIVALDDIHFRYQSKPLMLSNGLKANVTLFLEKRPL 409 Q + + VI+++++ +K + LS+G+ + R + Sbjct: 415 DQRLG---------------LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSV 459 Query: 410 YQWMLSPFYD 419 ++LSP + Sbjct: 460 ISYLLSPLEE 469
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.1 bits (62), Expect = 0.047 Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%) Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70 S IA+ G++R + + + KS+ + + + I + + Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81 Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115 P + ++ + +L I + V+ Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 53.1 bits (127), Expect = 1e-08 Identities = 39/161 (24%), Positives = 63/161 (39%), Gaps = 22/161 (13%) Query: 2138 DVAALFDLGGGDDVAKGYHKKKNIFTIGSGFKQYQGGENADTFILTSAVASKSHILSGGE 2197 D+AA+ L G + + G + + D + T + + + Sbjct: 250 DIAAIQRLYGANMTTRT----------GDSVYGFNSNTDRDFYTATDSSKALIFSVWDAG 299 Query: 2198 GNDTVALGEVLGNEIDSIIDISNGYYSQVNGGVEKQV-ALLYDFENILGHENVNDTIIGN 2256 G DT + G + I+++ G +S V G A EN +G ND ++GN Sbjct: 300 GTDTF---DFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSG-NDILVGN 355 Query: 2257 DVDNYLNGMGGDDKIWGNGGNDLLALQSGLAQGGTGLDSYH 2297 DN L G G+D ++G G D L GG G D++ Sbjct: 356 SADNILQGGAGNDVLYGGAGADTLY-------GGAGRDTFV 389 Score = 45.0 bits (106), Expect = 4e-06 Identities = 31/137 (22%), Positives = 47/137 (34%), Gaps = 21/137 (15%) Query: 2631 SSGNDEVVITSATFLPGNYIDTGDGNDAIIYIRGHEGT-MLKGGGGDDTYYYSAGSGAIN 2689 SGND +V SA N + G GND + G G L GG G DT+ Y +G + Sbjct: 346 GSGNDILVGNSA----DNILQGGAGNDVLY---GGAGADTLYGGAGRDTFVYGSGQDSTV 398 Query: 2690 IADTSGLDHLY-----------LDKHILLHTLSAERRENNLVLNIADNTSGRIIFVDWYL 2738 A D + + + ++L S I + + Sbjct: 399 AAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEVMLQWDAANS--ITNLWLHE 456 Query: 2739 ADENKVEFIWVEDSQIT 2755 A + V+F+ Q Sbjct: 457 AGHSSVDFLVRIVGQAA 473
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.3 bits (65), Expect = 0.004 Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 8/48 (16%) Query: 31 KKTFRQLLGLLSGFNIVFWCTDNFSAY-------EMLPDEKHIRSKLY 71 ++ F Q+ G +I+FW D F Y E+ PD K + KLY Sbjct: 70 EEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPD-KAFQDKLY 116
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 30.5 bits (68), Expect = 0.035 Identities = 19/54 (35%), Positives = 24/54 (44%), Gaps = 9/54 (16%) Query: 622 GIGKTETALALADSLFGGEKSLITINLSEYQEAHTVSQLKGSPPGYVGYGQGGV 675 GIG +A A AD + KS + N S Y G+ PGYV Q G+ Sbjct: 23 GIGAPPSAHAGADDVVDSSKSFVMENFSSYH---------GTKPGYVDSIQKGI 67
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 254 bits (651), Expect = 2e-89 Identities = 90/175 (51%), Positives = 121/175 (69%) Query: 1 MEFDEYDNSDTRYLLGIYQGQLICSVRFIELHLPNMITHTFNALFDDVALPKRGYIESSR 60 MEFD+YDN++T YL GI +ICS+RFIE PNMIT TF F ++ +P+ Y+ESSR Sbjct: 42 MEFDQYDNNNTTYLFGIKDNTVICSLRFIETKYPNMITGTFFPYFKEINIPEGNYLESSR 101 Query: 61 FFVDKTRAKLLFGNHYPISYLFFLSIINYSRHNGYTGIYTIVSRAMLTILKRSGWQVEVI 120 FFVDK+RAK + GN YPIS + FLS+INYS+ GY GIYTIVS MLTILKRSGW + V+ Sbjct: 102 FFVDKSRAKDILGNEYPISSMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVV 161 Query: 121 KEAHITEKERIYLLHLPIDRDNQARLLLQVNQRLQDPCSVLSTWPISLPVMPESA 175 ++ ++ER+YL+ LP+D +NQ L ++N+ + L WP+ +P A Sbjct: 162 EQGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQWPLRVPAAIAQA 216
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 36.1 bits (83), Expect = 1e-04 Identities = 25/116 (21%), Positives = 41/116 (35%), Gaps = 17/116 (14%) Query: 171 FQRSSAVLTPFFSRLLGELAPAFNEM---DNKIIITGHTDASRYRDQLLYNNWNLSGERA 227 F + A L P L +L + + D +++ G+TD D N LS RA Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTD-RIGSDAY---NQGLSERRA 278 Query: 228 LMAHKALVNGGLDEGRVLQI----------NAMADQMLLDPTDPLAAKNRRIEIMV 273 L++ G+ ++ N + A +RR+EI V Sbjct: 279 QSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 29.8 bits (66), Expect = 0.017 Identities = 27/114 (23%), Positives = 47/114 (41%), Gaps = 11/114 (9%) Query: 253 QHATIRLDPPDMGKIDISIHFEGGKLQVNINANQGEVYRALQ-----------QSSAELR 301 Q A +RL P D+G++ IS+ + + Q+ + + V AL+ +S +L Sbjct: 257 QSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLG 316 Query: 302 QTLIGQNSTEVNVQVSANSQQQQQQPRHSNHHGQADILAAQHFESQAEINADDG 355 Q+ I S Q ++ QQ Q+ H G+ D Q + + G Sbjct: 317 QSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSG 370
>FLAGELLIN#Flagellin signature. Length = 507 Score = 106 bits (266), Expect = 2e-27 Identities = 66/329 (20%), Positives = 120/329 (36%), Gaps = 8/329 (2%) Query: 5 IHTNASAKTAINSLSNAGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64 I+TN+ + N+L+ + + + + +RLS+G RINS D+AAG I NR + QA Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63 Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124 +N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+ Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123 Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKQAPKITG 184 N T++N K+ + +M Q G + ++ +DL + + Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179 Query: 185 KSSSATGSLEKQAYDLDKAVTDTKSLVAGAEGVQKTLEHDFAASGNKAVAEIKIPEYKDA 244 + S K D + +K Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN----GQ 235 Query: 245 LGKTVPEVVIALGAVITSANSNQMKDAVAALKTTHDAAVKAEATFQAKNSTGGGVMNMQL 304 L E A+ T+ ++ +A A ++ T Sbjct: 236 LTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295 Query: 305 ADKDLAMKADKKLSDVIDAYGAFRATLGA 333 K +K++ + A A + A Sbjct: 296 NGKVSTTINGEKVTLTVADITAGAANVDA 324 Score = 44.6 bits (105), Expect = 5e-07 Identities = 46/302 (15%), Positives = 88/302 (29%), Gaps = 10/302 (3%) Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123 +++ S + D + K + A + D+ + +L + Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKQAPKIT 183 + G K + E T++ K++ Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300 Query: 184 GKSSSATGSLEKQAYDLDKAVTDTKSLVAGAEGVQKTLEHDFAASGNKAVAEIKIPEYKD 243 + +L A D +L + + F K+ + + Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL-E 359 Query: 244 ALGKTVPEVVIALGAVITSANSNQMKDAVAALKTTHDAAVKAEATFQAKNSTGGGVMNMQ 303 A E I + +AN+ K +A D +T +++ Sbjct: 360 ANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA-------- 411 Query: 304 LADKDLAMKADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDTD 363 A K + + A R++LGA QNR S+ NL N ++N A I+D D Sbjct: 412 -AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470 Query: 364 FA 365 +A Sbjct: 471 YA 472
>FLAGELLIN#Flagellin signature. Length = 507 Score = 98.2 bits (244), Expect = 2e-24 Identities = 67/327 (20%), Positives = 119/327 (36%), Gaps = 6/327 (1%) Query: 5 IHTNASAKTAINSLSNAGLSNAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64 I+TN+ + N+L+ + S + + +RLS+G RINS D+AAG I NR + QA Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63 Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124 +N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+ Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123 Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKKATPVKA 184 N T++N K+ + +M Q G + ++ +DL + + K Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179 Query: 185 DVAGSTLEKEADVLDKATKAAKKAKEAAEGVQKTLETDFAVAGNKASAKITIPEYKDALG 244 G K + + K+ + L Sbjct: 180 ATVGDL--KSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237 Query: 245 KTVPEVVINSGTAITPANSTQMKDAVAALKATHDAAVKAEATFQAKNSTGGGVMNMQLAD 304 E T ++ +A A A ++ T Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNG 297 Query: 305 KDLAMKADKKLSDVIDAYGAFRATLGA 331 K +K++ + A A + A Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDA 324 Score = 61.2 bits (148), Expect = 3e-12 Identities = 57/336 (16%), Positives = 111/336 (33%), Gaps = 10/336 (2%) Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123 +++ S + D + K + A + D+ + +L + Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKKATPVK 183 + G K + E T++ V Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300 Query: 184 ADVAGSTLE-KEADVLDKATKAAKKAKEAAEGVQKTLETDFAVAGNKASAKITIPEYKDA 242 + G + AD+ A ++++ V ++ +K + +A Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEA 360 Query: 243 LGKTVPEVVINSGTAITPANSTQMKDAVAALKATHDAAVKAEATFQAKNSTGGGVMNMQL 302 E I A AN+ K +A D +T +++ Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA--------- 411 Query: 303 ADKDLAMKADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDTDF 362 A K + + A R++LGA QNR S+ NL N ++N A I+D D+ Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471 Query: 363 ADEMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 398 A E+ N +++++L Q+ +L +AN Q + +LL+ Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>FLAGELLIN#Flagellin signature. Length = 507 Score = 105 bits (263), Expect = 7e-27 Identities = 56/204 (27%), Positives = 96/204 (47%), Gaps = 4/204 (1%) Query: 5 IHTNGSAKTAINSLSKAGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64 I+TN + N+L+K+ + + + +RLS+G RINS D+AAG I NR + QA Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63 Query: 65 KQNIQESIAMLQIADGGLAESVKTLNTMKKLATQAANDTNSAADREAIQKEFTELGQELQ 124 +N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+ Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123 Query: 125 NALNNTEYNAEKLFADGGKMRKELNFQSGTDANSSLKLDLNKVIEELTESVTEERKKVTG 184 N T++N K+ + +M Q G + ++ +DL K+ + Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179 Query: 185 TSASATGSLEKQAFDLNEATTKAN 208 + S K + AN Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGAN 203 Score = 60.8 bits (147), Expect = 3e-12 Identities = 51/334 (15%), Positives = 102/334 (30%), Gaps = 7/334 (2%) Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNTMKKLATQAANDTNSAADREAIQKEFTELGQEL 123 +++ S + D + K + A + D+ + +L + Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 124 QNALNNTEYNAEKLFADGGKMRKELNFQSGTDANSSLKLDLNKVIEELTESVTEERKKVT 183 + G K + T++ + KV+ Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300 Query: 184 GTSASATGSLEKQAFDLNEATTKANTALKEAEILQEKITTNLTKTFPASVDIPGYINAKG 243 T +L A A L+ ++ + + + N Sbjct: 301 TTINGEKVTL-TVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTK------NESA 353 Query: 244 VPVAHEIIPSGTPINTGHIGKIQTAVAALRATHDTAAKTEDEFQAEHSTGGGVMNLLLRN 303 E + + + + A A KT + + Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413 Query: 304 KDRAMEADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTTQALGSIKDTDFAD 363 K + + A R++LGA QNR S+ NL N ++N A I+D D+A Sbjct: 414 KKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYAT 473 Query: 364 EMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 397 E+ N +++++L Q+ +L +AN Q + +LL+ Sbjct: 474 EVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>FLAGELLIN#Flagellin signature. Length = 507 Score = 100 bits (250), Expect = 3e-25 Identities = 64/328 (19%), Positives = 120/328 (36%), Gaps = 10/328 (3%) Query: 5 IHTNASAKTAINSLSNEGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64 I+TN+ + N+L+ + + + +RLS+G RINS D+AAG I NR + QA Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63 Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124 +N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+ Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123 Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSV------IAELTESVTKP 178 N T++N K+ + +M Q G + ++ +DL + + + K Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179 Query: 179 GLKANSGGTAEEKELARLEGLAKDAKSTAATTKSAETTLLVDDATGKGGKGGNASIDIII 238 + + + + + + + T K Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239 Query: 239 PAHKDTTGKDVAEKKIASGTAITPANITSMADAKAYWDKQEIETPKAVNEYVVKHSADSG 298 A +T K +GTA A ++ K ++ Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299 Query: 299 VMNMQLADKDLAMKADKKLSDVIDAYGA 326 + L + + +DA Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATL 327 Score = 63.9 bits (155), Expect = 4e-13 Identities = 56/338 (16%), Positives = 105/338 (31%), Gaps = 12/338 (3%) Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123 +++ S + D + K + A + D+ + +L + Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLDLNSVIAELTESVTKPGLKAN 183 + G K + E T++ K + Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300 Query: 184 SGGTAEEKELARLEGLAKDAKSTAATTKSAETTLLVDDATGKGGKGGNASIDIIIPAHKD 243 + E+ L + A A AAT +S++ + Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA------- 353 Query: 244 TTGKDVAEKKIASGTAITPANITSMADAKAYWDKQEIETPKAVNEYVVKHSADSGVMNMQ 303 A + + IT A+A +T + + Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413 Query: 304 LADKDLAM-KADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDT 362 + D LS V R++LGA QNR S+ NL N ++N A I+D Sbjct: 414 KKSTANPLASIDSALSKVDAV----RSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDA 469 Query: 363 DFADEMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 400 D+A E+ N +++++L Q+ +L +AN Q + +LL+ Sbjct: 470 DYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 158 bits (402), Expect = 4e-45 Identities = 93/324 (28%), Positives = 157/324 (48%), Gaps = 8/324 (2%) Query: 4 IRTAFSGMQATQAHLNATSMNIANMHTPGYSRQRAEQSAIGADGQGGVNAGNGVNVDGIR 63 I A SG+ A QA LN S NI++ + GY+RQ + + G GNGV V G++ Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQ 63 Query: 64 RLSQQYVVMQEWRANSQQQYYDAGEQYLNAVELMVSNESTSLATGLNNFFSSLSAATQLP 123 R ++ Q A +Q A + ++ ++ M+S ++SLAT + +FF+SL Sbjct: 64 REYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNA 123 Query: 124 DSPPMRQQIIESANAMALRFNNVNNFIVQQKKSIGQQRDITVKEINSLTRSIADYNQQIL 183 + P RQ +I + + +F + ++ Q K + +V +IN+ + IA N QI Sbjct: 124 EDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQIS 183 Query: 184 K--NRSDGNNINDLLDKQELQIKKLSGLIETQVNQAEDGTYRISVKQGQPLVNGAVAAEL 241 + G + N+LLD+++ + +L+ ++ +V+ + GTY I++ G LV G+ A +L Sbjct: 184 RLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTARQL 243 Query: 242 AVDTSSVDTKITLHFSGATQGMNMSC------GGQLGGINDYELTTLKKLQDSTQEMAKT 295 A SS D T N+ G LGGI + L + +++ ++A Sbjct: 244 AAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALA 303 Query: 296 VADKFNDQLGKGTDFTGAPGQDLF 319 A+ FN Q G D G G+D F Sbjct: 304 FAEAFNTQHKAGFDANGDAGEDFF 327 Score = 61.9 bits (150), Expect = 2e-12 Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 8/182 (4%) Query: 275 NDYELTTLKKLQDSTQEMAKTVADKFNDQLGKGTDFTGAPG-QDLFVFNPSDPNGMLQLS 333 N +++T L +T A+ G FTG P D F P + ++ + Sbjct: 368 NQWQVTRLA---SNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVS-DAIVNMD 423 Query: 334 AITAEQLALAAHGK-PAG--DNSNLFELLDIRKTPVTGMKNVPLDDAATALVGYIAITSN 390 + ++ +A + AG DN N LLD++ T +DA +LV I + Sbjct: 424 VLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTA 483 Query: 391 RNHSELENAENTLNQATRYHESFSGVNNDEEAMNLMEYQRAYQSNMKVIATGDKLFSDLL 450 + N + Q + +S SGVN DEE NL +Q+ Y +N +V+ T + +F L+ Sbjct: 484 TLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALI 543 Query: 451 AL 452 + Sbjct: 544 NI 545
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 45.5 bits (107), Expect = 4e-09 Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 4/79 (5%) Query: 18 GDLQPQDLEQAAVQFEAVFMRTLLQQMRKAAEVLAADDDPFNSKQQRMMRDFYDDKLAST 77 G+ ++ A Q E +F++ +L+ MR A D F+S+ R+ YD ++A Sbjct: 26 GEDPAANIRPVARQVEGMFVQMMLKSMRDAL----PKDGLFSSEHTRLYTSMYDQQIAQQ 81 Query: 78 LASQRSSGIANLLIQQLGS 96 + + + G+A ++++Q+ Sbjct: 82 MTAGKGLGLAEMMVKQMTP 100
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 330 bits (848), Expect = e-113 Identities = 146/359 (40%), Positives = 210/359 (58%), Gaps = 12/359 (3%) Query: 39 LVLPTASAQP--LGSLVDIQGVRGNQLVGYSLVVGLDGSGDK-NQVKFTGQSMANMLRQF 95 L P A A + + +Q R NQL+GY LVVGL G+GD FT QSM ML+ Sbjct: 19 LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNL 78 Query: 96 GVQLPEKMDPKVKNVAAVAISATLPPGYGRGQSIDITVSSIGDAKSLRGGTLLLTQLRGA 155 G+ KN+AAV ++A LPP G +D+TVSS+GDA SLRGG L++T L GA Sbjct: 79 GITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGA 137 Query: 156 DGEVYALAQGNVVVGGIKAEGDSGSSVTVNTPTVGRIPNGASIERQIPSDFQTNNQVVLN 215 DG++YA+AQG ++V G A+GD +++T T R+PNGA IER++PS F+ + +VL Sbjct: 138 DGQIYAVAQGALIVNGFSAQGD-AATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQ 196 Query: 216 LKRPSFKSANNVALALNR----AFGANTATAQSATNVMVNAPQDAGARVAFMSLLEDVQI 271 L+ P F +A VA +N +G A + + + V P+ A M+ +E++ + Sbjct: 197 LRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVA-DLTRLMAEIENLTV 255 Query: 272 NAGEQSPRVVFNARTGTVVIGEGVMVRAAAVSHGNLTVNIREQKNVSQPNPLGGGKTVTT 331 + +VV N RTGT+VIG V + AVS+G LTV + E V QP P G+T Sbjct: 256 ET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQ 314 Query: 332 PESDIEVTKGKNQMVMVPAGTRLRSIVNTINSLGASPDDIMAILQALYEAGALDAELVV 390 P++DI + +++ +V G LR++V +NS+G D I+AILQ + AGAL AELV+ Sbjct: 315 PQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVL 372
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 153 bits (389), Expect = 8e-49 Identities = 74/221 (33%), Positives = 109/221 (49%), Gaps = 13/221 (5%) Query: 4 FLILTPMVLALCGCESPALLVQKDDAEFAPPANLIQPATVTEGGGLFQPANS-----WSL 58 + I + +VL+L GC A A P P G +FQ A L Sbjct: 9 YAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVA---NGSIFQSAQPINYGYQPL 65 Query: 59 LQDRRAYRIGDILTVILDESTQSSKQAKTNFGKKNDMSLGVPEVLGKKLNKFGGSI---- 114 +DRR IGD LT++L E+ +SK + N + + G V FG + Sbjct: 66 FEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVE 125 Query: 115 -SGKRDFDGSATSAQQNMLRGSITVAVHQVLPNGVLVIRGEKWLTLNQGDEYMRVTGLVR 173 SG F+G + N G++TV V QVL NG L + GEK + +NQG E++R +G+V Sbjct: 126 ASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVN 185 Query: 174 ADDVARDNSVSSQRIANARISYAGRGALSDANSAGWLTRFF 214 ++ N+V S ++A+ARI Y G G +++A + GWL RFF Sbjct: 186 PRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFF 226
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.9 bits (98), Expect = 2e-06 Identities = 11/42 (26%), Positives = 20/42 (47%) Query: 213 QLEQGALEGSNVQVVEEMVDMITVQRAYEMNAKMVSAADDML 254 QL S V + EE ++ Q+ Y NA+++ A+ + Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIF 539 Score = 40.7 bits (95), Expect = 3e-06 Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 14/78 (17%) Query: 2 NSALWVSKTGLAAQDAKMGAISNNLANVNTDGFKRDRVVFADLFYQNQRTPGAPLDQNNT 61 +S + + +GL A A + SNN+++ N G+ R + A N+T Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA--------------QANST 46 Query: 62 TPSGIQFGSGVQIVGTQK 79 +G G+GV + G Q+ Sbjct: 47 LGAGGWVGNGVYVSGVQR 64
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 39.9 bits (93), Expect = 1e-05 Identities = 20/60 (33%), Positives = 27/60 (45%), Gaps = 5/60 (8%) Query: 2 SFSIANTALNAHTEQLNTISNNIANSATKGFKASR----TEFSSMYAQSQ-PLGVAVSGV 56 + A + LNA LNT SNNI++ G+ S++ A GV VSGV Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62 Score = 34.2 bits (78), Expect = 8e-04 Identities = 10/42 (23%), Positives = 22/42 (52%) Query: 371 LENSNVDITAELVGLMTAQRNYQASTKIISTNDSMMNALFQV 412 S V++ E L Q+ Y A+ +++ T +++ +AL + Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.9 bits (67), Expect = 0.004 Identities = 6/37 (16%), Positives = 19/37 (51%) Query: 102 VNVVSEMADMMSASRSFETNVEVLNSVKSMQQSVLKL 138 VN+ E ++ + + N +VL + ++ +++ + Sbjct: 509 VNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 45.5 bits (108), Expect = 3e-10 Identities = 25/74 (33%), Positives = 37/74 (50%) Query: 14 GLHLVLMISIVAIVPSLLIGLLVSIFQATTQINEQTLSFLPRLVMTMLVLIFAGKWMMIK 73 L+LVL++S + + +IGLLV +FQ TQ+ EQTL F +L+ L L W Sbjct: 11 ALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEV 70 Query: 74 LSDFTVSIFQQAAQ 87 L + + A Sbjct: 71 LLSYGRQVIFLALA 84
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 105 bits (263), Expect = 3e-29 Identities = 72/237 (30%), Positives = 128/237 (54%), Gaps = 3/237 (1%) Query: 19 LPFVRILSFLHFCPVIRHKAFTRKAKIGTALLLAILITPMISQPVVSGELLSIENLLLAG 78 P +R+L+ + P++ ++ ++ K+G A+++ I P + V + S L LA Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDV--PVFSFFALWLAV 75 Query: 79 EQILWGWLFGSMLHLVLAALEAAGQILSMNMGLGMAMMNDPTSGASTAVISQIIFTFSVL 138 +QIL G G + AA+ AG+I+ + MGL A DP S + V+++I+ ++L Sbjct: 76 QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALL 135 Query: 139 IFFTLNGHLLFVTILLKSFSSWPIG-EAINDFSLRSLALSLGWILSSATLLALPTTFIML 197 +F T NGHL +++L+ +F + PIG E +N + +L + I + +LALP ++L Sbjct: 136 LFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLL 195 Query: 198 IVQGSFGLLNRISPTLNLFSLGFPIGMLFGLLCLLLLAINIPDHYLHLTNEILTQFE 254 + + GLLNR++P L++F +GFP+ + G+ + L I HL +EI Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLA 252
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 298 bits (764), Expect = e-101 Identities = 97/344 (28%), Positives = 173/344 (50%) Query: 5 SGEKSEKPTAGKLSKARKKGDIPRSKDVTMAAGLVTSFILLSLFLPYYKALVSQSFVSVA 64 SGEK+E+PT K+ ARKKG + +SK+V A +V +L YY S+ + A Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61 Query: 65 QLASQLDDQGALEQFLLANLFIFAKFLATLIPIPLFSMLATLIPGGWNFTPVKLIPDLKK 124 + + Q L F L L ++ + ++ G+ + + PD+KK Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121 Query: 125 LSPLAGIKRIFSASNGTEVLKMLAKCSIVLYTLYLVVHSSLDDLLHLQTLPLEEAITQGF 184 ++P+ G KRIFS + E LK + K ++ +++++ +L LL L T +E Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181 Query: 185 AQYHHILLYFIAIVVVFAAIDIPLSHHLFTKKMKMTKQEVKQEHKNNDGNPEIKSRVRQL 244 +++ VV + D ++ + K++KM+K E+K+E+K +G+PEIKS+ RQ Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241 Query: 245 QRQYAIGQINKTVPSADVIITNPTHFSVALKYAPEKASAPYIVAKGKDDIALYIRSIAQK 304 ++ + + V + V++ NPTH ++ + Y + P + K D +R IA++ Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301 Query: 305 HKIEIVEFPPLARAIYHTTKVNQQIPAQLYRAIAQVLTYVMQIK 348 + I++ PLARA+Y V+ IPA+ A A+VL ++ + Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 49.6 bits (118), Expect = 7e-09 Identities = 21/97 (21%), Positives = 40/97 (41%), Gaps = 12/97 (12%) Query: 129 QTEPNIKAVAKMRPDLIIISATGDDSTLELYDQLSAIAPTLVINYDDKS-----WQELTL 183 +TEPN++ + +M+P ++ SA S + L+ IAP N+ D ++ Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLT 139 Query: 184 QLGQATGHEGDAEQVI---DKFARRLNEVKQKITLPP 217 ++ + AE + + F R + K P Sbjct: 140 EMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARP 176
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 671 bits (1732), Expect = 0.0 Identities = 229/875 (26%), Positives = 374/875 (42%), Gaps = 67/875 (7%) Query: 2 RIIKKIPIAMTTSLIMLSGAVSA--------IDFNTDAMDANDKQNIDLSHFTNVGYIMP 53 I+K +A + ++ A +A + FN + + + DLS F N + P Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75 Query: 54 GEYRLEINVNNHRIPEQVIAFYARDDEPNSSEVCLPEAVVEQFGLKPDVLQKITFWHEGQ 113 G YR++I +NN + + + F D E CL A + GL + + + Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSE-QGIVPCLTRAQLASMGLNTASVSGMNLLADDA 134 Query: 114 CADLREL-AGLTTEVDLATSTLAINVPQDWMEYSDSNWVPSSQWDEGIPGSLLDYNVNSL 172 C L + T ++D+ L + +PQ +M ++P WD GI LL+YN + Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 173 FSKPKESGSTRNISLNGTSGLNAGPWRLRGDYQGNYSHNSGEQNSSTSTFDWSRIYMYRA 232 + + G++ LN SGLN G WRLR + +Y+ + S + ++ R Sbjct: 195 SVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNK-WQHINTWLERD 253 Query: 233 IKSLAATLSVGENYFASSLFDTFRYAGASLSSDERMLPPNLRGYAPEVSGIARTNAKVTV 292 I L + L++G+ Y +FD + GA L+SD+ MLP + RG+AP + GIAR A+VT+ Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313 Query: 293 SQQGRILYQTTVASGPFRIQELSD-SVSGRLDVSVEEQDGTVQTFQVETAAVPYLTRPGA 351 Q G +Y +TV GPF I ++ SG L V+++E DG+ Q F V ++VP L R G Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373 Query: 352 IRYKTSVGQPSTLNHGTEGPVFASGEFSWGVSNRWSLFGGAIGSGDYNAVSVGVGRDLYA 411 RY + G+ + N E P F G+ W+++GG + Y A + G+G+++ A Sbjct: 374 TRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGA 433 Query: 412 FGAISTDITQTRASGLPNQETQSGKSLRVRYAKRFDELNSDISLAGNRFFEREFMSMNQY 471 GA+S D+TQ ++ LP+ G+S+R Y K +E ++I L G R+ + + Sbjct: 434 LGALSVDMTQANST-LPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492 Query: 472 LGTRYFDNDL--------------------GRNKEMYTVTASKNFPDIQTNINFSYSYQN 511 +R ++ + +T ++ T + S S+Q Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTST-LYLSGSHQT 551 Query: 512 YWDQP-TSNSYSATVSHAFDAFSLKDMTVNLSASRSKNNGV--NDDVLYLSFSVPLGNQ- 567 YW + A ++ AF +D+ LS S +KN D +L L+ ++P + Sbjct: 552 YWGTSNVDEQFQAGLNTAF-----EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWL 606 Query: 568 ----------QTLSYSGQH-NGQGNNQTVNYSNSSAIDS--SYRLSAGVNNSNDNGARGQ 614 + SYS H + D+ SY + G D + Sbjct: 607 RSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGST 666 Query: 615 FSGFYIHRSSIAETSLNVAYAQDDFTSTGVSMRGGATVTAKGAALHGPGMSGGTRLMVNT 674 +R ++ DD + GG A G L P T ++V Sbjct: 667 GYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKA 723 Query: 675 DDIAGVPLEERNI-RSNRFGIAVLNNINSYYRTDTRIDINQLADDVEVKQSAVEFALTEG 733 +E + R++ G AVL Y +D N LAD+V++ + T G Sbjct: 724 PGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRG 783 Query: 734 AIGYRRFAMMKGEKVLATISLTDSSHPPFGSLVISAKGQELGIVSDDGFTYLSGVEPGET 793 AI F G K+L T++ ++ PFG++V S Q GIV+D+G YLSG+ Sbjct: 784 AIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGK 842 Query: 794 LDVVW--SGAKQCQV--AIPAVIQPQA--QILLPC 822 + V W C +P Q Q Q+ C Sbjct: 843 VQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 35.9 bits (83), Expect = 8e-06 Identities = 13/78 (16%), Positives = 33/78 (42%), Gaps = 1/78 (1%) Query: 15 FIGYIIFHFNVMGGGDVKLITVLLLALTAEQSLNFIIYTAVMGGVVMVVGLLINRVDIQK 74 + ++ MG GD KL+ L L + ++ ++++G + + +L+ K Sbjct: 200 WAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNHHQSK 259 Query: 75 RGVPYAVAITAGFLSSVL 92 +P+ + ++L Sbjct: 260 -PIPFGPYLAIAGWIALL 276
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 27.8 bits (61), Expect = 0.022 Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 2/54 (3%) Query: 62 DAENVLSYQQLFEHNFNRQVTVLGSLINTAPSAELTVNFSHSVADLINGNSEEN 115 D Y +F H N T +N P A + + S V + IN + E+ Sbjct: 747 DGNGNYVYDVVF-HGMN-TDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTES 798
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 28.4 bits (63), Expect = 0.013 Identities = 11/24 (45%), Positives = 18/24 (75%) Query: 134 LKARTLIQVLEPIKARGALETDLL 157 LKA +I +L+ IK+ GAL+ +L+ Sbjct: 348 LKADGIIAILQGIKSAGALQAELV 371
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 29.5 bits (66), Expect = 0.008 Identities = 17/89 (19%), Positives = 25/89 (28%) Query: 39 LGLAYLAQGDLTAARKNLEKAVEADPQDYRTQLGMAFYAQRIGENSAAEQRYQQAMKLAP 98 L G A K + D D R LG+ Q +G+ A Y + Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101 Query: 99 GNGTVLNNYGAFLCSLGQYVSAQQQFSAA 127 + L G+ A+ A Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLA 130
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 149 bits (376), Expect = 2e-38 Identities = 119/438 (27%), Positives = 183/438 (41%), Gaps = 44/438 (10%) Query: 1048 LTMASLNGTGNFNLGSVMQSDSVAPLNVSGDANGDFIIAMNSSGQAPTNLN----VVNTN 1103 LT+ +L G+G F + L V DA+G + + +SG P + N V Sbjct: 473 LTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPL 532 Query: 1104 GGDARFALAN--GPVALGNYMTNLAKDANGNFVLTADKSAMTPGTAGIL----------- 1150 G A F LAN G V +G Y LA + NG + L K+ P A Sbjct: 533 GSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQ 592 Query: 1151 -------------------AVANTTPV-----IFNAELSSIQQRLDKQSTETNQSGMWGS 1186 A NT V ++ AE +++ +RL + + G WG Sbjct: 593 PEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGR 652 Query: 1187 YLNNNFAVKGRAAN-FDQKLNGMTLGGDKATALADGVLSVGGFASYSSSDIKTDYQSKGK 1245 + RA FDQK+ G LG D A A+A G +GG A Y+ D G Sbjct: 653 GFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGH 712 Query: 1246 VDSHSFGAYAQYLANSGYYMNAVVKNNQFSQDVNITSINGSA-SGVSNFSGMGIALKAGK 1304 DS G YA Y+A+SG+Y++A ++ ++ D + +G A G G+G +L+AG+ Sbjct: 713 TDSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGR 772 Query: 1305 HFNFNEA-YVSPYVAMSAFSSGKSNISLSNGMEAQSSSTRSAMGTLGVNAGYRFVMNNGA 1363 F + ++ P ++ F +G +NG+ + S +G LG+ G R + G Sbjct: 773 RFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGR 832 Query: 1364 ELKPYAIFAVDHEFAKNNQVTVNQEVFDNNLSGTRVNTGAGMNVNITPNLSVGSEVKLSS 1423 +++PY +V EF V N L GTR G GM + S+ + + S Sbjct: 833 QVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSK 892 Query: 1424 GKDIKTPVTINLNVGYSF 1441 G + P T + YS+ Sbjct: 893 GPKLAMPWTFHAGYRYSW 910
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 43.7 bits (103), Expect = 1e-06 Identities = 42/185 (22%), Positives = 66/185 (35%), Gaps = 47/185 (25%) Query: 180 EPAPSPDNHLDLHDIIGQSQA----KRALEIAAAGGHNLLLLGPPGTGKTMLATRLTGLL 235 P+ D+ D ++G+S A R L L++ G GTGK ++A R Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA-RALHDY 183 Query: 236 PPLTDQE--ALEAAAIT-GLLHSNALPTQWRCRAFRAPHHSASMAALIG-------GGSI 285 + A+ AAI L+ S L G G Sbjct: 184 GKRRNGPFVAINMAAIPRDLIES----------------------ELFGHEKGAFTGAQT 221 Query: 286 PRPGEISLAHNGVLFLDEL----PEFERRVLDSLREPLESGEIIISRAAAKICFPAKVQL 341 G A G LFLDE+ + + R+L L++ GE + + + V++ Sbjct: 222 RSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQ----GE--YTTVGGRTPIRSDVRI 275 Query: 342 IAAMN 346 +AA N Sbjct: 276 VAATN 280
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 28.6 bits (64), Expect = 0.011 Identities = 11/45 (24%), Positives = 16/45 (35%), Gaps = 3/45 (6%) Query: 2 RLPGA---VMKAKSKKIICALLLLGSILLGYFFWLSLRPVEIVAI 43 LP + S++ + L+ F L VEIVA Sbjct: 41 FLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVAT 85
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.003 Identities = 10/36 (27%), Positives = 13/36 (36%) Query: 1 MKAKSKKTLYALLLIGSVLLGYFFWLSLRPVEIVAV 36 S++ I L+ F L VEIVA Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVAT 85
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 94.5 bits (234), Expect = 1e-22 Identities = 96/354 (27%), Positives = 143/354 (40%), Gaps = 55/354 (15%) Query: 6 QQQRVNADLETAKITEPQRVENARLTAEAAEKAARDRRISEEIAATEAKRQRMENERLAE 65 N L T I+ Q N A+A+ +AA + E+ AA EAKR+ E R Sbjct: 184 LTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAA-EAKRKAEEQARQQA 242 Query: 66 QERQRVEGTKQQVSEASCAQQASAWQNRFTLPALQPSGSAQYSFAASGMSAVGE-AAELH 124 R N + +PA +GS + A G+ V + AA L Sbjct: 243 AIRA---------------------ANTYAMPA---NGSVVATAAGRGLIQVAQGAASLA 278 Query: 125 NSFLAAQEQLSAIATISASGSVAAMIALGIYQTKVGESSERPPGWNVSPKFVGSISLSAM 184 + A L + SA +A A Y ++ + + S ++ + + + Sbjct: 279 QAISDAIAVLGRVLA-SAPSVMAVGFASLTYSSRT--AEQWQDQTPDSVRYALGMDAAKL 335 Query: 185 GLPATESL----ASQGEMALPVRMRIIDAKDWIGCTEIYAVKTGVAGVLPK-VKVGAAQY 239 GLP + +L + G + LP MR+ + G T +V + +PK V V A Y Sbjct: 336 GLPPSVNLNAVAKASGTVDLP--MRLTNEAR--GNTTTLSVVSTDGVSVPKAVPVRMAAY 391 Query: 240 DESTGVYTFTTDST----PPRTLIFTPAQPPGAETRPILAPPGSTPATLQHTGEM---II 292 + +TG+Y T ST PP L +TPA PPG + P +TP + + Sbjct: 392 NATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQN-----PSSTTPVVPKPVPVYEGATL 446 Query: 293 KPVITPTILPLPQLYARDFHDYIIWFPADSGLEPVYVYLNSPY---GKTTAKGK 343 PV T P + D II FPADSG++P+YV P G T KG+ Sbjct: 447 TPV-KATPETYPGVITLP-EDLIIGFPADSGIKPIYVMFRDPRDVPGAATGKGQ 498
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 762 bits (1970), Expect = 0.0 Identities = 252/900 (28%), Positives = 399/900 (44%), Gaps = 79/900 (8%) Query: 15 RRKALTLCITLILHIDTAFGQEEP---QNFEFDESLFLGTKYASG-LTQLNKKNSITAGN 70 RK + L + AF + P F+ A L++ + G Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77 Query: 71 YDAVDVLVNNKLFKRMSVQFIKDANSSEVYPCLSDELLTAAGVELGRENSTPPKEPHVTE 130 Y VD+ +NN V F + + PCL+ L + G+ + Sbjct: 78 Y-RVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSG---------- 126 Query: 131 ANTPITETHAPTNQCLPLSTRVKGASFRFDQAKLRLELSIPQALLQKRPRGYIERAEWQE 190 + C+PL++ + A+ + D + RL L+IPQA + R RGYI W Sbjct: 127 ------MNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180 Query: 191 GEKLAFINYSANAYRSDTRGQQKRTSDFGFIGLKSGINLGLWQVRQQSNVRYASN--DSG 248 G +NY+ + R S + ++ L+SG+N+G W++R + Y S+ SG Sbjct: 181 GINAGLLNYNFSGNSVQNRIGG--NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSG 238 Query: 249 SDTQWNSIRTYVQRPIPQLDSQLTLGETFTDSTLFGSMSFLGAKMATDQRMWPVSMRGFS 308 S +W I T+++R I L S+LTLG+ +T +F ++F GA++A+D M P S RGF+ Sbjct: 239 SKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFA 298 Query: 309 PEVRGVASTNARVIIRQNGREIYETNVAPGPFVINDLFSTSSQGDLNVEVIEANGSRSTF 368 P + G+A A+V I+QNG +IY + V PGPF IND+++ + GDL V + EA+GS F Sbjct: 299 PVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIF 358 Query: 369 TVPFSAVPDSMRPGVSRYNAVIGESRDFTN--IDNYFTDFTYERGLTNQLTANSGVRLAK 426 TVP+S+VP R G +RY+ GE R F T GL T G +LA Sbjct: 359 TVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD 418 Query: 427 DYTALLAGGVLGT-PVGALGLNATYSHAKVENDKTQDGWRMQATYSQTFNQTGTTFSLAG 485 Y A G +GAL ++ T +++ + +D DG ++ Y+++ N++GT L G Sbjct: 419 RYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVG 478 Query: 486 YRYSTKGYRDLNDVFGVRSMQKNGGTWD-------------SSTYKQRSQFTTTINQDLG 532 YRYST GY + D R N T D + Y +R + T+ Q LG Sbjct: 479 YRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLG 538 Query: 533 NWGQLYASASTSDYYNDTARDTQLQLGYSNSYQQISYNLAVSRQRSVYTSTLYNWDSPDT 592 LY S S Y+ + D Q Q G + +++ I++ L+ S ++ + Sbjct: 539 RTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQ----------- 587 Query: 593 DETATTTRYGNTENIATFTVSIPL--------NIGSNNQYLSMSASRNPKSGNNYQTSLS 644 + + V+IP + S S S + + Sbjct: 588 ---------KGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVY 638 Query: 645 GTAGERNSFNYALNAGYDDSNFGSSSNNWGANVQKQFPNATVNGSYSRGNNYTQYGAGAR 704 GT E N+ +Y++ GY G+S + A + + N YS ++ Q G Sbjct: 639 GTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVS 698 Query: 705 GAAVIHRQGVTLGPYLGETFGLIEANGAQGARI--------DSNGFALVPALTPYNYNTI 756 G + H GVTLG L +T L++A GA+ A++ D G+A++P T Y N + Sbjct: 699 GGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRV 758 Query: 757 GLDTKGINRNTELKENQGRVVPYAGAAVKVKFETLTGYAVLI--QAEGEGLPLGADVYNS 814 LDT + N +L VVP GA V+ +F+ G +L+ + LP GA V + Sbjct: 759 ALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSE 818 Query: 815 KDELVGMVGQGNQIYARIADNKGTLDVRWGESSGDQCQLPYAFNRQDTEQDIIHITASCR 874 + G+V Q+Y G + V+WGE C Y + +Q + ++A CR Sbjct: 819 SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 32.3 bits (73), Expect = 0.003 Identities = 11/42 (26%), Positives = 13/42 (30%) Query: 161 VVVEDEPAAPTPVPTPVSTPAPTAPPVAKPINLSKVSLTKEK 202 VVE EP P P P KP K ++ Sbjct: 67 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQE 108
>PF07824#Type III secretion chaperone Length = 120 Score = 32.6 bits (74), Expect = 3e-04 Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 13/85 (15%) Query: 85 EGDDESLKIKLPLI--PADVDKIVFVVTIHDAQARRQSFGQVANAFIRLVNDDNGVEIAR 142 E + +S+ + P P +++ +++ ++++ + +D+ G IAR Sbjct: 35 EKEGDSINLLCPFCALPENINDLIYALSLN-----------YSEKICLATDDEGGSLIAR 83 Query: 143 YDLSEDASTETAMLFGELYRHNAEW 167 DL+ E + E Y W Sbjct: 84 LDLTGINEFEDIYVNTEYYISRVRW 108
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 358 bits (921), Expect = e-113 Identities = 181/876 (20%), Positives = 319/876 (36%), Gaps = 97/876 (11%) Query: 1 MVARCINLQCIAFLFSFFPTLAFPVTEKG-EVVFDIETLERLGYSAELAKFFSGQDRFLP 59 + R L A E+ F+ L + F P Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75 Query: 60 GQHDVTIIINASKTYRIAATFDSE-----GKLCMDKALLMALKLR-------NTESDGSC 107 G + V I +N TF++ C+ +A L ++ L N +D +C Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135 Query: 108 ENMEARWPGMVVKLFPGQFRVEITLPQEAFDPEMEG----SEYQQGGHALLLNYNIFGQR 163 + + +L GQ R+ +T+PQ G + G +A LLNYN G Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195 Query: 164 VESNNS-RFNLVQGQFEPGINFKNWVLRNRGSYSYNQGVSQ------YYNQETSALRAVE 216 V++ + + G+N W LR+ ++SYN S + + T R + Sbjct: 196 VQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDII 255 Query: 217 SLKSVVQLGEFGLVGNTFSGLPVTGIQLYSDNAQRDDTQ--LIVPIEGIANTNATIEIRQ 274 L+S + LG+ G+ F G+ G QL SD+ D+Q I GIA A + I+Q Sbjct: 256 PLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQ 315 Query: 275 RGRVIYRTIVAPGPFSLSNISNFSSGVNTDVSIIEEDGTQQNFTV-TSALDINAEQQASI 333 G IY + V PGPF++++I + + V+I E DG+ Q FTV S++ + + + Sbjct: 316 NGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTR 375 Query: 334 YQLAVGRYRDMFTGEDRPSPLLLSGEMS--FNPAATFYMTSAGLLSSGYQNIRVQNLYSG 391 Y + G YR + P + T Y L+ Y+ N G Sbjct: 376 YSITAGEYRS--GNAQQEKPRFFQSTLLHGLPAGWTIY--GGTQLADRYRAF---NFGIG 428 Query: 392 WDQAWF---SAAASYANTKDAGQGYQFSVQNQMTINGNFGVSWSSV------YGSANYWL 442 + S + AN+ + N + S +++ Y ++ Y+ Sbjct: 429 KNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFN 488 Query: 443 PDDALSSSNNLNDL------------------MFGKLKNATSVAVSWVHPRWGAFSYALS 484 D S N ++ + + + V+ R + S Sbjct: 489 FADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGS 548 Query: 485 NNMYYQASGR-TYHIFSISEQFGRATTILS-----SQLSSQGQNSLYVGINMPLG----- 533 + Y+ S ++ F LS + L + +N+P Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS 608 Query: 534 -------NGTLSGRVQR-NNGNVALGSTYQGRWGDNKDYSVGISGD-------NRQRRIN 578 + + S + NG + + G ++ + S + N Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668 Query: 579 GSMNIRTAYSQLTGGVSQATNNSRSAYLSSRGSVAYVNNTFATSSSSVGDTFAVVNIPNQ 638 ++N R Y G S + ++ + Y G V + T + DT +V P Sbjct: 669 ATLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVL-AHANGVTLGQPLNDTVVLVKAPGA 726 Query: 639 PGLRVSSPSSGIAITDYAGIALLPLVRPYTASKVQISTQTLPLNIRLNNTSADLLMTRGS 698 +V + + TD+ G A+LP Y ++V + T TL N+ L+N A+++ TRG+ Sbjct: 727 KDAKVENQT--GVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGA 784 Query: 699 VATHHFETTETRQLLLTIRGSDGEMLPIGANVLDEKGNFLGTIIGDGNFMLENKAIGVTL 758 + F+ +LL+T+ + + LP GA V E G + +G L + + Sbjct: 785 IVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKV 843 Query: 759 RVKANNRDE--CRVNYREPEKFDPDVLYEVADAVCQ 792 +VK + C NY+ P + +L ++ A C+ Sbjct: 844 QVKWGEEENAHCVANYQLPPESQQQLLTQL-SAECR 878
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 27.9 bits (62), Expect = 0.007 Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%) Query: 60 EGKLEQEYEVQLLFKSNTDH-QQALLTYIKQHHPYQTPELLVLPVR 104 +G E+E V L+F D Q+AL I + + + EL P+R Sbjct: 163 QGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQWPLR 208
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 48.1 bits (114), Expect = 3e-09 Identities = 34/173 (19%), Positives = 60/173 (34%), Gaps = 11/173 (6%) Query: 3 REQVLSNALNLLEQQGLANTTLEMLAKALSVEVSDLTRFWPDREALLYDCLRYHSQQIDT 62 R+ +L AL L QQG+++T+L +AKA V + + D+ L + I Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 63 WRRQLQLDETLSPQQKLLARY-QTLSEQVQNQRYPGCLFIAACSFYPDTEH----PIHQL 117 + Q P L L V +R + F+ + Q Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL---LMEIIFHKCEFVGEMAVVQQA 129 Query: 118 AEQQKQASLHYTKALLQEMDAD---DADMVAQQMELILEGCLSKLLIKRQLAD 167 S + L+ AD++ ++ +I+ G +S L+ A Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182
>PF07520#Virulence protein SrfB Length = 1041 Score = 29.6 bits (66), Expect = 0.027 Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 5/83 (6%) Query: 282 ILLPVIEEYNRP---QATRRFARIAQAMGVDTQDMSDE-QASHQAIAAIRQLSLQVGIPA 337 ++ VI P + + A + D Q + RQ S++V +P Sbjct: 639 LVHRVISAIVLPRLQDSIAQAGGQFVAERMRELFGGDIGGQEQQTVQRRRQFSIRVLVPL 698 Query: 338 GFSAL-GIEESDIEGWLDKALAD 359 + L E+++ +D +AD Sbjct: 699 AEAILSACEDAEEADRIDIPVAD 721
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 449 bits (1156), Expect = e-161 Identities = 147/357 (41%), Positives = 217/357 (60%), Gaps = 4/357 (1%) Query: 2 KAATAVIDRHALRHNLQQIRRLAPQSRLVAVVKANAYGHGLLAAAHTLQDADCYGVARIS 61 + A +D AL+ NL +R+ A +R+ +VVKANAYGHG+ + D + + + Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLE 62 Query: 62 EALMLRAGGIVKPILLLEGFFDAEDLPVLVANHIETAVHSLEQLVALEAATLSAPINVWM 121 EA+ LR G PIL+LEGFF A+DL + + + T VHS QL AL+ A L AP+++++ Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYL 122 Query: 122 KLDTGMHRLGVRPDQAEAFYQRLSACRNVIQPVNIMSHFSRADEPEVAATQQQLACFDAF 181 K+++GM+RLG +PD+ +Q+L A NV + + +MSHF+ A+ P+ +A + Sbjct: 123 KVNSGMNRLGFQPDRVLTVWQQLRAMANVGE-MTLMSHFAEAEHPD--GISGAMARIEQA 179 Query: 182 AAGKPGKQSIAASGGILRWPQAHRDWVRPGIVLYGVSPF-DAPYGRDFGLLPAMTLKSSL 240 A G ++S++ S L P+AH DWVRPGI+LYG SP + GL P MTL S + Sbjct: 180 AEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEI 239 Query: 241 IAVREHKAGESVGYGGTWVSERDTRLGVIAIGYGDGYPRSAPSGTPVWLNGREVSIVGRV 300 I V+ KAGE VGYGG + + + R+G++A GY DGYPR AP+GTPV ++G VG V Sbjct: 240 IGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTV 299 Query: 301 SMDMISIDLGPESTDKVGDEALMWGAELPVERVAACTGISAYELITNLTSRVAMEYL 357 SMDM+++DL P +G +WG E+ ++ VAA G YEL+ L RV + + Sbjct: 300 SMDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356
>PERTACTIN#Pertactin signature. Length = 922 Score = 30.5 bits (68), Expect = 0.026 Identities = 35/109 (32%), Positives = 47/109 (43%), Gaps = 22/109 (20%) Query: 102 SPQWHSRVVLPKGSRVTLSDSSLNNRLANFSTGRTLKIQPLVIENAECAST-PPAYLPLS 160 +PQ + + +G+RVT+S SL+ N VIE A PP PLS Sbjct: 309 APQLGAAIRAGRGARVTVSGGSLSAPHGN------------VIETGGGARRFPPPASPLS 356 Query: 161 VASQLQAGQAHLRLRLTTQGVASLSELDFAPMNLTLAGGIIQSNQLITT 209 + LQAG QG A L + P+ LTLAGG ++ T Sbjct: 357 I--TLQAGA-------RAQGRALLYRVLPEPVKLTLAGGAQGQGDIVAT 396
>PF07520#Virulence protein SrfB Length = 1041 Score = 29.9 bits (67), Expect = 0.016 Identities = 19/106 (17%), Positives = 30/106 (28%), Gaps = 11/106 (10%) Query: 2 HNHNNRLVITPGEPAGVGPDLAITLAQQDWPVELVVCADPALLLARASQLNLPLQLREYQ 61 + E G + + L DW E+ + A R+ + E+ Sbjct: 184 DPGAMSWFLQRLEADEDGNAVDLQLWVSDWLKEMFLDFKRAERPGRSISEENLPHMFEHW 243 Query: 62 ADQPAIAQQAGSLTILPVKTAVNVVPGK-----------LDVGNSH 96 A + Q P N V + LD+GNS Sbjct: 244 ARYLSYLQVIQRAVAPPKMRFANTVAPRDAVAPVEVDLVLDIGNSR 289
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 143 bits (363), Expect = 4e-40 Identities = 81/387 (20%), Positives = 149/387 (38%), Gaps = 84/387 (21%) Query: 5 IGIDLGTTNSCVAIMDGTKARVLENSEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + + VL PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKG--QGIVLNE-------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEAQRDKDIMPYKIIAADNGDAWLEVKGQKMAPPQISAE 118 P N + AI+ + +D I + + + Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFVTEK------------------MLQH 93 Query: 119 VLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALA 178 +K++ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 94 FIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIG 150 Query: 179 YGL--DKEVGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRL 236 GL + G+ V D+GGGT ++++I ++ V + +GG+ FD + Sbjct: 151 AGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAI 198 Query: 237 INYLVEEFKKDQGMDLRTDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADGSG 292 INY+ + G + AE+ K E+ SA + ++ + Sbjct: 199 INYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGV 245 Query: 293 PKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIQD--VILVGGQTRMPMV 349 P+ + + LE+L E + + + VAL+ SDI + ++L GG + + Sbjct: 246 PRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNL 303 Query: 350 QKKVADFFGKEPRKDVNPDEAVAIGAA 376 + + + G +P VA G Sbjct: 304 DRLLMEETGIPVVVAEDPLTCVARGGG 330
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 38.3 bits (89), Expect = 5e-05 Identities = 39/280 (13%), Positives = 99/280 (35%), Gaps = 37/280 (13%) Query: 95 LGGIIMAHFGDLVGRKKMFTLSILLMALPTLAIGMLPTYATIGITAPLLLLLMRVLQGAA 154 +G + D +G K++ I++ ++ + ++ ++ L++ R +QGA Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL-------LIMARFIQGAG 116 Query: 155 IGGEVPGAWVFVAEHVPRKRIGIACGTLTAGLTAGILLGSLVATVMNTTLGHQAIL---- 210 V VA ++P++ G A G + + + G +G + ++ + +L Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 211 ---------------EGGWRIPFFLGGIFGLFA----------MYLRRWLQETPIFKEMQ 245 E + F + GI + Y +L + + + Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236 Query: 246 ARKTLAEELPLKSVVVNHKKEVVVSMLLTWLLSAGIVVVILMTPTYLQKQFNVPP-ELAL 304 + P + ++ +L ++ + + M P ++ + E+ Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296 Query: 305 QANSLAIIALVIGCVVAGLAIDRFGASKTFIVGSLMLAMS 344 ++++I + G+ +DR G +G L++S Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVS 336
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 579 bits (1494), Expect = 0.0 Identities = 193/587 (32%), Positives = 309/587 (52%), Gaps = 22/587 (3%) Query: 73 PTLLRARSVSPGTACGKLLSLIRADLNA--LGDLPVAQGIEREQQMLADGVAQLGKAWES 130 + + S G A K + +++ V+ IE+ L +L Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAI--- 58 Query: 131 LLVANSSTAANSSTTENNSTTENNSTTRAIREVHRSLLRDGTFRQRLLSHIVAGESCATA 190 ++ + + I H +L D + I + A Sbjct: 59 ---------------KDQTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEY 103 Query: 191 IVATAA-YFSQQLALAANTYLRERELDIRDVSFQLLQQIYGEQRFPSQQALSEDSLCIAD 249 + + F N Y++ER DIRDVS ++L + G + S ++E+++ IA+ Sbjct: 104 ALKEVSDMFVSMFESMDNEYMKERAADIRDVSKRVLGHLIGVET-GSLATIAEETVIIAE 162 Query: 250 ELTPSQFLALDKRYLKGLLLGRGGSTSHTVILARSFNIPTLVGVDAAALQPYINQSLQID 309 +LTPS L+K+++KG GG TSH+ I++RS IP +VG + + +D Sbjct: 163 DLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVD 222 Query: 310 GELGLVVCLLDEPVRRYYRQEQWLHDQLREQQSRYQNMPGRTLDGVRMVVAANITHAVEV 369 G G+V+ E + Y +++ ++ +++ ++ P T DG + +AANI +V Sbjct: 223 GIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDV 282 Query: 370 EGAFNQGAESIGLFRTEILYMDRAAAPSEEELYTLYAQALGAAKGKPIIIRTIDIGGDKP 429 +G G E IGL+RTE LYMDR P+EEE + Y + + GKP++IRT+DIGGDK Sbjct: 283 DGVLANGGEGIGLYRTEFLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKE 342 Query: 430 VSYLNIPAESNPFLGYRAVRIYHEFLSLFHTQLRAILRASMHGPLKIMIPMISSMEEILW 489 +SYL +P E NPFLG+RA+R+ E +F TQLRA+LRAS +G LK+M PMI+++EE+ Sbjct: 343 LSYLQLPKELNPFLGFRAIRLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQ 402 Query: 490 VKDQLAEVKQSLRISHLQFDETVPLGMMLEVPSVMFIIDQCCEEMDFLSIGSNDLTQYLL 549 K + E K L + +++ +G+M+E+PS + +E+DF SIG+NDL QY + Sbjct: 403 AKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTM 462 Query: 550 AVDRDNAKVSEHYHCLPPALLRALDYAVCEVHRHGKWIGLCGELAAKDSVLPLLVAMGLD 609 A DR N +VS Y PA+LR +D + H GKW+G+CGE+A + +PLL+ +GLD Sbjct: 463 AADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLD 522 Query: 610 EISMSASFIGATKARLAKLDRGECRLLLNRVMACRTSREVEYLLVQY 656 E SMSA+ I +++L KL + E + + + T+ EVE L+ + Sbjct: 523 EFSMSATSILPARSQLLKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.018 Identities = 5/71 (7%), Positives = 26/71 (36%), Gaps = 4/71 (5%) Query: 14 LQEQANALAHIQALNFES-IDLPTAQRQLEELQARLDRLTHPQSDIAIAKAALDEAEARQ 72 + + + + + + + + + E L +S + ++ + A+ Sbjct: 233 EKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVY---KSQLEQIESEILSAKEEY 289 Query: 73 KELERQYQQEV 83 + + + ++ E+ Sbjct: 290 QLVTQLFKNEI 300
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 26.9 bits (59), Expect = 0.005 Identities = 18/54 (33%), Positives = 24/54 (44%), Gaps = 2/54 (3%) Query: 11 ELPSYITGANSIRLNHSVPRSVDSTDKTSRSLMALTGITDSGDVPTSRLLAYCS 64 ELP+ G ++ R V + T RSL+ GI D + TSR YC Sbjct: 136 ELPAVGGG--RPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTSRAEVYCY 187
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 400 bits (1030), Expect = e-143 Identities = 143/262 (54%), Positives = 182/262 (69%), Gaps = 1/262 (0%) Query: 39 IDTKRVVALEWLPVELLLALGVTPFGVADIHNYRLWVGEPALPADVINVGQRTEPNLELL 98 ID R+VALEWLPVELLLALG+ P+GVAD NYRLWV EP LP VI+VG RTEPNLELL Sbjct: 33 IDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELL 92 Query: 99 QQMAPSLILLSQGYGPSPEKLAPIAPTMSFAFNEQGSSPLAVGKNSLQTLGQRLGLETAA 158 +M PS ++ S GYGPSPE LA IAP F F++ G PLA+ + SL + L L++AA Sbjct: 93 TEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSD-GKQPLAMARKSLTEMADLLNLQSAA 151 Query: 159 QQHLADFDHFMLAARARLSGDTQTPLLMFSLLDPRHALIIGNGSLFQDVLSTLNIENAWQ 218 + HLA ++ F+ + + R PLL+ +L+DPRH L+ G SLFQ++L I NAWQ Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQ 211 Query: 219 GETNFWGSAVVGIERLATIKTARAVCFGHGNNEMLQQVARTPLWQSLSFVRENQLRLLPP 278 GETNFWGS V I+RLA K +CF H N++ + + TPLWQ++ FVR + + +P Sbjct: 212 GETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPA 271 Query: 279 VWFYGATLSAMRFVRLLEQAWG 300 VWFYGATLSAM FVR+L+ A G Sbjct: 272 VWFYGATLSAMHFVRVLDNAIG 293
>PF01540#Adhesin lipoprotein Length = 475 Score = 27.0 bits (59), Expect = 0.043 Identities = 18/88 (20%), Positives = 31/88 (35%), Gaps = 11/88 (12%) Query: 66 AQLSHFKLILEAWRNQLRDEVDRTVSHMQDEAANFPDPVDRAAQEEEFSL-----ELRNR 120 ++L FK +W ++ E + E A D+ EE + EL Sbjct: 196 SELESFKEFNTSWLEKIVSEWEEVKKAWSKELAEIKAEDDKKLAEENQKIKEGAKELLKL 255 Query: 121 DRE-RKLIKKIEKTLKKVE-----DDDF 142 + + I T+ K+E D+ F Sbjct: 256 SEKIQSFADTIALTITKLERKFQIDEKF 283
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.0 bits (78), Expect = 0.001 Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 2/83 (2%) Query: 26 DTVEAEQSLITVEGDKASMEVPSPQAGVVKEIKIAVGDKVATGSLIMVFDATGAAAAPVK 85 + V +T G S E+ + +VKEI + G+ V G +++ A GA A +K Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 86 AEEKPAAPAQVAAPAASAAKNVE 108 + ++++E Sbjct: 139 TQSSLLQARLEQTRYQILSRSIE 161 Score = 30.6 bits (69), Expect = 0.017 Identities = 10/49 (20%), Positives = 21/49 (42%), Gaps = 1/49 (2%) Query: 26 DTVEAEQSLITVEGDKASMEVPSPQAGVVKEIKI-AVGDKVATGSLIMV 73 + L E + + + +P + V+++K+ G V T +MV Sbjct: 310 NIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358 Score = 29.8 bits (67), Expect = 0.026 Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 1/44 (2%) Query: 133 EQSLITVEGDKASMEVPAPFAGIVKEIKIST-GDKVKTGSLIMV 175 L E + + + AP + V+++K+ T G V T +MV Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 41.0 bits (96), Expect = 3e-07 Identities = 21/66 (31%), Positives = 38/66 (57%) Query: 10 QKGFTLIELMVAVAIIAVLSGIGIPSYQRYIQKAALTDMLQAIVPYKMAVELCALEQSNL 69 Q+GFTL+E+MV + II VL+ + +P+ +KA + IV + A+++ L+ + Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66 Query: 70 DSCNAG 75 + N G Sbjct: 67 PTTNQG 72
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 274 bits (702), Expect = 5e-91 Identities = 107/405 (26%), Positives = 204/405 (50%), Gaps = 13/405 (3%) Query: 6 LFNWTALNKTGELQTGMLLATERNSVYEHIIQHGLQPLGV-----KGGRRLSARYWQGER 60 +++ AL+ G+ G A + + + GL PL V + S + Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62 Query: 61 -------LVAMTRQLATLLQAGLPLVNSLQLLAKEADDSAWRCLLDEISQQVAQGQSLSE 113 L +TRQLATL+ A +PL +L +AK+++ L+ + +V +G SL++ Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122 Query: 114 VMEQYPHVFPRLYPPVVAVGELTGNLEQCCTQLVHHQERQQNLHKKVIKALKYPVVVCIV 173 M+ +P F RLY +VA GE +G+L+ +L + E++Q + ++ +A+ YP V+ +V Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182 Query: 174 ALVVSVIMLVMVLPEFAQIYQSFDTPLPGLTASLLWLSTFLTFYGPYLALIIAIVCIGYF 233 A+ V I+L +V+P+ + + LP T L+ +S + +GP++ L + + + Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242 Query: 234 YTLRKKSRWQQWEQTILLSIPLVSTLIRGSCLSQIFQTLAITQQAGLPLSAGLDAAARSI 293 LR++ R + + LL +PL+ + RG ++ +TL+I + +PL + + + Sbjct: 243 VMLRQEKRRVSFHRR-LLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301 Query: 294 HNYNYQQALRCIQKQISQGIPLYTTLNQHPLFPAICQQLIRVGEESGSQDVLLEKLACWH 353 N + L + +G+ L+ L Q LFP + + +I GE SG D +LE+ A Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361 Query: 354 QQQTQNLADNVTQMLEPLLMLIIGSIVGVLVIAMYLPIFQLGDVI 398 ++ + + EPLL++ + ++V +V+A+ PI QL ++ Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 51.8 bits (124), Expect = 2e-09 Identities = 61/417 (14%), Positives = 113/417 (27%), Gaps = 96/417 (23%) Query: 7 SGRKRQLALIVAGVIIIAAAISGWLSVRQTTLNPLSEDAELGASVVH------IASSVPG 60 S R R +A + G ++IA +S + A + H I Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVL--------GQVEIVATANGKLTHSGRSKEIKPIENS 105 Query: 61 RIISINVEENSKVRRGDLLFSIEPDLYRLQVEQAQAELKMAEAT---------------- 104 + I V+E VR+GD+L + + Q+ L A Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165 Query: 105 -----------HDTQQRTVVAERSN--AAITNEQIVRAQANLKLATQT------------ 139 + + V+ S + Q + Q L L + Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINR 225 Query: 140 -----------LARLQPLRPKGYVTAQQVDDAATAKHDAEVSLKQALKQSVAAEALVSST 188 L L K + V + +A L+ Q E+ + S Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285 Query: 189 -------------------ASSEALVVARRAALAIAERELANTQIHAPNDGRVVGLTV-S 228 + + LA E + I AP +V L V + Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHT 345 Query: 229 AGEFVAPDQAIFTLINTEH-WHASAFFRETELKHIKVGDCATVYVMADRQRAIQGRVEGI 287 G V + + ++ + +A + ++ I VG A + V A G + G Sbjct: 346 EGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLVGK 404 Query: 288 GWGVSSEDMLNIPRGLPYVPKSLNWVRVVQRFPVRISLEKPPEDLMRIGATAVVIVR 344 ++ + + + GL + V+ + G ++ Sbjct: 405 VKNINLDAIEDQRLGLVF--------NVIISIEENCLSTGNKNIPLSSGMAVTAEIK 453
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.007 Identities = 11/35 (31%), Positives = 17/35 (48%) Query: 34 MVIVGPSGCAKSTMLRMIAGLEEISSGELTIADRK 68 +V+ G G KST++ + GL+ S I K Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.006 Identities = 13/30 (43%), Positives = 15/30 (50%) Query: 44 VGRSGCGKSTLLRLLAGLEAASDGTLLSGN 73 G G GKSTL+ L GL+ SD G Sbjct: 602 EGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>PF05043#Transcriptional activator Length = 493 Score = 27.6 bits (61), Expect = 0.006 Identities = 6/24 (25%), Positives = 13/24 (54%) Query: 20 GVSARELCRKHAISDATFYTLRKK 43 G A +C++ IS ++ Y + + Sbjct: 100 GCQAESICKEFYISSSSLYRIISQ 123
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 80.2 bits (198), Expect = 3e-19 Identities = 58/352 (16%), Positives = 122/352 (34%), Gaps = 59/352 (16%) Query: 5 RVFIAGHRGMVGSAIVRQLENRND--------------------IELIIRDR---TELDL 41 + + G G +G + ++L +EL+ + ++DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 42 MSQSAVQKFFATEKIDEIYLAAAKVGGIQANNNYPAEFIYQNLMIECNIIHAAHLAGIQK 101 + + FA+ + ++++ ++ + N P + NL NI+ IQ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLEN-PHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 102 LLFLGASCIYPKLAAQPMTEEALLTGVLEPTNEP---YAIAKIAGIKLCESYNRQYGRDY 158 LL+ +S +Y P + + + + P YA K A + +Y+ YG Sbjct: 121 LLYASSSSVYGLNRKMPFSTD-------DSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173 Query: 159 RSVMPTNLYGENDNFHPENSHVIPALLRRFHEAKIRNDKEMVVWGTGKPMREFLHVDDMA 218 + +YG P + + K + V+ GK R+F ++DD+A Sbjct: 174 TGLRFFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 219 AASVHVMELSDQIYQTNTQPMLSH------------INVGTGVDCTIRELAETMAKVVGF 266 A ++ L D I +TQ + N+G + + + + +G Sbjct: 225 EA---IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI 281 Query: 267 TGNLVFDSTKPDGTPRKLMDVSRLAK-LGWCYQISLEVGLTMTYQWFLAHQN 317 +P D L + +G+ + +++ G+ W+ Sbjct: 282 EAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 29.3 bits (65), Expect = 0.021 Identities = 33/153 (21%), Positives = 61/153 (39%), Gaps = 20/153 (13%) Query: 130 SQRDNINSRLLHIVDEATNPWGIKITRIEIRDVRPP--TELISAMNAQMKAERTKRADIL 187 + RD + RL IV+EA + R P TEL A NA M+AE + Sbjct: 85 ANRDALTQRLKDIVNEA----------LRHNASRTPSATELAHANNAAMQAEDERLRLAK 134 Query: 188 EAEGVRQAAILRAEGEKQSQILKAEGERQSA-------FLQAEARERAAEAEAQATKMVS 240 E R+ A + ++++ + E ER+ A +AE + AA +E ++ Sbjct: 135 AEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIA 194 Query: 241 EAIAAGDIQAINYFVAQKYTDALQHIGSANNSK 273 + + + + T + S+ +++ Sbjct: 195 QKKLSAAQSEVVKMDGEIKT-LNSRLSSSIHAR 226
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.4 bits (66), Expect = 0.013 Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 12/68 (17%) Query: 20 QSMSVPV-----LFYFWSERSQHCLQLTPTLDKLAAEYAGQFILARVDCDAQPMVASQFG 74 Q PV L Y+W ++ +T + +Y +F +V ++ FG Sbjct: 75 QQQGWPVVGWSSLKYYWKQKDPK--DVTQDTLAIIDKYQAEFGTQKV-----ILIGYSFG 127 Query: 75 LRSIPAVY 82 IP V Sbjct: 128 AEVIPFVL 135
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 80.9 bits (199), Expect = 3e-20 Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 7/191 (3%) Query: 3 KAVLITGCSSGIGLVAAQDLKNRGYRVLAACRKPDDVAKMVQ-LGLEG-----IELDLDD 56 K ITG + GIG A+ L ++G + A P+ + K+V L E D+ D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 57 SASVERAAAQVIELTGGRLYGLFNNGGFGLYGSLHTISRQQLEKQFSTNLFGTHQLTQLL 116 SA+++ A++ G + L N G G +H++S ++ E FS N G ++ + Sbjct: 69 SAAIDEITARIEREMG-PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 117 LPAMLPHGEGRIIQTSSVMGLVSTAGRGAYAASKYALEAWSDALRMELQSSGIHVSLIEP 176 M+ G I+ S V AYA+SK A ++ L +EL I +++ P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GPISTHFTQNV 187 G T ++ Sbjct: 188 GSTETDMQWSL 198
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 26.5 bits (58), Expect = 0.010 Identities = 5/41 (12%), Positives = 18/41 (43%), Gaps = 6/41 (14%) Query: 4 LSWIIFGLIAGILAKWIMP------GEDGGGFIMTIILGII 38 + I+ G I+G++ W+ ++ ++ ++ + Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 203
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 46.7 bits (111), Expect = 4e-08 Identities = 33/196 (16%), Positives = 67/196 (34%), Gaps = 24/196 (12%) Query: 5 IRFALLSFLLLSTGISVAPLAIARGSAVEVKGTAPLELASGSAM---VVDLQTNKVIYAN 61 +R+ L + L + +A S ++ E + +DL + + + A Sbjct: 1 MRYIRLCIISLLATLPLA----VHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAW 56 Query: 62 NADKVVPIASITKLMTAMVVLD----AKLPLDEILSVDIDQTKELKGVFSRVRVNSEISR 117 AD+ P+ S K++ VL L+ + + V S + ++ Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPV-SEKHLADGMTV 115 Query: 118 KDMLLLTLMSSENRAAASLAHHY--PGGYNAFIKAMNAKAKSL-----GMNSTHYVEPTG 170 ++ + S+N AA L P G AF++ + L +N + Sbjct: 116 GELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDAR- 174 Query: 171 LSINNVSTARDLAKLL 186 + +T +A L Sbjct: 175 ----DTTTPASMAATL 186
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 25.0 bits (54), Expect = 0.013 Identities = 13/45 (28%), Positives = 22/45 (48%) Query: 5 QLLLMKISLAQHFSSRPFIKGNVARMVNHATSIGIFKVDSYRPSK 49 ++ + K S + S+ P G ++NH + I KVDSY+ Sbjct: 398 RINIPKDSPGAYLSAIPGYAGEYEVLLNHGSKFKINKVDSYKDGT 442
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.1 bits (145), Expect = 7e-12 Identities = 29/193 (15%), Positives = 63/193 (32%), Gaps = 4/193 (2%) Query: 64 YNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQEDAK---LA 120 YN + +++ Q E R+ E ++ Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040 Query: 121 AEEQKKQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKA 180 AE K++ +K + + A+ +++A A + K + E A + ++ + + + Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100 Query: 181 QAEAQKKAEAEAKKEAA-VAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAA 239 + A + E +AK E K + K+ + A+ A + K+ + Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160 Query: 240 AKKVAAAAEAKKK 252 A E K Sbjct: 1161 QTNTTADTEQPAK 1173 Score = 52.4 bits (125), Expect = 2e-09 Identities = 22/199 (11%), Positives = 68/199 (34%), Gaps = 5/199 (2%) Query: 67 QQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQ-EDAKLAAEEQK 125 +Q+ ++ EQ + Q E ++ ++ + + E + ++ ++ + ++ Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103 Query: 126 KQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKAQAEAQ 185 V +++K E +K + + + + + E Q + A+ I + Q++ Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163 Query: 186 KKAEAEAKKE----AAVAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAAAK 241 A+ E + + VE E + +++ K Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223 Query: 242 KVAAAAEAKKKAAAEAAAS 260 + + + A +++ Sbjct: 1224 RRSVRSVPHNVEPATTSSN 1242 Score = 44.7 bits (105), Expect = 5e-07 Identities = 32/218 (14%), Positives = 65/218 (29%), Gaps = 10/218 (4%) Query: 47 GEVIDAVMVDPGAVTEQYNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKEL 106 EV + A T+ Q + + ++ A + EE + + + Q Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ----- 1120 Query: 107 EKERLQAQEDAKLAAEEQKKQVAEQQKQ--IAEQQKQAAEQQKIAAAAVAKAKEEQKQAE 164 E ++ +Q K E + AE ++ K+ Q A AKE E Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180 Query: 165 TAAAQAKAEADKIVKAQAEAQKKAEAEAKKEAAVAAAAKKQADADAKKAVEVAEKA-AAD 223 ++ + E + + + ++ K + + V A Sbjct: 1181 QPVTESTTV--NTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238 Query: 224 AAEKKAAADAEKKAAAAKKVAAAAEAKKKAAAEAAAST 261 + + A + A ++A+ KA A Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVG 1276
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 43.7 bits (103), Expect = 1e-06 Identities = 65/297 (21%), Positives = 108/297 (36%), Gaps = 38/297 (12%) Query: 35 PFFPIWLHDI--NNLSKTDTGIVFGSISLFALAFQPIMGPLSDKLGLRKTLMWIIVGLLV 92 P P L D+ +N GI+ +L A P++G LSD+ G R L +V L Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVL---LVSLAG 82 Query: 93 LFAPFFIYVFSPLLKYNIFIGAIVGGCYLGFVFTGGSHAI-EAYIEKVSRHSNFEYGRVR 151 + I +P L + ++IG IV G TG + A+ AYI ++ R R Sbjct: 83 AAVDYAIMATAPFL-WVLYIGRIVAG------ITGATGAVAGAYIADITDGDE----RAR 131 Query: 152 MFG----CIGWALCATVV--GILYTVNNQLIFWMASGCALILAVLLFFARPDRQSTAFVV 205 FG C G+ + A V G++ + F+ A+ + + F P+ Sbjct: 132 HFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG---- 187 Query: 206 DTLGANKAVFNLKNALAL---LRKRELWFFVMYIVGVACIYDVFDQQFANFFTSFFATK- 261 + + N + + V +I+ + Q A + F + Sbjct: 188 ERRPLRREALNPLASFRWARGMTVVAALMAVFFIM------QLVGQVPAALWVIFGEDRF 241 Query: 262 QQGTEIFGFVTTGGEILNATV-MFFAPVIIARIGSKNALLLAGTIMSVRILGSAFAT 317 G IL++ + AR+G + AL+L + AFAT Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 99.3 bits (247), Expect = 2e-27 Identities = 70/196 (35%), Positives = 96/196 (48%), Gaps = 22/196 (11%) Query: 24 PFHG-LLAKCDFEVNEYR--DELFAAYGIPFPGSLNKAVIKRRAEYLAGRFVARQVLNLL 80 PF G L DF+ + +R D L+ +P L A KR+AE+LAGR A L + Sbjct: 9 PFAGHRLHIVDFDASSFREHDLLW----LPHHDRLRSAGRKRKAEHLAGRIAAVHALREV 64 Query: 81 DIRDYPLATGMDRAPQWPTNLIGSISHNNQRALCAAQMIEPRGVESSTLHGIGLDIESHI 140 +R P G R P WP L GSISH CA + + IG+DIE + Sbjct: 65 GVRTVP-GMGDKRQPLWPDGLFGSISH------CAT-----TALAVISRQRIGIDIEKIM 112 Query: 141 AEEKAQEIWSGIISDEEYSLLQQGPLPFNQALTLVFSAKESLFKAVYPQSGRYFDFIEAR 200 ++ A E+ II +E +LQ LPF ALTL FSAKES++KA + F A+ Sbjct: 113 SQHTATELAPSIIDSDERQILQASLLPFPLALTLAFSAKESVYKA-FSDRVTLPGFNSAK 171 Query: 201 LLSYSLVSGNFELQLL 216 + S + + + L LL Sbjct: 172 VTSLT--ATHISLHLL 185
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 301 bits (773), Expect = e-107 Identities = 114/211 (54%), Positives = 156/211 (73%) Query: 5 MLKVFNVNFDRMSENKLDEIFTLRKITFKDRLDWKVTCIDGKESDQYDDENTNYLLGTID 64 ML++F+VN +SE K E+FTLRK TFKDRL+W V C DG E DQYD+ NT YL G D Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKD 60 Query: 65 DTLVCSVRFVEMQYPTMITGPFAPYFRDLDLPIDGFIESSRFFVEKALARDKLGNNGSLS 124 +T++CS+RF+E +YP MITG F PYF+++++P ++ESSRFFV+K+ A+D LGN +S Sbjct: 61 NTVICSLRFIETKYPNMITGTFFPYFKEINIPEGNYLESSRFFVDKSRAKDILGNEYPIS 120 Query: 125 AILFLSMVNYARNRGYKGILTVVSRGMYTILKRSGWGITVINQGESEKNEVIYLLHLSID 184 ++LFLSM+NY++++GY GI T+VS M TILKRSGWGI V+ QG SEK E +YL+ L +D Sbjct: 121 SMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLVFLPVD 180 Query: 185 SNSQQQLIRKIQRVHNIDTHTLASWPLVVPS 215 +Q+ L R+I R ++ L WPL VP+ Sbjct: 181 DENQEALARRINRSGTFMSNELKQWPLRVPA 211
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 29.1 bits (65), Expect = 0.017 Identities = 19/76 (25%), Positives = 29/76 (38%), Gaps = 5/76 (6%) Query: 176 GYEKNRTTGGDNNIGGDGYGFRPYYRYQVSDR-LSVNTDVKMLVEDKDARGADNGRFQFY 234 GY ++ G N +GG F YRY+ + L V + + A D + Q+Y Sbjct: 31 GYAQSDAQGQMNKMGG----FNLKYRYEEDNSPLGVIGSFTYTEKSRTASSGDYNKNQYY 86 Query: 235 EALVNVNYRVADNVHA 250 YR+ D Sbjct: 87 GITAGPAYRINDWASI 102
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 7e-04 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 1/49 (2%) Query: 32 IVLVGPSGCGKSTLLRMIAGLEDVNSGEIKI-EDKDVTQTNAGARGVSM 79 +VL G G GKSTL+ + GL+ + I KD + AG + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYEL 647
>PF05860#haemagglutination activity domain. Length = 117 Score = 82.9 bits (205), Expect = 1e-20 Identities = 21/141 (14%), Positives = 41/141 (29%), Gaps = 24/141 (17%) Query: 68 AAIVADASAPGNQQPTIINSANGTPQVNIQAPSSGGVSRNVYSQFDVDGRGVILNNGHGV 127 A I D + P N + I + T + + + + + +F V G N Sbjct: 1 AQITPDTTLPIN---SNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFN---- 52 Query: 128 NQTELGGFIDGNPWLARGEASIILNEVNSRDPSKLNGYIEVAGRKAQVVIANSAGITCEG 187 I++ V S ++G I A + + N GI Sbjct: 53 ---------------NPTNIQNIISRVTGGSVSNIDGLIRANAT-ANLFLINPNGIIFGQ 96 Query: 188 CGFINANRVTLTTGQAQLNNG 208 ++ + + +L Sbjct: 97 NARLDIGGSFVGSTANRLKFA 117
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.2 bits (65), Expect = 0.024 Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 4/60 (6%) Query: 40 NDYFVSMKEALEQAANDIGAKVYIADAGHDVSKQINDVED---MLQKKIDILLINPTDSV 96 D+F S++ L A D A+ + + Q + K+++I + D + Sbjct: 110 QDFFTSLQT-LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQI 168
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 144 bits (363), Expect = 3e-44 Identities = 84/260 (32%), Positives = 137/260 (52%), Gaps = 8/260 (3%) Query: 3 NLFSLENRKVLITGSAQGIGFLLAKGLAEFGAEIIINDITAERAEKAVAELRASGFIAHA 62 N +E + ITG+AQGIG +A+ LA GA I D E+ EK V+ L+A A A Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 63 AAFNVTNHDEVNEAIEKIESHIGAIDVLINNAGIQRRHAFTEFPEKDWDDVIAVNQKSVF 122 +V + ++E +IE +G ID+L+N AG+ R +++W+ +VN VF Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121 Query: 123 LVSQAVARYMVKRQRGKIINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARYNIQ 182 S++V++YM+ R+ G I+ + S + + R ++ YA+SK A M T+ + +ELA YNI+ Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 183 VNGIAPGYFKTDMTKALVDDQ--------AFTDWLCKRTPAARWGDPEELIGAAVYLSSK 234 N ++PG +TDM +L D+ + P + P ++ A ++L S Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 235 ASDFVNGHLLFVDGGMLVAV 254 + + H L VDGG + V Sbjct: 242 QAGHITMHNLCVDGGATLGV 261
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 28.9 bits (64), Expect = 0.011 Identities = 17/45 (37%), Positives = 23/45 (51%) Query: 102 LLSVLIYAISSVSDQGISGEMVDAKAVGISLFGPYVLAVELASML 146 L+S +Y+ + Q +SG+ VD K V SL P L SML Sbjct: 251 LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESML 295
>PF03627#PapG Length = 336 Score = 26.1 bits (57), Expect = 0.024 Identities = 10/46 (21%), Positives = 20/46 (43%) Query: 5 SNNSRAHCSKPFLYRQNQWHFNQAISEYRLPAPLSAQDLTDSVNHI 50 + ++ C KP + F+ I + LPA L D + ++ + Sbjct: 131 AFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYT 176
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 112 bits (281), Expect = 9e-30 Identities = 101/341 (29%), Positives = 171/341 (50%), Gaps = 23/341 (6%) Query: 40 TAVGNNNSLGGSTNGVVVGNGGSLSNSINGVVIG-NGSVSDGDGVSVGGGTSTNG----G 94 +AV + +GV +G S S++ GV +G N + V++G + Sbjct: 113 SAVTYGAASTAQKDGVAIGARASTSDT--GVAVGFNSKADAKNSVAIGHSSHVAANHGYS 170 Query: 95 IAIGSGSNATRSDEMNIG----DRQITGVKAGVADTDAANVGQL-----------VAKAG 139 IAIG S R + ++IG +RQ+T + AG DTDA NV QL ++ Sbjct: 171 IAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSA 230 Query: 140 ETLNSANIYVDNQATETLNNANIYTDNKATETINNANTYTDNKSSETLNSANSYTDNKSS 199 E L +AN Y DN+++ L AN YTD+K+ ET+ NA +S + LN A +++++ + Sbjct: 231 ELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVAR 290 Query: 200 ETLNSANTYTDSKTAEIFNTTKTYMDGKSKETLNNTYDYVDSKVSSIVYDVNSYTDKTVN 259 TL +A + +S T + + + KS E L + Y DSK S + NSYTD TV+ Sbjct: 291 TTLETAEEHANSVARTTLETAEEHANKKSAEALASANVYADSKSSHTLKTANSYTDVTVS 350 Query: 260 TAFETSLSDAKSYVDDKYNQLSDKVNKNFNKTNAGISGAMAMSGIPQKFGYEK-SFGMAI 318 + + ++ ++ Y D K+ QL ++++K + + G++ + A++ + Q +G K +F + Sbjct: 351 NSTKKAIRESNQYTDHKFRQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKVNFTAGV 410 Query: 319 GAYRGQSALAVGGDWNINHKTITRVNVSADTEGGVGVAAGF 359 G YR ALA+G + +N + V+ V A F Sbjct: 411 GGYRSSQALAIGSGYRVNENVALKAGVAYAGSSDVMYNASF 451
>INTIMIN#Intimin signature. Length = 939 Score = 432 bits (1113), Expect = e-142 Identities = 128/415 (30%), Positives = 198/415 (47%), Gaps = 31/415 (7%) Query: 7 FGKDNLQRNPYAVTAGINYTPVPLLTVGVDQRMGKSSKHETQWNLQMNYRLGESFQSQLS 66 F D LQ NP A T G+NYTP+PL+T+G+D R G ++++ +++Q Y+ + + Q+ Sbjct: 361 FNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIE 420 Query: 67 PSAVAGTRLLAESRYNLVDRNNNIVLEYQKQQVVKLTLSPATISGLPGQVYQVNAQVQGA 126 P V R L+ SRY+LV RNNNI+LEY+KQ ++ L + P I+G ++ V+ Sbjct: 421 PQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIVKSK 479 Query: 127 SAVREIVWSDAELIAAGGTLT---PLSTTQFNLVLPPYKRTAQVSRVTDDLTANFYSLSA 183 + IVW D+ L + GG + S + +LP Y + +N Y ++A Sbjct: 480 YGLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGG----------SNVYKVTA 529 Query: 184 LAVDHQGNRSNSFTLSVTVQQPQLTLTAAVIGD------GAPASGKTAITVEFTVADFEG 237 A D GN SN+ L++TV + + D A A G AIT TV G Sbjct: 530 RAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NG 588 Query: 238 KPLAGQEVVITTNNG-ALPNKITEKTDANGVARIALTNTTDGVTVVTAEVEGQRQSVDTH 296 A V +G A+ + + T+ +G A + L + G VV+A+ +++ + Sbjct: 589 VAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNAN 648 Query: 297 ---FVKGTIAADKSTLAAVPTSIIADGLMASTITLELKDTYGD-PQAGANVAFDTTLGNM 352 FV T A + + A T+ +A+G IT +K GD P + V F TTLG + Sbjct: 649 AVIFVDQT-KASITEIKADKTTAVANG--QDAITYTVKVMKGDKPVSNQEVTFTTTLGKL 705 Query: 353 GVITDHN--DGTYSAPLTSTTLGVATVTVKVDGAAFSVPSVTVNFTADPIPDAGR 405 T+ +G LTSTT G + V+ +V A V + V F D G Sbjct: 706 SNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760 Score = 150 bits (381), Expect = 6e-40 Identities = 70/363 (19%), Positives = 128/363 (35%), Gaps = 27/363 (7%) Query: 309 LAAVPTSIIADGLMASTITLELKDTYGDPQAGANVAFDTTLG----NMGVITDHNDGTYS 364 A TS ADG A T T +K G QA V+F+ G + + G + Sbjct: 563 FTADKTSAKADGTEAITYTATVKKN-GVAQANVPVSFNIVSGTAVLSANSANTNGSGKAT 621 Query: 365 APLTSTTLGVATVTVKVDGAAFSVPSVTVNFTADPIPDAGRSSFTVSTPDILADGTMSST 424 L S G V+ K ++ + V F + S + A Sbjct: 622 VTLKSDKPGQVVVSAKTAEMTSALNANAVIF----VDQTKASITEIKADKTTAVANGQDA 677 Query: 425 LSFVPVDKNGHFISGMQGLSFTQNGVPVSISPITEQPDSY-TATVVGNTAGDVTITPQVD 483 +++ G Q ++FT +S S + Y T+ T G ++ +V Sbjct: 678 ITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVS 737 Query: 484 TLILSTLQKKISLFPVPTLTGILVNGQNFATDKGFPKTIFKNATFQLQMDNDVANNTQYE 543 + + ++ F T+ + P + L+ N +Y Sbjct: 738 DVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKAS---GGNGKYT 794 Query: 544 WSSSFTPNVSVN-DQGQVTITYQTYSEVAVTAKSKKFPSYSVSYRFYPNRWIYDGGTSLV 602 W S+ SV+ GQVT+ + + ++V + + +Y+++ PN I + V Sbjct: 795 WRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIA---TPNSLIVPNMSKRV 851 Query: 603 SSIEASRQCQGSDMSAVLESSRATNGTRAPDGTLWGEWGSLTAYS--SDWQSGEYWVKRT 660 + +A C+ + L SS+ ++ WG+ Y Q+ WV++T Sbjct: 852 TYNDAVNTCK--NFGGKLPSSQNEL------ENVFKAWGAANKYEYYKSSQTIISWVQQT 903 Query: 661 STD 663 + D Sbjct: 904 AQD 906
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 28.9 bits (64), Expect = 0.008 Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%) Query: 40 LKNQDPTNPMENNELTTQLAQINTVSGIEKLNTT 73 L N+ P N ++NN L TQL + V G E+L T+ Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 45.3 bits (107), Expect = 3e-07 Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 8/87 (9%) Query: 6 AVSGMNAASSNLDVIGNNIANSATSGFKAGSVSFAD----MFAGSQTGMGVKVAGITQDF 61 A+SG+NAA + L+ NNI++ +G+ + A + AG G GV V+G+ +++ Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREY 66 Query: 62 NDGTATTTNRRLDLAISQNGFFRMQDS 88 + +L A +Q+ + Sbjct: 67 DA----FITNQLRAAQTQSSGLTARYE 89 Score = 40.7 bits (95), Expect = 9e-06 Identities = 15/49 (30%), Positives = 28/49 (57%) Query: 380 TLTSGALESSNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILQTLVSLR 428 L++ S V+L +E N+ Q+ Y +NAQ ++T + I L+++R Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.5 bits (97), Expect = 2e-06 Identities = 11/41 (26%), Positives = 22/41 (53%) Query: 192 ETSNVNVAEELVNMIQTQRAYEINSKAVSTSDQMLQKLAQL 232 S VN+ EE N+ + Q+ Y N++ + T++ + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 391 bits (1007), Expect = e-138 Identities = 155/366 (42%), Positives = 217/366 (59%), Gaps = 9/366 (2%) Query: 5 SLVTLLMVLLSLVWLPASAERIRDLVTVQGVRDNALIGYGLVVGLDGSGDQTMQTPFTTQ 64 +LV + LS A RI+D+ ++Q RDN LIGYGLVVGL G+GD +PFT Q Sbjct: 10 ALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQ 69 Query: 65 SLSNMLSQLGITVPPGTNMQLKNVAAVMVTAKLPAFSRAGQTIDVVVSSMGNAKSIRGGT 124 S+ ML LGIT G + KN+AAVMVTA LP F+ G +DV VSS+G+A S+RGG Sbjct: 70 SMRAMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGN 128 Query: 125 LLMTPLKGVDNQVYALAQGNVLVGGAGAAAGGSSVQVNQLAGGRISNGATIERELPTTFG 184 L+MT L G D Q+YA+AQG ++V G A +++ R+ NGA IERELP+ F Sbjct: 129 LIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFK 188 Query: 185 TDGIINLQLNSEDFTLAQQVSDAINR----QRGFGSATAIDARTIQVLVPRGGSSQVRFL 240 + LQL + DF+ A +V+D +N + G A D++ I V PR + R + Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247 Query: 241 ADIQNIPINVDPGDAKVIINSRTGSVVMNRNVVLDSCAVAQGNLSVVVDKQNIVSQPDTP 300 A+I+N+ + D AKV+IN RTG++V+ +V + AV+ G L+V V + V QP P Sbjct: 248 AEIENLTVETD-TPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-AP 305 Query: 301 FGGGQTVVTPNTQISVQQQGGVLQRVNASPNLNNVVRALNSLGATPIDLMSILQAMESAG 360 F GQT V P T I Q+G + V P+L +V LNS+G +++ILQ ++SAG Sbjct: 306 FSRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAG 364 Query: 361 CLRAKL 366 L+A+L Sbjct: 365 ALQAEL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 310 bits (796), Expect = e-108 Identities = 179/316 (56%), Positives = 232/316 (73%), Gaps = 6/316 (1%) Query: 1 MSDLLAMSGAAYDARSLEALKRDAARDPEGNLKQVAQQVEGMFVQMMLKSMRAALPQDGV 60 +SD ++ AA+DA+SL LK A DP N++ VA+QVEGMFVQMMLKSMR ALP+DG+ Sbjct: 2 ISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGL 61 Query: 61 MNSEQTKLYTSLYDQQIAQQMSA-KGLGLADMMVEQLS-GSTSASETAGTVPMMLDNEVL 118 +SE T+LYTS+YDQQIAQQM+A KGLGLA+MMV+Q++ E+ PM E + Sbjct: 62 FSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETV 121 Query: 119 QSMPAQALAQVMRRAIPTPPSSSMAAISPGNGNFVARMSIPAQIASQQSGIPHQLIMAQA 178 QAL+Q++++A+P S+ S F+A++S+PAQ+ASQQSG+PH LI+AQA Sbjct: 122 VRYQNQALSQLVQKAVPRNYDDSLPGDSK---AFLAQLSLPAQLASQQSGVPHHLILAQA 178 Query: 179 ALESGWGQREIPTADGKSSYNVFGIKAGSSWNGPVSEITATEYEQGVAKKTKARFRVYGS 238 ALESGWGQR+I +G+ SYN+FG+KA +W GPV+EIT TEYE G AKK KA+FRVY S Sbjct: 179 ALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSS 238 Query: 239 YVEAVSDYVKLLTQNPRYAHVAAAQSPEQGAHALQKAGYATDPQYAQKLVSVIQQMRSTG 298 Y+EA+SDYV LLT+NPRYA V A S EQGA ALQ AGYATDP YA+KL ++IQQM+S Sbjct: 239 YLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSIS 298 Query: 299 EQAVKAYGGSDLSQLF 314 ++ K Y ++ LF Sbjct: 299 DKVSKTY-SMNIDNLF 313
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 436 bits (1123), Expect = e-150 Identities = 315/552 (57%), Positives = 398/552 (72%), Gaps = 9/552 (1%) Query: 3 NSLMNTAMSGLNAAQYALSTVSNNITNFQVAGYNRQNTVFAQNGGTITSAGFIGNGVTVT 62 +SL+N AMSGLNAAQ AL+T SNNI+++ VAGY RQ T+ AQ T+ + G++GNGV V+ Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 63 GVNREYNAFITNQLRASQTQSSGLATYYQQISQIDNLLSNASNNLSTTMQDFFSNLQNLV 122 GV REY+AFITNQLRA+QTQSSGL Y+Q+S+IDN+LS ++++L+T MQDFF++LQ LV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 123 SNADDDAARKTVLGKAEGLVNQFQNADKYLRDMDDGVNQKITDSATQINNYAEQIAKLND 182 SNA+D AAR+ ++GK+EGLVNQF+ D+YLRD D VN I S QINNYA+QIA LND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 183 QITRLRG-SSGSEPNALLDQRDQLVTELNQIMAVTVTQQDGDAYNVSFAGGLSLVQGPNA 241 QI+RL G +G+ PN LLDQRDQLV+ELNQI+ V V+ QDG YN++ A G SLVQG A Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 YKVEAIPSSADATRLTLGYKRGNGEATEVDESRITTGSLGGTLKFRSEALDSARNQLGQL 301 ++ A+PSSAD +R T+ Y G E+ E + TGSLGG L FRS+ LD RN LGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALVMADSFNTQHNAGFDINGDEGEDFFSFADPTVLKNAKNQGNASITVEYKDTSKVKASD 361 AL A++FNTQH AGFD NGD GEDFF+ P VL+N KN+G+ +I D S V A+D Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360 Query: 362 YTVEFDGTDWQVTRLSDNTKVQTTPGVNADGDPTLEFEGVAIKIDNGTPGPQAKDKFTIK 421 Y + FD WQVTRL+ NT TP D + + F+G+ + P D FT+K Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTP----DANGKVAFDGLELTFTG---TPAVNDSFTLK 413 Query: 422 TVSNVAANLQVAITDSSKIAAAGSADGGISDNTNAQALLDLQSKKLVEGK-TTLSGAYAG 480 VS+ N+ V ITD +KIA A D G SDN N QALLDLQS G + + AYA Sbjct: 414 PVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 481 LVSNVGNQTATAKTNSTAQANIVTQLTTEQQSISGVNLDEEYGDLQRFQQYYLANAQVLQ 540 LVS++GN+TAT KT+S Q N+VTQL+ +QQSISGVNLDEEYG+LQRFQQYYLANAQVLQ Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533 Query: 541 AASTLFNALLSI 552 A+ +F+AL++I Sbjct: 534 TANAIFDALINI 545
>FLAGELLIN#Flagellin signature. Length = 507 Score = 40.4 bits (94), Expect = 4e-06 Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 7/137 (5%) Query: 4 STSMLYQQNMQGITNAQSLWMQTGQQLSTGKRVVNPSDDPMAASQAVMVSQAESENSQYT 63 S S+L Q N+ ++ S ++LS+G R+ + DD AA QA+ + Sbjct: 8 SLSLLTQNNLNKSQSSLS---SAIERLSSGLRINSAKDD--AAGQAIANRFTSNIKGLTQ 62 Query: 64 LARSFARQSSSLETT--VLAQTTSTIQSIQSLVISAKNDTLSDDDRASYATQLQGLKDQL 121 +R+ S +TT L + + +Q ++ L + A N T SD D S ++Q +++ Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122 Query: 122 LNQANTTDGNGRYIFAG 138 +N T NG + + Sbjct: 123 DRVSNQTQFNGVKVLSQ 139
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 112 bits (281), Expect = 9e-35 Identities = 82/144 (56%), Positives = 102/144 (70%) Query: 1 MKSQSPLVTLCDLAQKAVEQASTQLGHVRQSYQNAEQQLTMLLTYQDEYRERLNDTLCNG 60 M L TL DLA+K VE A+ LG +R+ Q AE+QL ML+ YQ+EYR LN + G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 MASSSWQNYQQFIQTLEQAIDQHRKQLAQWSIKVEQAVKYWQEKQQRLNAFETLQERAET 120 + S+ W NYQQFIQTLE+AI QHR+QL QW+ KV+ A+ W+EK+QRL A++TLQER T Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 TQRQQENRLDQKLMDEFAQRASQR 144 ENRLDQK MDEFAQRA+ R Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMR 144
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 221 bits (563), Expect = 5e-75 Identities = 128/233 (54%), Positives = 167/233 (71%), Gaps = 7/233 (3%) Query: 6 NALPWQPWSLKDFASQSEAPLSESMPDISLLFPNEPMEATAAVDEQQVLVNLQLEAEKQG 65 + LPW+ W+ D A P +E +P + P E + A +Q L LQ++A +QG Sbjct: 3 DNLPWKTWTPDDLAP----PQAEFVPIVE---PEETIIEEAEPSLEQQLAQLQMQAHEQG 55 Query: 66 RQQGFAKGLQEGLDKGYQTGLEEGHQQALADAQQQLAPMTAHWQVMVTDFQNTLDTLDSV 125 Q G A+G Q+G +GYQ GL +G +Q LA+A+ Q AP+ A Q +V++FQ TLD LDSV Sbjct: 56 YQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSV 115 Query: 126 IASRLVQIALAAAKQIIGQPAICDGTALLAQIQQMIQQEPMFAGKTQLRVNPDDLAIVEQ 185 IASRL+Q+AL AA+Q+IGQ D +AL+ QIQQ++QQEP+F+GK QLRV+PDDL V+ Sbjct: 116 IASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDD 175 Query: 186 RLGSTLSLHGWRLLGDSQIHAGGCKVSAEEGDLDASLATRWHELCRLAAPGEL 238 LG+TLSLHGWRL GD +H GGCKVSA+EGDLDAS+ATRW ELCRLAAPG + Sbjct: 176 MLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 314 bits (806), Expect = e-108 Identities = 113/327 (34%), Positives = 192/327 (58%), Gaps = 2/327 (0%) Query: 2 SLTGTEKSAIMLMTLGEDHAAEVFKHLSSREVQQLSTTMASMRQVSHQQLVDVLAEFEDD 61 +LTG +K+AI+L+++G + +++VFK+LS E++ L+ +A + ++ + +VL EF++ Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73 Query: 62 AEQYAALSVNASDYLRSVLIKALGEERASSLLEDILESRETTSGMETLNFMEPQMAADLI 121 + DY R +L K+LG ++A ++ + L S + E + +P + I Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILNFI 132 Query: 122 RDEHPQIIATILVHLKRAQAADILALFDERLRNDVMLRIATFGGVQPAALAELTEVLNNL 181 + EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 133 QQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKK 192 Query: 182 LDGQ-NLKRSKMGGIRTAAEIINLMKTQQEETVMDAVREYDGELAQKIIDEMFLFENLVS 240 L + + GG+ EIIN+ + E+ +++++ E D ELA++I +MF+FE++V Sbjct: 193 LASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVL 252 Query: 241 VDDRSIQRLLQEIDNESLLIALKGADQALRERFLSNMSLRAAEILRDDLATRGPVRMSLV 300 +DDRSIQR+L+EID + L ALK D ++E+ NMS RAA +L++D+ GP R V Sbjct: 253 LDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDV 312 Query: 301 ENEQKSILLIVRRLAESGEIVIGGGED 327 E Q+ I+ ++R+L E GEIVI G + Sbjct: 313 EESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 577 bits (1488), Expect = 0.0 Identities = 354/552 (64%), Positives = 443/552 (80%), Gaps = 9/552 (1%) Query: 19 LARLRANPKIPLLIAAAAAIAIIVALMLWAKSPDYRVLYSNLSDRDGGDIVTQLTQLNIP 78 L RLRANP+IPL++A +AA+AI+VA++LWAK+PDYR L+SNLSD+DGG IV QLTQ+NIP Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75 Query: 79 YRFADNGGALLIPAEKVHETRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQINYQRAL 138 YRFA+ GA+ +PA+KVHE RLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQ+NYQRAL Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135 Query: 139 EGELSRTIGTLGPVLNVRVHLAMPKPSLFVREQKSPTASVTLALQPGRALDDGQINAIVY 198 EGEL+RTI TLGPV + RVHLAMPKPSLFVREQKSP+ASVT+ L+PGRALD+GQI+A+V+ Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195 Query: 199 MVSSSVAGLPPGNVTVVDQTGRLLTQSDSAGRDLNASQLKFTSEVENRYQRRIENILAPM 258 +VSS+VAGLPPGNVT+VDQ+G LLTQS+++GRDLN +QLKF ++VE+R QRRIE IL+P+ Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255 Query: 259 VGNGNVHAQVTAQVDFASREQTDEEYKPNQAANQGAVRSQQVSTSEQLGGTNVGGVPGAL 318 VGNGNVHAQVTAQ+DFA++EQT+E Y PN A++ +RS+Q++ SEQ+G GGVPGAL Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315 Query: 319 SNQPPVAPIAPIEIPQPAGAAANNAAPANTAATANANTTATAAKASSSNSRHDQTTNFEV 378 SNQP API P A N +T+ +N+ A +++ ++T+N+EV Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNS--------AGPRSTQRNETSNYEV 367 Query: 379 DRTIRHTQQQAGMVQRLSVAVVVNYTSDKAGKPIALSKDQLAQVESLTREAMGFSTVRGD 438 DRTIRHT+ G ++RLSVAVVVNY + GKP+ L+ DQ+ Q+E LTREAMGFS RGD Sbjct: 368 DRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGD 427 Query: 439 TLNVVNTPFTASDDTRGSSLPFWQQQSFFDQLLNAGRYLLILLVAWILWRKLLRPMLAKK 498 TLNVVN+PF+A D+T G LPFWQQQSF DQLL AGR+LL+L+VAWILWRK +RP L ++ Sbjct: 428 TLNVVNSPFSAVDNT-GGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRR 486 Query: 499 QVADKAAASVNNIVQTAQAAETVKQSKEELALRKKNQQRVSAEVQAQRIRELADKDPRVV 558 KAA + Q + A V+ SK+E +++ QR+ AEV +QRIRE++D DPRVV Sbjct: 487 VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVV 546 Query: 559 ALVIRQWMSNDQ 570 ALVIRQWMSND Sbjct: 547 ALVIRQWMSNDH 558
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 80.5 bits (198), Expect = 2e-23 Identities = 59/102 (57%), Positives = 73/102 (71%) Query: 2 SVQGIEGVLQQLQVTALQASGSAKTLPAEAGFASELKAAIGKISENQQVARTSAQNFELG 61 ++QGIEGV+ QLQ TA+ A FA +L AA+ +IS+ Q ART A+ F LG Sbjct: 2 AIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLG 61 Query: 62 VPGVGLNDVMVNAQKSSVSLQLGIQVRNKLVAAYQEVMNMGV 103 PGV LNDVM + QK+SVS+Q+GIQVRNKLVAAYQEVM+M V Sbjct: 62 EPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.0 bits (65), Expect = 0.049 Identities = 20/121 (16%), Positives = 38/121 (31%), Gaps = 11/121 (9%) Query: 32 PLTTQQTSYKSKLTAYGVLQSALAKLETASTALKKADTLNSTAVSGSNSAFSATTDSAAS 91 P + +Y A V + +E + ++ST+ S + + T S Sbjct: 41 PAVSVSANY-PGADAQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSG-- 97 Query: 92 AGTYSIEVTNLAKAQSLLSADVPSATDKLGSSDATRTITITQPGQKEPMKISLTSEQTSL 151 T+ AQ + + AT L + I++ + M S+ Sbjct: 98 --------TDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGT 149 Query: 152 T 152 T Sbjct: 150 T 150
>FLAGELLIN#Flagellin signature. Length = 507 Score = 142 bits (359), Expect = 6e-43 Identities = 133/156 (85%), Positives = 144/156 (92%) Query: 3 VINTNSLSLLTQNNLNKSQSSLGTAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62 VINTNSLSLLTQNNLNKSQSSL +AIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62 Query: 63 AARNANDGISIAQTTEGSLNEINNNLQRVRELTVQAQNGSNSSSDLDSIQDEISLRLAEI 122 A+RNANDGISIAQTTEG+LNEINNNLQRVREL+VQA NG+NS SDL SIQDEI RL EI Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122 Query: 123 DRVSDQTQFNGKKVLAENTTMSIQVGANDGETIDVN 158 DRVS+QTQFNG KVL+++ M IQVGANDGETI ++ Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITID 158
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 137 bits (346), Expect = 1e-43 Identities = 58/185 (31%), Positives = 97/185 (52%), Gaps = 16/185 (8%) Query: 1 MKNKTTLAAFITAILLSSSAAYAAGDRTISLGYAQGDVRLGDGNRKDIRLDDDLKGINVK 60 MK L+A + ++ + AA T++ GYAQ D + G N + G N+K Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATS-TVTGGYAQSDAQ-GQMN--------KMGGFNLK 50 Query: 61 YLHKLSE-MFGAIGSFTYTDLNYDYLNNNVKIGDASFDYYSLMVGPSVHFNEFFSMYALL 119 Y ++ G IGSFTYT+ ++ YY + GP+ N++ S+Y ++ Sbjct: 51 YRYEEDNSPLGVIGSFTYTE--KSRTASSGDYNKN--QYYGITAGPAYRINDWASIYGVV 106 Query: 120 GIGHGNAKASVL-GYGKKEEQDSLAYGVGMQFNPLNNIAIDASYEYTKLKDANIGTWVLG 178 G+G+G + + Y +YG G+QFNP+ N+A+D SYE ++++ ++GTW+ G Sbjct: 107 GVGYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAG 166 Query: 179 IGYRF 183 +GYRF Sbjct: 167 VGYRF 171
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 71.8 bits (176), Expect = 7e-17 Identities = 25/115 (21%), Positives = 46/115 (40%), Gaps = 2/115 (1%) Query: 2 ISVLLVDDHELVRAGIRRILDDIKGIKVAGEMQCGEDAVKWCRSHVVDIVLMDMNMPGIG 61 ++L+ DD +R + + L G V +W + D+V+ D+ MP Sbjct: 4 ATILVADDDAAIRTVLNQALSR-AGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLEATRKILRFSPDTKVIMLTIHTENPLPAKVMQAGAGGYLSKGAAPQDVITAIR 116 + +I + PD V++++ K + GA YL K ++I I Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>ENTEROTOXINB#Heat labile enterotoxin B chain signature. Length = 124 Score = 26.2 bits (57), Expect = 0.038 Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 1/68 (1%) Query: 18 MEGISEATLYNWRNQAKSEGEPVPGAEKNSEQWPAEARLAVIVETATLSETEIAEYCRKK 77 + G E + ++N A + E VPG++ Q A R+ + A L+E ++ + C Sbjct: 52 LAGKREMAIITFKNGAIFQVE-VPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWN 110 Query: 78 GLYPAQIA 85 P IA Sbjct: 111 NKTPHAIA 118
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 48.6 bits (115), Expect = 3e-08 Identities = 101/420 (24%), Positives = 170/420 (40%), Gaps = 55/420 (13%) Query: 14 TLLMAGNASA---QETLRVLLEGHSTSDSIKALLPEFEKQTGIKVQAEIVPYSDLTSKAL 70 T++ + +A A + L + + G + + + +FEK TGIKV E + D + Sbjct: 17 TMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVE---HPDKLEEKF 73 Query: 71 LAFSSHSGRYDVVMDDWVHAV--GYASAGYITPVDQWMESDTAFYDGADFVKSYA---DT 125 ++ D++ W H GYA +G + + D AF D K Y D Sbjct: 74 PQVAATGDGPDIIF--WAHDRFGGYAQSGLLAEI----TPDKAFQD-----KLYPFTWDA 122 Query: 126 LRYKDGYYGLPVYGESTFLMYRKDLFEQYGIAVPKTFDELTAAAKTIKEKTEGKVAGITL 185 +RY P+ E+ L+Y KDL PKT++E+ A K +K K GK A + Sbjct: 123 VRYNGKLIAYPIAVEALSLIYNKDLLPN----PPKTWEEIPALDKELKAK--GKSA--LM 174 Query: 186 RGAQGIQNTFAWASFLWGYGGQWIDDNGK-----SAITSPQAVEATKSFVNILKNYGPIG 240 Q + F W G + +NGK + + A V+++KN Sbjct: 175 FNLQ--EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNA 232 Query: 241 AANFGWQENRLVFQQGKAAMTIDSTVNGGFNEDPKESTVVGKVGYAPVPVQPGDHPGNSG 300 ++ E F +G+ AMTI NG + +++ KV Y + + Sbjct: 233 DTDYSIAE--AAFNKGETAMTI----NGPWAWSNIDTS---KVNYGVTVLPTFKGQPSKP 283 Query: 301 ALQVHGLYISSDSKKQDAAWKFISWATDKQTQMKSVELNPNAGVSSLSAINSDAFTKRYG 360 + V I++ S ++ A +F+ +++V + L A+ ++ + Sbjct: 284 FVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKD-----KPLGAVALKSYEEELA 338 Query: 361 AFKDGMLAALQNGNAK--YLPTIPQSTQIINITGIALSEALAGTQTVENALQQANTRNDK 418 KD +AA K +P IPQ + A+ A +G QTV+ AL+ A TR K Sbjct: 339 --KDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 45.8 bits (108), Expect = 1e-06 Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%) Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614 +TGA G+G L +GA + A + L V R DV D Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673 + + + + G I ++ AGVL + L D + A F+V + +++ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700 + G + + S+ A + A S A Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 51.2 bits (122), Expect = 2e-08 Identities = 22/70 (31%), Positives = 44/70 (62%) Query: 22 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 81 + +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+ Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292 Query: 82 AWNQLMLSRS 91 W +L+ +RS Sbjct: 293 EWQKLLTTRS 302
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 29.8 bits (67), Expect = 0.022 Identities = 13/116 (11%), Positives = 26/116 (22%), Gaps = 15/116 (12%) Query: 287 ESGTSSGQTAIGIQTSLPGYLKALGLGLVNTAGGVSYLLSDSYG--TDSRIATGVGISLS 344 + + + ++P L + + S SY D + Sbjct: 584 NAWQKGRDQMLALNVNIP-----FSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVY 638 Query: 345 DSNGSTMNFVGWG-------GCAQTQDCLTTADAGWYPILTGASGNGSHSAGYNNY 393 + N + + G A + A+ SHS Sbjct: 639 GTLLED-NNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQL 693
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 742 bits (1917), Expect = 0.0 Identities = 244/882 (27%), Positives = 392/882 (44%), Gaps = 72/882 (8%) Query: 6 LLVTHISSAADNNNQDDYIFDDALVRGSSLGLGSIARFNKKNSYDAGQYQVDMYMNNKFV 65 L V +A + + F+ + + ++RF G Y+VD+Y+NN ++ Sbjct: 30 LFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89 Query: 66 DRLKMLFVDKDNS--VEPCLSVAQLLQAGVKEEALKTAD--PKTPCLAFQSILPASDFRF 121 + F D+ + PCL+ AQL G+ ++ + C+ S++ + + Sbjct: 90 ATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQL 149 Query: 122 DHAKLRFDLSIPQKFVKNVPRGYVDPKNLTAGNTIGFSNYNLNQYHVDYNKEGIKRTTNS 181 D + R +L+IPQ F+ N RGY+ P+ G G NYN + V G ++ Sbjct: 150 DVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG---NSHY 206 Query: 182 TYLSLNSGINIGMWRFRQQGSLRYDASRG-----TNWTSNRLYSQRALPTIGSEITLGET 236 YL+L SG+NIG WR R + Y++S W + +R + + S +TLG+ Sbjct: 207 AYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDG 266 Query: 237 FSSGQFFSSLGFLGVALSTDDRMLPESQRGYAPVVRGIARTNARVMVYQNNRSIYQTTVS 296 ++ G F + F G L++DD MLP+SQRG+APV+ GIAR A+V + QN IY +TV Sbjct: 267 YTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVP 326 Query: 297 PGAFEFNDLSVTHFGGDLTVEINEADGSVSTFQVPFASVPESLRPGYSRYSFAAGQVRDV 356 PG F ND+ GDL V I EADGS F VP++SVP R G++RYS AG+ R Sbjct: 327 PGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG 386 Query: 357 GN---NETFSELTYQQGISNAITANTGIRLASGYQAIMLGGVF-THYIGALGLNTTYSHA 412 F + T G+ T G +LA Y+A G +GAL ++ T +++ Sbjct: 387 NAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANS 446 Query: 413 RLPDGEQQQGWMAKASFSRTFQPTNTTLSVAGYRYSTDGYRDLSDVLGVR---------- 462 LPD Q G + ++++ + T + + GYRYST GY + +D R Sbjct: 447 TLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQD 506 Query: 463 ----ATSNDSSWNSSTYRQRSRAEISLNQNFHRYGSLYLTASSQDYRDDRSRDSQLQLGY 518 + + + Y +R + ++++ Q R +LYL+ S Q Y + D Q Q G Sbjct: 507 GVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGL 566 Query: 519 SNTFWRNTSFNLAISQQKTGGANKIYFVDPGSGMPASNGANTLATRETVAQMSISFPLGG 578 + + + ++ L+ S K R+ + ++++ P Sbjct: 567 NTA-FEDINWTLSYSLTKNAWQKG---------------------RDQMLALNVNIPFSH 604 Query: 579 SSSAP--------YVSAGAVNSRTSGASYQTSLSGTMGSDQTAGYSVDVARNEP---TNE 627 + S + + + GT+ D YSV + Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664 Query: 628 NTLSGSLQKQLPTTSLSGSASRSPGYWQGSASARGSVAFHRGGVTLGPYLSDTFALIEAK 687 +T +L + + + S S Q G V H GVTLG L+DT L++A Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAP 724 Query: 688 GASGAKVMYGQGARIDRFGYALVPTLTPYRYNTLSLDPDGMDFNTELQDGERQIAPYAGS 747 GA AKV G R D GYA++P T YR N ++LD + + N +L + + P G+ Sbjct: 725 GAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGA 784 Query: 748 TVKVTFRTLNGYPALITIKMPDGSQLPMGTVVYNYNGKGTNDKNDIVGMVGQSSQAYLRA 807 V+ F+ G L+T+ + LP G +V T++ + G+V + Q YL Sbjct: 785 IVRAEFKARVGIKLLMTLT-HNNKPLPFGAMV-------TSESSQSSGIVADNGQVYLSG 836 Query: 808 EELSGTLTLVWGESSKERCQLDYDLGKPTDNDKQLYKLDALC 849 L+G + + WGE C +Y L P + L +L A C Sbjct: 837 MPLAGKVQVKWGEEENAHCVANYQL-PPESQQQLLTQLSAEC 877
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 160 bits (405), Expect = 8e-51 Identities = 87/248 (35%), Positives = 120/248 (48%), Gaps = 20/248 (8%) Query: 10 RRLTWSLIFSIGLHGSVVAALLYVSVEQMKIQPEIEDTPLAVTMVNIAEFAAPQPAAAAP 69 RR W + S+ +HG+VVA LLY SV Q+ P P++VTMV A Sbjct: 7 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQ-PISVTMVT-----------PAD 54 Query: 70 EPVQETPAVPEETPPVLEETPPEPEELPEPVPVPVPEPVKPKPKPVKKEVKKPEVKKTQ- 128 + P E E P E P+ PV + +P K K E K Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDV 114 Query: 129 ---APPDDKPFKSDEAALVANNAPVKSAPVASTPGLSTSAGPKALSKAKPSYPARALALG 185 PF++ A + ++ + P S ++GP+ALS+ +P YPARA AL Sbjct: 115 KPVESRPASPFENTAPARLTSSTATAATS---KPVTSVASGPRALSRNQPQYPARAQALR 171 Query: 186 IEGQVKVQYDIDESGRVTNVRVLEATPRNTFEREVKQVMRKWRFEA-VAAKNYVTTIVFK 244 IEGQVKV++D+ GRV NV++L A P N FEREVK MR+WR+E V I+FK Sbjct: 172 IEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 231 Query: 245 LDGKMEMN 252 ++G E+ Sbjct: 232 INGTTEIQ 239
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 158 bits (402), Expect = 3e-52 Identities = 71/180 (39%), Positives = 105/180 (58%), Gaps = 10/180 (5%) Query: 1 MKWITTLAPLSLALSLGISVANAASDASNTVSFGYAQSTLKIDGEKIGKDNKGFNLKYRH 60 MK I L+ L+ L+ + AA+ +TV+ GYAQS + K+ GFNLKYR+ Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAAT---STVTGGYAQSDAQGQMNKM----GGFNLKYRY 53 Query: 61 ELD-SVLGIVASFTHTKQNYGMPGDSDGKRKVEYYSLMVGPSWRFNEFVSAYALIGATQG 119 E D S LG++ SFT+T+++ K +YY + GP++R N++ S Y ++G G Sbjct: 54 EEDNSPLGVIGSFTYTEKSRTASSGDYNK--NQYYGITAGPAYRINDWASIYGVVGVGYG 111 Query: 120 KSTHTKPRMVSNTVSKTSMGYGAGLQFNPVKHVAIDTAYEYAKIEDVKIGTWIVGGGYRF 179 K T+ + S YGAGLQFNP+++VA+D +YE ++I V +GTWI G GYRF Sbjct: 112 KFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 54 VVGESGCGKSTFARAI 69 + GESG GK ARA+ Sbjct: 165 ITGESGTGKELVARAL 180
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.048 Identities = 21/90 (23%), Positives = 33/90 (36%), Gaps = 5/90 (5%) Query: 251 RAAAQATKAQENADLSAATAKENFIQRLKAQADLQGKTASEIQAYKAAQLGVTEQAAPFI 310 A A K Q + L A ++ Q L +L ++ Q E+ Sbjct: 131 GAEADTLKTQ--SSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188 Query: 311 AKLKEQESAWQNGALSAKQYRLALRQLPSQ 340 + +KEQ S WQN Q L L + ++ Sbjct: 189 SLIKEQFSTWQN---QKYQKELNLDKKRAE 215
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 104 bits (260), Expect = 3e-28 Identities = 37/249 (14%), Positives = 83/249 (33%), Gaps = 41/249 (16%) Query: 33 QTALFFGKDDRTAVTNSRQWPWEAIGQVET---ASGNLCTATLISPRLVLTAGHCVLTP- 88 + +DR +T++ + + ++ + + ++ +LT H V Sbjct: 66 HANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATH 125 Query: 89 --PGNIDQAVALRFISDKGHWKYQITDLKTRVDAKLGQKLKADGDGWIVPPAAAAYDFAL 146 P + + + + + + +GD A F+ Sbjct: 126 GDPHALKAFPSAINQDNYPNGGFTAEQITKY---------SGEGD-------LAIVKFSP 169 Query: 147 IQLTNAAPIPIKPLPLWEGTANELTKALKLVNRKVTQAGYPLD-NLNTLYKHEDCLVTGW 205 + +KP + A VN+ +T GYP D + T+++ + + Sbjct: 170 NEQNKHIGEVVKPATM-------SNNAETQVNQNITVTGYPGDKPVATMWESKG--KITY 220 Query: 206 AQQGVLAHQCDTLPGDSGSPLLLKNGNSWSLIAIQSSAPAAKERYLADNRALSVT-AINN 264 + + + T G+SGSP+ + +I I + N A+ + + N Sbjct: 221 LKGEAMQYDLSTTGGNSGSPVFNEKNE---VIGIHWGGVPNE-----FNGAVFINENVRN 272 Query: 265 RLKKLVNKI 273 LK+ + I Sbjct: 273 FLKQNIEDI 281
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.4 bits (66), Expect = 0.032 Identities = 20/88 (22%), Positives = 33/88 (37%), Gaps = 14/88 (15%) Query: 1 MKVTVFGI-GYVGLVQATVLAEVGHDVLCID-IDANKVADLKKGRIAIFEPGLAPLVK-- 56 MK V G G++G + L E GH V+ ID ++ LK+ R+ + K Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 57 -ENYEAGRLQFSTD---------AQAGV 74 + E F++ + V Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAV 88
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.1 bits (208), Expect = 4e-20 Identities = 31/115 (26%), Positives = 48/115 (41%), Gaps = 1/115 (0%) Query: 10 ILVVEDEVVFRTVLAEYLGSLGATIHQAENGLAALYQLKGHSPDLILCDLAMPKMGGIEF 69 ILV +D+ RTVL + L G + N + DL++ D+ MP + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 70 VEQLLLKGIKIPVLVISATDKMADIAQVLRLGVKDVLLKPIVDLNRLREAVLACL 124 + ++ +PVLV+SA + + G D L KP DL L + L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIGRAL 119
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 62.5 bits (152), Expect = 7e-13 Identities = 66/356 (18%), Positives = 127/356 (35%), Gaps = 15/356 (4%) Query: 14 FLLFDNLLVVLGFFVVFPLISIRFVDQLGWAALVV---GLALGLRQLVQQGLGIFGGAIA 70 +L L +G ++ P++ + L + V G+ L L L+Q GA++ Sbjct: 9 VILSTVALDAVGIGLIMPVLPG-LLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67 Query: 71 DRFGAKPMIVTGMLMRAAGFALMAMADEPWILWLACALSGLGGTLFDPPRTALVIKLTRP 130 DRFG +P+++ + A +A+MA A W+L++ ++G+ G A + +T Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126 Query: 131 HERGRFYSLLMMQDSAGAVIGALIGSWLLQYDFHFVCWTGAAIFVLAAGWNAWLLPAYRI 190 ER R + + G V G ++G + + H + AA+ L +LLP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186 Query: 191 STVRAPMKEGLMRVLRDRRFVTYVLTLTGYYMLAVQVMLMLPI--------VVNELAGSP 242 R P++ + L R+ + + + + L+ + + Sbjct: 187 GE-RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245 Query: 243 AAVKWMYAIEAALSLTLLYPLARWSEKRFSLEQRLMAGLLIMTLSLFPIGMITHLQTLFM 302 + A L + R + LM G++ + T F Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305 Query: 303 FICFFYMGSILAEPARETLGASLADSRARGSYMGFSRLGLALGGALGYTGGGWMYD 358 + G I PA + + + D +G G +L +G +Y Sbjct: 306 IMVLLASGGI-GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 66.6 bits (162), Expect = 3e-15 Identities = 56/231 (24%), Positives = 90/231 (38%), Gaps = 25/231 (10%) Query: 7 IILTGASGLIGSAIADALYKSGMNLVLACKRSQKLQDRYLSDDKSKRAYFWY-GDLTNEK 65 +TGA+ IG A+A L G ++ +KL+ S R + D+ + Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70 Query: 66 ACRELVEYAVQQMGGVDVLINCAGVFNFSALEEMTYSRITDTISTNLLAPIYLTHLVLPY 125 A E+ ++MG +D+L+N AGV + ++ T S N + V Y Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130 Query: 126 IKTSACPIIVNISSIAGFSSLPEGACYAASKWGLNGFIHSIREELRKKSIHICNI-SPCQ 184 + IV + S A YA+SK F + EL + +I CNI SP Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR-CNIVSPGS 189 Query: 185 VKT-----LSHHSDTAIRTIA-----------------PENIANAVILVLS 213 +T L + A + I P +IA+AV+ ++S Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 102 bits (256), Expect = 7e-26 Identities = 74/361 (20%), Positives = 138/361 (38%), Gaps = 60/361 (16%) Query: 317 RVLILGVNGFIGNHLTERLLQDDRYEVYGLDIGSD--------AISRFLGNPAFHFVEGD 368 + L+ G GFIG H+++RLL+ ++V G+D +D A L P F F + D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 369 ISIHSEWIE--YHIKKCDVILPLVAIATPIEYT-RNPLRVFELDFEENLKIVRDCVKYN- 424 ++ E + + + + + Y+ NP + + L I+ C Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118 Query: 425 KRIVFPSTSEVYGMCDDKEFDEDTSRLIVGPINKQRWIYSVSKQLLDRVIWAYGVKEGLK 484 + +++ S+S VYG+ F D ++ +Y+ +K+ + + Y GL Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTD------DSVDHPVSLYAATKKANELMAHTYSHLYGLP 172 Query: 485 FTLFRPFNWMGPRLDNLDAARIGSSRAITQLILNLVEGSPIKLVDGGAQKRCFTDIHDGI 544 T R F GP D A ++A+ +EG I + + G KR FT I D Sbjct: 173 ATGLRFFTVYGPWGRP-DMALFKFTKAM-------LEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 545 EALFRIIEN---------------RDGCCDGRIINIGNPTNEASIRELAEMLLTSFENHE 589 EA+ R+ + R+ NIGN ++ + + + L + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGN-SSPVELMDYIQALEDALGIEA 283 Query: 590 LRDHFPPFAGFKDIESSAYYGKGYQDVEYRTPSIKNARRILHWQPEIAMQQTVTETLDFF 649 ++ P G DV + K ++ + PE ++ V ++++ Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328 Query: 650 L 650 Sbjct: 329 R 329
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 32.5 bits (74), Expect = 0.002 Identities = 25/90 (27%), Positives = 36/90 (40%), Gaps = 4/90 (4%) Query: 220 LMYDLITCLTTTPLRLLSLVGSAIALLGF-TFSVLLVALRLIFGPEWAGGGVFTLFAVLF 278 L L+ L + L V A+A G+ L A +L+ G E G G F L A L Sbjct: 166 LWGGLLFNLLGGFVSLGDAVIGAMA--GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALG 223 Query: 279 MFIGAQFV-GMGLLGEYIGRIYNDVRARPR 307 ++G Q + + LL +G R Sbjct: 224 AWLGWQALPIVLLLSSLVGAFMGIGLILLR 253
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.003 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 10/66 (15%) Query: 29 LIGPNGAGKSTLLASLAGL------LPASGEIVLAGKSLQHYEGHELAR----QRAYLSQ 78 L G G GKSTL+ +L GL G + + + +EL+ +RA Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660 Query: 79 QQSALS 84 ++ S Sbjct: 661 VKAFFS 666
>OUTRSURFACE#Outer surface protein signature. Length = 273 Score = 29.5 bits (66), Expect = 0.004 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 8/52 (15%) Query: 1 MKKYLLLFGVLSFMPLIAQSDVSLD------INMPGIN--LHLGDQDKRGYY 44 MKKYLL G++ + Q+ SLD +++PG L ++DK G Y Sbjct: 1 MKKYLLGIGLILALIACKQNVSSLDEKNSASVDLPGEMKVLVSKEKDKDGKY 52
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (296), Expect = 2e-38 Identities = 36/89 (40%), Positives = 55/89 (61%) Query: 4 TKAEMSEHLFEKLGLSKRDAKDLVELFFEEVRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + E L+K+D+ V+ F V L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 388 bits (997), Expect = e-138 Identities = 105/309 (33%), Positives = 179/309 (57%), Gaps = 7/309 (2%) Query: 21 LRAAALFTIVAFSSLISTAALAENNPSDTAKKFKVVTTFTIIQDIAQNIAGDVAVVESIT 80 ++ ++ S++I A + + + +K KVV T +II DI +NIAGD + SI Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 Query: 81 KPGAEIHDYQPTPRDIVKAQSADLILWNGMNLER----WFEKFFESIK---DVPSAVVTA 133 G + H+Y+P P D+ K ADLI +NG+NLE WF K E+ K + V+ Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSD 120 Query: 134 GITPLPIREGPYSGIANPHAWMSPSNALIYIENIRKALVEHDPAHAETYNRNAQAYAEKI 193 G+ + + G +PHAW++ N +I+ +NI K L DP + E Y +N + Y +K+ Sbjct: 121 GVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKL 180 Query: 194 KALDAPLRERLSRIPAEQRWLVTSEGAFSYLAKDYGFKEVYLWPINAEQQGIPQQVRHVI 253 LD +++ ++IPAE++ +VTSEGAF Y +K YG Y+W IN E++G P+Q++ ++ Sbjct: 181 DKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV 240 Query: 254 DIIRENKIPVVFSESTISDKPAKQVSKETGAQYGGVLYVDSLSGEKGPVPTYISLINMTV 313 + +R+ K+P +F ES++ D+P K VS++T ++ DS++ + +Y S++ + Sbjct: 241 EKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNL 300 Query: 314 DTIAKGFGQ 322 D IA+G + Sbjct: 301 DKIAEGLAK 309
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 120 bits (301), Expect = 5e-35 Identities = 75/252 (29%), Positives = 127/252 (50%), Gaps = 11/252 (4%) Query: 8 LKGKVALVTGCDTGLGQGMAIGLAEAGCDIIGVN-IVEPRETIEQ-VTALGRRFFSLTAD 65 ++GK+A +TG G+G+ +A LA G I V+ E E + + A R + AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 66 LSNIECIPSLLERAVAEFGHIDILVNNAGIIRREDAINFSEKDWDDVMNVNIKSVFFMSQ 125 + + I + R E G IDILVN AG++R + S+++W+ +VN VF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 126 AVAKQFIKQGNGGKIINVASMLSYQGGIRVPSYTASKSAVMGVTRLLANEWAKHGINVNA 185 +V+K + + G I+ V S + + +Y +SK+A + T+ L E A++ I N Sbjct: 126 SVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 186 VAPGYMATNNTQQLRKDEERSKEILD--------RIPAGRWGLPDDLKGPVVFLASKASD 237 V+PG T+ L DE +++++ IP + P D+ V+FL S + Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 238 YISGYTIAVDGG 249 +I+ + + VDGG Sbjct: 245 HITMHNLCVDGG 256
>PF06917#Periplasmic pectate lyase Length = 555 Score = 993 bits (2568), Expect = 0.0 Identities = 553/555 (99%), Positives = 555/555 (100%) Query: 1 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD 60 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD Sbjct: 1 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD 60 Query: 61 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARIQSGYFMQHGVHNESGLFYWGGHR 120 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARIQSGYFMQHGVHNESGLFYWGGHR Sbjct: 61 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARIQSGYFMQHGVHNESGLFYWGGHR 120 Query: 121 FLNLDTLKTEGPASKDQVYELKHHLPYYDLLITIDRERTLNFLQGFWHAHVEDWKTLDLG 180 FLNLDTLKTEGPASKDQV+ELKHHLPYYDLL+TIDRERTLNFLQGFWHAHVEDWKTLDLG Sbjct: 121 FLNLDTLKTEGPASKDQVHELKHHLPYYDLLVTIDRERTLNFLQGFWHAHVEDWKTLDLG 180 Query: 181 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA 240 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA Sbjct: 181 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA 240 Query: 241 AAAWGKHLYRQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG 300 AAAWGKHLYRQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG Sbjct: 241 AAAWGKHLYRQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG 300 Query: 301 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR 360 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR Sbjct: 301 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR 360 Query: 361 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL 420 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL Sbjct: 361 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL 420 Query: 421 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH 480 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH Sbjct: 421 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH 480 Query: 481 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR 540 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR Sbjct: 481 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR 540 Query: 541 TLYDIDFIYPTLLNQ 555 TLYDIDFIYPTLLNQ Sbjct: 541 TLYDIDFIYPTLLNQ 555
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 37.0 bits (85), Expect = 1e-04 Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 10/95 (10%) Query: 60 EVRIGDRVVNNLAPKSRGIAM-VFQNYALYPHMTVKENLAFGLKLSKLPKDQIEAQVAEA 118 +V +G V N A + G+A VF A E LA L++ DQI+ + ++ Sbjct: 504 KVALGMEVTNTAAQSAGGVAEGVFIKNA-------SEALA-DFMLARFAMDQIQQWLKQS 555 Query: 119 AKIL-ELEDLLDRLPRQLSGGQAQRVAVGRAIVKK 152 +I E + + L + +S Q R I+++ Sbjct: 556 VEIFGENQKVTAELQKAMSSAVQQNADASRFILRQ 590
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 459 bits (1182), Expect = e-151 Identities = 141/825 (17%), Positives = 294/825 (35%), Gaps = 78/825 (9%) Query: 46 TLYLELVVNDRNFGST-VPISYRNNRYY----LSQSQLRTIGLPISEPLAPEIAIDN--- 97 T +++ +N+ + V + ++ L+++QL ++GL ++ + + Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNT-ASVSGMNLLADDAC 135 Query: 98 ------MAGVNVKYDGENQRLLINVPSEWLPKQQIEVTEQDDFNLAQSSLGALFNYDIYA 151 + + D QRL + +P ++ + + ++ L NY+ Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWD--PGINAGLLNYNFSG 193 Query: 152 TQGYPYSSLTHFSAWTEQRIFDRFGLLSNTGVYRTHFPSNNNTDDAKGYIRFDTQWQKND 211 A+ + G + S++++ +K + W + D Sbjct: 194 NSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERD 253 Query: 212 EEHLL-RYSTGDLITGALPWSSAIRLGGIQIARHFAIRPDLITYPLPQFSGQAAVPSTVD 270 L R + GD T + I G Q+A + PD P G A + V Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312 Query: 271 LYIDNFRTQSANINPGPFVINNAPRINGAGQATIVTTDALGRQISTSVPFYVASTLLKPG 330 + + + ++ + PGPF IN+ +G + +A G +VP+ L + G Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372 Query: 331 VWDFSLSGGALRRNYAIRSADYGEMVASGVVRYGTTPWLTLEGRGDIAKEMHVIGGGVNF 390 +S++ G R A + + +G T+ G +A G+ Sbjct: 373 HTRYSITAGEYRSGNAQQEKPR---FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429 Query: 391 RMGLLGVLNSAYSISNTSNGAFNNVAEPLNTNNATPNRLPSPAASRRGRGNQRSLGYSYS 450 MG LG L+ + +N++ + + + + L + + + G N + +GY YS Sbjct: 430 NMGALGALSVDMTQANST------LPDDSQHDGQSVRFLYNKSLNESGT-NIQLVGYRYS 482 Query: 451 NA-FFNL--------NAQHIISSDEYSD----LANYKTPSLLSRRMTQLTGSLSLGSYGT 497 + +FN N +I + D +Y + R QLT + LG T Sbjct: 483 TSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTST 542 Query: 498 V----------GSGYFDVRDALGEQTRLINISYSTSLLHNSNFYSALNRELGRKGYNVQL 547 + G+ D + G T +I+++ S N + + + L Sbjct: 543 LYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQ------KGRDQMLAL 596 Query: 548 VWSIPLGPR-----------GSSSISATRTNDNQWIQQLNYSRSAPSNGGLGWNL--AYA 594 +IP S+S S + + + + + L +++ YA Sbjct: 597 NVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYA 656 Query: 595 NSTNNNNQ-YQQADIVWRTSMMESRMGLYGNSNNYNYWGGLTGSLVVMNRSVYASNMIND 653 + N+ A + +R + +G + + + G++G ++ V +ND Sbjct: 657 GGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLND 716 Query: 654 AFALVSTNGFSNIPVSYENQLIGTTNAKGYLLIPTVASYYQAKFQIDPMNLPADVMLPNV 713 LV G + V ENQ T+ +GY ++P Y + + +D L +V L N Sbjct: 717 TVVLVKAPGAKDAKV--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774 Query: 714 ERRLAIGERSGYLINFPIKRISAVNIRITDASGQDLPKGSAIYTTGNIPISYVGWDGMVY 773 + + F + + + +T + + LP G+ + + + V +G VY Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVY 833 Query: 774 IEQVAQLNNLRI-IRADNGTQCYSQFKLKTTEGIQDAG--TTVCR 815 + + +++ + C + ++L Q + CR Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 31.2 bits (70), Expect = 0.003 Identities = 21/83 (25%), Positives = 39/83 (46%), Gaps = 7/83 (8%) Query: 200 PSCTFDGPQKVDFGIVTSSNL-NNGGIERDLDFNITCKTDYGHYSATAAIFTQTSSADNN 258 P+CT + V++G + NL +GG ++D ++ C G T + ++ N Sbjct: 37 PACTVQNAE-VNWGDIEIQNLVQSGGNQKDFTVDMNCPYSLG----TMKVTITSNGQTGN 91 Query: 259 YIKVKDSQN-QEDRLLIKISDTN 280 I V ++ D LLI + ++N Sbjct: 92 SILVPNTSTASGDGLLIYLYNSN 114
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 88.1 bits (218), Expect = 5e-21 Identities = 38/129 (29%), Positives = 67/129 (51%), Gaps = 16/129 (12%) Query: 441 LFDSNSTKLKLNSQT--NEMMMELLSLVERNKEKKILIVGHSDNTGSSSMNMALSEQRAL 498 LF+ N LK Q +++ +L +L K+ ++++G++D GS + N LSE+RA Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNL--DPKDGSVVVLGYTDRIGSDAYNQGLSERRAQ 279 Query: 499 ALRDWLIKRSDITVDNFITKGMGASEPVATNHTEAGR---------EQNRRVEVLILPTQ 549 ++ D+LI + I D +GMG S PV N + + +RRVE+ + + Sbjct: 280 SVVDYLISKG-IPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIK 338 Query: 550 DRTRMTEPK 558 D +T+P+ Sbjct: 339 D--VVTQPQ 345
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 750 bits (1939), Expect = 0.0 Identities = 278/571 (48%), Positives = 392/571 (68%), Gaps = 2/571 (0%) Query: 1 MISGILVSPGIAFGKALLLKEDEIVINRKKISADQVEQEVERFKAGRAKAAEQLEAIKTK 60 I+GI S G+A KA + E + I + I V E+E+ A K+ E+L AIK + Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 61 AGVSLGEEKAAIFEGHIMLLEDEELEQEIIALIKDEHASADAAAYSVIEGQAKALEELDD 120 S+G +KA IF H+++L+D EL I I++E +A+ A V + E +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 121 EYLKERAADVRDIGKRLLKNILGLNIVDLSAIQDEVILVATDLTPSETAQLNLDKVLGFI 180 EY+KERAAD+RD+ KR+L +++G+ L+ I +E +++A DLTPS+TAQLN V GF Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 181 TDIGGRTSHTSIMARSLELPAIVGTSNVTKQVKNDDYLILDAVNNKVYLNPTADVIEQLK 240 TDIGGRTSH++IM+RSLE+PA+VGT VT+++++ D +I+D + V +NPT + ++ + Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 241 AVKNQYITEKNELAKLKDLPAITLDGHQVEVVANIGTVRDIAGAERNGAEGVGLYRTEFL 300 + + +K E AKL P+ T DG VE+ ANIGT +D+ G NG EG+GLYRTEFL Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 301 FMDRDSLPTEEEQFQAYKAVAEAMGSQAVIVRTMDIGGDKDLPYMNLPKEENPFLGWRAI 360 +MDRD LPTEEEQF+AYK V + M + V++RT+DIGGDK+L Y+ LPKE NPFLG+RAI Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 361 RIAMDRKEILHAQLRAILRASAFGKLRIMFPMIISVEEVRELKAELELLKSQLREENKAF 420 R+ +++++I QLRA+LRAS +G L++MFPMI ++EE+R+ KA ++ K +L E Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 421 DETIEVGVMVETPAAAVIARHLAKEVDFFSIGTNDLTQYTLAVDRGNELISHLYNPMSPS 480 ++IEVG+MVE P+ AV A AKEVDFFSIGTNDL QYT+A DR NE +S+LY P P+ Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 481 VLGLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540 +L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + + Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 541 NFEDVKVLAEQALAQPTAKELMDLVTTFIEE 571 + E++K A++AL TA+E+ LV + Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572
>PF06580#Sensor histidine kinase Length = 349 Score = 38.3 bits (89), Expect = 5e-05 Identities = 41/231 (17%), Positives = 83/231 (35%), Gaps = 59/231 (25%) Query: 239 ELRSPLARLQLAIGLAHQNPDNVDNAL----QRIEHESERLDKMIGEL-------LALSR 287 ++ S QL A NP + NAL I + + +M+ L L S Sbjct: 153 KMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSN 212 Query: 288 AENHSLADD----DEYFDLQEL-------VKVVVNDARYEAQLPGVEIQLEVAAQSEYTV 336 A SLAD+ D Y L + + +N A + Q+P + +Q V Sbjct: 213 ARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV-------- 264 Query: 337 KGNAELMRRAIENIVRNALRFSASGQQVKVTLSALDKRYQIQVADQGPGVEENKLSSIFD 396 EN +++ + G ++ + + + ++V + G +N Sbjct: 265 -----------ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------- 306 Query: 397 PFVRVKSAMSGKGYGLGLAITHK-VILAHGGQVEAR-NGEQGGLVITLRVP 445 + + G GL + + + +G + + + + +QG + + +P Sbjct: 307 ---------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.1 bits (221), Expect = 1e-22 Identities = 32/122 (26%), Positives = 63/122 (51%), Gaps = 1/122 (0%) Query: 2 KILLVDDDLELGTMLKEYLGGEGFTAKHVLTGKAGIDGALSGDYTALILDIMLPDMSGID 61 IL+ DDD + T+L + L G+ + +GD ++ D+++PD + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRQVRK-KSRLPIIMLTAKGDNIDRVIGLEMGADDYMPKPCYPRELVARLRAVLRRFEE 120 +L +++K + LP+++++A+ + + E GA DY+PKP EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 QP 122 +P Sbjct: 125 RP 126
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 60.2 bits (146), Expect = 6e-12 Identities = 40/259 (15%), Positives = 87/259 (33%), Gaps = 25/259 (9%) Query: 27 FRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQVTVGAQVSGQIKALHVTLGQQVE 86 F+ +S + + + E Q + K+ V +I + Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 87 KNQLVAEI--DDLAQQNALKDAEEALKNVQAQRAAKIA--TQKNNQLTYQRQQQILAKGV 142 + + + ++A+ + E + + Q +++ +++ L Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL---- 291 Query: 143 GVRADFDS-IKATLEATQAEISALDAQIAQAEIAVSTAKLNLGYTKISSPIAGTVVAIPV 201 V F + I L T I L ++A+ E + I +P++ V + V Sbjct: 292 -VTQLFKNEILDKLRQTTDNIGLLTLELAKNE-------ERQQASVIRAPVSVKVQQLKV 343 Query: 202 -EEGQTVNAVQSAPTIIKVAQLDTMTVEAQISEADVVKVKTGMPVYFTILGEPEKRF--- 257 EG V + ++ V + DT+ V A + D+ + G + P R+ Sbjct: 344 HTEGGVVTTAE--TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYL 401 Query: 258 SATLRAIEPAPDSINDETT 276 ++ I D+I D+ Sbjct: 402 VGKVKNI--NLDAIEDQRL 418 Score = 48.7 bits (116), Expect = 3e-08 Identities = 17/167 (10%), Positives = 57/167 (34%), Gaps = 17/167 (10%) Query: 10 RLIGWVVLLLIIGGLLFFRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQV-TVGA 68 RL+ + ++ ++ + + + +E A+G + + + Sbjct: 58 RLVAYFIMGFLVIAFI---L-------------SVLGQVEIVATANGKLTHSGRSKEIKP 101 Query: 69 QVSGQIKALHVTLGQQVEKNQLVAEIDDLAQQNALKDAEEALKNVQAQRAAKIATQKNNQ 128 + +K + V G+ V K ++ ++ L + + +L + ++ ++ + Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161 Query: 129 LTYQRQQQILAKGVGVRADFDSIKATLEATQAEISALDAQIAQAEIA 175 L + ++ + + + + + S Q Q E+ Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.6 bits (230), Expect = 8e-24 Identities = 33/134 (24%), Positives = 62/134 (46%) Query: 2 KILIAEDNAHIRNGLMEVLAHEGYRPIAAENGVQALALYRQQQPDFIILDIMMPELDGYK 61 IL+A+D+A IR L + L+ GY N D ++ D++MP+ + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VCREIRKHDWQTPIIFLSAKDEEIDRVIGLELGADDYISKPFGIHEMRARIKTIVRRCLR 121 + I+K P++ +SA++ + + E GA DY+ KPF + E+ I + R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 KVPESAEDAGFPFG 135 + + +D+ Sbjct: 125 RPSKLEDDSQDGMP 138
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.041 Identities = 10/23 (43%), Positives = 14/23 (60%) Query: 30 MVALLGPSGSGKTTLLRIIAGLE 52 V L G G GK+TL+ + GL+ Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 66.0 bits (161), Expect = 1e-14 Identities = 59/321 (18%), Positives = 115/321 (35%), Gaps = 70/321 (21%) Query: 1 MKILITGVSGYLGSQLANALMLE-HEVVGTVRAGSVCNRITDIGNVNL------------ 47 MK L+TG +G++G ++ L+ H+VVG + + D +V+L Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI-------DNLNDYYDVSLKQARLELLAQPG 53 Query: 48 -----INVTDSGWIDKVL-SFSPDVVINTAALYGRKGELLS--ELVDANIQFPLRILE-- 97 I++ D + + S + V + + L + D+N+ L ILE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 98 --------MLVST----GKGLFFQCGTSLPAD--VSQYALTKNQFTELAREYCNKFSGKF 143 + S+ G T D VS YA TK +A Y + + Sbjct: 114 RHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173 Query: 144 IELKLEHFFGPFDDST----KFTTYVINSCRSHSDLKL-TAGLQRRDFIYINDLINA--- 195 L+ +GP+ KFT + + + G +RDF YI+D+ A Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFT----KAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229 Query: 196 ------------FKIMISKSESLISGESISIGSGHAVTIKEFVETVAKMTSYQGNLQFGA 243 + + S+ +IG+ V + ++++ + + Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM-- 287 Query: 244 IPTRENELMYSCASLARIQEL 264 +P + +++ + A + E+ Sbjct: 288 LPLQPGDVLETSADTKALYEV 308
>SECA#SecA protein signature. Length = 901 Score = 1373 bits (3556), Expect = 0.0 Identities = 805/904 (89%), Positives = 852/904 (94%), Gaps = 3/904 (0%) Query: 1 MLIKLLTKVFGSRNDRTLRRMQKVVDVINRMEPDIEKLTDTELRAKTDEFRERLAKGEVL 60 MLIKLLTKVFGSRNDRTLRRM+KVV++IN MEP++EKL+D EL+ KT EFR RL KGEVL Sbjct: 1 MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVL 60 Query: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA Sbjct: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120 Query: 121 LSGRGVHVVTVNDYLAQRDAENNRPLFEFLGLSIGINLPNMTAPAKRAAYAADITYGTNN 180 L+G+GVHVVTVNDYLAQRDAENNRPLFEFLGL++GINLP M APAKR AYAADITYGTNN Sbjct: 121 LTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNN 180 Query: 181 EFGFDYLRDNMAFSPEERVQRQLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYIRVN 240 E+GFDYLRDNMAFSPEERVQR+LHYALVDEVDSILIDEARTPLIISGPAEDSSEMY RVN Sbjct: 181 EYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVN 240 Query: 241 KLIPKLIRQEKEDSDSFQGEGHFSVDEKSRQVHLTERGLILIEQMLVEAGIMDEGESLYS 300 K+IP LIRQEKEDS++FQGEGHFSVDEKSRQV+LTERGL+LIE++LV+ GIMDEGESLYS Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300 Query: 301 PANIMLMHHVTAALRAHVLFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360 PANIMLMHHVTAALRAH LFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360 Query: 361 EGVEIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTIVVPTNRPMIR 420 EGV+IQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDT+VVPTNRPMIR Sbjct: 361 EGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIR 420 Query: 421 KDLADLVYMTEQEKIGAIIEDIRERTANGQPVLVGTISIEKSEVVSAELTKAGIEHKVLN 480 KDL DLVYMTE EKI AIIEDI+ERTA GQPVLVGTISIEKSE+VS ELTKAGI+H VLN Sbjct: 421 KDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN 480 Query: 481 AKFHAMEAEIVSQAGQPGAVTIATNMAGRGTDIVLGGSWQSEIAALEDPTEEQIAAIKAA 540 AKFHA EA IV+QAG P AVTIATNMAGRGTDIVLGGSWQ+E+AALE+PT EQI IKA Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540 Query: 541 WQIRHDAVLASGGLHIIGTERHESRRIDNQLRGRAGRQGDAGSSRFYLSMEDALMRIFAS 600 WQ+RHDAVL +GGLHIIGTERHESRRIDNQLRGR+GRQGDAGSSRFYLSMEDALMRIFAS Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFAS 600 Query: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY Sbjct: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660 Query: 661 SQRNELLDVSDVSETINSIREDVFKTTIDSYIPTQSLEEMWDIEGLEQRLKNDFDLDMPI 720 SQRNELLDVSDVSETINSIREDVFK TID+YIP QSLEEMWDI GL++RLKNDFDLD+PI Sbjct: 661 SQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720 Query: 721 AKWLEDEPQLHEETLRERILQQAIETYQRKEEVVGIEMMRNFEKGVMLQTLDSLWKEHLA 780 A+WL+ EP+LHEETLRERIL Q+IE YQRKEEVVG EMMR+FEKGVMLQTLDSLWKEHLA Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLA 780 Query: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFAMFAAMLESLKYEVISVLSKVQVRMPEEVEAL 840 AMDYLRQGIHLRGYAQKDPKQEYKRESF+MFAAMLESLKYEVIS LSKVQVRMPEEVE L Sbjct: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEEL 840 Query: 841 EVQRREEAERLARQQQLSHQTDNSALMSEEEVKVANSLERKVGRNDPCPCGSGKKYKQCH 900 E QRR EAERLA+ QQLSHQ D+SA + + ERKVGRNDPCPCGSGKKYKQCH Sbjct: 841 EQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTG---ERKVGRNDPCPCGSGKKYKQCH 897 Query: 901 GRLQ 904 GRLQ Sbjct: 898 GRLQ 901
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 53.2 bits (128), Expect = 7e-10 Identities = 47/201 (23%), Positives = 72/201 (35%), Gaps = 18/201 (8%) Query: 171 IVKAVERCGLKVDQLIFAGLAASYAVLTEDERELGVCVVDIGGGTMDMAVYTGGALRHTK 230 I ++ + G + LI +AA+ G VVDIGGGT ++AV + + ++ Sbjct: 126 IRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSS 185 Query: 231 VIPYAGNVVTSDI------AYAFGTPPTDAEAIKVRHGCALGSIVSKDESVEVPSVGGRP 284 + G+ I Y AE IK G A + V V GR Sbjct: 186 SVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAY-----PGDEVREIEVRGRN 240 Query: 285 -----PRSLQRQTLAEVIEPRYTELLNLVNDEILQLQEQLRQQGVKHHLAAGIVLTGGAA 339 PR E++E E L + ++ EQ + G+VLTGG A Sbjct: 241 LAEGVPRGF-TLNSNEILEA-LQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGA 298 Query: 340 QIDGLAECAQRVFHAQVRIGQ 360 + L V + + Sbjct: 299 LLRNLDRLLMEETGIPVVVAE 319
>PF06580#Sensor histidine kinase Length = 349 Score = 35.2 bits (81), Expect = 5e-04 Identities = 18/82 (21%), Positives = 28/82 (34%), Gaps = 5/82 (6%) Query: 2 AIRRQPLIAPWLWPGLLAAGLIVAVALLAFAAIWHHAPTADWQSVWHDR-YLWHVIRFTF 60 A R WL + L V A + +W A T S+W ++ Sbjct: 58 AYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANT----SIWRLLAFINTKPVAFT 113 Query: 61 WQAFLSALLSVIPAIFLARALY 82 LS + +V+ F+ LY Sbjct: 114 LPLALSIIFNVVVVTFMWSLLY 135
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 60.7 bits (147), Expect = 4e-12 Identities = 40/124 (32%), Positives = 52/124 (41%), Gaps = 18/124 (14%) Query: 391 FTSDGAFRTGEATLSEEFINK-KNIERLGLALAPWPGDIEVIGHTDNKPFRSTSGNNNLK 449 SD F +ATL E + L P G + V+G+TD R S N Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTD----RIGSDAYNQG 272 Query: 450 LSAARASVVADKLRESTQINETHQREISAIGRGESDPLADNATEEGRKR---------NR 500 LS RA V D L S I ISA G GES+P+ N + ++R +R Sbjct: 273 LSERRAQSVVDYL-ISKGIPADK---ISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328 Query: 501 RVDI 504 RV+I Sbjct: 329 RVEI 332
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 32.0 bits (72), Expect = 0.008 Identities = 51/236 (21%), Positives = 88/236 (37%), Gaps = 8/236 (3%) Query: 407 TGMSVSATGISVSTTGTSLSVTGMSTSVTGVSVGFTLIGTS--FTGVSTSFTGVGTSFTG 464 +G + I ++T G++LS T S + G T +S G ++ T S Sbjct: 150 SGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLV 209 Query: 465 ASNSLTGVSNSMTGCSSSFTGTSNSMTGSSHSMTGMSTSITGHSMSQ-TGSSSSITGDST 523 A T + + + + T M GS + ST G S G S+ T Sbjct: 210 AGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 269 Query: 524 SFTGSSVSSTGSSVSTTGVSTSTTGSSTSTTGCSVSTTGSSTSTTGNSVSMTG----NST 579 S + ST ++ + ++ + T+ S+ ST T G + T T Sbjct: 270 SSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQT 329 Query: 580 STTGCSISTTGSSIGTVGSSISTTGSSVSTTGSSISTTGLSVSYTGAQYSDVGVDL 635 + G ++ S GT G S+ + +T ++ + L+ Y Q + G DL Sbjct: 330 AQKGSDLTAGYGSTGTAGDD-SSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384 Score = 30.9 bits (69), Expect = 0.022 Identities = 31/143 (21%), Positives = 63/143 (44%), Gaps = 6/143 (4%) Query: 492 GSSHSMTGMSTSITGHSMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSST 551 GS+ + S+ I G+ +QT +SI T+ GS+ ++ S T G +++T + Sbjct: 629 GSTSTAGADSSLIAGYGSTQTAGYNSIL---TAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685 Query: 552 STTGCSVSTTGSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSS---ISTTGSSVS 608 S+ +T ++ + + T+ G +++ S T G+ I+ GS+ + Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745 Query: 609 TTGSSISTTGLSVSYTGAQYSDV 631 + S T G + T + S + Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVL 768 Score = 30.1 bits (67), Expect = 0.038 Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 6/143 (4%) Query: 492 GSSHSMTGMSTSITGHSMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSST 551 GS+ + S+ I G+ +QT S T+ GS+ ++ S TG +++T + Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTL---TAGYGSTQTAQNESDLITGYGSTSTAGAN 541 Query: 552 STTGCSVSTTGSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSS---ISTTGSSVS 608 S+ +T +++ + + T+ G ++ S GT GS I+ GS+ + Sbjct: 542 SSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQT 601 Query: 609 TTGSSISTTGLSVSYTGAQYSDV 631 + S T G + T + S + Sbjct: 602 ASYHSSLTAGYGSTQTAREQSVL 624
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 104 bits (261), Expect = 3e-29 Identities = 67/252 (26%), Positives = 106/252 (42%), Gaps = 10/252 (3%) Query: 3 KTILITGALSGIGNTATKLFSEMGYNVVFSGRRPEEGRVILDDLKRINKDVLYVNADMNS 62 K ITGA GIG + + G ++ PE+ ++ LK + AD+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 ESDIKHLIEMTLERFGSLDVAVNCAGTVGETAEIQAVTQDNFHLVFNTNVLGTLLAMKYQ 122 + I + G +D+ VN AG + I +++ + + F+ N G A + Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 IPVMVERGKGSIINISSIAGLVGLPSTGIYVASKHAIEGLTKTAALEVATTGVRINSISP 182 M++R GSI+ + S V S Y +SK A TK LE+A +R N +SP Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 183 GPVEGKMFDRFLGHDENNKKAFIE--------MMPNKRFTTQEEVAHTIVFLAEDNVTAI 234 G E M L DEN + I+ +P K+ ++A ++FL I Sbjct: 188 GSTETDM-QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 235 TGQTITIDGGYT 246 T + +DGG T Sbjct: 247 TMHNLCVDGGAT 258
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 122 bits (307), Expect = 6e-36 Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 16/251 (6%) Query: 2 NLFISGGASGIGRSVVIAALSKGWNV-GFSYHNNKEGAQQLLDIAVAEFPRQLCRAYQLD 60 FI+G A GIG +V S+G ++ Y+ K A A + D Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA----FPAD 65 Query: 61 VIDSGAVEYVGDRLLVDFSNIDAVVCNAGIDLPGNLVSMTDEDWALVLNTNLTGTFYLIR 120 V DS A++ + R+ + ID +V AG+ PG + S++DE+W + N TG F R Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 YFLPLFLANKYGRIVTL-SSLAKDGSSGQAAYAASKAGLVGLTKTTAKEYGHFGITANVV 179 + + G IVT+ S+ A + AAYA+SKA V TK E + I N+V Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 180 VPGLINTEI-----IGDD-----IKGIKNFFAQYAPVGRLGSPSEVAEAILFLVAKESSY 229 PG T++ ++ IKG F P+ +L PS++A+A+LFLV+ ++ + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 230 VNGAVFNVTGG 240 + V GG Sbjct: 246 ITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 72.8 bits (178), Expect = 3e-17 Identities = 54/255 (21%), Positives = 95/255 (37%), Gaps = 32/255 (12%) Query: 10 VLVTGGTKGIGRATVESFVKAGAKVYGTYFWGDNLDELENHFSQYLNRPVFLQADISDEE 69 +TG +GIG A + GA + + + L+++ + AD+ D Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70 Query: 70 ITTQLIEKIAQENKKIDILILNAAFAPQFKDTYKFRGLLDSIEHNSWPLITYIDC----- 124 ++ +I +E IDIL+ N A + GL+ S+ W ++ Sbjct: 71 AIDEITARIEREMGPIDILV-NVAGVLRP-------GLIHSLSDEEWEATFSVNSTGVFN 122 Query: 125 -----IKQHFGQYPGYVVAITSEGHRSCHITGYDYVAASKAVLETLTKYIG---ARENII 176 K + G +V + S + Y A+SKA TK +G A NI Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAY-ASSKAAAVMFTKCLGLELAEYNIR 181 Query: 177 INCISPGVVDTEAFELVFGKK--AQAFIRKFDPDF--------IVSPEAVGNVSVALCSG 226 N +SPG +T+ ++ + A+ I+ F + P + + + L SG Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 227 LMDAVRGQVITVDNG 241 + + VD G Sbjct: 242 QAGHITMHNLCVDGG 256
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 30.5 bits (68), Expect = 0.034 Identities = 19/54 (35%), Positives = 24/54 (44%), Gaps = 9/54 (16%) Query: 622 GIGKTETALALADSLFGGEKSLITINLSEYQEAHTVSQLKGSPPGYVGYGQGGV 675 GIG +A A AD + KS + N S Y G+ PGYV Q G+ Sbjct: 23 GIGAPPSAHAGADDVVDSSKSFVMENFSSYH---------GTKPGYVDSIQKGI 67
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 30.1 bits (67), Expect = 0.003 Identities = 16/69 (23%), Positives = 26/69 (37%) Query: 35 YVYSSESTYGVEPNEKEVEEIIKMKPDVIDPGETLKLAPSILSLLKKNIRKDTGWRIGGR 94 Y+S G + + VE + ++ + E + +LS K NI K G Sbjct: 199 IQYNSNFRLGTKAQDGVVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGN 258 Query: 95 YSFNSVGGG 103 FN + G Sbjct: 259 AVFNLMKGI 267
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.0 bits (72), Expect = 0.009 Identities = 11/18 (61%), Positives = 13/18 (72%) Query: 408 GPNGIGKSTLLKTLLGEY 425 G GIGKSTL+ TL+G Sbjct: 603 GTGGIGKSTLINTLVGLD 620
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 35.1 bits (80), Expect = 5e-04 Identities = 40/135 (29%), Positives = 55/135 (40%), Gaps = 41/135 (30%) Query: 3 HCNTSDLLSLEQALTK-MLSQATPLPATEVIPLSEAAGRITASAIT----------SPIA 51 H NT ++++ AL+ M++QA PL E++ + AA S I P Sbjct: 354 HHNTEEIVAQSIALSSLMVAQAIPL-VGELVDIGFAAYNFVESIINLFQVVHNSYNRPAY 412 Query: 52 VP-----PFANSAMDGYAVRWHELSDEI--------------------PLPVAGVAFAGA 86 P PF + DGYAV W+ + D I PLP+AGV Sbjct: 413 SPGHKTQPFLH---DGYAVSWNTVEDSIIRTGFQGESGHDIKITAENTPLPIAGVLLPTI 469 Query: 87 PFK-DVWPEKTCIRI 100 P K DV KT I + Sbjct: 470 PGKLDVNKSKTHISV 484
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.1 bits (73), Expect = 0.004 Identities = 30/177 (16%), Positives = 61/177 (34%), Gaps = 26/177 (14%) Query: 114 EATSRMADIMEQINSLRNMRMRLEQDSRDTQFSLQEAQ-------HQIDIISKDLRRYKI 166 E + I EQ ++ +N + + E + + + + L + Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS 242 Query: 167 LDKKFLIAKSEL---ERQADRLIN---------WKVKSDILQK------HNSRNQKSFPS 208 L K IAK + E + +N +++S+IL + Sbjct: 243 LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD 302 Query: 209 QFKNIDESIILLEKMMKMIEVGIEQLVIIAPIDGTLSVLDI-ELGQQIKSGEKISVI 264 + + ++I LL + E + VI AP+ + L + G + + E + VI Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVI 359
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.0 bits (72), Expect = 0.007 Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 17/74 (22%) Query: 24 PGVKALDNVNLKVRPYSIHALMGENGAGKSTLLKCLFGIYKKDSGSIIFQGQEIEFKSSK 83 PG K D + L G G GKSTL+ L G+ F + + K Sbjct: 591 PGCKF-DYSVV---------LEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGK 633 Query: 84 EALEQGVSMVHQEL 97 ++ EQ +V EL Sbjct: 634 DSYEQIAGIVAYEL 647
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 78.3 bits (193), Expect = 8e-19 Identities = 33/150 (22%), Positives = 70/150 (46%), Gaps = 5/150 (3%) Query: 10 QSGSVLIVEDEPKLGQLLVDYLQAAGYRTQWLTNGAEVVATVRQTPPAIILLDLMLPGSD 69 ++L+ +D+ + +L L AGY + +N A + + +++ D+++P + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 70 GITLCREIR-RFSDIPIVMVTAKTEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-- 126 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 127 --RRCSQQRHQPTDDAPLLINESRFQASYQ 154 RR S+ D PL+ + Q Y+ Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 34.0 bits (78), Expect = 0.001 Identities = 24/90 (26%), Positives = 38/90 (42%), Gaps = 21/90 (23%) Query: 170 LSTLLAAAVTWVLS-------------RGMLAPVKRLVEGTHRLAA------GDFST--R 208 L+TL+AA++ + ++A V+ V H LA G F Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYC 136 Query: 209 VAVSSRDELGHLAQDFNQLASSLEKNEQMR 238 V++ + GHL N+LA E+ +QMR Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMR 166
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 126 bits (318), Expect = 5e-34 Identities = 97/435 (22%), Positives = 182/435 (41%), Gaps = 17/435 (3%) Query: 20 FMQTLDTTIVNTALPSIAASLGENPLRMQSVIVSYVLTVAVMLPASGWLADRIGVKWVFF 79 F L+ ++N +LP IA + P V +++LT ++ G L+D++G+K + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 SAIILFTFGSLMCAQSATLNE-LILSRVLQGVGGAMMVPVGRLTVMKIVPREQYMAAMAF 138 II+ FGS++ + LI++R +QG G A + + V + +P+E A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQIGPLVGPALGGFLVEFASWHWIFLINLP-VGVIGALATLLLMPNHKMSTRRFDI 197 + +G VGPA+GG + + HW +L+ +P + +I + L+ FDI Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFIMLAIGMATLTLALDGHTGLGLSPLAIAGLILCGVIALGSYWWHALGNRFALFSLHL 257 G I++++G+ L + ++ V++ + H L Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FKNKIYTLGLVGSMSARIGSGMLPFMTPIFLQIGLGFSPFHAG-LMMIPMIIGSMGMKRI 316 KN + +G++ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 IVQVVNRFGYRRVLVNATLLLAVVSLSLPLVAIMGWTLLMPVVLFFQGMLNALRFSTMNT 376 +V+R G VL L+V L+ + + +++F G L+ + ++T Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV-IST 371 Query: 377 LTLKTLPDRLASSGNSLLSMAMQLSMSIGVSTAGILLGTFAHHQVATNTPATHSAFLYS- 435 + +L + A +G SLL+ LS G++ G LL Q S +LYS Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431 Query: 436 -YLCMAIIIALPALI 449 L + II + L+ Sbjct: 432 LLLLFSGIIVISWLV 446
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 864 bits (2235), Expect = 0.0 Identities = 286/1035 (27%), Positives = 504/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIQRPVATTLLTLAITLSGIIGFSLLPVSPLPQVDYPVIMVSASMPGADPETMASSVAT 65 FI+RP+ +L + + ++G + LPV+ P + P + VSA+ PGAD +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERALGRIAGVNEMTSTS-SLGSTRIILQFDLNRDINGAARDVQAALNAAQSLLPSGMP 124 +E+ + I + M+STS S GS I L F D + A VQ L A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKMNPSDAPIMIMTLTSDT--FSQGQLYDYASTKLAQKIAQTEGVSDVTVGGSSL 182 + S + +M+ SD +Q + DY ++ + +++ GV DV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVELNPSALFNQGVSLDAVRQAISAANVRRPQGSVDAAET------HWQVQANDEIK 236 A+R+ L+ L ++ V + N + G + + + A K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAEGYRPLIVHYN-NGSPVRLQDVANVIDSVQDVRNAGMSAGQPAVLLVISREPGANIIA 295 E + + + N +GS VRL+DVA V ++ G+PA L I GAN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDRIRAELPALRASIPASIQLNIAQDRSPTIRASLDEVERSLVIAVALVILVVFIFLRS 355 T I+A+L L+ P +++ D +P ++ S+ EV ++L A+ LV LV+++FL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGVKPKVAALRGVREVGFTVLSMSISLVAVFIPLLLMAGLPGRLFREFAVTLSVAIGIS 474 E + PK A + + ++ ++ +++ L AVFIP+ G G ++R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LVISLTLTPMMCAWLLRSHPKGQQQRIRGFG----KVLLAIQQGYGRSLNWALSHTRWVM 530 ++++L LTP +CA LL+ + GF Y S+ L T + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVLLSTIALNVWLYISIPKTFFPEQDTGRMMGFIQADQSISFQSMQQKLKDFMQIVGADP 590 ++ +A V L++ +P +F PE+D G + IQ + + Q+ L + Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 591 -----AVDSVTGFT-GGSRTNSGSMFISLKPLSER---QETAQQVITRLRGKLAKEPGAN 641 +V +V GF+ G N+G F+SLKP ER + +A+ VI R + +L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLSSVQDIRVGGRHSNAAYQFTLLADDLAALREWEPKVRAALAKL-----PQLADVNSD 696 + ++ I G + ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDKGAEMALTYDRETMARLGIDVSEANALLNNAFGQRQISTIYQPLNQYKVVMEVAPEY 756 + A+ L D+E LG+ +S+ N ++ A G ++ K+ ++ ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDVSSLDKMFVINSNGQSIPLSYFAKWQPANAPLAVNHQGLSAASTISFNLPDGGSLSE 816 +DK++V ++NG+ +P S F + + I G S + Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ATAAVERAMTELGVPSTVRGAFAGTAQVFQETLKSQLWLIMAAIATVYIVLGILYESYVH 876 A A +E ++L P+ + + G + + + L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALDAQRNGN 936 P++++ +P VG LLA LF+ + ++G++ IG+ KNAI++V+FA D Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 ISAREAIFQASLLRFRPIIMTTLAALFGALPLVLSSGDGAELRQPLGITIVGGLVVSQLL 996 EA A +R RPI+MT+LA + G LPL +S+G G+ + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVIYLYFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031 Score = 78.0 bits (192), Expect = 1e-16 Identities = 59/350 (16%), Positives = 130/350 (37%), Gaps = 12/350 (3%) Query: 680 VRAALAKLPQLADVNSDQQDKGAEMALTYDRETMARLGI---DVSEANALLNNAFGQRQI 736 V+ L++L + DV M + D + + + + DV + N+ Q+ Sbjct: 162 VKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL 219 Query: 737 STIYQPLNQYKVVMEVAPEYTQDVSSLDKMFV-INSNGQSIPLSYFAK--WQPANAPLAV 793 Q +A ++ K+ + +NS+G + L A+ N + Sbjct: 220 GGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA 279 Query: 794 NHQGLSAASTISFNLPDGGSLSEATAAVERAMTEL--GVPSTVRGAFA-GTAQVFQETLK 850 G AA +L + A++ + EL P ++ + T Q ++ Sbjct: 280 RINGKPAAGLGIKLATGANAL-DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIH 338 Query: 851 SQLWLIMAAIATVYIVLGILYESYVHPLTILSTLPSAGVGALLALELFDAPFSLIALIGI 910 + + AI V++V+ + ++ L +P +G L F + + + G+ Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398 Query: 911 MLLIGIVKKNAIMMVDFALDAQRNGNISAREAIFQASLLRFRPIIMTTLAALFGALPLVL 970 +L IG++ +AI++V+ + +EA ++ ++ + +P+ Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF 458 Query: 971 SSGDGAELRQPLGITIVGGLVVSQLLTLYTTPVIYLYFDRLRNRFSKQPL 1020 G + + ITIV + +S L+ L TP + + + + Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 872 bits (2254), Expect = 0.0 Identities = 289/1036 (27%), Positives = 501/1036 (48%), Gaps = 29/1036 (2%) Query: 13 SRLFILRPVATTLFMIAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVVTSSI 72 + FI RP+ + I +++AG + LPV+ P + P + V YPGA V ++ Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMASQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L M+S S S G+ ITL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPYPPIYNKVNPADPPILTLAVTATAIPMTQVE--DMVETRIAQKISQVTGVGLVTLSGG 189 + I + ++ + TQ + D V + + +S++ GVG V L G Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAPAVAALGLDSETIRTAISNANVNSAKGSLDGP------TRSVTLSANDQ 243 Q A+R+ L+A + L + + N A G L G + ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MKSAEEYRDLII-AYQNGAPIRLQDVATIEQGAENNKLAAWANTQSAIVLNIQRQPGVNV 302 K+ EE+ + + +G+ +RL+DVA +E G EN + A N + A L I+ G N Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IATADSIREMLPELIKSLPKSVDVKVLTDRTSTIRASVNDVQFELLLAIALVVMVIYLFL 362 + TA +I+ L EL P+ + V D T ++ S+++V L AI LV +V+YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNAAATIIPSIAVPLSLVGTFAAMYFLGFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N AT+IP+IAVP+ L+GTFA + G+SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLDAALKGAGEIGFTIISLTFSLIAVLIPLLFMEDIVGRLFREFAVTLAVAIL 481 + E P +A K +I ++ + L AV IP+ F G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SYESLRKQNRLSRASEKFFDWVIAHYAVALKKVLNHPWL 538 +S +V+L LTP +CA +L S E + FD + HY ++ K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQVAAIILK 598 L + + V+L+L +P F P +D G+ ++ P + + QV LK Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VESLTSFVGVDGTNATLNNGRLQINLKPLSERDDRIP---QIITRLQESVSGVPG 653 + VES+ + G + N G ++LKP ER+ +I R + + + Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 654 IKLYLQPVQDLTIDTQLSRTQYQFTLQ---ATSLEELSTWVPKLVNELQQK-APFQDVTS 709 ++ P I + T + F L + L+ +L+ Q A V Sbjct: 660 --GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHDVQ 769 + + + VD++ A LG++++ I+ + A G ++ + ++ ++ D + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 ATPGLAAFNDIRLTGIDGKGVPLSSIATIEERFGPLSINHLNQFPSATVSFNLAQGYSLG 829 + + + +G+ VP S+ T +G + N PS + A G S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 EAVAAVTLAEKEIQLPADITTRFQGSTLAFQAALGSTLWLIIAAIVAMYIVLGVLYESFI 889 +A+A + +LPA I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMIDFALAAERDQ 949 P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA + Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVCMVGGLIVSQV 1009 G +A A +R RPILMT+LA + G LPL +S G G+ + +G+ ++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDKL 1025 L +F PV +++ + Sbjct: 1016 LAIFFVPVFFVVIRRC 1031 Score = 84.1 bits (208), Expect = 2e-18 Identities = 77/517 (14%), Positives = 190/517 (36%), Gaps = 25/517 (4%) Query: 533 LNHPWLTLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQV 592 + P +A ++ + L +P +P + + P + + Q V Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGA----DAQTVQDTV 61 Query: 593 AAIILKDPAVESLTSFVGVDGTNAT-LNNGRLQINL--KPLSERDDRIPQIITRLQESVS 649 +I +++ + ++T + G + I L + ++ D Q+ +LQ + Sbjct: 62 TQVI-----EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 650 GVP-GIKLYLQPVQDLTIDTQLSRTQYQFTLQATSLEELSTWVPK-LVNELQQKAPFQDV 707 +P ++ V+ + + L + T+ +++S +V + + L + DV Sbjct: 117 LLPQEVQQQGISVEKSS-SSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175 Query: 708 TSDWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHD 767 + + +D D ++ +T + N L Q + L Sbjct: 176 QLFGAQYAMR--IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233 Query: 768 VQATPGLAAFNDIR----LTGIDGKGVPLSSIATIEERFGPLSIN-HLNQFPSATVSFNL 822 + A + DG V L +A +E ++ +N P+A + L Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293 Query: 823 AQGYSLGEAVAAV--TLAEKEIQLPADI-TTRFQGSTLAFQAALGSTLWLIIAAIVAMYI 879 A G + + A+ LAE + P + +T Q ++ + + AI+ +++ Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353 Query: 880 VLGVLYESFIHPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMI 939 V+ + ++ + +P +G L G ++ + + G++L IG++ +AI+++ Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 940 DFALAAERDQGMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVC 999 + + + P +A ++ ++ + +P+ G + + + Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473 Query: 1000 MVGGLIVSQVLTLFTTPVIYLLFDKLARNTRGKNRHR 1036 +V + +S ++ L TP + K +N+ Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 10/115 (8%) Query: 67 VIAANTVTVTSRVDGELMALHFTEGQQVKAGDLLAEIDPRPYEVQLTQAQGQLAKDQATL 126 + + + + + + EG+ V+ GD+L ++ A+ K Q++L Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSL 143 Query: 127 DNARRDLARYQKLSK---TGLISQQELDTQSSLVRQSEGSVKADQGAIDSAKLQL 178 AR + RYQ LS+ + + +L + SE V I Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198 Score = 42.1 bits (99), Expect = 3e-06 Identities = 23/124 (18%), Positives = 54/124 (43%), Gaps = 4/124 (3%) Query: 108 YEVQLTQAQGQLAKDQATLDNARRDLARYQKLSKTGLISQQELDTQSSLVRQSEGSVKAD 167 E + +A +L ++ L+ ++ + + L++Q + +RQ+ ++ Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAK--EEYQLVTQLFKNEILDKLRQTTDNIGLL 314 Query: 168 QGAIDSAKLQLTYSRITAPISGRV-GLKQVDVGNYITSGTATPIVVITQTHPVDVVFTLP 226 + + + S I AP+S +V LK G +T+ T +V++ + ++V + Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQ 373 Query: 227 ESDI 230 DI Sbjct: 374 NKDI 377
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.007 Identities = 8/31 (25%), Positives = 14/31 (45%) Query: 34 LTLLGPSGSGKTTSLMMLAGFETPTQGEITL 64 + L G G GK+T + L G + + + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.5 bits (66), Expect = 0.026 Identities = 10/57 (17%), Positives = 24/57 (42%), Gaps = 1/57 (1%) Query: 468 ISRVAVTAAWRQQGIARRMIAAEQAHARQQQ-CDFLSVSFGYTAELAHFWHRCGFRL 523 I +AV +R++G+ ++ A++ C + + HF+ + F + Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 27.6 bits (61), Expect = 0.016 Identities = 16/74 (21%), Positives = 29/74 (39%), Gaps = 3/74 (4%) Query: 22 QDLLSRSPDNASLLYKIASLYDVQGLELQAVPFYRAAIEHNLVGTELQAAYLGLGSTYRT 81 L S D LY +A G A ++A + + +LGLG+ + Sbjct: 26 AMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRF---FLGLGACRQA 82 Query: 82 LGLYQAALETFDHA 95 +G Y A+ ++ + Sbjct: 83 MGQYDLAIHSYSYG 96
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.007 Identities = 14/35 (40%), Positives = 18/35 (51%) Query: 31 VVSLLGPSGSGKTTLLRAVAGLEKPSQGHIIIGEK 65 V L G G GK+TL+ + GL+ S H IG Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTG 632
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 29.1 bits (65), Expect = 0.018 Identities = 23/104 (22%), Positives = 46/104 (44%), Gaps = 13/104 (12%) Query: 186 GVAVSGNIHLWVADTQTPESRENWLT----TLEKIKALKPAIVVPGHFLDNAPQTLESVT 241 GVA + N LWV++ P+S + LE + +KP+ +V +P+ L + Sbjct: 58 GVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIA 117 Query: 242 FTQNYLTTLNAEIPKAKDSAELIAVMKKHYPELKDESSLELSAK 285 + + D + +A+ +K E+ D +L+ +A+ Sbjct: 118 PGRGF---------NFSDGKQPLAMARKSLTEMADLLNLQSAAE 152
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 236 bits (604), Expect = 3e-79 Identities = 116/275 (42%), Positives = 152/275 (55%), Gaps = 4/275 (1%) Query: 27 VFFVSYLIFGAMVGSFLNVLIYRLPIMLANLSSR-SESHGEEIKMRSHLRNINLFQPGSF 85 ++F +F M+GSFLNV+I+RLPIML S+ NL P S Sbjct: 14 LYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSC 73 Query: 86 CHHCNESIPIKYNIPILGWIFLRGASRCCNKKISTRYLFIEVLAVIQTLLVLMIFKEDLL 145 C HCN I NIP+L W++LRG R C IS RY +E+L + ++ V M Sbjct: 74 CPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWG 133 Query: 146 ICTSLVLIWSLTALAFIDFDTYLLPDCMTIPLLWLGLLINIDTVFAPLTSAVLGAVSGYL 205 +L+L W L AL FID D LLPD +T+PLLW GLL N+ F L AV+GA++GYL Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYL 193 Query: 206 FLWLSYWLFKIVRGVDGMGYGDFKLMAALGAWFGVSAVPFLILFSSFFGLVAYAIFYFFD 265 LW YW FK++ G +GMGYGDFKL+AALGAW G A+P ++L SS G Sbjct: 194 VLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLR 253 Query: 266 KKDNGKEINYIAFGPYISLAGVLYLFLGSHVTNLF 300 K I FGPY+++AG + L G +T + Sbjct: 254 NHHQSKP---IPFGPYLAIAGWIALLWGDSITRWY 285
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 29.8 bits (67), Expect = 0.012 Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 6/65 (9%) Query: 4 NGIALLMVLCALFLMSTMVMASYNYWFDIYYLAKNSQQRQKEKWILLGAEEKFVSKLIKN 63 NG+ALL+ ++F+M ++ +Y Y+ D + K + + + LIK Sbjct: 53 NGVALLL---SMFVMWPIMHDAYVYFEDEDVTFNDISSLSKH---VDEGLDGYRDYLIKY 106 Query: 64 TSEDR 68 + + Sbjct: 107 SDREL 111
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 207 bits (529), Expect = 2e-72 Identities = 87/136 (63%), Positives = 103/136 (75%) Query: 2 ANKKTKGFTLLEIMVVIVILGLLASLTIPSLMSNKNRADQQKAVSDISALENALDMYRLD 61 A K +GFTLLEIMVVIVI+G+LASL +P+LM NK +AD+QKAVSDI ALENALDMY+LD Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62 Query: 62 NGDYPTEQQGIAALVTKPNVPPLPQRYPSDGYIRRLPTDPWGNSYQMNNPGKHGQIDIFS 121 N YPT QG+ +LV P +PPL Y +GYI+RLP DPWGN Y + NPG+HG D+ S Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122 Query: 122 IGPDRLPETEDDIGNW 137 GPD TEDDI NW Sbjct: 123 AGPDGEMGTEDDITNW 138
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 358 bits (921), Expect = e-124 Identities = 172/406 (42%), Positives = 264/406 (65%), Gaps = 7/406 (1%) Query: 1 MAVFKYVAISRSGTKITGDIDAENIRIARYLLYKKNMHVLSI-------KKRILLFNKYV 53 MA + Y A+ G K G +A++ R AR LL ++ + LS+ +K Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 54 VKKNSNKTDLVLITRQIATLVNASMPLDEVLDIVGKQNSKSKMIEIIQRIRVNIQEGHSF 113 K + +DL L+TRQ+ATLV ASMPL+E LD V KQ+ K + +++ +R + EGHS Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 114 ADALSPFPAVFSPLYKTMVTAGEVSGHLGLVLVRLADHIEQTQKIQRKIIQALIYPCVLV 173 ADA+ FP F LY MV AGE SGHL VL RLAD+ EQ Q+++ +I QA+IYPCVL Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 174 LISLSVIIILLTAVVPNIVEQFSFSETALPLSTKVLMILSYSIKENVIFIMAIGVSAVIF 233 +++++V+ ILL+ VVP +VEQF + ALPLST+VLM +S +++ +++ ++ + Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 234 LNRLLKINKINVFFHRHYLSLPMLGNMFVRINTSRYLRTLTTLHSNGVTIVQAMSISNAV 293 +L+ K V FHR L LP++G + +NT+RY RTL+ L+++ V ++QAM IS V Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 294 LTNVYIKNKLNISVKLVSEGCSLSSSLVDSGVFPPIILHMIISGERSGKLDHMLETVAGV 353 ++N Y +++L+++ V EG SL +L + +FPP++ HMI SGERSG+LD MLE A Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 354 QEEELMNQISIVMSLLEPTIIIVMAAFISFIILSILQPILEINSLV 399 Q+ E +Q+++ + L EP +++ MAA + FI+L+ILQPIL++N+L+ Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 543 bits (1401), Expect = 0.0 Identities = 310/610 (50%), Positives = 432/610 (70%), Gaps = 15/610 (2%) Query: 3 ISGKGIKSIHGMIFLFTLIMPLDIISANFSVSFKDVDIKEFINSVSKNINKTIIIDPTVQ 62 I I+S + +F ++ + FS SFK DI+EFIN+VSKN+NKT+IIDP+V+ Sbjct: 2 IIANVIRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVR 61 Query: 63 GLISIRSYENLDKDTYYQLFLNVLDVYGYAAIEMPHNVLKVISSKRAKGVVAPLPKEGVT 122 G I++RSY+ L+++ YYQ FL+VLDVYG+A I M + VLKV+ SK AK P+ + Sbjct: 62 GTITVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAP 121 Query: 123 FDGDELINRVIPLRYISAKKITPLLRQLNDNTESGSIINYDPSNILLITGRAAVVNRLHS 182 GDE++ RV+PL ++A+ + PLLRQLNDN GS+++Y+PSN+LL+TGRAAV+ RL + Sbjct: 122 GIGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLT 181 Query: 183 IVTDLDQAGDNEIELYKLNYAIAADVVKIVNEAINPINNLKQEVSIVGKVIADERTNSIL 242 IV +D AGD + L++A AADVVK+V E + S+V V+ADERTN++L Sbjct: 182 IVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVL 241 Query: 243 ISGDTYIRKKSILMIKKLDKRQSSDGNTKVVYMKYAQASKLLDVLNGISEGFHNEKKTKQ 302 +SG+ R++ I MIK+LD++Q++ GNTKV+Y+KYA+AS L++VL GIS +EK+ + Sbjct: 242 VSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK 301 Query: 303 SNQWNQRPVAIKAYDQTNALVITADPDMMLALGEVIEKLDIRRAQVLVEAIIVETQNGEG 362 + + IKA+ QTNAL++TA PD+M L VI +LDIRR QVLVEAII E Q+ +G Sbjct: 302 PVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADG 361 Query: 363 INLGVKWENKRSDDINF----IKNSDGLLNNNGWGIATTIT-----------GLTAGFYK 407 +NLG++W NK + F + S + N + T++ G+ AGFY+ Sbjct: 362 LNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQ 421 Query: 408 GNWDVLLSALSTNTNNNILATPSIVTLDNMEAEFNVGQEVPVLISTQTTTTDKVYNSISR 467 GNW +LL+ALS++T N+ILATPSIVTLDNMEA FNVGQEVPVL +QTT+ D ++N++ R Sbjct: 422 GNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVER 481 Query: 468 QSIGVMLKVKPQINKGDSVLLEIRQEVSSIADSSTVNTHNLGSVFNKRVVNNAVLVKSGE 527 +++G+ LKVKPQIN+GDSVLLEI QEVSS+AD+++ + +LG+ FN R VNNAVLV SGE Sbjct: 482 KTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGE 541 Query: 528 TVVVGGLLDKKSSTIVNKVPFLGDLPLIGWLFRQTKEKVEKSNLILFIKPTILRESDDYS 587 TVVVGGLLDK S +KVP LGD+P+IG LFR T +KV K NL+LFI+PT++R+ D+Y Sbjct: 542 TVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601 Query: 588 VVTSKEYNKY 597 +S +Y + Sbjct: 602 QASSGQYTAF 611
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 44.6 bits (105), Expect = 5e-09 Identities = 19/62 (30%), Positives = 31/62 (50%) Query: 29 IKLVGVIEHSAPSESIAILEVKGKQTTHLTRENINYEDIVIVKIFTDRVIIKRNGKYYSL 88 + L GV+ S SIAI+ +Q + E + + IV I DRV+++ G+Y L Sbjct: 95 LSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVL 154 Query: 89 II 90 + Sbjct: 155 GL 156
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.2 bits (229), Expect = 9e-24 Identities = 32/123 (26%), Positives = 59/123 (47%), Gaps = 2/123 (1%) Query: 1 MMARRILVVEDEAPIREMVCFVLEQNGYQPLEAEDYDSAVARLSEPFPDLVLLDWMLPGG 60 M ILV +D+A IR ++ L + GY + + ++ DLV+ D ++P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGIQFIKHMKREALTRDIPVMMLTARGEEEDRVRGLEVGADDYITKPFSPKELVARIKAV 120 + + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I Sbjct: 61 NAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 MRR 123 + Sbjct: 119 LAE 121
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.7 bits (98), Expect = 2e-05 Identities = 32/222 (14%), Positives = 74/222 (33%), Gaps = 8/222 (3%) Query: 312 QYLAQLTPLT--QAVEQATAARQQQQLNQHEQETLIEQRIVPLDNLITQQQQTLSQLAGQ 369 L +LT L + ++ Q +L Q + L + + + Q + Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181 Query: 370 IQQLRAKEQQNSQQLALNEQKLLQTHQRLQQLADYANLHAHHQHWEKHLPLWHEQFRQLQ 429 + LR Q QK + ++ A+ + A +E + + Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241 Query: 430 LQQQQSAQSEQQLHQQTTLLATLQQQATTLSAQEKQQQVALAEARAQASYLQQKL--LVL 487 + A ++ + +Q + +Q +Q + + A+ + + Q +L Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301 Query: 488 EQ----QQPSAQLRQQLNEFNEQRQICQQLAALSPLAQQIQA 525 ++ L +L + E++Q A +S QQ++ Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKV 343 Score = 39.0 bits (91), Expect = 1e-04 Identities = 36/222 (16%), Positives = 72/222 (32%), Gaps = 30/222 (13%) Query: 449 LATLQQQATTLSAQEKQQQVALAEARAQASYLQQKLLVLEQQ----QPSAQLRQQLNEFN 504 L L +A TL Q Q L + R Q +L L + +P Q + Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186 Query: 505 EQRQICQQLAALSPLAQQIQALYDKQQQQFTAQQQQLKQLEQQ---LTEKRQLYQQ-QKQ 560 I +Q + Q + DK++ + ++ + E + + + Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246 Query: 561 HLVDLEALLEREKQIVTLEAERAKLQPGDACPLCGAVEHPAITAYQAVKPSETAVRVAKL 620 + A+LE+E + V E + ++ E+ + AK Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYK-------------------SQLEQIESEILSAKE 287 Query: 621 RL-QVEQLYTEGTELRTQVASMQQHQQRIEQELQDHRQQLAA 661 V QL+ E+ ++ + + EL + ++ A Sbjct: 288 EYQLVTQLFKN--EILDKLRQTTDNIGLLTLELAKNEERQQA 327 Score = 38.3 bits (89), Expect = 2e-04 Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 19/206 (9%) Query: 649 EQELQDHRQQLAAYQQRWQTLAQPLSL----AFTLNEPDALALWLEQHEQQEQACQLKLV 704 + Q Q Q R+Q L++ + L L + E+ + + Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL----- 190 Query: 705 EYERLTQQYQQAKDILTQLEQRQQEHQQQLALITERQKNAQQTYQQLQSQYQHQQEALIA 764 + +Q+ ++ Q E + + + + R + + +S+ +L+ Sbjct: 191 ----IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS-SLLH 245 Query: 765 QQQVLNHTLTELSLSVPDADQQQDWLAQREEECQRWQQHQQEQQRLTIEQKTLETRIENE 824 +Q + H + E +A + + E+ + +E+ +L + E + Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK-- 303 Query: 825 RRHLQECIDQLSALSQQRQQAETLLQ 850 L++ D + L+ + + E Q Sbjct: 304 ---LRQTTDNIGLLTLELAKNEERQQ 326 Score = 34.0 bits (78), Expect = 0.004 Identities = 26/180 (14%), Positives = 72/180 (40%), Gaps = 13/180 (7%) Query: 835 LSALSQQRQQAETLLQQQIQQRQALFGEDIVAE-------VRQRLRLQQQQAELAQQNAE 887 + L Q R Q + + + + ++ + +R +++Q + Q + Sbjct: 145 QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204 Query: 888 K--ALQQAQSQLNRLSGELTGLEQQCQQYQQRATTTQAEL-QQALSTSEFADETALTAAL 944 K L + +++ + + E + + R + L +QA++ ++ Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 945 LSE--EERQHLQQLQQQLNERRQQAQIRLQQAR-EILDQHLQLCPQGVDKSSELTLLQQQ 1001 ++E + L+Q++ ++ +++ Q+ Q + EILD+ Q + EL +++ Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 28.3 bits (63), Expect = 0.045 Identities = 11/37 (29%), Positives = 21/37 (56%) Query: 218 DVIAEQAMNNYERRFAKSLAHVINLFDPDVVVLGGGM 254 D + E+A +N +R F+ + + LF+P +VV + Sbjct: 351 DSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAV 387
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.015 Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 12/68 (17%) Query: 7 MVGARGAGKTTIGKALAQALGYRFVDTDL-------FMQQTSQMTVAEVVESEGWDGFRL 59 + G G GK+T+ L F DT +Q + + E+ E FR Sbjct: 601 LEGTGGIGKSTLINTLVGL--DFFSDTHFDIGTGKDSYEQIAGIVAYELSE---MTAFRR 655 Query: 60 RESMALQA 67 ++ A++A Sbjct: 656 ADAEAVKA 663
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 58.0 bits (140), Expect = 3e-13 Identities = 25/83 (30%), Positives = 45/83 (54%) Query: 147 QGRISPGEVDEVQLTLLMDIAKVTKISLRAALHRHLVEGATEEWVCSVYKMNQEDFWQNM 206 + + PG + E+ LL+ I+ + + A+ +LV G + + VC Y+MN F + Sbjct: 20 ESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEVCEKYQMNNGYFSTTL 79 Query: 207 RKLHRLNERVVQLLPFYTRQTSS 229 +L RLN +L P+YT ++S+ Sbjct: 80 GRLIRLNALAARLAPYYTDESSA 102
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.5 bits (79), Expect = 1e-04 Identities = 28/116 (24%), Positives = 47/116 (40%), Gaps = 8/116 (6%) Query: 66 IEREALLLWIARDEIGIIGTIQLVLCQKPNGLNRAEIQKLLVHSRSRRTGIGHKLIIAAE 125 +E E ++ E IG I++ + N A I+ + V R+ G+G L+ A Sbjct: 60 VEEEGKAAFLYYLENNCIGRIKI----RSNWNGYALIEDIAVAKDYRKKGVGTALLHKAI 115 Query: 126 NTAVQLRRGLIYLDTQS-GSSAESFYRAQGYRYVG-EIPDYACTPNGNYHPTAIYF 179 A + + L+TQ SA FY + + Y+ P N AI++ Sbjct: 116 EWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFPTAN--EIAIFW 169
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.9 bits (70), Expect = 0.004 Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 14/66 (21%) Query: 120 AEAI-SLLRHNRVVIFAAGTGNPFFTT-------------DSAACLRGIEIEADVVLKAT 165 AE I L+ +VI + G G P D A E+ AD+ + T Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT 235 Query: 166 KVDGVY 171 V+G Sbjct: 236 DVNGAA 241
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 30.3 bits (68), Expect = 0.010 Identities = 9/44 (20%), Positives = 20/44 (45%), Gaps = 2/44 (4%) Query: 206 VFIGQSTRIYDRETGE--VHYGRVPAGSVVVSGNLPSKDGSYSL 247 VF+ + G V+Y + G + + G ++ G+Y++ Sbjct: 623 VFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTV 666
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.028 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 4/42 (9%) Query: 31 VISIIGRSGSGKSTLLRCMNGLEDYQDGSIKLGGMTVTNRDS 72 + + G G GKSTL+ + GL+ + D +G T +DS Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG----TGKDS 635
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 47.2 bits (112), Expect = 7e-08 Identities = 35/163 (21%), Positives = 65/163 (39%), Gaps = 5/163 (3%) Query: 35 LETIATNFNLSVNQAGFIVTAAQLGYAVGLMFLVPLGDMFE-RRGLIVGMTLLAAGGMLI 93 L IA +FN ++ TA L +++G L D +R L+ G+ + G +I Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGS-VI 95 Query: 94 TAMSQNLTMMIIGTALTGLFSVVA--QLLVPLAATLAAPEKRGKVVGIIMSGLLLGILLA 151 + + ++I A L++ + A E RGK G+I S + +G + Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155 Query: 152 RTVAGALATLGGWRTIYWVASALMFIMALVLWRCLPRYKQHTG 194 + G +A W + + + I L + L + + G Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITI-ITVPFLMKLLKKEVRIKG 197
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.017 Identities = 23/105 (21%), Positives = 37/105 (35%), Gaps = 12/105 (11%) Query: 12 LNSRAKRQKDFPYQEILLTRLSMHMHSKLLENRNKMLKAQGINETLFMALITLDAQESRS 71 + + P QE+ L + L R A+G + + T Sbjct: 745 PSPEDEEIYFRPEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTF------- 797 Query: 72 IQPSELSAALG-----SSRTNATRIADELEKKGWIERRESHNDRR 111 + ++L ALG SS ++ D L + GW RE+ RR Sbjct: 798 VTIADLVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 68.3 bits (167), Expect = 1e-14 Identities = 63/410 (15%), Positives = 118/410 (28%), Gaps = 99/410 (24%) Query: 25 LLLTAIFIMIGVAYLIYWFLVLRHHQ---ETDNAYISGNQVQIMSQVPGSVVSVHFENTD 81 L A FIM + ++ + SG +I V + + + Sbjct: 57 PRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116 Query: 82 FVKSGDVLVTLDPTD-------AEQAFEQAK----------------------------- 105 V+ GDVL+ L + + QA+ Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176 Query: 106 ----------------TALANSVRQTHQLIINSKQYQ-------ANIALKKTELSQAQND 142 + Q +Q +N + + A I + ++ Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236 Query: 143 LKRRVVLGAAAVIGREELQHARDAVEAAQASLDMAVQQYNANQALVLNTPLE-------- 194 L L I + + + A L + Q ++ +L+ E Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296 Query: 195 -KQPAIEQAAAKMRDAWLT---------LQRTKVVSPISGYVSRRSVQ-VGAEISSGTPL 243 + + LT Q + + +P+S V + V G +++ L Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356 Query: 244 MAVVPADQ-LWIDANFKETQLANMRIGQPATI-VTDF----YGDDVVYQGKVVGLDMGTG 297 M +VP D L + A + + + +GQ A I V F YG GKV + Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY---LVGKVKNI----- 408 Query: 298 SAFSLLPAQNATGNWIKVVQRLPVRIALDEKQLKEHPLRIGLSSLVKVDT 347 + ++ G V+ + K PL G++ ++ T Sbjct: 409 NLDAIE--DQRLGLVFNVIISIEENCLSTG--NKNIPLSSGMAVTAEIKT 454
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 41.2 bits (96), Expect = 3e-05 Identities = 43/290 (14%), Positives = 81/290 (27%), Gaps = 27/290 (9%) Query: 504 QLHEAEMAQPLEEATIERKRPEQPALATFSLPTEVPPEEAPTVAKAKPAVATPAAVSTDV 563 L+ E+ + T++ P +P+ P +A+ A P A +T Sbjct: 979 DLYNPEVEK--RNQTVDTTNITTPNNIQADVPSV--PSNNEEIARVDEAPVPPPAPATPS 1034 Query: 564 EQPGFFSRLFSGLKNMFGASAEAEVQPAEVVKTDTSENRRNDRRNPR--RQNNGRKERND 621 E AE Q ++ V+ + + +N ++ + N Sbjct: 1035 ETTE--------------TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080 Query: 622 RTPREGRDNSSRYNTNRDNT--SRDNANRDGANRDNSNRDNSGRDNVSREGREDQRRNNR 679 +T + S T T + + A + + +++Q + Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQ 1140 Query: 680 RPAQPTTTSQGQTEVVEADKAQR----EEQPQRRGDRQRRRQDEKRQAPQEIKADVAEAP 735 A+P + + E EQP + Q V E P Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE-QPVTESTTVNTGNSVVENP 1199 Query: 736 VIEEVQPEQEERQQVMQRRQRRQLNQKVRIQSANDELNTLESPVSAPVAQ 785 Q + + + + VR N E T S + VA Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249 Score = 37.7 bits (87), Expect = 3e-04 Identities = 45/331 (13%), Positives = 91/331 (27%), Gaps = 40/331 (12%) Query: 671 REDQRRNNRRPAQPTTTSQGQTEVVEADKAQREEQPQRRGDR-QRRRQDEKRQAPQEIKA 729 ++R TT + Q + P + + R DE P Sbjct: 984 EVEKRNQTVDTTNITTPNNIQAD-----------VPSVPSNNEEIARVDEAPVPPPAPAT 1032 Query: 730 DVAEAPVIEEVQPEQEERQQVMQRRQRRQLNQKVRIQSANDELNTLESPVSAPVAQVVVA 789 + E ++ + + ++ Q R + + N + + VAQ Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQN-REVAKEAKSNVKANTQTNEVAQS--- 1088 Query: 790 EVQEEVKLLPQITAQTDDDSANERTTNNENGMPRRSRRSPRHLRVSGQRRRRYRDERYPA 849 E + T +T E+ +++ P+ ++ + + A Sbjct: 1089 -GSETKETQTTETKETATVEKEEK----AKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143 Query: 850 QSAMPLAGAFASPEMASGKVWVRYPVTPVVEQVVVEQIAIEQTTTVEQTAIVEQVSVANI 909 + A E P + EQ A E ++ VEQ Sbjct: 1144 EPARENDPTVNIKE----------PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193 Query: 910 VTAQLPVEQVQNTVAEQESSATPSVMTTPTVAVATAAVTLAPQHKPGGSSSSAAAVPGRA 969 + P T +S + + ++ P + + R+ Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNKPKN------RHRRSVRSV--PHNVEPATTSSNDRS 1245 Query: 970 PIVAAVPVVAETTAAETVVAKTEAAIDAVAV 1000 VA + + T A A+ +A A+ V Sbjct: 1246 T-VALCDLTSTNTNAVLSDARAKAQFVALNV 1275
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 977 bits (2528), Expect = 0.0 Identities = 328/570 (57%), Positives = 417/570 (73%), Gaps = 5/570 (0%) Query: 3 QISRQEYAGLFGPTTGDKIRLGDTNLFIEIEKDLRGYGEESVYGGGKSLRDGMGANNNLT 62 ++SR YA +FGPT GDK+RL DT LFIE+EKD +GEE +GGGK +RDGMG + +T Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMG-QSQVT 62 Query: 63 RDNGVLDLVITNVTIVDARLGVIKADVGIRDGKIAGIGKSGNPGVMDGVTQGMVVGVSTD 122 R+ G +D VITN I+D G++KAD+G++DG+IA IGK+GNP + GVT ++VG T+ Sbjct: 63 REGGAVDTVITNALILDH-WGIVKADIGLKDGRIAAIGKAGNPDMQPGVT--IIVGPGTE 119 Query: 123 AISGEHLILTAAGIDSHIHLISPQQAYHALSNGVATFFGGGIGPTDGTNGTTVTPGPWNI 182 I+GE I+TA G+DSHIH I PQQ AL +G+ GGG GP GT TT TPGPW+I Sbjct: 120 VIAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHI 179 Query: 183 RQMLRSIEGLPVNVGILGKGNSYGRGPLLEQAIAGVVGYKVHEDWGATANALRHALRMAD 242 +M+ + + P+N+ GKGN+ G L+E + G K+HEDWG T A+ L +AD Sbjct: 180 ARMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVAD 239 Query: 243 EVDIQVSVHTDSLNECGYVEDTIDAFEGRTIHTFHTEGAGGGHAPDIIRVASQTNVLPSS 302 E D+QV +HTD+LNE G+VEDTI A +GRTIH +HTEGAGGGHAPDIIR+ Q NV+PSS Sbjct: 240 EYDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSS 299 Query: 303 TNPTLPYGVNSQAELFDMIMVCHNLNPNVPADVSFAESRVRPETIAAENVLHDMGVISMF 362 TNPT PY VN+ AE DM+MVCH+L+P +P D++FAESR+R ETIAAE++LHD+G S+ Sbjct: 300 TNPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSII 359 Query: 363 SSDSQAMGRVGENWLRILQTADAMKAARGKLPEDAAGNDNFRVLRYVAKITINPAITQGV 422 SSDSQAMGRVGE +R QTAD MK RG+L E+ NDNFRV RY+AK TINPAI G+ Sbjct: 360 SSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGL 419 Query: 423 SHVIGSVEVGKMADLVLWDPRFFGAKPKMVIKGGMINWAAMGDPNASLPTPQPVFYRPMF 482 SH IGS+EVGK ADLVLW+P FFG KP MV+ GG I A MGDPNAS+PTPQPV YRPMF Sbjct: 420 SHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMF 479 Query: 483 GAMGKTLQDTCVTFVSQAALDDGVKEKAGLDRQVIAVKNCR-TISKRDLVRNDQTPNIEV 541 GA G++ ++ VTFVSQA+LD G+ + G+ ++++AV+N R I K ++ N TP+IEV Sbjct: 480 GAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEV 539 Query: 542 DPETFAVKVDGVHATCEPIATASMNQRYFF 571 DPET+ V+ DG TCEP M QRYF Sbjct: 540 DPETYEVRADGELLTCEPATVLPMAQRYFL 569
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.1 bits (182), Expect = 9e-18 Identities = 27/112 (24%), Positives = 52/112 (46%), Gaps = 1/112 (0%) Query: 1 MRIALESEGWRVFESETLQRGLIEAGTRKPDLIILDLGLPDGDGLNYIQDLRQWSA-IPI 59 + AL G+ V + DL++ D+ +PD + + + +++ +P+ Sbjct: 19 LNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPV 78 Query: 60 IVLSARNNEEDKVAALDAGADDYLSKPFGISELLARVRVALRRHSGASQESP 111 +V+SA+N + A + GA DYL KPF ++EL+ + AL + Sbjct: 79 LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLE 130
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 23.2 bits (50), Expect = 0.048 Identities = 11/34 (32%), Positives = 20/34 (58%), Gaps = 3/34 (8%) Query: 11 SHEQVVARMLKKPAV---RAEYERLERQDFAIID 41 SH +++R L+ PAV + E+++ D I+D Sbjct: 189 SHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVD 222
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 68.6 bits (168), Expect = 3e-19 Identities = 32/79 (40%), Positives = 47/79 (59%) Query: 10 IVHLATELLWLVLLLSLPVVVVASTVGLVISLVQALTQIQDQTLQFLIKLLAVSATLLMT 69 +V + L+LVL+LS +VA+ +GL++ L Q +TQ+Q+QTL F IKLL V L + Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63 Query: 70 YHWMGATLLNYTQQSFLQI 88 W G LL+Y +Q Sbjct: 64 SGWYGEVLLSYGRQVIFLA 82
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 224 bits (572), Expect = 1e-76 Identities = 86/220 (39%), Positives = 143/220 (65%), Gaps = 7/220 (3%) Query: 4 LNSSYQLIALLFMLSVLPLLVVMGTAFLKLSVVFSLLRNALGVQQVPPNIAIYGLALVLT 63 + + LIALL ++LP ++ GT F+K S+VF ++RNALG+QQ+P N+ + G+AL+L+ Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 64 IFIMAPVGLDVQARLQNEELSNDIGALAHQIDQNALVPYRDFLQRNTDIEQVTFFNDIVQ 123 +F+M P+ D ++E+++ + + + L YRD+L + +D E V FF + Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 124 NKWPE-------RYRDSVKPDSLLILMPAFTLSQLNEAFKIGLLLFLPFVAIDLIVSNIL 176 + R +D ++ S+ L+PA+ LS++ AFKIG L+LPFV +DL+VS++L Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 177 LAMGMMMVSPMTLSLPFKLLVFVLVDGWSLVLGQLVGSYL 216 LA+GMMM+SP+T+S P KL++FV +DGW+L+ L+ Y+ Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYM 220
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 51.5 bits (123), Expect = 1e-09 Identities = 22/81 (27%), Positives = 37/81 (45%) Query: 209 PPLAAVQLEDLPQTLVMEIGRLTLPLGEIKQLAVGQTLACQTHCYGEVNICLNGQSVGRG 268 L LP L + R + L E++ + Q L+ T+ V I NG +G G Sbjct: 220 TAETLPGLNQLPVKLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNG 279 Query: 269 SLLRCDEKLVVRIAQWGLQNG 289 L++ ++ L V I +W ++G Sbjct: 280 ELVQMNDTLGVEIHEWLSESG 300
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.005 Identities = 17/114 (14%), Positives = 40/114 (35%), Gaps = 7/114 (6%) Query: 5 QQRTLQRLLALRQRQERRLRQQLGQLRREQQQLENGRRRHQQLCQQLQQLAQWCGMLTPR 64 ++ + Q Q+ + L + R E+ + R++ L + + L Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL--- 243 Query: 65 EADEQKVLRQAVYQAERQAKKQLNAWVAQGRQQVSAIERQ--QARLRRNQREQE 116 +Q + + AV + E + + + + Q+ IE + A+ Q Sbjct: 244 -LHKQAIAKHAVLEQENKY-VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 63.4 bits (154), Expect = 8e-14 Identities = 44/188 (23%), Positives = 70/188 (37%), Gaps = 7/188 (3%) Query: 7 MLAIVLMTLSLSGCDME-LYSGLSEGEANQMLALLMLHQINAEKQIEKSGMVGLTVDKRQ 65 + +V M L D L+S LS+ + ++A L I + V + Sbjct: 35 VAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADK 91 Query: 66 FINAVELLRQNGFPRQRFITVDELFPANQLVTSPTQEQAKMVFLKEQQLENMLSHMDGVI 125 L Q G P+ + EL + S EQ E +L + + V Sbjct: 92 VHELRLRLAQQGLPKGGAVGF-ELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVK 150 Query: 126 HADVTVAMPM-SVDGKNPLPHTASVFIKYSPEVNLQSYQ-SQIKGLVRDAVPGIDYAKIS 183 A V +AMP S+ + +ASV + P L Q S + LV AV G+ ++ Sbjct: 151 SARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVT 210 Query: 184 VVMQPANY 191 +V Q + Sbjct: 211 LVDQSGHL 218
>PF05272#Virulence-associated E family protein Length = 892 Score = 37.0 bits (85), Expect = 1e-04 Identities = 18/78 (23%), Positives = 25/78 (32%), Gaps = 19/78 (24%) Query: 33 VLVGPSGCGKSTLLRMIAGLEEISGGTVGINDKDVTDVEPKMRDIAMVFQSYALYPQMTV 92 VL G G GKSTL+ + GL+ S D +D Y Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFS-------DTHFDI--GTGKDSYEQIAGIVAY----- 645 Query: 93 RENMGFALKMAKMSKADI 110 + +M +AD Sbjct: 646 --ELS---EMTAFRRADA 658
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.017 Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 18/87 (20%) Query: 289 LASLGVLDILSSD--------------YYPASLMDAAF-RIAHDE--SNRFTLPQAVNLV 331 L +G I+SSD + A M R+ + ++ F + + + Sbjct: 350 LHDIGAFSIISSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKY 409 Query: 332 TRNPARALGLNDR-GVIAEGKRADLIL 357 T NPA A GL+ G + GKRADL+L Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVL 436
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.045 Identities = 13/47 (27%), Positives = 17/47 (36%), Gaps = 8/47 (17%) Query: 64 CVVLHGHSGSGKSTLLRSLYANYLPDSGHI--------WIKHQGEWI 102 VVL G G GKSTL+ +L H + + G Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVA 644
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.5 bits (157), Expect = 9e-13 Identities = 26/125 (20%), Positives = 58/125 (46%), Gaps = 17/125 (13%) Query: 735 MADQLVLVLEDEPDVRQTLCEQLHQLGYLTLETGDSRQALALMADVPDISIVISDLMLPG 794 M +LV +D+ +R L + L + GY T ++ +A +V++D+++P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPD 59 Query: 795 DMTGAEVLQQARSVYPHLKLLLISGQD---------LRRSKNFMPEVELLRKPFNQQQLV 845 ++L + + P L +L++S Q+ + + +++P KPF+ +L+ Sbjct: 60 -ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLP------KPFDLTELI 112 Query: 846 QALQR 850 + R Sbjct: 113 GIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 99 bits (249), Expect = 2e-26 Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%) Query: 2 KPAILVVDDDTAICEVLRDVLNEHVFDVLLCHSGNEALQITATQPSIALILLDMMLPDIN 61 ILV DDD AI VL L+ +DV + + + A L++ D+++PD N Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDEN 61 Query: 62 GLLVLQQVQKLRPSLPVVMLTGMGSESDMVVGLEMGADDYIAKPFNARVVVARVKAVLRR 121 +L +++K RP LPV++++ + + E GA DY+ KPF+ ++ + L Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 122 SE 123 + Sbjct: 122 PK 123
>adhesinb#Adhesin B signature. Length = 310 Score = 25.6 bits (56), Expect = 0.019 Identities = 6/27 (22%), Positives = 11/27 (40%) Query: 20 PEEYERIVSAYAAWTRVCREYEFNDGY 46 P E + IV++ + + Y Y Sbjct: 196 PGEKKMIVTSEGCFKYFSKAYNVPSAY 222
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 23.2 bits (50), Expect = 0.047 Identities = 10/35 (28%), Positives = 18/35 (51%) Query: 7 DVVMSDIDMPESELKKGMKGVISITTRTIKPYILI 41 D V++D + E L+ +K V S + P +L+ Sbjct: 78 DGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLV 112
>INTIMIN#Intimin signature. Length = 939 Score = 473 bits (1217), Expect = e-145 Identities = 267/884 (30%), Positives = 405/884 (45%), Gaps = 81/884 (9%) Query: 11 YTLGPGDSIQSIAKKYNITVDELKKLNAYRTFSKP-FASLTTGDEIEVPRKESSF----- 64 YTL G+++ ++K +I + + LN + S+ G +I +P K+ F Sbjct: 65 YTLKTGETVADLSKSQDINLSTIWSLNKHLYSSESEMMKAEPGQQIILPLKKLPFEYSAL 124 Query: 65 ---------------------FSNNPNENNKKDVDDLLARNAMGAG-----KLLSNDNTS 98 +P+ DD A +L S Sbjct: 125 PLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNG 184 Query: 99 DAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLLVPLKDSESSLL 158 D A + A N+ ++ Q WL +GTA V L ++F D S+LD L+P DSE L Sbjct: 185 DYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNF--DGSSLDFLLPFYDSEKMLA 242 Query: 159 FTQLGVRNKDSRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLKF 218 F Q+G R DSR T N+GAG R + + M G N F D D +G N R+G+G E DY K Sbjct: 243 FGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKS 302 Query: 219 SANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALFG 278 S N YF ++GWH+S + YDERPA+GFDIR YLP+YP LG KLMYE+Y GD VALF Sbjct: 303 SVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFN 362 Query: 279 KDDRQKDPHAVTLGVNYTPVPLVTIGAEHREGKGNNNNTSVNVQLNYRMGQPWNDQIDQS 338 D Q +P A T+GVNYTP+PLVT+G ++R G GN N+ ++Q Y+ +PW+ QI+ Sbjct: 363 SDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQ 422 Query: 339 AVAANRTLAGSRYDLVERNNNIVLDYKKQELIHLVLPDRISGSGGGAITLTAQVRAKYGF 398 V RTL+GSRYDLV+RNNNI+L+YKKQ+++ L +P I+G+ + V++KYG Sbjct: 423 YVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKYGL 482 Query: 399 SRIEWDATPLENAGG---STSPLTQSSLSVTLPFYQHILRTSNTHTISAVAYDAQGNASN 455 RI WD + L + GG + + LP Y SN + ++A AYD GN+SN Sbjct: 483 DRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQ--GGSNVYKVTARAYDRNGNSSN 540 Query: 456 RAVTSIEVTRPETMV----ISHLATTIDNATANGIATNTVQATVTDGDGQPIIGQLINFA 511 + +I V +V ++ +A A+G T ATV + Sbjct: 541 NVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNI 600 Query: 512 VNTQATLSTTEARTGANGTASTTLTHTVSGVSRVSVTLGSSSRSVDTTFV--ADESTAEI 569 V+ A LS A T +G A+ TL G VS + +++ V D++ A I Sbjct: 601 VSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASI 660 Query: 570 TAANLTVTTNDSVANGSDTNVVRAKVTDAYTNAVANQSVIFSASNGATVIDQTVITNAEG 629 T + +VANG D KV V+NQ V F+ + + + T T+ G Sbjct: 661 T--EIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQEVTFT-TTLGKLSNSTEKTDTNG 716 Query: 630 IADSTLTNTTAGVSVVTATLGGQS---QQVDTTFKPGSTAAISLVKLADRAVADGIDQNE 686 A TLT+TT G S+V+A + + + + F T +++ V + Sbjct: 717 YAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVW 776 Query: 687 IQ-----VVLRDGTGN----AVPNVPMSIQADNGAIVVASTPNTGVDGTIN----ATFTN 733 +Q + G G + S+ A +G + + T + + AT+T Sbjct: 777 LQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTI 836 Query: 734 LRAGESVVS------VTSPALVGMTMTMTFSADPRTAVVSTLAAIDNNAKADG-TDTNVV 786 +V + A+ + + + A K + + + Sbjct: 837 ATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTI 896 Query: 787 RAWVVDANGNSVPGVSVTFDAGNGAVLAQNPV----VTDRNGYA 826 +WV ++ GV+ T+D ++ QNP+ ++ N YA Sbjct: 897 ISWVQQTAQDAKSGVASTYD-----LVKQNPLNNIKASESNAYA 935 Score = 89.7 bits (222), Expect = 1e-19 Identities = 74/340 (21%), Positives = 120/340 (35%), Gaps = 29/340 (8%) Query: 2552 NALADGVTRNQVRAHVVDSTGNSVADMAVTFTANRGAQLSKVTVLTDNNGDAVNTLTNSL 2611 +A ADG A V + + A LS + T+ +G A TL + Sbjct: 569 SAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDK 628 Query: 2612 VGVTVVTAKLGTAGTPLTVDTVFTAGPLATLTLVTTV--NNAFADNSATNTVQATLKDV- 2668 G VV+AK TA ++ T +T + + A + + + T+K + Sbjct: 629 PGQVVVSAK--TAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMK 686 Query: 2669 SGNPIVGEVVAFAASNGATITATDGGVSNANGIVLATLTNGTAGVSTVTATIE----TLT 2724 P+ + V F + G +T+ ++ NG TLT+ T G S V+A + + Sbjct: 687 GDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVK 744 Query: 2725 ETTDTTFIAMKNLDVTVNGTTFNGDAGFPTTGFVGATFKVNSGGDNSLYDWSSSAPALVS 2784 F + D + PT + + G N Y W S+ PA+ S Sbjct: 745 APEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS 804 Query: 2785 VSGD-GVVTFNAVFPTGTPTITISATPKGGGSPLSYSFRVNQWFINNNGATLNRADAITH 2843 V G VT T TIS + N + N + DA+ Sbjct: 805 VDASSGQVTLK-----EKGTTTISVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNT 859 Query: 2844 CENVGYTMPTSTQVTNAATWMSGKRAVGNLWSEWGDFSAY 2883 C+N G +P+S + N++ WG + Y Sbjct: 860 CKNFGGKLPSSQNE------------LENVFKAWGAANKY 887 Score = 73.2 bits (179), Expect = 1e-14 Identities = 91/426 (21%), Positives = 137/426 (32%), Gaps = 45/426 (10%) Query: 821 DRNGYAENTLTNLAIGTTTVKATTVTDPVGQTVNTHFVAGAVDTITLTVPVNGAVANGVN 880 DRNG N+ N+ + T + V D VG T T A A+G Sbjct: 533 DRNG---NSSNNVLLTITVLSNGQVVDQVGVT-------------DFTADKTSAKADGTE 576 Query: 881 TNSVQAVVSDSGGNPVTGATVVFSSTNATAQVTTVIGTTGADGIATATLTNTVAGTSNVV 940 + A V +G V F+ + TA ++ T G AT TL + G V Sbjct: 577 AITYTATVKKNGVAQA-NVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVS 635 Query: 941 ATI----DTVNANIDTAFVAGAVATITLTAPV-NGAVADGADTNQVDALVEDANGNPITG 995 A +NAN FV A+IT AVA+G D V P++ Sbjct: 636 AKTAEMTSALNANA-VIFVDQTKASITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSN 693 Query: 996 AAVVFSSANGATILSSTMNTGVNGVASTLLTHTVAGTSNVVATVDTVNANI---DTTFVA 1052 V F++ + +ST T NG A LT T G S V A V V ++ + F Sbjct: 694 QEVTFTT-TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFT 752 Query: 1053 GAVATITLTTPVNGAVADGANSNSVQAVVSDSDGNPVTGAAVVFSSANATAQITTVIGTT 1112 V V + +Q + + G S+ A A + G Sbjct: 753 TLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQV 812 Query: 1113 GADGIATATLTNTVAGTSNVVATIDTVNANIDTAFVAGAVATITLTAPVNGAVADGADTN 1172 T T++ + TI T N + I D +T Sbjct: 813 TLKEKGTTTISVISSDNQTATYTIATPN------------SLIVPNMSKRVTYNDAVNTC 860 Query: 1173 QVDALVQDANGNAITGAAVVFSSANGADIIAPTMNTGVNGVASTLLTHTVAGTSNVVATI 1232 + ++ N + + +AN + + S + S V +T Sbjct: 861 KNFGGKLPSSQNELENVFKAWGAANKYEYY-----KSSQTIISWVQQTAQDAKSGVASTY 915 Query: 1233 DTISAN 1238 D + N Sbjct: 916 DLVKQN 921 Score = 70.9 bits (173), Expect = 7e-14 Identities = 70/374 (18%), Positives = 125/374 (33%), Gaps = 34/374 (9%) Query: 1921 NRVQSKDTTFIADRTTATIRASDLTITRNNALADGVATNAARVIVTDANGNPVPSMFVGY 1980 N V T + + +D T + +A ADG V Sbjct: 540 NNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFN 599 Query: 1981 TSDNGALLTPTSGMTDSSGTFSTTFTHTTAGISKVTAAIVTMGISQTKDAVFIADRSTAH 2040 A+L+ S T+ SG + T G V+A M + +AV D++ A Sbjct: 600 IVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKAS 659 Query: 2041 VSELIVVKNDSLANNSDRNIVQAHIKDAHGNVVTGMNVNFSATENVTLTANTVTTNSQGY 2100 ++E+ K ++AN D + V+ V F+ T L+ +T T++ GY Sbjct: 660 ITEIKADKTTAVANGQDAITYTVKVMK-GDKPVSNQEVTFT-TTLGKLSNSTEKTDTNGY 717 Query: 2101 AENTLRHNAPVTSAVTATVATDLVGLTEDVRFVAGAGARIELFRLNDGAVADGIQTNRVE 2160 A+ TL P S V+A V+ ++ + I +E Sbjct: 718 AKVTLTSTTPGKSLVSAR--------------VSDVAVDVKAPEVEFFTTLT-IDDGNIE 762 Query: 2161 ARVYDVSDNLVPN------SNVVFSADNGG---QLVQNDVQTDALGSAYVTVSNINTGVT 2211 V L N+ S NG + + + S VT+ G T Sbjct: 763 IVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLK--EKGTT 820 Query: 2212 KVTVTADGVSASTTTTFIADRDTATLVTDRFLITHDNAVANGVVENRVLLHLVDANDNSV 2271 ++V + S + T T+ + +V + ++ + V + + ++ N + Sbjct: 821 TISVIS---SDNQTATYTIATPNSLIVPN---MSKRVTYNDAVNTCKNFGGKLPSSQNEL 874 Query: 2272 SGVEVNFSATNGAS 2285 V + A N Sbjct: 875 ENVFKAWGAANKYE 888 Score = 61.6 bits (149), Expect = 4e-11 Identities = 50/212 (23%), Positives = 73/212 (34%), Gaps = 9/212 (4%) Query: 1340 VAGAVATITLTAPVNGAVADGVNTNSVQAVVSDSDGNAVTGATVVFSSANATAQITTVIG 1399 V V TA A ADG + A V +G A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612 Query: 1400 TTGADGIATATLTNTVAGTSNVVATI----DTVNANIDTTFVAGELENIVVSIINNNALA 1455 T G AT TL + G V A +NAN + + A+A Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672 Query: 1456 NGADTNIVEAFVTDRFGNGVANQSLIFGTNGASIVGSSTVTTNLDGRVRASATHTVAGSS 1515 NG D V V+NQ + F T + +ST T+ +G + + T T G S Sbjct: 673 NGQDAITYTVKVMKG-DKPVSNQEVTFTTTL-GKLSNSTEKTDTNGYAKVTLTSTTPGKS 730 Query: 1516 NTVIAISGAHQGYA--RVTFVADVSTAQLKLT 1545 +S V F ++ + Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE 762 Score = 59.7 bits (144), Expect = 2e-10 Identities = 66/358 (18%), Positives = 123/358 (34%), Gaps = 30/358 (8%) Query: 1633 VAGKAASIEMTMTKDNAVANNIDTNEVQVLVTDVDGNAINGAVVNLTSNSGMNITPNSVT 1692 V + + T K +A A+ + V N V + ++ NS Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSAN 613 Query: 1693 TGSDGTATATLTHTLAGSLPINARIDQVSKTINATF--IADASTAQI--IAGDMFIIVND 1748 T G AT TL G + ++A+ +++ +NA D + A I I D Sbjct: 614 TNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTA--- 670 Query: 1749 QVANGQAVNAVQARVTDSYGNPIKDQTVEFVLSNNGTIQYELDVTSVEGGVMVTFTNTLA 1808 VANGQ +V P+ +Q V F + G + + T G VT T+T Sbjct: 671 -VANGQDAITYTVKVMKG-DKPVSNQEVTFT-TTLGKLSNSTEKTDTNGYAKVTLTSTTP 727 Query: 1809 GITNVTATVVSSGSS-RNIDTTFIADVTTAHIAASDLMVIVDDAVADNLDKNEVHARVTD 1867 G + V+A V + + F +T IV V L + + Sbjct: 728 GKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE----IVGTGVKGKLPTVWLQYGQVN 783 Query: 1868 AKGNVLSGQTVIFTSGNGAAITTVNGISDGDGLTKATLTHTLAGTSVVTARVGNRVQSKD 1927 K + +G+ ++ A + +T GT+ ++ + ++ Sbjct: 784 LKASGGNGKYTWRSANPAIASVDAS---------SGQVTLKEKGTTTISVISSD---NQT 831 Query: 1928 TTFIADRTTATIRASDLTITRNNALADGVATNAARVIVTDANGNPVPSMFVGYTSDNG 1985 T+ + I + +++ D V T ++ N + ++F + + N Sbjct: 832 ATYTIATPNSLIVPN---MSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANK 886 Score = 59.3 bits (143), Expect = 2e-10 Identities = 81/378 (21%), Positives = 131/378 (34%), Gaps = 31/378 (8%) Query: 675 DRAVADGIDQNEIQVVLRDGTGNAVPNVPMSIQADNG-AIVVASTPNTGVDGTINATFTN 733 A ADG + ++ G A NVP+S +G A++ A++ NT G T + Sbjct: 568 TSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKS 626 Query: 734 LRAGESVVSVTSPALVGMTMTMTFSA----DPRTAVVSTLAAIDNNAKADGTDTNVVRAW 789 + G+ VVS + MT + +A D A ++ + A A A+G D + Sbjct: 627 DKPGQVVVSAKT---AEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDA-ITYTV 682 Query: 790 VVDANGNSVPGVSVTFDAGNGAVLAQNPVVTDRNGYAENTLTNLAIG--TTTVKATTVTD 847 V V VTF L+ + TD NGYA+ TLT+ G + + + V Sbjct: 683 KVMKGDKPVSNQEVTFTT-TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAV 741 Query: 848 PVGQTVNTHFVAGAVDTITLTVPVNGAVANGVNTNSVQAVVSDSGGNPVTGATVVFSSTN 907 V F +D + + G V + T +Q + + G S+ Sbjct: 742 DVKAPEVEFFTTLTIDDGNIEIVGTG-VKGKLPTVWLQYGQVNLKASGGNGKYTWRSANP 800 Query: 908 ATAQVTTVIGTTGADGIATATLTNTVAGTSNVVATIDTVNANIDTAFVAGAVATITLTAP 967 A A V G T T++ + TI T N + I Sbjct: 801 AIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPN------------SLIVPNMS 848 Query: 968 VNGAVADGADTNQVDALVEDANGNPITGAAVVFSSANGATILSSTMNTGVNGVASTLLTH 1027 D +T + ++ N + + +AN S+ + S + Sbjct: 849 KRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSS-----QTIISWVQQT 903 Query: 1028 TVAGTSNVVATVDTVNAN 1045 S V +T D V N Sbjct: 904 AQDAKSGVASTYDLVKQN 921 Score = 54.3 bits (130), Expect = 9e-09 Identities = 85/438 (19%), Positives = 141/438 (32%), Gaps = 75/438 (17%) Query: 2150 VADGIQTNRVEARVYDVSDNLVPNSNVVFSADNGGQLVQNDVQTDALGSAYVTVSNINTG 2209 V G +V AR YD + N N + + + GQ+V G Sbjct: 518 VQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQV------------------G 559 Query: 2210 VTKVTVTADGVSASTTTTFIADRDTATLVTDRFLITHDNAVANGVVENRVLLHLVDANDN 2269 VT T A T IT+ V N Sbjct: 560 VTDFTADKTSAKADGTEA----------------ITYTATVKK--------------NGV 589 Query: 2270 SVSGVEVNFSATNG-ASINA-SAITDINGFAIGVLTNTLSGPSDVTVTLVTPGGTESLTV 2327 + + V V+F+ +G A ++A SA T+ +G A L + P V V+ T T +L Sbjct: 590 AQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSD--KPGQVVVSAKTAEMTSALNA 647 Query: 2328 TPQFIADINTANIATGDFVIIDDGAVANSVDANEVRARVTDNQGNAIAGYSVVFSSQNGA 2387 D A+I AVAN DA +V ++ V F++ G Sbjct: 648 NAVIFVDQTKASITE--IKADKTTAVANGQDAITYTVKVMKG-DKPVSNQEVTFTTTLGK 704 Query: 2388 TITTSGITGVDGWASAKLTHIKAGESGILARLSRPMATVHTLMPYFIADVSTATLQLFNF 2447 ++ T +G+A LT G+S + AR+S V F ++ Sbjct: 705 LSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDD------ 758 Query: 2448 NPIPIIADGVMQFFVLGRV-FDANQNPVGGQQVAFSATNEVTLTESNGSISTPEGSVLLS 2506 I I+ GV + + G + T +N +I++ + S Sbjct: 759 GNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKY------TWRSANPAIASVDASS-GQ 811 Query: 2507 VTSTQAGVHPITGTLVSNNYTDTFGAAFIANKNTAQLSTLMVVDNNALADGVTRNQVRAH 2566 VT + G I+ N A + + + + D V + Sbjct: 812 VTLKEKGTTTISVISSDNQT-----ATYTIATPNSLI-VPNMSKRVTYNDAVNTCKNFGG 865 Query: 2567 VVDSTGNSVADMAVTFTA 2584 + S+ N + ++ + A Sbjct: 866 KLPSSQNELENVFKAWGA 883 Score = 50.5 bits (120), Expect = 1e-07 Identities = 93/491 (18%), Positives = 162/491 (32%), Gaps = 61/491 (12%) Query: 1900 LTKATLTHTLAGTSVVTARVGNRVQSKDTTFIADRTTATIRASDLTITRNNALADGVATN 1959 + + H + GT T ++ V+SK + +R+ G + Sbjct: 453 ILSLNIPHDINGTERSTQKIQLIVKSKYGLDRIVWDDSALRS-----------QGGQIQH 501 Query: 1960 AARVIVTDANGNPVPSMFVGYTSDNGALLTPTSGMTDSSGTFSTTFTHTTAGISKVTAAI 2019 + D ++ Y + T+ D +G S T I Sbjct: 502 SGSQSAQDYQ-----AILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLT----------I 546 Query: 2020 VTMGISQTKDAVFIADRSTAHVSELIVVKNDSLANNSDRNIVQAHIKDAHGNVVTGMNVN 2079 + Q D V + D + K + A+ ++ A +K +G + V+ Sbjct: 547 TVLSNGQVVDQVGVTDFTAD--------KTSAKADGTEAITYTATVKK-NGVAQANVPVS 597 Query: 2080 FSATENV-TLTANTVTTNSQGYAENTLRHNAPVTSAVTATVATDLVGL-TEDVRFVAGAG 2137 F+ L+AN+ TN G A TL+ + P V+A A L V FV Sbjct: 598 FNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTK 657 Query: 2138 ARI-ELFRLNDGAVADGIQTNRVEARVYDVSDNLVPNSNVVFSADNGGQLVQNDVQTDAL 2196 A I E+ AVA+G +V D V N V F+ G+L + +TD Sbjct: 658 ASITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSNQEVTFTTT-LGKLSNSTEKTDTN 715 Query: 2197 GSAYVTVSNINTGVTKVTVTADGVSASTTTTFIADRDTATLVTDRFLITHDNAVANGVVE 2256 G A VT+++ G + V+ V+ + T T+ V GV Sbjct: 716 GYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI-----EIVGTGVKG 770 Query: 2257 NRVLLHLVDANDN-SVSGVEVNFSATNGASINASAITDINGFAIGVLTNTLSGPSDVTVT 2315 + L N SG ++ ++ A A D + + TL T++ Sbjct: 771 KLPTVWLQYGQVNLKASGGNGKYTWR--SANPAIASVDASSGQV-----TLKEKGTTTIS 823 Query: 2316 LVTPGGTESLTVTPQFIADINTANIATGDFVIIDDGAVANSVDANEVRARVTDNQGNAIA 2375 V ++ T T I T N + + ++V+ + + N + Sbjct: 824 -VISSDNQTATYT------IATPN-SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELE 875 Query: 2376 GYSVVFSSQNG 2386 + + N Sbjct: 876 NVFKAWGAANK 886
>PF05860#haemagglutination activity domain. Length = 117 Score = 59.0 bits (143), Expect = 4e-13 Identities = 20/128 (15%), Positives = 42/128 (32%), Gaps = 18/128 (14%) Query: 53 TPPSTCRALTSYCIGMTETVVNIQAPDENGLSHNKYSKFDVVANGLFDVTTLNNRLAQEV 112 TP +T ++ ++ + L H+ + +F V +G Sbjct: 4 TPDTTLPINSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTA------------- 49 Query: 113 NGNSFLQDKSATIILNEVNSSHASLLDGNLRVDGGNAHIIIANPAGINCRGCSFTNASHV 172 F + I++ V S +DG +R A++ + NP GI + + Sbjct: 50 ---FFNNPTNIQNIISRVTGGSVSNIDGLIRA-NATANLFLINPNGIIFGQNARLDIGGS 105 Query: 173 TLTTGTPS 180 + + Sbjct: 106 FVGSTANR 113
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 44.9 bits (106), Expect = 4e-07 Identities = 33/157 (21%), Positives = 68/157 (43%), Gaps = 7/157 (4%) Query: 59 FNFIMPAMLTDLGLSMSDVGILGTLFYITYGCSKFVSGMISDRSNPRYFMGIGLVMTGII 118 N +P + D + + T F +T+ V G +SD+ + + G+++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFG 92 Query: 119 NILFGMSSSLLVLGALWILNAFFQGWG---WPPCSKILTSWY-SRSERGGWWAIWNTSHN 174 +++ + S +L I+ F QG G +P ++ + Y + RG + + + Sbjct: 93 SVIGFVGHSFF---SLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVA 149 Query: 175 FGGALIPLLVGVITLHFSWRYGMIIPGIIGVVIGLLM 211 G + P + G+I + W Y ++IP I + + LM Sbjct: 150 MGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 17/85 (20%), Positives = 33/85 (38%), Gaps = 10/85 (11%) Query: 426 VTNAYRHGAASR-----IEINARQDNQQIYLTISDNGK-GIDLASITPGYGLRGIQSRVS 479 V N +HG A I + +DN + L + + G + + G GL+ ++ R+ Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQ 323 Query: 480 A-FGGNVSLSV---DNGTCLNVTLP 500 +G + + V +P Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 60.6 bits (147), Expect = 3e-13 Identities = 34/173 (19%), Positives = 63/173 (36%), Gaps = 20/173 (11%) Query: 4 RVVFIDDHDIVRSGFAQLLSLEEDIQVVGEFSSAKQARAGLPGLQANICICDISMPDENG 63 ++ DD +R+ Q LS V S+A + ++ + D+ MPDEN Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 64 LDLLKGLPS---GMGVIMLSMHDSPALVETALERGARGFLSKRCKPEDLISAVRTVGSGG 120 DLL + + V+++S ++ A E+GA +L K +LI + Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR----- 117 Query: 121 VYLMPEIAQQLARVAVDPLTRREREIAVLLAEG---MEVREIAESLGLSPKTV 170 A + L ++ L+ E+ + L + T+ Sbjct: 118 -------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 57.6 bits (139), Expect = 1e-11 Identities = 41/193 (21%), Positives = 83/193 (43%), Gaps = 7/193 (3%) Query: 47 LNPQKVVILNPSVLDNADALHIKVAGVPQTSTHLPAFLSKYSGPE-YMNTGTLFEPDYEA 105 ++P ++V L ++ AL I GV T + ++S+ P+ ++ G EP+ E Sbjct: 33 IDPNRIVALEWLPVELLLALGIVPYGVADTINY-RLWVSEPPLPDSVIDVGLRTEPNLEL 91 Query: 106 LSQAKPDLIIAGGRAQDAYNKLSAIAPTIALDVDTQHFTQSLTQRT-EQLASIFGKEEEA 164 L++ KP ++ + L+ IAP + ++ +++ ++A + + A Sbjct: 92 LTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA 151 Query: 165 KTLLGNFSSQVNAIKQKSANAGS---AMVLMISGGKMSAYTPGSRFGFIFDELGFTPAAT 221 +T L + + ++K + G+ + +I M + P S F I DE G A Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQ 211 Query: 222 FAESGRHGNVVTS 234 E+ G+ S Sbjct: 212 -GETNFWGSTAVS 223
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 59.0 bits (142), Expect = 9e-13 Identities = 45/204 (22%), Positives = 100/204 (49%), Gaps = 11/204 (5%) Query: 18 QFPPLRKVRQVAPSAADQTLDPAEYQKQLMAGFQEGISQGFDKGLAEGKEEGYQEGVRLG 77 +F P+ + + A+ +L+ Q Q+ A QG+ G+AEG+++G+++G + G Sbjct: 21 EFVPIVEPEETIIEEAEPSLEQQLAQLQMQAH-----EQGYQAGIAEGRQQGHKQGYQEG 75 Query: 78 HDDGLKKGRIEGRQSELASFNDVIKPFSGYITQLHTYLETYEQRRRDELLQLVEKVTRQV 137 GL++G E +S+ A + ++ +++ T L+ + L+Q+ + RQV Sbjct: 76 LAQGLEQGLAEA-KSQQAPIHARMQQL---VSEFQTTLDALDSVIASRLMQMALEAARQV 131 Query: 138 IRCELALQPAQLLTLVEEALAALPMVPQQLKVYLNPAEFGRINDV--APEKVQAWGLAAD 195 I + + L+ +++ L P+ + ++ ++P + R++D+ A + W L D Sbjct: 132 IGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGD 191 Query: 196 PDMVGGECRIVTETTEIDVGCQHR 219 P + G C++ + ++D R Sbjct: 192 PTLHPGGCKVSADEGDLDASVATR 215
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 172 bits (438), Expect = 3e-53 Identities = 85/334 (25%), Positives = 165/334 (49%), Gaps = 2/334 (0%) Query: 15 KSDTKGRSRLEQASILLLSIGEEAAAMVMQQLSREEVVCVSQMMSRLHNIKLDQARQALD 74 D + ++A+ILL+SIG E ++ V + LS+EE+ ++ +++L I + L Sbjct: 9 ILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLL 68 Query: 75 DFFQDYREQSGINGASRSYLQAILNKALGNDIAKSVINGIYGDEIRHRMTRLQWVDTPQL 134 +F + Q I Y + +L K+LG A +IN + ++ D + Sbjct: 69 EFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANI 128 Query: 135 VALIDQEHLQLQAVFLAFLPPDVAAAVLAYLDKDRQDDILYRIAKLDDVNRDVVDEL-DR 193 + I QEH Q A+ L++L P A+ +L+ L + Q ++ RIA +D + +VV E+ Sbjct: 129 LNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERV 188 Query: 194 LIERGVAVLSEHGSKVIGIKQAANIVNRIPGNQQQ-LLDQLGERDEEVLNELKDEMYEFF 252 L ++ ++ SE + G+ I+N ++ +++ L E D E+ E+K +M+ F Sbjct: 189 LEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFE 248 Query: 253 ILSRQSEATLQRLMDLIPMSDWAIALKGTEPALRQAIYDVLPKRQIQQLQNATQRTGAVP 312 + + ++QR++ I + A ALK + +++ I+ + KR L+ + G Sbjct: 249 DIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308 Query: 313 VSRVEHIRKVIMAQVRELAEAGEIQVQLFAEQTM 346 VE ++ I++ +R+L E GEI + E+ + Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDV 342
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 283 bits (724), Expect = 1e-90 Identities = 154/565 (27%), Positives = 258/565 (45%), Gaps = 62/565 (10%) Query: 12 GQLGENTKTILMSAVALLVTAAIIFSLWRSSQGYTALFGSQENIPITQVVEVLEGEAIAY 71 +L N + L+ A + V + LW + Y LF + + +V L I Y Sbjct: 17 NRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY 76 Query: 72 RINPDNGQVLVAENQLGKARILLAAKGITATLPIGYELMDKESMLGSSQFIQNVRYKRSL 131 R +G + V +++ + R+ LA +G+ +G+EL+D+E G SQF + V Y+R+L Sbjct: 77 RFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEK-FGISQFSEQVNYQRAL 135 Query: 132 EGELAQSMMALSAVEYARVHLGMSEASSFAISNHADNSASVVLRLRYGQTLSTEQVGAIV 191 EGELA+++ L V+ ARVHL M + S F + SASV + L G+ L Q+ A+V Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLF-VREQKSPSASVTVTLEPGRALDEGQISAVV 194 Query: 192 QLVAGSIPGMKPANVRVVDQHGELLSQAYQANSEGVPSVKSGTELAHYLQSTTEKNIANL 251 LV+ ++ G+ P NV +VDQ G LL+ Q+N+ G + + A+ ++S ++ I + Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLT---QSNTSGRDLNDAQLKFANDVESRIQRRIEAI 251 Query: 252 LNSVIGANNYRISVSTQLDMSRIEETAEHYGPDPRIN------DENIQQENSNDDMAMGI 305 L+ ++G N V+ QLD + E+T EHY P+ + + E G+ Sbjct: 252 LSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGV 311 Query: 306 PGSLSNQPIPQSQAGQTPAAVSRSQAQ------------------------RKYIYDRNI 341 PG+LSNQP P ++A ++ AQ Y DR I Sbjct: 312 PGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTI 371 Query: 342 RHVRYPGYKLEKMTVAVVLN-KSLPVL--EQWTPEQQEELKRLIEDAAGIDVKRGDSLTI 398 RH + +E+++VAVV+N K+L T +Q ++++ L +A G KRGD+L + Sbjct: 372 RHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431 Query: 399 NMMAFAVP-TLIDEPVMPWWQEPSTFRWAELLGIGLLSLLVLW----FGVRPLMKRYSRK 453 F+ E +P+WQ+ S G LL L+V W VRP + R + Sbjct: 432 VNSPFSAVDNTGGE--LPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEE 489 Query: 454 GSENLPLAISSASADEALDHVDTGVDGAESSPRTENAFSASSLWKSDDLPEQGSGLETKI 513 + E + V+ + E Q G E Sbjct: 490 AK---AAQEQAQVRQETEEAVEVRLSKDEQL--------------QQRRANQRLGAEVMS 532 Query: 514 AHLQQLAQSETERTAEVIKQWINSN 538 +++++ ++ A VI+QW++++ Sbjct: 533 QRIREMSDNDPRVVALVIRQWMSND 557
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 44.3 bits (104), Expect = 5e-09 Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 1/73 (1%) Query: 53 NNLSFSQVLNGAIKSVDQLQHVASEKQTAMDMGISD-DLTGTMLASQKASVAFSAMVQVR 111 +SF+ L+ A+ + Q A + +G L M QKASV+ +QVR Sbjct: 29 PTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVR 88 Query: 112 NKLTSALDDVMNT 124 NKL +A +VM+ Sbjct: 89 NKLVAAYQEVMSM 101
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 375 bits (965), Expect = e-130 Identities = 127/345 (36%), Positives = 186/345 (53%), Gaps = 22/345 (6%) Query: 14 HGFVANAPSSVSVFSLARRVAEFNVPVLVTGETGTGKECVAKYIHQKAMGDASPYIAVNC 73 V + + ++ + R+ + ++ +++TGE+GTGKE VA+ +H P++A+N Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196 Query: 74 AAIPESMLEAILFGYEKGAFTGAIASVAGKFEQANGGTLLLDEIGDMPLALQVKLLRVLQ 133 AAIP ++E+ LFG+EKGAFTGA G+FEQA GGTL LDEIGDMP+ Q +LLRVLQ Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256 Query: 134 EQEVERLGGHKPIPLDIRIIASTNKDLSVEIAEGRFRQDLYYRLSVVPIHILPLRERPED 193 + E +GG PI D+RI+A+TNKDL I +G FR+DLYYRL+VVP+ + PLR+R ED Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316 Query: 194 ILPLVKAFINKYQSFLNVKIDITAEAQCELYKYTWPGNVRELENVIQRGIIMSNNGVI-- 251 I LV+ F+ + + EA + + WPGNVRELEN+++R + VI Sbjct: 317 IPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITR 376 Query: 252 ---------ELPSLGLPMAQGISSPVGETSLPF--------STIQPPDGENNIKLRGRLA 294 E+P + A S + + S Sbjct: 377 EIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEM 436 Query: 295 QYQYIVDLLQRHQGNKSKTAAFLGITPRALRYRLANMREDGIDIE 339 +Y I+ L +GN+ K A LG+ LR + +RE G+ + Sbjct: 437 EYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 72.6 bits (178), Expect = 2e-19 Identities = 35/77 (45%), Positives = 50/77 (64%) Query: 54 RKMSLFSRIPVTLTLEVASVEIPLSELLTVNNDSVIELDKLAGEPLDIQVNGIMFGQAEV 113 + + L IPV LT+E+ + + ELL + SV+ LD LAGEPLDI +NG + Q EV Sbjct: 52 QDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEV 111 Query: 114 VVINEKYGLRIININSQ 130 VV+ +KYG+RI +I + Sbjct: 112 VVVADKYGVRITDIITP 128
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 219 bits (559), Expect = 1e-73 Identities = 111/236 (47%), Positives = 155/236 (65%), Gaps = 4/236 (1%) Query: 19 LVGGLLYSPLLLAQEGGITLFNTVQTATGQDYNVKIEILILMTLLGLLPIMMLMMTCFTR 78 V L +PL AQ GIT + GQ +++ ++ L+ +T L +P ++LMMT FTR Sbjct: 9 PVLLWLITPLAFAQLPGIT--SQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTR 66 Query: 79 FIIVLAILRQALGLQQSPPNKVLTGIALALTLLVMRPVWTKIHQDAVIPFQQDEITLSQA 138 IIV +LR ALG +PPN+VL G+AL LT +M PV KI+ DA PF +++I++ +A Sbjct: 67 IIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEA 126 Query: 139 LGRAEAPLKNYMLAQTSTKSLDQMMAIA--QVSGEPQQQDLSVVTPAYVLSELKTAFQMG 196 L + PL+ +ML QT L +A P+ + ++ PAYV SELKTAFQ+G Sbjct: 127 LEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIG 186 Query: 197 FMIYIPFLVIDLIVASILMAMGMMMLSPLIVSLPFKLMLFVLCDGWTLMVGTLTAS 252 F I+IPFL+IDL++AS+LMA+GMMM+ P ++LPFKLMLFVL DGW L+VG+L S Sbjct: 187 FTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQS 242
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 219 bits (560), Expect = 4e-66 Identities = 115/462 (24%), Positives = 215/462 (46%), Gaps = 48/462 (10%) Query: 12 KRRTFAIISHPDAGKTTITEKVLLFGHAIQTAGTVKGRGSSHHAKSDWMEMEKQRGISIT 71 K +++H DAGKTT+TE +L AI G+V ++D +E+QRGI+I Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT----TRTDNTLLERQRGITIQ 57 Query: 72 TSVMQFPYGGCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTR 131 T + F + VN++DTPGH DF + YR+L+ +D +++I A GV+ +TR L R Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117 Query: 132 LRDTPILTFMNKLDREIRDPMEVLDEVERELNIACSPITWPIGCGKSFKGVYHLHKDETY 191 P + F+NK+D+ D V +++ +L+ K + Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVI------------------KQKVE 159 Query: 192 LYQSGKGHTIQEVRIVKGLNNPDLDVAVGEDLAKQFRQELELVQGASHEFDHEAFLSGDL 251 LY + E + + +DL +++ L + + F + L Sbjct: 160 LYPNMCVTNFTESEQWDTVIEGN------DDLLEKYMSGKSLEALELEQEESIRFHNCSL 213 Query: 252 TPVFFGTALGNFGVDHMLDGLVEWAPAPMPRKTDTRVVVASEEKFTGFVFKIQANMDPKH 311 PV+ G+A N G+D++++ + + R + + G VFKI+ K Sbjct: 214 FPVYHGSAKNNIGIDNLIEVITNKFYSSTHR---------GQSELCGKVFKIE--YSEK- 261 Query: 312 RDRVAFMRVVSGRFEKGMKLRQVRTKKDVVISDALTFMAGDRSHVEEAYAGDIIGLHNHG 371 R R+A++R+ SG +R + K+ + I++ T + G+ +++AY+G+I+ L N Sbjct: 262 RQRLAYIRLYSGVLHLRDSVR-ISEKEKIKITEMYTSINGELCKIDKAYSGEIVILQNEF 320 Query: 372 ---TIQIGDTFTQGEDMKFTGIPNFAPELFRRIRLRDPLKQKQLLKGLVQLSEEG-AVQV 427 +GDT + + I N P L + P +++ LL L+++S+ ++ Sbjct: 321 LKLNSVLGDTKLLPQRER---IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377 Query: 428 FRPLSNNDLIVGAVGVLQFEVVSSRLKSEYNVEAVYESVNVS 469 + + +++I+ +G +Q EV + L+ +Y+VE + V Sbjct: 378 YVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVI 419
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 47.2 bits (112), Expect = 5e-10 Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 1/80 (1%) Query: 18 DEATLFNIAIDPQYQRQGYGRLLLEHLIEQLEARNIVTLWLEVRASNARAIALYESLGFN 77 A + +IA+ Y+++G G LL IE + + L LE + N A Y F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147 Query: 78 EVSVRRNYYPS-ANGREDAI 96 +V Y + E AI Sbjct: 148 IGAVDTMLYSNFPTANEIAI 167
>PF04183#IucA / IucC family Length = 580 Score = 27.9 bits (62), Expect = 0.017 Identities = 8/38 (21%), Positives = 14/38 (36%), Gaps = 2/38 (5%) Query: 32 HLPEDTRLLIVA--QQLPEHGDPLLCDVLRSLGLTPHQ 67 L D +++A + E+ PL + GL Sbjct: 358 WLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAET 395
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 31.3 bits (70), Expect = 0.006 Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 18/105 (17%) Query: 106 WGTSGSSTVLVNAANFTAENLTIRNDFDFPANQAKAEGDPTKLKDTQAVALLLAEKSDKA 165 + S S + VNA N I+ + N+ + E K KD+ + ++ Sbjct: 21 FAISSSQAIEVNAMNEHYTESDIKRNHKTEKNKTEKE----KFKDSINNLVKTEFTNETL 76 Query: 166 RFRQVKLEGYQDTL----------YSKTGSRSYFTDCDISGHVDF 200 K++ QD L YS+ G YFTD D+ H + Sbjct: 77 ----DKIQQTQDLLKKIPKDVLEIYSELGGEIYFTDIDLVEHKEL 117
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 36.2 bits (83), Expect = 7e-04 Identities = 61/300 (20%), Positives = 99/300 (33%), Gaps = 41/300 (13%) Query: 18 AQRVKAALESREDVHHAEVNVHYAKVTGEADTHALIETIKQTGYQATEAQTPDVELHLSG 77 A S+++ E N A ET Q A EA+ +V+ + Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDAT-----------ETTAQNREVAKEAK-SNVKANTQT 1082 Query: 78 LSCGHCTETVRKALEAVSGVISADVTLESANVYGKADIQTLIAAVEQAGYHATQQGIDSP 137 ++ + +A V E KA ++T ++ +Q SP Sbjct: 1083 NEVAQSGSETKETQTTETK-ETATVEKE-----EKAKVET--EKTQEVPKVTSQV---SP 1131 Query: 138 KTEPLTHSAQSQPESLAAAPNTVPATNVALATS-TVSDTNTVLPTNTTSTTS----TADT 192 K E S QP++ A N P N+ S T + +T P TS+ T T Sbjct: 1132 KQE---QSETVQPQAEPAREN-DPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187 Query: 193 ASATSTAPVINPLPVTESVAQPAA-SEGESVQLLLTGMSCASCVSKVQNALQRVDGVQVA 251 T + V NP T + QP SE + S S V+ A + Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA--TTSSNDRS 1245 Query: 252 RVNLAERSALVTGTQNNEALIAAVKNAGYGAEIIEDEGERRERQQQ------MSQASMKR 305 V L + ++ T ++A A A + + + E + +S SM + Sbjct: 1246 TVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNK 1305
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 115 bits (290), Expect = 9e-34 Identities = 37/119 (31%), Positives = 54/119 (45%), Gaps = 4/119 (3%) Query: 50 EEQARLQMQELQKNNIVYFGFDKYDIGSDFAQMLDAHAAFLRSN--PSDKVVVEGHADER 107 +Q + + V F F+K + + LD + L + VVV G+ D Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264 Query: 108 GTPEYNIALGERRASAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAFAKNRRAVL 166 G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++ Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGNTCDN-VKQRAALI 321
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 30.3 bits (68), Expect = 0.005 Identities = 21/112 (18%), Positives = 40/112 (35%), Gaps = 10/112 (8%) Query: 152 YNVAVSLALEKKQYDQAITAFQSFVKQYPKSTYQPNANYWLGQLYYNKGKKDDAAYYYAV 211 Y++A + + +Y+ A FQ+ Y LG G+ D A + Y+ Sbjct: 40 YSLAFN-QYQSGKYEDAHKVFQALCVLDH---YDSRFFLGLGACRQAMGQYDLAIHSYSY 95 Query: 212 VVKNYPKSPKSSEAMFKVGVIMQDKGQSDKAKA---VYQQVIKQYPNTDAAK 260 K P+ F + KG+ +A++ + Q++I Sbjct: 96 GAIMDIKEPRFP---FHAAECLLQKGELAEAESGLFLAQELIADKTEFKELS 144
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.015 Identities = 9/18 (50%), Positives = 12/18 (66%) Query: 48 LVLLGPSGAGKSSLLRVL 65 +VL G G GKS+L+ L Sbjct: 599 VVLEGTGGIGKSTLINTL 616
>PF04183#IucA / IucC family Length = 580 Score = 29.8 bits (67), Expect = 0.007 Identities = 14/65 (21%), Positives = 23/65 (35%), Gaps = 9/65 (13%) Query: 54 IQQIGGQQGLPDDNLSAQFRPYLSQSLYNDIQA--ARKQASNRTPAQVNKTQMISGDIFT 111 + Q+ + D + A+ L +L D+Q AR+ S +N D Sbjct: 78 LMQLKQVLSMSDATV-AEHMQDLYATLLGDLQLLKARRGLSASDLINLN------ADRLQ 130 Query: 112 SLREG 116 L G Sbjct: 131 CLLSG 135
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 76.4 bits (188), Expect = 9e-18 Identities = 71/366 (19%), Positives = 126/366 (34%), Gaps = 73/366 (19%) Query: 1 MKVLVTGATSGLGRNAVEYLRRQEISVIA---------TGRNQAMGALLTKLGAKFIHAD 51 MK LVTGA +G + + L V+ QA LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTDLVSSQAKAMLADVDTLWHCS-------SFTSPWGTEQAFALANVRATRRLGEWAAAY 104 L D + ++ S +P A+A +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVENFIHISSPAIYFDYHHHRNIQEDFRPVRFANEFARSKAAGEEVIKLLALSNPQTH-- 162 +++ ++ SS ++Y + D + +A +K A E L+A + + Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANE----LMAHTYSHLYGL 171 Query: 163 -FTILRPQGLFGPHDK--VMLPRLLHMIKHYGTLLLPRGGDALVDMTYLENAVHAM---- 215 T LR ++GP + + L + + ++ + G D TY+++ A+ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 216 ---------WLATQSQKTLS---GRAYNITNQQPRPLRTIVQQLLDALDMKCRIRSVPYP 263 W S R YNI N P L +Q L DAL ++ + +P Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291 Query: 264 MMDIMARAMEKMSNKAEKEPVLTHYAVAKLNFDLTLDTLRAEQELGYRPIISLDEGILRT 323 D VL A DT + +G+ P ++ +G+ Sbjct: 292 PGD-----------------VLETSA----------DTKALYEVIGFTPETTVKDGVKNF 324 Query: 324 ARWLKE 329 W ++ Sbjct: 325 VNWYRD 330
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 36.3 bits (84), Expect = 2e-04 Identities = 13/26 (50%), Positives = 18/26 (69%) Query: 5 RILVLGASGYIGQHLVPLLSQQGHQV 30 + LV GA+G+IG H+ L + GHQV Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV 27
>FLAGELLIN#Flagellin signature. Length = 507 Score = 106 bits (266), Expect = 6e-29 Identities = 85/216 (39%), Positives = 113/216 (52%), Gaps = 8/216 (3%) Query: 9 VTIDINLQKIDSKSLGLGSYSVSGVSGALTSLTDTSVTGVTTTTALDFSDISTFAKGATV 68 V+ IN +K+ + + + + + L S + + V D + AK + + Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358 Query: 69 HGIGDVGTDGAYADGYVIRTTDGKQYKGEVDATNGKVTFADDANGDPIDDATKLEAAAQF 128 G T +G +Y + + L Sbjct: 359 -------EANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411 Query: 129 SPAGKATASPLETLDDAIKQVDGLRSSLGAVQNRFESAVTNLNNTVTNLTSARSRIEDAD 188 + PL ++D A+ +VD +RSSLGA+QNRF+SA+TNL NTVTNL SARSRIEDAD Sbjct: 412 AAKKSTAN-PLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470 Query: 189 YATEVSNMSRAQILQQAGTSVLSQANQVPQTVLSLL 224 YATEVSNMS+AQILQQAGTSVL+QANQVPQ VLSLL Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506
>cloacin#Cloacin signature. Length = 551 Score = 30.5 bits (68), Expect = 0.006 Identities = 33/146 (22%), Positives = 54/146 (36%), Gaps = 29/146 (19%) Query: 56 QLLRRIDHSESQQQEWQ------------EKAELALRKDKEDLARAALIEKQ-KVMTLVE 102 Q+ +R D +QQEW E+A L + ED+AR E+Q K + + Sbjct: 295 QVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVAR--NQERQAKAVQVYN 352 Query: 103 TLKREVATVDETLSRMKHEITELENKLTETRA--------------RQQALTLRHQAASS 148 + K E+ ++TL+ EI + + A R Q QAA Sbjct: 353 SRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFD 412 Query: 149 SRDVRRQLDSGKLDEAMARFEQFERR 174 + + L AM ++ E + Sbjct: 413 AAAKEKSDADAALSSAMESRKKKEDK 438
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 346 bits (890), Expect = e-119 Identities = 124/344 (36%), Positives = 176/344 (51%), Gaps = 19/344 (5%) Query: 3 EQLDNLLGEANAFVDVLEQVSGLAKLNKPVLVIGERGTGKELIAHRLHYLSERWQGPFIS 62 + L+G + A ++ ++ L + + +++ GE GTGKEL+A LH +R GPF++ Sbjct: 134 QDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVA 193 Query: 63 LNCAALNENLLDSELFGHEAGAFTGAQKRHLGRFERADGGTLFLDELATAPMLVQEKLLR 122 +N AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LLR Sbjct: 194 INMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLR 253 Query: 123 VIEYGHLERVGGSQPLQVDVRLVCATNDNLPALAAAGKFRADLLDRLAFDVVQLPPLRER 182 V++ G VGG P++ DVR+V ATN +L G FR DL RL ++LPPLR+R Sbjct: 254 VLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDR 313 Query: 183 QQDIMLLAEHFAILMCRELGLPLFSGFTATAKEQLLEYRWPGNVRELKNVVERSV----- 237 +DI L HF +E F A E + + WPGNVREL+N+V R Sbjct: 314 AEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQ 371 Query: 238 -----------YRHSDSSLPLNNIIINPFASNQKGEIEGVDTPNEGGAVLPALPVD-LKH 285 R P+ + + +E P Sbjct: 372 DVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDR 431 Query: 286 WLHTSEHQMLTRALKQARFNQRKAAHLLGLTYHQLRGLLKKHTI 329 L E+ ++ AL R NQ KAA LLGL + LR +++ + Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>TATBPROTEIN#Bacterial sec-independent translocation TatB protein signature. Length = 171 Score = 31.5 bits (71), Expect = 0.002 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 3/46 (6%) Query: 144 LLLAIIVVAFVGPS-LEHAMFAVWLALLPRMVRTIYSAVHDELDKE 188 LL+ II + +GP L A +A R +R++ + V +EL +E Sbjct: 10 LLVFIIGLVVLGPQRLPVA--VKTVAGWIRALRSLATTVQNELTQE 53
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.006 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 118 bits (298), Expect = 3e-34 Identities = 44/129 (34%), Positives = 72/129 (55%), Gaps = 2/129 (1%) Query: 2 LLYQGVSRFDFSAGQL-NDSSINHNPAIVQGAYHYGLGNTYTLYGGAQVAENYRSVAIGN 60 L +G +R+ +AG+ + ++ P Q +GL +T+YGG Q+A+ YR+ G Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 61 AFNT-PLGGVSMDITHAKSELAGDRRSSGNSYKIDYSKYVGETDTNLTLAAYRYSSGGYY 119 N LG +S+D+T A S L D + G S + Y+K + E+ TN+ L YRYS+ GY+ Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487 Query: 120 SFREASLDR 128 +F + + R Sbjct: 488 NFADTTYSR 496
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 294 bits (754), Expect = 5e-95 Identities = 106/323 (32%), Positives = 160/323 (49%), Gaps = 21/323 (6%) Query: 4 VEFNADFIHGGG---VDVMRFMHENPVAPGVYDVTVIINGKNRGKHRIRFELSEGESTAE 60 + FN F+ D+ RF + + PG Y V + +N + F + E Sbjct: 47 LYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIV 106 Query: 61 PCFTLEQLDSIGLKIETSDTDLLVNGKAAPKDQCYNLRALIKDSHVNYNSGDLELSLTVP 120 PC T QL S+GL +T + D C L ++I D+ + G L+LT+P Sbjct: 107 PCLTRAQLASMGL-----NTASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIP 161 Query: 121 QFNLVHHPRGYIDSSLWDAGGTVGFLDYNSNVYSIFNGRSNSDVGSDNSNSYNSNIGLSA 180 Q + + RGYI LWD G G L+YN + + +G ++ +Y + L + Sbjct: 162 QAFMSNRARGYIPPELWDPGINAGLLNYNFSGN-----SVQNRIGGNSHYAY---LNLQS 213 Query: 181 GINLGEWRFRKRLNTTWSNSSG-----MHTQNLYGYAATDITALKSQLTIGDTNTQGSLF 235 G+N+G WR R ++++S Q++ + DI L+S+LT+GD TQG +F Sbjct: 214 GLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIF 273 Query: 236 DSYALRGVLLASDTRMLPEGIRNYSPIVRGIAETNARVTVTQRGQIIYETVVTPGAFELT 295 D RG LASD MLP+ R ++P++ GIA A+VT+ Q G IY + V PG F + Sbjct: 274 DGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTIN 333 Query: 296 DIGTMSYGGDLQMTITESDGRTR 318 DI GDLQ+TI E+DG T+ Sbjct: 334 DIYAAGNSGDLQVTIKEADGSTQ 356
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 32.7 bits (74), Expect = 4e-04 Identities = 32/113 (28%), Positives = 49/113 (43%), Gaps = 18/113 (15%) Query: 1 MRKLNLAVCAVALSVISSTSYAAAGGTVTFNGKLIADTCQVDTASENITVTLPTLSIQSL 60 M+K+ V L + + + A +TF GKLI C V +N V + IQ+L Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTV----QNAEVNWGDIEIQNL 56 Query: 61 AVAEAQDGS--KDFEIKVLDCP-------ATLTQVGAHFNAIDSSGVNPATGN 104 Q G KDF + ++CP T+T G N+I + A+G+ Sbjct: 57 ----VQSGGNQKDFTVD-MNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGD 104
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.9 bits (75), Expect = 0.007 Identities = 35/164 (21%), Positives = 57/164 (34%), Gaps = 32/164 (19%) Query: 576 DDIRAVMELPQRLEAR----------VIGQPHALMQLGENIMTARAGLSDPRKPLGVFML 625 I + P+R ++ ++G+ A+ ++ + AR +D L + M+ Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL--ARLMQTD----LTL-MI 165 Query: 626 VGPSGVGKTETALAIAESMYGGEQNMITINMSEYQESHTVSSLKGSPPGYVGYGEGGVLT 685 G SG GK A A+ + + INM+ S L G E G T Sbjct: 166 TGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFT 217 Query: 686 EAVRRKPYSV-------VLLDEIEKAHSDVHELFFQVFDKGQME 722 A R + LDEI D +V +G+ Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYT 261
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 73.3 bits (180), Expect = 2e-17 Identities = 29/122 (23%), Positives = 56/122 (45%), Gaps = 5/122 (4%) Query: 1 MTKSHTILIVDDHPLMRRGIKQLLGLDSRFDVVAEANNGSDAITEAAKFQPDVILLDLNM 60 MT + TIL+ DD +R + Q L + +DV +N + A D+++ D+ M Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVM 57 Query: 61 KGMSGLDTLKALRHNGSDARIIILTV-SDARSDVYAMIDAGADGYLLKDCEPEILLENIR 119 + D L ++ D +++++ + + + A + GA YL K + L+ I Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIG 116 Query: 120 QA 121 +A Sbjct: 117 RA 118
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1304 bits (3377), Expect = 0.0 Identities = 672/1034 (64%), Positives = 819/1034 (79%), Gaps = 2/1034 (0%) Query: 43 MANFFIDRPIFAWVLAIILCLTGALAISTLPVEQYPNLAPPNVRISASYPGASAQTLENT 102 MANFFI RPIFAWVLAIIL + GALAI LPV QYP +APP V +SA+YPGA AQT+++T Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 103 VTQVIEQSMTGLDNLLYMSSQSSNSGSASVTLTFQAGTNPNEAMQQVQNQLQSAIKRLPQ 162 VTQVIEQ+M G+DNL+YMSS S ++GS ++TLTFQ+GT+P+ A QVQN+LQ A LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 163 DVQQQGVSVSKSGDNTLMMVAFVSTDGSMDKQDISDYVASNLQDPLSRIEGVGSVDAFGS 222 +VQQQG+SV KS + LM+ FVS + + DISDYVASN++D LSR+ GVG V FG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 223 QYAMRIWLDPNKLTNYQLTTSDIVSAIQSQNTQVAVGQLGGTPAVDNQALNATINAQSQL 282 QYAMRIWLD + L Y+LT D+++ ++ QN Q+A GQLGGTPA+ Q LNA+I AQ++ Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 283 QTPEEFREITLRVNQDGSLVTLGDVAKIELGSEKYDYLSRFNGQAASGMGIKLASGANEL 342 + PEEF ++TLRVN DGS+V L DVA++ELG E Y+ ++R NG+ A+G+GIKLA+GAN L Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 343 QTDKRVKARLAELAPFFPHGLEAKIAYETTPFVQASIKDVVKTLLEAILLVFLVMYLFLQ 402 T K +KA+LAEL PFFP G++ Y+TTPFVQ SI +VVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 403 NFRATLIPTVAVPVVLLGTFAVLSAFGFSINTLTMFAIVLAIGLLVDDAIVVVENVERVM 462 N RATLIPT+AVPVVLLGTFA+L+AFG+SINTLTMF +VLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 463 SEEGLDPREATRKSMGQIQGALIGIALVLSAVFIPMAFFGGTTGAIYRQFSITIVSAMVL 522 E+ L P+EAT KSM QIQGAL+GIA+VLSAVFIPMAFFGG+TGAIYRQFSITIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 523 SVLVALILTPAMCATLLKPIAPGHHHAKRGFFGWFNRMFDRNSHRYERGVARVLHHSLRY 582 SVLVALILTPA+CATLLKP++ HH K GFFGWFN FD + + Y V ++L + RY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 583 MLLYLLLLGGLALLFLKLPTSFLPLEDRGVFMAQVQLPVGSTQQQTLKVVEKVENYFLTE 642 +L+Y L++ G+ +LFL+LP+SFLP ED+GVF+ +QLP G+TQ++T KV+++V +Y+L Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 643 EKNNVLSVFATVGSGPGGNGQNVARLFIRLADWDQRTASTDSSFAIIERATKELSKIVEA 702 EK NV SVF G G QN F+ L W++R +S+ A+I RA EL KI + Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 703 KVSVSSPPAISGLGGSSGFDMELQDHGGHGHDKLMVARNQLLQMASQEPA-LTRVRHNGL 761 V + PAI LG ++GFD EL D G GHD L ARNQLL MA+Q PA L VR NGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 762 DDSPQLQIDIDQRKAQALGVSLNDINSTLKTAWGSTYVNDFVDRGRVKKVYVQSEATARM 821 +D+ Q ++++DQ KAQALGVSL+DIN T+ TA G TYVNDF+DRGRVKK+YVQ++A RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 822 LPEDVNKWYVRNKNGGMVPFSAFSTTRWEYGSPRLERYNGYSALEIVGEAASGVSTGTAM 881 LPEDV+K YVR+ NG MVPFSAF+T+ W YGSPRLERYNG ++EI GEAA G S+G AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 882 DVMEKLVSQLPNGFGLEWTGMSYQERLSGSQAPALYAISLLVVFLCLAALYESWSIPFSV 941 +ME L S+LP G G +WTGMSYQERLSG+QAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 942 MLVVPLGVIGAVAATWMRGLENDVYFQVGLLTIIGLSAKNAILIVEFANEL-NNRGKDLV 1000 MLVVPLG++G + A + +NDVYF VGLLT IGLSAKNAILIVEFA +L GK +V Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 1001 EATLEASRQRLRPILMTSLAFIFGVLPMAISQGAGSGSQHAVGTGVMGGMISATVLAIFF 1060 EATL A R RLRPILMTSLAFI GVLP+AIS GAGSG+Q+AVG GVMGGM+SAT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1061 VPLFFVLVRRRFPG 1074 VP+FFV++RR F G Sbjct: 1021 VPVFFVVIRRCFKG 1034
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 53/154 (34%), Positives = 78/154 (50%), Gaps = 13/154 (8%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGSARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G+ R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPARHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ 160 G+P I F+NK D + L V +++E LS Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSA 150
>SECETRNLCASE#Bacterial translocase SecE signature. Length = 127 Score = 161 bits (410), Expect = 7e-55 Identities = 109/127 (85%), Positives = 116/127 (91%) Query: 1 MSANTEAPGSGRGLETAKWLIVAVLLVVAIVGNYYYREYSLPLRALAVVVIIAVAGAVAL 60 MSANTEA GSGRGLE KW++V LL+VAIVGNY YR+ LPLRALAVV++IA AG VAL Sbjct: 1 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL 60 Query: 61 MTAKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120 +T KGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS Sbjct: 61 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120 Query: 121 FITGLRF 127 FITGLRF Sbjct: 121 FITGLRF 127
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 26.3 bits (58), Expect = 0.048 Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 2/71 (2%) Query: 4 KVQAYVKLQVAAGMANPSPPVGPALGQQ-GVNIMEFCKAFNAKTESIEKGLPIPVVITVY 62 +V+ + N P G + G N ++ KA AK ++ P + + Sbjct: 267 RVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP 326 Query: 63 SDRSFTFVTKT 73 D + FV + Sbjct: 327 YDTT-PFVQLS 336
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.4 bits (71), Expect = 0.002 Identities = 29/152 (19%), Positives = 62/152 (40%), Gaps = 8/152 (5%) Query: 18 FCIFFVYSAYCGLTYFIPF-LKDIYGLPVALIGAYGIINQYGLKMVGGPVGGFLADKVAK 76 C ++ G +P+ +KD++ L A IG+ I ++ G +GG L D+ Sbjct: 263 LCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGP 322 Query: 77 SPTVYLKWTFLISAIAMILFIQLPHDSMNVYLGMMATLGFGAIIFSQRAI-FFAPMDEIG 135 + + TFL + F+ ++ + ++ ++ G + F++ I Sbjct: 323 LYVLNIGVTFLSVSFLTASFLL---ETTSWFMTIIIVFVLGGLSFTKTVISTIVSS---S 376 Query: 136 TSREHAGSAMAFGCIIGYMPSMFAYALYGSLL 167 ++ AG+ M+ ++ A+ G LL Sbjct: 377 LKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408
>PF06580#Sensor histidine kinase Length = 349 Score = 37.5 bits (87), Expect = 1e-04 Identities = 13/70 (18%), Positives = 30/70 (42%), Gaps = 10/70 (14%) Query: 421 ELDKSLIERIIDPLT--HLVRNSLDHGIEEPATRIAAGKSPVGNLTLSAEHQGGNICIEV 478 +++ ++++ + P+ LV N + HGI + G + L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEV 296 Query: 479 IDDGAGLNRQ 488 + G+ + Sbjct: 297 ENTGSLALKN 306
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.4 bits (73), Expect = 0.004 Identities = 23/88 (26%), Positives = 31/88 (35%), Gaps = 3/88 (3%) Query: 47 LLAVSSPQELTQIAEYFRTPLKVALTSGDKSSSSTSPIPGGGDDPTQQVGEVRKQINSEE 106 L VSSP A P K ++G + + PGGGDD GE + Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDDGEDPFGEWLDDEVARL 440 Query: 107 SRQEIHRLNKLREKLDQLIESDPRLKAL 134 + L R L + + S P L Sbjct: 441 RLRGRWLLKPRRAALIEALRSAPALAGC 468
>PF05844#YopD protein Length = 295 Score = 32.3 bits (73), Expect = 0.002 Identities = 11/28 (39%), Positives = 21/28 (75%), Gaps = 2/28 (7%) Query: 76 MDLMALLYRLLAKSRQQGMLSLERDIEN 103 ++L+ +L+R+ K+R+ G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1342 bits (3475), Expect = 0.0 Identities = 806/1032 (78%), Positives = 918/1032 (88%) Query: 1 MAKFFIDRPIFAWVIAIIIMLAGALAIMKLPVAQYPTIAPPAITIAANYPGADATTVQNT 60 MA FFI RPIFAWV+AII+M+AGALAI++LPVAQYPTIAPPA++++ANYPGADA TVQ+T Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLLYMSSSSDSSGNVQLTLTFNSGTDPDIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNL+YMSS+SDS+G+V +TLTF SGTDPDIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVAGFISEDGTMQQEDIADYVGSNIKDPISRTPGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMVAGF+S++ Q+DI+DYV SN+KD +SR GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMDPHKLNNYKLTPVDVINAIKIQNNQVAAGQLGGTPPVPGQELNSSIIAQTRL 240 QYAMRIW+D LN YKLTPVDVIN +K+QN+Q+AAGQLGGTP +PGQ+LN+SIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TNAEEFSQILLKVNTDGSQVRLKDVAIVKLGAESYNIIARYNGKPAAGIGIKLATGANAL 300 N EEF ++ L+VN+DGS VRLKDVA V+LG E+YN+IAR NGKPAAG+GIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 NTSAAVKAELAKLQPFFPSGLTVVYPYDTTPFVKISINEVVKTLIEAIILVFLVMYLFLQ 360 +T+ A+KA+LA+LQPFFP G+ V+YPYDTTPFV++SI+EVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAILSAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFAIL+AFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 QEEGLPPKEATKKSMEQIQGALVGIALVLSAVFVPMAFFGGATGAIYRQFSITIVSAMVL 480 E+ LPPKEAT+KSM QIQGALVGIA+VLSAVF+PMAFFGG+TGAIYRQFSITIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIKKGDHGPKTGFFGWFNNMFEKSTHHYTDSVANILRSTGRY 540 SVLVALILTPALCAT+LKP+ H K GFFGWFN F+ S +HYT+SV IL STGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LVIYLAIVIGMAVLFMRLPSSFLPEEDQGVFLTMVQLPAGATQERTQKVLNHVTDYYLDK 600 L+IY IV GM VLF+RLPSSFLPEEDQGVFLTM+QLPAGATQERTQKVL+ VTDYYL Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKNVVNSVFTVNGFGFSGQGQNTGLAFVSLKNWDERKGEQNKVPAIVSRASAAFSKIKDG 660 EK V SVFTVNGF FSGQ QN G+AFVSLK W+ER G++N A++ RA KI+DG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 MVFAFNLPAIVELGTATGFDFQLIDQGNLGHQQLTDARNQLLGMAAQHPDMLVGVRPNGL 720 V FN+PAIVELGTATGFDF+LIDQ LGH LT ARNQLLGMAAQHP LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKVEVDQEKAQALGVAISDINTTLGSAMGGSYVNDFIDRGRVKKVYVQADAPFRM 780 EDT QFK+EVDQEKAQALGV++SDIN T+ +A+GG+YVNDFIDRGRVKK+YVQADA FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPDDIDKWYVRNNMGQMVSFATFSTAKWEYGSPRLERYNGLPSMEILGQAAPGKSTGEAM 840 LP+D+DK YVR+ G+MV F+ F+T+ W YGSPRLERYNGLPSMEI G+AAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 DLMQELAAKLPSGVGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 LM+ LA+KLP+G+GYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAATLRGLENDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGLV 960 MLVVPLG+VG LLAATL +NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+V Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 ESTLESVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMITATVLAIFF 1020 E+TL +VRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGM++AT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPLFFVVVRRRF 1032 VP+FFVV+RR F Sbjct: 1021 VPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.4 bits (92), Expect = 2e-05 Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 45/166 (27%) Query: 89 QIDPATYQAAYDSAKGDLAKAQASAQIAHLTVNRYKPLLGTNYISKQ---EYDQALSDAQ 145 +++ +A + + + + +++ ++ + LL I+K E + +A Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265 Query: 146 QADATVLAAKAALES----------------------------------------ARINL 165 + +ES Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325 Query: 166 AYTQVRSPISGRTGKSAV-TEGALVTSGQASAMTTVQQLDPMYVDV 210 + +R+P+S + + V TEG +VT+ + M V + D + V Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTA 370
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 165 bits (420), Expect = 7e-54 Identities = 135/210 (64%), Positives = 164/210 (78%) Query: 1 MARKTKQKAEETRQQILDAAVREFSAHGVSRTSLTDIAIAAGVTRGAIYWHFKNKVDLFN 60 MARKTKQ+A+ETRQ ILD A+R FS GVS TSL +IA AAGVTRGAIYWHFK+K DLF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EVWELSESKIDQLEIEYQAKYPDNPLRILRELLIYILVSTREDRRRRALMEIVFHKCEFV 120 E+WELSES I +LE+EYQAK+P +PL +LRE+LI++L ST + RRR LMEI+FHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMTSVHDARKVLDLASYERIESVLQGCIDANQLPVNLNTHRAAIIMRAYITGLMENWLF 180 GEM V A++ L L SY+RIE L+ CI+A LP +L T RAAIIMR YI+GLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 MPESFDIKQEAPVLIDAYLEMLGQSFSLRN 210 P+SFD+K+EA + LEM +LRN Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRN 210
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 26.0 bits (57), Expect = 0.034 Identities = 9/71 (12%), Positives = 27/71 (38%) Query: 47 IAGLNGQQPREGYNLQQMLEILTAQNVPIKLCKTCADARGIAGLTLVDGVEIGTLVELAQ 106 I +N ++ ++ ++E L VP ++ D R + ++ + I + Sbjct: 222 IWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDS 281 Query: 107 WTLAAEKVLTF 117 ++ ++ Sbjct: 282 IAEQGKEGDSY 292
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 40.4 bits (94), Expect = 4e-05 Identities = 28/235 (11%), Positives = 58/235 (24%), Gaps = 21/235 (8%) Query: 35 SEVQSQLDLLSKQKILSPAEKLAQQDLTQTLE-YLDTIERTKQEANQLKQQLAQAPAKLR 93 S + +L K ++ + LE L+ + + L A L Sbjct: 95 SNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALA 154 Query: 94 QATEGLE-ALKSSSADTMTKESLANYSLRQLESRLNETLDNLQSAQEDLSAYNSQLIALQ 152 LE AL+ + + +++ + + + L Sbjct: 155 ARKADLEKALEGAMNFS-----------TADSAKIKTLEAEKAALEARQAELEKALEGAM 203 Query: 153 TQPERVQSAMYSASMRLMQIRNQLNGLTPNQESLRPTQQ--QELLAEQVMLNGQLDLERK 210 + + + + + L E + L+ + Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263 Query: 211 NLEANTTLQDLLQKQRDYTTAHINQLERYVQLLQEVVSGKRLILSEKTVKEAQAQ 265 LE + +A I LE L+ K + + V A Q Sbjct: 264 ELEK---ALEGAMNFSTADSAKIKTLEAEKAALEAE---KADLEHQSQVLNANRQ 312 Score = 32.0 bits (72), Expect = 0.016 Identities = 36/201 (17%), Positives = 72/201 (35%), Gaps = 33/201 (16%) Query: 37 VQSQLDLLSKQKILSPAEKLAQQDLTQTLEYLDTIERTKQEANQLKQQ------------ 84 + L ++Q L A + A T + T+E K K Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311 Query: 85 --LAQAPAKLRQATEGLEA-----------LKSSSADTMTKESLANYSLRQLESRLNETL 131 L + R+A + LEA ++S + + +QLE+ + Sbjct: 312 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371 Query: 132 DNLQSAQEDLSAYNSQLIALQTQPERVQSAMYSASMRLMQIRNQLNGLTPNQESLRPTQQ 191 + + ++ + L A + ++V+ A+ A+ +L + L +ES + T++ Sbjct: 372 EQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKEL---EESKKLTEK 428 Query: 192 QELLAEQVMLNGQLDLERKNL 212 E+ L +L+ E K L Sbjct: 429 -----EKAELQAKLEAEAKAL 444
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 121 bits (305), Expect = 6e-40 Identities = 48/88 (54%), Positives = 65/88 (73%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIITSVTESLKEGDDVALVGFGTFAVRERSARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F VRER+AR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEISIPAAKVPGFRAGKGLKDAV 89 NPQTG+EI I A+KVP F+AGK LKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 0.004 Identities = 15/76 (19%), Positives = 32/76 (42%), Gaps = 6/76 (7%) Query: 296 DWMLQVPWNSHSKVKKDLVKAQEVLDTDHYGLERVKDRILEYLAVQSRVSKIKGP----- 350 DW+ W+ +++K LV D+ +++ + V+++ P Sbjct: 537 DWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD 596 Query: 351 -ILCLVGPPGVGKTSL 365 + L G G+GK++L Sbjct: 597 YSVVLEGTGGIGKSTL 612
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.4 bits (66), Expect = 0.032 Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 13/72 (18%) Query: 61 RSSLPTPHEIRHHLDDYVIGQEPAKKVLAVAVYNHYKRLRNGDTSNGIELGKSNILLIGP 120 P+ E ++G+ A + +Y RL D +++ G Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITGE 168 Query: 121 TGSGKTLLAETL 132 +G+GK L+A L Sbjct: 169 SGTGKELVARAL 180
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.7 bits (61), Expect = 0.012 Identities = 19/73 (26%), Positives = 34/73 (46%), Gaps = 3/73 (4%) Query: 2 LKKILFPLLAIFILAGCATTSNTLNVTPKVVLPTQDPTLMGVTISINGADQRRDAALAKV 61 +KK+LF ++ GCA + T+ P V P + T ++G Q++ AK+ Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITH---HFFVSGIGQKKTVDAAKI 62 Query: 62 NRDGQLVVLTPSR 74 + VV T ++ Sbjct: 63 CGGAENVVKTETQ 75
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.8 bits (111), Expect = 1e-07 Identities = 44/199 (22%), Positives = 77/199 (38%), Gaps = 15/199 (7%) Query: 221 RNNAWLI-LLLIVFYKMGDAFAASLSTTFLIRGVGFDAGEVGLVNKTLGLIATIIGALYG 279 R+N LI L ++ F+ + + ++S + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GLLMQRLSLFRALMIFGILQAVSNMGYWLLAITDKNIFSMGSAIFLENLCGGMGTAAFVA 339 L +L + R L+ I+ + ++ + FS+ + + G G AAF A Sbjct: 71 KL-SDQLGIKRLLLFGIIINCFGS----VIGFVGHSFFSL---LIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGP-IAGWFVEAHGWPLFYLFSIAAAIP 394 L+M K F L+ ++ A+G VGP I G W L + I Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GLLLLYVCRQTLDHTQKTD 413 L+ + ++ + D Sbjct: 182 VPFLMKLLKKEVRIKGHFD 200
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 33.1 bits (75), Expect = 0.005 Identities = 40/172 (23%), Positives = 63/172 (36%), Gaps = 26/172 (15%) Query: 437 LYLRNQSAATPWNFWAQTLYAHSRQSSGTYTPGYQTNGYGINVGVDRRFND--ESLFG-- 492 LY P N WA + S S G + YG + GVD N E++ G Sbjct: 1012 LYQFAPKYEKPTNVWANAIGGTSLNSGG------NASLYGTSAGVDAYLNGEVEAIVGGF 1065 Query: 493 VSLGYQNANIN---IHSYGNEKDVDSYELMAYTGWFDDRYFFNGNVNMGYNSNSSTRNIG 549 S GY + + ++S N + Y + F +++ F+ S+ S+ N Sbjct: 1066 GSYGYSSFSNQANSLNSGANNTNFGVYSRI-----FANQHEFDFEAQGALGSDQSSLNFK 1120 Query: 550 ENTGYQGNTKATADYNSLQMGYQVKAGMTFDL----DVVKLQPSVAYNYQWL 597 N YN L +A +D + + L+PSV +Y L Sbjct: 1121 SALLRDLNQS----YNYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHL 1168
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 140 bits (355), Expect = 1e-38 Identities = 94/404 (23%), Positives = 167/404 (41%), Gaps = 17/404 (4%) Query: 18 LSLATFMQVLDSTIANVAIPTIAGDLGSSNSQGTWVITSFGVANAISIPVTGWLAKRVGE 77 L + +F VL+ + NV++P IA D + WV T+F + +I V G L+ ++G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 78 VRLFLWSTGLFVLASWLCGMSNS-LGMLIFFRVIQGLVAGPLIPLSQSLLLNNYPPAKRS 136 RL L+ + S + + +S +LI R IQG A L ++ P R Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 137 MALALWSMTIVVAPIFGPILGGYISDNYHWGWIFFINIPIGLVVVLLAGSTLKGRETKTE 196 A L + + GP +GG I+ HW + + IP+ ++ + L +E + + Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIK 196 Query: 197 IRPIDTIGLVLLVVGIGALQIMLDQGKELDWFNSTEIIVLTVVAVVAITFLIVWELTDDH 256 D G++L+ VGI + ML F ++ I +V+V++ + Sbjct: 197 -GHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245 Query: 257 PVIDLSLFKSRNFTIGCLCLSLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGILP 316 P +D L K+ F IG LC + + G + ++P ++++V+ + G G + Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305 Query: 317 VLLS-PLIGRFAHRIDMRQLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFFQGFAIA 375 V++ + G R ++ +V F ++ E F G + Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT 365 Query: 376 CFFMPLTTITLSGLPPERMAAASSLSNFMRTLAGSIGTSITTTL 419 ++TI S L + A SL NF L+ G +I L Sbjct: 366 K--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.8 bits (85), Expect = 1e-04 Identities = 16/54 (29%), Positives = 22/54 (40%) Query: 812 VLVRSDLKGLGLGRALLEKMIRYARSHGLSRLTAVTMPNNRGMIGLAQKLGFTI 865 + V D + G+G ALL K I +A+ + L T N K F I Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.006 Identities = 15/56 (26%), Positives = 21/56 (37%), Gaps = 9/56 (16%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERTTTGDIYIGDQRVTDLEPKDRGIAMVFQNYVLY 88 +V+ G G GKSTL+ + GL+ + D KD V Y Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKDS--YEQIAGIVAY 645
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 33.9 bits (77), Expect = 0.001 Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 13/173 (7%) Query: 136 GRLLSQPFNSSTPVLYYNKEAFKKAGLDPEQPPKTWQELAADTAKLRAAGSSCGYASGWQ 195 G+L++ P L YNK+ L P PPKTW+E+ A +L+A G S + + Sbjct: 127 GKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQE 179 Query: 196 GWIQIENFSAWHGQPIASRNNGFDGTDAVLEFNKPLQVKHIQLLSDMNKKGDFTYFGRKD 255 + +A G N +D D + + + L D+ K Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKD--VGVDNAGAKAGLTFLVDLIKNKHMNADTDYS 237 Query: 256 ESTSKFYNGDCAITTASSGSLASIRHYAKFNFGVGMMPYDADAKNAPQNAIIG 308 + + F G+ A+T + ++I +K N+GV ++P K P +G Sbjct: 238 IAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 27.5 bits (61), Expect = 0.031 Identities = 19/83 (22%), Positives = 33/83 (39%), Gaps = 10/83 (12%) Query: 13 MKKTVIAIITMATLTSTAAYANTIEKDIRVEAEIISLMDVKRADDSNINKIKLTYDTVTN 72 MKK++IA+ A + A D+ + I + ++ R+ N + T Sbjct: 1 MKKSLIALTLAALPVAAMA-------DVTLYGTIKAGVETSRSVAHNGAQ---AASVETG 50 Query: 73 DGTYSHSEAIKVKARKQLGDKLK 95 G I K ++ LG+ LK Sbjct: 51 TGIVDLGSKIGFKGQEDLGNGLK 73
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 72.2 bits (177), Expect = 4e-15 Identities = 40/265 (15%), Positives = 80/265 (30%), Gaps = 27/265 (10%) Query: 442 SLARYQSPYVS----RYAPDSGST---SGSYTRRIGPTQLSYQFNQYRNNRQHRIQSGWD 494 L R + Y+S Y S + ++ +N Q G D Sbjct: 536 QLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK----GRD 591 Query: 495 WQLPQFNLALSLGLQNGGQWNSHNNYGVFLNTTLSFGQSNASINTAYTQQQLNTSASYQK 554 L +++ W ++ + + + S+ S + + Sbjct: 592 ---QMLALNVNIPF---SHWLRSDSKSQWRHASASYSMS--HDLNGRMTNLAGVYGTLLE 643 Query: 555 EFIDNYGASTLGVSGSASGKLNSVGGFAKRSGSRGDISGRVGIDNQITNGGISYNGMLAL 614 + +Y T G ++ G G+ + + I +G + Sbjct: 644 DNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLA 703 Query: 615 SSQGVALGRSSYSGAALLIKAPALGGTPYSFHVEDSPI--TGGGTYAIPVPRYQDRFFVR 672 + GV LG+ + +L+KAP VE+ T YA+ +P + R Sbjct: 704 HANGVTLGQPL-NDTVVLVKAPGAKDAK----VENQTGVRTDWRGYAV-LPYATEYRENR 757 Query: 673 THTDRSDMDMNIQLPVNIVRAHPGQ 697 D + + N+ L + P + Sbjct: 758 VALDTNTLADNVDLDNAVANVVPTR 782
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 65.8 bits (160), Expect = 7e-15 Identities = 31/195 (15%), Positives = 67/195 (34%), Gaps = 8/195 (4%) Query: 70 ITQNIIEPAVEQRVNQPDDIVDLPTLPEQPEGQREITRKEPIKVKRPAENRATSRKPVNK 129 I+ ++ PA + + PE KE V + KP Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK-PKPKPKPKPV 108 Query: 130 ETQESDSKQSSPAAAASAMLSGTSQQVAAAVNSDSSHRQQAQVSWKSRLQGHLMGFKRYP 189 + E + P + A + ++ ++ + S S + +YP Sbjct: 109 KKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYP 168 Query: 190 SSARKQQQQGTAMIRFVVDKNGYVSSVQLSHSSGTSALDREALAIIKRAQPLPKPPAELL 249 + A+ + +G ++F V +G V +VQ+ + + +RE ++R + P P Sbjct: 169 ARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGS-- 226 Query: 250 SQGQITLSLPVDFNL 264 + + + F + Sbjct: 227 -----GIVVNILFKI 236
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 348 bits (895), Expect = e-118 Identities = 91/424 (21%), Positives = 172/424 (40%), Gaps = 8/424 (1%) Query: 25 RYLNIGGGLVVIGFIGFLLWAGLAPLDKGVAVTGLLVVAENRKVIQPLQGGRIQQLHVTE 84 R + ++ + + + L ++ G L + K I+P++ ++++ V E Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114 Query: 85 GDEIVSGQLLVTLDDTAIRNQRDNLQHQYLSALAQEARLTAEQNDLDVITFPQALLEH-- 142 G+ + G +L+ L Q L A ++ R +++ P+ L Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174 Query: 143 ATQPAVERNIILQQQLLHHRRQAHLSEIARLSTQLTRHQARLDGLQAMRSNHQRQSNLFQ 202 Q E ++ L+ + ++ + L + +A + A + ++ S + + Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234 Query: 203 QQLDSVQLLAKDGHIAKNKLLEMESQSTSLQARVEQSTSDIAEAHKLIDETEQHVLQRRE 262 +LD L IAK+ +LE E++ + S + + I ++ + Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294 Query: 263 QYQSENSEQLAKAQQNTQELVQRLNIAEYELSHTRIFAPVSGSVIALAQHTVGGVVSSGQ 322 +++E ++L + N L L E + I APVS V L HT GGVV++ + Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354 Query: 323 ALMEIVPSGQPLFVEAQLPVELIDKVAVGLPVDLNFSAFNQSNTPRLQGSVWRIGADRIQ 382 LM IVP L V A + + I + VG + AF + L G V I D I+ Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414 Query: 383 PPPTSPPYYPLTVAIDL-----DPTELAIRPGMAVDVFIRTGERSLLSYLFKPFTDRLHL 437 + + ++I+ + + GMAV I+TG RS++SYL P + + Sbjct: 415 DQRLGLVFNVI-ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTE 473 Query: 438 ALAE 441 +L E Sbjct: 474 SLRE 477
>PF06438#Heme acquisition protein HasAp Length = 205 Score = 232 bits (594), Expect = 2e-80 Identities = 66/214 (30%), Positives = 104/214 (48%), Gaps = 18/214 (8%) Query: 1 MSTTIQYNSNYADYSISSYLREWANNFGDIDQAPAETKDRGSFSG-SSTLFSGTQYAIGS 59 MS +I Y++ Y+ ++++ YL +W+ FGD++ P + D + G + F G+QYA+ S Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60 Query: 60 SHSNPEGMIAEGDLKYSFM--PQHTFHGQIDTLQFGKDLATNAGGPSAGKHLEKIDITFN 117 + S+ IA GDL Y+ P HT G++D++ G L G S G L+ +++F+ Sbjct: 61 TASDA-AFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTL--TGGASSGGYALDSQEVSFS 117 Query: 118 ELDLSGEFDSGKSMTENHQGDMHKSVRGLMKGNPDPMLEVMKAKGINVDTAFKDLSIASQ 177 L L G+ G +HK V GLM G+ + + A VD + S Q Sbjct: 118 NLGLDSPIAQGRD------GTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQ 171 Query: 178 YPDSGYMSDAPM-----VDTVGVVDC-HDMLLAA 205 +G P V VGV + HD+ LAA Sbjct: 172 LAAAGVAHATPAAAAAEVGVVGVQELPHDLALAA 205
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 42.1 bits (99), Expect = 1e-06 Identities = 36/138 (26%), Positives = 58/138 (42%), Gaps = 18/138 (13%) Query: 133 VQTLLAAGYMPIISSIG----ITVEGQLMNVNA----DQAATALAATLGAD-LILLSDVS 183 ++ L+ G + I S G I +G++ V A D A LA + AD ++L+DV+ Sbjct: 179 IKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVN 238 Query: 184 GILDGKG----QRIAEMTAQKAEQLIAQGIITDG-MVVKVNAALDAARSLGRPVDIASWR 238 G G Q + E+ ++ + +G G M KV AA+ G IA Sbjct: 239 GAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHL- 297 Query: 239 HSEQLPALFNGVPIGTRI 256 E+ G GT++ Sbjct: 298 --EKAVEALEG-KTGTQV 312
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 0.001 Identities = 13/32 (40%), Positives = 17/32 (53%) Query: 32 VVFVGPSGCGKSTLLRMIAGLEDITSGELLIG 63 VV G G GKSTL+ + GL+ + IG Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 679 bits (1752), Expect = 0.0 Identities = 331/394 (84%), Positives = 367/394 (93%) Query: 10 IGKTARVLALSALTTLVLSSSAFAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVT 69 I AR+LALSALTT++ S+SA AKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVT Sbjct: 3 IKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVT 62 Query: 70 IEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAELTPSKAFQEKLFPFTWDA 129 +EHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAE+TP KAFQ+KL+PFTWDA Sbjct: 63 VEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDA 122 Query: 130 VRFNGKLIGYPVAVEALSLIYNKDLVKEAPKTWEEIPALDKTLRANGKSAIMWNLQEPYF 189 VR+NGKLI YP+AVEALSLIYNKDL+ PKTWEEIPALDK L+A GKSA+M+NLQEPYF Sbjct: 123 VRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYF 182 Query: 190 TWPVIAADGGYAFKFENGVYDAKNVGVNNAGAQAGLQFIVDLVKNKHINADTDYSIAEAA 249 TWP+IAADGGYAFK+ENG YD K+VGV+NAGA+AGL F+VDL+KNKH+NADTDYSIAEAA Sbjct: 183 TWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAA 242 Query: 250 FNKGETAMTINGPWAWSNIDKSKINYGVTLLPTFHGQPSKPFVGVLTAGINAASPNKELA 309 FNKGETAMTINGPWAWSNID SK+NYGVT+LPTF GQPSKPFVGVL+AGINAASPNKELA Sbjct: 243 FNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELA 302 Query: 310 TEFLENYLITDQGLAEVNKDKPLGAVALKSFQEQLAKDPRIAATMDNATNGEIMPNIPQM 369 EFLENYL+TD+GL VNKDKPLGAVALKS++E+LAKDPRIAATM+NA GEIMPNIPQM Sbjct: 303 KEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQM 362 Query: 370 AAFWYATRSAVLNAITGRQTVEAALNDAATRITK 403 +AFWYA R+AV+NA +GRQTV+ AL DA TRITK Sbjct: 363 SAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 32.6 bits (74), Expect = 0.005 Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 8/66 (12%) Query: 69 LAKETDLAGAIKSMFSGEKINR-------TEDRAVLHIALRNRSNTPIVVDGKDVMPEVN 121 AK +DL + + S + + D+ ++ I ++N IV DVM ++ Sbjct: 276 YAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNII-IKAHGQTNALIVTAAPDVMNDLE 334 Query: 122 AVLAKM 127 V+A++ Sbjct: 335 RVIAQL 340
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 29.4 bits (66), Expect = 0.032 Identities = 18/89 (20%), Positives = 29/89 (32%), Gaps = 5/89 (5%) Query: 214 DYTAALLGEALNVSRIDIWTDVPGIYTTDPRVVPAAKRIDKIAFEEAAEMATFGAKILHP 273 D L E +N I TDV G + + ++ EE + G Sbjct: 216 DLAGEKLAEEVNADIFMILTDVNGAALYYGT--EKEQWLREVKVEELRKYYEEGH--FKA 271 Query: 274 ATLLPAVRSDIPMFVGSSKDPAAGGTLVC 302 ++ P V + I F+ + A L Sbjct: 272 GSMGPKVLAAI-RFIEWGGERAIIAHLEK 299
>PF05860#haemagglutination activity domain. Length = 117 Score = 79.1 bits (195), Expect = 2e-19 Identities = 24/124 (19%), Positives = 42/124 (33%), Gaps = 21/124 (16%) Query: 45 VSSVNGTSVINIVQPSASGLSHNQFQDFNVGEKGAVLNNATSAGNSILAGQLAANQNLNG 104 +++ T +I + S L H+ FQ+F+V G N N Sbjct: 15 ITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFN-------------------NP 54 Query: 105 QAASIILNEVISRNPSLLLGQQEIFGMTADYILANPNGITCNGCGFMNTNRESLVVGNPL 164 I++ V + S + G TA+ L NPNGI ++ + Sbjct: 55 TNIQNIISRVTGGSVSNIDGLIRANA-TANLFLINPNGIIFGQNARLDIGGSFVGSTANR 113 Query: 165 IEQG 168 ++ Sbjct: 114 LKFA 117
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 30.7 bits (69), Expect = 0.041 Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 9/83 (10%) Query: 347 AGLEPLTIDANTLFVNVGERTN---VTGSARFKRLIKEEKYGEALDVARQQVESGAQIID 403 +P+ + + +TN VT + + E+ LD+ R QV A I + Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAP--DVMNDLERVIAQLDIRRPQVLVEAIIAE 355 Query: 404 INMDEGMLDAEAAMVRFLNLIAG 426 + D L+ +++ N AG Sbjct: 356 VQ-DADGLNLG---IQWANKNAG 374
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 347 bits (892), Expect = e-121 Identities = 125/351 (35%), Positives = 199/351 (56%), Gaps = 2/351 (0%) Query: 1 MSTEKNEKPTPKRLKEAKEKGQVVKSVEITSGVQLVALVIYFLLTGYSLVEQAKALIRSS 60 MS EK E+PTPK++++A++KGQV KS E+ S +VAL + E L+ Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60 Query: 61 IIQLQQPLTLALARIGAECMTVLMHIVVVLGGALIVVTIIAGIAQVGPLLATKAVSFKGE 120 Q P + AL+ + + ++ L ++ I + + Q G L++ +A+ + Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 121 RINPIQNAKQLFSLRSVFELMKSLLKVGVLTLIFGYLLMQYAPSFGYLTHCGSRCALPVF 180 +INPI+ AK++FS++S+ E +KS+LKV +L+++ ++ + L CG C P+ Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180 Query: 181 STLMGWLLGSLIACYLVFSLMDYAFQRYTIMKQLKMSHDEVKREYKDSNGDPHIKQKRRQ 240 ++ L+ ++V S+ DYAF+ Y +K+LKMS DE+KREYK+ G P IK KRRQ Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240 Query: 241 LQHEVQSGSFATNVRRSTAVVRNPTHFAVCLIYHPEETPLPIVIEKGHDEQAALIVSLAE 300 E+QS + NV+RS+ VV NPTH A+ ++Y ETPLP+V K D Q + +AE Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300 Query: 301 QSGIPVVENIALARALHRDVACGDTIPEQFFEPVAALLRM--ALELDYQPS 349 + G+P+++ I LARAL+ D IP + E A +LR ++ Q S Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHS 351
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 141 bits (356), Expect = 5e-43 Identities = 52/230 (22%), Positives = 105/230 (45%), Gaps = 4/230 (1%) Query: 5 LPGLTALALAMMRPYGILLILPLFTARSLGSSLLRNGLIVAIALPVTPLFLSAPIITNSS 64 L L ++R ++ P+ + RS+ + + GL + I + P + + S Sbjct: 10 LSWLNLYFWPLLRVLALISTAPILSERSVPKRV-KLGLAMMITFAIAPSLPANDVPVFS- 67 Query: 65 PVTWIGVLCTELLIGVVMGFVAALPFWAMNMAGFLIDTLRGATMSTLFNPGMGVESSLFG 124 + + ++LIG+ +GF F A+ AG +I G + +T +P + + Sbjct: 68 -FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLA 126 Query: 125 VLFTQILTVLFLISGGFNQVLAALYGSYDSLPIGQGIQPAADLLLFLQTEWQMMFELCLC 184 + + +LFL G +++ L ++ +LPIG + L + +F L Sbjct: 127 RIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSL-IFLNGLM 185 Query: 185 FALPALLVMVLADLSLGLINRSARQLNVFFLAMPIKSALALFLLLISLPY 234 ALP + +++ +L+LGL+NR A QL++F + P+ + + L+ +P Sbjct: 186 LALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPL 235
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 78.7 bits (194), Expect = 3e-17 Identities = 36/173 (20%), Positives = 64/173 (36%), Gaps = 14/173 (8%) Query: 647 HILLVDDSETNRDITGMMLQQLGHQVTRADSGTTALAIGRQHRFDLVLMDIRMPVLDGLA 706 IL+ DD R + L + G+ V + T DLV+ D+ MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 707 TTARWRHDPANIDSHCMITALSANASPDEQIKTSQAGMNHYLSKPVTLGQLAEMLDLTAQ 766 R + + +SA + IK S+ G YL KP L E++ + + Sbjct: 65 LLPRIK----KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGR 117 Query: 767 FQLERGVDLSPQLSEPQPLLDL-ADSALSLKLYQSLQVLIQQAKDAIENLPVL 818 E S + Q + L SA ++Y+ ++ + +L ++ Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR----VLARLMQT--DLTLM 164
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 59.1 bits (143), Expect = 2e-12 Identities = 26/127 (20%), Positives = 53/127 (41%), Gaps = 3/127 (2%) Query: 3 TKLLIVDDHELIIHGIKNMLAAYPRYLIVGQADNGLEVYNLCRQTEPDMVILDLGLPGMD 62 +L+ DD I + L+ V N ++ + D+V+ D+ +P + Sbjct: 4 ATILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 63 GLDVIIQLLRRWPALKILTLTARNEEHYASRTFNSGALGYVLKKSPQQILMAAIQTVAIG 122 D++ ++ + P L +L ++A+N A + GA Y+ K L+ I A+ Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR-ALA 120 Query: 123 KRYIDPA 129 + P+ Sbjct: 121 EPKRRPS 127
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 31.1 bits (70), Expect = 0.008 Identities = 7/43 (16%), Positives = 18/43 (41%) Query: 293 AYGAPKAITSFVVPTGYSFNLDGSTLYQSIAAIFIAQLYGIEL 335 + A + + TGY + +T+++S I + ++ Sbjct: 186 SNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQY 228