>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 27.5 bits (61), Expect = 0.046 Identities = 7/51 (13%), Positives = 25/51 (49%) Query: 138 LQAVDAKVSELEELLPLLMKDRSLAKGVSHLLSTQLTRILRTHAAMSILGH 188 + V+ +V++ +P L + +++++ +S L ++ + + A + Sbjct: 80 VSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQLKAYLEGKSE 130
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 141 bits (357), Expect = 3e-39 Identities = 84/388 (21%), Positives = 152/388 (39%), Gaps = 86/388 (22%) Query: 5 IGIDLGTTNSCVAIMDGTQARVLENAEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + Q VL PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKG--QGIVLNE-------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPYKIIGADNGDAWLDVKGQKMAPPQISAE 118 P N + AI+ +K +A ++ + Sbjct: 64 GRTPGN-IAAIR---------------------------------PMKDGVIADFFVTEK 89 Query: 119 VLKK-MKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAAL 177 +L+ +K+ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 90 MLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAI 149 Query: 178 AYGL--DKEVGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDTR 235 GL + G+ V D+GGGT ++++I ++ V + +GG+ FD Sbjct: 150 GAGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEA 197 Query: 236 LINYLVDEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADAT 291 +INY+ + G + AE+ K E+ SA + ++ + Sbjct: 198 IINYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEG 244 Query: 292 GPKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIND--VILVGGQTRMPM 348 P+ + + LE+L E + + + VAL+ SDI++ ++L GG + Sbjct: 245 VPRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRN 302 Query: 349 VQKKVAEFFGKEPRKDVNPDEAVAIGAA 376 + + + E G +P VA G Sbjct: 303 LDRLLMEETGIPVVVAEDPLTCVARGGG 330
>PF06580#Sensor histidine kinase Length = 349 Score = 29.8 bits (67), Expect = 0.025 Identities = 15/79 (18%), Positives = 27/79 (34%), Gaps = 3/79 (3%) Query: 4 RRQPLIPGWLIPGLCAAALMITVSLAAFLALWLNAPSGAWSTIWRDSYLWHVVRFSFWQA 63 R GWL + L + + +W A + W + +++ Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLL---AFINTKPVAFTLPL 116 Query: 64 FLSAVLSVVPAVFLARALY 82 LS + +VV F+ LY Sbjct: 117 ALSIIFNVVVVTFMWSLLY 135
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.3 bits (84), Expect = 3e-04 Identities = 44/285 (15%), Positives = 87/285 (30%), Gaps = 41/285 (14%) Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGVVKEIKVSVGDKTETGALIMIFDSADGAADAAP 85 + V +T G S E+ + +VKEI V G+ G +++ + AD Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 86 AKA--------EEKKEAAPAAAPAAAAAKDVHVPDIGSDEVEVTEVMVKVG------DTV 131 ++ + + + + + + V EV+ T Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198 Query: 132 EAEQSLITVEGDKASMEVPAPFAGTVKEIKVNTGDKVSTGSLIMVFEVAGAAPAAAPAKA 191 + ++ + DK E A + ++ +K + A A + Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK-QAIAKHAVLEQ 257 Query: 192 EAAPAAAAPAATGVKDVNVPDIGGDEVEVTEVMVK-----------VGDKVA-------- 232 E A V + E E+ + + DK+ Sbjct: 258 ENKYV----EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 233 AEQSLITVEGDKASMEVPAPFAGTVKEIKIST-GDKVKTGSLIMV 276 L E + + + AP + V+++K+ T G V T +MV Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358 Score = 29.8 bits (67), Expect = 0.040 Identities = 16/85 (18%), Positives = 32/85 (37%), Gaps = 4/85 (4%) Query: 229 DKVAAEQSLITVEGDKASMEVPAPFAGTVKEIKISTGDKVKTGSLIMVFEVEGAAPAAAP 288 + VA +T G S E+ VKEI + G+ V+ G ++ ++ A Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVL--LKLTALGAEADT 136 Query: 289 AKQEAAAPAPAAKAEKPAAPAAKAE 313 K +++ + + + E Sbjct: 137 LKTQSSLLQARLEQTRYQILSRSIE 161
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 499 bits (1286), Expect = 0.0 Identities = 245/296 (82%), Positives = 267/296 (90%) Query: 1 MRDLYPLTRRRLLTAMALSPLLWQMNTAQAAAIDPRRIVALEWLPVELLLALGITPYGVA 60 M L ++RRRLLTAMALSPLLWQMNTA AAAIDP RIVALEWLPVELLLALGI PYGVA Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 Query: 61 DVPNYKLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEKLARIAPGR 120 D NY+LWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPE LARIAPGR Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 Query: 121 GFDFSDGKKPLAVARRSLVELAQTLNLEAAAEKHLAQYDRFIASQKPHFIRRGGRPLLMT 180 GF+FSDGK+PLA+AR+SL E+A LNL++AAE HLAQY+ FI S KP F++RG RPLL+T Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 Query: 181 TLIDPRHMLVLGPNCLFQEVLDEYGIVNAWQGETNFWGSTAVSIDRLAMYKEADVICFDH 240 TLIDPRHMLV GPN LFQE+LDEYGI NAWQGETNFWGSTAVSIDRLA YK+ DV+CFDH Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 Query: 241 GNNTDMNALMATPLWQAMPFVRAGRFHRVPAVWFYGATLSTMHFVRILNNVLGGKA 296 N+ DM+ALMATPLWQAMPFVRAGRF RVPAVWFYGATLS MHFVR+L+N +GGKA Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 29.8 bits (67), Expect = 0.008 Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 14/66 (21%) Query: 120 AEAI-SLLRNNRVVILSAGTGNPFFTT-------------DSAACLRGIEIEADVVLKAT 165 AE I L+ +VI S G G P D A E+ AD+ + T Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT 235 Query: 166 KVDGVF 171 V+G Sbjct: 236 DVNGAA 241
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.8 bits (85), Expect = 1e-04 Identities = 34/177 (19%), Positives = 70/177 (39%), Gaps = 3/177 (1%) Query: 213 FWLLFMILALGVFSGMVISSSSAQIGMTQYGLLSGAL-VVSLVSIFNSIGRLFWGGLTDK 271 WL + V + MV++ S I + V + + SIG +G L+D+ Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 272 LGGYNTLVIVYLFTCLCMLLLFFFNGNTSVFYFSALGVGFAYAGILVIFPGLTSQNFGMR 331 LG L+ + C ++ F + S+ + G A + + ++ Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 332 NQGLNYGFMYFGFAVGAVIAPYVTSAIAKYTGSYNTVFILTTVLLLIGVVLTLITKK 388 N+G +G + A+G + P + IA Y ++ + ++ + ++ L + KK Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH-WSYLLLIPMITIITVPFLMKLLKK 191
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 120 bits (303), Expect = 4e-32 Identities = 84/368 (22%), Positives = 148/368 (40%), Gaps = 68/368 (18%) Query: 23 GIDLGTTNSLVATVRSGQAETLPDHEGRHLLPSVVHYQQQGHTVGYAARDNAAQDTTNTI 82 IDLGT N+L+ G + +E PSVV A + ++ Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVV------------AIRQDRAGSPKSV 52 Query: 83 SSV----KRMMGRSLADIQARYPHLPYRFKASVNGLPMIDTAAGLLNPVRVSADILKALA 138 ++V K+M+GR+ +I A P M D G++ V+ +L+ Sbjct: 53 AAVGHDAKQMLGRTPGNIAAIRP--------------MKD---GVIADFFVTEKMLQHFI 95 Query: 139 ARA-SESLSGELDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYGLDS 197 + S S V++ VP +R+ +++A+ AG + L+ EP AAAI GL Sbjct: 96 KQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPV 155 Query: 198 GKEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG--I 255 + V D+GGGT +++++ L+ V +GGD FD + +Y+R G I Sbjct: 156 SEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLI 210 Query: 256 ADRSDNRVQRELLDAAIAAKIALSDADTVRVNVAG---WQG-----EITREQFNDLISAL 307 + + R++ E+ A + + V G +G + + + + Sbjct: 211 GEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEP 263 Query: 308 VKRTLLACRRALKDAGVD-PQDVLE--VVMVGGSTRVPLVRERVGEFFGRTPLTAIDPDK 364 + + A AL+ + D+ E +V+ GG + + + E G + A DP Sbjct: 264 LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLT 323 Query: 365 VVAIGAAI 372 VA G Sbjct: 324 CVARGGGK 331
>INTIMIN#Intimin signature. Length = 939 Score = 328 bits (842), Expect = e-102 Identities = 178/587 (30%), Positives = 267/587 (45%), Gaps = 59/587 (10%) Query: 53 KGKSFKEQGADYFINSATQGFDNLTPEALES-QARSYLQNQITSSAQSYLEGVMSPYGKI 111 K ++ +Y A L +L A+ + A S L+ + YG Sbjct: 154 KSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTA 213 Query: 112 RTSLSVGEGGDLDGSSLDYFIPWYDNQSTLLFSQISAQRKEDRTIGNFGLGVRQNVGNWL 171 +L G + DGSSLD+ +P+YD++ L F Q+ A+ + R N G G R + + Sbjct: 214 EVNLQSGN--NFDGSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENM 271 Query: 172 LGGNAFYDYDFTRGHRRLGLGTEAWTDYLKFSGNYYHPLSDWKDSEDFDFYEERPARGWD 231 LG N F D DF+ + RLG+G E W DY K S N Y +S W +S + Y+ERPA G+D Sbjct: 272 LGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFD 331 Query: 232 IRMESWLPFYPQLGAKLVYEQYYGDEVALFGTDNLQKDPHAVTLGLEYTPVPLVTVGTDY 291 IR +LP YP LGAKL+YEQYYGD VALF +D LQ +P A T+G+ YTP+PLVT+G DY Sbjct: 332 IRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDY 391 Query: 292 KAGTGDSNDFSVNATVNYQIGTPLAAQLDPENVKIQHSLMGSRTDFVDRNNFIILEYREK 351 + GTG+ ND + YQ P + Q++P+ V +L GSR D V RNN IILEY+++ Sbjct: 392 RHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQ 451 Query: 352 DPLDVTLWLKADATNEHPECVIEDTPEAAVGLEKCKWTVNALINHHYKIISASWQAKNNA 411 D L + + + T E I+ ++ GL++ W +AL + +I + Q+ + Sbjct: 452 DILSLNIPHDINGT-ERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSGSQSAQD- 509 Query: 412 ARTLVMPVVKANALTEGNNNSWNLVLPAWVNADTEEQRTALNTWKVRMTLEDEKGNKQNS 471 + +LPA+V + N +KV D GN N+ Sbjct: 510 ---------------------YQAILPAYV-------QGGSNVYKVTARAYDRNGNSSNN 541 Query: 472 GVVEITVQQDRKIELIVDNIADTDRSDHSHEASALADGEDGVVMDLLITDSFGDSTDRNG 531 ++ ITV + +VD + TD + + + SA ADG + + + Sbjct: 542 VLLTITVLSN---GQVVDQVGVTDFT--ADKTSAKADGTEAITYTATVK----------- 585 Query: 532 NELVDDAMTPVLYDSNDKKVTLAQTPCTTETPCVF--IASRDKEAGTVTLSSTLPGTFRW 589 V A PV ++ L+ T DK V + T T Sbjct: 586 KNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSAL 645 Query: 590 KAKADAYGDSNY--------VDVTFIGDNLSALNAVIYQVKAANPVN 628 A A + D T + + A+ + +K PV+ Sbjct: 646 NANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVS 692
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 87.8 bits (217), Expect = 3e-19 Identities = 111/462 (24%), Positives = 166/462 (35%), Gaps = 53/462 (11%) Query: 1589 DDSATDKLVITGDASGTTDLYINGIGDGAQTTNGIEVVDVGGVSTSDAFVLKN---EVNA 1645 D +DKLV+ DASG L++ G + N + +V S + F L N +V+ Sbjct: 491 DLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAA-TFTLANKDGKVDI 549 Query: 1646 SLYTYRLYWNESDNDWYLASKAQSDDDDSGGDDTPSDGGDDGGNVTPPDDGGDGGNVTPP 1705 Y YRL N + W L G P G P Sbjct: 550 GTYRYRLAAN-GNGQWSLV------------------GAKAPPAPKPAPQPGPQPPQPPQ 590 Query: 1706 DDGGDGGDVTPPDDGGDVAPQYRADIGAYMGNQ--WMARNLQMQTLYDREGSQYRNAD-G 1762 P A + G W A + L R G N D G Sbjct: 591 PQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYA---ESNALSKRLGELRLNPDAG 647 Query: 1763 SVWARFKAGKAESEAVSGNIDMDSNYSQFQLGGDILAWGNGQQSVTVGVMASYINADTDS 1822 W R A + + + +G D + F+LG D A +G +A Y D Sbjct: 648 GAWGRGFAQRQQLDNRAGR-RFDQKVAGFELGAD-HAVAVAGGRWHLGGLAGYTRGDRGF 705 Query: 1823 TGNRGADGSQFTSSGNVDGYNLGVYATWFADAQTHSGAYVDSWYQYGFYNN--SVESGDA 1880 TG+ G G+ D ++G YAT+ AD SG Y+D+ + N V D Sbjct: 706 TGDGG---------GHTDSVHVGGYATYIAD----SGFYLDATLRASRLENDFKVAGSDG 752 Query: 1881 GSESYDSTANAV--SLETGYRYDIALSNGNTVSLTPQAQVVWQNYSADSVKDNYGTRIDG 1938 + + V SLE G R+ A + L PQA++ + + G R+ Sbjct: 753 YAVKGKYRTHGVGASLEAGRRFTHA----DGWFLEPQAELAVFRAGGGAYRAANGLRVRD 808 Query: 1939 QDGDSWTTRLGLRVDGKLYKGSRTVIQPFAEANWLHTSD-DVSVSFDDATVKQDLPANRA 1997 + G S RLGL V ++ +QP+ +A+ L D +V + + +L RA Sbjct: 809 EGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRA 868 Query: 1998 ELKVGLQADIDKQWSVRAQVAGQTGSNDFGDLNGSLNLRYNW 2039 EL +G+ A + + S+ A G RY+W Sbjct: 869 ELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910 Score = 40.0 bits (93), Expect = 1e-04 Identities = 74/404 (18%), Positives = 136/404 (33%), Gaps = 59/404 (14%) Query: 112 TGTGLVIETSGGGA-----DDPDGGKYVSNAISLDHYAILELTDAKITTTGIYTQGISAA 166 T +G I+ SG A ++P N + + + T G A Sbjct: 63 TASGTTIKVSGRQAQGILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVA 122 Query: 167 DGSKLTLTDSTLTIDGNFGVMTLYTGSEATLDGTIVEAANSSSAQVQQGSTLNVLDGSTI 226 D + L T DG + ++A++ + ++ A Q+++G+ + V S I Sbjct: 123 DHATLANVGDTWDDDG-IALYVAGEQAQASIADSTLQGA--GGVQIERGANVTV-QRSAI 178 Query: 227 TLAQGQINVVAGNTATDEG-STLNLSDSSVSS---AGTMSTIQGTNKAALNLTNATITHT 282 I + D S + L D++V++ +G + + + L L IT Sbjct: 179 VDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGGHITGG 238 Query: 283 NASGAAVQANNATTLD---ISGGNITSAGM----------------------------GV 311 A+G A L I G+ + G GV Sbjct: 239 RAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGV 298 Query: 312 YILASDARIDGATINADGDGIFITSKKRSTSYEDLNALTVSDANVTSKTVALNIDG---- 367 + S + + + A G I + + +L+ NV A Sbjct: 299 DVSGSSVELAQSIVEAPELGAAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAP 358 Query: 368 -STTINDPIELTNSTFTA---PTAIKLGSKATIQAEKTMLTGNIVQTDASSSS----LSL 419 S T+ P +KL A+ ++ + + +S ++L Sbjct: 359 LSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVATEL-PSIPGTSIGPLDVAL 417 Query: 420 SQGSTLTGSVDAMFTTLSLDDTSQWNMTDPSTVGNLTNDGDITL 463 + + TG+ A+ +LS+D+ + W MTD S VG L D ++ Sbjct: 418 ASQARWTGATRAV-DSLSIDNAT-WVMTDNSNVGALRLASDGSV 459
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 48.6 bits (116), Expect = 1e-08 Identities = 32/116 (27%), Positives = 51/116 (43%), Gaps = 9/116 (7%) Query: 64 VRDGIVWDFFGAVTLVRRHLDTLEQQLGCRFT-HAATSFPPGTDP---RISINVLESAGL 119 ++DG++ DFF +++ + + R + P G R + AG Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135 Query: 120 EVSHVLDEPTAVA---DLLALDNAG--VVDIGGGTTGIAIVKQGKVTYSADEATGG 170 +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.1 bits (99), Expect = 3e-06 Identities = 75/360 (20%), Positives = 129/360 (35%), Gaps = 30/360 (8%) Query: 16 SLFRIAFAVFLTYMTVGLPLPVIPLFVHHELGYSNTMV---GIAVGIQFFATVLTRGYAG 72 L I V L + +GL +PV+P + +L +SN + GI + + G Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLR-DLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 73 RLADQYGAKRSALQGMFACGLAGAAWLLAALLPVSAPIKFALLIVGRLILGFGESQLLTG 132 L+D++G + L + A + + A P +L +GR++ G + Sbjct: 65 ALSDRFGRRPVLLVSLA---GAAVDYAIMATAPF-----LWVLYIGRIVAGITGATGAVA 116 Query: 133 TLTWGLGLVGPTRSGKVMSWNGMAIYGALAAGAPLGLL---IHSHFGFAALAGTTMVLPL 189 G R+ + + + AG LG L H F A A + L Sbjct: 117 GAYIADITDGDERA-RHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFL 175 Query: 190 LAWAFNGTVRKVPAYTGERPSLWSVVGLIWKPGL-----------GLALQGVGFAVIGTF 238 K R +L + W G+ + L G A + Sbjct: 176 TGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI 235 Query: 239 ISLYFVSNGWTMAGFTLTAFGGAFVLMRIL-FGWMPDRFGGVKVAVVSLLVETAGLLLLW 297 T G +L AFG L + + G + R G + ++ ++ + G +LL Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLA 295 Query: 298 LAPTAWIALVGAALTGAGCSLIFPALGVEVVKRVPAQVRGTALGGYAAFQDISYGVTGPL 357 A W+A L +G + PAL + ++V + +G G AA ++ + GPL Sbjct: 296 FATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-SIVGPL 353 Score = 29.8 bits (67), Expect = 0.017 Identities = 32/142 (22%), Positives = 47/142 (33%), Gaps = 8/142 (5%) Query: 252 GFTLTAFGGAFVLMRILFGWMPDRFGGVKVAVVSLLVETAGLLLLWLAPTAWIALVG--- 308 G L + + G + DRFG V +VSL ++ AP W+ +G Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 309 AALTGAGCSLIFPALGVEVVKRVPAQVRGTALGGYAAFQDISYGVTGPLAGMLATSYGYP 368 A +TGA G + R G +A V GP+ G L + Sbjct: 106 AGITGA----TGAVAGAYIADITDGDERARHFGFMSACFGFGM-VAGPVLGGLMGGFSPH 160 Query: 369 SVFLAGAISAVVGILVTILSFR 390 + F A A + L Sbjct: 161 APFFAAAALNGLNFLTGCFLLP 182
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 28.7 bits (64), Expect = 0.026 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 6/63 (9%) Query: 204 VRNIVGS-LLEVGAHNQPESWIAELLAARDRTLAAATAKAEGLYLVAVDYPDRFDLPKPP 262 +NI GS ++ + PES + E AA LAA +K++ V VDY + ++ KP Sbjct: 496 TKNIPGSHVIVKNIMDIPESTLLE--AAN---LAAYYSKSQNSSNVPVDYTEVKNVKKPN 550 Query: 263 MGP 265 Sbjct: 551 GAK 553
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 28.3 bits (63), Expect = 0.019 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 3/56 (5%) Query: 68 MVTSFT---AVHDVARFGAEVLRASPRQADLMVVAGTCFTKMAPVIQRLYDQMLEP 120 M+TSFT V + R A P Q L + F M+PVI ++Y +P Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQP 115
>SECA#SecA protein signature. Length = 901 Score = 32.2 bits (73), Expect = 0.012 Identities = 46/189 (24%), Positives = 70/189 (37%), Gaps = 36/189 (19%) Query: 472 VDGIDSDLQNKIDVIVQALAGAKKPLIISGTNAGSSEVIQAAANVAKALKGRGADVGITM 531 VD +DS L ID A+ PLIISG SSE+ + + L + + T Sbjct: 208 VDEVDSIL---ID-------EARTPLIISGPAEDSSEMYKRVNKIIPHLIRQEKEDSETF 257 Query: 532 IA----------RSVNSMGLGM-------MGGGSLDDALGELETGNADAVVVLENDLHRH 574 R VN G+ + G +D+ N + + L H Sbjct: 258 QGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAH 317 Query: 575 ASATRVNAALAKAPLVMVVDHQRTAIMENAHLV--LSAASFAESDGTVINNEGRA----- 627 A TR + K V++VD M+ L A A+ +G I NE + Sbjct: 318 ALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK-EGVQIQNENQTLASIT 376 Query: 628 -QRFFQVYD 635 Q +F++Y+ Sbjct: 377 FQNYFRLYE 385
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.9 bits (114), Expect = 3e-08 Identities = 32/135 (23%), Positives = 56/135 (41%), Gaps = 16/135 (11%) Query: 185 PGAVAIVAEDSKVARAMLEKGLNAMGIPHQMHVTGKDAWERIQQLAQEAEAEGKPISEKI 244 GA +VA+D R +L + L+ G ++ W I + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-------------AGDG 48 Query: 245 ALVLTDLEMPEMDGFTLTRKIKTDERLKKIPVVIHSSLSGSANEDHIRKVKADGYVAK-F 303 LV+TD+ MP+ + F L +IK + +PV++ S+ + + A Y+ K F Sbjct: 49 DLVVTDVVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106 Query: 304 EINELSSVIQEVLER 318 ++ EL +I L Sbjct: 107 DLTELIGIIGRALAE 121
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 30.2 bits (68), Expect = 0.002 Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 12/74 (16%) Query: 1 MIDWQDLHHSELTVPQLYALLKLRCAVFV--------VEQRCPYLDVDGDDLVGDNRHIL 52 M++ D++H+ L+ + L LR F + D + + ++ Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNN----NTTYLF 56 Query: 53 GWHQDELVAYARIL 66 G + ++ R + Sbjct: 57 GIKDNTVICSLRFI 70
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.2 bits (63), Expect = 0.043 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 15/75 (20%) Query: 196 AERDTQAYLKLDHDFHYVFVKYADNKYISQAHLLISARLLAIRYRLDFTAEYITSSNRGH 255 A+R+ L F VF+ + A+RY L+ Y S+ G Sbjct: 62 ADREGMTDLFASGHFERVFISPH------RL---------AVRYSLENPHAYADSNLTGF 106 Query: 256 ATILDMLKNNNVEGV 270 IL+ ++N ++ + Sbjct: 107 LNILEGCRHNKIQHL 121
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 538 bits (1388), Expect = 0.0 Identities = 260/389 (66%), Positives = 298/389 (76%), Gaps = 17/389 (4%) Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLFGKVDGLHYFSDDKGSDGDQTYMRIG 60 MK KVL+L++PALL AGAA+AAEIYNKDGNKLDL+GKVDGLHYFSDD DGDQTYMR+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQVNDQLTGYGQWEYQIQGNQTEG-SNDSWTRVAFAGLKFADAGSFDYGRNYGVTY 119 FKGETQ+NDQLTGYGQWEY +Q N TEG +SWTR+AFAGLKF D GSFDYGRNYGV Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVTSWTDVLPEFGGDTYG-ADNFMQQRGNGYATYRNTDFFGLVDGLDFALQYQGKNGSVS 178 DV WTD+LPEFGGD+Y ADN+M R NG ATYRNTDFFGLVDGL+FALQYQGKN S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 179 GEN--------TNGRSLLNQNGDGYGGSLTYAIGEGFSVGGAITTSKRTADQNNTANARL 230 ++ NG + NGDG+G S TY IG GFS G A TTS RT +Q N Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG--T 238 Query: 231 YGNGDRATVYTGGLKYDANNVYLAAQYSQTYNATRFGTSNGNNPSTSYGFANKAQNFEVV 290 GD+A +T GLKYDANN+YLA YS+T N T +G ++ G ANK QNFEV Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDK---GYDGGVANKTQNFEVT 295 Query: 291 AQYQFDFGLRPSVAYLQSKGKDISNGYGASYGDQDIVKYVDVGATYYFNKNMSTYVDYKI 350 AQYQFDFGLRP+V++L SKGKD++ + D+D+VKY DVGATYYFNKN STYVDYKI Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYN-NVNGDDKDLVKYADVGATYYFNKNFSTYVDYKI 354 Query: 351 NLLDKND-FTRDAGINTDDIVALGLVYQF 378 NLLD +D F +DAGI+TDDIVALG+VYQF Sbjct: 355 NLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 29.0 bits (65), Expect = 0.006 Identities = 10/30 (33%), Positives = 12/30 (40%) Query: 1 MNLRRKNRLWVVCAVLAGLALTTALVLYAL 30 R K WVV V LA + + AL Sbjct: 27 AAERSKKLAWVVAGVAGALATAGVVAVAAL 56
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.2 bits (164), Expect = 3e-15 Identities = 23/114 (20%), Positives = 48/114 (42%), Gaps = 2/114 (1%) Query: 9 VLIVDDHPLMRRGIRQLLELDPAFHVVAEAGDGASAIDLANRIEPDLILLDLNMKGLSGL 68 +L+ DD +R + Q L A + V + A+ + DL++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDSASDIYALIDAGADGYLLKDSDPEVLLEAIRK 122 D L +++ +++++ ++ + GA YL K D L+ I + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 32.9 bits (75), Expect = 5e-04 Identities = 19/97 (19%), Positives = 33/97 (34%), Gaps = 4/97 (4%) Query: 77 VRKLIAALVGSVLEPLDTLQELADALGNDPNFATTVLNKLAGKQPLDETLTALSGKSVDG 136 + + L E +DT+ E A+G P + A +A V Sbjct: 46 LHEKFEELYDHAAETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASE--MVQA 103 Query: 137 LIEYVGLRETISRAADALQKSQNGGDIPDKDLFVRRI 173 L+ ++ S + + ++ D DLFV I Sbjct: 104 LVN--DYKQISSESKFVIGLAEENQDNATADLFVGLI 138
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 30.8 bits (69), Expect = 0.026 Identities = 35/146 (23%), Positives = 59/146 (40%), Gaps = 21/146 (14%) Query: 557 AQLAEDEALRANTFAMATEATSSCE---DRVTFFLHQMKNVQLVHNAEKGQYDNDLA--- 610 AQL + +A +A A EA + + D +T L + N L HNA + +LA Sbjct: 60 AQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHAN 119 Query: 611 -ALVATGREMFRLGKLEQIAREKVRTLALVDEIEVW-LAYQNKLKKSLGLTSVTSE---- 664 A + E RL K E+ AR+ E E A+Q ++ + +E Sbjct: 120 NAAMQAEDERLRLAKAEEKARK---------EAEAAEKAFQEAEQRRKEIEREKAETERQ 170 Query: 665 MRFFDVSGVTVTDLQDAELQVKAAEK 690 ++ + + L + V+ A+K Sbjct: 171 LKLAEAEEKRLAALSEEAKAVEIAQK 196
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.7 bits (98), Expect = 3e-06 Identities = 75/378 (19%), Positives = 119/378 (31%), Gaps = 24/378 (6%) Query: 21 LIVAFLTGIAGALQTPTLSIFLTDEVHA--RPGMVGFFFTGSAVIGIIVSQFLAGRSDKK 78 L L + L P L L D VH+ G A++ + L SD+ Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70 Query: 79 GDRKKLIVFCCVLGMLACVLFAWNRNYFILLFIGVFLSSFGSTANPQMFALAREHADRTG 138 G R+ +++ + + A ++L +IG ++ G T A A G Sbjct: 71 G-RRPVLLVSLAGAAVDYAIMATAPFLWVL-YIGRIVA--GITGATGAVAGAYIADITDG 126 Query: 139 REAVMFSSILRAQVSLAWVIGPPLAYALAMGFSFTVMYLSAAVAFTVCGVMVWFFLPSMR 198 E + A V GP L L GFS + +AA + + F LP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESH 185 Query: 199 K-----DAPLATGTLEAPRRNR--RDTLLLFVICTLMWGTNSLYIINMPLFIINELHLPE 251 K A L + R R L + +M + +F + H Sbjct: 186 KGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245 Query: 252 KLAGVMMGTAAGLEIPT-MLIAGYFAKRLGKRLLMCIAVVAGLCFYVGMLLA-HAPATLL 309 G+ + L +I G A RLG+R + + ++A Y+ + A Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305 Query: 310 GLQLLNAIYIGILGGIGMLYFQDLMPGQAGSATTLYTNTIRVGWIIAGSLAG--IAAEIW 367 + LL GGIGM Q ++ Q S+ G + I+ Sbjct: 306 IMVLL------ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359 Query: 368 NYHAVFWFALVMIVATMF 385 W I Sbjct: 360 AASITTWNGWAWIAGAAL 377 Score = 41.7 bits (98), Expect = 4e-06 Identities = 17/101 (16%), Positives = 37/101 (36%) Query: 19 AFLIVAFLTGIAGALQTPTLSIFLTDEVHARPGMVGFFFTGSAVIGIIVSQFLAGRSDKK 78 A + V F+ + G + IF D H +G ++ + + G + Sbjct: 214 ALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAAR 273 Query: 79 GDRKKLIVFCCVLGMLACVLFAWNRNYFILLFIGVFLSSFG 119 ++ ++ + +L A+ ++ I V L+S G Sbjct: 274 LGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG 314
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 49.9 bits (119), Expect = 1e-08 Identities = 65/425 (15%), Positives = 140/425 (32%), Gaps = 65/425 (15%) Query: 22 RVIICCFLVVMLDGFDTAAIGFIAPDIRTHWQLSASELAPLFGAGLLGLTAGALLCGPLA 81 +++I ++ + + PDI + + + A +L + G + G L+ Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 82 DRFGRKRVIELCVALFGALSLLSAFS-PDIETLVLLRFLTGLGLGGAMPNTIT-MTSEYL 139 D+ G KR++ + + S++ L++ RF+ G G A P + + + Y+ Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYI 132 Query: 140 PARRRGALVTLMFCGFTLGSAMGGIVSAQLVPLIGWHGILALGGILPLLLFFGLLFALPE 199 P RG L+ +G +G + + I W +L ++P++ + F Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL----LIPMITIITVPFL--- 185 Query: 200 SPRWQVRRQLPQAVVARTVSAITGERYHDTQFFLHEAAAVAKGSIRQLFAGRQLVITLML 259 + L + V R LF + L++ Sbjct: 186 ------MKLLKKEV-----------RIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIV 228 Query: 260 WVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGALLL---------- 309 V+ F+ + + ++ P + G ++ V + GT+ + Sbjct: 229 SVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVH 287 Query: 310 --------------------------GVLMDRLNPFRVLAVSYALGAVCIVMIGLSENG- 342 G+L+DR P VL + +V + Sbjct: 288 QLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETT 347 Query: 343 LWLMALAIFGTGIGISGSQVGLNALTATLYPTQSRATGVSWSNAIGRCGAIVGSLSGGMM 402 W M + I G+S ++ ++ + ++ Q G+S N G G + Sbjct: 348 SWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407 Query: 403 MALNF 407 +++ Sbjct: 408 LSIPL 412 Score = 41.8 bits (98), Expect = 4e-06 Identities = 40/169 (23%), Positives = 73/169 (43%), Gaps = 1/169 (0%) Query: 251 RQLVITLMLWVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGALLLG 310 R I + L ++ F S+L +L+ +P + N +WV AF + ++G + G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 311 VLMDRLNPFRVLAVSYALGAVCIVMIGLSENGLWLMALAIFGTGIGISGSQVGLNALTAT 370 L D+L R+L + V+ + + L+ +A F G G + + + A Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 371 LYPTQSRATGVSWSNAIGRCGAIVGSLSGGMMM-ALNFSFDTLFFVIAI 418 P ++R +I G VG GGM+ +++S+ L +I I Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITI 179
>PF06580#Sensor histidine kinase Length = 349 Score = 217 bits (555), Expect = 9e-68 Identities = 60/216 (27%), Positives = 116/216 (53%), Gaps = 3/216 (1%) Query: 377 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 436 L G + + + ++ ++++ L AQ+NPHF+FNALN I+A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 437 SQLVQYLSTFFRKNLKR-PSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQ 495 +++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ + + + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 496 KLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAG-SSGL 554 ++P +Q +VEN IKHG +QL G + ++ ++ + L++E+ L + S+G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 555 GMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLP 590 G+ V +RL+ +G + I ++ + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-18 Identities = 49/215 (22%), Positives = 87/215 (40%), Gaps = 19/215 (8%) Query: 2 IKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRIS 61 +L+ DD+ R L L V +NA + D++ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQ 117 +++ + + RP + + ++A + AIKA E+ A+DYL KP + L + R Sbjct: 62 AFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 118 ERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVT--SSEGKEGFT 175 E ++ L ++Q + + G S +A + + +T S GK Sbjct: 121 EPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGK---- 173 Query: 176 ELTLRTLESRTPLLRCHRQFL-VNMAHLQEIRLED 209 EL R L R + F+ +NMA + +E Sbjct: 174 ELVARALHDYGK--RRNGPFVAINMAAIPRDLIES 206
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.7 bits (61), Expect = 0.012 Identities = 12/32 (37%), Positives = 19/32 (59%) Query: 7 MALPLFALSLSVSITGCDQKNDTLQGKQNNMT 38 M LF+ +L++ ITGC Q+ T+ K +T Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVT 37
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 681 bits (1758), Expect = 0.0 Identities = 247/839 (29%), Positives = 389/839 (46%), Gaps = 26/839 (3%) Query: 2 LRMTPIASLVLLTLFTWQTQAIATETFDTHFMVGGMRDQKITNFHLDENKPIPGQYELDI 61 L + V + A F+ F+ + + + + PG Y +DI Sbjct: 23 LAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDI 82 Query: 62 YVNNQWRGKYDIIVADDPGST----CISTELLKNIGVISDGLQPQ---GATDCIALKDVV 114 Y+NN + D+ C++ L ++G+ + + C+ L ++ Sbjct: 83 YLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMI 142 Query: 115 RSGGYTFNIGVFRLDLSVPQAYVNEVEAGYVLPENWDRGINAFYTSYYASQYYSDYKNSG 174 ++G RL+L++PQA+++ GY+ PE WD GINA +Y S + G Sbjct: 143 HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG 202 Query: 175 SSESTYVRFNSGFNLLGWQAHADTTFNKTD-----GSSGEWKSNTLYLERGIAELLGTLR 229 +S Y+ SG N+ W+ +TT++ GS +W+ +LER I L L Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262 Query: 230 AGDQYTSSEIFDSVRFTGVRLFRDMQMLPNSKQNFTPLVQGIAQTNALVTIEQNGFVVYQ 289 GD YT +IFD + F G +L D MLP+S++ F P++ GIA+ A VTI+QNG+ +Y Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322 Query: 290 KEVPPGPFSIADLQLAGGGADLDVTVREADGSINTWLVPYASVPNMLQPGVSKYDFSAGR 349 VPPGPF+I D+ AG DL VT++EADGS + VPY+SVP + + G ++Y +AG Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382 Query: 350 SHIEGADNQAD-FTQISYQYGLNNLLTLYGGTMLSNHYNAFTLGTGWNT-RIGAISLDAT 407 A + F Q + +GL T+YGGT L++ Y AF G G N +GA+S+D T Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442 Query: 408 RAHSKQDNGDVFDGQSYQIAYNKYLTQTLTRFGLAAYRYSSQDYRTFNDHVWANNKNNYR 467 +A+S + DGQS + YNK L ++ T L YRYS+ Y F D ++ Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502 Query: 468 RDKNDVYDI----ADYYQNDFGRKNTFSANVSQSLPEGWGAVSLSALWRDYWGRSGTSKD 523 ++ V + DYY + ++ V+Q L + LS + YWG S + Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQ 561 Query: 524 YQISYSNTFQKINYTLSASQTYDE-DHNEDKRFNLFISIPFD--WGDGITTPRRHLNVSN 580 +Q + F+ IN+TLS S T + D+ L ++IPF + RH + S Sbjct: 562 FQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASY 621 Query: 581 STTFDDDGFTSNNIGLTGTAGSRDQFNYGVNVSH---QRQDSETTAGTNLTWNTPVATLN 637 S + D +G +N G+ GT + +Y V + +S +T L + N Sbjct: 622 SMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNAN 681 Query: 638 GSYSQSSNYTQTGGSISGGVVAWSGGLNLSSRLSDTFAIMQAPGLEGAYVNGQKYRTTNK 697 YS S + Q +SGGV+A + G+ L L+DT +++APG + A V Q T+ Sbjct: 682 IGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDW 741 Query: 698 KGTVVYDNLTPYRENHLMLDVSQSSSETELRGNRKVAAPYRGAVVLVNFDTDQRKPWFIK 757 +G V T YREN + LD + + +L P RGA+V F + Sbjct: 742 RGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMT 801 Query: 758 AQRPDGSPLIFGYDVVDHHGHNVGIVGQGSQLFIRTNDIPPEVSVPVDKEQGLSCSITF 816 + PL FG V + GIV Q+++ + +V V +E+ C + Sbjct: 802 L-THNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANY 859
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 26.8 bits (59), Expect = 0.019 Identities = 10/43 (23%), Positives = 17/43 (39%), Gaps = 7/43 (16%) Query: 3 SKLLPCALLLATSFAWAAPA-------TTGIDQYELKSFIADF 38 ++L LLL +S++WA L+ + DF Sbjct: 10 KRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.1 bits (86), Expect = 1e-04 Identities = 33/153 (21%), Positives = 52/153 (33%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLITAAIRYGFFVYGGAETYFTYALLFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y +L++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYPQPVN 372 + V D R G ++ C GFG + G LGG+M P Sbjct: 109 TGATGAVAGAYIADI-TDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGGFSPHAP---- 162 Query: 373 GLTFNWAGMWTFGAVMIAVIALLFMIFFRESDK 405 + A + + L ES K Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186 Score = 33.3 bits (76), Expect = 0.002 Identities = 55/286 (19%), Positives = 93/286 (32%), Gaps = 17/286 (5%) Query: 29 LNKSGFSAGEIGWSYACTAIAAILSPILVGSVTDRFFSAQKVLAVLMFAGAVLMYFAAQQ 88 L S G A A+ ++G+++DRF ++ + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147 A F +L + T A T ++A A + D+ R R G + G+ G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQMLGY-NDISPTNIPLLITAASSALLGVFAFCLPDTPPKSTGKMDIKVMLGLDALVLL 206 P + G SP + P AA + L + L K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLITAAIRYGFFVYGGAETYFTYALLFLG 306 R G ++ L+LG+I Y + ++ L Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 8e-18 Identities = 27/140 (19%), Positives = 65/140 (46%), Gaps = 2/140 (1%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLINHGDKVLPYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + ++ + ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-RRC 128 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 129 KPQRELQQQDAESPLMIDES 148 + +L+ + ++ S Sbjct: 124 RRPSKLEDDSQDGMPLVGRS 143
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.010 Identities = 20/66 (30%), Positives = 26/66 (39%), Gaps = 14/66 (21%) Query: 187 RGLLAPVKRLVEGTHRLAAGDFTTRVTPTSADEL-----------GKLAQDFNQLASTLE 235 L+A V+ V H LA + P S + L G L N+LA E Sbjct: 104 SQLMAAVRSKVMEGHSLAD---AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTE 160 Query: 236 KNQQMR 241 + QQMR Sbjct: 161 QRQQMR 166
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 124 bits (314), Expect = 2e-33 Identities = 98/458 (21%), Positives = 201/458 (43%), Gaps = 26/458 (5%) Query: 12 LWIVALGFFMQSLDTTIVNTALPSMAKSLGESPLHMHMVVVSYVLTVAVMLPASGWLADK 71 +W+ L FF L+ ++N +LP +A + P + V +++LT ++ G L+D+ Sbjct: 17 IWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 72 IGVRNIFFAAIVLFTLGSLFCALSGTLNQ-LVLARVLQGVGGAMMVPVGRLTVMKIVPRA 130 +G++ + I++ GS+ + + L++AR +QG G A + + V + +P+ Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 131 QYMAAMTFVTLPGQIGPLLGPALGGVLVEYASWHWIFLINIP-VGIVGAMATFMLMPNYT 189 A + +G +GPA+GG++ Y HW +L+ IP + I+ L+ Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEV 193 Query: 190 IETRRFDLPGFLLLAIGMAVLTLALDGSKSMGISPWTLAGLAAGGAAAILLYLLHAKKNS 249 FD+ G +L+++G+ L + L + L+++ H +K + Sbjct: 194 RIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVL---------SFLIFVKHIRKVT 244 Query: 250 GALFSLRLFCTPTFSLGLLGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVL 308 L F +G+L M P ++ S G +++ P + Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304 Query: 309 GSMGMKRIVVQIVNRFGYRRVLVATTLGLALVSLLFMSVALL----GWYYLLPLVLLLQG 364 + I +V+R G VL +G+ +S+ F++ + L W+ + +V +L G Sbjct: 305 SVIIFGYIGGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361 Query: 365 MVNSARFSSMNTLTLKDLPDTLASSGNSLLSMIMQLSMSIGVTIAGMLL--GMFGQQHIG 422 + S + ++T+ L A +G SLL+ LS G+ I G LL + Q+ + Sbjct: 362 L--SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLP 419 Query: 423 IDSSATHHVFMYTWLCMAVIIALPAIIFARVPNDTQQN 460 ++ + +++ L + II + ++ V +Q++ Sbjct: 420 MEVDQSTYLYSNLLLLFSGIIVISWLVTLNVYKHSQRD 457
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 881 bits (2279), Expect = 0.0 Identities = 284/1035 (27%), Positives = 505/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILIAAAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ ++A + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPGGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSES--WSQGKLYDFASTQLAQTIAQIDGVGDVDVGGSSL 182 + S + +M+ S++ +Q + D+ ++ + T+++++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLNPQALFNQGVSLDEVREAIDSANVRRPQGAIEDSV------HRWQIQTNDELK 236 A+R+ L+ L ++ +V + N + G + + I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGAAVRLGDVASVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G+ VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDGIRAKLPELRAMIPAAIDLQIAQDRSPTIRASLQEVEETLAISVALVILVVFLFLRS 355 T I+AKL EL+ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVISMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LVVSLTLTPMMCGWMLKSSKPRTQQRKRGVG----RLLVALQQGYGTSLKWVLNHTRLVG 530 ++V+L LTP +C +LK + K G Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVFLGTVALNIWLYIAIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 +++ VA + L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 RD-DPAVNNVTGFT-GGSRVNSGMMFITLKPRGER---KETAQQVIDRLRVKLAKEPGAR 641 + +V V GF+ G N+GM F++LKP ER + +A+ VI R +++L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDSLAALREWEPKIRKALSAL-----PQLADVNSD 696 + + I G ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMNLIYDRDTMSRLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756 ++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDISALEKMFVINRDGKAIPLSYFAQWRPANAPLSVNHQGLSAASTIAFNLPTGTSLSQ 816 ++K++V + +G+ +P S F + + I GTS Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ATEAINRTMTQLGVPPTVRGSFSGTAQVFQQTMNSQLILIVAAIATVYIVLGILYESYVH 876 A + ++L P + ++G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRSGG 936 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + G Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPEQAIFQACLLRFRPIMMTTLAALFGALPLVLSDGDGSELRQPLGITIVGGLVMSQLL 996 +A A +R RPI+MT+LA + G LPL +S+G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 891 bits (2303), Expect = 0.0 Identities = 292/1036 (28%), Positives = 504/1036 (48%), Gaps = 29/1036 (2%) Query: 13 SRLFILRPVATTLLMAAILLAGIIGYRFLPVAALPEVDYPTIQVVTLYPGASPDVMTSAV 72 + FI RP+ +L +++AG + LPVA P + P + V YPGA + V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPIYSKVNPADPPIMTLAVTSNAMPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189 + I S + +M S+ TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------ERAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G + ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSADEYRRLII-AYQNGAPVRLGDVATVEQGAENSWLGAWANQAPAIVMNVQRQPGANI 302 ++ +E+ ++ + +G+ VRL DVA VE G EN + A N PA + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IATADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362 + TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SQQSLRKQNRFSRACERMFDRVIASYGRGLAKVLNHPWL 538 +S +V+L LTP +CA +L S + + F FD + Y + K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVAFATLLLSVMLWIVIPKGFFPVQDNGIIQGTLQAPQSSSYASMAQRQRQVAERILQ 598 L + + V+L++ +P F P +D G+ +Q P ++ + QV + L+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VQSLTTFVGVDGANPTLNSARLQINLKPLDARDDR---VQQVISRLQTAVATIPG 653 + V+S+ T G + N+ ++LKP + R+ + VI R + + I Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 654 VALYLQPTQDLTIDTQVSRTQYQFTLQ---ATTLDALSHWVPKL-QNALQSLPQLSEVSS 709 ++ P I + T + F L DAL+ +L A Q L V Sbjct: 660 G--FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDRGLAAWVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTA 769 + + + VD++ A LG+S++D++ + A G ++ + ++ ++ + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 STPGLAALETIRLTSRDGGTVPLSAIARIEQRFAPLSINHLDQFPVTTFSFNVPEGYSLG 829 ++ + + S +G VP SA + + + P G S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVQAILDTEKTLALPADITTQFQGSTLAFQAALGSTVWLIVAAVVAMYIVLGVLYESFI 889 DA+ + + LPA I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALIIAGSELDIIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA + + D+ ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMSPRDAIFQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIAMVGGLLVSQV 1009 G +A A +R RPILMT+LA +LG LPL +S G G+ + +GI ++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 2e-06 Identities = 36/172 (20%), Positives = 71/172 (41%), Gaps = 10/172 (5%) Query: 154 KVALAQAQGQLAKDNATLANARRDLARYQQ---LAKTNLVSRQELDAQQAL--VNETQGT 208 K A+ + + + + L + L + + AK +L + L + +T Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310 Query: 209 IKADEANVASAQLQLDWSRITAPVSGRV-GLKQVDVGNQISSSDTAGIVVITQTHPIDLI 267 I +A + + S I APVS +V LK G +++++T +V++ + +++ Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVT 369 Query: 268 FTLPESDIATVVQAQKAGKTLVVEAWDRTNSHKL-SEGVLLSLDNQIDPTTG 318 + DI + Q A + VEA+ T L + ++LD D G Sbjct: 370 ALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419 Score = 41.4 bits (97), Expect = 6e-06 Identities = 20/122 (16%), Positives = 46/122 (37%), Gaps = 13/122 (10%) Query: 110 GTVTAA-NTVTVRSRVDGQLIALHFQEGQQVNAGDLLAQIDPSQFKVALAQAQGQLAKDN 168 G +T + + ++ + + + +EG+ V GD+L ++ + + Q Sbjct: 88 GKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQ------- 140 Query: 169 ATLANARRDLARYQQLAKTNLVSRQELDAQQALVNETQGTIKADEANVASAQLQLDWSRI 228 ++L AR + RYQ L+++ EL+ L + + L + Sbjct: 141 SSLLQARLEQTRYQILSRS-----IELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195 Query: 229 TA 230 + Sbjct: 196 ST 197
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 107 bits (268), Expect = 2e-28 Identities = 81/361 (22%), Positives = 127/361 (35%), Gaps = 58/361 (16%) Query: 6 LITGVTGQDGSYLAEFLLEKGYEVHGIKRRASSFNTERVDHIYQDPH--------SCNPK 57 L+TG G G ++++ LLE G++V GI + N Y D P Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLND------YYDVSLKQARLELLAQPG 53 Query: 58 FHLHYGDLTDASNLTRILQEVQPDEVYNLGAMSHVAVSFESPEYTADVDAMGTLRLLEAI 117 F H DL D +T + + V+ V S E+P AD + G L +LE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 118 RFLGLEKKTRFYQASTSELYGLVQEIPQKETTPF-YPRSPYAVAKLYAYWITVNYRESYG 176 R ++ AS+S +YGL +++P +P S YA K + Y YG Sbjct: 114 RHNKIQ---HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 177 IYACNGILFNHESPRRGETFVTRKITRAIANIAQGLESCLYLGNMDSLRDWGHAKDYVRM 236 + A F P K T+A+ G +Y RD+ + D Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDD---- 222 Query: 237 QWMMLQQEQPEDFVIATGVQYSVRQFVELAAAQLGIKLRFEGEGINEKGIVVSVTGHDAP 296 IA + +R + A + + V G+ +P Sbjct: 223 --------------IAEAI---IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSP 265 Query: 297 GVKPGDVIVAV--------DPRY--FRPAEVETLLGDPSKAHEKLGWKPEITLSEMVSEM 346 V+ D I A+ +P +V D +E +G+ PE T+ + V Sbjct: 266 -VELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324 Query: 347 V 347 V Sbjct: 325 V 325
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 87.9 bits (218), Expect = 7e-22 Identities = 64/344 (18%), Positives = 128/344 (37%), Gaps = 47/344 (13%) Query: 5 RIFVAGHRGMVGSAIVRQLAQRG-------------DVEL------VLRTRD----ELDL 41 + V G G +G + ++L + G DV L +L ++DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 42 LDGRAVQAFFAGAGIDQVYLAAAKVGGIVANNTYPADFIYENMMIESNIIHAAHLHNVNK 101 D + FA ++V+++ + + + P + N+ NI+ + + Sbjct: 62 ADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 102 LLFLGSSCIYPKLARQPMAESELLQGTLEPTNEPYAIAKIAGIKLCESYNRQYGRDYRSV 161 LL+ SS +Y + P + P + YA K A + +Y+ YG + Sbjct: 121 LLYASSSSVYGLNRKMPFSTD---DSVDHPVS-LYAATKKANELMAHTYSHLYGLPATGL 176 Query: 162 MPTNLYGPHDNFHPDNSHVIPALLRRFHEAAQSHAPEVVVWGSGTPMREFLHVDDMAAAS 221 +YGP PD AL + + + +V + G R+F ++DD+A A Sbjct: 177 RFFTVYGPWGR--PDM-----ALFKFTKAMLEGKSIDV--YNYGKMKRDFTYIDDIAEAI 227 Query: 222 IHVMELA----REVWQENTAPMLSH-----INVGTGVDCTIRELAQTIAKVVGYQGRVVF 272 I + ++ + E P S N+G + + Q + +G + + Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287 Query: 273 DAAKPDGTPRKLLDVTRLHQ-LGWYHEISLEAGLAGTYQWFLEN 315 +P D L++ +G+ E +++ G+ W+ + Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 176 bits (449), Expect = 1e-54 Identities = 83/359 (23%), Positives = 142/359 (39%), Gaps = 50/359 (13%) Query: 1 MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLT--YAGNLE--SLSDISESNRYNFEH 56 MK L+TG AGFIG V + +++ VV ID L Y +L+ L +++ + F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPG-FQFHK 58 Query: 57 ADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWS 116 D+ D +T +F + V V S+ P A+ ++N+ G +LE R Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-- 116 Query: 117 ALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDH 176 + S+ VYG +P T+ + P S Y+A+K +++ Sbjct: 117 -------KIQHLLYASSSSVYGLNRK---------MPFSTDDSVDHPVSLYAATKKANEL 160 Query: 177 LVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYV 236 + + YGLP YGP+ P+ + LEGK + +Y G RD+ Y+ Sbjct: 161 MAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYI 220 Query: 237 EDHA----RALHMVVTEGKA--------------GETYNIGGHNEKKNLDVVFTICDLLD 278 +D A R ++ YNIG + + +D + + D L Sbjct: 221 DDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280 Query: 279 EIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLAN 337 +A + + +PG + D + +G+ P T + G++ V WY Sbjct: 281 ---IEA-----KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 41.7 bits (98), Expect = 2e-06 Identities = 25/162 (15%), Positives = 58/162 (35%), Gaps = 27/162 (16%) Query: 1 MNILLFGKTGQVGWELQRSLAPVGN-LIALDV-----------HSKEFC---------GD 39 M L+ G G +G+ + + L G+ ++ +D E D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 40 FSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPEL---AQLLNATSVEAIAKAANETG 96 ++ +G+ + + + + AV + P + L ++ + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 97 AWVVHYSTDYVFPGTGDIPWQETDATS-PLNVYGKTKLAGEK 137 +++ S+ V+ +P+ D+ P+++Y TK A E Sbjct: 121 --LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 72.5 bits (178), Expect = 2e-16 Identities = 62/352 (17%), Positives = 122/352 (34%), Gaps = 48/352 (13%) Query: 11 RVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLNDLMES----HIGDI 66 + VTG GF G +S L E G V G + RL L + H D+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 67 RDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVGNIKA 126 D E + + A E VF + VR S E P +N+ G +++LE + I+ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-KIQH 120 Query: 127 VVNITSDKCYDNREWVWGYRENEPMGGYD-------PYSNSKGCAELVASAFRNSFFNPA 179 ++ +S V+G P D Y+ +K EL+A + + + Sbjct: 121 LLYASSSS-------VYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY---- 169 Query: 180 NYEQHGVGLASVRAGNVIGGGDWAK-DRLIPDILRSFENNQQVIIRNPYSI-RPWQHVLE 237 G+ +R V G W + D + ++ + + + N + R + ++ + Sbjct: 170 -----GLPATGLRFFTVY--GPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDD 222 Query: 238 PLSGYIVVAQRLYTEGAKFSEG-------------WNFGPRDEDAKTVEFIVDKMVTLWG 284 I + + +++ +N G + + + + G Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIG--NSSPVELMDYIQALEDALG 280 Query: 285 DDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAW 336 +A + P + D +G+ P + + + V W++ + Sbjct: 281 IEAKKNML-PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>PERTACTIN#Pertactin signature. Length = 922 Score = 30.8 bits (69), Expect = 0.011 Identities = 23/82 (28%), Positives = 34/82 (41%), Gaps = 9/82 (10%) Query: 209 GDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQ 268 G +G S + E A+ + GEL+ ++ WGR DN G+RF Q+ Sbjct: 629 GGVGLAS-----TLWYAESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQK 683 Query: 269 LGSLPQGYDHKYTYS----HLG 286 + G DH + HLG Sbjct: 684 VAGFELGADHAVAVAGGRWHLG 705
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 80.6 bits (199), Expect = 1e-19 Identities = 62/332 (18%), Positives = 126/332 (37%), Gaps = 56/332 (16%) Query: 8 VIVSGASGFIGKHLLEALKKSGISVVAITRDVIKNNSNAL---ANVRWCSWDNIEL---- 60 +V+GA+GFIG H+ + L ++G VV I D + + + A + + + Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGI--DNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 61 -----LVEELSIDSALIGIIHLATEYGHKTSSLINIE------DANVIKPLKLLDLAIKY 109 + +L + + S +E D+N+ L +L+ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYS----LENPHAYADSNLTGFLNILEGCRHN 116 Query: 110 RADIF----------LNTDSFFAKKDFNYQHMRPYIITKRHFDEIGHYYANMHDISFVNM 159 + LN F+ D + Y TK+ + + H Y++++ + + Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176 Query: 160 RLEHVYGP-GDGENKFIPYIIDCLNKKQSCVKCTTGEQIRDFIFVDDVVNAYLTILEN-- 216 R VYGP G + + L K V G+ RDF ++DD+ A + + + Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIAEAIIRLQDVIP 235 Query: 217 ------RKEVPS-------YTEYQVGTGAGVSLKDFLVYLQNTMMPGSSSIFEFGAIEQR 263 E + Y Y +G + V L D++ L++ + + + + Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM----LPLQ 291 Query: 264 DNEIMFSVANNKNL-KAMGWKPNFDYKKGIEE 294 +++ + A+ K L + +G+ P K G++ Sbjct: 292 PGDVLETSADTKALYEVIGFTPETTVKDGVKN 323
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.6 bits (71), Expect = 0.006 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 1/84 (1%) Query: 136 STTAEGAQRRLAEYIQQVDEEVAKELEVDLKDNITLQTKTLQESLETQEVVAQEQKDLRI 195 +T R +A+ + + + EV + T +T+T E+ ET V +E+ + Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT-TETKETATVEKEEKAKVET 1116 Query: 196 KQIEEALRYADEAKITQPQIQQTQ 219 ++ +E + + Q Q + Q Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQ 1140
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 33.2 bits (75), Expect = 5e-04 Identities = 27/110 (24%), Positives = 43/110 (39%), Gaps = 10/110 (9%) Query: 53 LTDATAALQREVTERAKEQRRQHAADEERKRADEELAKIQADADAAERARGGLQQQLAAV 112 T+A ++LQ + AA + A A+ QA A+A +A +QQ A Sbjct: 193 FTEAISSLQIRMN-------TLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIR 245 Query: 113 Q-RQLAGSETGRLSAIAAASQ--AKSETGILLAQLLGEADDLAGKFAKEA 159 A G + A AA ++ LAQ + +A + G+ A Sbjct: 246 AANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASA 295
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 27.7 bits (62), Expect = 0.014 Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 9/79 (11%) Query: 86 RTELDRRILASADLIKLNRKKAIDTTLSRFSGWASSIPSADSIALTGIQGT--MRETA-- 141 + +L ++ + +L K + A+D S S + + + L G G +RE A Sbjct: 4 KQDLIAKVAEATELTKKDSAAAVDAVFSAVSS---YLAKGEKVQLIGF-GNFEVRERAAR 59 Query: 142 -GHIQKAAEKVDYEARRVM 159 G + E++ +A +V Sbjct: 60 KGRNPQTGEEIKIKASKVP 78
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 46.6 bits (110), Expect = 2e-07 Identities = 29/137 (21%), Positives = 59/137 (43%), Gaps = 13/137 (9%) Query: 291 DSIPNEAEKMDEEKIVALINKAIDARMAKADSEAADLKAKAD--AEEAAKKEKADAEEKE 348 P A + + VA +K + K + +A + A+ A+EA KA+ + E Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE 1084 Query: 349 AEEAKAKA-DAEEKAAKEKADAEAKEKA--DTEEAERMAKEKADADVRREIAEL------ 399 ++ ++ + + KE A E +EKA +TE+ + + K + ++E +E Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144 Query: 400 --KSRIPTELSDEERNE 414 + PT E +++ Sbjct: 1145 PARENDPTVNIKEPQSQ 1161 Score = 43.5 bits (102), Expect = 2e-06 Identities = 38/215 (17%), Positives = 78/215 (36%), Gaps = 26/215 (12%) Query: 314 DARMAKADSEAADL-KAKADAEEAAKKEKADAEEKEAEEAKAKADAEEKAAKEKADAEAK 372 KA+++ ++ ++ ++ +E E + E EE KAK + E+ K ++ Sbjct: 1072 AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE-KAKVETEKTQEVPKVTSQVS 1130 Query: 373 EKADTEEAERMAKEKA------------------DADVRREIAELKSRIPTELSDEERNE 414 K + E + E A AD + E S + +++ Sbjct: 1131 PKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190 Query: 415 VADAQVKADSVFSCFGKRAPVPLSGEKPLAYRRRLMIQLQEHSPDFKTV---DLSSIADS 471 ++ V+ + + V R R ++ H+ + T D S++A Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250 Query: 472 ALLSVAEKTIYADAQKSA---ILSVGPGMLREIKR 503 L S + +DA+ A L+VG + + I + Sbjct: 1251 DLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQ 1285 Score = 37.0 bits (85), Expect = 2e-04 Identities = 27/132 (20%), Positives = 49/132 (37%), Gaps = 8/132 (6%) Query: 296 EAEK----MDEEKIVALINKAIDARMAKADSEAADLKAKADAEEAAKKEKADAEEKEAEE 351 E EK +D I N D +++E +A A ++ E AE Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043 Query: 352 AKAKADAEEKAAKEKADAEAKEKADTEEAERMAKEKAD----ADVRREIAELKSRIPTEL 407 +K ++ EK ++ + A+ + +EA+ K A E E ++ E Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103 Query: 408 SDEERNEVADAQ 419 + E+ E A + Sbjct: 1104 ATVEKEEKAKVE 1115
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 581 bits (1499), Expect = 0.0 Identities = 201/395 (50%), Positives = 279/395 (70%), Gaps = 5/395 (1%) Query: 4 KIMAINAGSSSLKFQLLEMPQGDMLCQGLIERIGMADAQVTIKTHSQKWQETVPVADHRD 63 KI+ IN GSSSLK+QL+E G++L +GL ERIG+ D+ +T + +K + + DH+D Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 64 AVTLLLEKLLG--YQIINSLRDIDGVGHRVAHGGEFFKDSTLVTDETLAQIERLAELAPL 121 A+ L+L+ L+ Y +I + +ID VGHRV HGGE+F S L+TD+ L I ELAPL Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121 Query: 122 HNPVNALGIHVFRQLLPDAPSVAVFDTAFHQTLDEPAYIYPLPWHYYAELGIRRYGFHGT 181 HNP N GI Q++PD P VAVFDTAFHQT+ + AY+YP+P+ YY + IR+YGFHGT Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181 Query: 182 SHKYVSGVLAEKLGVPLSALRVICCHLGNGSSICAIKNGRSVNTSMGFTPQSGVMMGTRS 241 SHKYVS AE L P+ +L++I CHLGNGSSI A+KNG+S++TSMGFTP G+ MGTRS Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241 Query: 242 GDIDPSILPWIAQRESKTPQQLNQLLNNESGLLGVSGVSSDYRDVEQAA-NTGNRQAKLA 300 G IDPSI+ ++ ++E+ + +++ +LN +SG+ G+SG+SSD+RD+E AA G+++A+LA Sbjct: 242 GSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLA 301 Query: 301 LTLFAERIRATIGSYIMQMGGLDALVFTGGIGENSARARSAVCHNLQFLGLAVDEEKNQR 360 L +FA R++ TIGSY MGG+D +VFT GIGEN R + L+FLG +D+EKN+ Sbjct: 302 LNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKV 361 Query: 361 NA--TFIQTENALVKVAVINTNEELMIAQDVMRIA 393 I T ++ V V V+ TNEE MIA+D +I Sbjct: 362 RGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIV 396
>BONTOXILYSIN#Bontoxilysin signature. Length = 1196 Score = 30.6 bits (69), Expect = 0.011 Identities = 8/39 (20%), Positives = 17/39 (43%) Query: 190 SDFTDALAEKAAKLVFQYLPTAVEKGDCVATRGKMHNAS 228 SDF+ ++ K LV+ +L + + + G + Sbjct: 518 SDFSKVVSSKDKSLVYSFLDNLMSYLETIKNDGPIDTDK 556
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 27.3 bits (60), Expect = 0.023 Identities = 13/30 (43%), Positives = 15/30 (50%) Query: 90 PPPPVIEPEPEASEIAAVVSEAPAEEAPQE 119 PP PV+EPEPE I EAP + Sbjct: 64 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 93
>PF03627#PapG Length = 336 Score = 36.9 bits (85), Expect = 1e-04 Identities = 22/93 (23%), Positives = 34/93 (36%), Gaps = 8/93 (8%) Query: 327 DDHVLDAVLPPDIP-------IPSIAEVQRALYDATKAVSGMPGEEVKQRLRTGTVVTTD 379 DD + LP D+P IP + +QR A +P K R ++ Sbjct: 152 DDIIFKVALPADLPLGDYSVTIPYTSGMQRHFASYLGARFKIPYNVAKTLPRENEMLFLF 211 Query: 380 DRNWELRYSASALRFNLSRAVAIDMESATIAAQ 412 R SA +L ++I+ + AAQ Sbjct: 212 KNIGGCRPSAQSLEIKHGD-LSINSANNHYAAQ 243
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 27.8 bits (61), Expect = 0.026 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 2/39 (5%) Query: 9 GTQTDPGTGKPSENPPAAPPSDGPASEKPHDPPAAPNKP 47 GT+ +P P NP A P +DG +P D PA P++P Sbjct: 348 GTRPNP-EPDPDLNPDANPDTDGQPGTRP-DSPAVPDRP 384
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.7 bits (79), Expect = 0.004 Identities = 37/228 (16%), Positives = 74/228 (32%), Gaps = 15/228 (6%) Query: 813 QLDQQIQLVEEKSETLEREIEDVERNNEHLKAVSAAPSFIWDDEPPLEDTRRQRSHRYTA 872 L++ ++ S +I+ +E L+A A E LE + Sbjct: 159 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL------EKALEGAMNFSTADSAK 212 Query: 873 LTDIEEKHRSVSSQWKKSRNLLLALQECEPDSKILFRDFPQELAEIADQIKRAEVAGRDI 932 + +E + +++++ L S AE A R + + Sbjct: 213 IKTLEAEKAALAARKADLEKALEGAMN---FSTADSAKIKTLEAEKAALEARQAELEKAL 269 Query: 933 KRYQPLINQIEKEYPLLREEYPENIAQVRQQVEQNEKTWQTSAMRVRLVKELDSVRAHLK 992 + + L E A+ Q++ +A R L ++LD+ R K Sbjct: 270 EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVL---NANRQSLRRDLDASREAKK 326 Query: 993 QEYANAQKILEDEAQAQILLSGDQKRLEQDGDRIKQ---ELTTAKNEL 1037 Q A QK+ E ++ ++ L+ + KQ E + + Sbjct: 327 QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQN 374
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 560 bits (1444), Expect = 0.0 Identities = 268/396 (67%), Positives = 310/396 (78%), Gaps = 17/396 (4%) Query: 1 MNRKVLALLVPALLVAGAANAAEIYNKNGNKLDLYGKVDGLRYFSDNAGDDGDQSYARIG 60 M RKVLAL++PALL AGAA+AAEIYNK+GNKLDLYGKVDGL YFSD++ DGDQ+Y R+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQINDMLTGYGQWEYNIKVNTTEGEGANSWTRLGFAGLKFGEYGSFDYGRNYGVIY 120 FKGETQIND LTGYGQWEYN++ NTTEGEGANSWTRL FAGLKFG+YGSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DIEAWTDALPEFGGDTYTQTDVYMLGRTNGVATYRNTDFFGLVEGLNFALQYQGNNENGG 180 D+E WTD LPEFGGD+YT D YM GR NGVATYRNTDFFGLV+GLNFALQYQG NE+ Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 181 AGEGTGNGGS----RKLARENGDGFGMSASYDFDFGLSLGAAYSSSDRTDNQVARGYGDG 236 A + + + +NGDGFG+S +YD G S GAAY++SDRT+ QV G Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAG---- 236 Query: 237 MNERNNYAGGETAEAWTVGAKYDAYNVYLAAMYAETRNMTYYGGGNGEDNGGIANKTQNF 296 AGG+ A+AWT G KYDA N+YLA MY+ETRNMT YG + +GG+ANKTQNF Sbjct: 237 ----GTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNF 292 Query: 297 EVVAQYQFDFGLRPSIAYLQSKGKDLGGQEVHRGNWRYTNKDLVKYVDVGMTYYFNKNMS 356 EV AQYQFDFGLRP++++L SKGKDL V+ +KDLVKY DVG TYYFNKN S Sbjct: 293 EVTAQYQFDFGLRPAVSFLMSKGKDLTYNNVNGD-----DKDLVKYADVGATYYFNKNFS 347 Query: 357 TYVDYKINLLDEDDDFYASNGIATDDIVGVGLVYQF 392 TYVDYKINLLD+DD FY GI+TDDIV +G+VYQF Sbjct: 348 TYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 213 bits (543), Expect = 5e-71 Identities = 231/260 (88%), Positives = 246/260 (94%) Query: 1 MIQVTSEQWLYWLHLYFWPLLRVLALISTAPILSERAIPKRVKLGLGIMITLVIAPSLPA 60 M+QVTSEQWL WL+LYFWPLLRVLALISTAPILSER++PKRVKLGL +MIT IAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDTPLFSIAALWLAMQQILIGIALGFTMQFAFAAVRTAGEFIGLQMGLSFATFVDPGSHL 120 ND P+FS ALWLA+QQILIGIALGFTMQFAFAAVRTAGE IGLQMGLSFATFVDP SHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLAMLLFLTFNGHLWLISLLVDTFHTLPIGSNPVNSNAFMALARAGGLIF 180 NMPVLARIMDMLA+LLFLTFNGHLWLISLLVDTFHTLPIG P+NSNAF+AL +AG LIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPVITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGIMLMAALMPLIAPFC 240 LNGLMLALP+ITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGI LMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEIFNLLADIVSEMPI 260 EHLFSEIFNLLADI+SE+P+ Sbjct: 241 EHLFSEIFNLLADIISELPL 260
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.5 bits (165), Expect = 1e-18 Identities = 23/78 (29%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALITGLIISILQAATQINEMTLSFIPKIVAVFIAII 63 + ++ G +A+ + L L+ +VA I GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 329 bits (844), Expect = e-117 Identities = 225/245 (91%), Positives = 234/245 (95%) Query: 1 MRRLLFLSLAGLWLFSPVAAAQLPGLISQPLAGGGQSWSLSVQTLVFITSLTFLPAILLM 60 MRRLL ++ LWL +P+A AQLPG+ SQPL GGGQSWSL VQTLVFITSLTF+PAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEQK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSE+K Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALDKGAQPLRAFMLRQTREADLALFARLANSGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEAL+KGAQPLR FMLRQTREADL LFARLAN+GPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 209 bits (534), Expect = 2e-73 Identities = 136/137 (99%), Positives = 136/137 (99%) Query: 1 MSDMNNPSDENTGALDDLWADALNEQKATTNKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 MSDMNNPSDENTGALDDLWADALNEQKATT KSAADAVFQQLGGGDVSGAMQDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 384 bits (987), Expect = e-136 Identities = 86/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 5 ILSQAEIDALLNGDS--DTKDEPTPGIASDSDIRPYDPNTQRRVVRERLQALEIINERFA 62 +LSQ EID LL S D E I+ I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 63 RQFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 241 HEDQNWRDNLVRQVQHSELELVANFADIPLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297 + L ++ ++++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 298 GVPVLTSQYGTVNGQYALRVEHLI 321 Q G V + A ++ I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 406 bits (1044), Expect = e-143 Identities = 193/411 (46%), Positives = 233/411 (56%), Gaps = 40/411 (9%) Query: 1 MITLPQLITTDTDMTAGLTSGKTTGSAEDFLALLAGALGADGAQGKDARITLADLQAAGG 60 MI L LIT D D T L GK + +A+DFLALL+ AL + K A L Sbjct: 1 MIRLAPLITADVDTTT-LPGGKASDAAQDFLALLSEALAGETTTDKAAPQLL-------- 51 Query: 61 KLSKELLTQHGEPGQALKLADLLAQKAN---ATDETLTDLTQAQHLLSTLTPSLKTSALA 117 ++ + T GEP + ++D AQ+AN DET + Q + LT + + A Sbjct: 52 -VATDKPTTKGEPLISDIVSD--AQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAA 108 Query: 118 ALSKTAQHDEKTPALSDEDLASLSALFAMLPGQPVATPVAGETPAENHIALPSLLRGDMP 177 K DEK L+++ ASLSALFAMLPG V D P Sbjct: 109 VADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVT-----------------DAP 151 Query: 178 SAPQEETHTLSFSEHEKGKTEASLARASDDRATGPALTPLVVAAAATSAKVEVDSPPAPV 237 S F++ T L A D A G PL A +K EV S P+PV Sbjct: 152 STVLPTEKPTLFTK----LTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPV 207 Query: 238 THGAAMPTLSSATAQPQPLPVASAPVLSAPLGSHEWQQTFSQQVMLFTRQGQQSAQLRLH 297 T AA P ++ QP LP +APVLSAPLGSHEWQQ+ SQ + LFTRQGQQSA+LRLH Sbjct: 208 T-AAASPLITPHQTQP--LPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLH 264 Query: 298 PEELGQVHISLKLDDNQAQLQMVSPHSHVRAALEAALPMLRTQLAESGIQLGQSSISSES 357 P++LG+V ISLK+DDNQAQ+QMVSPH HVRAALEAALP+LRTQLAESGIQLGQS+IS ES Sbjct: 265 PQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGES 324 Query: 358 FAGQQQ-SSSQQQSSRAQHTDAFGAEDDIALAAPASLQAAARGNGAVDIFA 407 F+GQQQ +S QQQS R + + EDD L P SLQ GN VDIFA Sbjct: 325 FSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSGVDIFA 375
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 206 bits (526), Expect = 4e-72 Identities = 130/147 (88%), Positives = 138/147 (93%) Query: 1 MAQHGALETLKDLAEKEVDDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRSNLNTDMGNG 60 MA+HGAL TLKDLAEKEV+DAARLLGEMRRGCQQAEEQLKMLIDYQNEYR+NLN+DM G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 IASNRWINYQQFIQTLEKAIEQHRLQLTQWTQKVDLALKSWREKKQRLQAWQTLQDRQTA 120 I SNRWINYQQFIQTLEKAI QHR QL QWTQKVD+AL SWREKKQRLQAWQTLQ+RQ+ Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 AALLAENRMDQKKMDEFAQRAAMRKPE 147 AALLAENR+DQKKMDEFAQRAAMRKPE Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
>FLAGELLIN#Flagellin signature. Length = 507 Score = 264 bits (677), Expect = 6e-85 Identities = 250/507 (49%), Positives = 293/507 (57%), Gaps = 13/507 (2%) Query: 2 AQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKGL 61 AQVINTNSLSLLTQNNLNKSQS+L +AIERLSSGLRINSAKDDAAGQAIANRFT+NIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQASRNANDGISIAQTTEGALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRLN 121 TQASRNANDGISIAQTTEGALNEINNNLQRVREL+VQ+ N TNS SDL SIQ EI QRL Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDLKQINSQTLGLDTLSVQDAYTP 181 EIDRVS QTQFNGVKVL+QDN + IQVGANDGETI IDL++I+ ++LGLD +V Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 KGTAVTRDVTTYKNGGTTLTAPNAAAIDTALGTTGAAGTAAVK----FKDGNYFVEVTGT 237 + T N +D G TA + + T Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 238 TKDGLYEATVDAAGAVTMTANKATVTGASTVTENQIVDAVTPTPVDTVAAATALTNAGVT 297 ++ + TA + GA + N V+ Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300 Query: 298 GATGNTSLVKMSFEDKNGKVTDAGYALKVGNDYYAA------DYDEKTGEIKAKTVNYTD 351 + + G L+ + Y + +D+KT AK + Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEA 360 Query: 352 ATGATKTGAVKFGGANGKTEV---VTTVDGNTYQASDVKGHNFQSGGALSEAVTTKTENP 408 + GA T+ G T + A T NP Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANP 420 Query: 409 LAKIDAALAQVDALRSDLGAVQNRFNSAITNLGNTVNNLSEARSRIEDSDYATEVSNMSR 468 LA ID+AL++VDA+RS LGA+QNRF+SAITNLGNTV NL+ ARSRIED+DYATEVSNMS+ Sbjct: 421 LASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSK 480 Query: 469 AQILQQAGTSVLAQANQVPQNVLSLLR 495 AQILQQAGTSVLAQANQVPQNVLSLLR Sbjct: 481 AQILQQAGTSVLAQANQVPQNVLSLLR 507
>SECA#SecA protein signature. Length = 901 Score = 55.3 bits (133), Expect = 4e-11 Identities = 19/35 (54%), Positives = 22/35 (62%), Gaps = 1/35 (2%) Query: 186 PHTTPLQMPIK-AEVKVGRNDPCPCGSGKKFKQCC 219 + + E KVGRNDPCPCGSGKK+KQC Sbjct: 863 DSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCH 897
>cloacin#Cloacin signature. Length = 551 Score = 29.3 bits (65), Expect = 0.028 Identities = 14/46 (30%), Positives = 26/46 (56%) Query: 50 TPEAVEQDTTEHHPDPQPLENEPPVSQTEAGYQKIRAELHEARKNI 95 +P+ V+Q E + Q + PV E Y++ RAEL++A +++ Sbjct: 292 SPDQVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDV 337
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 29.1 bits (65), Expect = 0.006 Identities = 12/49 (24%), Positives = 19/49 (38%) Query: 20 KVAQLVGSAPEALDTLQELADALGNDPNFAITVLNKLAGKQPLDETLTA 68 K +L A E +DT+ E A+G P + + A +A Sbjct: 49 KFEELYDHAAETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSA 97
>SOPEPROTEIN#Salmonella type III secretion SopE effector protein signature. Length = 239 Score = 401 bits (1032), Expect = e-146 Identities = 163/237 (68%), Positives = 193/237 (81%) Query: 2 TNITLSTQHYRIHRSDVEPVKEKTTEKDIFAKSITAVRNSFISLSTSLSDRFSLHQQTDI 61 T ITLS Q++RI + + +KEK+TEK+ AKSI AV+N FI L + LS+RF H+ T+ Sbjct: 1 TKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLSERFISHKNTES 60 Query: 62 PTTHFHRGSASEGRAVLTSKTVKDFMLQKLNSLDIKGNASKDPAYARQTCEAILSAVYSN 121 THFHRGSASEGRAVLT+K VKDFMLQ LN +DI+G+ASKDPAYA QT EAILSAVYS Sbjct: 61 SATHFHRGSASEGRAVLTNKVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSK 120 Query: 122 NKDHCCKLLISKGVSITPFLKEIGEAAQNAGLPGEIKNGVFTPGGAGANPFVVPLIAAAS 181 NKD CC LLISKG++I PFL+EIGEAA+NAGLPG KN VFTP GAGANPF+ PLI++A+ Sbjct: 121 NKDQCCNLLISKGINIAPFLQEIGEAAKNAGLPGTTKNDVFTPSGAGANPFITPLISSAN 180 Query: 182 IKYPHMFINHNQQVSFKAYAEKIVMKEVTPLFNKGTMPTPQQFQLTIENIANKHLQN 238 KYP MFIN +QQ SFK YAEKI+M EV PLFN+ MPTPQQFQL +ENIANK++QN Sbjct: 181 SKYPRMFINQHQQASFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQN 237
>SECA#SecA protein signature. Length = 901 Score = 56.4 bits (136), Expect = 1e-12 Identities = 16/28 (57%), Positives = 21/28 (75%) Query: 92 IDGTRPQLGRNDPCPCGSGKKFKKCCGQ 119 ++GRNDPCPCGSGKK+K+C G+ Sbjct: 872 AQTGERKVGRNDPCPCGSGKKYKQCHGR 899
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.0 bits (213), Expect = 9e-21 Identities = 36/152 (23%), Positives = 60/152 (39%), Gaps = 3/152 (1%) Query: 10 ILIVEDEPVFRSLLDSWFSSLGATTALAGDGVDALELMGRFTPDLMICDIAMPRMNGLKL 69 IL+ +D+ R++L+ S G + + + DL++ D+ MP N L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 70 VENLRNRGDQTPILVISATENMADIAKALRLGVEDVLLKPVKDLNRLRETVFACLYPNMF 129 + ++ P+LV+SA KA G D L KP DL L + L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122 Query: 130 NSRVEEEERLFRDWDAMVSNPTAAAQLLQELQ 161 R + E +D +V A ++ + L Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 27.9 bits (62), Expect = 0.002 Identities = 9/49 (18%), Positives = 18/49 (36%) Query: 8 KSGIIGFTSAVTILTTFFTGFRSSLRIVFEIPAAMLTAFAARFRCFFTI 56 K+ ++ F R++L +P +L FA ++I Sbjct: 342 KTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSI 390
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 31.4 bits (71), Expect = 0.014 Identities = 23/105 (21%), Positives = 36/105 (34%), Gaps = 2/105 (1%) Query: 526 PIDVELTESCLIENDTLALSVIQQFSQLGAQIHLDDFGTGYSSLSQLARFPIDAVKLDQA 585 P+ V S I L S + FS I + ++ Q+ D V Sbjct: 425 PLVVVFVASNFINAHLLTDSFPRYFSDKS--IDFHSYYLLQDNVYQIPDLKPDLVITHSQ 482 Query: 586 FVRDIHKQPLSQSLVRAIVAVAQALNLQVIAEGVENAKEDAFLTK 630 + +H + V I L++Q + V+ K A LTK Sbjct: 483 LIPFVHHELTKGIAVAEISFDESILSIQELMYQVKEEKFQADLTK 527
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 52.0 bits (124), Expect = 4e-10 Identities = 52/260 (20%), Positives = 99/260 (38%), Gaps = 22/260 (8%) Query: 4 LSGKRILVTGVASKLSIAYGIAQAMHREGAEL-AFTYQNDKLKGRVEEFAAQLGSSIVLP 62 + GK +TG A I +A+ + +GA + A Y +KL+ V A+ + P Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63 Query: 63 CDVAEDASIDAMFAELGNVWPKFDGFVHSIGF---APGDQLDGDYVNAVTREGFKIAHDI 119 DV + A+ID + A + D V+ G L + A F + Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT----FSVN--- 116 Query: 120 SSYSFVAMAKACRTMLNP-GSALLTLSYLGAERAIPNYNVMGLAKASLEANVRYMANAMG 178 S+ F A + M++ +++T+ A + +KA+ + + + Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 179 PEGVRVNAISAGPIRTLAASGI--------KDFRKMLAHCEAVTPIRRTVTIEDVGNSAA 230 +R N +S G T + + + L + P+++ D+ ++ Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236 Query: 231 FLCSDLSAGISGEVVHVDGG 250 FL S + I+ + VDGG Sbjct: 237 FLVSGQAGHITMHNLCVDGG 256
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 344 bits (883), Expect = e-118 Identities = 124/345 (35%), Positives = 178/345 (51%), Gaps = 22/345 (6%) Query: 7 AEFKDNLLGEANRFLEVLEQVSRLAPLDKPVLIIGERGTGKELIANRLHYLSSRWQGPLI 66 ++ L+G + E+ ++RL D ++I GE GTGKEL+A LH R GP + Sbjct: 133 SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFV 192 Query: 67 SLNCAALNENLLDSELFGHEAGAFTGAQKRHPGRFERADGGTLFLDELATAPMLVQEKLL 126 ++N AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LL Sbjct: 193 AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLL 252 Query: 127 RVIEYGELERVGGSQPLQVNVRLVCATNADLPAMVKEGTFRADLLDRLAFDVVQLPPLRE 186 RV++ GE VGG P++ +VR+V ATN DL + +G FR DL RL ++LPPLR+ Sbjct: 253 RVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRD 312 Query: 187 RQSDIMLMAEHFAIQMCRELRLPLFPGFTDRAKETLLHYAWPGNVRELKNVVERSVYRHG 246 R DI + HF Q +E F A E + + WPGNVREL+N+V R + Sbjct: 313 RAEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYP 370 Query: 247 SSE--------HPLDEIVIDPFQRHPAEPPAPALPAASVT------------PDLPLKLR 286 EI P ++ A + ++ A Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD 430 Query: 287 EFQLQQEKALLQRSLQQAKFNQKRAADLLALTYHQFRALLKKHQL 331 + E L+ +L + NQ +AADLL L + R +++ + Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 87.0 bits (215), Expect = 9e-23 Identities = 68/248 (27%), Positives = 111/248 (44%), Gaps = 22/248 (8%) Query: 7 KSVLILGGSRGIGAAIVRRFSADGASVV-FSYAG-------SREAAEKLAAETGSIAIQT 58 K I G ++GIG A+ R ++ GA + Y S AE AE ++ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 59 DSADRDAVISLVREYGPLDILVVNAGVALFGDALEQDSDAIDRLFRINIHAPYHASVEAA 118 +A + + RE GP+DILV AGV G + + F +N ++AS + Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 RNMP--EGGRIIIIGSVNGDRMPVPGMAAYAASKSALQGLARGLARDFGPRGITINVVQP 176 + M G I+ +GS N +P MAAYA+SK+A + L + I N+V P Sbjct: 129 KYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GPIDTDI--------NPEDGPMKELMHSF---MAIKRHGRPEEVAGMVAWLAGPEASFVT 225 G +TD+ N + +K + +F + +K+ +P ++A V +L +A +T Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 226 GAMHTIDG 233 +DG Sbjct: 248 MHNLCVDG 255
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 45.4 bits (107), Expect = 3e-08 Identities = 14/115 (12%), Positives = 40/115 (34%), Gaps = 5/115 (4%) Query: 7 SRTPGRPRQFDPEQAIETAQHLFHSRGYDAVSVADLTKAFGINPPSFYAAFGSKLGLYTR 66 +T ++ + ++ A LF +G + S+ ++ KA G+ + Y F K L++ Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 67 VLK----RYRMTDAIPLGALLRHDRPTAKCLIDVLMEAARRYAADPDATGCLVLE 117 + + + + ++ ++E+ + + Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116
>INTIMIN#Intimin signature. Length = 939 Score = 217 bits (553), Expect = 2e-62 Identities = 116/409 (28%), Positives = 187/409 (45%), Gaps = 21/409 (5%) Query: 29 SDNEIQSWIAGTASSISPHLQEGTLE-DYAKGKIKALPGQAANHLVNEGIKNAFPEIIFR 87 +D++ ++ A A+S+ LQ +L DYAK + G A+ + +++ Sbjct: 158 TDDKALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQH-----YGT 212 Query: 88 GGVNLEDGAKYRSSEFDMFIPVQETTSSLLFGQLGFRDHDSSSFDGRTYVNVGVGYRQEV 147 VNL+ G + S D +P ++ L FGQ+G R DS R N+G G R + Sbjct: 213 AEVNLQSGNNFDGSSLDFLLPFYDSEKMLAFGQVGARYIDS-----RFTANLGAGQRFFL 267 Query: 148 NGWLLGVNTFLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSAVHELHDERPA 207 +LG N F+D D + R GIGGE ++D S N YF ++GW S + +DERPA Sbjct: 268 PENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPA 327 Query: 208 YGFDLRTKGTLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGAALVWNPVPLLEV 267 GFD+R G LP +P +L YEQYYGD V L + L NP AA + + P+PL+ + Sbjct: 328 NGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTM 387 Query: 268 RAGYRDAGNGGSQAEGGLRVNYSFGTPLHEQLDYRNV-GAPSNTTNRRAFVDRNYDIVMA 326 YR + ++ Y F P +Q++ + V + + +R V RN +I++ Sbjct: 388 GIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILE 447 Query: 327 YREQAS-KIRITAMPVSGLSGTLVTLMATVDSRYPVEKVEWSGDAELLAGLQLQGSLGSG 385 Y++Q + I ++G + + V S+Y ++++ W A G Q+Q S Sbjct: 448 YKKQDILSLNIPH-DINGTERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSGSQS 506 Query: 386 -----LILPQLPLTATDGQEYSLYLTVTDSRGTRVTSERIPVRVTQDET 429 ILP Y + D G + + + V + Sbjct: 507 AQDYQAILP--AYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQ 553
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.5 bits (69), Expect = 0.005 Identities = 21/147 (14%), Positives = 50/147 (34%), Gaps = 22/147 (14%) Query: 49 NIARS--LFHAISLMAIFIIAWGVGILLFFLVKQKARIHDISFLRLFLAAVLFFIPIVIE 106 +A+S + ++A+ + G+ F + I F A+ + +V Sbjct: 22 QVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSY---VVDN 78 Query: 107 FSLLTESFLWELFFIILLVALC---LSVGMRF--------YSKLMPVICFTQLSWVR--- 152 L + L + L+A+ + G K+ P+ ++ ++ Sbjct: 79 VLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLV 138 Query: 153 ---RHCFTIVMLGFIIYFFIFSFFVGI 176 + +V+L +I+ I V + Sbjct: 139 EFLKSILKVVLLSILIWIIIKGNLVTL 165
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 149 bits (379), Expect = 3e-43 Identities = 97/369 (26%), Positives = 169/369 (45%), Gaps = 6/369 (1%) Query: 20 RRILPVFLLVGLYAASTAAVMSVLPFYIREMGGSPLII---GIIIATEAFSQFCAAPLIG 76 R ++ + V L A +M VLP +R++ S + GI++A A QF AP++G Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 77 HLSDRVGRKRILIVTLAIAAISLLLLANAQCILFILLARTLFGISAGNLSAAAAYIADCT 136 LSDR GR+ +L+V+LA AA+ ++A A + + + R + GI+ + A AYIAD T Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 137 HVRNRRQAIGILTGCIGLGGIVGAGVSGWLSRISLGAPIYAAFILVLGSALVAIWGLKDP 196 R + G ++ C G G + G + G + S AP +AA L + L + L + Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 197 STTSRTTDKIASFSARAILKMPVLRVLIIVMLCHFFAYGMYSSQLPVFLSDTFIWNGLPF 256 R + + + A + ++ ++ FF + Q+P L F + + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLV-GQVPAALWVIFGEDRFHW 243 Query: 257 GPKALSYLLMADGVINIFVQLFLLGWVSQYFSERKLIILIFALLCTGFLTAGIATTIPVL 316 + L A G+++ Q + G V+ ER+ ++L TG++ AT + Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM- 302 Query: 317 VFAIVCISIADALAKPTYLAALSVHVSPARQGIVIGTAQALIAIADFISPVLGGFVLGYA 376 F I+ + + + P A LS V RQG + G+ AL ++ + P+L + + Sbjct: 303 AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362 Query: 377 LYGVWIGIA 385 + W G A Sbjct: 363 I-TTWNGWA 370
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.4 bits (123), Expect = 3e-09 Identities = 64/350 (18%), Positives = 119/350 (34%), Gaps = 30/350 (8%) Query: 10 YALFNFI----GGWASDKVGPKTVFLIAALLWSVFCGLTGLVTGLWTMLIVRVLFGMAEG 65 YAL F G SD+ G + V L++ +V + LW + I R++ G+ Sbjct: 52 YALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA 111 Query: 66 PVSAAGNKIINNWISRKESATAIGIFSAGSPLGGAVSGPIVGLLALSLGWRPAFGIIFLF 125 + AG I + E A G SA G V+GP++G L F Sbjct: 112 TGAVAGA-YIADITDGDERARHFGFMSA-CFGFGMVAGPVLGGLMGGFSPHAPFFAAAAL 169 Query: 126 GLVWVLLWYFIVSDKPTMSKRLAPEERIDFENHEDVILSDDGRATPSLGYYMKQPMVWAT 185 + L F++ + +R E + + Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREAL-------------NPLASFRWARGMTVVAALM 216 Query: 186 TLAFFSYNYILFFFLTWFPSYLNHSLHLDIKEISIATVIPWVIGAIGMVLGGVCSDVIYR 245 + FF + + + H D I I+ G + + + + + Sbjct: 217 AV-FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQAMITGPVAA 272 Query: 246 ITGNALLSRRLILGVCLAGAAVCVAVSGTVSTIGSAITLMSVSLFLLYLTGPIYWAVIQD 305 L R L + + + + A +M V L + P A++ Sbjct: 273 -----RLGERRALMLGMIADGTGYILLAFATRGWMAFPIM-VLLASGGIGMPALQAMLSR 326 Query: 306 VVHKDKVGSVGGAMHGLANISGIIGPLVTGFIVQFS-GKYDYAFYLAGAI 354 V +++ G + G++ L +++ I+GPL+ I S ++ ++AGA Sbjct: 327 QVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAA 376 Score = 32.9 bits (75), Expect = 0.002 Identities = 31/121 (25%), Positives = 49/121 (40%), Gaps = 13/121 (10%) Query: 252 LSRRLILGVCLAGAAVCVAVSGTVST-----IGSAITLMSVSLFLLYLTGPIYWAVIQDV 306 RR +L V LAGAAV A+ T IG + ++ + TG + A I D+ Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA------TGAVAGAYIADI 123 Query: 307 VHKDKVGSVGGAMHGLANISGIIGPLVTGFIVQFSGKYDYAFYLAGAIAIVSSLLVFVFV 366 D+ G M + GP++ G + FS F+ A A+ ++ L + Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHA--PFFAAAALNGLNFLTGCFLL 181 Query: 367 K 367 Sbjct: 182 P 182
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 470 bits (1210), Expect = e-168 Identities = 235/386 (60%), Positives = 275/386 (71%), Gaps = 20/386 (5%) Query: 3 KVVVLSAVAAAVMMAGAANAAEIYNKDGNKLDLYGKVDGLHYFSSNHSTDGDQSYIRMGI 62 K VL+ V A++ AGAA+AAEIYNKDGNKLDLYGKVDGLHYFS + S DGDQ+Y+R+G Sbjct: 2 KRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVGF 61 Query: 63 KGETQITDQLTGFGQWEYQVNANRPEDGDSSGSPQSWTRLGFAGLAFADMGSVDYGRNYG 122 KGETQI DQLTG+GQWEY V AN E SWTRL FAGL F D GS DYGRNYG Sbjct: 62 KGETQINDQLTGYGQWEYNVQANTTE----GEGANSWTRLAFAGLKFGDYGSFDYGRNYG 117 Query: 123 VLYDIGSWTDVLPEFGNDSYEASDNFMTGRANGVLTYRNNDFFGLVDGLNIALQYQGKND 182 VLYD+ WTD+LPEFG DSY +DN+MTGRANGV TYRN DFFGLVDGLN ALQYQGKN+ Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177 Query: 183 GLSKEGDPLSNNAR---KSIAYQNGDGFGASATYDLGMGVSLGAAYTSSKRTLDQMTQDK 239 S + + N R I Y NGDGFG S TYD+GMG S GAAYT+S RT +Q+ Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237 Query: 240 YD-NGDRAEAWTGGVKYDANNIYLAANYTRTYDMTYMGDTL----GGFAHKTDNWEMVGQ 294 GD+A+AWT G+KYDANNIYLA Y+ T +MT G T GG A+KT N+E+ Q Sbjct: 238 TIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQ 297 Query: 295 YQFDNGLRPSLAFLQSRANDVD----GLGSFDLVKYIDVGSYYYFNKNMSAYVDYKINLL 350 YQFD GLRP+++FL S+ D+ DLVKY DVG+ YYFNKN S YVDYKINLL Sbjct: 298 YQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLL 357 Query: 351 KDGNP----SNPNTDNTVALGLVYEF 372 D +P + +TD+ VALG+VY+F Sbjct: 358 DDDDPFYKDAGISTDDIVALGMVYQF 383
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 30.5 bits (69), Expect = 0.008 Identities = 12/51 (23%), Positives = 21/51 (41%), Gaps = 1/51 (1%) Query: 22 GQGKVADYIPALASVEGSKLGI-AICTVDGQHYQAGDAHERFSIQSISKVL 71 + + I S ++G+ + G+ A A ERF + S KV+ Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVV 71
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.2 bits (138), Expect = 5e-11 Identities = 44/192 (22%), Positives = 85/192 (44%), Gaps = 8/192 (4%) Query: 36 LSDIAESFHMQTAQVGIMLTIYAWVVAVMSLPFMLLTSQMERRKLLICLFVLFIASHVLS 95 L DIA F+ A + T + ++ + + L+ Q+ ++LL+ ++ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FLAWN-FTVLVISRIGIAFAHAIFWSITASLAIRLAPAGKRAQALSLIATGTALAMVLGL 154 F+ + F++L+++R A F ++ + R P R +A LI + A+ +G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PIGRVVGQYFGWRTTFFAIGMGALITLLCLIKLLPKLPSEHSGSLKSLPLLFRRPALMSL 214 IG ++ Y W + I M +IT+ L+KLL K + LMS+ Sbjct: 157 AIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKK------EVRIKGHFDIKGIILMSV 209 Query: 215 YVLTVVVVTAHY 226 ++ ++ T Y Sbjct: 210 GIVFFMLFTTSY 221
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.8 bits (93), Expect = 1e-05 Identities = 38/199 (19%), Positives = 67/199 (33%), Gaps = 11/199 (5%) Query: 11 LGVDLIGYALTSALTIGVVFSLGFGILADKFDKKRYMLLAIIAFACGFIAIPMVHNVVLV 70 G+ L YAL + G L+D+F ++ +L+++ A + AI + V Sbjct: 45 YGILLALYALMQ-----FACAPVLGALSDRFGRRPVLLVSLAGAAVDY-AIMATAPFLWV 98 Query: 71 VLLFALINCAYSVFSTVLKAWFADNLTATTKTRIFSLNYTVLNIGWTVGPPLGTLLVMQS 130 + + ++ V A+ AD + R F G GP LG L+ S Sbjct: 99 LYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFS 158 Query: 131 INLPFWLAAICSAFPLVFIQVWVTRSVAASE-GKNAAIWSPSVLLRDKALL----WFTLS 185 + PF+ AA + + + S +P R + Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAV 218 Query: 186 AFLASFVGGAFASCISQYV 204 F+ VG A+ + Sbjct: 219 FFIMQLVGQVPAALWVIFG 237
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 76.1 bits (187), Expect = 3e-17 Identities = 48/194 (24%), Positives = 84/194 (43%), Gaps = 3/194 (1%) Query: 8 LVWLAGLSVLGFLATDMYLPAFAAIQADLQTPAAAVSASLSLFLAGFAVAQLLWGPLSDR 67 L+WL LS L + + I D P A+ + + F+ F++ ++G LSD+ Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 68 YGRKPILLLGLSIFALGSLGMLWVESAAALLTL-RFVQAVGVCAATVIWQALVTDYYPSQ 126 G K +LL G+ I GS+ S +LL + RF+Q G A + +V Y P + Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 127 KINRIFATIMPLVGLSPALAPLLGSWILTHFSWQAIFATLFVITLLLMLPALRLKPSVKA 186 + F I +V + + P +G I + W + L + ++ +P L + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEV 193 Query: 187 RTEGQDKLTFATLL 200 R +G + L+ Sbjct: 194 RIKGHFDIKGIILM 207
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 387 bits (995), Expect = e-136 Identities = 126/350 (36%), Positives = 204/350 (58%), Gaps = 4/350 (1%) Query: 2 SEKTEQPTEKKLRDGRKEGQVVKSIEITSLFQLIALYLYFHFFTEKMILILIESITFTLQ 61 EKTEQPT KK+RD RK+GQV KS E+ S ++AL ++ + + + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62 Query: 62 LVNKPFSYALTQL-SHALIESLTSALLFLGAGVIVATVGSVFLQVGVVIASKAIGFKSEH 120 PFS AL+ + + L+E L ++A + S +Q G +I+ +AI + Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMA-IASHVVQYGFLISGEAIKPDIKK 121 Query: 121 INPVSNFKQIFSLHSVVELCKSSLKVIMLSLIFAFFFYYYASTFRALPYCGLACGVLVVS 180 INP+ K+IFS+ S+VE KS LKV++LS++ T LP CG+ C ++ Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181 Query: 181 SLIKWLWVGVMVFYIVVGILDYSFQYYKIRKDLKMSKDDVKQEHKDLEGDPQMKTRRREM 240 +++ L V V ++V+ I DY+F+YY+ K+LKMSKD++K+E+K++EG P++K++RR+ Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241 Query: 241 QSEIQSGSLAQSVKQSVAVVRNPTHIAVCLGYHPTDMPIPRVLEKGSDAQANYIVNIAER 300 EIQS ++ ++VK+S VV NPTHIA+ + Y + P+P V K +DAQ + IAE Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301 Query: 301 NCIPVVENVELARSLFFEVERGDKIPETLFEPVAALLRMVMK--IDYAHS 348 +P+++ + LAR+L+++ IP E A +LR + + I+ HS Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHS 351
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 152 bits (387), Expect = 3e-48 Identities = 46/192 (23%), Positives = 84/192 (43%), Gaps = 4/192 (2%) Query: 1 MSLTFPILPIIYQQKIMMHIGKDYSWLGLVTGEVIIGFLIGFCAAVPFWAVDMAGFLLDT 60 M +TF I P + + + + L L +++IG +GF F AV AG ++ Sbjct: 48 MMITFAIAPSLPANDVPVF---SFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGL 104 Query: 61 LRGATMGTIFNSTIEAETSLFGLLFSQFLCVIFFISGGMEFILNILYESYQYLPPGRTLL 120 G + T + + + ++F G +++++L +++ LP G L Sbjct: 105 QMGLSFATFVDPASHLNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPL 164 Query: 121 FDRQFLKYIQAEWRTLYQLCISFSLPAIICMVLADLALGLLNRSAQQLNVFFFSMPLKSI 180 FL +A ++ + +LP I ++ +LALGLLNR A QL++F PL Sbjct: 165 NSNAFLALTKAGSL-IFLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLT 223 Query: 181 LVLLTLLISFPY 192 + + + P Sbjct: 224 VGISLMAALMPL 235
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 72.5 bits (178), Expect = 9e-21 Identities = 30/85 (35%), Positives = 50/85 (58%) Query: 4 SELTQFVTQLLWIVLFTSMPVVLVASVVGVIVSLVQALTQIQDQTLQFMIKLLAIAITLM 63 +L + L++VL S +VA+++G++V L Q +TQ+Q+QTL F IKLL + + L Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VSYPWLSGILLNYTRQIMLRIGEHG 88 + W +LL+Y RQ++ G Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 231 bits (592), Expect = 9e-80 Identities = 79/215 (36%), Positives = 130/215 (60%), Gaps = 8/215 (3%) Query: 8 LQLIGILFLLSILPLIIVMGTSFLKLAVVFSILRNALGIQQVPPNIALYGLALVLSLFIM 67 + LI +L ++LP II GT F+K ++VF ++RNALG+QQ+P N+ L G+AL+LS+F+M Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64 Query: 68 GPTLLAVKERWHPVQVAGAPFWT-SEWDSKALAPYRQFLQKNSEEKEANYFRNLIKRTWP 126 P + + V + S+ + L YR +L K S+ + +F N + Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124 Query: 127 ED-------IKRKIKPDSLLILIPAFTVSQLTQAFRIGLLIYLPFLAIDLLISNILLAMG 179 + K +I+ S+ L+PA+ +S++ AF+IG +YLPF+ +DL++S++LLA+G Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184 Query: 180 MMMVSPMTISLPFKLLIFLLAGGWDLTLAQLVQSF 214 MMM+SP+TIS P KL++F+ GW L L+ + Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 51.1 bits (122), Expect = 3e-10 Identities = 21/67 (31%), Positives = 38/67 (56%) Query: 247 LEQIPQQVLFEIGRASLEIGQLRQLKTGDVLPVGGCFAPEVTIRVNDRIIGQGELIACGN 306 + IP ++ E+GR + I +L +L G V+ + G + I +N +I QGE++ + Sbjct: 57 IMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVAD 116 Query: 307 EFMVRIT 313 ++ VRIT Sbjct: 117 KYGVRIT 123
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 51.9 bits (124), Expect = 5e-10 Identities = 28/181 (15%), Positives = 65/181 (35%), Gaps = 11/181 (6%) Query: 3 LYRSLPEDEANQMLALLMQHHIDAEKKQEEDGVTLRVEQSQFINAVELLRLNGYPHRQFT 62 L+ +L + + ++A L Q +I + V + L G P + Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYRFA--NGSGAIEVPADKVHELRLRLAQQGLP-KGGA 109 Query: 63 TADKMFPANQLVVSPQEEQQKINFLKEQRIEGMLSQMEGVINAKVTIALPTYDEGS---- 118 ++ + +S EQ E + + + V +A+V +A+P + S Sbjct: 110 VGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMP---KPSLFVR 166 Query: 119 NASPSSVAVFIKYSPQVNMEAFRVK-IKDLIEMSIPGLQYSKISILMQPAEFRMVPDVPA 177 S +V + P ++ ++ + L+ ++ GL ++++ Q + Sbjct: 167 EQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSG 226 Query: 178 R 178 R Sbjct: 227 R 227
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 79.2 bits (195), Expect = 1e-21 Identities = 26/127 (20%), Positives = 49/127 (38%) Query: 16 LKQLLSVDPETVYASGYASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYT 75 L ++ S E +Y+ + +Q G Y A F L + + R + L + +Y Sbjct: 28 LNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYD 87 Query: 76 TAINFYGHALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYADASWSEIRQNA 135 AI+ Y + ++D P + CL GE A A ++ + E+ Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRV 147 Query: 136 QIMVDTL 142 M++ + Sbjct: 148 SSMLEAI 154
>PF05844#YopD protein Length = 295 Score = 30.4 bits (68), Expect = 0.004 Identities = 29/149 (19%), Positives = 48/149 (32%), Gaps = 19/149 (12%) Query: 9 VLPAPSL-LTPSSTPSPSGEGMGTESMLLLFDDIWTKLMELAKKLRDIMRSYNVVKQRLG 67 L AP L P + E + +LL+ I K EL RD + Q+ Sbjct: 50 ELNAPRQVLDPVRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQK-- 107 Query: 68 WELQVNVLQTQMKTIDEAFRASMITAGGAILSGVLTIGLGAVGGETGLIAGQAVGHTAGG 127 +DE + + A+++GV + VG L G+A+ Sbjct: 108 ------------AQVDEMRSGATLMIAMAVIAGVGALASAVVGSLGALKNGKAISQEK-- 153 Query: 128 VMGLGAGVAQRQSDQDKAIADLQQNGAQS 156 L + R D + L + + Sbjct: 154 --TLQKNIDGRNELIDAKMQALGKTSDED 180
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 87.3 bits (216), Expect = 9e-25 Identities = 38/144 (26%), Positives = 65/144 (45%), Gaps = 7/144 (4%) Query: 3 FFRRGGSLRMLL---DDDVTQPLNTLYRYAMQLMEVKEFAGAARLFQLLTIYDAWSFDYW 59 F + GG++ ML D L LY A + ++ A ++FQ L + D + ++ Sbjct: 18 FLKGGGTIAMLNEISSDT----LEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFF 73 Query: 60 FRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAECYLACDNVCYAIKALKAVVRI 119 LG C QA + AI++Y A + I P+ P+ AAEC L + A L + Sbjct: 74 LGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133 Query: 120 CGEVSEHQILRQRAEKILQQLSDR 143 + +E + L R +L+ + + Sbjct: 134 IADKTEFKELSTRVSSMLEAIKLK 157
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 582 bits (1501), Expect = 0.0 Identities = 157/500 (31%), Positives = 261/500 (52%), Gaps = 15/500 (3%) Query: 11 LLFILNTAKSDELSWKGNDFTLYARQMPLAEVLHLLSENYDTAITISPLITATFSGKIPP 70 LL + + + + EL W + A+ L ++L NYD + +S I SG+ Sbjct: 17 LLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEH 76 Query: 71 GPPVDILNNLAAQYDLLTWFDGSMLYVYPASLLKHQVITFNILSTGRFIHYLRSQNILSS 130 P D L ++A+ Y+L+ ++DG++LY++ S + ++I L+ I Sbjct: 77 DNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWE- 135 Query: 131 PGCEVKEITGTKAVEVSGVPSCLTRISQLASVLDNALIKR--KDSAVSVSIYTLKYATAM 188 P + + V VSG P L + Q A+ L+ R K A+++ I+ LKYA+A Sbjct: 136 PRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASAS 195 Query: 189 DTQYQYRDQSVVVPGVVSVL-REMSKTSVPVSSTNN-----GSPATQALPMFAADPRQNA 242 D YRD V PGV ++L R +S ++ + +N + A ADP NA Sbjct: 196 DRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEADPSLNA 255 Query: 243 VIVRDYAANMAGYRKLITELDQRQQMIEISVKIIDVNAGDINQLGIDWGTAVSLGG---- 298 +IVRD M Y++LI LD+ IE+++ I+D+NA + +LG+DW + G Sbjct: 256 IIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQV 315 Query: 299 --KKIAFNTGLNDGGASGFSTVISDTSNFMVRLNALEKSSQAYVLSQPSVVTLNNIQAVL 356 K + + GA G + R+N LE A V+S+P+++T N QAV+ Sbjct: 316 VIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVI 375 Query: 357 DKNITFYTKLQGEKVAKLESITTGSLLRVTPRLLNDNGTQKIMLNLNIQDGQQSDTQSET 416 D + T+Y K+ G++VA+L+ IT G++LR+TPR+L +I LNL+I+DG Q S Sbjct: 376 DHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGI 435 Query: 417 DPLPEVQNSEIASQATLLAGQSLLLGGFKQGKQIHSQNKIPLLGDIPVVGHLFRNDTTQV 476 + +P + + + + A + GQSL++GG + + + +K+PLLGDIP +G LFR + Sbjct: 436 EGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELT 495 Query: 477 HSVIRLFLIKASVVNNGISH 496 +RLF+I+ +++ GI+H Sbjct: 496 RRTVRLFIIEPRIIDEGIAH 515
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.1 bits (169), Expect = 3e-14 Identities = 30/156 (19%), Positives = 57/156 (36%), Gaps = 13/156 (8%) Query: 691 ILLVDDADINRDIIGKMLVSLGQHVTIAASSNEALTLSQQQRFDLVLIDIRMPEIDGIEC 750 IL+ DD R ++ + L G V I +++ DLV+ D+ MP+ + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 751 VQLWHDEPNNLDPDCMFVALSASVAAEDIHRCKKNGIHHYITKPVTLATLARYISIAAEY 810 + PD + +SA + + G + Y+ KP L L Sbjct: 66 LP----RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG-------- 113 Query: 811 QLLRNIELQEQDPSRCSALLATDDVVI-NSKIFQSL 845 + R + ++ PS+ ++ S Q + Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.0 bits (161), Expect = 6e-15 Identities = 28/119 (23%), Positives = 50/119 (42%), Gaps = 2/119 (1%) Query: 1 MKEYKILLVDDHEIIINGIMNALLPWPHFKIVEHVKNGLEVYNACCAYEPDILILDLSLP 60 M IL+ DD I + AL + V N ++ A + D+++ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GINGLDIIPQLHQRWPAMNILVYTAYQQEYMTIKTLAAGANGYVLKSSSQQVLLAALQT 119 N D++P++ + P + +LV +A IK GA Y+ K L+ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.5 bits (209), Expect = 2e-21 Identities = 31/127 (24%), Positives = 56/127 (44%) Query: 2 ATIHLLDDDTAVTNACAFLLESLGYDVKCWTQGADFLAQASLYQAGVVLLDMRMPVLDGQ 61 ATI + DDD A+ L GYDV+ + A + +V+ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 GVHDALRQCGSTLAVVFLTGHGDVPMAVEQMKRGAVDFLQKPVSVKPLQAALERALTVSS 121 + +++ L V+ ++ A++ ++GA D+L KP + L + RAL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 122 AAVARRE 128 ++ E Sbjct: 124 RRPSKLE 130
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.006 Identities = 71/392 (18%), Positives = 134/392 (34%), Gaps = 28/392 (7%) Query: 8 TAVGLYFNYFVHGMGVILMSLNMSSLEQQWHTSAAGVSIVISSLGIGRLSVLLIA---GM 64 + + + +G+ L+ + L + S + L + L A G Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65 Query: 65 LSDRFGRRPFIILGTACYLIFFIGILYAQTIFVAYACGFLAGMANSFLDAGTYPSLMEAF 124 LSDRFGRRP +++ A + + + A ++V Y +AG+ + A + + Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADIT 124 Query: 125 PRSPSTANI-LIKAFVSGGQFLLPIIISLLVWANMWFGWSFLLAGAIMLINAL---FLLR 180 + + A G P++ L+ F A A+ +N L FLL Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGCFLLP 182 Query: 181 CPFP----PYPGRILKPKISQAPVTGVHHCSLIDLISYT--LYGYISMATFYLISQWLAQ 234 P L P S G+ + + + + L G + A + + + Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 235 YGQFVAGMSYTQSIKLLSIYTCGSLLCVFITAPLVRKTIRSTTLLMFYTFISFIALLTVC 294 + G+S + SL IT P+ + + LM + + Sbjct: 243 WDATTIGISLA------AFGILHSLAQAMITGPVAAR-LGERRALMLGMIADGTGYILLA 295 Query: 295 LHPQAYVVMIFAFVIGFSSAGGVVQIGLTLMAARF--PQEKGKATGIYYSAGSIATFTIP 352 + ++ ++ GG+ L M +R + +G+ G + S+ + P Sbjct: 296 FATRGWMAFPIMVLLAS---GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGP 352 Query: 353 LITARISEMSIAHIMWFDTGIAAAGFLLALFI 384 L+ I SI + AA +LL L Sbjct: 353 LLFTAIYAASITTWNGWAWIAGAALYLLCLPA 384
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.5 bits (79), Expect = 8e-04 Identities = 32/166 (19%), Positives = 71/166 (42%), Gaps = 7/166 (4%) Query: 23 FLHGMSVITLAQNMTSLAQKFSTDSAGIAYLISGIGLGRLVSILFFGVLSDKFGRRAIIL 82 F ++ + L ++ +A F+ A ++ + L + +G LSD+ G + ++L Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 83 LGAVLYML----FFFGIPASPNLMIAFILAVCVGVANSALDTGGYPALMECFPKASGSAV 138 G ++ F G L++A + A AL + G A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY--IPKENRGKAF 141 Query: 139 ILVKAMVSFGQMIYPLIVSALLVNHIWYGYAVVIPGILFVLITLML 184 L+ ++V+ G+ + P I ++ ++I + Y ++IP I + + ++ Sbjct: 142 GLIGSIVAMGEGVGPAI-GGMIAHYIHWSYLLLIPMITIITVPFLM 186
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 28.5 bits (63), Expect = 0.012 Identities = 17/59 (28%), Positives = 25/59 (42%) Query: 49 QGLTVGIIILTIGVMAPIASGTLPPSTLIHSFVNWKSLVAIAVGVFVSWLGGRGITLMG 107 Q + L IG + + LPPS ++ N ++ A VS LG +TL G Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDG 232
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 29.2 bits (65), Expect = 0.030 Identities = 15/54 (27%), Positives = 23/54 (42%), Gaps = 8/54 (14%) Query: 95 LRSLPSAALLFAGAAIIGCGIALG--------NVLLPGLIKRDFSQHVARLTGA 140 LR+ P L+ AG+A + +A+ L L +D VA+LT Sbjct: 19 LRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.002 Identities = 8/37 (21%), Positives = 17/37 (45%), Gaps = 5/37 (13%) Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTIIL 35 + I+ G I+G++ W+ K ++ I+L Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 26.3 bits (58), Expect = 0.003 Identities = 9/34 (26%), Positives = 13/34 (38%) Query: 7 LRILPGSLNKAKHLNAQQRQFRQFELFFKNRINH 40 L I+ L K K + Q + F FF + Sbjct: 255 LGIVISDLQKLKEFGSVSDQVKGFWQFFSEGKTN 288
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 197 bits (503), Expect = 2e-67 Identities = 67/187 (35%), Positives = 94/187 (50%), Gaps = 18/187 (9%) Query: 1 MKNIILSTLVITTSVLVVNVAQADTNAFSVGYAQSKVQDFKN-IRGVNVKYRYE-DDSPV 58 MK I + + + A T+ + GYAQS Q N + G N+KYRYE D+SP+ Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL 60 Query: 59 SFISSLSYLYGDRQASGSVEPEGIHYHDKFEVKYGSLMVGPAYRLSDNFSLYALAGVGTV 118 I S +Y R AS D + +Y + GPAYR++D S+Y + GVG Sbjct: 61 GVIGSFTYTEKSRTASSG---------DYNKNQYYGITAGPAYRINDWASIYGVVGVGYG 111 Query: 119 KATFKEHSTQDGDSFSNKISSRKTGFAWGAGVQMNPLENIVVDVGYEGSNISSTKINGFN 178 K E+ T D+ GF++GAG+Q NP+EN+ +D YE S I S + + Sbjct: 112 KFQTTEYPTYKHDT-------SDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWI 164 Query: 179 VGVGYRF 185 GVGYRF Sbjct: 165 AGVGYRF 171
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 27.3 bits (60), Expect = 0.031 Identities = 11/41 (26%), Positives = 15/41 (36%) Query: 83 TRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGL 123 QS NT SQT + S + + S + V G Sbjct: 302 KNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASF 342
>cloacin#Cloacin signature. Length = 551 Score = 31.2 bits (70), Expect = 0.022 Identities = 44/200 (22%), Positives = 75/200 (37%), Gaps = 19/200 (9%) Query: 437 TPEAVEQDTTEHHPDPQPLENEPPVSQTEAGYQKIRAELHEARKNIP------------- 483 +P+ V+Q E + Q + PV E Y++ RAEL++A +++ Sbjct: 292 SPDQVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAVQVY 351 Query: 484 --PKNPVDVG-KQLAAARGEYVEGISDPNDP--KWVHNNYSASNQGEKEEVVPEEKQPAA 538 K+ +D K LA A E + +DP A + ++ + KQ A Sbjct: 352 NSRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAF 411 Query: 539 EPEAVTRNADGTFDVSALFSAPSNQTEKTEARTERDGETPKESNQQETAG-DTGQEITTD 597 + A ++ SA+ S + +K A + E K + G D T+ Sbjct: 412 DAAAKEKSDADAALSSAMESRKKKEDKKRSAENNLNDEKNKPRKGFKDYGHDYHPAPKTE 471 Query: 598 GGSGTGGDEAGEAADPVENG 617 G G + G P +NG Sbjct: 472 NIKGLGDLKPGIPKTPKQNG 491
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.021 Identities = 8/22 (36%), Positives = 14/22 (63%) Query: 46 LTLLGPSGCGKTTVLRLIAGLE 67 + L G G GK+T++ + GL+ Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD 620
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 57.0 bits (137), Expect = 4e-10 Identities = 45/263 (17%), Positives = 85/263 (32%), Gaps = 34/263 (12%) Query: 513 PSEEEYAERKRPEQPALATFAMPDVPPAPTPVEPAVSVATAKKDNVAVAQPAQPGLFSRF 572 P E+ + DVP P+ + A+ D V PA Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSN-----NEEIARVDEAPVPPPAPA------ 1031 Query: 573 LNALKQLFSGEETKAVETAAPKAEEKAERQQDRRKPRQNNRRDRNERRDTRDNR----AG 628 + E +K K E+ A QN + + + + N Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDATE-----TTAQNREVAKEAKSNVKANTQTNEVA 1086 Query: 629 RDGGESRDDNRRNRRQTQQQNAEAR---DTRQQETAEKVKTGDEQQQTPRRERSRRRNDD 685 + G E+++ ++T E + +T + + KV + Q +P++E+S Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS----QVSPKQEQSETVQPQ 1142 Query: 686 KRQAQQEVKALNREEQPVQETEQEERVQQVQPRRKQRQLNQKVRFTNSAVVETVDTPVVV 745 A++ +N +E Q + QP ++ N + T S V T ++ V Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTAD---TEQPAKETSS-NVEQPVTESTTVNTGNSVVEN 1198 Query: 746 DEPRPVENVEQPVPAPRTELAKV 768 E + P +E + Sbjct: 1199 PENTTPATTQ---PTVNSESSNK 1218 Score = 35.4 bits (81), Expect = 0.001 Identities = 50/289 (17%), Positives = 93/289 (32%), Gaps = 32/289 (11%) Query: 718 RRKQRQLNQKVRFTNSAVVETVDTPVVVDEPRPVENVEQPVPAPRT---ELAKVDLPVVA 774 + K R +N + N V E + V N++ VP+ + E+A+VD V Sbjct: 968 KYKLRNVNGRYDLYNPEV-EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP 1026 Query: 775 DIAP----EQDDSVEPRDNTGMPRRSRRSPRHLRVSGQRRRRYRDERYPTQ-SPMPLTVA 829 AP E ++V + + Q R ++ + + + VA Sbjct: 1027 PPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA 1086 Query: 830 CASPEMASGKVWIRYPIVRPQETQVVEEQREADLALPQPVVAEQQVIAATVALEPQASVQ 889 + E + +ET VE++ +A V E+ V Q S + Sbjct: 1087 QSGSETKETQT------TETKETATVEKEEKAK------VETEKTQEVPKVT--SQVSPK 1132 Query: 890 AVENVAVEPQTVAEPQTSEVVEVETTHPEVIAAPVDEQP---------QLIAESDTPVAQ 940 ++ V+PQ + V ++ + EQP Q + ES T Sbjct: 1133 QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192 Query: 941 EVIADAEPVAETADASITVAEDVADVVVVEPEEETKAEAAVVEHTAEET 989 + + A TV + ++ ++ VE + Sbjct: 1193 NSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSS 1241
>FLAGELLIN#Flagellin signature. Length = 507 Score = 41.2 bits (96), Expect = 4e-06 Identities = 30/138 (21%), Positives = 59/138 (42%) Query: 1 MRISTQMMYEQNMSGITNSQAEWMKLGEQMSTGKRVTNPSDDPIAASQAVVLSQAQAQNS 60 I+T + + + SQ+ E++S+G R+ + DD + A + + Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61 Query: 61 QYALARTFATQKVSLEESVLSQVTTAIQTAQEKIVYAGNGTLSDDDRASLATDLQGIRDQ 120 Q + E L+++ +Q +E V A NGT SD D S+ ++Q ++ Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121 Query: 121 LMNLANSTDGNGRYIFAG 138 + ++N T NG + + Sbjct: 122 IDRVSNQTQFNGVKVLSQ 139
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 662 bits (1710), Expect = 0.0 Identities = 437/553 (79%), Positives = 487/553 (88%), Gaps = 8/553 (1%) Query: 2 SSLINHAMSGLNAAQAALNTVSNNINNYNVAGYTRQTTILAQANSTLGAGGWIGNGVYVS 61 SSLIN+AMSGLNAAQAALNT SNNI++YNVAGYTRQTTI+AQANSTLGAGGW+GNGVYVS Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 62 GVQREYDAFITNQLRGAQNQSSGLTTRYEQMSKIDNLLADKSSSLSGSLQSFFTSLQTLV 121 GVQREYDAFITNQLR AQ QSSGLT RYEQMSKIDN+L+ +SSL+ +Q FFTSLQTLV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 122 SNAEDPAARQALIGKAEGLVNQFKTTDQYLRDQDKQVNIAIGSSVAQINNYAKQIANLND 181 SNAEDPAARQALIGK+EGLVNQFKTTDQYLRDQDKQVNIAIG+SV QINNYAKQIA+LND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 182 QISRMTGVGAGASPNDLLDQRDQLVSELNKIVGVEVSVQDGGTYNLTMANGYTLVQGSTA 241 QISR+TGVGAGASPN+LLDQRDQLVSELN+IVGVEVSVQDGGTYN+TMANGY+LVQGSTA Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 RQLAAVPSSADPTRTTVAYVDEAAGNIEIPEKLLNTGSLGGLLTFRSQDLDKTRNTLGQL 301 RQLAAVPSSADP+RTTVAYVD AGNIEIPEKLLNTGSLGG+LTFRSQDLD+TRNTLGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALAFADAFNAQHTKGYDADGNKGKDFFSIGSPVVYSNSNNADKTVSLTAKVVDSTKVQAT 361 ALAFA+AFN QH G+DA+G+ G+DFF+IG P V N+ N V++ A V D++ V AT Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGD-VAIGATVTDASAVLAT 359 Query: 362 DYKIVFDGTDWQVTRTADNTTFTATKDADGKLEIDGLKVTVGTGAQKNDSFLLKPVSNAI 421 DYKI FD WQVTR A NTTFT T DA+GK+ DGL++T NDSF LKPVS+AI Sbjct: 360 DYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419 Query: 422 VDMNVKVTNEAEIAMASESKLDPDVDTGDSDNRNGQALLDLQ-NSNVVGGNKTFNDAYAT 480 V+M+V +T+EA+IAMASE D GDSDNRNGQALLDLQ NS VGG K+FNDAYA+ Sbjct: 420 VNMDVLITDEAKIAMASEE------DAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 481 LVSDVGNKTSTLKTSSTTQANVVKQLYKQQQSVSGVNLDEEYGNLQRYQQYYLANAQVLQ 540 LVSD+GNKT+TLKTSS TQ NVV QL QQQS+SGVNLDEEYGNLQR+QQYYLANAQVLQ Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533 Query: 541 TANALFDALLNIR 553 TANA+FDAL+NIR Sbjct: 534 TANAIFDALINIR 546
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 499 bits (1285), Expect = 0.0 Identities = 263/316 (83%), Positives = 289/316 (91%), Gaps = 3/316 (0%) Query: 1 MIGDGKLLASAAWDAQSLNELKAKAGQDPAANIRPVARQVEGMFVQMMLKSMREALPKDG 60 MI D KLLASAAWDAQSLNELKAKAG+DPAANIRPVARQVEGMFVQMMLKSMR+ALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSDQTRLYTSMYDQQIAQQMTAGKGLGLADMMVKQMTGGQTMPADDAPQVPLKFSLET 120 LFSS+ TRLYTSMYDQQIAQQMTAGKGLGLA+MMVKQMT Q +P + P P+KF LET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VNSYQNQALTQLVRKAIPKTPDSSDAPLSGDSKDFLARLSLPARLASEQSGVPHHLILAQ 180 V YQNQAL+QLV+KA+P+ D S L GDSK FLA+LSLPA+LAS+QSGVPHHLILAQ Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDS---LPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQ 177 Query: 181 AALESGWGQRQILRENGEPSYNVFGVKATASWKGPVTEITTTEYENGEAKKVKAKFRVYS 240 AALESGWGQRQI RENGEPSYN+FGVKA+ +WKGPVTEITTTEYENGEAKKVKAKFRVYS Sbjct: 178 AALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYS 237 Query: 241 SYLEALSDYVALLTRNPRYAAVTTAATAEQGAVALQNAGYATDPNYARKLTSMIQQLKAM 300 SYLEALSDYV LLTRNPRYAAVTTAA+AEQGA ALQ+AGYATDP+YARKLT+MIQQ+K++ Sbjct: 238 SYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSI 297 Query: 301 SEKVSKTYSANLDNLF 316 S+KVSKTYS N+DNLF Sbjct: 298 SDKVSKTYSMNIDNLF 313
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 429 bits (1104), Expect = e-153 Identities = 153/362 (42%), Positives = 215/362 (59%), Gaps = 9/362 (2%) Query: 5 LAGIVLALVATLAHAERIRDLTSVQGVRENSLIGYGLVVGLDGTGDQTTQTPFTTQTLNN 64 A L+ A RI+D+ S+Q R+N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 14 SALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRA 73 Query: 65 MLSQLGITVPTGTNMQLKNVAAVMVTASYPPFARQGQTIDVVVSSMGNAKSLRGGTLLMT 124 ML LGIT G + KN+AAVMVTA+ PPFA G +DV VSS+G+A SLRGG L+MT Sbjct: 74 MLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMT 132 Query: 125 PLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAIIERELPTQFGAGNT 184 L G D Q+YA+AQG ++V G A +++ R+ NGAIIERELP++F Sbjct: 133 SLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVN 192 Query: 185 INLQLNDEDFTMAQQITDAINRAR----GYGSATALDARTVQVRVPSGNSSQVRFLADIQ 240 + LQL + DF+ A ++ D +N G A D++ + V+ P + R +A+I+ Sbjct: 193 LVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEIE 251 Query: 241 NMEVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQLNVNQPNTPFGGG 300 N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF G Sbjct: 252 NLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSRG 309 Query: 301 QTVVTPQTQIDLRQSGGSLQSVRSSANLNSVVRALNALGATPMDLMSILQSMQSAGCLRA 360 QT V PQT I Q G + ++ +L ++V LN++G +++ILQ ++SAG L+A Sbjct: 310 QTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQA 368 Query: 361 KL 362 +L Sbjct: 369 EL 370
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 293 bits (752), Expect = e-104 Identities = 192/202 (95%), Positives = 200/202 (99%) Query: 1 MQGATTAQPIPGPVPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVLQENVSASK 60 +QGAT+AQP+PGP PVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVLQENVSASK Sbjct: 31 VQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVLQENVSASK 90 Query: 61 SSSANASRDGKTSFGFDTVPRYLQGLFGNSRADMEASGGNSFNGKGGANASNTFSGTLTV 120 SSSANASRDGKT+FGFDTVPRYLQGLFGN+RAD+EASGGN+FNGKGGANASNTFSGTLTV Sbjct: 91 SSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGGNTFNGKGGANASNTFSGTLTV 150 Query: 121 TVDQVLANGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNSVPSTQVADARIEYVGN 180 TVDQVL NGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSN+VPSTQVADARIEYVGN Sbjct: 151 TVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGN 210 Query: 181 GYINEAQNMGWLQRFFLNLSPM 202 GYINEAQNMGWLQRFFLNLSPM Sbjct: 211 GYINEAQNMGWLQRFFLNLSPM 232
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.8 bits (103), Expect = 4e-07 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%) Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62 S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47 Query: 63 PSGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260 S VN+ EE N+ + Q+ Y N++ + T + + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.1 bits (96), Expect = 6e-06 Identities = 17/48 (35%), Positives = 29/48 (60%) Query: 356 LTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 403 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 37.6 bits (87), Expect = 8e-05 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 4/60 (6%) Query: 2 SFSQAVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 + A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 27.9 bits (62), Expect = 0.013 Identities = 13/68 (19%), Positives = 24/68 (35%), Gaps = 7/68 (10%) Query: 31 LLNSAQAQNSYKDPAYDNDFGIEPPSALDNFTQAIQSQILGGLLTNINTGKPGRMVTNDF 90 LL S+ + + D P F ++ I + +I+ R+V Sbjct: 48 LLISSSSNYPR---IHLTDLTKPNPIKAPMFCMVLRKYISNAKIVDIHQINQDRIV---- 100 Query: 91 IIDIANRD 98 +ID + D Sbjct: 101 VIDFESTD 108
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.4 bits (123), Expect = 3e-09 Identities = 52/253 (20%), Positives = 91/253 (35%), Gaps = 24/253 (9%) Query: 56 AFLATAAFIGRPFGGALFGLLADKFGRKPLMMWSIVAYSVGTGLSGLASGVIMLTLSRFI 115 A A F P GAL +D+FGR+P+++ S+ +V + A + +L + R + Sbjct: 50 ALYALMQFACAPVLGAL----SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 116 VGMGMAGEYACASTYAVESWPKHLKSKASAFLVSGFGIGNIIAAYFMPSFAEAYGWRAAF 175 G+ A + A + +++ F+ + FG G ++A + + A F Sbjct: 106 AGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGFG-MVAGPVLGGLMGGFSPHAPF 163 Query: 176 FV-GLLPVLLVIYIRARAPESKEWEE--AKLSGPGKHSQSAWSVFSLSMKGLFNRA---- 228 F L L + PES + E + + W+ + L Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223 Query: 229 ---QFPLTLCVFIVLFSIFGANWPIFGLLPTYLAGEGFDTGVVSNLMTAAAFGTVLGN-- 283 Q P L V+F +W + LA G ++ M LG Sbjct: 224 LVGQVPAAL---WVIFGEDRFHWDA-TTIGISLAAFGI-LHSLAQAMITGPVAARLGERR 278 Query: 284 -IVWGLCADRIGL 295 ++ G+ AD G Sbjct: 279 ALMLGMIADGTGY 291
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.3 bits (151), Expect = 4e-14 Identities = 30/158 (18%), Positives = 58/158 (36%), Gaps = 8/158 (5%) Query: 20 RQLILTAALAVFSQYGIHGARLEQVAERAGVSKTNLLYYYPSKEALYVAVMRQILDVWLA 79 RQ IL AL +FSQ G+ L ++A+ AGV++ + +++ K L+ + Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 80 PLKAFRAEF--SPLEAIKEYIRLKLEVSRDYPQASRLF-CMEMLAGAPLLMEELTGDLKA 136 ++A+F PL ++E + LE + + L + M + + Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132 Query: 137 LIDEKSALIAGWVHSG-----KLAPVSPHHLIFMIWAA 169 L E I + A + ++ Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 82.6 bits (204), Expect = 2e-20 Identities = 30/117 (25%), Positives = 56/117 (47%), Gaps = 1/117 (0%) Query: 2 KILLIEDNQKTIEWVRQGLTEAGYVVDYACDGRDGLHLALQEHYSLIILDIMLPGLDGWQ 61 IL+ +D+ + Q L+ AGY V + L++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRALRTAHQS-PVICLTARDSVEDRVKGLEAGANDYLVKPFSFAELLARVRAQLRQ 117 +L ++ A PV+ ++A+++ +K E GA DYL KPF EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 18/102 (17%), Positives = 38/102 (37%), Gaps = 15/102 (14%) Query: 348 ILLQRVLSNLLTNAIRYSDENAVIRIESAYDDNVAEIRVANPGSHPADADKLFRRFWRGD 407 +L+Q ++ N + + I + I ++ D+ + V N GS K Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------- 309 Query: 408 NARHTAGFGLGLSLVNA-IALLHGGSASYRYADEHNIFSVRL 448 G GL V + +L+G A + +++ + + Sbjct: 310 ------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 662 bits (1710), Expect = 0.0 Identities = 185/396 (46%), Positives = 253/396 (63%), Gaps = 5/396 (1%) Query: 138 LNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYEGKGVCSWDTKNIHHANN 197 LNN+ W + ++H+G +Y PA+ MKIG K+IF Y GKG+C T+ H N Sbjct: 146 LNNKNWGPVNKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRESDHIAN 205 Query: 198 LWMSTVSVHEDGKDKTLFCGIRHGVLSPYH-EKDPLLRHVGAENKAKEVLTAALFSKPEL 256 +W+S V V ++GK+ +F GIRHGV+S Y +K+ R V A NKA+E+++AAL+S+PEL Sbjct: 206 MWLSKV-VDDEGKE--IFSGIRHGVISAYGLKKNSSERAVAARNKAEELVSAALYSRPEL 262 Query: 257 LNKALAGEAVSLKLVSVGLLTASNIFGKEGTMVEDQMRAWQSL-TQPGKMIHLKIRNKDG 315 L++AL+G+ V LK+VS LLT +++ G E +M++DQ+ A + L ++ G+ L IRN DG Sbjct: 263 LSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVNALKGLNSKRGEPTKLLIRNSDG 322 Query: 316 DLQTVKIKPDVAAFNVGVNELALKLGFGLKASDSYNAEALHQLLGNDLRPEARPGGWVGE 375 L+ V + V FN GVNELALK+G G + D N E++ LLG++ GGW E Sbjct: 323 LLKEVSVNLKVVTFNFGVNELALKMGLGWRNVDKLNDESICSLLGDNFLKNGVIGGWAAE 382 Query: 376 WLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSG 435 + + P V LA QIK+I D GEPYKL+QR+ +LA+ I AVP WNCKSG Sbjct: 383 AIEKNPPCKNDVIYLANQIKEIINKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSG 442 Query: 436 KDRTGMMDSEIKREHISLHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNLEIQKQNTGGAG 495 KDRTGM D+EIKRE I H+T S S S +++F +L+NSGN+EIQ+ NTG G Sbjct: 443 KDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGVPG 502 Query: 496 NKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLI 531 NKVMK L L LSY +R+GD IW VKG SS + Sbjct: 503 NKVMKKLPLSSLELSYSERIGDSKIWNMVKGYSSFV 538
>PF07824#Type III secretion chaperone Length = 120 Score = 165 bits (419), Expect = 1e-56 Identities = 33/114 (28%), Positives = 63/114 (55%), Gaps = 1/114 (0%) Query: 1 MESLLNRLYDALGLDAPE-DEPLLIIDDGIQVYFNESDHTLEMCCPFMPLPDDILTLQHF 59 ME L + + ALG+ + + D+ +++DD + +Y + ++ + CPF LP++I L + Sbjct: 1 MEDLADVICRALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYA 60 Query: 60 LRLNYTSAVTIGADADNTALVALYRLPQTSTEEEALTGFELFISNVKQLKEHYA 113 L LNY+ + + D + +L+A L + E+ E +IS V+ LK+ +A Sbjct: 61 LSLNYSEKICLATDDEGGSLIARLDLTGINEFEDIYVNTEYYISRVRWLKDEFA 114
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 480 bits (1236), Expect = e-173 Identities = 214/387 (55%), Positives = 264/387 (68%), Gaps = 29/387 (7%) Query: 2 MKRKILAAVIPALLAAATANAAEIYNKDGNKLDLYGKAVGRHVWTTTGDSKNADQTYAQI 61 MKRK+LA VIPALLAA A+AAEIYNKDGNKLDLYGK G H + + SK+ DQTY ++ Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLH-YFSDDSSKDGDQTYMRV 59 Query: 62 GFKGETQINTDLTGFGQWEYRTKADRAEGEQQNSNLVRLAFAGLKYAEVGSIDYGRNYGI 121 GFKGETQIN LTG+GQWEY +A+ EGE NS RLAFAGLK+ + GS DYGRNYG+ Sbjct: 60 GFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYGV 118 Query: 122 VYDVESYTDMAPYFSGETWGGAYTDNYMTSRAGGLLTYRNSDFFGLVDGLSFGIQYQGKN 181 +YDVE +TDM P F G+++ Y DNYMT RA G+ TYRN+DFFGLVDGL+F +QYQGKN Sbjct: 119 LYDVEGWTDMLPEFGGDSY--TYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKN 176 Query: 182 QDNHS---------------INSQNGDGVGYTMAYEFD-GFGVTAAYSNSKRTNDQQDRD 225 + + I NGDG G + Y+ GF AAY+ S RTN+Q + Sbjct: 177 ESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAG 236 Query: 226 G---NGDRAESWAVGAKYDANNVYLAAVYAETRNMSIVENTVTD-TVEMANKTQNLEVVA 281 G GD+A++W G KYDANN+YLA +Y+ETRNM+ T +ANKTQN EV A Sbjct: 237 GTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTA 296 Query: 282 QYQFDFGLRPAISYVQSKGKQLNGAD---GSADLAKYIQAGATYYFNKNMNVWVDYRFNL 338 QYQFDFGLRPA+S++ SKGK L + DL KY GATYYFNKN + +VDY+ NL Sbjct: 297 QYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINL 356 Query: 339 LDEND--YSSSYVGTDDQAAVGITYQF 363 LD++D Y + + TDD A+G+ YQF Sbjct: 357 LDDDDPFYKDAGISTDDIVALGMVYQF 383
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 40.0 bits (93), Expect = 7e-05 Identities = 44/267 (16%), Positives = 90/267 (33%), Gaps = 20/267 (7%) Query: 347 QEKIERYEADLEELQIRLEEQNEVVAEAAEMQDENEARAEAAELEVDELKSQLADYQQAL 406 + K + + L+ +E E ++ A E +N+ ++ EL+++ AD ++AL Sbjct: 70 KLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKAL 129 Query: 407 DVQQTRAIQYNQAISALARAKELCHLPDLTPESAAEWLDTFQAKEQEATEKLLSLEQKMS 466 + + + I L K L A + + A + K+ Sbjct: 130 EGAMNFSTADSAKIKTLEAEKA-----ALAARKADL-----EKALEGAMNFSTADSAKIK 179 Query: 467 VAQTAHSQFEQAYQLVAAINGPLARSEAWDVARELLRDGVNQRHLAEQVQPLRMRLSELE 526 + + E + D + L + L R ++LE Sbjct: 180 TLEAEKAALEARQAELEKALE--------GAMNFSTADSAKIKTLEAEKAALAARKADLE 231 Query: 527 QRLREQQEAERLLAEFCKRQGKNFDIDELEALHQELEARIASLSDSVSSASEQRMALRQE 586 + L + K + + LEA ELE + + ++ S + L E Sbjct: 232 KALEGAMNFSTADSA--KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 289 Query: 587 QEQLQSRIQHLMQRAPVWLAAQNSLNQ 613 + L++ L ++ V A + SL + Sbjct: 290 KAALEAEKADLEHQSQVLNANRQSLRR 316 Score = 38.5 bits (89), Expect = 2e-04 Identities = 39/283 (13%), Positives = 97/283 (34%), Gaps = 18/283 (6%) Query: 974 DSAEMLSGNSDLNEKLRQRLEQAEAERTRAREALRSHAAQLSQYSQVLASLKSSYDTKKE 1033 D + D N++L + L A+ + + ++L A+++ + A L+ + + Sbjct: 75 DLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMN 134 Query: 1034 LLNDLQRELQDIGVRADSGAEERA--RQRRDELHAQLSNNRSRRNQLEKALTFCEAEMEN 1091 +++ + + A +A + + + + ++ LE EA Sbjct: 135 FSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 194 Query: 1092 LTRKLRKLERDY-------HEMREQVVTAKAGWCAVMRMVKDNGVERRLHRRELAYLSAD 1144 L + L + + A + + ++ ++ L A+ Sbjct: 195 LEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE 254 Query: 1145 ------ELRSMSDKALGALRLAVADNEHLRDVLRLSEDPKRPERKIQFFVAVYQHLRERI 1198 + GA+ + AD+ ++ + + + ++ V R+ + Sbjct: 255 KAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSL 314 Query: 1199 RQDIIRTDDPVEAIEQMEIELSRLTEELTSREQKLAISSRSVA 1241 R+D+ D EA +Q+E E +L E+ E R + Sbjct: 315 RRDL---DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLD 354 Score = 37.4 bits (86), Expect = 4e-04 Identities = 49/288 (17%), Positives = 90/288 (31%), Gaps = 20/288 (6%) Query: 835 EAEIRRLNGRRVELERALATHE---NDNQQQRLQFEQAKEGVSALNRLLPRLNLLADETL 891 ++I+ L R+ +LE+AL + + E K ++A L + A Sbjct: 112 ASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 171 Query: 892 ADRVDEIQERLDEAQEAARFVQQYGNQLAKLEPVVSVLQSDPEQFEQLKEDYAWSQQMQR 951 +I+ E + L + + + E K A + Sbjct: 172 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 231 Query: 952 DARQQAFALAEVVERRAHFSYSDSAEMLSGNSDLNEKLRQRLEQAEAERTRAREALRSHA 1011 A + A + + ++ A + + ++L EK + + + L + Sbjct: 232 KALEGAMNFSTADSAKIKTLEAEKAALEARQAEL-EKALEGAMNFSTADSAKIKTLEAEK 290 Query: 1012 AQLSQYSQVLAS-----------LKSSYDTKKELLNDLQRELQDIGVRADSGAEERARQR 1060 A L L L+ D +E L+ E Q + + R R Sbjct: 291 AALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLR 350 Query: 1061 RDELHAQLSNNRSRRNQLEKALTFCEAEMENLTRKLRKLERDYHEMRE 1108 RD L +R + QLE E + + + L RD RE Sbjct: 351 RD-----LDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE 393 Score = 36.6 bits (84), Expect = 7e-04 Identities = 59/356 (16%), Positives = 114/356 (32%), Gaps = 32/356 (8%) Query: 261 HLISEATDYVAADYMRHANERRVHLDQALAFRRELYTSRKQLAAEQYKHVDMARELGEHN 320 + E D + + A + ++L+ + K + L E Sbjct: 53 EKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKA 112 Query: 321 GAEGSLEADY----QAASDHLNLVQTALRQQEKIERYEADLEELQIRLEEQNEVVAEAAE 376 LEA +A +N + + +E +A L + LE+ E + Sbjct: 113 SKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 172 Query: 377 MQDENEARAEAAELEVDELKSQL-ADYQQALDVQQTRAIQYNQAISALARAKELCHLPDL 435 EA + ++ +++L + A++ + + + A + Sbjct: 173 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEK 232 Query: 436 TPESAAEWLDTFQAKEQEATEKLLSLEQKMSVAQTAHSQFEQAYQLVAAINGPLARSEAW 495 E A + AK + + +LE + + + A ++ Sbjct: 233 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA--------DSAKIK 284 Query: 496 DVARELLRDGVNQRHLAEQVQPLRMRLSELEQRLREQQEA-ERLLAEFCK---------- 544 + E + L Q Q L L + L +EA ++L AE K Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344 Query: 545 -RQGKNFDID-------ELEALHQELEARIASLSDSVSSASEQRMALRQEQEQLQS 592 RQ D+D +LEA HQ+LE + S S A R+ ++Q++ Sbjct: 345 SRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEK 400
>FLAGELLIN#Flagellin signature. Length = 507 Score = 30.0 bits (67), Expect = 0.009 Identities = 17/83 (20%), Positives = 34/83 (40%), Gaps = 10/83 (12%) Query: 106 RLANEGIFTQQEL---YDELLTLADEAKLLKLVNNRSTGSDVDRQKLQEKVRSSLNRLRR 162 R AN+GI Q +E+ + L + T SD D + +Q++++ L + R Sbjct: 65 RNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDR 124 Query: 163 L-------GMVWFMGHDSSKFRI 178 + G+ + K ++ Sbjct: 125 VSNQTQFNGVKVLSQDNQMKIQV 147
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (294), Expect = 4e-38 Identities = 33/88 (37%), Positives = 57/88 (64%), Gaps = 1/88 (1%) Query: 2 TKSELIERLATQQSHIPAKAVEDAVKEMLEHMASTLAQGERIEIRGFGSFSLHYRAPRTG 61 K +LI ++A + + + K AV + ++S LA+GE++++ GFG+F + RA R G Sbjct: 3 NKQDLIAKVA-EATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61 Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRDR 89 RNP+TG++++++ VP FK GK L+D Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDA 89
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.2 bits (73), Expect = 0.003 Identities = 38/158 (24%), Positives = 61/158 (38%), Gaps = 6/158 (3%) Query: 8 VMLLLCGLLLLT-LAIAVLNTLVPLWLAQANLPTWQVGMVSSSYFTGNLVGTLFTGYLIK 66 +++ LC L + L VLN +P N P V++++ +GT G L Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 67 RIGFNHSYYLASLIFAAGCVGLGVMVGFWSWMSW-RFIAGIGCAMIWVVVESALMCSGTS 125 ++G +I G V V F+S + RFI G G A +V + Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 126 HNRGRLLAAYMMVYYMGTFLGQLLVSKVSGELLHVLPW 163 NRG+ + MG +G + G + H + W Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPA----IGGMIAHYIHW 168
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 52.0 bits (124), Expect = 2e-08 Identities = 56/320 (17%), Positives = 94/320 (29%), Gaps = 45/320 (14%) Query: 580 AAPAFSLATGGAPRPQVKEGIGPQLPRPNRVRVPTRRELASYGIKLPSQRIAEEKAREAE 639 + A P V ++ R + VP PS+ E A ++ Sbjct: 994 TTNITTPNNIQADVPSVPSN-NEEIARVDEAPVPPPAP------ATPSET-TETVAENSK 1045 Query: 640 RNQYETGVQLTDEEIDAMHQDELARQFAQSQQHRYGETYQHDTQQAEDDDTAAEAELARQ 699 + E DA R+ A+ + + +TQ E + +E + + Sbjct: 1046 QESKTVEKN----EQDATETTAQNREVAKEAK----SNVKANTQTNEVAQSGSETKETQT 1097 Query: 700 FAASQQQRYSGEQPAGAQPFSLDDLDFSPMKVLVDEGPHEPLFTPGVMPESTPVQQPVAP 759 + E+ A KV ++ P T V P+ +Q Sbjct: 1098 TETKETATVEKEEKA---------------KVETEKTQEVPKVTSQVSPKQ---EQSETV 1139 Query: 760 QPQPQYQQPQQPV--APQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAP 813 QPQ + + P +PQ Q QP + P P Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199 Query: 814 QPQYQQPQQPTAPQDSLIHPLLMRNGDSRPLQ-RPTTPLPSLDLLTPPPSEVEPVDTFAL 872 + QPT +S P +N R ++ P P+ + S V D + Sbjct: 1200 ENTTPATTQPTVNSESSNKP---KNRHRRSVRSVPHNVEPA-TTSSNDRSTVALCDLTST 1255 Query: 873 EQMARLVEARLADFRIKADV 892 A L +AR + +V Sbjct: 1256 NTNAVLSDARAKAQFVALNV 1275 Score = 40.8 bits (95), Expect = 4e-05 Identities = 29/175 (16%), Positives = 54/175 (30%), Gaps = 17/175 (9%) Query: 405 QPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLAPQETQPQWQAPDAEQHWQPEPTHQ 464 P+ +Q PQ A PA + + D EQ + ++ Sbjct: 1122 VPKVTSQVSPKQEQSETVQPQ--AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179 Query: 465 PEPVYQPEPIAAEPSHMPPPVIEQPVATEPEPDTEETRPARPPLYYFEEVEEKRAREREQ 524 +PV + + S + P P T+P ++E + + + + R Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS----------NKPKNRHRRSVRS 1229 Query: 525 LAAWYQPIPEPVKENVPVKPTVSVAPSIPPVEAVAAAASLDAGIKSGALAAGAAA 579 + EP + + TV++ A + A + AL G A Sbjct: 1230 VPH----NVEPATTSSNDRSTVALCDLT-STNTNAVLSDARAKAQFVALNVGKAV 1279
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.2 bits (133), Expect = 2e-10 Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVFALSQQGHQVRA---------AARRVERLEKQRLANVSCHKVDL 54 + LV GA+G+IG H+ L + GHQV + + RLE HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 HWPENLPALLRD--IDTVYYLVH------GMGEGGDFIAHERQAALNVRDALRQTPVKQL 106 E + L + V+ H + + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 46.7 bits (111), Expect = 5e-08 Identities = 49/207 (23%), Positives = 78/207 (37%), Gaps = 25/207 (12%) Query: 1 MTQYASSLRSLAAGSVLLFLFASPVKAEEQTIAPPGVDAR-AWILMDYASGKVLAEGNAD 59 M + SL A ++ L + ASP E+ ++ + R I MD ASG+ L AD Sbjct: 1 MRYIRLCIISLLA-TLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRAD 59 Query: 60 EKLDPASLTKIMTSYVVGQALKAGKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQV 119 E+ S K++ V + AG +L + + +P V D + Sbjct: 60 ERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGM 113 Query: 120 SVADLNKGIIIQSGNDACIALADYVAGSQESFIGLMNAYAKRLGLTNTT---FQTVHGLD 176 +V +L I S N A L V G + A+ +++G T ++T Sbjct: 114 TVGELCAAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEA 168 Query: 177 APGQF---STARDMA------LLGKAL 194 PG +T MA L + L Sbjct: 169 LPGDARDTTTPASMAATLRKLLTSQRL 195
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 253 bits (647), Expect = 1e-89 Identities = 156/171 (91%), Positives = 164/171 (95%) Query: 1 MKKIACLSALAAVLAFSAGTAVAATSTVTGGYAQSDAQGVANKMSGFNLKYRYEQDDNPL 60 MKKIACLSALAAVLAF+AGT+VAATSTVTGGYAQSDAQG NKM GFNLKYRYE+D++PL Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL 60 Query: 61 GVIGSFTYTEKDRTNGAGDYNKGQYYGITAGPAYRLNDWASIYGVVGVGYGKFQTTDYPT 120 GVIGSFTYTEK RT +GDYNK QYYGITAGPAYR+NDWASIYGVVGVGYGKFQTT+YPT Sbjct: 61 GVIGSFTYTEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPT 120 Query: 121 YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171 YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF Sbjct: 121 YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 144 bits (365), Expect = 5e-47 Identities = 31/146 (21%), Positives = 70/146 (47%), Gaps = 4/146 (2%) Query: 22 SESDKKATVELLNRQVIQFIDLSLITKQAHWNMRGANFIAVHEMLDGFRTALTDHLDTMA 81 +++++ LN Q+ + L + HW ++G +F +HE + + +DT+A Sbjct: 6 AKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIA 65 Query: 82 ERAVQLGGVALGTTQVINSKTPLKSYPLDIHNVQDHLKELADRYAVVANDVRKAIG---E 138 ER + +GG + T + + + + + ++ L + Y ++++ + IG E Sbjct: 66 ERLLAIGGQPVATVKEYTEHASITDGGNET-SASEMVQALVNDYKQISSESKFVIGLAEE 124 Query: 139 AKDEDTADIFTAASRDLDKFLWFIES 164 +D TAD+F +++K +W + S Sbjct: 125 NQDNATADLFVGLIEEVEKQVWMLSS 150
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.023 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 118 bits (296), Expect = 2e-34 Identities = 76/255 (29%), Positives = 119/255 (46%), Gaps = 16/255 (6%) Query: 3 LKDKVAIITGAASARGLGFATAKLFAENGAKVVIIDLNGEAS---EAAAAALGEGHLGLA 59 ++ K+A ITGAA +G+G A A+ A GA + +D N E ++ A Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63 Query: 60 ANVADEVQVQAAIEQIMAKYGRVDVLVNNAGITQPLKLMDIKRANYDAVLDVSLRGTLLM 119 A+V D + +I + G +D+LVN AG+ +P + + ++A V+ G Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 120 SQAVIPVMRAQKSGSIVCISSISAQRGGGIFGGPHYSAAKAGVLGLARAMARELGPDNVR 179 S++V M ++SGSIV + S A G Y+++KA + + + EL N+R Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 180 VNCITPGLIQTDITAGKLTDE---------MTANILAGIPMNRLGDAVDIARAALFLGSD 230 N ++PG +TD+ DE GIP+ +L DIA A LFL S Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 231 LASYSTGITLDVNGG 245 A + T L V+GG Sbjct: 242 QAGHITMHNLCVDGG 256
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 45.6 bits (108), Expect = 3e-07 Identities = 59/398 (14%), Positives = 126/398 (31%), Gaps = 46/398 (11%) Query: 22 LTMIFLVYAINYADRTNIGAVLPFIIDEFHINNFEAGAIASMFFLGYAVSQIP----AGF 77 L +I A++ I VLP ++ + +N + L YA+ Q G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLAL-YALMQFACAPVLGA 65 Query: 78 FIAKRGTRGLVSLSIFGFSAFTWLMGTVSSVFSLKMVRLGLGLSEGPCPVGLASTINNWF 137 + G R ++ +S+ G + +M T ++ L + R+ G++ V + I + Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-AGAYIADIT 124 Query: 138 PPKEKATATGVYIAATMFAPIIVPPLAVWIAVTWGWRWVFFSFAIPGIVAAIAWYLLVKS 197 E+A G +++A ++ P+ + + FF+ A + + L+ Sbjct: 125 DGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL-- 181 Query: 198 KPSESGFVSQSELEEINAGRDIHKNTVRENILIADRFTLLDKIIRVKKMAPIDTAKRLFT 257 S G E +N + +A + + Sbjct: 182 PESHKGERRPLRREALNP------------------LASFRWARGMTVVAAL-----MAV 218 Query: 258 SKNILGDCLAYFMMVSVLYGLLTWIPLYLVKERGFDVMSMGFVASMPCIGGFIGAIGGGW 317 +F+M V ++ +D ++G + G + ++ Sbjct: 219 ----------FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQAM 265 Query: 318 ISDKVLGRRRKPTMMFTAISTVVMMLIMLNIPASTWAVCVGLFFVGLCLNIGWPAFTAYG 377 I+ V R + + + I+L W + + IG PA A Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA-SGGIGMPALQAML 324 Query: 378 MAVSDTKTYPIASSIINSGGNLGGFVAPMAAGFLLDKT 415 D + + + +L V P+ + + Sbjct: 325 SRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 30.4 bits (68), Expect = 0.013 Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 5/55 (9%) Query: 230 VLQTAKALGIPVKGHVEQLSLLGGAQLVSRYQGLSADHIEYLDEAGVAAMRDGGT 284 VL+ +P G +S+LG ++L L HI AGVAAM+ Sbjct: 202 VLRDTNVTAVPASGAPAAVSVLGASELT-----LDGGHITGGRAAGVAAMQGAVV 251
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.005 Identities = 10/20 (50%), Positives = 13/20 (65%) Query: 34 IFLGPNGCGKSTLLRSLAGL 53 + G G GKSTL+ +L GL Sbjct: 600 VLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.3 bits (76), Expect = 0.003 Identities = 16/66 (24%), Positives = 27/66 (40%), Gaps = 3/66 (4%) Query: 503 AAAPAASSAPAT---APAGPGTPVTAPLAGNIWKVIAAEGQTVAEGDVLLILEAMKMETE 559 A A +G + + ++I EG++V +GDVLL L A+ E + Sbjct: 76 VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135 Query: 560 IRAAQA 565 Q+ Sbjct: 136 TLKTQS 141 Score = 31.0 bits (70), Expect = 0.016 Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 10/56 (17%) Query: 533 KVIAAEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGDTLMTL 588 V A G+ G EI+ + V+ I VK G++V GD L+ L Sbjct: 82 IVATANGKLTHSGRSK----------EIKPIENSIVKEIIVKEGESVRKGDVLLKL 127
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.021 Identities = 2/39 (5%), Positives = 19/39 (48%) Query: 56 LTQLQQQLSDNQSDIDSLRGQIQENQYQLNQVMERQKQI 94 + + + + + +++ + Q+++ + ++ E + + Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 115 bits (289), Expect = 2e-33 Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 4/119 (3%) Query: 56 EEQARLQMQQLQQNNIVYFDLDKYDIRSDFAAMLDAHANFLRSN--PSYKVTVEGHADER 113 +Q + + V F+ +K ++ + A LD + L + V V G+ D Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264 Query: 114 GTPEYNISLGERRANAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAYAKNRRAVL 172 G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++ Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGN-TCDNVKQRAALI 321
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 62.8 bits (152), Expect = 1e-12 Identities = 29/199 (14%), Positives = 67/199 (33%), Gaps = 6/199 (3%) Query: 64 YNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQKQAEEA 123 YN + +++ Q E R+ + A + E Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040 Query: 124 AKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKAEAEAA 183 A+ ++Q+ + E+ + A + + A +A ++ K + V A+ +E + Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV-----AQSGSETKET 1095 Query: 184 KAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEAAKAAAEAKKKADAAAA 243 + + + KA E +K E + + K+ +E + AE ++ D Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQE-VPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154 Query: 244 KAAADAKKKAAAEKAAAAE 262 ++ A+ A+ Sbjct: 1155 IKEPQSQTNTTADTEQPAK 1173 Score = 55.1 bits (132), Expect = 3e-10 Identities = 29/184 (15%), Positives = 63/184 (34%), Gaps = 4/184 (2%) Query: 55 VDPGAVVQQYNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQ 114 VD + N Q D S EE ++ + +E E + ++ Sbjct: 992 VDTTNITTPNNIQADVP-SVPSNNEEIARVDEAPVPPPAPATPSETTETVA-ENSKQESK 1049 Query: 115 EQQKQAEEAAKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADA 174 +K ++A + Q ++ A+EA + E + + + E + A + Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE-TATVEK 1108 Query: 175 KKKAEAEAAKAAADAKKKAEAEAAKAAAEA-KKKAEAEAAKAAAEAKKKADAEAAKAAAE 233 ++KA+ E K K ++ + +E + +AE K+ ++ A Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168 Query: 234 AKKK 237 + Sbjct: 1169 EQPA 1172 Score = 53.5 bits (128), Expect = 1e-09 Identities = 24/219 (10%), Positives = 71/219 (32%), Gaps = 21/219 (9%) Query: 66 RQQDQQASARRAEEERKKLQQQQAEE--LQQKQAAEQER------LKQLEKERLAAQEQQ 117 + A + E + + +Q A E Q ++ A++ + + E + ++ ++ Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 118 KQ-----------AEEAAKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAE 166 Q EE AK+ ++ Q E + + K+ ++E + A+ ++ + Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQ--EVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152 Query: 167 AVKAAADAKKKAEAEAAKAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAE 226 ++ A+ + A + E ++ + E + A + Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212 Query: 227 AAKAAAEAKKKADAAAAKAAADAKKKAAAEKAAAAEGVD 265 + + + + + ++ + D Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCD 1251 Score = 45.4 bits (107), Expect = 3e-07 Identities = 24/215 (11%), Positives = 59/215 (27%), Gaps = 6/215 (2%) Query: 59 AVVQQYNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQK 118 A Q Q + E K+ + EE + + + + E ++ +Q K Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ-----EVPKVTSQVSPK 1132 Query: 119 QAEEAAKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKA 178 Q + Q + + + + + + A + + E + Sbjct: 1133 QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192 Query: 179 EAEAAKAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEAAKAAAEAKKKA 238 + + ++ K + ++ + A + + A Sbjct: 1193 NSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252 Query: 239 DAAAAKAA-ADAKKKAAAEKAAAAEGVDDLLGDLS 272 + A +DA+ KA + V + L Sbjct: 1253 TSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLE 1287
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.0 bits (70), Expect = 0.004 Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 15 VDNAPRMQDYTLEGEEGRDM-MLLDALIQLKEKDPSLSFRR 54 ++N + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.006 Identities = 11/33 (33%), Positives = 16/33 (48%), Gaps = 6/33 (18%) Query: 50 KIDFTLTEGNRLALIGHNGSGKTTLLRVLAGAY 82 K D+++ L G G GK+TL+ L G Sbjct: 594 KFDYSVV------LEGTGGIGKSTLINTLVGLD 620
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 30.4 bits (68), Expect = 0.006 Identities = 16/87 (18%), Positives = 26/87 (29%), Gaps = 8/87 (9%) Query: 20 LTLVSSANIACGFHAGDAQTMLT---CVREALKNGVAIGAHPSFPDRDN--FGRT--AMV 72 + + IA G G T+LT V + A+ A PS ++DN G + Sbjct: 95 VEAPTGTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQI 153 Query: 73 LPPETVYAQTLYQIGALGAIVQAQGGV 99 + + V Sbjct: 154 TKYSGEGDLAIVKFSPNEQNKHIGEVV 180
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 37.9 bits (88), Expect = 2e-05 Identities = 14/68 (20%), Positives = 31/68 (45%), Gaps = 4/68 (5%) Query: 155 NPFTNGHRYLIQQAAAQCDWLHLFLVKEDTSRFPY---EDRLDLVLKGTTDIPRLTVHRG 211 +P T GH +I++ D +++ V + ++ P ++RL+ + K +P V Sbjct: 10 DPITFGHLDIIERGCRLFDQVYV-AVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSF 68 Query: 212 SEYIISRA 219 ++ A Sbjct: 69 EGLTVNYA 76
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 31.2 bits (70), Expect = 0.011 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 1/73 (1%) Query: 41 LDTNMKTQLRAYLEKLTKPVELIATLDDS-AKSAEIKELLAEIAELSDKVTFKEDNTLPV 99 D N K + +++E + ++ LD + A +AEIK+ + + S + + + N + Sbjct: 109 FDANGKENIASFMESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNL 168 Query: 100 RKPSFLITNPGSQ 112 P PG Q Sbjct: 169 LTPVIEKVKPGEQ 181
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 28.4 bits (63), Expect = 0.018 Identities = 18/98 (18%), Positives = 41/98 (41%), Gaps = 13/98 (13%) Query: 50 QGITILKSFEAPGGMKGYLGKYQDMGVTIYLTPDGKHAISG--YMYNEKGENLSNALIEK 107 + + + FE YLGK+ ++ + G ++ + N+ G ++ N Sbjct: 21 ESVKPVSDFEL----NNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLN----- 71 Query: 108 EIYAPAGREMWQKMEKASWILDGKKDAPVVLYVFADPF 145 Y+ + W++ E ++ ++G D + + F PF Sbjct: 72 RGYSEE-KGEWKEAEGKAYFVNGSTDGYLKVSFFG-PF 107
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 338 bits (868), Expect = e-120 Identities = 105/257 (40%), Positives = 148/257 (57%), Gaps = 20/257 (7%) Query: 9 KTVWVTGAGKGIGYATALAFVDAGARVIGFDRE---------------FTQENYPFATEV 53 K ++TGA +GIG A A GA + D E +P Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP----- 63 Query: 54 MDVADAAQVAQVCQRVLQKTPRLDVLVNAAGILRMGATDALSVDDWQQTFAVNVGGAFNL 113 DV D+A + ++ R+ ++ +D+LVN AG+LR G +LS ++W+ TF+VN G FN Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 114 FSQTMAQFRRQQGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLELAGCGVRCN 173 ++ G+IVTV S+ A PR M+AY +SKAA +GLELA +RCN Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 174 VVSPGSTDTDMQRTLWVSEDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDLA 233 +VSPGST+TDMQ +LW E+ +Q I+G E FK GIPL K+A+P +IA+ +LFL S A Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 234 SHITLQDIVVDGGSTLG 250 HIT+ ++ VDGG+TLG Sbjct: 244 GHITMHNLCVDGGATLG 260
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 424 bits (1092), Expect = e-153 Identities = 147/299 (49%), Positives = 191/299 (63%), Gaps = 18/299 (6%) Query: 1 MAIPKLQSYALPTALDIPTNKVNWAFEPERAALLIHDMQDYFVSFWGRNCPMMDQVIANI 60 MAIP +Q Y +PTA D+P NKV+W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRQYCKEHHIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVEALTPDEADTV 120 L+ C + IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P++ D V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKDTGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FCREEHLMALNYVAGRSGRVVMTESLL------PTPVPASKA-----------ALRALIL 223 F E+H MAL Y AGR VMT+SLL P V + A +R I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDETDEPLD-DENLIDYGLDSVRMMGLAARWRKVHGDIDFVMLAKNPTIDAWWALLS 281 LL ET E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 60.0 bits (145), Expect = 2e-12 Identities = 47/210 (22%), Positives = 82/210 (39%), Gaps = 21/210 (10%) Query: 105 EPNAETVAAQMPDLILISATGGDSALALYDQLSAIAPTLVINYDDKS-----WQSLLTQL 159 EPN E + P ++ SA G S + L+ IAP N+ D + LT++ Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLTEM 141 Query: 160 GEITGQEKQAAARIAEFEAQLTTVKQRIALPPQPVSALVYTPAAHSANLWTPESAQGKLL 219 ++ + A +A++E + ++K R L ++ P S ++L Sbjct: 142 ADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEIL 201 Query: 220 TQLGFTLATLPRGLQTSKSQGKRHDIIQLGGENLAAGLNGESLFLFAGDNKDVAALYANP 279 + G A + + + + LAA + + L ++KD+ AL A P Sbjct: 202 DEYGIPNAW--------QGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATP 253 Query: 280 LLAHLPAVQNKRVYALGTETFRLDYYSATL 309 L +P V+ R + F Y ATL Sbjct: 254 LWQAMPFVRAGRFQRVPAVWF----YGATL 279
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.019 Identities = 70/397 (17%), Positives = 131/397 (32%), Gaps = 66/397 (16%) Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTLTGGAMFIGLMVGGVLADRYERKKVIL 86 F S+++ +L V++P T IG V G L+D+ K+++L Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 87 LARGTCGIGFIGLCVNSLLPEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGRENLMQ 146 G + V ++A ++ G F +L ++ + +EN + Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKENRGK 139 Query: 147 AGAITMLTVRLGSVISPMLGGILLASGGVAWNYGLAAAGTFITLLPLLTLPRLPVPPQPR 206 A + V +G + P +GG++ + W+Y L IT++ + L +L Sbjct: 140 AFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKEVRI 195 Query: 207 ------------------------------------------------ENPFIAL-LAAF 217 +PF+ L Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKN 255 Query: 218 RFLLASPLIGGIALLGGLVTMASAVRVLYPALAMSWQMSTAQIGLLYAAI-PLGAAIGAL 276 + L GGI T+A V ++ + Q+STA+IG + + I Sbjct: 256 IPFMIGVLCGGIIF----GTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 277 TSGQLAHSVRPGLIMLVSTVG---SFLAVGVFAIMPVWIAGVICLALFGWLSAISSLLQY 333 G L P ++ + SFL W +I + + G LS +++ Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371 Query: 334 TLLQTQTPENMLGRMNGLWTAQNVTGDAIGAALLGGL 370 + + + M L + + G A++GGL Sbjct: 372 IVSSSLKQQEAGAGM-SLLNFTSFLSEGTGIAIVGGL 407
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 31.3 bits (71), Expect = 0.032 Identities = 9/68 (13%), Positives = 25/68 (36%) Query: 1 MTQRLPLVAAQPGIWMAEKLSDLPSAWSVAHYVELNGELDVALLAKAVAVGMQQADTLRM 60 M ++ +AA G+ +A+ L + + ++ L A+ ++ ++ Sbjct: 1 MHHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKD 60 Query: 61 RFTEENGE 68 + G Sbjct: 61 QTEASMGA 68
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 29.1 bits (65), Expect = 0.039 Identities = 30/151 (19%), Positives = 47/151 (31%), Gaps = 15/151 (9%) Query: 128 GAESVIPAITGLTTTAGVFDSTGLLSLSQRPARLG--ILGGGYIGLEFASMFANFGTKVT 185 GA+ I A+ F + G + + + G I E S F V Sbjct: 136 GADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPS---KFKDSVN 192 Query: 186 IFEAAPQFLPREDRDIAQAITRILQEKGVELILNANVQAVSSKEGAVQVETPEGAHLVDA 245 + L D A + ++ + + S+E + V+ P A L Sbjct: 193 LVLQ----LRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQE--IAVQKPRVADLTRL 246 Query: 246 LLVASGRKPATAGLQLQNAGVAVNERGGIIV 276 + T A V +NER G IV Sbjct: 247 MAEIENLTVETD----TPAKVVINERTGTIV 273
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 62.5 bits (152), Expect = 2e-14 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 9/69 (13%) Query: 74 LDEALHHGAVLRVRPKAMTVAVIIAGLLPVLWGTGAGSEVMSRI---------TAPLLSL 124 + EA +R+RP MT I G+LP+ GAGS + + +A LL++ Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018 Query: 125 FIIPAAYKL 133 F +P + + Sbjct: 1019 FFVPVFFVV 1027
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.8 bits (80), Expect = 1e-05 Identities = 7/45 (15%), Positives = 17/45 (37%), Gaps = 1/45 (2%) Query: 4 KRYPEEFKIEAVRQVVER-GHSVSSAATHLDITTHSFYARIKKYG 47 R E + + + + AA L + ++ +I++ G Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 2e-16 Identities = 29/122 (23%), Positives = 58/122 (47%), Gaps = 2/122 (1%) Query: 1 MKPASVIIMDEHPIVRMSIEVLLGKNSNIQVVLKTDDSRTAIEYLRTYPVDLVILDIELP 60 M A++++ D+ +R + L + V T ++ T ++ DLV+ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GTDGFTLLKRIKSIQEHTRILFLSSKSEAFYAGRAIRAGANGFVSKRKDLNDIYNAVKMI 120 + F LL RIK + +L +S+++ A +A GA ++ K DL ++ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 LS 122 L+ Sbjct: 119 LA 120
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.018 Identities = 16/149 (10%), Positives = 42/149 (28%), Gaps = 8/149 (5%) Query: 299 RSQLNYSEENLKQARASLERLYTALRGTDKSAAPAGGEAFEARFVEAMNDDFNTPEAY-- 356 + ++ +L QAR R R + + P E F ++ + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 357 SVLFDMAREVN--RLKGEDMTAA-NAMASHLRKISGVLGLLEQEPDVFLQSGAQADDGEV 413 + L + A + + + + + + + D F + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249 Query: 414 AEIEALIQQRLDARKAKDWAAADAARDRL 442 A+ L Q+ + + +++ Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQI 278
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 358 bits (921), Expect = e-127 Identities = 126/310 (40%), Positives = 176/310 (56%), Gaps = 16/310 (5%) Query: 2 KTLVVALGGNALLQRGEALTAENQYRNIADAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60 K +V+ALGGNAL QRG+ + E N+ +A + AR Y + I HGNGPQVG L L Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 61 QNLAWKA---VEPYPLDVLVAESQGMIGYMLAQRLALEPDM----PPVTAVLTRIKVSAD 113 A +A + P+DV A SQG IGYM+ Q L E V ++T+ V + Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 114 DPAFLEPEKFIGPVYSPEEQMALEATYGWHMKRD-GKYLRRVVASPAPRQIIESAAIELL 172 DPAF P K +GP Y E L GW +K D G+ RRVV SP P+ +E+ I+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 173 LKEGHVVICSGGGGVPVAGEG---EGVEAVIDKDLAAALLAEQIAADGLIILTDADAVYE 229 ++ G +VI SGGGGVPV E +GVEAVIDKDLA LAE++ AD +ILTD + Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242 Query: 230 HWGTPQQRAIRQASPDELAPFAKAD----GAMGPKVTAVSGYVKRCGKPAWIGALSRIDD 285 ++GT +++ +R+ +EL + + G+MGPKV A +++ G+ A I L + + Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302 Query: 286 TLAGRAGTCI 295 L G+ GT + Sbjct: 303 ALEGKTGTQV 312
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 49.7 bits (119), Expect = 1e-08 Identities = 37/163 (22%), Positives = 57/163 (34%), Gaps = 32/163 (19%) Query: 4 DLIIKNGTVILENEARVIDIAVQGGKIAAIGEN------------LGEAKNVLDATGLIV 51 D +I N ++ DI ++ G+IAAIG+ +G V+ G IV Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128 Query: 52 SPGMVDAHTHISEPGRTHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRET------- 104 + G +D+H H P + A G+T M+ PA T Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177 Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNLDRLHELDEVGVVGFK 146 I +AA ++ A G + L E+ G K Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 32.0 bits (73), Expect = 0.001 Identities = 9/39 (23%), Positives = 21/39 (53%), Gaps = 2/39 (5%) Query: 488 ESIEKLADEVDENAKEAEKALEPFVERVKTLL--GDRVK 524 + I K+A+ + K++ A++ V + L G++V+ Sbjct: 6 DLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQ 44
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 45.1 bits (106), Expect = 9e-07 Identities = 52/275 (18%), Positives = 85/275 (30%), Gaps = 34/275 (12%) Query: 366 PEPETPRQSFAPVAPTAVMTPP--QVQQPSAP-----------APQTSPAPLPASTSQVL 412 PE E Q V T + TP Q PS P AP PAP S + Sbjct: 983 PEVEKRNQ---TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 413 AARNQLQRAQGVTKTKK--SEPAAASRARPVNNSALERLASVSERVQARPAPSALETAPV 470 A N Q ++ V K ++ +E A + R + + + E A Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQN-----------REVAKEAKSNVKANTQTNEVAQS 1088 Query: 471 KKEAYRWKATTPVVQTKEVVATPKALKKALEHEKTPELAAKLAAEAIERDPWAAQVSQLS 530 E T +TKE K K +E EKT E K+ ++ + + V + Sbjct: 1089 GSET----KETQTTETKETATVEKEEKAKVETEKTQE-VPKVTSQVSPKQEQSETVQPQA 1143 Query: 531 LPKLVEQVALNAWKEQNGNAVCLHLRSTQRHLNSSGAQQKLAQALSDLTGTTVELTIVED 590 P +N + Q+ + +S+ Q + + VE Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203 Query: 591 DNPAVRTPLEWRQAIYEEKLAQARESIIADNNIQT 625 T + + ++ S+ + T Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 115 bits (291), Expect = 8e-38 Identities = 49/88 (55%), Positives = 67/88 (76%) Query: 2 NKSQLIEKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89 NPQTG+EI I A+KVP+F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.9 bits (59), Expect = 0.023 Identities = 11/37 (29%), Positives = 18/37 (48%) Query: 17 NMLKKLLFPLVALFMLAGCATPPTTIDVAPKITLPQQ 53 N +KK+LF ++ GCA T+ P P++ Sbjct: 4 NKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.0 bits (96), Expect = 7e-06 Identities = 40/190 (21%), Positives = 76/190 (40%), Gaps = 15/190 (7%) Query: 221 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYG 279 R+N LI L ++ + + + ++++ + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GILMQRLSLFRALLIFGILQGASNAGYWLLSITDKNMFSMGAAVFFENLCGGMGTAAFVA 339 L +L + R LL I+ + ++ + FS+ + G G AAF A Sbjct: 71 K-LSDQLGIKRLLLFGIIINCFGS----VIGFVGHSFFSL---LIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGPVAGWFVEAH-GWPTFYLFSVVAAVP 394 L+M K F L+ ++ A+G VGP G + + W L ++ + Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GLLLLLVCRQ 404 L+ + ++ Sbjct: 182 VPFLMKLLKK 191
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 85.3 bits (211), Expect = 4e-20 Identities = 85/383 (22%), Positives = 155/383 (40%), Gaps = 26/383 (6%) Query: 16 GLGTVFSLRMLGMFMVLPVLTTY--GMALQGASEALIGIAIGIYGLAQAIFQIPFGLLSD 73 L TV L +G+ +++PVL + A GI + +Y L Q G LSD Sbjct: 10 ILSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68 Query: 74 RIGRKPLIVGGLAVFVAGSVIAALSHSIWGIILGRALQG-SGAIAAAVMALLSDLTREQN 132 R GR+P+++ LA I A + +W + +GR + G +GA A A ++D+T Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDE 128 Query: 133 RTKAMAFIGVSFGITFAIAMVLGPIVTHSLGLNALFWMIAALATLGILLTIWVVPNSTNH 192 R + F+ FG VLG ++ +A F+ AAL L L +++P S Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187 Query: 193 VLNRESGMVKGSFSKVLAEPRLLKLNFGIMCLHILLMSTFVA-LPGQLADAGFPAAEHWK 251 + + G+ + L+ F+ L GQ+ A + + Sbjct: 188 ERRPLRREALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240 Query: 252 VYLATMVIAFA--------AVVPFIIYAEVKRRMKQVFLFCVGLI--VVAEIVLWGAGQH 301 + I + ++ +I V R+ + +G+I I+L A + Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG 300 Query: 302 FWELVIGVQLFFLAFNL--MEALLPSLISKESPAGYKGTAMGVYSTSQFLGVALGGSLGG 359 + I V L + ++A+L + +E +G+ + S + +G L ++ Sbjct: 301 WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360 Query: 360 WIDGTFDGQTVFLAGAVLAMVWL 382 T++G ++AGA L ++ L Sbjct: 361 ASITTWNG-WAWIAGAALYLLCL 382
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.043 Identities = 7/21 (33%), Positives = 12/21 (57%) Query: 46 VLALIGPSGSGKTTVLRAVAG 66 + L G G GK+T++ + G Sbjct: 598 SVVLEGTGGIGKSTLINTLVG 618
>PF06580#Sensor histidine kinase Length = 349 Score = 37.2 bits (86), Expect = 1e-04 Identities = 17/123 (13%), Positives = 38/123 (30%), Gaps = 28/123 (22%) Query: 290 TFTFEVDDSLSVLGNEEQLRSAISNLVYNAVNH----TPAGTHITVSWRRVAHGAEFCIQ 345 F +++ ++ + + + LV N + H P G I + + ++ Sbjct: 241 QFENQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 346 DNGPGIAAEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHALNH---HESRLEIDSSPG 402 + G +G GL V+ L E+++++ G Sbjct: 298 NTGSLAL------------------KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 403 KGT 405 K Sbjct: 340 KVN 342
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.0 bits (244), Expect = 7e-26 Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNKLNEPWPDLILLDWMLPGGSGLQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHLKREAMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPGSHRVMTGDSP 152 E L D + G S Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 29.3 bits (65), Expect = 0.028 Identities = 14/70 (20%), Positives = 29/70 (41%), Gaps = 4/70 (5%) Query: 149 KQQQLLHAIADYYQQQYQEACQLRGERKLPVIATGHLTTVGASKSDAVRDIYIGTLDAFP 208 K+ Q+++ IA++Y +++ + E++ T D + + I A Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAIN-EKEAFECIYDSRTRSA--GKD-IVSVKINIDKAKK 190 Query: 209 AQHFPPADYI 218 + P DYI Sbjct: 191 ILNLPECDYI 200
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 49.1 bits (117), Expect = 6e-08 Identities = 32/198 (16%), Positives = 71/198 (35%), Gaps = 13/198 (6%) Query: 373 TQQSHDRAQLSQWQQQLLSDTRQRDALPPLTLDLTPQALAEARALHTRQRPLRHRLAALQ 432 TQ S +A+L Q + Q+LS + + + LP L L P + R L ++ Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL------IK 192 Query: 433 GQILPKQKRQAQLQAAIARHHQEQAQYTQRLADKRLSYKTKAQELADVRTICEQ----EA 488 Q Q ++ Q + + + E+ R+ + + L D ++ + + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252 Query: 489 RIKDLESQRAHLQS--GQPCPLCGSTTHPAIAAYQALELSANQTRRDALEKEVKTLAEEG 546 + + E++ + ++A + +L + + L+K +T Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI- 311 Query: 547 AALRGQLDALTQQLQRDE 564 L +L ++ Q Sbjct: 312 GLLTLELAKNEERQQASV 329
>PF06291#Lambda prophage Bor protein Length = 102 Score = 30.0 bits (67), Expect = 0.002 Identities = 21/68 (30%), Positives = 30/68 (44%), Gaps = 11/68 (16%) Query: 28 VNDKEIICSPDESNTHTFVILEGVVSLVRGDKVLIGIVQAPFIFGLADGVAKKEAQYKLI 87 V +K +P E+ TH F VS + K V A I G A+ V K E Q + Sbjct: 29 VGNKPTAVTPKETITHHFF-----VSGIGQKKT----VDAAKICGGAENVVKTETQQTFV 79 Query: 88 AESGCIGY 95 +G +G+ Sbjct: 80 --NGLLGF 85
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 120 bits (303), Expect = 6e-30 Identities = 100/436 (22%), Positives = 165/436 (37%), Gaps = 59/436 (13%) Query: 608 TYSANGEADNSYTDNVVA---ATGNYKVRIDNATGAGSVADYKGNELIRVNDVNTDATFS 664 + N AD +D +V A+G +++ + N+ GS L+ + + ATF+ Sbjct: 483 LFRMNVFADLGLSDKLVVMQDASGQHRLWVRNS---GSEPASANTLLLVQTPLGSAATFT 539 Query: 665 AAN---KADLGAYTYQAKQEGNTV------------------------------------ 685 AN K D+G Y Y+ GN Sbjct: 540 LANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQ 599 Query: 686 VLEQMELTDYANMALSIP--SANTNIWNLEQDTVGTRLTNARHGLADNGGAWVSYFGGNF 743 EL+ AN A++ + +W E + + RL R D GGAW F Sbjct: 600 PPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRL-NPDAGGAWGRGFAQRQ 658 Query: 744 NGDNGTIN-YDQDVNGIMVGVDTKVDGNNAKWIVGAAAGFAKGDLS---DRTGQVDQDSQ 799 DN +DQ V G +G D V +W +G AG+ +GD D G D Sbjct: 659 QLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTD---- 714 Query: 800 SAYIYSSARFANN--IFVDGNLSYSHFNNDLSANMSDGTYVDGNTSSDAWGFGLKLGYDL 857 S ++ A + + ++D L S ND SDG V G + G L+ G Sbjct: 715 SVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRF 774 Query: 858 KLGDAGYVTPYGSVSGLFQSGDDYQLSNDMKVDGQSYDSMRYELGVDAGYTFTYSEDQAL 917 D ++ P ++ G Y+ +N ++V + S+ LG++ G + + + Sbjct: 775 THADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQV 834 Query: 918 TPYFKLAYVYD-DSNNDADVNGDSIDNGVEGSAVRVGLGTQFSFTKNFSAYTDANYLGGG 976 PY K + + + D NG + + G+ +GLG + + S Y Y G Sbjct: 835 QPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGP 894 Query: 977 DVDQDWSANVGVKYTW 992 + W+ + G +Y+W Sbjct: 895 KLAMPWTFHAGYRYSW 910
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.003 Identities = 19/69 (27%), Positives = 29/69 (42%) Query: 254 DVLREIRERTELPLGAYQVSGEYAMIKFAAMAGAIDEEKVVLESLGSIKRAGADLIFSYF 313 + E+ + +L L QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 314 ALDLAEKNI 322 L+L E+ I Sbjct: 526 DLNLVERRI 534
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 56.4 bits (136), Expect = 7e-11 Identities = 56/306 (18%), Positives = 108/306 (35%), Gaps = 17/306 (5%) Query: 19 FTSWMLDAFDFFILVFVLSDLAEWFHAS---VSDVSIAIMLTLAVRPIGALLFGRMAEKY 75 ++ LDA +++ VL L S + I + L ++ A + G +++++ Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70 Query: 76 GRRPILMLNILFFTVFELLSAWSPTFMAFLIFRVMYGVAMGGIWGVASSLAMETIPDRSR 135 GRRP+L++++ V + A +P I R++ G+ G VA + + R Sbjct: 71 GRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDER 129 Query: 136 ----GLMSGIFQAGYPCGYLFASVIFGLFYSMVGWRGMFLIGA---LPVVLLPYIWFKVP 188 G MS F G + A + G F A L Sbjct: 130 ARHFGFMSACFGFG-----MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 189 ESPVWLAARARKENTALLPVLRKQWKLCLYLVLVMAFFNFFSHGTQDLYPTFLKMQHGFD 248 R N + + L+ V L+ F + + +D Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244 Query: 249 PHLISI-IAIFYNIAAMLGGIFYGTLSERIGRKKAIMIAAFLALPVLPLWAFSSGSFTIG 307 I I +A F + ++ + G ++ R+G ++A+M+ L AF++ + Sbjct: 245 ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304 Query: 308 LGAFLM 313 L+ Sbjct: 305 PIMVLL 310 Score = 33.6 bits (77), Expect = 0.001 Identities = 37/186 (19%), Positives = 77/186 (41%), Gaps = 10/186 (5%) Query: 3 TPLNWTTTQRHVAFASFTSWMLDAF-DFFILVFVLSDLAEWFHASVSDVSIAIMLTLAVR 61 W VA +++ ++V+ + FH + + I++ + Sbjct: 201 ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG-EDRFHWDATTIGISLAAFGILH 259 Query: 62 PIG-ALLFGRMAEKYGRRPILMLNILF-FTVFELLSAWSPTFMAFLIFRVMYGVAMG--G 117 + A++ G +A + G R LML ++ T + LL+ + +MAF I ++ +G Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPA 319 Query: 118 IWGVASSLAMETIPDRSRGLMSGIFQAGYPCGYLFASVIFGLFYSMVGWRG-MFLIGA-L 175 + + S E + +G ++ + G L + I+ S+ W G ++ GA L Sbjct: 320 LQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA--ASITTWNGWAWIAGAAL 377 Query: 176 PVVLLP 181 ++ LP Sbjct: 378 YLLCLP 383
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 48.3 bits (115), Expect = 4e-08 Identities = 18/112 (16%), Positives = 37/112 (33%), Gaps = 7/112 (6%) Query: 74 ELRSRVGGTLDAVSVPEGRLVSRGQLLFQIDPRPFEVALDTAVAQLRQAEVLARQAQADF 133 E++ + + V EG V +G +L ++ E + L QA + + Q Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 134 DRIQR-------LVASGAVSRKNADDVTATRNARQAQMQSAKAAVAAARLEL 178 I+ L + ++V + + Q + + L L Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL 209 Score = 34.0 bits (78), Expect = 0.001 Identities = 20/106 (18%), Positives = 37/106 (34%), Gaps = 13/106 (12%) Query: 112 LDTAVAQLRQAEVLARQAQADFDRIQRLVASGAVSRKNADDVTATRNARQAQMQSAKAAV 171 L +QL Q E A+ ++ + +L + ++ + + Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKN---------EILDKLRQTTDNIGLLTLEL 318 Query: 172 AAARLELSWTRITAPIAGRVDRVLVTRGNLVSGGVAGNATLLTTIV 217 A + I AP++ +V ++ V GGV A L IV Sbjct: 319 AKNEERQQASVIRAPVSVKVQQLKVHT----EGGVVTTAETLMVIV 360
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1046 bits (2707), Expect = 0.0 Identities = 435/1040 (41%), Positives = 660/1040 (63%), Gaps = 19/1040 (1%) Query: 6 FFIARPIFAIVLSLLMLLAGAIAFLKLPLSEYPAVTPPTVQVSASYPGANPQVIADTVAA 65 FFI RPIFA VL++++++AGA+A L+LP+++YP + PP V VSA+YPGA+ Q + DTV Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLEQVINGVDGMLYMNTQMAIDGRMVISIAFEQGTDPDMAQIQVQNRVSRALPRLPEEVQ 125 +EQ +NG+D ++YM++ G + I++ F+ GTDPD+AQ+QVQN++ A P LP+EVQ Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 126 RIGVVTEKTSPDMLMVVHLVSPQKRYDSLYLSNFAIRQVRDELARLPGVGDVLVWGAGEY 185 + G+ EK+S LMV VS +S++ V+D L+RL GVGDV ++G +Y Sbjct: 124 QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQY 182 Query: 186 AMRVWLDPAKIANRGLTASDIVTALREQNVQVAAGSVGQQPEASA-AFQMTVNTLGRLTS 244 AMR+WLD + LT D++ L+ QN Q+AAG +G P ++ R + Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242 Query: 245 EEQFGEIVVKIGADGEVTRLRDVARVTLGADAYTLRSLLNGEAAPALQIIQSPGANAIDV 304 E+FG++ +++ +DG V RL+DVARV LG + Y + + +NG+ A L I + GANA+D Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302 Query: 305 SNAIRGKMDELQQNFPQDIEYRIAYDPTVFVRASLQSVAITLLEALVLVVLVVVLFLQTW 364 + AI+ K+ ELQ FPQ ++ YD T FV+ S+ V TL EA++LV LV+ LFLQ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 365 RASIIPLVAVPVSLVGTFALMHLFGFSLNTLSLFGLVLSIGIVVDDAIVVVENVERHISQ 424 RA++IP +AVPV L+GTFA++ FG+S+NTL++FG+VL+IG++VDDAIVVVENVER + + Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 425 GKSPG-EAAKKAMDEVTGPILSITSVLTAVFIPSAFLAGLQGEFYRQFALTIAISTILSA 483 K P EA +K+M ++ G ++ I VL+AVFIP AF G G YRQF++TI + LS Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 484 INSLTLSPALAAILLRPHHDTTKADWLTRLMGTVTGGFFHRFNRFFDSASNRYVSAVRRA 543 + +L L+PAL A LL+P ++ GGFF FN FD + N Y ++V + Sbjct: 483 LVALILTPALCATLLKP---------VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 544 VRGSVIVMVLYAGFVGLTWLGFHQVPNGFVPAQDKYYLVGIAQLPSGASLDRTEAVVKQM 603 + + +++YA V + F ++P+ F+P +D+ + + QLP+GA+ +RT+ V+ Q+ Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593 Query: 604 SAIALA--EPGVESVVVFPGLSVNGPVNVPNSALMFAMLKPFDEREDPSLSANAIAGKLM 661 + L + VESV G S +G N+ + F LKP++ER SA A+ + Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651 Query: 662 HKFSHIPDGFIGIFPPPPVPGLGATGGFKLQIEDRAELGFEAMTKVQSEIMSKAMQTP-E 720 + I DGF+ F P + LG GF ++ D+A LG +A+T+ +++++ A Q P Sbjct: 652 MELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPAS 711 Query: 721 LANMLASFQTNAPQLQVDIDRVKAKSMGVSLTDIFETLQINLGSLYVNDFNRFGRTWRVM 780 L ++ + + Q ++++D+ KA+++GVSL+DI +T+ LG YVNDF GR ++ Sbjct: 712 LVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLY 771 Query: 781 AQADAPFRMQQEDIGLLKVRNAKGEMIPLSAFVTIMRQSGPDRIIHYNGFPSVDISGGPA 840 QADA FRM ED+ L VR+A GEM+P SAF T G R+ YNG PS++I G A Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831 Query: 841 PGFSSGQATDAIEKIVRETLPEGMVFEWTDLVYQEKQAGNSALAIFALAVLLAFLILAAQ 900 PG SSG A +E + + LP G+ ++WT + YQE+ +GN A A+ A++ ++ FL LAA Sbjct: 832 PGTSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAAL 890 Query: 901 YNSWSLPFAVLLIAPMSLLSAIVGVWVSGGDNNIFTQIGFVVLVGLAAKNAILIVEFAR- 959 Y SWS+P +V+L+ P+ ++ ++ + N+++ +G + +GL+AKNAILIVEFA+ Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950 Query: 960 AKEHDGADPLTAVLEASRLRLRPILMTSFAFIAGVVPLVLATGAGAEMRHAMGIAVFAGM 1019 E +G + A L A R+RLRPILMTS AFI GV+PL ++ GAG+ ++A+GI V GM Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010 Query: 1020 LGVTLFGLLLTPVFYVVVRR 1039 + TL + PVF+VV+RR Sbjct: 1011 VSATLLAIFFVPVFFVVIRR 1030 Score = 89.5 bits (222), Expect = 4e-20 Identities = 68/427 (15%), Positives = 143/427 (33%), Gaps = 36/427 (8%) Query: 643 FDEREDPSLSANAIAGKLMHKFSHIPDGFIGIFPPPPVPGLGATGGFKLQIEDRAELGFE 702 F DP ++ + KL +P + ++ + + ++ Sbjct: 94 FQSGTDPDIAQVQVQNKLQLATPLLPQEV----QQQGISVEKSSSSYLMVAGFVSDNP-- 147 Query: 703 AMTKVQSEIMSKAMQTPELANM--LASFQTNAPQLQVDI--DRVKAKSMGVSLTDIFETL 758 T+ + L+ + + Q Q + I D ++ D+ L Sbjct: 148 GTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQL 207 Query: 759 QIN--------LGSLYVNDFNRFGRTWRVMAQADAPFRMQQEDIGLLKVR-NAKGEMIPL 809 ++ LG + + + P E+ G + +R N+ G ++ L Sbjct: 208 KVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP-----EEFGKVTLRVNSDGSVVRL 262 Query: 810 SAFVTIMRQSGPDRII-HYNGFPSVDISGGPAPGFSSGQATDAIEKIV---RETLPEGM- 864 + +I NG P+ + A G ++ AI+ + + P+GM Sbjct: 263 KDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMK 322 Query: 865 ---VFEWTDLVYQEKQAGNSALAIFALAVLLAFLILAAQYNSWSLPFAVLLIAPMSLLSA 921 ++ T V + + + + A++L FL++ + + P+ LL Sbjct: 323 VLYPYDTTPFV---QLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGT 379 Query: 922 IVGVWVSGGDNNIFTQIGFVVLVGLAAKNAILIVE-FARAKEHDGADPLTAVLEASRLRL 980 + G N T G V+ +GL +AI++VE R D P A ++ Sbjct: 380 FAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQ 439 Query: 981 RPILMTSFAFIAGVVPLVLATGAGAEMRHAMGIAVFAGMLGVTLFGLLLTPVFYVVVRRM 1040 ++ + A +P+ G+ + I + + M L L+LTP + + Sbjct: 440 GALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKP 499 Query: 1041 ALKRENR 1047 + Sbjct: 500 VSAEHHE 506
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 134 bits (339), Expect = 7e-43 Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 21/183 (11%) Query: 1 MKRRSSFLVFLGLLLASPLALANDQHTVSFGYAQTHLSSLKNSDSKDLRGFNFKYRYEFN 60 MK+ + +L + TV+ GYAQ+ N + GFN KYRYE + Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMN----KMGGFNLKYRYEED 56 Query: 61 ET-WGMLGSFTATRNEMENYTWKEGKLHKNGSDSVDYGSLMFGPTYRFNDYVSLYGNAGI 119 + G++GSFT T + K Y + GP YR ND+ S+YG G+ Sbjct: 57 NSPLGVIGSFTYTEKSRTASSGDYNK--------NQYYGITAGPAYRINDWASIYGVVGV 108 Query: 120 ATMKF--------NKHSKEDSFAYGAGVIFNPVKSISIDASWEASRFFAVDTNTFGVSVG 171 KF + + F+YGAG+ FNP++++++D S+E SR +VD T+ VG Sbjct: 109 GYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVG 168 Query: 172 YRF 174 YRF Sbjct: 169 YRF 171
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 761 bits (1966), Expect = 0.0 Identities = 262/880 (29%), Positives = 416/880 (47%), Gaps = 63/880 (7%) Query: 4 TINLNRKS-LALLIAIVCSGSAQG----EEYYFDPALLQGATYGQ-NIARFNE-QQTPSG 56 I +R + + + + C+ +AQ E YF+P L +++RF Q+ P G Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 57 DYLADVYVNGTLVTSSTNIRFNAVKEGQQTEPCLPLSVMKAAQIKSLPATDAA----TEC 112 Y D+Y+N + + ++ FN Q PCL + + + + + + C Sbjct: 77 TYRVDIYLNNGYMAT-RDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135 Query: 113 RPLREWVPHAGWQFDSATLRLLLTIPMTELTHKPRGYISPSEWDSGALALFLRHNTNWTH 172 PL + A Q D RL LTIP ++++ RGYI P WD G A L +N + Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195 Query: 173 TENTDSHYRYQYLWSGLNMGVNLGLWQVRHQSNLRYANSNQS-GSAWRYNSVRTWVQRPV 231 +N Y + L G+N+G W++R + Y +S+ S GS ++ + TW++R + Sbjct: 196 VQN-RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254 Query: 232 ASINSILSLGDSYTDSSLFGSLSFNGAKLVTDERMRPQGKRGYAPEVRGVAASSAHVVVK 291 + S L+LGD YT +F ++F GA+L +D+ M P +RG+AP + G+A +A V +K Sbjct: 255 IPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIK 314 Query: 292 QLGKVIYETNVPPGPFYIDDLYNTRYQGDLEVEVIEASGKTSRFTVPYSSVPDSVRPGNW 351 Q G IY + VPPGPF I+D+Y GDL+V + EA G T FTVPYSSVP R G+ Sbjct: 315 QNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHT 374 Query: 352 HYSLAFGRVRQYY--DIENRFFEGTFQHGVNNTITLNLGSRIAQRYQAWLAGGVWATGM- 408 YS+ G R + RFF+ T HG+ T+ G+++A RY+A+ G G Sbjct: 375 RYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGAL 434 Query: 409 GAFGLNATWSNARAEHNDRQQGWRAELSYSKTFT-TGTNLVLAAYRYSTNGFRDLQDVLG 467 GA ++ T +N+ + + G Y+K+ +GTN+ L YRYST+G+ + D Sbjct: 435 GALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTY 494 Query: 468 VRREAKTGI-------------DYYSDTLHQRNRLSATVSQPLGRLGTLNLSASTADYYN 514 R DYY+ ++R +L TV+Q LGR TL LS S Y+ Sbjct: 495 SRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWG 554 Query: 515 NQSRITQLQMGYSNQWRNISYGVNIARQRTTWDYDRFYHGVNEPLDVSSRQKYTETTMSF 574 + Q Q G + + +I++ ++ + + W + ++ Sbjct: 555 TSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG------------------RDQMLAL 596 Query: 575 NVSIPLDWGENRTSVA------MNYNQSSQSRSST---VSMTGSSGENSDLSWSVYGGYE 625 NV+IP S + +Y+ S + G+ E+++LS+SV GY Sbjct: 597 NVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYA 656 Query: 626 RYRNSNSDSSAPTTFGGNLQQNTRFGALRANYDQGDNYRQEGLGASGTLVLHSGGLTAGP 685 + NS S+ L +G Y D+ +Q G SG ++ H+ G+T G Sbjct: 657 GGGDGNSGSTG----YATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQ 712 Query: 686 YTSDTFALIHADGAQGAIVQNGQGAVVDRFGYAILPSLSPYRVNNVTLDTRKMRSDAELT 745 +DT L+ A GA+ A V+N G D GYA+LP + YR N V LDT + + +L Sbjct: 713 PLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLD 772 Query: 746 GGSQQIVPYAGAIARVNFATISGKAVLISVKMPDGGIPPMGADVFNGEGTNIGMVGQSGQ 805 +VP GAI R F G +L+++ + P GA V + + G+V +GQ Sbjct: 773 NAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQ 831 Query: 806 IYARIAHPSGSLLVRWGKEANQRCRVAYQLDLHTKEPFLY 845 +Y +G + V+WG+E N C YQL +++ L Sbjct: 832 VYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLT 871
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 33.0 bits (75), Expect = 5e-04 Identities = 14/62 (22%), Positives = 26/62 (41%), Gaps = 7/62 (11%) Query: 146 VGLAHVKLSNNTIPVGFGINETLSASKNNFAWGAGIGAKYAVTDNIMIDASYKYINAGKV 205 VG+ + K P S F++GAG+ ++ +N+ +D SY+ V Sbjct: 106 VGVGYGKFQTTEYPTYKH-----DTSDYGFSYGAGL--QFNPMENVALDFSYEQSRIRSV 158 Query: 206 SI 207 + Sbjct: 159 DV 160
>PF05775#Enterobacteria AfaD invasin protein Length = 142 Score = 92.3 bits (229), Expect = 7e-27 Identities = 38/132 (28%), Positives = 66/132 (50%), Gaps = 2/132 (1%) Query: 14 SVSLLVAASSLMPIANAAEKLQTTLRVGTYFRAGHVPDGMVLAQGWVTYHGSHSGFRVWS 73 S+SL + LM + + ++ TL Y + DG+ LA G + +HSGFRVW Sbjct: 4 SISLTLCGILLMLMGSFSQAADITLMNHKYM-GNLLHDGVKLATGRIICQDTHSGFRVWI 62 Query: 74 DEQKAGNTPAVLLLSGQQDPRHHIQVRLEGEGWQPDTVNGRGAILRTAADNAS-FSVVVD 132 + ++ G ++ + P+H++++R+ G GW G + T ++AS F + VD Sbjct: 63 NARQEGGGAGKYIVQSTEGPQHNLRIRISGNGWSSFVEKGIQGVFNTIKEDASIFYIEVD 122 Query: 133 GNQEVPADTWTL 144 GNQ+V + Sbjct: 123 GNQQVQPGKYLF 134
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 830 bits (2146), Expect = 0.0 Identities = 309/872 (35%), Positives = 452/872 (51%), Gaps = 52/872 (5%) Query: 4 KQPALLLFIAGVVHCANA-------HAYTFDASML-GDAAKGVDMSLFNQG-VQQPGTYR 54 K F+ V CA A F+ L D D+S F G PGTYR Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79 Query: 55 VDVMVNGKRVDTRDVVFKLEKDGQGTPFLAPCLTVSQLSRYGVKTEDYPQLWKAAKTPDE 114 VD+ +N + TRDV F QG + PCLT +QL+ G+ T + A D Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQG---IVPCLTRAQLASMGLNTASVSGMNLLAD--DA 134 Query: 115 CADL-SAIPQAKAVLDINNQQLQLSIPQVALRTKFKGIAPEDLWDDGIPAFLMNYSARTT 173 C L S I A A LD+ Q+L L+IPQ + + +G P +LWD GI A L+NY+ Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 174 QTDYKMDMERRDNSSWVQLQPGINIGAWRVRNATSWQR-----SGQQSGKWQAAYTYAER 228 ++ + +++ LQ G+NIGAWR+R+ T+W S KWQ T+ ER Sbjct: 195 SVQNRIG--GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLER 252 Query: 229 GLYSLKSRLTLGQKTSQGEIFDSVPFTGVMLASDDNMVPYSERQFAPVVRGIARTQARVE 288 + L+SRLTLG +QG+IFD + F G LASDDNM+P S+R FAPV+ GIAR A+V Sbjct: 253 DIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312 Query: 289 VKQNGYTIYNTTVAPGPFALRDLSVTDSSGDLHVTVWEADGSTQMFVVPYQTPAIALHQG 348 +KQNGY IYN+TV PGPF + D+ +SGDL VT+ EADGSTQ+F VPY + + +G Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372 Query: 349 YLKYSLLAGRYRSSDSATDKAQIAQATLMYGLPWNLTAYGGIQSATHYQAALLGLGGSLG 408 + +YS+ AG YRS ++ +K + Q+TL++GLP T YGG Q A Y+A G+G ++G Sbjct: 373 HTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMG 432 Query: 409 RWGSLSVDGSDTHSQRQGEAVQQGASWRLRYSNQLTATGTNFFLTRWQYASQGYNTLSDV 468 G+LSVD + +S ++ G S R Y+ L +GTN L ++Y++ GY +D Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492 Query: 469 LDSYRHNGNRL-------------WSWRENLQPSSRTTLMLSQSWGRHLGNLSLTGSRTD 515 S + N + + L ++Q GR L L+GS Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQT 551 Query: 516 WRNRPGHDDSYGLSWGTSIGGGSLSLNWNQNRTLWRNGAHRKENITSLWFSMPLSRWTGN 575 + D+ + T+ + +L+++ + W+ G ++ + +L ++P S W + Sbjct: 552 YWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG---RDQMLALNVNIPFSHWLRS 608 Query: 576 -------NVSASWQMTSPSHGGQTQQVGVNGEAFSQ-QLDWEVRQSYRADAPPGGGNNSA 627 + SAS+ M+ +G T GV G L + V+ Y G+ Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668 Query: 628 LHLAWNGAYGLLGGDYSYSRAMRQMGVNIAGGIVIHHHGVTLGQPLQGSVALVEAPGASG 687 L + G YG YS+S ++Q+ ++GG++ H +GVTLGQPL +V LV+APGA Sbjct: 669 ATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKD 728 Query: 688 VPVGGWPGVKTDFRGDTTVGNLNVYQENTVSLDPSRLPDDAEVTQTDVRVVPTEGAVVEA 747 V GV+TD+RG + Y+EN V+LD + L D+ ++ VVPT GA+V A Sbjct: 729 AKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788 Query: 748 KFHTHIGARALMTLKREDGSAIPFGAQVTVNGQDGSAALVDTDSQVYLTGLADKGELTVK 807 +F +G + LMTL + +PFGA VT + S+ +V + QVYL+G+ G++ VK Sbjct: 789 EFKARVGIKLLMTLTH-NNKPLPFGAMVT-SESSQSSGIVADNGQVYLSGMPLAGKVQVK 846 Query: 808 WGA---QQCRVNYQLPAHKGIAGLYQMSGLCR 836 WG C NYQLP L Q+S CR Sbjct: 847 WGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 27.7 bits (61), Expect = 0.019 Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 9/45 (20%) Query: 104 TAKMSLEQYCSKAFSAGFVKPQNRKSLADVVMYYNGKPVGSFEYI 148 M+L++ AF GF +P + + Y GK + F++ Sbjct: 547 KPDMTLKEALKIAF--GFNEP-------NGNLQYQGKDITEFDFN 582
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.009 Identities = 31/198 (15%), Positives = 64/198 (32%), Gaps = 36/198 (18%) Query: 177 QQQSQERAARAELLQYQLKELNDFNPQAGEFEQIDEEYKRLANSGQLLTTSQNALALLAD 236 + QS AR E +YQ+ + + E + DE Y + + ++L + + Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS- 196 Query: 237 GEDVNLQSQLYSAKQLVSELVGMDSKLSGILDMLEEATIQLTEASDELRHYCERLDLDPN 296 Q+Q Y Q L ++ +L A I E + Sbjct: 197 ----TWQNQKY---QKELNLDKKRAERLTVL-----ARINRYENLSRV------------ 232 Query: 297 RLFELEQRIAKQISLARKHHVSPEALPQLYQSLLEEQQQLDDQADSLETLTLAVNKHHQQ 356 + R+ SL K ++ ++LE++ + + + L + + + Sbjct: 233 ----EKSRLDDFSSLLHKQAIA-------KHAVLEQENKYVEAVNELRVYKSQLEQIESE 281 Query: 357 ALETAQALHQQRQFYAQE 374 L + Q + E Sbjct: 282 ILSAKEEYQLVTQLFKNE 299
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 27.9 bits (62), Expect = 0.012 Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 8/78 (10%) Query: 20 GSRVLESSPAQMTAAVDVSKAGISKTFTTRNQLTRNQSILMHLVDGPFKKLIGGWK---- 75 G+ VLE P+ + +D G + + LT I +++G +++ + Sbjct: 113 GNAVLEVDPSITFSIIDRLFGGTGQAAKVQRDLT---DIENSVMEGVIVRILANVRESWT 169 Query: 76 -FTPLSPEACRIEFQLDF 92 L P +IE F Sbjct: 170 QVIDLRPRLGQIETNPQF 187
>INTIMIN#Intimin signature. Length = 939 Score = 45.1 bits (106), Expect = 6e-06 Identities = 63/315 (20%), Positives = 106/315 (33%), Gaps = 38/315 (12%) Query: 2674 TPAQTNGQPLLAFAQDKAGNTGIAAGFTAPDTRVPEAPIITNVVDDVGIYTGAIANGQ-- 2731 +N + A A D+ GN+ T + V D T A A+G Sbjct: 518 VQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEA 577 Query: 2732 VTNDAQPTLNGTAQAGATVS--IYNNGALLGTTTANASGNWSFTPTGNLTEGSHAFT-AT 2788 +T A NG AQA VS I + A+L +AN +G+ T T L + Sbjct: 578 ITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVT--LKSDKPGQVVVS 635 Query: 2789 ATNANGTGSVSTAATVIVDTLAPGTPSGTLSADGGSLSGLAEANSTVTVTLT-------- 2840 A A T +++ A + VD + AD + +A +T T+ Sbjct: 636 AKTAEMTSALNANAVIFVDQTK--ASITEIKAD--KTTAVANGQDAITYTVKVMKGDKPV 691 Query: 2841 -----------GGVTLTT-TAGSNGAWSLTLPTKQIEGQLINVTATDAAGN-ASGTLGIT 2887 G ++ +T +NG +TL + L++ +D A + + + Sbjct: 692 SNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF 751 Query: 2888 APVLPLAARDNITSLDLTSTAVTSTQSYSDYGLLLVGALGNVASVLGN------DTAQVE 2941 + I + T Y L G G N D + + Sbjct: 752 TTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ 811 Query: 2942 FTIAEGGTGDVTIDA 2956 T+ E GT +++ + Sbjct: 812 VTLKEKGTTTISVIS 826 Score = 37.4 bits (86), Expect = 0.001 Identities = 60/295 (20%), Positives = 113/295 (38%), Gaps = 28/295 (9%) Query: 2147 IYNGSALVGTA-QVQANGSWSFT-------PSTSLGAGVWNLTATATDAAGNTSAASEIR 2198 +++ SAL Q+Q +GS S G+ V+ +TA A D GN+S + + Sbjct: 486 VWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSS-NNVLL 544 Query: 2199 SFTIDTTAPAAPVIDTVYDGTGPITGNLSSGQ--ITDEARPVISGTREAN--TTIRLYDN 2254 + T+ + V D T T + G IT A +G +AN + + Sbjct: 545 TITV-LSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSG 603 Query: 2255 GTLLAEIPADNSSSWRYTPDASLATGNHVITVIAVDAAGNASPV-SDSVNFVVDTTPPLT 2313 +L+ A+ + S + T +L + V++ A S + +++V FV T +T Sbjct: 604 TAVLSANSANTNGSGKAT--VTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASIT 661 Query: 2314 PVITSVSDDQAPGLGTIANGQN--TNDPTPTFSGTAEAGATITLYENGTVIGTTTAQ--P 2369 + A +ANGQ+ T + +T + +T + Sbjct: 662 EI--KADKTTA-----VANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDT 714 Query: 2370 DGAWSVSTSTLASGTHVITAVATDAAGNSSPNSTAFTLTVDTTAPQTPILMSVVD 2424 +G V+ ++ G +++A +D A + F T+ I+ + V Sbjct: 715 NGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVK 769 Score = 36.6 bits (84), Expect = 0.002 Identities = 60/263 (22%), Positives = 89/263 (33%), Gaps = 22/263 (8%) Query: 1467 DGVYTLTAIAADAAGNSSGVSNSFTFTVDTVPLQPPVVN--EILDDVAPVTGLLTDG--A 1522 VY +TA A D GNS SN+ T+ TV VV+ + D A T DG A Sbjct: 522 SNVYKVTARAYDRNGNS---SNNVLLTI-TVLSNGQVVDQVGVTDFTADKTSAKADGTEA 577 Query: 1523 FTNDRTLTINGSGENGSTVTIYDNGVAIGTALVTDGVWTFN-----TPELSEVSHALTFS 1577 T T+ NG + V+ + GTA+++ N T L Sbjct: 578 ITYTATVKKNGVAQANVPVSF---NIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVV 634 Query: 1578 ATDDAGNTTAQTQPITITVDITAPPAPTVQTVADDGTRVAGLADPYA-TVEIHHADGTLV 1636 + A T+A I VD T ++ AD T VA D TV++ D + Sbjct: 635 SAKTAEMTSALNANAVIFVDQTKASITEIK--ADKTTAVANGQDAITYTVKVMKGDKPVS 692 Query: 1637 GSAVANGTGEFVVTLSPAQTDGGTLTAIAIDRAGNNGPATNFPASDSGLPAVPAITAIED 1696 V T ++ S +TD + + + G + V A Sbjct: 693 NQEVTFTTTLGKLSNSTEKTDTNGYAKVTLT-STTPGKSLVSARVSDVAVDVKAPEVEFF 751 Query: 1697 DVGSVQGNIAA--GGATDDTMPT 1717 ++ G +PT Sbjct: 752 TTLTIDDGNIEIVGTGVKGKLPT 774
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 4e-04 Identities = 32/224 (14%), Positives = 63/224 (28%), Gaps = 32/224 (14%) Query: 209 DVVQTEARIESARSQLAQYQANLDSAKASLMSWLGWNSLNGINNDFPAKLARSCETATPD 268 + EA +S L Q + + S D P S E Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187 Query: 269 DRLVPAVLAAW-AQANVARANLDYASAQ---MTPTISLEPSVQHYLNDKYPSHEVLDKTQ 324 L+ + W Q NLD A+ + I+ ++ + L Q Sbjct: 188 TSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247 Query: 325 YSTWVKVEMPLYQGGGLTARRNAASHTVDAAQSTIQRTRLDVRQKLMEARSQAMSLASAL 384 V + ++ +L +SQ + S Sbjct: 248 AIAKHAVL-------------------------EQENKYVEAVNELRVYKSQLEQIES-- 280 Query: 385 QILRRQQQLSERTRELYQQQYLDLGSRPLLDVLNAEQEVYQARF 428 +IL +++ T +L++ + LD + ++ E+ + Sbjct: 281 EILSAKEEYQLVT-QLFKNEILDKLRQTTDNIGLLTLELAKNEE 323
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 243 bits (621), Expect = 3e-78 Identities = 95/432 (21%), Positives = 176/432 (40%), Gaps = 56/432 (12%) Query: 9 ERAFSGAGRIVLICSLLFLILGI-WAWFGRLDEVSTGNGKVIPSSREQVLQSLDGGILAQ 67 E S R+V + FL++ + G+++ V+T NGK+ S R + ++ ++ I+ + Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 68 LTVREGDRVQANQIVARLDPTRLASNVGESAAKYRASLASSARLTAEVSDLPL------- 120 + V+EG+ V+ ++ +L ++ ++ + + R + L Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 121 --AFPAELNGWPDLIAAETRLYKSR-----------RAQLADTEAELRDALASVNK---- 163 P N + + T L K + L AE LA +N+ Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 164 ------ELTITQRLEKSGAASHVEVLRLQRQKSDLG---------------------LKI 196 L L A + VL + + + + Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 197 TDLRSQYYVQAREALSKANAEVDMLSAILKGREDSVTRLTVRSPVRGIVKNIQVTTIGGV 256 + + + + L + + +L+ L E+ +R+PV V+ ++V T GGV Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 257 IPPNGEMMEIVPVDDRLLIETRLSPRDIAFIHPGQRALVKITAYDYAIYGGLDGVVETIS 316 + +M IVP DD L + + +DI FI+ GQ A++K+ A+ Y YG L G V+ I+ Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409 Query: 317 PDTIQDKVKPEIFYYRVFIRTHQDYLQNKSGRRFSIVPGMIATVDIKTGEKTIVDYLIKP 376 D I+D+ + V I ++ L + + GM T +IKTG ++++ YL+ P Sbjct: 410 LDAIEDQRLGL--VFNVIISIEENCLSTG-NKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466 Query: 377 F-NRAKEALRER 387 E+LRER Sbjct: 467 LEESVTESLRER 478
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 33.2 bits (75), Expect = 0.003 Identities = 22/112 (19%), Positives = 46/112 (41%), Gaps = 19/112 (16%) Query: 294 SEDYSADVKKALVKYHEMQHGNGNLSSDEWESLIAVDVLPEFKRNYEQFFR--NIVSTDA 351 +DY+ + +++ Y+E+ G I++D++ + K +F +S D+ Sbjct: 156 IKDYAINSEQSKEVYYEIGKG------------ISLDIISKDKSLDPEFLNLIKSLSDDS 203 Query: 352 NQ----YLSMGKRFLIMNQKVVDVCFLNSNSLQ-QHKLAFQGQGYVGVKQRD 398 + + K L +N K +D+ F+ N + QH + Y R Sbjct: 204 DSSDLLFSQKFKEKLELNNKSIDINFIKENLTEFQHAFSLAFSYYFAPDHRT 255
>FLAGELLIN#Flagellin signature. Length = 507 Score = 278 bits (712), Expect = 5e-90 Identities = 267/515 (51%), Positives = 314/515 (60%), Gaps = 18/515 (3%) Query: 2 AQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKGL 61 AQVINTNSLSLLTQNNLNKSQS+L +AIERLSSGLRINSAKDDAAGQAIANRFT+NIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQASRNANDGISIAQTTEGALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRLN 121 TQASRNANDGISIAQTTEGALNEINNNLQRVREL+VQ+ N TNS SDL SIQ EI QRL Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDLKQINSQTLGLDSLNVQKAYDV 181 EIDRVS QTQFNGVKVL+QDN + IQVGANDGETI IDL++I+ ++LGLD NV + Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 KDTAVTTKAYANNGTTLDVSGLDDAAIKAATGGTNGTASVTGGAVKFDADNNKYFVTIGG 241 + + G G + + +G A VT D G Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSG-----AVVTDTTAPTVPDKVYVNAANGQ 235 Query: 242 FTGADAAKNGDYEVNVATDGTVTLAAGATKTTMPAGATTKTEVQELKDTPAVVSADAKNA 301 T DA N ++ T T A A + E + D K Sbjct: 236 LTTDDAENNTAVDLFKTTKST---AGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTG 292 Query: 302 LIAGGVDATDANGAELVKMSYTDKNGKTIEGGYALKAGDKYYAA------DYDEATGAIK 355 G +T NG ++ G L++ Y + +D+ T Sbjct: 293 NDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNES 352 Query: 356 AKTTSYTAADGTTKTAANQLGGVDG----KTEVVTIDGKTYNASKAAGHDFKAQPELAEA 411 AK + A + + + G + + VT+ GKT K A E A A Sbjct: 353 AKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAA 412 Query: 412 AAKTTENPLQKIDAALAQVDALRSDLGAVQNRFNSAITNLGNTVNNLSEARSRIEDSDYA 471 A K+T NPL ID+AL++VDA+RS LGA+QNRF+SAITNLGNTV NL+ ARSRIED+DYA Sbjct: 413 AKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYA 472 Query: 472 TEVSNMSRAQILQQAGTSVLAQANQVPQNVLSLLR 506 TEVSNMS+AQILQQAGTSVLAQANQVPQNVLSLLR Sbjct: 473 TEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 0.004 Identities = 46/217 (21%), Positives = 66/217 (30%), Gaps = 49/217 (22%) Query: 991 PPG----TVVAVVGRSGAGKSTLIKLLAGLYSPGSGQIRVGER-----------LIDAAS 1035 PG V + G G GKSTLI L GL +G + + Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSE 649 Query: 1036 LSDYRRQTGLVTQDVALFSGDIAENI-RYPRPNSSDTEVESAARRAGLFETV---QHL-- 1089 ++ +RR D + RY V+ R+ ++ T Q+L Sbjct: 650 MTAFRR------ADAEAVKAFFSSRKDRYRGA--YGRYVQDHPRQVVIWCTTNKRQYLFD 701 Query: 1090 PLGFRT--PVNNGG----TDLSAGQRQLIALA--------RAHLA--QAHILLLDEATAR 1133 G R PV G L + QL A A R + I E R Sbjct: 702 ITGNRRFWPVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQELR 761 Query: 1134 -IDRSAEERLMTSLTRVTHTEKRIALIVAHRLTTARR 1169 ++ + RL LTR A A + + Sbjct: 762 LVETGVQGRLWALLTREG---APAAEGAAQKGYSVNT 795
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 21/101 (20%) Query: 370 LLDNALKY----TPEQGIVTARLERDGDAVTLVVEDSGPGIDDEHIHLALQPFHRLDNVG 425 L++N +K+ P+ G + + +D VTL VE++G L N Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA--------------LKNTK 308 Query: 426 NVAGAGIGLALVND-IARLHRTHPHFSRSEALGGLYVRIRF 465 G GL V + + L+ T SE G + + Sbjct: 309 E--STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.8 bits (241), Expect = 2e-25 Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%) Query: 2 RLLLAEDNRELAHWLEKALVQNGFAVDCVFDGLAADHLLHSEMYALAVLDINMPGMDGLE 61 +L+A+D+ + L +AL + G+ V + + + L V D+ MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VVQRLRKRGQTLPVLLLTARSAVADRVKGLNVGADDYLPKPFELEE-LDARLRALLRRSA 120 ++ R++K LPVL+++A++ +K GA DYLPKPF+L E + RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 GQ 122 Sbjct: 125 RP 126
>INTIMIN#Intimin signature. Length = 939 Score = 27.3 bits (60), Expect = 0.030 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 6/69 (8%) Query: 82 SVDDQVKTTTPAAESQFYTVKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPE---KI 138 D ++ T FYT+K+G+T++ +SK N + I+ NK + S K Sbjct: 48 GSDSKLLTHNSYQNRLFYTLKTGETVADLSKSQDINLST---IWSLNKHLYSSESEMMKA 104 Query: 139 YPGQVLRIP 147 PGQ + +P Sbjct: 105 EPGQQIILP 113
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 84.3 bits (208), Expect = 1e-21 Identities = 67/257 (26%), Positives = 120/257 (46%), Gaps = 7/257 (2%) Query: 3 QVAVVIGGGQTLGAFLCRGLAEEGYRVAVVDIQSDKAANVAQEINADFGEGMAYGFGADA 62 ++A + G Q +G + R LA +G +A VD +K V + A+ A F AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66 Query: 63 TSEQSVLALSRGVDEIFGRVDLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCARE 122 ++ ++ ++ G +D+LV AG+ + I +++ + VN G F +R Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHSL 182 S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + + Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 MLGNLLKSPMFQSL-LPQYATKLGIKPDEVEQYYIDKVPLKRGCDYQDVLNMLLFYASPK 241 G+ ++ M SL + + IK +E + +PLK+ D+ + +LF S + Sbjct: 186 SPGS-TETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQ 242 Query: 242 ASYCTGQSINVTGGQVM 258 A + T ++ V GG + Sbjct: 243 AGHITMHNLCVDGGATL 259
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 27.1 bits (60), Expect = 0.044 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 5/45 (11%) Query: 1 MKPRQRQAAILEHLQKQGKCSVEEL-----AQYFDTTGTTIRKDL 40 M QR I E + + +EL ++ T T+ +D+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 374 bits (961), Expect = e-127 Identities = 122/340 (35%), Positives = 180/340 (52%), Gaps = 21/340 (6%) Query: 187 MIGLSPAMTQLKKEIEIVAGSDLNVLIGGETGTGKELVAKAIHQGSPRAVNPLVYLNCAA 246 ++G S AM ++ + + + +DL ++I GE+GTGKELVA+A+H R P V +N AA Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198 Query: 247 LPESVAESELFGHVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYG 306 +P + ESELFGH KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG 258 Query: 307 DIQRVGDDRSLRVDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLFVPPLRERGDDVV 366 + VG +R DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ Sbjct: 259 EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIP 318 Query: 367 LLAGYFCEQCRLRLGLSRVVLSPGARRHLLNYGWPGNVRELEHAIHRAVVLARATRAGDE 426 L +F +Q + GL A + + WPGNVRELE+ + R L E Sbjct: 319 DLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITRE 377 Query: 427 VVL-----EEQHFALS---------------EDVLPAPSAESFLALPACRNLRESTENFQ 466 ++ E + E+ + A ALP + Sbjct: 378 IIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEME 437 Query: 467 REMIRQALAQNNHNWAASARALETDVANLHRLAKRLGLKD 506 +I AL N +A L + L + + LG+ Sbjct: 438 YPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSV 477
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.0 bits (59), Expect = 0.011 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 8/75 (10%) Query: 12 IDGNQAKVD--VCGIQRDVDLTLVGSCDENGQPRLGQWVLVHVGFAMSVINEAEARDTLD 69 I GNQ + D G+ D L ++NG+P G W+ + + F + ++ ++ D + Sbjct: 171 IIGNQIRTDQKFMGV-FDESLKERQEAEKNGEPTGGDWLDIFLSF---IFDKKQSSDVKE 226 Query: 70 ALQN--MFDVEPDVG 82 A+ + V+PD+ Sbjct: 227 AINQEPVPHVQPDIA 241
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 30.5 bits (68), Expect = 0.007 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 8/57 (14%) Query: 201 IISDLTRKWSQAEVAGKLFMSVSSLKRKLAAEEVSFSKIYLDARMNQAIKLLRMGAG 257 ++ LT + Q + F+S SS +R ++++YL+ RM +A++ R G G Sbjct: 66 VLDHLTGRSCQVGSSNSAFVSTSSSRR--------YTEVYLEHRMQEAVEAERAGRG 114
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 42.6 bits (100), Expect = 7e-07 Identities = 33/167 (19%), Positives = 63/167 (37%), Gaps = 10/167 (5%) Query: 23 LLKGLDQEQANEVIAVLQMHNIEANKIDSGKLGYSITVAEPDFTAAVYWIKTYQLPPRPR 82 L L + ++A L NI + +G +I V + LP Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNI-PYRFANG--SGAIEVPADKVHELRLRLAQQGLPKGGA 109 Query: 83 VEIAQMFPADSLVSSPRAEKARLYSAIEQRLEQSLQTMEGVLSARVHISYDIDA---GEN 139 V + + S +E+ A+E L ++++T+ V SARVH++ + E Sbjct: 110 VGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQ 168 Query: 140 GRPPKPVHLSALAVYERGSPLAHQISDIKRFLKNSFADVDYDNISVV 186 P V ++ QIS + + ++ A + N+++V Sbjct: 169 KSPSASVTVTLEPGRALDEG---QISAVVHLVSSAVAGLPPGNVTLV 212
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 28.1 bits (62), Expect = 0.044 Identities = 12/39 (30%), Positives = 21/39 (53%) Query: 234 MSTSTLKRKLAEEGTSFSDIYLSARMNQAAKLLRIGNHN 272 +S +K++ +GT+ IY+++ KLLRI N Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLG 279
>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase signature. Length = 468 Score = 302 bits (774), Expect = 2e-99 Identities = 67/212 (31%), Positives = 103/212 (48%), Gaps = 17/212 (8%) Query: 332 GKPVALAGSYPKNTPDALEAHMKMLLEKECSCLVVLTSEDQMQAKQ--LPPYFRGSYTFG 389 G +A YP LE+H +ML E L VL S ++ ++ +P YFR S T+G Sbjct: 252 GNTRTIACQYP--LQSQLESHFRMLAENRTPVLAVLASSSEIANQRFGMPDYFRQSGTYG 309 Query: 390 EVHTNSQKVSSASQGEAI--DQYNMQL-SCGEKRYTIPVLHVKNWPDHQPLPS--TEQLE 444 + S+ G+ I D Y + + G+K ++PV+HV NWPD + S T+ L Sbjct: 310 SITVESKMTQQVGLGDGIMADMYTLTIREAGQKTISVPVVHVGNWPDQTAVSSEVTKALA 369 Query: 445 YLADRVKNSNQNGAPGRSSS-----DKHLPMIHCLGGVGRTGTMAAALVLKDNPHSNL-- 497 L D+ + +N + SS K P+IHC GVGRT + A+ + D+ +S L Sbjct: 370 SLVDQTAETKRNMYESKGSSAVGDDSKLRPVIHCRAGVGRTAQLIGAMCMNDSRNSQLSV 429 Query: 498 EQVRADFRDSRNNRMLEDASQF-VQLKAMQAQ 528 E + + R RN M++ Q V +K + Q Sbjct: 430 EDMVSQMRVQRNGIMVQKDEQLDVLIKLAEGQ 461
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 44.0 bits (104), Expect = 1e-08 Identities = 17/128 (13%), Positives = 46/128 (35%), Gaps = 8/128 (6%) Query: 2 QAHQDIIANIGEKLGL-PLTFDDNNQCLLLLDSDIFTSIEAK--DDIWLLNGMIIPLSPV 58 ++ ++ + L + PL FDD+ C +++D+ ++ + LL G++ P Sbjct: 4 LFYKTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPH--- 60 Query: 59 CGDSIWRQIMVINGELAANNEGTLAYIDAAETLLLIHAI-TDLTNTYHIISQLESFVNQQ 117 D + ++ N L + + +I + + + ++ + Sbjct: 61 -KDIPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWM 119 Query: 118 EALKNILQ 125 + Q Sbjct: 120 RGWREASQ 127
>BACINVASINC#Salmonella/Shigella invasin protein C signature. Length = 409 Score = 515 bits (1327), Expect = 0.0 Identities = 407/409 (99%), Positives = 408/409 (99%) Query: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP Sbjct: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 Query: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS Sbjct: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 Query: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ Sbjct: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 Query: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV Sbjct: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 Query: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKDSNKQISPEHQAILSKRLESV 300 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIK+SNKQISPEHQAILSKRLESV Sbjct: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQAILSKRLESV 300 Query: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASGQYAATQERSEQQISQVN 360 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGAS QYAATQERSEQQISQVN Sbjct: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVN 360 Query: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA Sbjct: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 842 bits (2176), Expect = 0.0 Identities = 593/593 (100%), Positives = 593/593 (100%) Query: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE Sbjct: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 Query: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE Sbjct: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 Query: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG 180 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG Sbjct: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG 180 Query: 181 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN Sbjct: 181 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 Query: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF Sbjct: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 Query: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA Sbjct: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 Query: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV Sbjct: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 Query: 421 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG Sbjct: 421 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 Query: 481 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML Sbjct: 481 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 Query: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA Sbjct: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 128 bits (322), Expect = 2e-40 Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 4/160 (2%) Query: 4 QNNVSEERVAEMIWDAVSEGATLKDVHGIPQDMMDGLYAHAYEFYNQGRLDEAETFFRFL 63 Q + + + G T+ ++ I D ++ LY+ A+ Y G+ ++A F+ L Sbjct: 3 QETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQAL 62 Query: 64 CIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMRKAAK 123 C+ D Y+ + +GL A Q Q+ A Y+ + + R F +C L + A+ Sbjct: 63 CVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAE 122 Query: 124 ARQCF----ELVNERTEDESLRAKALVYLEALKTAETEQH 159 A EL+ ++TE + L + LEA+K + +H Sbjct: 123 AESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEH 162
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 340 bits (875), Expect = e-118 Identities = 120/360 (33%), Positives = 205/360 (56%), Gaps = 19/360 (5%) Query: 1 MSSNKTEKPTKKRLEDSAKKGQSFKSKDLIIACLTLGGIAYLVSYGSFN-EFMGIIKIII 59 MS KTE+PT K++ D+ KKGQ KSK+++ L + A L+ + E + +I Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60 Query: 60 ADNFDQSMADYSLAVFGIGLKYLIPFMLLCL---VCSALPAL----LQAGFVLATEALKP 112 +QS +S A+ + L+ F LC +AL A+ +Q GF+++ EA+KP Sbjct: 61 ---AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKP 117 Query: 113 NLSALNPVEGAKKLFSMRTVKDTVKTLLYLSSFVVAAIICWKKYKVEIFSQLNGNIVGIA 172 ++ +NP+EGAK++FS++++ + +K++L + V+ +I+ W K + + L GI Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKV---VLLSILIWIIIKGNLVTLLQLPTCGIE 174 Query: 173 VIWRELLLALVLTCLACA---LIVLLLDAIAEYFLTMKDMKMDKEEVKREMKEQEGNPEV 229 I L L + C +++ + D EY+ +K++KM K+E+KRE KE EG+PE+ Sbjct: 175 CITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 230 KSKRREVHMEILSEQVKSDIENSRLIVANPTHITIGIYFKPELMPIPMISVYETNQRALA 289 KSKRR+ H EI S ++ +++ S ++VANPTHI IGI +K P+P+++ T+ + Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 290 VRAYAEKVGVPVIVDIKLARSLFKTHRRYDLVSLEEIDEVLRLLVWLE--EVENAGKDVI 347 VR AE+ GVP++ I LAR+L+ + E+I+ +L WLE +E +++ Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHSEML 354
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 188 bits (478), Expect = 3e-61 Identities = 48/237 (20%), Positives = 104/237 (43%), Gaps = 4/237 (1%) Query: 12 LVASAALGFARVAPIFFFLPFLNSGVLSGAPRNAIIILVALGVWPHALNEAPPFLSVAMI 71 + RV + P L+ + + + +++ + P P S + Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFAL 71 Query: 72 PLVLQEAAVGVMLGCLLSWPFWVMHALGCIIDNQRGATLSSSIDPANGIDTSEMANFLNM 131 L +Q+ +G+ LG + + F + G II Q G + ++ +DPA+ ++ +A ++M Sbjct: 72 WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDM 131 Query: 132 FAAVVYLQNGGLVTMVDVLNKSYQLCDPMNEC--TPSLPPLLTFINQVAQNALVLASPVV 189 A +++L G + ++ +L ++ E + + L + + N L+LA P++ Sbjct: 132 LALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLI 191 Query: 190 LVLLLSEVFLGLLSRFAPQMNAFAISLTVKSGIAVLIMLLYFS--PVLPDNVLRLSF 244 +LL + LGLL+R APQ++ F I + + + +M +++ F Sbjct: 192 TLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIF 248
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 88.7 bits (220), Expect = 4e-27 Identities = 86/86 (100%), Positives = 86/86 (100%) Query: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 Query: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86 FLLSGWYGEVLLSYGRQVIFLALAKG Sbjct: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 303 bits (777), Expect = e-107 Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 537 bits (1384), Expect = 0.0 Identities = 301/303 (99%), Positives = 303/303 (100%) Query: 1 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIQPGDWL 60 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWI+PGDWL Sbjct: 1 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL 60 Query: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL Sbjct: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 Query: 121 HIMSDRGGLWFEYLPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS 180 HIMSDRGGLWFE+LPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS Sbjct: 121 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS 180 Query: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR Sbjct: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 Query: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG Sbjct: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 Query: 301 NGE 303 NGE Sbjct: 301 NGE 303
>SSPANPROTEIN#Salmonella invasion protein InvJ signature. Length = 336 Score = 600 bits (1547), Expect = 0.0 Identities = 333/336 (99%), Positives = 334/336 (99%) Query: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL Sbjct: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 Query: 61 PVLLAAWRHGAPAKSEHHNGNVSGLHHNGKGELRIAEKLLKVTAEKSVGLISAEAKVDKS 120 P+LLAAWRHGAPAKSEHHNGNVSGLHHNGK ELRIAEKLLKVTAEKSVGLISAEAKVDKS Sbjct: 61 PLLLAAWRHGAPAKSEHHNGNVSGLHHNGKSELRIAEKLLKVTAEKSVGLISAEAKVDKS 120 Query: 121 AALLSPKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR 180 AALLS KNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR Sbjct: 121 AALLSSKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR 180 Query: 181 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA Sbjct: 181 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 Query: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH Sbjct: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 Query: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA Sbjct: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 169 bits (429), Expect = 3e-57 Identities = 141/147 (95%), Positives = 143/147 (97%) Query: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRGLQAEEEAILEQIAGLKLLLDTLRAEN 60 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDR LQ EEEAI+EQIAGLKLLLDTLRAEN Sbjct: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRRLQVEEEAIVEQIAGLKLLLDTLRAEN 60 Query: 61 RQLSREEIYTLLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNY 120 RQLSREEIY LLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQ+KSKYWLRKEGNY Sbjct: 61 RQLSREEIYALLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQEKSKYWLRKEGNY 120 Query: 121 QRWIIRQKRFYIQREIQQEEAESEEII 147 QRWIIRQKR YIQREIQQEEAESEEII Sbjct: 121 QRWIIRQKRLYIQREIQQEEAESEEII 147
>SSPAKPROTEIN#Invasion protein B family signature. Length = 133 Score = 114 bits (286), Expect = 8e-37 Identities = 21/76 (27%), Positives = 37/76 (48%) Query: 1 MGADSMVVLQQRAYEILMTIMEGCHFARGGQLLLGEQNGELTLKALVHPDFLSDGEKFST 60 A S V LQ AY IL ++ ++ + L + L L+ ++ D++ DG F+ Sbjct: 58 FDAPSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQLRVVIKDDYVHDGIVFAE 117 Query: 61 ALNGFYNYLEVFSRSL 76 L+ FY +E+ + L Sbjct: 118 ILHEFYQRMEILNGVL 133
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 604 bits (1558), Expect = 0.0 Identities = 371/372 (99%), Positives = 371/372 (99%) Query: 1 MIPGSTSGISFSRILSRQTSHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 MIPGSTSGISFSRILSRQ SHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA Sbjct: 1 MIPGSTSGISFSRILSRQASHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 Query: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP Sbjct: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 Query: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS Sbjct: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 Query: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR Sbjct: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 Query: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL Sbjct: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 Query: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE Sbjct: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 Query: 361 MAEQRRTIEKLS 372 MAEQRRTIEKLS Sbjct: 361 MAEQRRTIEKLS 372
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 564 bits (1456), Expect = 0.0 Identities = 166/534 (31%), Positives = 269/534 (50%), Gaps = 57/534 (10%) Query: 1 MLACAALVLVAPGYSSE----KIPVTGSGFVAKDDSLRTFFDAMALQLKEPVIVSKMAAR 56 +L L+L + ++ E IP +VAK +SLR V+VS Sbjct: 12 VLTGTLLLLSSYSWAQELDWLPIPYV---YVAKGESLRDLLTDFGANYDATVVVSD-KIN 67 Query: 57 KKITGNFEFHDPNALLEKLSLQLGLIWYFDGQAIYIYDASEMRNAVVSLRNVSLNEFNNF 116 K++G FE +P L+ ++ L+WY+DG +YI+ SE+ + ++ L+ E Sbjct: 68 DKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQA 127 Query: 117 LKRSGLYNKNYPLRGDNRKGTFYVSGPPVYVDMVVNAATMMDKQND--GIELGRQKIGVM 174 L+RSG++ + R D YVSGPP Y+++V A +++Q + G I + Sbjct: 128 LQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIF 187 Query: 175 RLNNTFVGDRTYNLRDQKMVIPGIATAIERLLQGEEQPLGNIVSSEPPAMPAFSANGEKG 234 L DRT + RD ++ PG+AT ++R+L + + P Sbjct: 188 PLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIP------------ 235 Query: 235 KAANYAGGMSLQEALKQNAAAGNIKIVAYPDTNSLLVKGTAEQVHFIEMLVKALDVAKRH 294 Q A + +A A ++ A P N+++V+ + E++ + L+ ALD Sbjct: 236 -----------QAATRASAQA---RVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSAR 281 Query: 295 VELSLWIVDLNKSDLERLGTSWSGSI-----------TIGDKLGVSLNQSSISTLDG--- 340 +E++L IVD+N L LG W I T GD+ ++ N + S +D Sbjct: 282 IEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASNGALGSLVDARGL 341 Query: 341 SRFIAAVNALEEKKQATVVSRPVLLTQENVPAIFDNNRTFYTKLIGERNVALEHVTYGTM 400 +A VN LE + A VVSRP LLTQEN A+ D++ T+Y K+ G+ L+ +TYGTM Sbjct: 342 DYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTM 401 Query: 401 IRVLPRFSADG---QIEMSLDIEDGNDKTPQSDTTTSVDALPEVGRTLISTIARVPHGKS 457 +R+ PR G +I ++L IEDGN Q ++ ++ +P + RT++ T+ARV HG+S Sbjct: 402 LRMTPRVLTQGDKSEISLNLHIEDGN----QKPNSSGIEGIPTISRTVVDTVARVGHGQS 457 Query: 458 LLVGGYTRDANTDTVQSIPFLGKLPLIGSLFRYSSKNKSNVVRVFMIEPKEIVD 511 L++GG RD + + +P LG +P IG+LFR S+ VR+F+IEP+ I + Sbjct: 458 LIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRIIDE 511
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 83.4 bits (206), Expect = 1e-19 Identities = 67/387 (17%), Positives = 143/387 (36%), Gaps = 48/387 (12%) Query: 16 FLDLINLFIASVAFPAMSVDLHTSISALAWVSNGYIAGLTLIVPFSAFLSRYLGARRLII 75 F ++N + +V+ P ++ D + ++ WV+ ++ ++ LS LG +RL++ Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 76 FSLILFSVAAAAAGFADSLHS-LVFWRIVQGAGGGLLIPVGQALTWQQFKPHERAGVSSV 134 F +I+ + S S L+ R +QGAG + + + R + Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 135 VMMVALLAPACSPAIGGLLVETCGWRWIFFATLPVAVLTLLLAYRWLNAASTT------- 187 + + + PAIGG++ W ++ + + L Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203 Query: 188 --------------MASARLLHL-------------------PLLTDRLLRFAMIVYLCV 214 S + L P + L + + + Sbjct: 204 IILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263 Query: 215 PGMFIGISVVGM-----FYLQNVAQLSPAAAGS-LMLPWSIASFVAIMLTGRYFNRLGPR 268 G I +V G + +++V QLS A GS ++ P +++ + + G +R GP Sbjct: 264 CGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPL 323 Query: 269 PLIIVGCLLQAAGILLLTNVTPATSHRVLMMIFALMGAGGSLCSSTAQSGAFLTIARRDM 328 ++ +G + L + + TS + ++I ++G G S + + ++ +++ Sbjct: 324 YVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVISTIVSSSLKQQEA 382 Query: 329 PDASALWNLNRQLSFFLGAALLTLLLN 355 +L N LS G A++ LL+ Sbjct: 383 GAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 84.4 bits (209), Expect = 9e-21 Identities = 55/217 (25%), Positives = 92/217 (42%), Gaps = 31/217 (14%) Query: 1 MQIIITGGGGFLGQKLASALLNSSL------AFNELLLVDLKMPARLS--DSPRLRCLEA 52 M+ ++TG GF+G ++ LL + N+ V LK ARL P + + Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQ-ARLELLAQPGFQFHKI 59 Query: 53 DLT-QPGVLENVITANTSVVYHLAA-------IVSSHAEDDFDLGWKVNLDLTRQLLEAC 104 DL + G+ + + + V+ + + HA D NL +LE C Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD------SNLTGFLNILEGC 113 Query: 105 RRQPQKIRFVFSSSLAVYGG--TLPECVTDTTALTPRSSYGAQKAACELLVNDYTRKGYV 162 R + +++SS +VYG +P D+ P S Y A K A EL+ + Y+ + Sbjct: 114 RHNKIQ-HLLYASSSSVYGLNRKMPFSTDDSVD-HPVSLYAATKKANELMAHTYSHLYGL 171 Query: 163 DGLALRLPTICVRPGKPNRAASSFVSAIIREPLQGET 199 LR T+ G+P+ A F A+ L+G++ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAM----LEGKS 204
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.005 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 12/84 (14%) Query: 240 IVATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGS 299 IVATA+G++ ++G + IK ++ ++V+E + V+ G + + + Sbjct: 82 IVATANGKLTHSGRSK-------EIKP---IENSIV--KEIIVKEGESVRKGDVLLKLTA 129 Query: 300 TGTSSTRLHFEIRYKGKSVNPLRY 323 G + L + + RY Sbjct: 130 LGAEADTLKTQSSLLQARLEQTRY 153
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 63.3 bits (154), Expect = 1e-12 Identities = 48/138 (34%), Positives = 63/138 (45%), Gaps = 20/138 (14%) Query: 36 VDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITI 95 VD GK+TL LL+++ I +L S+ + R D ER++GITI Sbjct: 12 VDAGKTTLTESLLYNSGAI--TELGSVDKGTTR-------------TDNTLLERQRGITI 56 Query: 96 DVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFIS 155 F E K I DTPGH + + S D AILLI A+ GV QTR Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTR--ILFH 114 Query: 156 TL--LGIKHLVVAINKMD 171 L +GI + INK+D Sbjct: 115 ALRKMGIPTIFF-INKID 131
>PF07675#Cleaved Adhesin Length = 1358 Score = 30.8 bits (69), Expect = 0.020 Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 12/92 (13%) Query: 206 ILGQTYLPRKFKTTVVIP---PQND--IDLHANDMNFVAIAENGKLVGFNLLVGGGLSIE 260 ++ +P+ T +P PQN + A+ ++VAI+++G L G + G++ Sbjct: 240 VMPYRAMPKT--NTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATV 297 Query: 261 HGNK-----KTYARTASEFGYLPLEHTLAVAE 287 + K Y + YLP+ + E Sbjct: 298 NMTKQITENGNYDVVITRSNYLPVIKQIQAGE 329
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 628 bits (1620), Expect = 0.0 Identities = 231/856 (26%), Positives = 381/856 (44%), Gaps = 66/856 (7%) Query: 19 SQATEFNASLLDSGNLSNVDLTAFSREGYVAPGNYILDIWLNDQPVREQYPVRVVPVAGL 78 S FN L + DL+ F + PG Y +DI+LN+ + + V Sbjct: 44 SAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRD-VTFNTGDSE 102 Query: 79 DAAVICVTTDMVAMLGLKDKIIHGLKPVTGIPDGQCLELRSA--DSQVRYSAENQRLTFI 136 V C+T +A +GL + + + D C+ L S D+ + QRL Sbjct: 103 QGIVPCLTRAQLASMGLNTA---SVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLT 159 Query: 137 IPQAWMRYQDPDWVPPSRWSDGVTAGLLDYSLMVNRYMPQQGETSTSYSLYGTAGFNLGA 196 IPQA+M + ++PP W G+ AGLL+Y+ N + G S L +G N+GA Sbjct: 160 IPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGA 219 Query: 197 WRLRSDYQYSRFDS-GQGASQSDFYLPQTYLFRALPALRSKLTLGQTYLSSAIFDSFRFA 255 WRLR + +S S S++ + T+L R + LRS+LTLG Y IFD F Sbjct: 220 WRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFR 279 Query: 256 GLTLASDERMLPPSLQGYAPKISGIANSNAQVTVSQNGRILYQTRVSPGPFELPDLSQ-N 314 G LASD+ MLP S +G+AP I GIA AQVT+ QNG +Y + V PGPF + D+ Sbjct: 280 GAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAG 339 Query: 315 ISGNLDVSVRESDGSVRTWQVNTASVPFMARQGQVRYKVAAGRPLYGGTHNNSTVSPDFL 374 SG+L V+++E+DGS + + V +SVP + R+G RY + AG G N P F Sbjct: 340 NSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG---NAQQEKPRFF 396 Query: 375 LGEATWGAFNNTSLYGGLIASTGDYRSAALGIGQNMGLLGALSADVTRSDARLPHGQKQS 434 G ++YGG + YR+ GIG+NMG LGALS D+T++++ LP + Sbjct: 397 QSTLLHGLPAGWTIYGGTQLA-DRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHD 455 Query: 435 GYSYRINYAKTFDKTGSTLAFVGYRFSDRHFLSMPEYLQRRATDGGD------------- 481 G S R Y K+ +++G+ + VGYR+S + + + R Sbjct: 456 GQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKF 515 Query: 482 ------AWHEKQSYTVTYSQSVPVLNMSAALSVSRLNYWNAQ-SNNNYMLSLNKVFSLGD 534 A++++ +T +Q + + LS S YW + + LN F Sbjct: 516 TDYYNLAYNKRGKLQLTVTQQLG-RTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF---- 570 Query: 535 LQGLSASVSFARNQYTGG-GSQNQVYATISIPWGDSR-----------QVSYSVQKDNRG 582 + ++ ++S++ + G + ++IP+ SYS+ D G Sbjct: 571 -EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629 Query: 583 GLQQTVNYSD--FHNPDTTWNISAGHNRYDTGSN-SSFSGSVQSRLPWGQAAADATLQPG 639 + + + ++++ G+ G++ S+ ++ R +G A + Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689 Query: 640 QYRSLGLSWYGSVTATAHGAAFSQSMAGNEPRMMIDTGDVAGVPVNGNSGV-TNRFGVGV 698 + L G V A A+G Q + N+ +++ V +GV T+ G V Sbjct: 690 -IKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAV 746 Query: 699 VSAGSSYRRSDISVDVAALPEDVDVSSSVISQVLTEGAVGYRKIDASQGEQVLGHIRLAD 758 + + YR + +++D L ++VD+ ++V + V T GA+ + A G ++L + + Sbjct: 747 LPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HN 805 Query: 759 GASPPFGALVVSGKTGRTAGMVGDGGLAYLTGLSGEDRRTLNVSW--DGRVQCRLTLPET 816 PFGA+V S +++G+V D G YL+G+ + V W + C Sbjct: 806 NKPLPFGAMVTSES-SQSSGIVADNGQVYLSGM--PLAGKVQVKWGEEENAHCVANYQLP 862 Query: 817 VTLSRGPL---LLPCR 829 + L CR Sbjct: 863 PESQQQLLTQLSAECR 878
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 97.7 bits (243), Expect = 5e-28 Identities = 52/183 (28%), Positives = 77/183 (42%), Gaps = 17/183 (9%) Query: 15 MNKMLLAGSAGIVLLSAAASPVWADDNASTFSLGYAQSH-TNHAGTLRGVRLANNYEMSP 73 M K+ SA +L+ A A ST + GYAQS + G L YE Sbjct: 1 MKKIACL-SALAAVLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDN 57 Query: 74 D-WGLTTSFAWLNGSQRYSDESSNGRVTTRYYSLLAGPSWKINNQLSLYSQVGPVLLHQR 132 G+ SF + S SS +YY + AGP+++IN+ S+Y VG + Sbjct: 58 SPLGVIGSFTYTEKS---RTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQ 114 Query: 133 DH---GINESDSKVGYGYSAGVAYTPVSSVAITLGYEGADFDATHNSGSLNSNGFNLGVG 189 S G+ Y AG+ + P+ +VA+ YE + S++ + GVG Sbjct: 115 TTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQS------RIRSVDVGTWIAGVG 168 Query: 190 YRF 192 YRF Sbjct: 169 YRF 171
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.4 bits (66), Expect = 0.011 Identities = 38/160 (23%), Positives = 70/160 (43%), Gaps = 12/160 (7%) Query: 11 LVLIVIAIAINMIGGQLISMLKLPIFLDSIGTLISAVLLGPFIGMLTGLLTNLLWGLLTD 70 LV +V+ + + + LI + +P+ L +GT G I LT L GLL D Sbjct: 350 LVFLVMYLFLQNMRATLIPTIAVPVVL--LGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407 Query: 71 PIAAAFAPVAMVIGLVAGWLARAGWFRTLPKVIVSGVVITLAVTLVAVPLRTALFGGVTG 130 V V+ + + +++ ++ G ++ +A+ L AV + A FGG TG Sbjct: 408 DAIVVVENVERVM-MEDKLPPKEATEKSMSQI--QGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 131 SGADLFVAWMHSMGQNLVESVAITVIGANLVDKILTAIIV 170 A +V ++A++V+ A ++ L A ++ Sbjct: 465 -------AIYRQFSITIVSAMALSVLVALILTPALCATLL 497
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.028 Identities = 10/21 (47%), Positives = 13/21 (61%) Query: 32 LALTGDNGAGKSTLLRIMAGL 52 + L G G GKSTL+ + GL Sbjct: 599 VVLEGTGGIGKSTLINTLVGL 619
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 49.7 bits (118), Expect = 2e-08 Identities = 40/238 (16%), Positives = 76/238 (31%), Gaps = 8/238 (3%) Query: 197 PNNAFDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALERKLEIEQQEAFMTLEQ 256 N A+ + + E R A + E E +QE+ + Sbjct: 999 TPNNIQAD-VPSVPSNNEEIARVDEAPVPPPAPATPSETT---ETVAENSKQESKTVEKN 1054 Query: 257 EQQVKTRTAEQNAKIAAFEAERHREAE-QTRILAERQIQETEIEREQAVRSRKVEAEREV 315 EQ TA+ + A EA+ + +A QT +A+ + E + + + VE E + Sbjct: 1055 EQDATETTAQN--REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112 Query: 316 RIKEIEQQQVTEIANQTKSIAIAAKSEQQSQAEARANDALADAVRAQ-QNVETTRQTAEA 374 +++ + Q+V ++ +Q +++ Q AR ND + Q Q T A Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172 Query: 375 DRAKQVALIAAAQDAETKAVELTVRAKAEKEAAELQAAAIIELAEATRKKGLAEAEAQ 432 + V A Q E + + + + Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 29.0 bits (65), Expect = 0.027 Identities = 10/37 (27%), Positives = 20/37 (54%) Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383 G FD + GH+ + +L D++ VAV + + + + Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 533 bits (1375), Expect = 0.0 Identities = 175/400 (43%), Positives = 261/400 (65%), Gaps = 12/400 (3%) Query: 7 VLVINCGSSSIKFSVLDVATCDVLMAGIADGMNTENAFLSI--NGDK-PINLAHSNYEDA 63 +LVINCGSSS+K+ +++ +VL G+A+ + ++ L+ NG+K I +++DA Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62 Query: 64 LKAIAFELEKRDL-----TDSVALIGHRIAHGGELFTQSVIITDEIIDNIRRVSPLAPLH 118 +K + L D + +GHR+ HGGE FT SV+ITD+++ I LAPLH Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122 Query: 119 NYANLSGIDAARRLFPAVRQVAVFDTSFHQTLAPEAYLYGLPWEYFSSLGVRRYGFHGTS 178 N AN+ GI A ++ P V VAVFDT+FHQT+ AYLY +P+EY++ +R+YGFHGTS Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182 Query: 179 HRYVSRRAYELLDLDEKNSGLIVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSG 238 H+YVS+RA E+L+ ++ +I HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTRSG Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242 Query: 239 DVDFGAMAWIAKETGQTLSDLERVVNKESGLLGISGLSSDLR-VLEKAWHEGHERARLAI 297 +D ++++ ++ + ++ ++NK+SG+ GISG+SSD R + + A+ G +RA+LA+ Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302 Query: 298 KTFVHRIARHIAGHAASLHRLDGIIFTGGIGENSVLIRQLVIEHLGVLGLTLDVEMNKQP 357 F +R+ + I +AA++ +D I+FT GIGEN IR+ +++ L LG LD E NK Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362 Query: 358 NSHGERIISANPSQVICAVIPTNEEKMIALDAIHL-GNVK 396 E IIS S+V V+PTNEE MIA D + ++K Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKIVESLK 400
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 35.1 bits (81), Expect = 2e-04 Identities = 20/100 (20%), Positives = 36/100 (36%), Gaps = 15/100 (15%) Query: 144 KNITIIVQIESQLGVDNVDAIAATEGVDGIFVGPSDLA----------AALGHLGNASHP 193 +I + + +E + A + VD +G +DL + +L HP Sbjct: 423 DSIEVGIMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHP 480 Query: 194 DVQQTIQHIFARAKAHGKP---CGILAPVEADARRYLEWG 230 + + + + A + GK CG +A E L G Sbjct: 481 AILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLG 520
>PF03309#Bvg accessory factor Length = 271 Score = 30.1 bits (68), Expect = 0.008 Identities = 15/64 (23%), Positives = 24/64 (37%), Gaps = 3/64 (4%) Query: 4 LAIDIGGTKLAAALIDNN---LRISQRRELPTPASKTPDALREALKALVEPLRAEARQVA 60 LAID+ T LI + ++ Q+ + T T D L + L+ + Sbjct: 3 LAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTGAS 62 Query: 61 IAST 64 ST Sbjct: 63 GLST 66
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 58.7 bits (142), Expect = 2e-11 Identities = 80/455 (17%), Positives = 159/455 (34%), Gaps = 46/455 (10%) Query: 30 LLDGFDFVLIALVLTEVQSEFGLTTVQAASLISAAFISRWFGGLLLGAMGDRYGRRLAMV 89 + +++ + L ++ ++F + +A ++ G + G + D+ G + ++ Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 90 SSIILFSVGTLACGFAPGYTTMFI-ARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGF 148 II+ G++ + ++ I AR + G G A V PK R KA G Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 149 LISGFSVGAVVAAQVYSLVVPVWGWRALFFIGILPIIFALWLRKNIPEAEDWKEKHAGKA 208 + S ++G V + ++ W L I ++ II +L K + + Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR--------- 194 Query: 209 PVRTMVDILYRGEHRIINILMTFAAAAALWFCFAGNLQNAAIVAGLGLLCAVIFISFMVQ 268 +G I I++ + IV+ L +IF+ + + Sbjct: 195 ---------IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSF---LIFVKHIRK 242 Query: 269 SSGK----RWPTGVMLMLVVLFAFLYSWPIQA---LLPTYLKTELAYDPHTVANVLFFSG 321 + + M+ VL + + ++P +K + +V+ F G Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302 Query: 322 -FGAAVGCCVGGFLGDWLGTRK-AYVCSLLASQILIIPVFAIGGTNVWVLGLLLFFQQML 379 + +GG L D G + S + F + T W + +++ F Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETT-SWFMTIIIVFVLGG 361 Query: 380 GQGIAGILPKLIGGYFDTDQRAAGLGFTYNVGALGGALAP-ILGALIA-----QRL---- 429 ++ ++ + AG+ L I+G L++ QRL Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPME 421 Query: 430 -DLGTALAS---LSFSLTFVVILLIGLDMPSRVQR 460 D T L S L FS V+ L+ L++ QR Sbjct: 422 VDQSTYLYSNLLLLFSGIIVISWLVTLNVYKHSQR 456
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 69.7 bits (170), Expect = 4e-15 Identities = 34/187 (18%), Positives = 65/187 (34%), Gaps = 32/187 (17%) Query: 90 GLGSGVIIDAAKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGGDDQ 137 + SGV++ K +LTN HV++ L +G ++ + Sbjct: 102 FIASGVVV--GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALG 190 D+A+++ + ++++ + +V G P ++ + Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212 Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSIGIGFAIPSN 249 S + L+ +Q D S GNSG + N E+IGI+ I N Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW---GGVPNEFNGAVFINEN 269 Query: 250 MAQTLAQ 256 + L Q Sbjct: 270 VRNFLKQ 276
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 51.5 bits (123), Expect = 1e-09 Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%) Query: 55 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 102 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 103 TDLAVLKI-------NATGGLPTIPINTKRTPHIGDVVLAIGNPYNLGQTITQGIISATG 155 DLA++K + + ++ + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 156 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 195 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.3 bits (76), Expect = 0.003 Identities = 16/66 (24%), Positives = 27/66 (40%), Gaps = 3/66 (4%) Query: 503 AAAPAASSAPAT---APAGPGTPVTAPLAGNIWKVIAAEGQTVAEGDVLLILEAMKMETE 559 A A +G + + ++I EG++V +GDVLL L A+ E + Sbjct: 76 VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135 Query: 560 IRAAQA 565 Q+ Sbjct: 136 TLKTQS 141 Score = 30.6 bits (69), Expect = 0.017 Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 10/56 (17%) Query: 533 KVIAAEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGDTLMTL 588 V A G+ G EI+ + V+ I VK G++V GD L+ L Sbjct: 82 IVATANGKLTHSGRSK----------EIKPIENSIVKEIIVKEGESVRKGDVLLKL 127
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 28.5 bits (63), Expect = 0.031 Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%) Query: 3 VAVLGAAGGIGQALALLLKNQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62 + GAA GIG+A+A L G+ ++ D P V S A + F + Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66 Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104 A + D+++ AGV R + F+VN+ V N + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 105 IAKTCPK----ACVGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146 ++K + V + +NP ++AA KA K G+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 128 bits (324), Expect = 2e-39 Identities = 82/216 (37%), Positives = 130/216 (60%), Gaps = 3/216 (1%) Query: 1 MAKKTKADALKTRQHLIETAIAQFALRGVANTTLNDIADAADVTRGAIYWHFENKTQLFN 60 MA+KTK +A +TRQH+++ A+ F+ +GV++T+L +IA AA VTRGAIYWHF++K+ LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EVW-LQQPPLRELIQDRLTGCWNDNPLQDLREKFIAALQYIAAVPRQQALMQILYHKCEF 119 E+W L + + EL + +PL LRE I L+ R++ LM+I++HKCEF Sbjct: 61 EIWELSESNIGELELEYQAKF-PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 HNGM-ISEQAIREKMGFHHQSLLEVLQRCMDKKLISGSLDLDVILIILHGSFSGIVKNWL 178 M + +QA R + + + L+ C++ K++ L II+ G SG+++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 MNPTSYDLYKQAPALVDNVLKMLSPDGSVRQLMPNE 214 P S+DL K+A V +L+M ++R NE Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.1 bits (99), Expect = 2e-06 Identities = 24/137 (17%), Positives = 48/137 (35%), Gaps = 15/137 (10%) Query: 24 ATYQADYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADAAV 82 + K +L + E+ A + Q + I D RQ + Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311 Query: 83 VAAKAAVESARINLAYTKVTSPISGRIGKSNV-TEGALVTNGQSTELATVQQLDPIYVDV 141 + + + +P+S ++ + V TEG +VT + T + V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTA 370 Query: 142 TQSSND--FMRLKQSVE 156 + D F+ + Q+ Sbjct: 371 LVQNKDIGFINVGQNAI 387 Score = 29.8 bits (67), Expect = 0.015 Identities = 15/90 (16%), Positives = 26/90 (28%), Gaps = 12/90 (13%) Query: 8 EGSDVEAGQSLYQIDPATYQADYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQE 67 EG V G L ++ +AD K++++ A L RY L E Sbjct: 114 EGESVRKGDVLLKLTALGAEAD-------TLKTQSSLLQARLEQTRYQIL-----SRSIE 161 Query: 68 YDQAIADARQADAAVVAAKAAVESARINLA 97 ++ + +L Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLI 191
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1391 bits (3602), Expect = 0.0 Identities = 917/1032 (88%), Positives = 974/1032 (94%) Query: 1 MANFFIRRPIFAWVLAIILMMAGALAIMQLPVAQYPTIAPPAVSISATYPGADAQTVQDT 60 MANFFIRRPIFAWVLAIILMMAGALAI+QLPVAQYPTIAPPAVS+SA YPGADAQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKSSSSFLMVAGFVSDNPNTTQDDISDYVASNIKDSISRLNGVGDVQLFGA 180 EVQQQGISVEKSSSS+LMVAGFVSDNP TTQDDISDYVASN+KD++SRLNGVGDVQLFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDANLLNKYQLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240 QYAMRIWLDA+LLNKY+LTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KDPEEFGKVTLRVNTDGSVVHLKDVARIELGGENYNVVARINGKPASGLGIKLATGANAL 300 K+PEEFGKVTLRVN+DGSVV LKDVAR+ELGGENYNV+ARINGKPA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIKAKLAELQPFFPQGMKVVYPYDTTPFVKISIHEVVKTLFEAIILVFLVMYLFLQ 360 DTA AIKAKLAELQPFFPQGMKV+YPYDTTPFV++SIHEVVKTLFEAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NIRATLIPTIAVPVVLLGTFAVLAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N+RATLIPTIAVPVVLLGTFA+LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 MEDNLSPREATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 MED L P+EATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATLLKPVSAEHHEKKSGFFGWFNTRFDHSVNHYTNSVSGIVRNTGRY 540 SVLVALILTPALCATLLKPVSAEHHE K GFFGWFNT FDHSVNHYTNSV I+ +TGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LIIYLLIVVGMAVLFLRLPTSFLPEEDQGVFLTMIQLPSGATQERTQKVLDQVTHYYLNN 600 L+IY LIV GM VLFLRLP+SFLPEEDQGVFLTMIQLP+GATQERTQKVLDQVT YYL N Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKANVESVFTVNGFSFSGQGQNSGMAFVSLKPWEERNGEENSVEAVIARATRAFSQIRDG 660 EKANVESVFTVNGFSFSGQ QN+GMAFVSLKPWEERNG+ENS EAVI RA +IRDG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 LVFPFNMPAIVELGTATGFDFELIDQGGLGHDALTKARNQLLGMVAKHPDLLVRVRPNGL 720 V PFNMPAIVELGTATGFDFELIDQ GLGHDALT+ARNQLLGM A+HP LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKLDVDQEKAQALGVSLSDINETISAALGGYYVNDFIDRGRVKKVYVQADAQFRM 780 EDT QFKL+VDQEKAQALGVSLSDIN+TIS ALGG YVNDFIDRGRVKK+YVQADA+FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPGDINNLYVRSANGEMVPFSTFSSARWIYGSPRLERYNGMPSMELLGEAAPGRSTGEAM 840 LP D++ LYVRSANGEMVPFS F+++ W+YGSPRLERYNG+PSME+ GEAAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 SLMENLASQLPNGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 +LMENLAS+LP GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAASLRGLNNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGRGLI 960 MLVVPLG+VG LLAA+L NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEG+G++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLEASRMRLRPILMTSLAFILGVMPLVISRGAGSGAQNAVGTGVMGGMLTATLLAIFF 1020 EATL A RMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGM++ATLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVVKRRF 1032 VPVFFVV++R F Sbjct: 1021 VPVFFVVIRRCF 1032
>adhesinb#Adhesin B signature. Length = 310 Score = 29.0 bits (65), Expect = 0.001 Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 10/68 (14%) Query: 1 MKR---LIPVALLTTLLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWETAGAIAGG 57 MK+ L+ + L LA C+ + +V TN+ + T++ IAG Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN-------IAGD 53 Query: 58 AAAVAGLT 65 + + Sbjct: 54 KINLHSIV 61
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 34.0 bits (77), Expect = 0.008 Identities = 51/220 (23%), Positives = 86/220 (39%), Gaps = 20/220 (9%) Query: 362 ISGDRTVNTLTGDSSVTDGATGMVISGDGTTNTISGHSTVDNATGA---------LISGN 412 +T+ T S+++ +I+G G+T T ST+ G+ L++G Sbjct: 153 TQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGY 212 Query: 413 GTTTNFAGDIAVSG--GGTAIIIDGDNATIKNTGTSNISGAGSTGTVIDGNNARVNNDGD 470 G+T + + G T + G + T G + AG ++I G + D Sbjct: 213 GSTQTAGEESSQMAGYGSTQTGMKGSDLT---AGYGSTGTAGDDSSLIAGYGSTQTAGED 269 Query: 471 MTITDG-GTGGHITGDNVVIDNAGSTTVSGADATALYIEGDNALVINEGNQTISGGAVGT 529 ++T G G+ + + GST +GAD++ + G E QT G+ T Sbjct: 270 SSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQT 329 Query: 530 RIDGDD-----AHTTNTGDIAVDGAGSAAVIINGDNGSLT 564 G D T GD + AG + G++ SLT Sbjct: 330 AQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 369
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 40.0 bits (93), Expect = 2e-05 Identities = 36/197 (18%), Positives = 61/197 (30%), Gaps = 18/197 (9%) Query: 146 ANATQPAPGATSAEQTAGNTSQDISLPPISSTPTQGQSPVVADGQQRVEVQGDLNNALTQ 205 N Q + + + +PP + + VA+ ++ + N Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT 1059 Query: 206 NPEQMNNVAVN---STLPTEPATVAPVRNGSTTRQAAVSEPTERHTTRPERKQAV----- 257 N S + T ++GS T++ +E E T E K V Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119 Query: 258 ---------IEPKKPQTTAKTTTAEPKKPVAP-VKRTEPAAPAATPKATTTTAAPTATAS 307 + PK+ Q+ AEP + P V EP + T T A T++ Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179 Query: 308 AAPVQTAKPAQASTTPV 324 PV + + V Sbjct: 1180 EQPVTESTTVNTGNSVV 1196
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 266 bits (682), Expect = 7e-86 Identities = 82/301 (27%), Positives = 133/301 (44%), Gaps = 18/301 (5%) Query: 94 LENRSISLQYADAAELAKAGEKLLSAKGTIMVDKRTNRLLLRDNRAVLAELEKWVSQMDL 153 L + +I D + +A SA+ + D N +++RD+ + ++ + +D Sbjct: 219 LSDATIQQVTVDNQRIPQAAT-RASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDK 277 Query: 154 PVAQVELAAHIVTINEKSLRELGVKWTLADATQAGAVGDVTTLSSDLSVAAATSRVGFNI 213 P A++E+A IV IN L ELGV W + T + T ++A+ G Sbjct: 278 PSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASN----GALG 333 Query: 214 GRISGRLLDL---ELSALEQKQQLDIIASPRLLASHLQPASIKQGSEIPYQVSSGESGAT 270 + R LD ++ LE + +++ P LL A I SE Y +G+ A Sbjct: 334 SLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA- 391 Query: 271 SVEFKEAVLG--MEVTPTVLQKG---RIRLKLHISQNVPGQVLQQADGEVLAIDKQEIET 325 E K G + +TP VL +G I L LHI +G + I + ++T Sbjct: 392 --ELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG-IPTISRTVVDT 448 Query: 326 QVEVKSGETLALGGIFSRKNKSGSDSVPLLGDIPWLGQLFRHDGKEDERRELVVFITPRL 385 V G++L +GGI+ + VPLLGDIP++G LFR + R + I PR+ Sbjct: 449 VARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRI 508 Query: 386 V 386 + Sbjct: 509 I 509
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 45/188 (23%) Query: 270 INKDIEECNAIIEQFIDYLR------TGQEMPM--EMADLNSVL-------GEVIAAESG 314 I +D + ++ + +R +++ + E+ ++S L + + Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ---- 241 Query: 315 YEREINTALQAGSIQVKMHPLSIKRAVANMVVNA--ARYGNGWIKVSSGTESHRAWFQVE 372 +E +IN A+ V++ P+ ++ V N + + G I + ++ +VE Sbjct: 242 FENQINPAIM----DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 373 DDGPGIKPEQRKHLFQPFVRGDSARSTSGTGLGLAIV-QRIIDNH--NGMLEIGTSERGG 429 + G ++ TG GL V +R+ + +++ + ++G Sbjct: 298 NTGSLALKNTKE----------------STGTGLQNVRERLQMLYGTEAQIKL-SEKQGK 340 Query: 430 LSIRAWLP 437 ++ +P Sbjct: 341 VNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.4 bits (245), Expect = 6e-26 Identities = 39/136 (28%), Positives = 72/136 (52%), Gaps = 3/136 (2%) Query: 6 KILVVDDDMRLRALLERYLTEQGFQVRSVANAEQMDRLLTRESFHLMVLDLMLPGEDGLS 65 ILV DDD +R +L + L+ G+ VR +NA + R + L+V D+++P E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 ICRRLRSQSNPMPIIMVTAKGEEVDRIVGLEIGADDYIPKPFNPRELLARIRAVL---RR 122 + R++ +P+++++A+ + I E GA DY+PKPF+ EL+ I L +R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 123 QANELPGAPSQEEAVI 138 + ++L ++ Sbjct: 125 RPSKLEDDSQDGMPLV 140
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 41.8 bits (98), Expect = 9e-06 Identities = 42/142 (29%), Positives = 66/142 (46%), Gaps = 30/142 (21%) Query: 1 MKKLTIGLIGNPNSGKTTLFNQL---TGARQRVGNW-AGVTV------ERKEG---QFAT 47 MK + IG++ + ++GKTTL L +GA +G+ G T ER+ G Q Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60 Query: 48 T-----DHQVTLVDLPGTYSLTTISSQTSLDEQIACHYILSGDADLLINVVDASNLE-RN 101 T + +V ++D PG + SL +L G A LLI+ D + R Sbjct: 61 TSFQWENTKVNIIDTPG-HMDFLAEVYRSLS-------VLDG-AILLISAKDGVQAQTRI 111 Query: 102 LYLTLQLLELGIPCIVALNMLD 123 L+ L+ ++GIP I +N +D Sbjct: 112 LFHALR--KMGIPTIFFINKID 131
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.6 bits (74), Expect = 3e-04 Identities = 17/92 (18%), Positives = 32/92 (34%), Gaps = 16/92 (17%) Query: 55 VACIDDIVVGHLSIQVTQRPRRSHVADFGICVDARWHNRGIASTLIRTMID------MCD 108 + +++ +G + I+ + + D + D R G+ + L+ I+ C Sbjct: 69 LYYLENNCIGRIKIRSNWN-GYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125 Query: 109 NWLRVDRIELTVFVDNEPAVAVYKKYGFEIEG 140 L I N A Y K+ F I Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 43.2 bits (101), Expect = 1e-06 Identities = 46/176 (26%), Positives = 73/176 (41%), Gaps = 17/176 (9%) Query: 133 SGHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQELADYTAKLRAAGMKCGYASGW 192 +G L++ P L YNKD L P PPKTW+E+ +L+A G + Sbjct: 126 NGKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQ 178 Query: 193 QGWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIALLEEMNKKGDFSYVG 250 + + +A G F +N +D D ++ K + L++ + D Y Sbjct: 179 EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY-- 236 Query: 251 RKDESTEKFYNGDCAMTTASSGSLANIRQYAKFNYGVGMMPYDADIKGAPQNAIIG 306 + F G+ AMT + +NI +K NYGV ++P KG P +G Sbjct: 237 --SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.0 bits (67), Expect = 0.024 Identities = 15/114 (13%), Positives = 34/114 (29%), Gaps = 2/114 (1%) Query: 17 DKEQKQEQTEEQQIVEEQRPVEPPVETAADVDAQTPAHSKAETEAFAEEVVDVTEKVQES 76 +++ K E + Q++ + V P E + V Q + + +E T ++ Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168 Query: 77 EKP-QPVEPEPAAAIETAAPQIAVEREELPLPEEVKDEAISPEEWQAEAETVEV 129 E+P + + + PE P + + Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVV-ENPENTTPATTQPTVNSESSNKPKN 1221
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 26.7 bits (59), Expect = 0.026 Identities = 6/29 (20%), Positives = 16/29 (55%) Query: 7 FFIIIIALIVVAASFRFVQQRREKAANEA 35 +++I AA ++F++Q+ K ++ Sbjct: 173 ALMVLIQSTSEAARYKFIEQQIGKRVDKT 201
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.2 bits (68), Expect = 0.040 Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 3/78 (3%) Query: 336 AEERRAPIERFIDRFSRIYTPVIMVIALLVTLIPPLMFDGGWQEWIYKGLTLLLIGCPCA 395 E++ P E S+I ++ + +L + P+ F GG IY+ ++ ++ A Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS---A 477 Query: 396 LVISTPAAITSGLAAAAR 413 + +S A+ A A Sbjct: 478 MALSVLVALILTPALCAT 495
>PF01206#SirA family protein Length = 76 Score = 104 bits (260), Expect = 7e-33 Identities = 28/72 (38%), Positives = 42/72 (58%) Query: 39 DHTLDALGLRCPEPVMMVRKTVRNMQTGETLLIIADDPATTRDIPGFCTFMEHDLLAQET 98 D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F H+LL Q+ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 99 EGLPYRYLLRKA 110 E Y + L++A Sbjct: 65 EDGTYHFRLKRA 76
>PF04183#IucA / IucC family Length = 580 Score = 27.9 bits (62), Expect = 0.038 Identities = 17/91 (18%), Positives = 28/91 (30%), Gaps = 14/91 (15%) Query: 121 LGQILDVHVFNRLRQNRRWWLAPTASTLFGNISDTLAFFFIAFWRSPDAFMAEHWMEIAL 180 LG I + L+ + +TL + + AE W+ Sbjct: 347 LGVIWRENPCRWLKPDES---PVLMATLMECDENNQPL--AGAYIDRSGLDAETWLT--- 398 Query: 181 VDYCFKVLISIIFFLPMYGVLL-----NMLL 206 V++ + L YGV L N+ L Sbjct: 399 -QLFRVVVVPLYHLLCRYGVALIAHGQNITL 428
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.1 bits (117), Expect = 2e-08 Identities = 76/403 (18%), Positives = 137/403 (33%), Gaps = 42/403 (10%) Query: 1 MRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHD--AMGFSAFWAGLIISLQYFATLLSR 58 M+ N ++ I+ + IGL + VLPG + D G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 59 PHAGRYADVLGPKKIVVFGLCGCFLSGFGYLLADIASAWPMISLLLLGLGRVILGI-GQS 117 P G +D G + +++ L G + Y + A L +L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPF-----LWVLYIGRIVAGITGAT 112 Query: 118 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLCYAWGGLQGLALTVMGV 177 A G+ + + R + M G LG L G Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLM----GGFSPHAPFFAA 166 Query: 178 ALLAILLAL----------PRPSVKANKGKPLPFRAVLGRVWLYGMALALA-----SAGF 222 A L L L + P + + +A +A Sbjct: 167 AALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVG 226 Query: 223 GVIATFITLFYDAK-GWDGAAFALTLFSVAFVGT---RLLFPNGINRLGGLNVAMICFGV 278 V A +F + + WD ++L + + + ++ RLG M+ Sbjct: 227 QVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIA 286 Query: 279 EIIGLLLVGTAAMPWMAKIGVLLTGMGFSLVFPALGVVAVKAVPPQNQGAALATYTVFMD 338 + G +L+ A WMA ++L + PAL + + V + QG + Sbjct: 287 DGTGYILLAFATRGWMAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS 345 Query: 339 MSLGVTGPLAGLVMTWAGVPV----IYLAAAGLVAMALLLTWR 377 ++ + GPL + A + ++A A L + L R Sbjct: 346 LT-SIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 32.3 bits (73), Expect = 7e-04 Identities = 25/93 (26%), Positives = 44/93 (47%), Gaps = 6/93 (6%) Query: 22 RRASWLAGRVLLSRALSPL---PEMVYGEQGKPAFSAGTPLWFNLSHSGDTIALLLSDEG 78 R+A LAGR+ AL + G++ +P + G L+ ++SH T ++S + Sbjct: 46 RKAEHLAGRIAAVHALREVGVRTVPGMGDKRQPLWPDG--LFGSISHCATTALAVISRQ- 102 Query: 79 EVGCDIEVIRPRDNWRSLANTVFSLGEHAEMEA 111 +G DIE I + LA ++ E ++A Sbjct: 103 RIGIDIEKIMSQHTATELAPSIIDSDERQILQA 135
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 48.0 bits (114), Expect = 2e-08 Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 7/171 (4%) Query: 200 REREHGTVEHLLVMPVTPFEIMMAKV-WSMGLVVLVVSGLSLMLMVKGVLGVPIEGSIPL 258 R T E +L + +I++ ++ W+ L +G + +V LG + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG---IGVVAAALGY-TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQAVQD 317 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGAGLSIVWPQFLTLLAIGGVFFL-IALLRFR 367 +P +H + L + I+ + + + I FFL ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 24.8 bits (54), Expect = 0.011 Identities = 8/24 (33%), Positives = 10/24 (41%), Gaps = 1/24 (4%) Query: 16 KTAPAGMPEYD-VKTLRVRPREPK 38 A GM Y +T+ V P P Sbjct: 550 NGAKPGMVIYSTNQTIYVTPTNPN 573
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.1 bits (78), Expect = 8e-05 Identities = 20/52 (38%), Positives = 26/52 (50%), Gaps = 5/52 (9%) Query: 76 VAPDALRHGIGKALL----EYVQQR-FPLLSLEVYQKNQSAVNFYHALGFRI 122 VA D + G+G ALL E+ ++ F L LE N SA +FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 116 bits (293), Expect = 1e-33 Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 11/124 (8%) Query: 108 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEY--PKTAVNVVGYTDSTGSHDLNMRLS 165 + ++V F+ + ATLKP G L + L +V V+GYTD GS N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASSLITQGVDASRIRTSGMGPANPIASNSTAEGK---------AQNRRVEITL 216 ++RA SV LI++G+ A +I GMG +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SPLQ 220 ++ Sbjct: 335 KGIK 338
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 52.9 bits (127), Expect = 2e-09 Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 16/106 (15%) Query: 3 IATAGHVDHGKTTLLQAI---TGV------------NADRLPEEKKRGMTIDLGYAYWPQ 47 I HVD GKTTL +++ +G D E++RG+TI G + Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65 Query: 48 PDGRVLGFIDVPGHEKFLSNMLAGVGGIDHALLVVACDDGVMAQTR 93 + +V ID PGH FL+ + + +D A+L+++ DGV AQTR Sbjct: 66 ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTR 110
>PF03895#Serum resistance protein DsrA. Length = 79 Score = 72.6 bits (178), Expect = 1e-17 Identities = 18/80 (22%), Positives = 38/80 (47%), Gaps = 2/80 (2%) Query: 1369 VENKMSGGIASAMAMAGLPQAYAPGANMTSIAGGTFNGESAVAIGV-SMVSESGGWVYKL 1427 + ++ G+A+ A++ L Q G S A G + ++A+AIGV S +++ + Sbjct: 1 LSKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGV 60 Query: 1428 QGTSNSQGDYSAAIGAGFQW 1447 + + G S G+++ Sbjct: 61 AFNTYN-GGMSYGASVGYEF 79
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 99.5 bits (248), Expect = 3e-26 Identities = 75/348 (21%), Positives = 125/348 (35%), Gaps = 67/348 (19%) Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLNI 47 +VTG AGFIG ++ K L + G ++ +DNL D +++ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 48 ADYMDKEDFLIQIMSGEELGDIEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100 AD + + + G E +F + +Y ++N + Y+ + Sbjct: 62 ADR----EGMTDLF---ASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109 Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158 L C +I LYASS++ YG F + P+++Y +K + Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169 Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218 G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224 Query: 219 AVNL------------WFLESGKSG-------IFNLGTGRAESFQAVADATLAY-HKKGS 258 + W +E+G ++N+G A + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284 Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRNA-GYDKPFKTVAEGVTEYMAW 305 +P G T AD L G+ P TV +GV ++ W Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327
>SECA#SecA protein signature. Length = 901 Score = 39.5 bits (92), Expect = 5e-05 Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 7/79 (8%) Query: 291 MRLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRSWFAPL 344 M L + + G GKTL A L A L A+ GK V ++ + LA++ A N R F L Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150 Query: 345 GVEVGWLAGKQKGKARQAQ 363 G+ VG A++ Sbjct: 151 GLTVGINLPGMPAPAKREA 169
>PERTACTIN#Pertactin signature. Length = 922 Score = 118 bits (297), Expect = 3e-29 Identities = 163/749 (21%), Positives = 288/749 (38%), Gaps = 90/749 (12%) Query: 230 TGDSSEGLRTGQSGSLIRLGDDATIETSGASSTGIYAASSSRTELGNNATITVNGASAHA 289 TG + G+ G+++ L ATI A + G + + Sbjct: 236 TGGRAAGV-AAMDGAIVHL-QRATIRRGDAPAGGAVPGGAVPGGAVPGG-FGPLLDGWYG 292 Query: 290 VYATNATVNLGENATISVNSASKAASYSKAPAGLYALSRGAINLAGGAAITMAGDNSSES 349 V +++TV+L A V + A+ +S G+++ G I G Sbjct: 293 VDVSDSTVDL---AQSIVEAPQLGAAIRAGRGARVTVSGGSLSAPHGNVIETGGGARRFP 349 Query: 350 YAISTETGGIVDGS--SGGRFVIDGDIRAAGATAASGTLPQ--------------QNSTI 393 S + + G+ G + T A G Q + + Sbjct: 350 PPASPLSITLQAGARAQGRALLYRVLPEPVKLTLAGGAQGQGDIVATELPPIPGASSGPL 409 Query: 394 KLNMTDNSRWDGASYITSATAGTGVISVQMSDATWNMTSSSTLTDLTLNSGATINFSH-- 451 + + +RW GA+ V S+ + +ATW MT +S + L L S +++F Sbjct: 410 DVALASQARWTGATRA--------VDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPA 461 Query: 452 EDGEPWQTLTINEDYVGNGGKLVFNTVLNDDDSETDRLQVLGNTSGNTFVAVNNIGGAGA 511 E G ++ L ++ G+G +F + D +D+L V+ + SG + V N G A Sbjct: 462 EAGR-FKVLMVDT-LAGSG---LFRMNVFADLGLSDKLVVMRDASGQHRLWVRNSGSEPA 516 Query: 512 QTIEGIEIVNVAGNSNGTFEKASR---IVAGAYDYNVVQKGKNWYLTSYIEPDEPIIPDP 568 + + +V S TF A++ + G Y Y + G + S + P P P Sbjct: 517 -SGNTMLLVQTPRGSAATFTLANKDGKVDIGTYRYRLAANGNGQW--SLVGAKAPPAPKP 573 Query: 569 VDPVIPDPVIPDPVDPDPVDPVIPDPVIPDPVDPDPVDPVIPDPTIPDIGQSDTPPITEH 628 P P P P P P P P P P P P ++ + + Sbjct: 574 APQPGPQPGPQPPQPPQPPQP----PQPPQPPQRQPEAPAPQPPAGRELSAAANAAVNTG 629 Query: 629 QFRPEVGSYLANNYAANTLFMTRLHDRLGETQYTDMLTGEKKVTSLWMRNVGAHTRFNDG 688 + A + A L RLGE + G W R + ++ Sbjct: 630 GVGLASTLWYAESNA--------LSKRLGELRLNPDAGG------AWGRGFAQRQQLDNR 675 Query: 689 SGQLKTRINSYVLQVGGDLAQWSTDGLDRWHIGAMAGYANSQNRTQSSVSDYHSRGQVTG 748 +G+ + ++G D A + G RWH+G +AGY + D G Sbjct: 676 AGRRFDQ-KVAGFELGADHA-VAVAG-GRWHLGGLAGYTRGD---RGFTGD--GGGHTDS 727 Query: 749 YSVGLYGTWYANNIDRSGAYVDTWMLYNWFDN--KVMGQDQAA--EKYKSKGITASVEAG 804 VG Y T+ AN+ G Y+D + + +N KV G D A KY++ G+ S+EAG Sbjct: 728 VHVGGYATYIANS----GFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGVSLEAG 783 Query: 805 YSFRLGESVHQSYWLQPKAQVVWMGVQADDNREANGTLVKDDTAGNLLTRMGVKAYINGH 864 F ++L+P+A++ V R ANG V+D+ ++L R+G++ Sbjct: 784 RRFAH----ADGWFLEPQAELAVFRVGGGAYRAANGLRVRDEGGSSVLGRLGLEV----G 835 Query: 865 NAIDDNKSREFQPFVEANWIHNTQPA-SVKMDDVS--SDMRGTKNIGELKVGIEGQITSR 921 I+ R+ QP+++A+ + A +V+ + ++ +++RGT+ EL +G+ + Sbjct: 836 KRIELAGGRQVQPYIKASVLQEFDGAGTVRTNGIAHRTELRGTR--AELGLGMAAALGRG 893 Query: 922 LNVWGNVAQQVGDQGYSNTQGLLGVKYSF 950 +++ + G + G +YS+ Sbjct: 894 HSLYASYEYSKGPKLAMPWTFHAGYRYSW 922
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 43.1 bits (101), Expect = 4e-07 Identities = 43/180 (23%), Positives = 63/180 (35%), Gaps = 22/180 (12%) Query: 1 MSTPANF--NGQRPAIDANDAVMLLIDHQSGLFQTVGD--MPMPELRARAAALAKIATLC 56 M T ++ N D N AV+L+ D Q+ P+ EL A L Sbjct: 11 MPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQL 70 Query: 57 NMPVITTASVPQ-------------GPNGPLIPE----IHANAPHA-QYVARKGEINAWD 98 +PV+ TA GP P I AP V K +A+ Sbjct: 71 GIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFK 130 Query: 99 NADFVQAVKATGRKTLIIAGTITSVCMAFPAISAVAEGYKVFAVIDASGTYSKMAQEITM 158 + ++ ++ GR LII G + A A E K F V DA +S ++ + Sbjct: 131 RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMAL 190
>cloacin#Cloacin signature. Length = 551 Score = 27.4 bits (60), Expect = 0.033 Identities = 12/47 (25%), Positives = 20/47 (42%) Query: 30 NGNGGGHSNNAANQGNNGNGHKGNAGQKTEHRKNGGKPDHVESDISY 76 N GGG + G +G+G+ G G GG V + +++ Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAF 90
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.7 bits (85), Expect = 1e-04 Identities = 40/208 (19%), Positives = 77/208 (37%), Gaps = 13/208 (6%) Query: 33 ITVEFLPVSLLTP----MAQDLGISEGVAGQSVTVTAFVAMFSSLFITQIIQATDR--RY 86 + ++ + + L+ P + +DL S V + A A+ + +DR R Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73 Query: 87 IVILFAVLLTA-SCLMVSFANSFTLLLLGRACLGLALGGFWAMSASLTMRLVPARTVPKA 145 V+L ++ A +++ A +L +GR G+ G A++ + + + Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132 Query: 146 LSVIFGAVSIALVIAAPLGSFLGGIIGWRNVFNAAAVMGVLCVIWVVKSLP-SLPGEPSH 204 + +V LG +GG F AAA + L + LP S GE Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191 Query: 205 QKQ---NMFSLLQRPGVMAGMIAIFMSF 229 ++ N + + M + A+ F Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 28.4 bits (63), Expect = 0.030 Identities = 13/69 (18%), Positives = 27/69 (39%), Gaps = 9/69 (13%) Query: 51 TVKKAVDQLVREGVLVQVQGKGTFVKKENVAYPLGEGLLSFAEALASQKINFTTSVITSR 110 ++ +A Q+ RE V G F K N+ + F ++++S T V + Sbjct: 49 SIDQAAAQITREN--VSWNGTNVFGKSANLTF-------KFLQSVSSIPSGDTGFVKFNA 99 Query: 111 LEPANRFVA 119 + ++ Sbjct: 100 EQIEQAKLS 108
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.0 bits (83), Expect = 3e-04 Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 16/168 (9%) Query: 49 FNIAQNDMISTYGLSMTELGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--- 89 Query: 109 MLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 F + +G S F ++ + F Q G + + + ++ P+ RG G Sbjct: 90 --CFGSVIGFVGHSFFSLLIM---ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRF 212 +G + A+Y+ + + + P +I +I ++ Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP-MITIITVPFLMKL 188
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.0 bits (96), Expect = 6e-06 Identities = 72/408 (17%), Positives = 138/408 (33%), Gaps = 60/408 (14%) Query: 29 RHILITIWLGYALFY--FTRKSFNAAAPEILASGILSRSDIGLLATLFYITYGVSKFVSG 86 RH I IWL F+ N + P+I + + T F +T+ + V G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSNARYFMGIGLIATGVVNILFGFSTSLWAFALLWALNAFFQGFGS---PVCARLL 143 +SD+ + + G+I +++ S F L + F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPLVMAAVALHYGWRVGMMVAGLLAIGVGMVLC 202 A Y + RG + L + +G + P + +A + W +++ + I V Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV----- 182 Query: 203 WRLRDRPQAIGLPPVGDWRHDALEVAQQQEGAGLSRKEILAKYVLLNPYIWLLSLCYVLV 262 P + L ++ G L I+ + Y + VL Sbjct: 183 ------PFLMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVSMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNG----------------NRGPMNLIFAAGILLSVGSL---WLMPFASYVMQ 347 GS +F G RGP+ ++ LSV L +L+ S+ M Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352 Query: 348 AACFFTTGFFVFGPQMLIGMAAAECSHKEAAGAATGFVGLFAYLGASL 395 F G F + +I + ++ AGA + ++L Sbjct: 353 IIIVFVLGGLSF-TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 863 bits (2230), Expect = 0.0 Identities = 522/548 (95%), Positives = 536/548 (97%) Query: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQTQQTTQTTTTAAGSAADQGVPASGQGKM 60 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQ QQTTQTTTTAAGSAADQGVPASGQGK+ Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60 Query: 61 ITVKTDVLDLTINTRGGDVEQALLPAYPKELGSNEPFQLLETTPQFIYQAQSGLTGRDGP 120 I+VKTDVLDLTINTRGGDVEQALLPAYPKEL S +PFQLLET+PQFIYQAQSGLTGRDGP Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120 Query: 121 DNPANGPRPLYNVEKDAFVLADGQNELQVPMTYTDAAGNTFTKTFVFKRGDYAVNVNYSV 180 DNPANGPRPLYNVEKDA+VLA+GQNELQVPMTYTDAAGNTFTKTFV KRGDYAVNVNY+V Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180 Query: 181 QNTGEKPLEVSTFGQLKQSVNLPPHRDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD 240 QN GEKPLE+S+FGQLKQS+ LPPH DTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD 240 Query: 241 NENLNVSSKGGWVAMLQQYFATAWIPRNDGTNNFYTANLGNGIVAIGYKAQPVLVQPGQT 300 NENLN+SSKGGWVAMLQQYFATAWIP NDGTNNFYTANLGNGI AIGYK+QPVLVQPGQT Sbjct: 241 NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQT 300 Query: 301 GAMTSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII 360 GAM STLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII Sbjct: 301 GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII 360 Query: 361 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRQSQEMMALYKAEKVNPL 420 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQR SQEMMALYKAEKVNPL Sbjct: 361 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL 420 Query: 421 GGCFPLIIQMPIFLALYYMLMGSIELRHAPFALWIHDLSAQDPYYILPILMGVTMFFIQK 480 GGCFPL+IQMPIFLALYYMLMGS+ELR APFALWIHDLSAQDPYYILPILMGVTMFFIQK Sbjct: 421 GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK 480 Query: 481 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL 540 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL Sbjct: 481 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL 540 Query: 541 HSREKKKS 548 HSREKKKS Sbjct: 541 HSREKKKS 548
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 26.7 bits (58), Expect = 0.043 Identities = 15/42 (35%), Positives = 21/42 (50%) Query: 70 AEAQVIIEQANKRRAQILDEAKTEAEQERTKIVAQAQAEIEA 111 A+A + ANK R Q EAK +AE++ + A A A Sbjct: 210 AKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYA 251
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 28.9 bits (64), Expect = 0.041 Identities = 16/41 (39%), Positives = 23/41 (56%) Query: 66 TETSDALATQLTALQKAQESQKAELEGIIKKQAAQLDDANR 106 TE L+ QL LQ+ QES KA+L +I + + D A + Sbjct: 598 TEAKITLSQQLNTLQQQQESAKAQLSILINRSGSWADVARQ 638
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 178 bits (454), Expect = 1e-50 Identities = 100/448 (22%), Positives = 170/448 (37%), Gaps = 87/448 (19%) Query: 4 NLRNIAIIAHVDHGKTTLVDKLLQQSGTFDARAETQE--RVMDSNDLEKERGITILAKNT 61 + NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAHGL 121 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159 I INK+D+ G V + + L+ N+ T+ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 160 --------------------------------FPIIYASALNGIAGLDHEDMAEDMTPLY 187 FP+ + SA N I G+D+ L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231 Query: 188 QAIIDHVPAPDVDLDGPLQMQISQLDYNNYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247 + I + + L ++ +++Y+ + R+ G + V I + E Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289 Query: 248 NAKVGKVLTHLGLERIDSDIAEAGDIIAITGLG-ELN--ISDTICDPQNVEALPALSVDE 304 K+ ++ T + E D A +G+I+ + +LN + DT PQ + Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPL 343 Query: 305 PTVSMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGEL 364 P + + + D L LR +S G++ Sbjct: 344 PLLQTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKV 394 Query: 365 HLSVLIENMRRE-GFELAVSRPKVIFRE 391 + V ++ + E+ + P VI+ E Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 32.5 bits (74), Expect = 0.005 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457 EPY + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 458 MTSGTGLLYSTFSHY 472 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.1 bits (73), Expect = 0.005 Identities = 24/153 (15%), Positives = 48/153 (31%), Gaps = 7/153 (4%) Query: 194 AALFSLCGLLFMWLCYAGVKERYVEVKQADSAQKAGILQSFRAIAGNRPLFILCVANLCT 253 AA + L + E + ++ + L SFR G + L Sbjct: 166 AAALNGLNFL---TGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIM 222 Query: 254 LAAFNVKLAIQVYYTQYVLN-DPILLSYM--GFFSMGCIFIGVFLMPTAVRRFGKKKVYI 310 V A+ V + + + D + F + + + R G+++ + Sbjct: 223 QLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLA-QAMITGPVAARLGERRALM 281 Query: 311 GGLLIWAVGDLLNYSFGDSSVSFVAFSCLAFFG 343 G++ G +L ++F LA G Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLASGG 314
>PHAGEIV#Gene IV protein signature. Length = 426 Score = 27.2 bits (60), Expect = 0.007 Identities = 13/58 (22%), Positives = 20/58 (34%), Gaps = 10/58 (17%) Query: 8 DMGRILLDLS--DDVIKRLDDLKVQRNLPRAELLREAVEQYLERQDRAETTISKALGL 63 G LL +S D++ L +LP ++L E + E AL Sbjct: 166 VDGSNLLVVSAPKDILDNLPQFLSTVDLPTDQILIEGL--------IFEVQQGDALDF 215
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 33.8 bits (77), Expect = 6e-04 Identities = 11/57 (19%), Positives = 15/57 (26%), Gaps = 1/57 (1%) Query: 30 EATPTASSQPATPAPSQTPETQSDESPAQPSAAKPETATQPPAAKPETPAQPEVDAE 86 + P +P P P PE + P K E P + E Sbjct: 67 QPPPEPVVEPE-PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVE 122 Score = 33.4 bits (76), Expect = 0.001 Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 4/56 (7%) Query: 32 TPTASSQPATPAPSQTPETQSDESPAQPSAAKPETATQPPAAKPETPAQPEVDAEE 87 P AP+ Q+ + P +P +PE P PE P + V E+ Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEP-VVEPEP---EPEPIPEPPKEAPVVIEK 96
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 30.8 bits (69), Expect = 0.002 Identities = 9/29 (31%), Positives = 16/29 (55%) Query: 9 ARSLVRERQRTGLSLAEIARRAGIAKSTL 37 A L ++ + SL EIA+ AG+ + + Sbjct: 20 ALRLFSQQGVSSTSLGEIAKAAGVTRGAI 48
>adhesinb#Adhesin B signature. Length = 310 Score = 27.5 bits (61), Expect = 0.046 Identities = 12/65 (18%), Positives = 23/65 (35%), Gaps = 11/65 (16%) Query: 167 EPVWAIGTGKSATPAQAQAVHKFIRDHIAKA-------DAKIAEQV----IIQYGGSVNA 215 +W I T + TP Q + + + +R + D + + V I + Sbjct: 221 AYIWEINTEEEGTPDQIKTLVEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFT 280 Query: 216 SNAAE 220 + AE Sbjct: 281 DSVAE 285
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.2 bits (68), Expect = 0.017 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%) Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81 T +++ G +G GK +AR K N PF+ + Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.5 bits (102), Expect = 7e-07 Identities = 24/138 (17%), Positives = 55/138 (39%), Gaps = 2/138 (1%) Query: 83 PNQLTSEQRQLLEQMQADMRQQPTQLNEVPWNEQTPEQRQQTLQRQRQAQQQQWTQTQPV 142 + T++ R++ ++ +++++ Q NEV + ++ Q T ++ +++ Sbjct: 1058 ATETTAQNREVAKEAKSNVKANT-QTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116 Query: 143 QQPRTQPRVNEQPQTRTVQSAPAQPARQSQPPKQ-TASQQPYQDLLQTPAHTSAAAPKAA 201 ++ + P+V Q + QS QP + T + + Q T A T A + + Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176 Query: 202 PITRAPEAPKTTAEKKDE 219 P TT + Sbjct: 1177 SNVEQPVTESTTVNTGNS 1194 Score = 32.3 bits (73), Expect = 0.003 Identities = 37/213 (17%), Positives = 60/213 (28%), Gaps = 31/213 (14%) Query: 65 QPGVRTPTEPSAGGE---VMNPNQLTSEQRQLLEQMQADMRQQPTQLNEVPWNEQTPEQR 121 P V + E A + V P T + + + + NE E T + R Sbjct: 1007 VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR 1066 Query: 122 QQTLQRQRQ---AQQQQWTQTQPVQQPRTQPRVNEQPQTRTVQSAPAQPARQSQPPKQTA 178 + + + Q + TQ ++ T + ++Q Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ------ 1120 Query: 179 SQQPYQDLLQTPAHTSAAAPKAAPITRAPEAPKTTAEKKDERRWMVQCGSFKGAEQAESV 238 + P TS +PK E + AE E V + + Sbjct: 1121 ---------EVPKVTSQVSPK----QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167 Query: 239 RAQLA------FEGFDSKITTNNGWNRVVIGPV 265 Q A E ++ TT N N VV P Sbjct: 1168 TEQPAKETSSNVEQPVTESTTVNTGNSVVENPE 1200 Score = 29.3 bits (65), Expect = 0.025 Identities = 29/196 (14%), Positives = 64/196 (32%), Gaps = 9/196 (4%) Query: 27 THHKKEESETLQNQKVTGNGLP-----PKPEERWRYIKELESRQPGVRTPTEPSAGGEVM 81 + + T QN++V + E + E + Q T T+ +A E Sbjct: 1053 KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ---TTETKETATVEKE 1109 Query: 82 NPNQLTSEQRQLLEQMQADMRQQPTQLNEVPWNEQTPEQRQQTLQRQRQAQQQQWTQTQP 141 ++ +E+ Q + ++ + + + Q V + P + ++ Q Q T Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE-PARENDPTVNIKEPQSQTNTTADT 1168 Query: 142 VQQPRTQPRVNEQPQTRTVQSAPAQPARQSQPPKQTASQQPYQDLLQTPAHTSAAAPKAA 201 Q + EQP T + ++ A+ QP + + + Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVR 1228 Query: 202 PITRAPEAPKTTAEKK 217 + E T++ + Sbjct: 1229 SVPHNVEPATTSSNDR 1244
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 626 bits (1617), Expect = 0.0 Identities = 195/570 (34%), Positives = 319/570 (55%), Gaps = 6/570 (1%) Query: 114 YRARSVCSGSAGGVLTPLSSLDLNALGELPTANDTETEQAALDNGLAML---IKHVEFRQ 170 ++ + + S + L+ N E + D TE L L ++ ++ + Sbjct: 3 HKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQT 62 Query: 171 LDSDGAASA-ILEAHRSLAGDASLRQHLLDGVL-RGLSCAQAIVESANHFCNEFARASSS 228 S GA A I AH + D L + + ++ A+ E ++ F + F + Sbjct: 63 EASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNE 122 Query: 229 YLQERALDVRDVCFQLLQHIYGEQRFPAPGQLTRPSICMAEELTPSQFLELDKTFLKGLL 288 Y++ERA D+RDV ++L H+ G + + + ++ +AE+LTPS +L+K F+KG Sbjct: 123 YMKERAADIRDVSKRVLGHLIGVET-GSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 289 LKSGGTTSHTVILARSFNIPTLVGVEIEALTPWRQQTVYIDGNAGAIVVAPDEPVTRYYQ 348 GG TSH+ I++RS IP +VG + V +DG G ++V P E + Y+ Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 349 QEARVQDALREQQRIWLTQEARTADGIRMEVAANIAHSVEAQAAFSNGAEAVGLFRTEML 408 ++ + +++ + + + T DG +E+AANI + +NG E +GL+RTE L Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 409 YMDRACAPDENELYNIFCQALESAKGRSIIVRTMDIGGDKPVDYLNIPAEANPFLGYRAV 468 YMDR P E E + + + ++ G+ +++RT+DIGGDK + YL +P E NPFLG+RA+ Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 469 RIYEEYASLFTTQLRSILRASAHGNLKIMIPMISSMEEILWVKEKLAEAKQQLRNEHIPF 528 R+ E +F TQLR++LRAS +GNLK+M PMI+++EE+ K + E K +L +E + Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 529 DEKIPLGIMLEVPSVMFIIDQCCEEIDFFSIGSNDLTQYLLAVDRDNAKVTRHYNSLNPA 588 + I +GIM+E+PS + +E+DFFSIG+NDL QY +A DR N +V+ Y +PA Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 589 FLRALDFAVQAVHRQGKWIGLCGELGAKGSVLPLLVGLGLDEISMGAPSIPAAKARMAQL 648 LR +D ++A H +GKW+G+CGE+ +PLL+GLGLDE SM A SI A++++ +L Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 649 DSRACRQLLNQAMACRTSLEVEHLLAQFRM 678 + +A+ T+ EVE L+ + + Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYL 571
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 496 bits (1280), Expect = e-180 Identities = 144/357 (40%), Positives = 210/357 (58%), Gaps = 3/357 (0%) Query: 2 QAATVVINRRALRHNLQRLRELAPASKLVAVVKANAYGHGLLETARTLPDADAFGVARLE 61 + ++ +AL+ NL +R+ A +++ +VVKANAYGHG+ + D F + LE Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLE 62 Query: 62 EALRLRAGGITQPILLLEGFFDAADLPTISAQCLHTAVHNQEQLAALEAVELAEPVTVWM 121 EA+ LR G PIL+LEGFF A DL L T VH+ QL AL+ L P+ +++ Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYL 122 Query: 122 KLDTGMHRLGVRPEEAEAFYQRLTHCKNVRQPVNIVSHFARADEPECGATEHQLDIFNAF 181 K+++GM+RLG +P+ +Q+L NV + + ++SHFA A+ P+ + Sbjct: 123 KVNSGMNRLGFQPDRVLTVWQQLRAMANVGE-MTLMSHFAEAEHPD--GISGAMARIEQA 179 Query: 182 CQGKPGQRSIAASGGILLWPQSHFDWARPGIILYGVSPLEHKPWGPDFGFQPVMSLTSSL 241 +G +RS++ S L P++HFDW RPGIILYG SP + G +PVM+L+S + Sbjct: 180 AEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEI 239 Query: 242 IAVRDHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRV 301 I V+ KAGE VGYGG + + + R+G+VA GY DGYPR AP+GTPVLV+G VG V Sbjct: 240 IGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTV 299 Query: 302 AMDMICVDLGPNAQDNAGDPVVLWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYI 358 +MDM+ VDL P Q G PV LWG+ + ++ +A YEL+ L RV + + Sbjct: 300 SMDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.012 Identities = 12/63 (19%), Positives = 24/63 (38%), Gaps = 14/63 (22%) Query: 133 KAWLEDKTNSNLLIEMVIPQADISFSDSLRLGYERGIILMKEIKKIYPDV-VIDMSVNSA 191 W+ ++ ++V+P + L+ IKK PD+ V+ MS + Sbjct: 40 WRWIAAGDGDLVVTDVVMPDEN-------------AFDLLPRIKKARPDLPVLVMSAQNT 86 Query: 192 ASS 194 + Sbjct: 87 FMT 89
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 266 bits (682), Expect = 5e-87 Identities = 87/422 (20%), Positives = 174/422 (41%), Gaps = 25/422 (5%) Query: 3 IIISLTILIIILTYFIEINSVVHGQGVITTKDNAQLISLSKGGTIQDIYVAEGDTVKKGE 62 I+ ++ IL+ ++ V G +T ++ I + +++I V EG++V+KG+ Sbjct: 63 FIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGD 122 Query: 63 LLAKVVNLDLQKEYQRYRTQKGYLDKDVNEI-------SFILDKENESGLITLDGTRSLS 115 +L K+ L E +TQ L + + S L+K E L +++S Sbjct: 123 VLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180 Query: 116 NKEVKANIELVHSQIRA-------KELKKTSLDSEISGLQEKLSSKEKELALLAEEINIL 168 +EV L+ Q KEL +E + +++ E + ++ Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240 Query: 169 SPLVKKGISPYTNFLNKKQAYIKVKSEINDIESSITLKKDDIELVVNDIEALNNELRLSL 228 S L+ K L ++ Y++ +E+ +S + + +I + + + + + Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300 Query: 229 SKIISKNLQELEVVNSTLKVIEKQINEEDIYSPVDGVIYKINKSATTHGGVIQAADLLFE 288 + + + ++ L E++ I +PV + ++ T GGV+ A+ L Sbjct: 301 LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQL--KVHTEGGVVTTAETLMV 358 Query: 289 IKPKVRTMLADVKILPKYRDQIYVDEAVKLDVQSIIQPKIKSYNATIDNISPDSYEENTG 348 I P+ T+ + K I V + + V++ + + NI+ D+ E+ Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRL 418 Query: 349 GTIQRYYKVIIAFDVNE----DDLRWLKPGMTVDASVITGKHSIMEYLLSPLMKGVDKAF 404 G + VII+ + N + L GM V A + TG S++ YLLSPL + V ++ Sbjct: 419 GL---VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL 475 Query: 405 SE 406 E Sbjct: 476 RE 477
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 49.7 bits (118), Expect = 3e-07 Identities = 49/190 (25%), Positives = 76/190 (40%), Gaps = 30/190 (15%) Query: 96 DSAQVEKKGNGKRRNKKEEEELKKQLDDAENAKK--EADKAK-EEAEKAKEAAEKALNEA 152 A+ + + + L++ LD + AKK EA+ K EE K EA+ ++L Sbjct: 293 LEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352 Query: 153 FEVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQA---KATQASKQNDAEKVLP 208 + +K Q+E Q N + S+AS+Q+ + + +A KQ + Sbjct: 353 LDASREAKKQLEAEHQKLEEQN-------KISEASRQSLRRDLDASREAKKQVEKALEEA 405 Query: 209 QPI-------NKNTSTGK--SNSSKNEEN-KLDAESVKEPLKVTLALAAES----NSGSK 254 NK K + K E KL+AE+ + LK LA AE +G Sbjct: 406 NSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEA--KALKEKLAKQAEELAKLRAGKA 463 Query: 255 DDSITNFTKP 264 DS T KP Sbjct: 464 SDSQTPDAKP 473 Score = 48.1 bits (114), Expect = 9e-07 Identities = 35/136 (25%), Positives = 63/136 (46%), Gaps = 4/136 (2%) Query: 98 AQVEKKGNGKRRNKKEEEELKKQLDDAENAKKEADKAKEEAEKAKEAAEKALNEAFEVQN 157 A+ +K + ++ + L++ LD + AKK+ +KA EEA A EK E E + Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKK 424 Query: 158 SSKQIEEMLQNFL-ADNVA-KDNLAQQSD--ASQQNTQAKATQASKQNDAEKVLPQPINK 213 +++ + LQ L A+ A K+ LA+Q++ A + +A +Q K +P Sbjct: 425 LTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQA 484 Query: 214 NTSTGKSNSSKNEENK 229 + K N +K + Sbjct: 485 PQAGTKPNQNKAPMKE 500 Score = 43.5 bits (102), Expect = 3e-05 Identities = 17/115 (14%), Positives = 42/115 (36%), Gaps = 19/115 (16%) Query: 101 EKKGNGKRRNKKEEEELKKQLDDAENAKKEAD-------KAKEEAEKAKEAAEKALNEAF 153 ++ ++ + + + E + + + A ++L Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ-SQVLNANRQSLRRDL 318 Query: 154 EVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQAK---ATQASKQNDAE 204 + +K Q+E Q + + N + S+AS+Q+ + + +A KQ +AE Sbjct: 319 DASREAKKQLEAEHQ-----KLEEQN--KISEASRQSLRRDLDASREAKKQLEAE 366
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 33.1 bits (75), Expect = 0.003 Identities = 15/79 (18%), Positives = 26/79 (32%), Gaps = 2/79 (2%) Query: 291 ELDPREQKRREQF--GEPPPLPAPTPASEQSGGRERTTPPVTTLPADTSSQPPVTGLRSG 348 ++ P++++ EP PT ++ + TT +TSS S Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187 Query: 349 TLTTPGRPEAVPELQDNTA 367 T+ T PE Sbjct: 1188 TVNTGNSVVENPENTTPAT 1206
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 32.3 bits (73), Expect = 0.003 Identities = 28/125 (22%), Positives = 52/125 (41%), Gaps = 8/125 (6%) Query: 227 KVTSQAVSPLSVATTAKTPRNPFSASESGEKSTVPVQKTQAGPAAKLTSGKVKPSTELAP 286 KV+S V+ L TTA +P + + S + Q+TQ ++K + K++ L P Sbjct: 7 KVSSLFVATL---TTATLVSSPAANALSSKAMDNHPQQTQ---SSKQQTPKIQKGGNLKP 60 Query: 287 APAPSALSVASAPLNKAALGVPLTSSGAVKPGGTVQNSNPPSTVISRTAPVSGKTVFTPG 346 +V ++ + T++G P +Q P T I+ V T+ T Sbjct: 61 LEQREHANVILPNNDRH--QITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNK 118 Query: 347 ALLSS 351 ++ + Sbjct: 119 HVVDA 123
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 67.3 bits (164), Expect = 8e-14 Identities = 67/321 (20%), Positives = 118/321 (36%), Gaps = 30/321 (9%) Query: 254 GMNSDLYDDIRKTIEQMLTPKSGRFWLSAATGTLSVTDTPDVLERIGRYIEYQNKVLSRQ 313 G++S + + + K+ T L VT PDV+ + R I Q + Q Sbjct: 288 GISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIA-QLDIRRPQ 346 Query: 314 VQLNIQIVSVNQTRNEQLGLDWGLVYKSLHNFGATLTGSMANASTSAGSAGISILDTATG 373 V + I V LG+ W + F + + ++ AG+ + T + Sbjct: 347 VLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNS---GLPISTAIAGANQYNKDGTVSS 403 Query: 374 NAAKFSGSSLLIKALSEQGNVSMALN--QTDPTANL--TPVAYQLSNQQGVL-------- 421 + A S I A QGN +M L + ++ TP L N + Sbjct: 404 SLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPV 463 Query: 422 -TSSSSTATANVGVTSSQTVTTITTGLFMTMLPFIQENGDVQLQFAFSYTSPPQIEKFIS 480 T S +T+ N+ TV T G+ + + P I E V L+ +S S Sbjct: 464 LTGSQTTSGDNI----FNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTS 519 Query: 481 RDGNTRNDIPNTSTQGLARKVNLRSGQTLVLTGSEQQNLSANKQGT-FTPDNFILGG--- 536 D +T+ + V + SG+T+V+ G +++S D ++G Sbjct: 520 SDLGAT-----FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFR 574 Query: 537 GQNGTRGRNTLVIMITPVLLR 557 + + L++ I P ++R Sbjct: 575 STSKKVSKRNLMLFIRPTVIR 595
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 33.7 bits (77), Expect = 0.002 Identities = 14/50 (28%), Positives = 19/50 (38%), Gaps = 6/50 (12%) Query: 223 KPAAPARAPHPWASQPPVSLLLGNCWLTREPLFASVAGWRFTDGECVPEG 272 +P A+ W SQ S L C + + GWR +G C P Sbjct: 552 QPLNKAQEVQKWLSQNNKSSYLTQCKMDKS------LGWRVVEGACTPAQ 595
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 53.3 bits (128), Expect = 7e-10 Identities = 50/266 (18%), Positives = 114/266 (42%), Gaps = 15/266 (5%) Query: 105 ALISAGMETGNIPAALMQADKLIVARRRILGQVIFASVFPAALAILSTGLLLANNLALVP 164 A+++AG +G++ A L + R+++ ++ A ++P L +++ ++ +VP Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVP 196 Query: 165 TMSKMSDPARWTGAL----GFMNGVAKWSSEWGVASAATAAGLVLLSFWSLPRWRGRLRR 220 + + AL + G++ +G + L + + R+ Sbjct: 197 KVVEQF--IHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVSF 254 Query: 221 CADWL-LPW--SVYKDLQGAVFLMNIGALLGSGVQELKALQIL-NGFAPPWLQERIEAAM 276 L LP + + L A + + L S V L+A++I + + + + R+ A Sbjct: 255 HRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLAT 314 Query: 277 ECMSEGDSLGRALRNSGYDFPSREAVNYLSLLDKGDGAASLITNYADRWREQALARVARR 336 + + EG SL +AL + FP + ++ ++ S++ AD + +++ Sbjct: 315 DAVREGVSLHKALEQTAL-FP-PMMRHMIASGERSGELDSMLERAADNQDREFSSQM--- 369 Query: 337 ANATKLFSLVLIMSFFLLILMMVMQI 362 A LF +L++S ++L +V+ I Sbjct: 370 TLALGLFEPLLVVSMAAVVLFIVLAI 395
>PilS_PF08805#PilS N terminal Length = 185 Score = 95.0 bits (236), Expect = 7e-27 Identities = 46/193 (23%), Positives = 80/193 (41%), Gaps = 32/193 (16%) Query: 9 RQHQPDRGWGILEHGTIAIGTIIVLAIVGALVWSLWGKK----SVAVEVSNLQTVVTNAQ 64 R+ + D+G ++E + + V+ ++ A + L+ + E +N+ TV+ N + Sbjct: 20 RKKEQDKGATLME----VLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMK 75 Query: 65 QLKQAQGGYNFTSGTTMTGTLIQQGGAPKAGWTIQGTASSGTATMWNGYGGQVVLAPVAS 124 LK + + TL QG P + + +A N +GG V + + Sbjct: 76 SLK----FQGRYTDSNYIKTLYAQGLLPS---DMIADTTGASAK--NPWGGSVT---ITT 123 Query: 125 NGFNNGFSVTTQKVPQADCISITTQLGSGGAFSAITINSTDYSDGLVSAEEAGKTCSSDS 184 + F+V VPQ +C+++ L S A S I S S A C+SDS Sbjct: 124 SSDKYSFNVVEANVPQKNCMAMVNALRSSSAISKINNTS-------TSTVSAATVCASDS 176 Query: 185 GMTGNNTLVFTHN 197 NTL F+ + Sbjct: 177 -----NTLTFSTD 184
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 49.8 bits (119), Expect = 2e-09 Identities = 34/143 (23%), Positives = 55/143 (38%), Gaps = 7/143 (4%) Query: 73 PLLERLMSLLFCLFLFRLTLTDAFTGFLPRELTIRCLIAGLVSALIAP--GFIGHFLTAT 130 P L +LL L LT D LP +LT+ L GL+ L+ + A Sbjct: 130 PGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAM 189 Query: 131 TALVIFGVWRYVTFRIHARECLGLGDVWLAGAIAAWLGGREGLYALL----IGVVLFVLW 186 ++ + + +E +G GD L A+ AWLG + LL +G + + Sbjct: 190 AGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGL 249 Query: 187 QISVR-RITEGGPMGPWLCAGAI 208 + ++ P GP+L Sbjct: 250 ILLRNHHQSKPIPFGPYLAIAGW 272
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 31.2 bits (70), Expect = 0.011 Identities = 11/89 (12%), Positives = 25/89 (28%), Gaps = 12/89 (13%) Query: 212 TMHTSIDMGGNNLNNTGTINAVTGNFSGNVA-------ATGNITANGTVTGQNVTAGSNV 264 + + NT T T GN G+++A + + + A Sbjct: 307 STQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNSNSSTVA---- 362 Query: 265 TAGNTITANNDIRSNNGWFITRGSKGWLN 293 ++++ + + LN Sbjct: 363 -IDHSLSLAGERTWAETMGLNTADTARLN 390
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.4 bits (79), Expect = 9e-04 Identities = 4/74 (5%), Positives = 29/74 (39%) Query: 76 YRKVQGRLDSLESDNKTLADENKELKKNNTNVDQQISQAVGQVRSEEAQKRAQLSSQVTD 135 + + + ++ + + ++++ + ++ ++E K Q + + Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 136 LSSQVNQLLDQLKN 149 L+ ++ + ++ + Sbjct: 314 LTLELAKNEERQQA 327
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 30.8 bits (69), Expect = 0.008 Identities = 33/122 (27%), Positives = 43/122 (35%), Gaps = 20/122 (16%) Query: 393 YRAAITLLIKAQDKETLDKRYLDLSSKL--LNCGMEPINPEHDIGPLSSYMRALPMCFNP 450 + AIT L + D + K+ C + EH +G + M LP N Sbjct: 4 FEQAITQLFQQLSLSIPDTIEPVIGVKVGEFACHIT----EHPVGQI--LMFTLPSLDN- 56 Query: 451 QMDKHNWYTRLMFVQHFACLAPIYGRDTGTGHPGLTFWNRGGGPLSVDPLNKNDRTQNAH 510 +K + +F Q L PI D GHP L WNR PLN D Sbjct: 57 NDEKETLLSHNIFSQDI--LKPILSWDEVGGHPVL--WNR-------QPLNSLDNNSLYT 105 Query: 511 LL 512 L Sbjct: 106 QL 107
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.012 Identities = 26/147 (17%), Positives = 60/147 (40%), Gaps = 15/147 (10%) Query: 50 EFRARQRALASERTPALPPELAQLLTGQLALLWQAAVKQAEAGTLAAREQADTDIARADQ 109 E R +L E+ + Q +L L K+AE T+ AR +++R ++ Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQK---ELNL----DKKRAERLTVLARINRYENLSRVEK 234 Query: 110 ERDEALAKVTALESELAVLREVVTERDRLLDEVRG----LRAEALPLREQVARLTATGEH 165 R L ++L + A+ + V E++ E +++ + ++ + Sbjct: 235 SR---LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291 Query: 166 LAAQLQ-DTKAELKETREDGRALQVEL 191 + + + +L++T ++ L +EL Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTLEL 318
>SECA#SecA protein signature. Length = 901 Score = 30.6 bits (69), Expect = 0.019 Identities = 33/131 (25%), Positives = 46/131 (35%), Gaps = 24/131 (18%) Query: 288 EALRELFTESKAPLSLSNTTPNGLDLPKLSSLVDELS------------LTGKGLVMTMG 335 E L+ E +A L N +P+ ++V E S L G G+V+ Sbjct: 41 EELKGKTAEFRARLEKGEVLEN--LIPEAFAVVREASKRVFGMRHFDVQLLG-GMVLNER 97 Query: 336 -----KGGVGKTTVAASVAVLLAKRGHKVHL-TTSDPAAHLSYTLDGSLPN---LQVSRI 386 + G GKT A A L A G VH+ T +D A + L L V Sbjct: 98 CIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGIN 157 Query: 387 DPKVETERYRR 397 P + R Sbjct: 158 LPGMPAPAKRE 168
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.8 bits (72), Expect = 7e-04 Identities = 20/96 (20%), Positives = 38/96 (39%), Gaps = 5/96 (5%) Query: 43 LKKIRNQALPWVVALEEEKVIGYCYLTRYRERYAYRHTLEDSIYIHPDSQRQGTGKALLR 102 + + + + E IG + YA +ED I + D +++G G ALL Sbjct: 57 VSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYAL---IED-IAVAKDYRKKGVGTALLH 112 Query: 103 HVIAWAETHGYRQMIAIVGDSNNEGSLKVHQQVGFT 138 I WA+ + + ++ D N + + + F Sbjct: 113 KAIEWAKENHFCGLMLETQD-INISACHFYAKHHFI 147
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.0 bits (70), Expect = 0.009 Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 1/77 (1%) Query: 38 IANDTSWGQPLIFSGLTLAMGIMGLISPISGRLLVSMGGRKVLQLGALLNGLGCLLLATS 97 IAND + T M + + + G+L +G +++L G ++N G ++ Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99 Query: 98 HSLY-IYLMAWLVMGIG 113 HS + + +MA + G G Sbjct: 100 HSFFSLLIMARFIQGAG 116
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 28.8 bits (64), Expect = 0.036 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 7/64 (10%) Query: 130 PPPPPPVVAKRVESAPRPTEPARNPFKSSDDRLTGVTSSNTVTRPAARASAGAGDKVVIA 189 P P P K+VE R +P + S + + RP + + A K V + Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFE-------NTAPARPTSSTATAATSKPVTS 152 Query: 190 IDAG 193 + +G Sbjct: 153 VASG 156
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 30.1 bits (68), Expect = 0.027 Identities = 26/161 (16%), Positives = 57/161 (35%), Gaps = 18/161 (11%) Query: 31 VENSLDAGATRVDIDIER---GGAKLIR-IRDNGCGIKKEELALALARHATSKIASLDDL 86 ++ SLD A + ++ I R A++ ++ N G E + A+ + +L++ Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEA 64 Query: 87 EAIISLGFRGEAL----------ASISSVSRLTLTSRTAEQAEAWQAYAEGRDMDVTVK- 135 + G++G L I RLT + Q +A Q +D+ +K Sbjct: 65 ITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKV 124 Query: 136 -PAAHPVGTTLEVLDLFYNTPARRKFMRTEK--TEFNHIDE 173 + +G + + + + + F + Sbjct: 125 NSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEH 165
>SECA#SecA protein signature. Length = 901 Score = 33.3 bits (76), Expect = 0.002 Identities = 26/144 (18%), Positives = 55/144 (38%), Gaps = 6/144 (4%) Query: 282 HVVDAADVRVQENIEAVNTVLEEIDAHEIPTLMVMNKIDMLDDFEPRIDRDEENK-PIRV 340 ++D +DV N + IDA+ P + ++ + + R+ D + PI Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722 Query: 341 WLSAQSGVGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400 WL + + L + + + + + + R + LQ ++ W E ++ Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782 Query: 401 SLQVRMPIVDWRRLCKQEPALIEY 424 +R I R +++P EY Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 29.0 bits (64), Expect = 0.030 Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 3/65 (4%) Query: 225 NRMRAEREAVARRHRSQGQEEAEKLRAAADYEVTK---TLAEAERQGRIMRGEGDAEAAK 281 N+ R + A A+R + + +RAA Y + +A A +G I +G A A+ Sbjct: 220 NKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQ 279 Query: 282 LFADA 286 +DA Sbjct: 280 AISDA 284
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 45.3 bits (107), Expect = 3e-07 Identities = 42/225 (18%), Positives = 93/225 (41%), Gaps = 9/225 (4%) Query: 14 LMFGLFVAYLDRSNLSITLPTITHDLNIDGATASIVLTIYLIGYAFSNIFGGVFTQRYDP 73 L F + L+ L+++LP I +D N A+ + V T +++ ++ G + + Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 74 KKIVILMVLIWSIATVFVGFTSSVYVILI-CRLVLGITEGIYWPQQSRFASDWFSDKERT 132 K++++ ++I +V S + +LI R + G + + + + R Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 133 QANSIIQYYGQFLALGLGFMILSPLDAAFGWRNVFIITGVIGIVVVVPLYITMLKKQEEA 192 +A +I + G+G I + W + +I + ++ VP + +LKK+ Sbjct: 139 KAFGLIGSIVA-MGEGVGPAIGGMIAHYIHWSYLLLIPMI--TIITVPFLMKLLKKEV-- 193 Query: 193 PYYRAPAPTEKTKLTLESLGGTPFLLLIFTYITQGMLFWGITLWI 237 R + + L S+G F+L +Y ++ ++ I Sbjct: 194 ---RIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLI 235
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 37.0 bits (86), Expect = 1e-04 Identities = 44/205 (21%), Positives = 69/205 (33%), Gaps = 55/205 (26%) Query: 3 KNDILITGGHI--IDPARNINEINNLRIINDIIVDANKYPVTSETRIIHADGMIVTPGLI 60 K DI + G I I A N + + II V T +I +G IVT G + Sbjct: 85 KADIGLKDGRIAAIGKAGNPDMQPGVTII-----------VGPGTEVIAGEGKIVTAGGM 133 Query: 61 DYHAHVF-----YDATEGGVRPDMYMPPNGVTTVVDAGSAGTANFDAFYRTVICASKVRI 115 D H H +A +G+T ++ G+ A T I Sbjct: 134 DSHIHFICPQQIEEALM-----------SGLTCMLGGGTGPAHGTLA---TTCTPGPWHI 179 Query: 116 KAFLTVSPPGQTWSQENYDPDNI------DENKIHALFRQYRNVLQGLKLKVQTEDIAEY 169 + + + + P N+ + + AL LKL ED + Sbjct: 180 ARMIE--------AADAF-PMNLAFAGKGNASLPGALVEMVLGGATSLKLH---ED---W 224 Query: 170 GLKP--LTESLRIANDLKCPVAIHS 192 G P + L +A++ V IH+ Sbjct: 225 GTTPAAIDCCLSVADEYDVQVMIHT 249 Score = 29.7 bits (67), Expect = 0.024 Identities = 16/67 (23%), Positives = 26/67 (38%), Gaps = 16/67 (23%) Query: 310 THTPAVLLGMAAEIGTLAPGAFADIAIFKLKNRHVEFADIHGETLTGTHVLVPQMTIKSG 369 T PA+ G++ EIG+L G AD+ ++ V+ P M + G Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVLWNPAFFGVK----------------PDMVLLGG 453 Query: 370 EILFRQI 376 I + Sbjct: 454 TIAAAPM 460
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 45.3 bits (107), Expect = 4e-07 Identities = 69/394 (17%), Positives = 147/394 (37%), Gaps = 32/394 (8%) Query: 30 DTAVISGAIGSLTSYFHLSPAETGWAVSCVVVGCVIGSFSAGYLSKRFGRKKSLMVSALL 89 + V++ ++ + + F+ PA T W + ++ IG+ G LS + G K+ L+ ++ Sbjct: 29 NEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIII 88 Query: 90 FTISAVGTSLSYTFTHFVIY-RIIGGLAVGLAATVSPMYMSEVSPKNMRGRALSMQQFAI 148 +V + ++F +I R I G + + ++ PK RG+A + + Sbjct: 89 NCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIV 148 Query: 149 VFGQILIFYVNYKIASIAADTWLIELGWRYMFAAGIIPCILFCILVFLIPESPRW----- 203 G+ V I + A + W Y+ +I I L+ L+ + R Sbjct: 149 AMGE----GVGPAIGGMIAHY----IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200 Query: 204 -----MMMIGREEETLKILTKISNEEHARHLLADIKTSLQNDQLNAHQKLNYRDGNVRFI 258 +M +G + T + + +++ + ++ G Sbjct: 201 IKGIILMSVGI--VFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPF 258 Query: 259 LILGCMIAMLQQVTGVNVMMYYAPIVLKDVTG-SAQEALFQTIWIGVIQ-LIGSIIGAMI 316 +I ++ V M P ++KDV S E I+ G + +I IG ++ Sbjct: 259 MIGVLCGGIIFGTVAGFVSM--VPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL 316 Query: 317 MDKMGRLSLMRKGTIGSIIGLLLTSWALYSQATGYFALFGMLFFMIFYALSWGVGAWVLI 376 +D+ G L ++ G + L + + T +F ++F + + + +I Sbjct: 317 VDRRGPLYVLNIGVTFLSVSFLTA--SFLLETTSWFMTIIIVFVLGGLSFT-----KTVI 369 Query: 377 SEIFPNRMRSQGMSISVGFMWMANFLVSQFFPMI 410 S I + ++ Q + + +FL I Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 58.4 bits (141), Expect = 3e-11 Identities = 79/383 (20%), Positives = 151/383 (39%), Gaps = 31/383 (8%) Query: 1 MFGYSTAVITGVVLP-LQQYYQLTPTETGWAVSSIVIGCIIGALVGGKIADKLGRKPALL 59 F ++ V LP + + P T W ++ ++ IG V GK++D+LG K LL Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 60 IIAIIFIASSLGAAMSES-FMIFSLSRIVCGFAVGMAGTASTMYMSELAPAEIRGKALGI 118 II S+ + S F + ++R + G + ++ P E RGKA G+ Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 119 YNISVVSGQVIVFIVNYLIAKGMPADVLVSQGWKTMLFAQVVPSIAMLAITLFLPESPAW 178 V G+ + + +IA + W +L ++ I + + L + Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYI--------HWSYLLLIPMITIITVPFLMKLLKKEVRI 195 Query: 179 CARNNRSEA--RSIKVLTRIYSGLTATDVAAIF---------DSMKETVRPQDNVAGGER 227 + S+ ++ + + + I +++ P + G Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLG-- 253 Query: 228 TNLKSSPVLRYILLVGCCIAVLQQFTGVNVMNYYAPLVLQNSSTEVVMFQTIFIAVCNVV 287 K+ P + +L G + F V+++ Y V Q S+ E+ + ++ Sbjct: 254 ---KNIPFMIGVLCGGIIFGTVAGF--VSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308 Query: 288 GSFIGMILFDRYGRIPIMKIGTIGSIVGLLIASYGLYTHDTGYITIFGILFFMLLFAVSW 347 +IG IL DR G + ++ IG V L AS+ L T T + I+F + + + Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLET--TSWFMTIIIVFVLGGLSFTK 366 Query: 348 SVGAWVLISEVFPEKIKGFGMGL 370 +V + ++ S + ++ G GM L Sbjct: 367 TVISTIVSSSLKQQE-AGAGMSL 388
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.026 Identities = 11/59 (18%), Positives = 23/59 (38%), Gaps = 4/59 (6%) Query: 64 DKDVEVVIITASNEAHADVAVAALNANKYVFCEKP--LAVTAADCQRVIEAEQKNGKRM 120 D+ V++++A N A+ A Y + KP L R + ++ ++ Sbjct: 73 RPDLPVLVMSAQNTF--MTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKL 129
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.4 bits (97), Expect = 5e-06 Identities = 31/131 (23%), Positives = 55/131 (41%), Gaps = 6/131 (4%) Query: 240 NERHWDNTGFAMTLFGIAFIAVRFFCAKFPDRYGGATVATFSLLVEGTGLAVMWAAPSAG 299 +W NT F +T + K D+ G + F +++ G + + S Sbjct: 49 ASTNWVNTAFMLTFSIGTAVY-----GKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFF 103 Query: 300 AALIGAAITGCGCSLMFPSLGVEVVRR-VPPEIRGTALGVWSAFQDLAYGFTGPIAGLLT 358 + LI A + FP+L + VV R +P E RG A G+ + + G I G++ Sbjct: 104 SLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIA 163 Query: 359 PFIGYQQVFLL 369 +I + + L+ Sbjct: 164 HYIHWSYLLLI 174
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.0 bits (72), Expect = 0.015 Identities = 14/126 (11%), Positives = 31/126 (24%), Gaps = 2/126 (1%) Query: 372 STRKAEAAKKYQTEDFFNQVESKEYVEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQL 431 E A + + +E +A+ A EK ++ Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKA--ALEARQAELEKALEGAMNFSTADSAKI 213 Query: 432 AAKSEQLVRLNATWQTLSQVRATRELIDNDIEQYLDNLNKLLSGQEQKVTQLKSAKAEWK 491 + L A L + + L + E + +L+ A Sbjct: 214 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAM 273 Query: 492 KYRASE 497 + ++ Sbjct: 274 NFSTAD 279
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 68.7 bits (168), Expect = 1e-15 Identities = 28/141 (19%), Positives = 48/141 (34%), Gaps = 2/141 (1%) Query: 1 MDSITTLIVEDEPMLAEILVDTIKIFPQFSIVGIADKLESAKKQIRLYQPQLILLDNFLP 60 M T L+ +D+ + +L + V I + + I L++ D +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 DGKGIDLIRHTISTNYTGRIIFITADNHMDTISDALRMGVFDYLIKPVHYQRLQHTLERF 120 D DL+ ++ ++A N T A G +DYL KP L + R Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 TRYRSSLRSSEQANQTHVDAL 141 S + + L Sbjct: 119 LAEPKRRPSKLEDDSQDGMPL 139
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.2 bits (68), Expect = 0.018 Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 13/81 (16%) Query: 104 DATYITVGNEKGQRLYHVNPDEIGKYMEGGDSDDALYNAKSYVSVRKGSLGSSLRGKSPI 163 + + G EK Q L V +E+ KY E G + GS+G + Sbjct: 238 NGAALYYGTEKEQWLREVKVEELRKYYEEG-------------HFKAGSMGPKVLAAIRF 284 Query: 164 QDSTGKVIGIVSVGYTLEQLE 184 + G+ I + +E LE Sbjct: 285 IEWGGERAIIAHLEKAVEALE 305
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.6 bits (77), Expect = 0.002 Identities = 15/67 (22%), Positives = 30/67 (44%), Gaps = 4/67 (5%) Query: 504 ASSAPVQAASP----VAPAGAGTPVTAPLAGNIWKVIATEGQTVAEGDVLLILEAMKMET 559 + V+ + + +G + + ++I EG++V +GDVLL L A+ E Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134 Query: 560 EIRAAQA 566 + Q+ Sbjct: 135 DTLKTQS 141 Score = 29.4 bits (66), Expect = 0.047 Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 10/56 (17%) Query: 534 KVIATEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGDTLMTL 589 V G+ G EI+ + V+ I VK G++V GD L+ L Sbjct: 82 IVATANGKLTHSGRSK----------EIKPIENSIVKEIIVKEGESVRKGDVLLKL 127
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 37.9 bits (88), Expect = 2e-05 Identities = 20/102 (19%), Positives = 42/102 (41%), Gaps = 4/102 (3%) Query: 154 NPFTLGHRYLVEQAAAACDWLHLFVVKEDAS--FFSYTDRWALIEQGIGGIDNVTLHSGS 211 +P T GH ++E+ D +++ V++ FS +R I + I + N + S Sbjct: 10 DPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFE 69 Query: 212 AYMISRATFPGYFLKEKGV--VDDCHCQIDLQLFREHLAPAL 251 ++ A +G+ + D ++ + + LA L Sbjct: 70 GLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDL 111
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 696 bits (1799), Expect = 0.0 Identities = 243/885 (27%), Positives = 385/885 (43%), Gaps = 67/885 (7%) Query: 3 HYKKFRLSTLAAVVGIVLAVGPENSYAEAPIQFNTRFLDVKDDASLDLSRFSRKGYIMPG 62 H +K RL+ + + A + + A + FN RFL A DLSRF + PG Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 63 SYHLQVLVNQSQIAQDNVITYSVDNNDPDNTYPCLSPELVSLLGLKPEIADKMIWINAGQ 122 +Y + + +N +A +V + D+ PCL+ ++ +GL M + Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQ--GIVPCLTRAQLASMGLNTASVSGMNLLADDA 134 Query: 123 CLQPDQL-EGMETQTDLSQSTLTVIIPQAYLEYSDEEWDPPSRWDEGIPGVLFDYNVNSQ 181 C+ + Q D+ Q L + IPQA++ + PP WD GI L +YN + Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 182 WRHAEHDDGDEYDISGNGTVGANLGAWRLRADWQANYRHENDSEDKDNFGSSSEQNWDWN 241 Y N G N+GAWRLR + +Y + S S S+ W Sbjct: 195 SVQNRIGGNSHY-AYLNLQSGLNIGAWRLRDNTTWSYNSSDSS-------SGSKNKWQHI 246 Query: 242 RYYAWRAIPQLRAQLTLGEGSLESDIFDGFNYVGGSLITDDQMLPPNLRGYAPDISGVAR 301 + R I LR++LTLG+G + DIFDG N+ G L +DD MLP + RG+AP I G+AR Sbjct: 247 NTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIAR 306 Query: 302 TNAKVTVTQRGRVIYESQVPAGPFRIQDINET-VSGDLHVKIEEQSGQVQEYDVSTASIP 360 A+VT+ Q G IY S VP GPF I DI SGDL V I+E G Q + V +S+P Sbjct: 307 GTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVP 366 Query: 361 FLTRPGQVRYKLAAGRPQDWDHNMEGGFFTSAEASWGIANGWSLYGGAIGEQDYQALALG 420 L R G RY + AG + + E F + G+ GW++YGG Y+A G Sbjct: 367 LLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFG 426 Query: 421 LGRDLALLGAFSVDVTHSRATLPEGSAYGDGTIQGNSFRASYAKDFDDIDSRLTFAGYRF 480 +G+++ LGA SVD+T + +TLP D G S R Y K ++ + + GYR+ Sbjct: 427 IGKNMGALGALSVDMTQANSTLP-----DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRY 481 Query: 481 SEENYMTMDEFIDTHNDDNDR-----------------QRTGHDKEMYTLTYSQNFSAIN 523 S Y + + + + + + LT +Q Sbjct: 482 STSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-T 540 Query: 524 VNAYINYTHRTYWNQPNQD-SYNLTLSHYFDVGEVRGISLSVNGFRNEYDNERDDGVYVS 582 Y++ +H+TYW N D + L+ F+ +LS + +N + RD + ++ Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALN 597 Query: 583 LSIPWGN-----------NRTLSYNGSFSDDNN-SNQVGYYERI--DDRNNYQINAGRAD 628 ++IP+ + + + SY+ S + +N G Y + D+ +Y + G A Sbjct: 598 VNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAG 657 Query: 629 -----NGATLDGYYRHQASYADIDVSANYQEGDYTSDGLNIQGGATLTAKGGALHRTSVN 683 +G+T ++ Y + ++ ++ D + GG A G L + Sbjct: 658 GGDGNSGSTGYATLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVLAHANGVTLGQPL-- 714 Query: 684 GGSRLLVDVGDEANVPISGYSTPVYTNAFGKAVIVDVNDYYRNLVKIDITQLPEDAEATL 743 + +LV + + + V T+ G AV+ +Y N V +D L ++ + Sbjct: 715 NDTVVLVKAPGAKDAKVENQTG-VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDN 773 Query: 744 SIAQATLTEGAIGYRRMEVLSGKKAMASIRLRDGGTPPFGAEVYNSRQQQLGIVGEDGSV 803 ++A T GAI + G K + ++ + PFGA V + Q GIV ++G V Sbjct: 774 AVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQV 832 Query: 804 YLIGINPGERLQVTW--EGKTQCEA--ALPDPLPGDLFSGLLLPC 844 YL G+ ++QV W E C A LP L + L C Sbjct: 833 YLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 38.9 bits (90), Expect = 2e-06 Identities = 36/136 (26%), Positives = 61/136 (44%), Gaps = 10/136 (7%) Query: 43 PPCTVTGGE---VEFGNVLTTKVDGVNYRQAVGYRLSCNGRVSDYLKLQIQGNAVTINGE 99 PPCT+ G+ V+FGN+ VD +SC + S L +++ GN + + Sbjct: 32 PPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISISCPYK-SGSLWIKVTGNTMGVGQN 90 Query: 100 SVLQTDVDGLGIRL-QTATDGALISPGNTQWLSFQYSGGSGPA-----IEAIPVKNNGVT 153 +VL T++ GI L Q ++ GN ++ + G A ++P +N Sbjct: 91 NVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRVTAGLDTARSTFTFTSVPFRNGSGI 150 Query: 154 LTGGAFNAGATLVVDY 169 L GG F A++ + Y Sbjct: 151 LNGGDFRTTASMSMIY 166
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 45.0 bits (106), Expect = 2e-08 Identities = 55/184 (29%), Positives = 78/184 (42%), Gaps = 39/184 (21%) Query: 1 MKKI---VLTMLMGGSLAAQ---AADNLKFHGTLISPPNCTINNDQTIDVEFGNLLINKI 54 MKKI L +++G L +Q AADNL F G LI P CT+ N +V +G++ I + Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPA-CTVQN---AEVNWGDIEIQNL 56 Query: 55 DGTRYAQ-------NVPYEITCDSTVRDETMAMTLTLSGSVSD--FNPAAVNTSVAGLGI 105 + Q N PY + TM +T+T +G + P S GL I Sbjct: 57 VQSGGNQKDFTVDMNCPYSLG--------TMKVTITSNGQTGNSILVPNTSTASGDGLLI 108 Query: 106 ELRQNDQ-----------PFTLGS-TITVNEQSIPVLKAIPVKKSGASLKEGGFDATATL 153 L ++ T G T T + I + + K + SL+ G F ATATL Sbjct: 109 YLYNSNNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATL 168 Query: 154 QVDY 157 Y Sbjct: 169 VASY 172
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 37.7 bits (87), Expect = 8e-06 Identities = 41/170 (24%), Positives = 72/170 (42%), Gaps = 16/170 (9%) Query: 14 ILCGALILP--VSAADNLHFSGSLVASPCTLTMQGTGIAEVDFSSLDSSDFTPDGQSARK 71 ++ GA+++ V AADNL F G L+ CT+ AEV++ ++ + G +K Sbjct: 11 VMLGAVLMSQHVHAADNLTFKGKLIIPACTVQN-----AEVNWGDIEIQNLVQSG-GNQK 64 Query: 72 PLVFELTDCDSALSNGVQVTFTGTEATGMRGILAIDSHSGASGIGIGIETLSGVPVGMND 131 ++ C +L ++VT T TG ++ S + G+ I + + +G Sbjct: 65 DFTVDMN-CPYSLGT-MKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAV 122 Query: 132 EEGAIFT--LVTGNNALNLNAWVQRL----PGEDLIPGTFFASALVTFEY 175 G+ T +TG +L + L GTF A+A + Y Sbjct: 123 TLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.4 bits (79), Expect = 9e-04 Identities = 72/429 (16%), Positives = 140/429 (32%), Gaps = 45/429 (10%) Query: 28 IQALLSVFLGYLAYYIVRNNFTLSTPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVM 84 I L +V L + ++ P L L S G+L + + V+ Sbjct: 8 IVILSTVALDAVGIGLI----MPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 85 SSLADKASPKVFMACGLVLCAIVNVGLGFSSAFWIFAALVVFNGLFQGMGVGPSFITIAN 144 +L+D+ + + L A+ + + W+ + G+ G IA+ Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122 Query: 145 WFPRRERGRVGAFWNISHNVGGGIVA-PIVGAAFAILGSEHWQSASYIVPACVAVIFALI 203 ER R F +S G G+VA P++G A + A + + L Sbjct: 123 ITDGDERAR--HFGFMSACFGFGMVAGPVLGGLMGGFSPH----APFFAAAALNGLNFLT 176 Query: 204 VLVLGKGSPREEGLPSLEQMMPEEKVILKTKNTAKAPENMSAWQIFCTYVLRNKNAWYIS 263 L +PE + + P A ++ + Sbjct: 177 GCFL----------------LPE------SHKGERRPLRREALNPLASFRWARGMTVVAA 214 Query: 264 LVDVFVYMVRFGMISWLPIYLLTVKHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKL 320 L+ VF M G + + F + ++ + ++ ++ G ++ +L Sbjct: 215 LMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARL 274 Query: 321 FKGRRMPLAMICMALIFVCLIGYWKSESLLMVTIFAAIVGCLIYVPQFLASVQTMEIVPS 380 + R + L MI ++ L + + + A G + Q + S Q E Sbjct: 275 GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334 Query: 381 FAVGSAVGLRGFMSYIFGASLGTSLFGVMVDKLGWYGGFYLLMGGIVCCILFCYLSHRGA 440 GS L ++ I G L T+++ + + G+ + G + + L RG Sbjct: 335 QLQGSLAALTS-LTSIVGPLLFTAIYAA---SITTWNGWAWIAGAALYLLCLPAL-RRGL 389 Query: 441 LELERQRQN 449 QR + Sbjct: 390 WSGAGQRAD 398
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 248 bits (635), Expect = 6e-80 Identities = 120/474 (25%), Positives = 192/474 (40%), Gaps = 73/474 (15%) Query: 7 SILLIDDDVDVLDAYTQMLEQAGYRVRGFTHPFEAKEWVKADWEGIVLSDVCMPGCSGID 66 +IL+ DDD + Q L +AGY VR ++ W+ A +V++DV MP + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 67 LMTLFHQDDDQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGKLLILIEDALRQRRS 126 L+ + LP+L+++ A+ A +KGA+D+L KP D +L+ +I AL + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 127 VIARRQYCQQTLQVELIGRSEWMNQFRQRLQQLAETDIAVWFYGEHGTGRMTGARYLHQL 186 ++ + Q L+GRS M + + L +L +TD+ + GE GTG+ AR LH Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 187 GRNAKGPFVRYELT--PENAGQLETF-----------------IDQAQGGTLVLSHPEYL 227 G+ GPFV + P + + E F +QA+GGTL L + Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 228 TREQQHHLAR-LQSLEHRP----------FRLVGVGSASLVEQAAANQIAAELYYCFAMT 276 + Q L R LQ E+ R+V + L + +LYY + Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 277 QIACQSLSQRPDDIEPLFRHYLRKACLRLNHPVPEIAGELLKGIMRRAWPSNVRELANAA 336 + L R +DI L RH++++A + V E L+ + WP NVREL N Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 337 ELFAV-----------------------------------GVLPLAETVNPQLL------ 355 + E Q Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 356 LQEPSPLDRRVEEYERQIITEALNIHQGRINEVAEYLQIPRKKLYLRMKKYGLS 409 L DR + E E +I AL +G + A+ L + R L ++++ G+S Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>OMPTIN#Omptin serine protease signature. Length = 317 Score = 470 bits (1211), Expect = e-171 Identities = 149/320 (46%), Positives = 211/320 (65%), Gaps = 11/320 (3%) Query: 1 MKKHAIAVMMIAIFSESVYAESTLFIPDVSPDSVTTSLSVGVLNGKSRELVYD-TDTGRK 59 M+ + +++ + S +A + +PD++ +S+G L+GK++E VY + GRK Sbjct: 1 MRAKLLGIVLTTPIAISSFASTET--LSFTPDNINADISLGTLSGKTKERVYLAEEGGRK 58 Query: 60 ISQLDWKIKNVATLQGDLSWEPYSFMTLDARGWTSLASGSGYMVDHDWMSSEQPG-WTDR 118 +SQLDWK N A ++G ++W+ +++ A GWT+L S G MVD DWM S PG WTD Sbjct: 59 VSQLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDE 118 Query: 119 SIHPDTSANYANEYDLNVKGWLLQGDNYKAGVTAGYQETRFSWTARGGSYIYDNGR---- 174 S HPDT NYANE+DLN+KGWLL NY+ G+ AGYQE+R+S+TARGGSYIY + Sbjct: 119 SRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRD 178 Query: 175 YIGNFPHGVRGIGYSQRFEMPYIGLAGDYRINDFECNVLFKYSDWVNAHDNDEHY--MRK 232 IG+FP+G R IGY QRF+MPYIGL G YR DFE FKYS WV + DNDEHY ++ Sbjct: 179 DIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKR 238 Query: 233 LTFREKTENSRYYGASIDAGYYITSNAKIFAEFAYSKYEEGKGGTQIIDKTSGDTAYFGG 292 +T+R K ++ YY +++AGYY+T NAK++ E A+++ KG T + D + +T+ + Sbjct: 239 ITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNN-NTSDYSK 297 Query: 293 DAAGIANNNYTVTAGLQYRF 312 + AGI N N+ TAGL+Y F Sbjct: 298 NGAGIENYNFITTAGLKYTF 317
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.026 Identities = 23/117 (19%), Positives = 45/117 (38%), Gaps = 20/117 (17%) Query: 199 WIIATMVWMFPAAGRAKIVVI-----ILMTWLIALGDTTHIVVGSVEILYLVFNGTLPWS 253 I W+ G+ + V+ I M W +A + L F T P + Sbjct: 61 SFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRL---------LAFINTKPVA 111 Query: 254 DFLWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAE 310 F P AL ++ N+ TF+++L+ K ++A + ++ ++A+ Sbjct: 112 -FTLPLAL-SIIFNVVVVTFMWSLLYFGWHFF----KNYKQAEIDQWKMASMAQEAQ 162
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 398 bits (1024), Expect = e-144 Identities = 237/251 (94%), Positives = 248/251 (98%) Query: 1 MKLRLSALALGTTLLVGCASSGTEQQGRSDPFEGFNRTMYNFNFNVLDPYVVRPVAVAWR 60 MKLRLSALALGTTLLVGCASSGT+QQGRSDP EGFNRTMYNFNFNVLDPY+VRPVAVAWR Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 Query: 61 DYVPQPARNGLSNFTGNLEEPAIMVNYFLQGDPYQGMVHFTRFFLNTLLGMGGFIDVAGM 120 DYVPQPARNGLSNFTGNLEEPA+MVNYFLQGDPYQGMVHFTRFFLNT+LGMGGFIDVAGM Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 Query: 121 ANPKLQRVEPHRFGSTLGHYGVGYGPYMQLPFYGSFTLREDGGDMADTLYPVLSWLTWPM 180 ANPKLQR EPHRFGSTLGHYGVGYGPY+QLPFYGSFTLR+DGGDMAD LYPVLSWLTWPM Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180 Query: 181 SIGKWTIEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGKLKPQENPNAQA 240 S+GKWT+EGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGG+LKPQENPNAQA Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 Query: 241 IQDELKEIDSE 251 IQD+LK+IDSE Sbjct: 241 IQDDLKDIDSE 251
>PERTACTIN#Pertactin signature. Length = 922 Score = 28.9 bits (64), Expect = 0.019 Identities = 19/60 (31%), Positives = 22/60 (36%), Gaps = 4/60 (6%) Query: 99 PIPVETPKPKPVEKPKPQPKPQQPVVAASTPTPAPQPATDDKPAPTGKAYVVQLGALKNA 158 P P P+P P P+P PQ P P QP P G+ L A NA Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE----LSAAANA 624 Score = 28.5 bits (63), Expect = 0.022 Identities = 16/49 (32%), Positives = 17/49 (34%) Query: 106 KPKPVEKPKPQPKPQQPVVAASTPTPAPQPATDDKPAPTGKAYVVQLGA 154 K P KP PQP PQ P P P P +A Q A Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPA 614
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 33.6 bits (76), Expect = 0.002 Identities = 13/37 (35%), Positives = 24/37 (64%), Gaps = 2/37 (5%) Query: 469 KDVDQQYLDFLDSLRND-DAKAVLFQNEM-ENLEMHN 503 K +D ++L+ + SL +D D+ +LF + E LE++N Sbjct: 186 KSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLELNN 222
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 348 bits (894), Expect = e-118 Identities = 121/371 (32%), Positives = 185/371 (49%), Gaps = 24/371 (6%) Query: 122 NMSGVRRLQEQVVELNQLLYADHHE---KHHAIITENPEMLSNIAKAKRLAASNIPVTIV 178 +++ + + + + + + + ++ + M RL +++ + I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 179 GETGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENS-QGY 237 GE+GTGKEL +R +H KR N PF+A+N A+P LIES LFG +GA+TGA+ G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 238 LELANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKL 297 E A GGTLFLDE+ MP++ Q++LLR LQ + +GG+ + SDVRIVAA N+ + Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 298 IQQERLRADLFYRLSVGMLTLPPLRARPEDIPLLANYFIDKYRNDVPQDIHGLSETARAD 357 I Q R DL+YRL+V L LPPLR R EDIP L +F+ + + D+ + A Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 358 LLNHAWPGNVRMLENAIVRSMIMQEKDGLLKHIIF-------------------EQDELN 398 + H WPGNVR LEN + R + +D + + II ++ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 399 LGVPETAPENPLPSSPDPQYEGSLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTT 458 V E + G + +A E LI AL +GN AA L ++R T Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 459 LQYKVQKYAIR 469 L+ K+++ + Sbjct: 466 LRKKIRELGVS 476
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 31.7 bits (72), Expect = 0.006 Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 20/133 (15%) Query: 87 VLKAIRDAGICAEANSQYEVRKCLEIGFRGDQIVFNGVVKKPADLEYAIANDLYLINVDS 146 + AI A N + E E G++G ++ G DLE + L Sbjct: 46 IWSAIGATDGFALLNLE-EAITLRERGWKGPILMLEGFFH-AQDLEIYDQHRLTT----C 99 Query: 147 LYELEHIDAIS-RKLKKVANVCVRVEPNVPSATHAELVTAFHAKSGLDLEQAEETCRRIL 205 ++ + A+ +LK ++ ++V + + + G ++ +++ Sbjct: 100 VHSNWQLKALQNARLKAPLDIYLKVN------------SGMN-RLGFQPDRVLTVWQQLR 146 Query: 206 AMPYVHLRGLHMH 218 AM V L H Sbjct: 147 AMANVGEMTLMSH 159
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.6 bits (69), Expect = 0.012 Identities = 36/179 (20%), Positives = 68/179 (37%), Gaps = 12/179 (6%) Query: 25 ILYFFNYMDRVNIGFAALRMNESLGITPEDFANISSIFFISYLIFQIPSSIGLQKLGARK 84 IL FF+ ++ + + + + P +++ F +++ I +LG ++ Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 85 W--ISSIIIGWGAVTGLIFFAKDTQHIL-LARIFLGVFEAGFFPGMVYYLACWFPARERG 141 II +G+V G F +L +AR G A F ++ +A + P RG Sbjct: 81 LLLFGIIINCFGSVIG--FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 142 KVNSFFMLSIAVASVLAAPMSGWIIEHLNTPDYEGWRWLFAIEGIPTVFLGILTFYLLP 200 K +A+ + + G I ++ W +L I I T+ LL Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI-TIITVPFLMKLLK 190
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 1e-17 Identities = 29/104 (27%), Positives = 47/104 (45%) Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNAIDIVLSDVNMPNMDGYRL 886 ILV DD R +L L GY + ++ ++ D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 887 TQRIRQLGLTLPVVGVTANALAEEKQRCLESGMDSCLSKPVTLD 930 RI++ LPV+ ++A + E G L KP L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.9 bits (114), Expect = 1e-08 Identities = 27/145 (18%), Positives = 60/145 (41%), Gaps = 20/145 (13%) Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114 + L+ IK+ P L ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 115 DLPKALAALQKGKKFTPESVSRLLD 139 DL + + + + S+L D Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.5 bits (121), Expect = 5e-09 Identities = 32/129 (24%), Positives = 56/129 (43%), Gaps = 20/129 (15%) Query: 106 AMMVHIRHTAHSQ-LPEAITQAVIGRPINFQGLGGDDANRQAQGILERAAKRAGFQEVVF 164 M+ H HS + ++ P+ R+A + +A+ AG +EV Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGA-----TQVERRA---IRESAQGAGAREVFL 140 Query: 165 QYEPVAAGLDYEATLREEKRVLVVDIGGGTTDCSMLLMGPQWRQRADRENSLLGHSGCRV 224 EP+AA + + E +VVDIGGGTT+ +++ + ++ S R+ Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 225 GGNDLDIAL 233 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 30.9 bits (70), Expect = 0.010 Identities = 20/65 (30%), Positives = 33/65 (50%), Gaps = 11/65 (16%) Query: 351 ALDQPLARILEQLQLAMDSAQEKPDV--------IYLTGGSARSPLIKKALSEQLPGIPV 402 AL +PL I+ + +A++ Q P++ + LTGG A + + L E+ GIPV Sbjct: 259 ALQEPLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPV 315 Query: 403 AGGDD 407 +D Sbjct: 316 VVAED 320
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 367 bits (942), Expect = e-132 Identities = 191/235 (81%), Positives = 208/235 (88%), Gaps = 7/235 (2%) Query: 1 MSNELPWQVWTPDDLAPPPETFVPVEADNVTLTDDTPEPELTTEQQLEQELAQLKIQAHE 60 MS+ LPW+ WTPDDLAPP FVP+ T+ ++ E LEQ+LAQL++QAHE Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEA-------EPSLEQQLAQLQMQAHE 53 Query: 61 QGYNAGLAEGRQKGHAQGYQEGLAQGLEQGQAQAQTQQAPIHARMQQLVSEFQNTLDALD 120 QGY AG+AEGRQ+GH QGYQEGLAQGLEQG A+A++QQAPIHARMQQLVSEFQ TLDALD Sbjct: 54 QGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALD 113 Query: 121 SVIASRLMQMALEAARQVIGQTPAVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 180 SVIASRLMQMALEAARQVIGQTP VDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV Sbjct: 114 SVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 173 Query: 181 EEMLGATLSLHGWRLRGDPTLHHGGCKVSADEGDLDASVATRWQELCRLAAPGVL 235 ++MLGATLSLHGWRLRGDPTLH GGCKVSADEGDLDASVATRWQELCRLAAPGV+ Sbjct: 174 DDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 339 bits (870), Expect = e-118 Identities = 114/329 (34%), Positives = 196/329 (59%), Gaps = 2/329 (0%) Query: 1 MSNLSGTDKSVILLMTIGEDRAAEVFKHLSTREVQALSTAMANVRQISNKQLTDVLSEFE 60 +S L+G K+ ILL++IG + +++VFK+LS E+++L+ +A + I+++ +VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANEYLRSVLVKALGEERASSLLEDILETRDTTSGIETLNFMEPQSAAD 120 + + +Y R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRSQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180 I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190 Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239 L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++ Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250 Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEPPLREKFLRNMSQRAADILRDDLANRGPVRLS 299 V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310 Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328 VE Q+ I+ ++R+L E GE+VI G + Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 740 bits (1911), Expect = 0.0 Identities = 528/530 (99%), Positives = 530/530 (100%) Query: 2 AGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGAIEVPA 61 AGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGAIEVPA Sbjct: 30 AGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGAIEVPA 89 Query: 62 DKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPV 121 DKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPV Sbjct: 90 DKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPV 149 Query: 122 KSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNV 181 KSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNV Sbjct: 150 KSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNV 209 Query: 182 TLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPIVGNGNVHAQVTAQL 241 TLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPIVGNGNVHAQVTAQL Sbjct: 210 TLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPIVGNGNVHAQVTAQL 269 Query: 242 DFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQPAPPNEAPIAT 301 DFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQPAPPNEAPIAT Sbjct: 270 DFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQPAPPNEAPIAT 329 Query: 302 PPTNQQNAQNTPQTSTSTNSNNAGPRNTQRNETSNYEVDRTIRHTKMNVGDIERLSVAVV 361 PPTNQQNAQNTPQTSTSTNSN+AGPR+TQRNETSNYEVDRTIRHTKMNVGDIERLSVAVV Sbjct: 330 PPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTKMNVGDIERLSVAVV 389 Query: 362 VNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNVVNSPFSAVDNTGGELPFW 421 VNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNVVNSPFSAVDNTGGELPFW Sbjct: 390 VNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNVVNSPFSAVDNTGGELPFW 449 Query: 422 QQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEV 481 QQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEV Sbjct: 450 QQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEV 509 Query: 482 RLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMSNDHE 531 RLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMSNDHE Sbjct: 510 RLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMSNDHE 559
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 114 bits (286), Expect = 1e-36 Identities = 90/103 (87%), Positives = 96/103 (93%) Query: 2 AAIQGIEGVISQLQATAMAARGQDTHSQSTVSFAGQLHAALDRISDRQTAARVQAEKFTL 61 +AIQGIEGVISQLQATAM+AR Q++ Q T+SFAGQLHAALDRISD QTAAR QAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGIALNDVMADMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPG+ALNDVM DMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>PF01206#SirA family protein Length = 76 Score = 92.5 bits (230), Expect = 6e-29 Identities = 16/71 (22%), Positives = 37/71 (52%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.8 bits (67), Expect = 0.021 Identities = 10/53 (18%), Positives = 17/53 (32%), Gaps = 2/53 (3%) Query: 164 RFTLLPMFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFIGMIGWALLT 214 R L R + + + A L + P R R M + ++L Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLG 78
>PF05844#YopD protein Length = 295 Score = 31.9 bits (72), Expect = 0.002 Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQQGMFSLERDIEN 103 ++LL +L+R+ K+R+ G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 42.2 bits (99), Expect = 1e-06 Identities = 25/118 (21%), Positives = 46/118 (38%), Gaps = 11/118 (9%) Query: 162 FKTGSAEVEPYMRDILRAIAPVL---NGIPNRISLAGHTDDFPYANGEKGYSNWELSADR 218 F A ++P + L + L + + + G+TD G Y N LS R Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI----GSDAY-NQGLSERR 277 Query: 219 ANASRRELVAGGLDNGKVLRVVGMAATMRLSDRGPDDAINRR--ISLLVLNKQAEQAI 274 A + L++ G+ K+ GM + ++ D+ R I L +++ E + Sbjct: 278 AQSVVDYLISKGIPADKI-SARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>PF06580#Sensor histidine kinase Length = 349 Score = 41.8 bits (98), Expect = 7e-06 Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 52/151 (34%) Query: 378 ELDKSLIERIIDPLT--HLVRNSLDHGIEMPEKRLEAGKNVVGNLILSAEHQGGNICIEV 435 +++ ++++ + P+ LV N + HGI G ++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 436 TDDGAGLNRERILAKAMSQGMAVNENMTDDEVGMLIFAPGFSTAEQVTDVSGRGVGMDVV 495 + G+ + G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 496 KRNIQEMGG---HVEIQSKQGSGTTIRILLP 523 + +Q + G +++ KQG +L+P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.2 bits (159), Expect = 7e-14 Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 6/142 (4%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAARARIAAHKPM 141 +AE R ++ + M Sbjct: 116 GRALAEPKRRPSKLEDDSQDGM 137
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.7 bits (220), Expect = 7e-24 Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGFGFIISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG +++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADSAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 420 bits (1082), Expect = e-149 Identities = 101/351 (28%), Positives = 179/351 (50%), Gaps = 14/351 (3%) Query: 7 DDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVCIIWFGGESLARQLAGMLSAGLH 66 +KTE PTP ++ AR++GQ+ +S+E+ S +++ ++ + + ++ + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--IP 60 Query: 67 FDHRMVNDPNLILGQIILLIKAAMMALLPLIAGVVLVALISPVMLGGLIFSGKSLQPKFS 126 + + + + ++ PL+ L+A+ S V+ G + SG++++P Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 127 KLNPLPGIKRMFSAQTGAELLKAVLKSTLVGCVTGFYLWHHWPQMMRLMAESPIVAMGNA 186 K+NP+ G KR+FS ++ E LK++LK L+ + + + +++L P + Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQL----PTCGIECI 176 Query: 187 LDLVGLCALLVVLGVIPMVGF------DVFFQIFSHLKKLRMSRQDIRDEFKESEGDPHV 240 L+G +L L VI VGF D F+ + ++K+L+MS+ +I+ E+KE EG P + Sbjct: 177 TPLLG--QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 241 KGKIRQMQRAAAQRRMMEDVPKADVIVTNPTHYSVALQYDENKMSAPKVVAKGAGLIALR 300 K K RQ + R M E+V ++ V+V NPTH ++ + Y + P V K Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 301 IREIGAEHRVPTLEAPPLARALYRHAEIGQQIPGQLYAAVAEVLAWVWQLK 351 +R+I E VP L+ PLARALY A + IP + A AEVL W+ + Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>INTIMIN#Intimin signature. Length = 939 Score = 247 bits (631), Expect = 5e-75 Identities = 126/444 (28%), Positives = 216/444 (48%), Gaps = 24/444 (5%) Query: 22 SFSLSLLLLTASGTIRAQAQDPFTQNRL----PDLGMMPESHEGEKHFAEMAKAFGEASM 77 F S L L S + A N+L PD+ + + ++A A + + Sbjct: 118 PFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQL 177 Query: 78 KNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSW 137 ++ L+ G+ A+ A G + Q + QL++WL +G+A V++ N F+GS + Sbjct: 178 QSRSLN-GDYAKDTALG----IAGNQASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDF 230 Query: 138 FIPLQDKQRYLTWSQLGLTQQTDGLVSNIGVGQRWAQDGWLLGYNTFYDNLLDENLQRAG 197 +P D ++ L + Q+G +N+G GQR+ +LGYN F D + R G Sbjct: 231 LLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLG 290 Query: 198 FGAEAWGEYLRLSANYYQPFADWQT--HTATLEQRMARGYDINAQMRLPFYQHINTSVSL 255 G E W +Y + S N Y + W + ++R A G+DI LP Y + + Sbjct: 291 IGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMY 350 Query: 256 EQYFGDSVDLFDSGTGYHNPVALKLGLNYTPVPLLTMTAQHKQGESGVSQNNLGLTLNYR 315 EQY+GD+V LF+S NP A +G+NYTP+PL+TM ++ G + + Y+ Sbjct: 351 EQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQ 410 Query: 316 FGVPLKKQLAASEVAQSQSLRGSRYDTPQRNSLPTMEYRQRKTLTVFLATPPWDLTPGET 375 F P +Q+ V + ++L GSRYD QRN+ +EY+++ L++ + + T T Sbjct: 411 FDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERST 469 Query: 376 VALKLQVRSVHGIRHLSWQGDTQALSLTAG----TDNRSTEGWTIIMPAWDHREGAANRW 431 ++L V+S +G+ + W D AL G + ++S + + I+PA+ +G +N + Sbjct: 470 QKIQLIVKSKYGLDRIVW--DDSALRSQGGQIQHSGSQSAQDYQAILPAY--VQGGSNVY 525 Query: 432 RLSVVVEDEKGQRVSSNEITLALT 455 +++ D G SSN + L +T Sbjct: 526 KVTARAYDRNGN--SSNNVLLTIT 547
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.2 bits (177), Expect = 6e-17 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLVSMAPDISVVGEASNGEQGIDLAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A V SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKALSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>PF06580#Sensor histidine kinase Length = 349 Score = 51.4 bits (123), Expect = 4e-09 Identities = 30/123 (24%), Positives = 54/123 (43%), Gaps = 17/123 (13%) Query: 473 SARFGFTVKLDYQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SHADDVVVTV 523 S +F ++ + Q+ P + VP L+Q E N +KH +++ Sbjct: 233 SIQFEDRLQFENQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKG 285 Query: 524 TQCGKQVKLKVQDNGCGVPENAERSNHYGMIIMRDRAQSLRG-DCQVRRRETGGTEVTVT 582 T+ V L+V++ G +N + S G+ +R+R Q L G + Q++ E G + Sbjct: 286 TKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 583 FIP 585 IP Sbjct: 346 LIP 348
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.026 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 128 TPFSTFIIISLLCGFAGANF-ASSMANISFFFPKQKQGGALGLNGGLGNMGVSVMQLV 184 + FS I+ + G A F A M ++ + PK+ +G A GL G + MG V + Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 27.5 bits (61), Expect = 0.044 Identities = 20/105 (19%), Positives = 46/105 (43%), Gaps = 7/105 (6%) Query: 40 LVEVRSNSARALAEKKQLSR-RIEQATAQQIEWQEKAELA-LRKDKDDLARAALIEKQKL 97 + + R + +L K+ +++ + + + +E EL + + + L K++ Sbjct: 232 VEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV--NELRVYKSQLEQIESEILSAKEEY 289 Query: 98 TDLIATLEQEVTLVDDTLARMKKEIGELENKLSETRARQQALMLR 142 + + E+ D L + IG L +L++ RQQA ++R Sbjct: 290 QLVTQLFKNEIL---DKLRQTTDNIGLLTLELAKNEERQQASVIR 331
>MPTASEINHBTR#Metalloprotease inhibitor signature. Length = 122 Score = 25.7 bits (56), Expect = 0.015 Identities = 6/43 (13%), Positives = 14/43 (32%) Query: 30 AGRGELSQSEQQRLLQLTDDAQRMRERIQALEDILDAEHPNWR 72 AG+ + + + A + + E L + +W Sbjct: 37 AGQLGIEATGSGVCAGPAEQANALAGDVACAEQWLGDKPVSWS 79
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 28.5 bits (63), Expect = 0.010 Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 2/34 (5%) Query: 44 LKNQDPTNPLQNNELTTQLAQISTVSGIEKLNTT 77 L N+ P N L NN L TQL + V G E+L T+ Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 67.5 bits (165), Expect = 9e-15 Identities = 70/370 (18%), Positives = 123/370 (33%), Gaps = 71/370 (19%) Query: 1 MKVLVTGATSGLGRNAVEFLRNKGISVRA---------TGRNEAMGKLLEKMGAEFVHAD 51 MK LVTGA +G + + L G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 104 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAAGEEVINLLAQANPQT--- 161 +++ ++ SS S+Y + D + +A +K A E L+A Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANE----LMAHTYSHLYGL 171 Query: 162 RFTVLRPQSLFGPHDK--VFIPRLAHMMHHYGSVLLPHGGSALVDMTYYENAIHAMWLAS 219 T LR +++GP + + + + M S+ + + G D TY ++ A+ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 220 QPGCDHLPS--------------GRAYNITNGENRTLRSIVQKLIDELTIDCRIRSVPYP 265 R YNI N L +Q L D L I+ + +P Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291 Query: 266 MLDMIARSMERFGKKSAKEPPLTHYGVSKLNFDFTLDTTRAQEELGYQPIITLDEGIERT 325 D+ T DT E +G+ P T+ +G++ Sbjct: 292 PGDV----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNF 324 Query: 326 ADWLRDHGNL 335 +W RD + Sbjct: 325 VNWYRDFYKV 334
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.006 Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 1/50 (2%) Query: 33 LVLLGPSGAGKSSLLRVLNLLEMPRSGTLTIAGNHFDFTKTPSDKAIREL 82 +VL G G GKS+L+ L L+ S T G D + + EL Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDF-FSDTHFDIGTGKDSYEQIAGIVAYEL 647
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 30.9 bits (69), Expect = 0.004 Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 31/119 (26%) Query: 83 FDAVMAG--MDITPEREKQVLFTTPYYDNSALFVGQQGKYTSVDQLKGKKVGVQNGTTHQ 140 D+V+A M + E +QV+ TP DNSAL + QL Q Sbjct: 112 LDSVIASRLMQMALEAARQVIGQTPTVDNSALI-------KQIQQL-----------LQQ 153 Query: 141 KFIMDKHPEITTVPYDSYQNAKLDLQNGRIDAVFGDTAVVTEW-LKANPKLAPVGDKVT 198 + + P++ P DLQ R+D + G T + W L+ +P L P G KV+ Sbjct: 154 EPLFSGKPQLRVHPD--------DLQ--RVDDMLGATLSLHGWRLRGDPTLHPGGCKVS 202
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.011 Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSNFSFGIGNAAGLLFAGIML-GFLRANHPTFG-YIPQ--GALNMVKEFGL 449 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 MVFMAGVGLSAGSGISNGLGAVGGQM--LIAGLVVSLVPVVICFLF 493 M G G+ AG + +G A + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.3 bits (112), Expect = 6e-09 Identities = 17/80 (21%), Positives = 33/80 (41%) Query: 7 RRANDPKRREKIIQATLEAVKTYGVHAVTHRKIAAIAQVPLGSMTYYFAGMDALLSEAFT 66 + + R+ I+ L GV + + +IA A V G++ ++F L SE + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 67 LFTENMSRQYQDFFAQVTDA 86 L N+ ++ A+ Sbjct: 65 LSESNIGELELEYQAKFPGD 84
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 0.002 Identities = 33/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%) Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 275 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 276 DRYSRVTVVR-ASALM--GALGIGLIIFVDSDWVA-GVSVILWGLGASLGFPLTISAASD 331 DR + V+ + L ++ S ++ + +L GL + TI ++S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 332 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 44.1 bits (104), Expect = 6e-07 Identities = 66/356 (18%), Positives = 127/356 (35%), Gaps = 51/356 (14%) Query: 48 QAGLDWVPTSMTAYLAGGMFLQWLLGPLSDRIGRRPVMLAGVVWFIVTCLATLLAKNIEQ 107 A +WV T+ + G + G LSD++G + ++L G++ + + + Sbjct: 48 PASTNWVNTAFMLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFS 104 Query: 108 FT-FLRFLQGISLCFIGAVGYAAIQESFEEAVCIKITALMANVALIAPLLGPLVGAAWVH 166 RF+QG A+ + + K L+ ++ + +GP +G H Sbjct: 105 LLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH 164 Query: 167 VLPWEGMFILFAALAAIAFFGLQRAMPETATRRGE------------------------- 201 + W +L + I L + + + +G Sbjct: 165 YIHW-SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSI 223 Query: 202 ------TLSFKALGRDYRLV---------IKNRRFVAGALALGFVSLPLLAWIAQSPIII 246 LSF + R V KN F+ G L G + + +++ P ++ Sbjct: 224 SFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM 283 Query: 247 ISGEQLSSYEYG-LLQVPVFGALIAGNLVLARLTSRRTVRSLIVMGGWPIVAGLIIAAAA 305 QLS+ E G ++ P ++I + L RR ++ +G + + A+ Sbjct: 284 KDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-- 341 Query: 306 TVVSSHAYLWMTAGLSVYAFGIGLANAGLVRLTLFSSDMSKGTVSAAMGMLQMLIF 361 + +MT + V+ G GL+ V T+ SS + + A M +L F Sbjct: 342 -FLLETTSWFMTIII-VFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSF 394
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 32/224 (14%), Positives = 72/224 (32%), Gaps = 25/224 (11%) Query: 6 TTTKGEQAKSQLIAAALAQFGEYGLHATT-RDIAALAGQNIAAITYYFGSKEDLYLACAQ 64 T + ++ + ++ AL F + G+ +T+ +IA AG AI ++F K DL+ + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 65 WIADFLGEKFRPHAEKAERLFSQPAPD-RDAIRELILLACKNMIMLLTQEDTVNLSKFIS 123 +GE E + P R+ + ++ L E + +F+ Sbjct: 65 LSESNIGELEL---EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV- 120 Query: 124 REQLSPTSAYQLVHEQVIDPLHTHLTRLVAA---YTGCDANDTRMILHTHALLGEVLAFR 180 E A + + + D + L + A +I+ + ++ Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM--- 175 Query: 181 LGKETILLRTGWPQFDEEKAELIYQTVTCHIDLILHGLTQRSLD 224 W + + ++ ++L Sbjct: 176 ---------ENWLFAPQSFDLK--KEARDYVAILLEMYLLCPTL 208
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 61.0 bits (148), Expect = 2e-12 Identities = 48/286 (16%), Positives = 104/286 (36%), Gaps = 28/286 (9%) Query: 55 ASLNVDEGDAIKAGQVLGELDHAPYENALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAA 114 NV E + L + + ++N Q + + +A+ +LA E Sbjct: 175 YFQNVSEEEV-LRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 115 AVRQAQAAYDYAQNFYNRQQGLWKSRTISA--NDLENARSSRDQAQATLKSAQDKLSQYR 172 R + + + L + N+L +S +Q ++ + SA+++ Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL-V 292 Query: 173 TGNREQDI----AQAKASLEQAKAQLAQAQLDLQDTTLIAPANGTLLTRAV-EPGSMLNA 227 T + +I Q ++ +LA+ + Q + + AP + + V G ++ Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352 Query: 228 GSTVLTLSLT-RPVWVRAYVDERNLSQTQPGRDILLYTDGRPDKPYH---GKIGFVSPTA 283 T++ + + V A V +++ G++ ++ + P Y GK+ ++ A Sbjct: 353 AETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA 412 Query: 284 EFTPKTVETPDLRTDLVYRLRIIVT-------DADDALRQGMPVTV 322 D R LV+ + I + + + L GM VT Sbjct: 413 --------IEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTA 450
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.009 Identities = 22/89 (24%), Positives = 29/89 (32%), Gaps = 21/89 (23%) Query: 294 PRFEDAFIDLLGGAGTSESPLGSILHTVEGTAGETVIEAQELTKKFGDFAATDHVNFVVQ 353 PR E + +LG P + Q + K HV V++ Sbjct: 548 PRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVME 590 Query: 354 RGEIFG----LLGPNGAGKSTTFKMMCGL 378 G F L G G GKST + GL Sbjct: 591 PGCKFDYSVVLEGTGGIGKSTLINTLVGL 619 Score = 29.7 bits (66), Expect = 0.044 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56 Y L G G GK+TL+ L GL Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 45.7 bits (108), Expect = 1e-07 Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 5/139 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFVGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCATQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYL 333 P+ H D+ + I L Sbjct: 209 AARFLPLSHSIDLIRPIML 227
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 35.9 bits (82), Expect = 1e-04 Identities = 29/64 (45%), Positives = 39/64 (60%), Gaps = 4/64 (6%) Query: 222 AEPGALIRQLAQGAPQYKEQLMT--IAEWLEEKGRTEGLQKGLQKGLEQGLAQGREAEAR 279 AEP +L +QLAQ Q EQ IAE ++G +G Q+GL +GLEQGLA+ + +A Sbjct: 36 AEP-SLEQQLAQLQMQAHEQGYQAGIAEG-RQQGHKQGYQEGLAQGLEQGLAEAKSQQAP 93 Query: 280 AIAR 283 AR Sbjct: 94 IHAR 97 Score = 30.1 bits (67), Expect = 0.009 Identities = 20/71 (28%), Positives = 32/71 (45%), Gaps = 12/71 (16%) Query: 233 QGAPQYKEQLMTIAEWLEEKGRTEGLQKGLQKGLEQGLAQGREAEARAIARKMLANGLEP 292 + P ++QL + E+G G+ +G Q+G +QG +G LA GLE Sbjct: 35 EAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG------------LAQGLEQ 82 Query: 293 GLIASVTGITP 303 GL + + P Sbjct: 83 GLAEAKSQQAP 93
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 36.2 bits (83), Expect = 7e-04 Identities = 41/219 (18%), Positives = 81/219 (36%), Gaps = 18/219 (8%) Query: 92 RQKVAQAPEKMRQ-ATAALNALSDVDNDDEMRKTLSALSLRQLELRVA--QVLDDLQNSQ 148 R ++A+A EK R+ A AA A + + + + A + RQL+L A + L L Sbjct: 129 RLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEA 188 Query: 149 NDLAAYNSQLVSLQTQPERVQNAMYTASQQI-------QQIRNRLDGNNVGEAALRPSQQ 201 + +L + Q++ ++ + T + ++ L G A + Sbjct: 189 KAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYK 248 Query: 202 VLLQAQQALLNAQID--------QQRKSLEGNTVLQDTLQKQRDYVTANSNRLEHQLQLL 253 L + + L D + + G +++ QKQ NR+ + + Sbjct: 249 ELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQI 308 Query: 254 QEAVNSKRLTLTEKTAQEAISPDETARIQANPLVKQELD 292 Q+A++ A+ + + + Q N L Q D Sbjct: 309 QKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKD 347
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 204 bits (519), Expect = 8e-69 Identities = 187/214 (87%), Positives = 199/214 (92%) Query: 1 MARKTKQQALETRQHILDVALRLFSQQGVSATSLAEIANAAGVTRGAIYWHFKNKSDLFS 60 MARKTKQ+A ETRQHILDVALRLFSQQGVS+TSL EIA AAGVTRGAIYWHFK+KSDLFS Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSESNIGELEIEYQAKFPDDPLSVLREILVHILEATVTEERRRLLMEIIFHKCEFV 120 EIWELSESNIGELE+EYQAKFP DPLSVLREIL+H+LE+TVTEERRRLLMEIIFHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMVVVQQAQRSLCLESYDRIEQTLKHCINAKMLPENLLTRRAAILMRSFISGLMENWLF 180 GEM VVQQAQR+LCLESYDRIEQTLKHCI AKMLP +L+TRRAAI+MR +ISGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APQSFDLKKEARAYVTILLEMYQLCPTLRASTVN 214 APQSFDLKKEAR YV ILLEMY LCPTLR N Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATN 214
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 33/216 (15%), Positives = 75/216 (34%), Gaps = 27/216 (12%) Query: 100 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 159 + Y A +L + + ++ + Q +++ ++ L +Q T + Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSSV-TEGALVQNGQASALATVQQLDPIYVDVTQ 218 + + + +P+S ++ + V TEG +V + + V + D + V Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTALV 372 Query: 219 SSNDFLRLKQELA------------NGSLKQENGKAKVDLVTSDGIKFPQSGTLEFSDVT 266 + D + G L KV + D I+ + G + ++ Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYL-----VGKVKNINLDAIEDQRLGLVFNVIIS 427 Query: 267 VDQTTGSITLRAIFPNPDHTLLPGMFVRARLQEGTK 302 +++ S + I L GM V A ++ G + Sbjct: 428 IEENCLSTGNKNIP------LSSGMAVTAEIKTGMR 457 Score = 32.5 bits (74), Expect = 0.003 Identities = 24/133 (18%), Positives = 45/133 (33%), Gaps = 10/133 (7%) Query: 49 PLQITTELPGR-TVAYRIAEVRPQVSGIILKRNFV-EGSDIEAGVSLYQIDP-------A 99 ++I G+ T + R E++P + I+ K V EG + G L ++ Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTL 137 Query: 100 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 159 Q++ A+ + + Q + EL KL Y ++ L Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197 Query: 160 AKAAVETARINLA 172 + +NL Sbjct: 198 WQNQKYQKELNLD 210
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1368 bits (3542), Expect = 0.0 Identities = 810/1033 (78%), Positives = 918/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISATYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDPISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPTELTKYQLTPVDVINAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ L KY+LTPVDVIN +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTDEFGKILLKVNQDGSQVRLRDVAKIELGGENYDVIAKFNGQPASGLGIKLATGANAL 300 + +EFGK+ L+VN DGS VRL+DVA++ELGGENY+VIA+ NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIRAELKKMEPFFPPGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+L +++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 TEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPVAKGDHGEGKKGFFGWFNRLFDKSTHHYTDSVGNILRSTGR 540 SVLVALILTPALCAT+LKPV+ H E K GFFGWFN FD S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLLLYLIIVVGMAYLFVRLPSSFLPDEDQGVFLTMVQLPAGATQERTQKVLDEVTDYYLN 600 YLL+Y +IV GM LF+RLPSSFLP+EDQGVFLTM+QLPAGATQERTQKVLD+VTDYYL Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKANVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEKNKVEAITQRATAAFSQIKD 660 EKANVESVF VNGF F+G+ QN G+AFVSLK W +R G++N EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLFGEVAKYPDLLVGVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQL G A++P LV VRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDINDWYVRGSDGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D++ YVR ++G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MAMMEELASKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFS 900 MA+ME LASKLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 VEATLEAVRMRLRPILMTSLAFMLGVMPLVISSGAGSGAQNAVGTGVLGGMVTATVLAIF 1020 VEATL AVRMRLRPILMTSLAF+LGV+PL IS+GAGSGAQNAVG GV+GGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.4 bits (123), Expect = 3e-09 Identities = 70/356 (19%), Positives = 122/356 (34%), Gaps = 35/356 (9%) Query: 23 IFSLALGTFGLGMAEFGIMGVLTELARDVGITIPAAGH---MISFYAFGVVLGAPVMALF 79 + ++AL G+G+ IM VL L RD+ + H +++ YA APV+ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 80 SSRFSLKHILLFLVMLCVMGNAIFTFSSSYLMLAVGRLVSGFPHGAFFGVGAIVLSKIIR 139 S RF + +LL + + AI + +L +GR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 140 PGKVTAAVAGMVSGMTVANLVGIPVGTYLSPEFSWRYTFLLIAVFNIAVLTAIFFWVPDI 199 G A G +S +V PV L FS F A N F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 200 RDKAQGSLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYIKPFMMYI 247 + LR + + A + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 248 SGFSETSMTFIMILVGLGM---VLGNLLSGKLSGRYTPLRIAVVTDLVIVLSLMALFFFS 304 F + T + L G+ + +++G ++ R R ++ ++ + Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG---MIADGTGYILLA 295 Query: 305 GYKTASLTFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAIG 358 + F + + P +L E G G +A +L S +G Sbjct: 296 FATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.8 bits (67), Expect = 0.014 Identities = 8/55 (14%), Positives = 17/55 (30%) Query: 277 FAIMKLPLADINAQNAMMHAGKSSEADVQGHVDGWINAHQQQFDGWVKEALAAQK 331 F + ++P + S +D + HV + + Q + Q Sbjct: 133 FVLNEMPARYRKNVLGAVLLSPSQSSDFEIHVSEMVTSDNQSARYLTLPEVNKQT 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.4 bits (110), Expect = 1e-07 Identities = 31/165 (18%), Positives = 66/165 (40%), Gaps = 2/165 (1%) Query: 34 LDTIAHHFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFE-RRTLIVSMTLLAAGGMLI 92 L IA+ F+ +S ++ TA L ++ G L D +R L+ + + G ++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 93 TASSQSLSMMILGTALTGLFSVVAQILVPLA-ATLATPATRGKVVGTIMSGLLLGILLAR 151 S++I+ + G + LV + A RGK G I S + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 152 TVAGLLANLGGWRTVFWVASALMALMAVALWRGLPKLKSDTHLNY 196 + G++A+ W + + + + + +++ H + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 74.5 bits (183), Expect = 1e-16 Identities = 62/418 (14%), Positives = 125/418 (29%), Gaps = 97/418 (23%) Query: 19 KRKTALLLLTLLFVIIAVAYGIYWFLVLRHIEETDDA----YVAGNQVQIMAQVSGSVTK 74 + L F++ + VL +E A +G +I + V + Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 75 VWADNTDFVKEGDVLVTLDQT--------------------------------------- 95 + + V++GDVL+ L Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 96 -------------DAKQAFEKAKTALASSVRQTHQLMINSKQ-------LQANIDVQKTA 135 + + K ++ Q +Q +N + + A I+ + Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 136 LAQAQSDLNRRVPLGNANLIGREELQHARDAVASAQAQLDVAIQQYNANQAMILNSNLED 195 +S L+ L + I + + + A +L V Q ++ IL++ E Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 196 QPAVQQAATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQ 236 Q Q E+ + + + I +P++ V + V G Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 237 ISPTTPLMAVVPATD-LWVDANFKETQLANMRIGQPVTIITDIYGDDVKY---TGKVVGL 292 ++ LM +VP D L V A + + + +GQ I + + +Y GKV + Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI 408 Query: 293 DMGTGSAFSLLPAQNATGNWIKVVQRLPVRVELDARQLEQHPLRIGLSTLVTVDTANR 350 + ++ G V+ + + PL G++ + T R Sbjct: 409 -----NLDAIE--DQRLGLVFNVIISIEENCLSTGNK--NIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 129 bits (326), Expect = 8e-35 Identities = 94/405 (23%), Positives = 164/405 (40%), Gaps = 23/405 (5%) Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRF 76 I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ + Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 77 GEVKLFMWSTVAFAAASWACGVS-SSLNMLIFFRVVQGVVAGPLIPLSQSLLLNNYPPAK 135 G +L ++ + S V S ++LI R +QG A L ++ P Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136 Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGIAVVLMTLHTLRGRETH 195 R A L V + GP +GG I+ HW ++ I + I I V + L+ Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRI 195 Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVIAISFLIVWELTD 255 D G+ L+ +GI + ML F++ I +V+V++ + Sbjct: 196 KG--HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243 Query: 256 DHPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315 P VD L K+ F IG LC + + G + ++P ++++V+ + G G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAWTFEPGMDFGASAWPQFIQGF- 373 + VI+ I G + ++ +V F ++ S + I F Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358 Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416 ++ ++TI S L + A SL NFT L+ G +I Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 35.5 bits (81), Expect = 4e-05 Identities = 37/144 (25%), Positives = 62/144 (43%), Gaps = 26/144 (18%) Query: 57 PPCTIGGAS---VEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGE 113 PPCTI V+FG++ V ++ S++C + S L +++ G T + Sbjct: 32 PPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISISCPYK-SGSLWIKVTGNTMGVGQN 90 Query: 114 QVLQTSVQGLGIRIQQ-------------AGNKQLVPVGI-TDWLNFTLSGSNGPELEAV 159 VL T++ GI + Q +GN V G+ T FT + +V Sbjct: 91 NVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRVTAGLDTARSTFTFT--------SV 142 Query: 160 PVKEPTTQLAGGDFNASATLVVDY 183 P + + L GGDF +A++ + Y Sbjct: 143 PFRNGSGILNGGDFRTTASMSMIY 166
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 40.5 bits (94), Expect = 6e-07 Identities = 44/165 (26%), Positives = 74/165 (44%), Gaps = 18/165 (10%) Query: 5 LILTLLITQFAC-AD-NLTFHGKLINPPACTINNGETLEVSFGSVIIDNIDGVNYLTEIP 62 L ++LL+T A AD + G + PP CTINNG+ + V FG++ +++D N E+ Sbjct: 6 LFISLLLTSVAVLADVQINIRGNVYIPP-CTINNGQNIVVDFGNINPEHVD--NSRGEVT 62 Query: 63 WPLTCDSSFRDDALTFTLSYLGTATPYSANALTTNVPELGIELQQNGTVFPPGT------ 116 ++ ++ +L ++ T N L TN+ GI L Q + P T Sbjct: 63 KNISISCPYKSGSLWIKVTG-NTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGSG 121 Query: 117 -----SLTIDES-SLPTLKAVPVKQPGKEPAEGDFEAFATLQVDY 155 + +D + S T +VP + GDF A++ + Y Sbjct: 122 NGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166
>INTIMIN#Intimin signature. Length = 939 Score = 29.7 bits (66), Expect = 0.007 Identities = 28/120 (23%), Positives = 48/120 (40%), Gaps = 9/120 (7%) Query: 23 TLPAATPNVHYSGKLVAGACNLVVDNDTMATVDSHTIGSDNFDASGQTTPVPFKLSLQDC 82 L + + +SG A ++ + + + + +D +G ++ L++ Sbjct: 491 ALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSN-NVLLTI--- 546 Query: 83 KTALANGVLVTFQGVEDSTLPGLLALEPSSEASGFAIGVE----TAAQQPVSINATVGTA 138 T L+NG +V GV D T A +EA + V+ A PVS N GTA Sbjct: 547 -TVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTA 605
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.9 bits (153), Expect = 3e-12 Identities = 25/116 (21%), Positives = 48/116 (41%), Gaps = 4/116 (3%) Query: 669 TVMAVDDNPANLKLIGALLEDKVQHVELCDSGHQAVDRAKQMQFDLILMDIQMPDMDGIR 728 T++ DD+ A ++ L V + + DL++ D+ MPD + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF- 63 Query: 729 ACELIHQL-PHQQQTPVIAVTAHAMAGQKEKLLSAGMNDYLAKPIEEEKLHNLLLR 783 +L+ ++ + PV+ ++A K G DYL KP + +L ++ R Sbjct: 64 --DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 141 bits (358), Expect = 2e-44 Identities = 59/143 (41%), Positives = 86/143 (60%), Gaps = 2/143 (1%) Query: 4 ALPFLIFYASFSLLLGIYDARTGLLPDRFTCPLLWGGLLYHQICLPERLPDALWGAIAGY 63 L L+ + L D LLPD+ T PLLWGGLL++ + L DA+ GA+AGY Sbjct: 134 TLAALLLT-WVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGY 192 Query: 64 GGFALIYWGYRLRYQKEGLGYGDVKYLAALGAWHCWETLPLLVFLAAMLACGGFGVALLV 123 +YW ++L KEG+GYGD K LAALGAW W+ LP+++ L++++ G+ L++ Sbjct: 193 LVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV-GAFMGIGLIL 251 Query: 124 RGKSALINPLPFGPWLAVAGFIT 146 P+PFGP+LA+AG+I Sbjct: 252 LRNHHQSKPIPFGPYLAIAGWIA 274
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 36.8 bits (85), Expect = 1e-05 Identities = 18/103 (17%), Positives = 43/103 (41%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLRSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+++ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAKDLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138 + + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186 G+P I F+NK D + L V +++E LS + + +W+ Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 187 AKIIELAGFLDSYIPEPE 204 I L+ Y+ Sbjct: 177 TVIEGNDDLLEKYMSGKS 194
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 616 bits (1591), Expect = 0.0 Identities = 178/698 (25%), Positives = 305/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMQDLANEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KQALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 +Q R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKTARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPENPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P+ ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKSGPLAGYPVVDLGVRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D + +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDDAPNNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 32.4 bits (73), Expect = 0.004 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%) Query: 275 RTPISGDYRGYQVFSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331 R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131 Query: 332 YAYADRSEYLGDPDFVKVPWQA 353 Y P F WQ+ Sbjct: 132 GRY---------PTFSYQDWQS 144
>PF04619#Dr-family adhesin Length = 160 Score = 27.6 bits (61), Expect = 0.029 Identities = 11/60 (18%), Positives = 21/60 (35%), Gaps = 4/60 (6%) Query: 29 VGARYGHTMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGGW 84 +G ++ D + G+ FL+ D+N ++ W + D G W Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.040 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 77.9 bits (192), Expect = 6e-18 Identities = 70/413 (16%), Positives = 136/413 (32%), Gaps = 82/413 (19%) Query: 3 KMKRHLVWWGAGILVAVAAIAWWMLRPAGIPEGFAASNGRI--EATEVDIATKIAGRIDT 60 + LV + + V A +L E A +NG++ +I + Sbjct: 54 SRRPRLVAY-FIMGFLVIAFILSVLGQV---EIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 61 ILVSEGQFVRQGEVLAKMDTRV----------------LQEQRLEAI------------- 91 I+V EG+ VR+G+VL K+ L++ R + + Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 92 -----------------------AQIKEAESAVAAARALLEQRQSEMRAAQSVVKQREAE 128 Q ++ L+++++E + + + E Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 129 LDSVSKRHVRSRSLSQRGAVSVQQLDDDRAAAESARAALETAKAQVSAAKAAIEAARTSI 188 R SL + A++ + + A L K+Q+ ++ I +A+ Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 189 IQ-------------AQTRVEAAQATERRIVADID--DSELKAPRDGRV-QYRVAEPGEV 232 QT T + S ++AP +V Q +V G V Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 233 LSAGGRVLNMVDLSDVY-MTFFLPTEQAGLLKIGGEARLVLDAAPDLRIPATISFVASVA 291 ++ ++ +V D +T + + G + +G A + ++A P R V V Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVK 406 Query: 292 QFTPKTVETHDERLKLMFRVKARIPPELLRQHLEYV--KTGLPGMAWVRLDER 342 +E D+RL L+F V I L + + +G+ A ++ R Sbjct: 407 NINLDAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.1 bits (68), Expect = 0.022 Identities = 23/194 (11%), Positives = 57/194 (29%), Gaps = 40/194 (20%) Query: 12 TGLLLLLALAFVLFYEAINGFHDTANAVATVIY------TRAMRSQLAVVMAAVFNFFGV 65 L++AL+ +L + F + + ++A+ + V+ F Sbjct: 30 VSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFP 89 Query: 66 LLGGLSVAYAIVHML-------------------PTDLLLNMGSAHGLAMVFSMLLAAII 106 LL ++ H++ P + + S L +L ++ Sbjct: 90 LLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVL 149 Query: 107 WNLGTWYFGLPASSSHTLIGAIIGIGLTNAMMTGTSVVDALNIPKVINIFGSLIISPIVG 166 ++ W + ++ + T + T ++ + L++ VG Sbjct: 150 LSILIWIIIKG------NLVTLLQLP-TCGIECITPLLGQI--------LRQLMVICTVG 194 Query: 167 LVFAGGLIFLLRRY 180 V + Y Sbjct: 195 FVVISIADYAFEYY 208
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.4 bits (66), Expect = 0.016 Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 4/62 (6%) Query: 108 DELARRGYHILLCVAGYTEQTEAELVATLLSRRPDGVVLTGIHH----TIELKKVILNAA 163 E+ ++ +LC+ G + L + + L+G H ++ K+I Sbjct: 181 PEVNKQTTVPMLCLYGKEDDAPLHLCPEVKQPNVTVMELSGGHSFDDDYDKVVKLIKGWL 240 Query: 164 IP 165 P Sbjct: 241 KP 242
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 40.2 bits (94), Expect = 1e-05 Identities = 68/410 (16%), Positives = 129/410 (31%), Gaps = 65/410 (15%) Query: 35 VAPIMSKELGFDPEA---MGLAFSSFGIAYVIMQLPGGWLLDRYGSRLVYGCALIGWSLV 91 V P + ++L + G+ + + + G L DR+G R V +L G ++ Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAV- 85 Query: 92 TMFQGTIYLYGSPLIVLVILRLLMGAIEAPAFPANSRLS--------VQWFPNNERGFVT 143 I L VL I R++ G A A + ++ + F GF++ Sbjct: 86 ---DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHF-----GFMS 137 Query: 144 SVYQAAQYISLGIITPLMTIILHNLSWHFVFYYIGAIGV---MLGIFWLMKVKDPMHHPK 200 + + + P++ ++ S H F+ A+ + G F L + P Sbjct: 138 ACFGFGM-----VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPL 192 Query: 201 VNQAEIDYIRSGGGEPSLGCKKEPQKITFAQIKTVCVNRMMIGVYIGQFCVTSITWFFLT 260 + A + ++ + F + + Sbjct: 193 ---------------------RREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAA 231 Query: 261 WFPTYLYQAKGMSILKVGFVASIPAIAGFIGGLLGGVFSDWLLKRGYSLTVARKLPVICG 320 + + +G A G + L + + + R ++ G Sbjct: 232 LWVIFGEDRFHWDATTIGISL---AAFGILHSLAQAMITGPVAARLGERRA-----LMLG 283 Query: 321 MLLSCV--IVIANYTSSEFVVIAAMSLAFFAKGFGNLGWCVLSDTSPKEVLGIAGGVFNM 378 M+ I++A T + LA G L +LS +E G G Sbjct: 284 MIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ-AMLSRQVDEERQGQLQGSLAA 342 Query: 379 CGNMASIVTPLVIGVILANTQSFDFAILYVGSMGLIGLISYLFIVGPLDR 428 ++ SIV PL+ I A + + + G + G YL + L R Sbjct: 343 LTSLTSIVGPLLFTAIYAASITT-----WNGWAWIAGAALYLLCLPALRR 387
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.028 Identities = 21/87 (24%), Positives = 30/87 (34%), Gaps = 13/87 (14%) Query: 8 MTVI---GAGSYGTALAITLARNGHQVVLWGHD---PKHIATLEHDRCNVAFLPDVPFPD 61 M + AG G ++ L GHQVV G D + +L+ R + P F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVV--GIDNLNDYYDVSLKQARLELLAQPGFQF-- 56 Query: 62 TLHLESDLATALAASRNILVVVPSHVF 88 + DLA + VF Sbjct: 57 ---HKIDLADREGMTDLFASGHFERVF 80
>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein signature. Length = 170 Score = 234 bits (598), Expect = 2e-82 Identities = 91/153 (59%), Positives = 118/153 (77%), Gaps = 4/153 (2%) Query: 3 EQNNTEMAFQIQRIYTKDVSFEAPNAPHVFQKDWQPEVKLDLDTASSQLADDVYEVVLRV 62 Q + QIQRIY KDVSFEAPN PH+FQ+DW+P++ DL T + Q+ DD+YEV L + Sbjct: 12 TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQDWEPKLSFDLSTEAKQVGDDLYEVCLNI 71 Query: 63 TVTASLGEE--TAFLCEVQQAGIFSISGIEGTQMAHCLGAYCPNILFPYARECITSLVSR 120 +V ++ AF+CEV+QAG+F+ISG+E QMAHCL + CPN+LFPYARE ++SLV+R Sbjct: 72 SVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFPYARELVSSLVNR 131 Query: 121 GTFPQLNLAPVNFDALFMNYL--QQQAGEGTEE 151 GTFP LNL+PVNFDALFM+YL Q+QA + TEE Sbjct: 132 GTFPALNLSPVNFDALFMDYLQRQEQAEQTTEE 164
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 47.1 bits (112), Expect = 7e-08 Identities = 25/196 (12%), Positives = 62/196 (31%), Gaps = 21/196 (10%) Query: 45 RDQLKSIQADIAAKERDVRQQQQQRASLLAQLKAQEEAISAAARKLRETQSTLDQLNAQI 104 ++ + + Q +L +E S + + + LD+ A+ Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216 Query: 105 DEMNASIAKLEQQKASQERNLAAQLDAAFRQGEHTGIQLILSGEESQRGQRLQAYFGYLN 164 + A I + E ++ L + + ++ Q + ++A Sbjct: 217 LTVLARINRYENLSRVEKSRLDD-FSSLLHKQAIAKHAVL-----EQENKYVEA-----V 265 Query: 165 QARQETIAELKQTREQVATQKAELEEKQSQQQTLLYEQRAQ-QAKLEQARNERKKTLAGL 223 + ++L+Q ++ + K E + + + ++ Q + E K Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE-- 323 Query: 224 ESSIQQGQQQLSELRA 239 +QQ S +RA Sbjct: 324 -------RQQASVIRA 332
>PF06580#Sensor histidine kinase Length = 349 Score = 39.1 bits (91), Expect = 3e-05 Identities = 30/142 (21%), Positives = 55/142 (38%), Gaps = 11/142 (7%) Query: 365 LRPRQLDDLTLAQAIRSLLREMELESRGIVSHLDWRIDETALSESQRVTLFRVCQEGLNN 424 LR ++LA + + ++L S L + +V + Q + N Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266 Query: 425 IVKHA-----NASAVTLQGWQQDERLMLVIEDDGSGLPPGSHQ-QGFGLTGMRERVSALG 478 +KH + L+G + + + L +E+ GS + + G GL +RER+ L Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326 Query: 479 G---TLTISCTHG-TRVSVSLP 496 G + +S G V +P Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 61.0 bits (148), Expect = 2e-13 Identities = 23/116 (19%), Positives = 45/116 (38%), Gaps = 5/116 (4%) Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61 T+ + DD +R+ Q L V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHT 114 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>PF06872#EspG protein Length = 398 Score = 28.5 bits (63), Expect = 0.021 Identities = 14/54 (25%), Positives = 27/54 (50%) Query: 111 LLLEAGMEVNDDFKEPADHLAIYLELLSHLHFSLGESFQQRRMNKLRQKTLSSL 164 L+L+A +++N D+K+P + + +LL L L + + Q L+ L Sbjct: 29 LVLDATIKINSDYKKPWNEMTCAEKLLKILTLGLWNPKYSQDERQQFQGLLTVL 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.4 bits (188), Expect = 3e-18 Identities = 28/115 (24%), Positives = 56/115 (48%), Gaps = 1/115 (0%) Query: 16 HIVIVEDEPVTQARLQAYFEQEGYRVSVTDSGAGLRDIMEHEHVSLILLDINLPDENGLM 75 I++ +D+ + L + GY V +T + A L + L++ D+ +PDEN Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 76 LTRALRER-STVGIILVTGRCDQIDRIVGLEMGADDYVTKPLELRELVVRVKNLL 129 L +++ + +++++ + + I E GA DY+ KP +L EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 56.0 bits (135), Expect = 4e-10 Identities = 28/162 (17%), Positives = 64/162 (39%), Gaps = 5/162 (3%) Query: 665 RLLLIEDNMLTQRITAEMLTGKGVKVSVAESANDALRCLAEGESFDVALVDFDLPDYDGL 724 +L+ +D+ + + + L+ G V + +A R +A G+ D+ + D +PD + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63 Query: 725 TLAQQLMSQYPAMKRIGFSAH-VIDDNLRQRTAGLFCGIIQKPVPREELYRMIAHYLQGK 783 L ++ P + + SA ++ G + + KP EL +I L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALAEP 122 Query: 784 SHNARAMLNEHQLAGDMASVGP--EKLRQWIALFKDSALPLV 823 + ++ Q + +++ + +A + L L+ Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.1 bits (112), Expect = 8e-08 Identities = 65/384 (16%), Positives = 118/384 (30%), Gaps = 36/384 (9%) Query: 53 AEMGYVFSAFAWLYTLCQIPGGWFLDRIGSRLTYFIAIFGWSVATLLQGFATGLLSLIGL 112 A G + + +A + C G DR G R +++ G +V + A L L Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102 Query: 113 RAITGIFEAPAFPANNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLLIWIQEMLSWH 172 R + GI A + ERA GF ++ G+ P+L + S H Sbjct: 103 RIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV-AGPVLGGLMGGFSPH 160 Query: 173 WVFIVTGGIGIIWSLVWFKVYQPPRLTKSLSQAELEYIRDGGGLVDGDAPAKKEARQPLT 232 F + + L + + P ++EA PL Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGE-------------------RRPLRREALNPLA 201 Query: 233 KADWKLVFHRKLVGVYLGQFAVNSTLWFFLTWFPNYLTQEKGITALKAGFMTTV-PFLAA 291 W + + F + + + A G L + Sbjct: 202 SFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS 260 Query: 292 FFGVLLSGWLADKLVKKGFSLGVARKTPIICGLLISTC--IMGANYTNDPLWIMALMAIA 349 +++G +A +L + ++ G++ I+ A T + ++ +A Sbjct: 261 LAQAMITGPVAARL---------GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311 Query: 350 FFGNGFASITWSLISSLAPMRLIGLTGGMFNFIGGLGGISVPLVIGYL-AQSYGFAPALV 408 G G ++ +++S G G + L I PL+ + A S Sbjct: 312 SGGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWA 370 Query: 409 YISVVALLGALSYILLVGDVKRVG 432 +I+ AL L G G Sbjct: 371 WIAGAALYLLCLPALRRGLWSGAG 394
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.4 bits (76), Expect = 2e-04 Identities = 20/86 (23%), Positives = 33/86 (38%), Gaps = 9/86 (10%) Query: 87 LALRNGEVVGMISLHMQFHLHHANWIG--EIQELVVLPPMRGQKIGSQLLAWAEEEARQA 144 L +G I + +NW G I+++ V R + +G+ LL A E A++ Sbjct: 69 LYYLENNCIGRIKIR-------SNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN 121 Query: 145 GAELTELSTNIKRRDAHRFYLREGYK 170 L T A FY + + Sbjct: 122 HFCGLMLETQDINISACHFYAKHHFI 147
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 53/284 (18%), Positives = 104/284 (36%), Gaps = 40/284 (14%) Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYATIGIWAPILLLLCKMAQGFSVGGE 144 G L D++GR+ +L +++ ++ + P +W +L + ++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112 Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLEWGW 200 A ++A+ + +R GFM + FG +AG VLG G++ S Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159 Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKHWRS 260 PFF A L + L K E+ P SF+ + Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFRWARGMTVVA 213 Query: 261 LLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVMGLLS 319 L + ++ + + + H+ G+ + ++ L + G ++ Sbjct: 214 ALMAVFFIM--QLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271 Query: 320 DRFGRRPFVIMGSIA-LFALAIPAFILINSNVIGLIFAGLLMLA 362 R G R +++G IA + AF + F +++LA Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFA----TRGWMAFPIMVLLA 311 Score = 38.7 bits (90), Expect = 5e-05 Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVIMGSIALFALAIPAFI 344 L H+ + +G+L+ + A+M PV+G LSDRFGRRP ++ ++L A+ I Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLL---VSLAGAAVDYAI 89 Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLIAG 401 + + + +++ G ++A I V + + + R + ++A F ++AG Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147 Query: 402 LTPTLAAWLVESSQDLMMPAYYLMVIAVIGLITGI-SMKETANR 444 P L + S P + + + +TG + E+ Sbjct: 148 --PVLGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPESHKG 187
>PF06580#Sensor histidine kinase Length = 349 Score = 37.5 bits (87), Expect = 5e-05 Identities = 39/182 (21%), Positives = 78/182 (42%), Gaps = 34/182 (18%) Query: 184 ARLDQMMDSVSQLLQLARVGQSFSSGNYQEVKLLEDV-ILPSYDELNTM-LETR-QQTLL 240 + +M+ S+S+L++ S N ++V L +++ ++ SY +L ++ E R Q Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 241 LPESAADVVVRGDATLLRMLLRNLVENAHRY----SPEGTHITIHISADPDAI-MAVEDE 295 + + DV V ML++ LVEN ++ P+G I + + D + + VE+ Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 296 GPGIDESKCGKLSEAFVRMDSRYGGIGLGLSIV-SRITQLHQGQFFLQNRTERTGTRAWV 354 G + + G GL V R+ L+ + ++ ++ A V Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 355 LL 356 L+ Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.0 bits (226), Expect = 2e-23 Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 1/144 (0%) Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVSTARAAEHSLESGHYSLMVLDLGLPDEDGLH 61 IL+ +DD + L A GY S A + +G L+V D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLTRIRQKKYTLPVLILTARDTLNDRITGLDVGADDYLVKPFALEELHARI-RALLRRHN 120 L RI++ + LPVL+++A++T I + GA DYL KPF L EL I RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 NQGESELTVGNLTLNIGRHQAWRD 144 + E + +GR A ++ Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQE 148
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 32.1 bits (73), Expect = 0.005 Identities = 39/163 (23%), Positives = 60/163 (36%), Gaps = 13/163 (7%) Query: 80 CVFILVGAAAQYFILTYGIIIDRSMIANMMDTTPAETFALM-TPQMVLTLG---LSGVLA 135 CV +V A +L+ + +M P T LM V T G L +LA Sbjct: 177 CVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLA 236 Query: 136 AVIAFWVKIRPATPRLRSGLYRLASVLISILLVILVAAFFYKDYASLFRNNKQLIKALSP 195 +AF V +R R+ L LI + L A + + + L + L++A+ Sbjct: 237 GFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRI 296 Query: 196 SNSIVASWSWYSHQRLANLPLVRIGEDAHRN--------PLML 230 S V S + H+ VR G H+ P+M Sbjct: 297 S-GDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMR 338
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.6 bits (230), Expect = 6e-24 Identities = 32/139 (23%), Positives = 58/139 (41%) Query: 1 MQQPQVWLVEDEQGIADTLIYTLQLEGFTVELFARGLPALEKARQQRPDAVILDVGLPDI 60 M + + +D+ I L L G+ V + + D V+ DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGFELCRQLLERHPALPILFLTARSDEVDRLLGLEIGADDYVAKPFSPREVSARVRTLLR 120 + F+L ++ + P LP+L ++A++ + + E GA DY+ KPF E+ + L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 RVKKFAAPSPVVRTGHFEL 139 K+ + L Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.049 Identities = 20/79 (25%), Positives = 32/79 (40%), Gaps = 16/79 (20%) Query: 374 NVLDNAIDFTPENGVITLSAQPMGEKAILQVTDSGCGIPDFALPRIFDRFYSLPRENGRK 433 N + + I P+ G I L L+V ++G SL +N ++ Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG----------------SLALKNTKE 309 Query: 434 SSGLGLAFVSEAARLLNGE 452 S+G GL V E ++L G Sbjct: 310 STGTGLQNVRERLQMLYGT 328
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 768 bits (1985), Expect = 0.0 Identities = 333/877 (37%), Positives = 489/877 (55%), Gaps = 70/877 (7%) Query: 14 LSFLFICCS----IKPALAHDHFNPLSLENDEPGVENVDLSVFEKGGQAE-GTYNVDIYI 68 LF+ C+ + A +FNP L +D DLS FE G + GTY VDIY+ Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADD--PQAVADLSRFENGQELPPGTYRVDIYL 84 Query: 69 NNTSVETKNIAFKNKKSANNKLSLQPCLSVEQLKQWGVKTENFPELKN-DPNGCTDL-SL 126 NN + T+++ F + +++ + PCL+ QL G+ T + + + C L S+ Sbjct: 85 NNGYMATRDVTFN---TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSM 141 Query: 127 LAGAVAKFNVIGNRLDLAIPQIALIADPREFVPTSEWDEGINAFLLNYSFTGSQDHDIDE 186 + A A+ +V RL+L IPQ + R ++P WD GINA LLNY+F+G+ + Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN-RI 200 Query: 187 NRTENSEYANLRPGINIGAWRFRNYSTW-----NHDSDGQNSWDSAYTYVSRDIEFLKGQ 241 + Y NL+ G+NIGAWR R+ +TW + S +N W T++ RDI L+ + Sbjct: 201 GGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSR 260 Query: 242 LIAGENNTPADVFDSISFKGVQISSDDDMLPDSMKGFAPVIRGVAKSSAQVTVEQNGYTI 301 L G+ T D+FD I+F+G Q++SDD+MLPDS +GFAPVI G+A+ +AQVT++QNGY I Sbjct: 261 LTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDI 320 Query: 302 YKTNVPAGPFAINDLYPTGGSGDLYVTIKESDGSEQHFIVPYASVPVLQREGHLKYDLTV 361 Y + VP GPF IND+Y G SGDL VTIKE+DGS Q F VPY+SVP+LQREGH +Y +T Sbjct: 321 YNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITA 380 Query: 362 GRTRSSDTHSAQQNFAELTALYGLAGGITAYGGIESTLSNDIYHAALIGTGLNLGDLGAL 421 G RS + + F + T L+GL G T YGG + D Y A G G N+G LGAL Sbjct: 381 GEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLA---DRYRAFNFGIGKNMGALGAL 437 Query: 422 SLDVTNSWSKIKAGDVVSDTLTGQSWRIRYSKDIQSTGTNFTVAGYRYSTKDYYALEDVL 481 S+D+T + S + GQS R Y+K + +GTN + GYRYST Y+ D Sbjct: 438 SVDMTQANSTLPDD----SQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTT 493 Query: 482 DTYSD--------------------NSHYDHVRNRTDLSLSQDII-YGSISLTLYNEDYW 520 + + + + R + L+++Q + ++ L+ ++ YW Sbjct: 494 YSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYW 553 Query: 521 N-DTHTTSLGIGYNNTWHNVSYGINYSYTLNADNTQDEDDDTEDSNDQQISINISIPLDA 579 G N + ++++ ++YS T NA DQ +++N++IP Sbjct: 554 GTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQ---------KGRDQMLALNVNIPFSH 604 Query: 580 FMPS--------TYATYNMNSAKDGDTTHTVGLNGTALAQKNLSWSVQEGYSS---QEKA 628 ++ S A+Y+M+ +G T+ G+ GT L NLS+SVQ GY+ Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664 Query: 629 TSGNVSATYNGTYADINGGYSYDNHMRRLNYGVQGGVLLHRNGLTLSQPMDDTIILVKAP 688 ++G + Y G Y + N GYS+ + +++L YGV GGVL H NG+TL QP++DT++LVKAP Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAP 724 Query: 689 GAAGVPVNNETGVDTDFRGYAVVPYASPYHRNEVSLDTTGIRKNIELIDTSKTLVPTRGA 748 GA V N+TGV TD+RGYAV+PYA+ Y N V+LDT + N++L + +VPTRGA Sbjct: 725 GAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGA 784 Query: 749 VVRAEYKTNIGYKALMVLTRINNLPVPFGATVSSLTKPDNHSSFVGDAGQAWLTGLEKQG 808 +VRAE+K +G K LM LT NN P+PFGA V+S + S V D GQ +L+G+ G Sbjct: 785 IVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSES--SQSSGIVADNGQVYLSGMPLAG 841 Query: 809 RLLVKWGPTAADRCQVSYRIPSSPSASGVEILHEQCQ 845 ++ VKWG C +Y++P + L +C+ Sbjct: 842 KVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 27.7 bits (61), Expect = 0.023 Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 9/70 (12%) Query: 11 MLTAV-ASTPVFAQNTITFNGKIYDQACTVQVNGSTDTTIDLGNYSKERIAEKGATTDYV 69 ML AV S V A + +TF GK+ ACTVQ + ++ G+ + + + G Sbjct: 12 MLGAVLMSQHVHAADNLTFKGKLIIPACTVQ-----NAEVNWGDIEIQNLVQSGGNQK-- 64 Query: 70 PFTVSLVSCP 79 FTV + +CP Sbjct: 65 DFTVDM-NCP 73