>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 31.7 bits (71), Expect = 0.007 Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 8/102 (7%) Query: 1 MLDPNKLRNNYDFFKKKLLERNVNEQLLNQFIQTDKLMRKNLQQLELANQKQSLLAKQVA 60 + P +++N F+K+ + + + +F++T KL+ EL QK++L ++ A Sbjct: 96 FIQPKSVKSNL-MFEKEAVNFALMTRDYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEA 154 Query: 61 K---QKDNKKLLAESKELKQK----IENLNNAYKDSQNISQD 95 K QK K + KE + K +ENL NA + QN+S + Sbjct: 155 KEQAQKAQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNN 196
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 170 bits (431), Expect = 2e-47 Identities = 111/438 (25%), Positives = 175/438 (39%), Gaps = 86/438 (19%) Query: 5 NIRNFSIIAHIDHGKSTLSDRLLEHSLGFEKRL----LQAQMLDTMEIERERGITIKLNA 60 I N ++AH+D GK+TL++ LL +S G L D +ER+RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNS-GAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60 Query: 61 VELKINVDNNNYLFHLIDTPGHVDFTYEVSRSLAACEGVLLLVDATQGIQAQ-------- 112 + N ++IDTPGH+DF EV RSL+ +G +LL+ A G+QAQ Sbjct: 61 ----TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 113 ------TI-------------SNAYLALENNL--EIIP-----------VINKIDMDNAD 140 TI S Y ++ L EI+ V N + + D Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 141 ----------------IETTKDSLHNLL--GVEKNSICLV---SAKANLGIDQLIQTIIA 179 L S+ V SAK N+GID LI+ I Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236 Query: 180 KIPPPKGEINRPLKALLFDSYYDPYKGVVCFIRVFDGCLKVNDKVRFIKSNSVYQIVELG 239 K L +F Y + + +IR++ G L + D VR + + +I E+ Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMY 295 Query: 240 VKTPFFE-KRDQLQAGDVGWFSAGIKKLRDVGVGDTIVSFDDQFTKPLAGYKKILPMIYC 298 K D+ +G++ KL V +GDT + Q + + LP++ Sbjct: 296 TSINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKL--LPQRERI----ENPLPLLQT 348 Query: 299 GLYPVDNSDYQNLKLAMEKIIISDAALEYEY--ETSQALGFGVRCGFLGLLHMDVIKERL 356 + P + L A+ +I SD L Y T + + FLG + M+V L Sbjct: 349 TVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCALL 403 Query: 357 EREYNLKLISAPPSVVYK 374 + +Y++++ P+V+Y Sbjct: 404 QEKYHVEIEIKEPTVIYM 421 Score = 35.2 bits (81), Expect = 9e-04 Identities = 19/105 (18%), Positives = 35/105 (33%), Gaps = 15/105 (14%) Query: 400 ISEPFVKVFIDLPDQYLGSVIDLCQNFRGQYESLNEIDINRKRICYLMPLGEIIYSFFDK 459 + EP++ I P +YL + ++ N + +P I + Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVD-TQLKNNEVILSGEIPARC-IQEYRSD 592 Query: 460 LKSISKGYASLNYEFYNYQ-------------HSQLEKVEIMLNK 491 L + G + E Y +S+++KV M NK Sbjct: 593 LTFFTNGRSVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNK 637
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 27.0 bits (60), Expect = 0.012 Identities = 7/30 (23%), Positives = 11/30 (36%) Query: 47 VGQIIYRQRGTKIFAGQNVAMGSDNTLFAL 76 G+II Q G + A + + A Sbjct: 98 AGEIIGLQMGLSFATFVDPASHLNMPVLAR 127
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 38.3 bits (89), Expect = 1e-05 Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 10/73 (13%) Query: 7 IFGGSFDPIHNAHLYIAKHAIKKIKAQKLFFVPTYNGIFKN---NFHASNKDRIAMLKLA 63 I+ GSFDPI HL I + F Y + +N S ++R+ + A Sbjct: 4 IYPGSFDPITFGHLDIIERGC-------RLFDQVYVAVLRNPNKQPMFSVQERLEQIAKA 56 Query: 64 IKSVNNALVSNFD 76 I + NA V +F+ Sbjct: 57 IAHLPNAQVDSFE 69
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 35.8 bits (82), Expect = 2e-04 Identities = 28/222 (12%), Positives = 68/222 (30%), Gaps = 1/222 (0%) Query: 94 LEGEINRQVQQNSELFSQLKQSESEIIQMQQLVEAKEHQIEALNKQLHAIKEANKKLIEE 153 L+ + + N L + E+ ++ + + + ++ ++ L + Sbjct: 69 LKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKA 128 Query: 154 HESINVEELLKEYEVQCNEAIYKRDQHIQTVFEDKLALKDGEISETQSLLKTAEKEKQAL 213 E +++ EA + E L + + +KT E EK AL Sbjct: 129 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 188 Query: 214 KKAYKLVVNSLQKHQKLTTEIEIDFTKLDEIIATIFDETKNPKTGFTNFIKQFEKTKAKL 273 + + +L+ +T L+ A + + + + AK+ Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248 Query: 274 TKKIAEITKLDHSTPTNYQQETPASQQQLDQENEPIKPSKKS 315 AE L+ ++ + ++ IK + Sbjct: 249 KTLEAEKAALEARQA-ELEKALEGAMNFSTADSAKIKTLEAE 289
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 28.6 bits (64), Expect = 0.046 Identities = 18/120 (15%), Positives = 40/120 (33%), Gaps = 1/120 (0%) Query: 50 SPFAGTISAINVKVGDVVSIGQVMAVIGEKTSTPLVEPKPQPTEEVAKVKEAGASVVGEI 109 + I VK G+ V G V+ + K Q + A++++ ++ Sbjct: 101 PIENSIVKEIIVKEGESVRKGDVLLKL-TALGAEADTLKTQSSLLQARLEQTRYQILSRS 159 Query: 110 KVSDNLFPIFGVKPHATPAVKDTKVASSTNITVETTQKPESKTEQKTIAISTMRKAIAEA 169 + L + V + +V T++ E +++ QK + + R Sbjct: 160 IELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTV 219
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 29.7 bits (66), Expect = 0.016 Identities = 27/85 (31%), Positives = 39/85 (45%), Gaps = 8/85 (9%) Query: 37 TAANLYVQARNSIDSSF-NSAKAFANALANSANQFSKSSITNNLDQVK---KDLEQSLQK 92 T L Q N + F +S K N + + T N D+VK KDLE+SL+K Sbjct: 561 TTKGLSPQEANKLIKDFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRK 620 Query: 93 VD----EYKKNLESQNNLGNISQEK 113 + E +K LES++ N + K Sbjct: 621 REHLEKEVEKKLESKSGNKNKMEAK 645
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 30.4 bits (68), Expect = 0.017 Identities = 15/44 (34%), Positives = 28/44 (63%), Gaps = 2/44 (4%) Query: 633 KKLSTIKRTKDGFEYKFKY--RKDFNEQRWIAKDFRIPLNKNVQ 674 +K S +++ ++ F+ K KY RK+ N QRWI + R+ + + +Q Sbjct: 94 EKRSELEKKREEFQEKSKYWLRKEGNYQRWIIRQKRLYIQREIQ 137
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 28.5 bits (63), Expect = 0.046 Identities = 16/77 (20%), Positives = 24/77 (31%), Gaps = 6/77 (7%) Query: 210 DSYNFRLNSLKSKLDNALYSLDKTIQNTNENTANLEAIRHNLEQKIQNQSKQLRTNFDTQ 269 +N L S L DK++ LEA + +LE+ ++ Sbjct: 84 KDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFS------T 137 Query: 270 KLDDKINELEIRMQKLT 286 KI LE L Sbjct: 138 ADSAKIKTLEAEKAALA 154
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 38.9 bits (90), Expect = 2e-04 Identities = 48/267 (17%), Positives = 83/267 (31%), Gaps = 5/267 (1%) Query: 376 DSLLKLETEYKALQHKINEFKNESATKSEELLNQERELFE---KRREIDTLLTQASLEYE 432 L KAL+ +E E + E+L ++ L E K +E++ E Sbjct: 71 LKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130 Query: 433 HQRESSQLLKDKQNEVKQHFQNLEYAKKELDKERNLLDQQKKVDSEAIFQLKEKVAQERK 492 S K ++ L K +L+K DS I L+ + A Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190 Query: 493 ELEELY--LVKKQKQDQKENELLFFEKQLKQHQADFENELEAKQQELFEAKHALERSFIK 550 EL L ++ + + K A + +LE + A Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250 Query: 551 LEDKEKDLNTKAQQIANEFSQLKTDKSKSADFELMLQNEYENLQQEKQKLFQERTYFERN 610 LE ++ L + ++ + + L+ E L+ EK L + N Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310 Query: 611 AAVLSNRLQQKREELLQQKETLDQLTK 637 L L RE Q + +L + Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEE 337 Score = 36.2 bits (83), Expect = 0.001 Identities = 53/355 (14%), Positives = 105/355 (29%), Gaps = 9/355 (2%) Query: 550 KLEDKEKDLNTKAQQIANEFSQLKTDKSKSADFELMLQNEYENLQQEKQKLFQERTYFER 609 K E + L K ++ LK + + + + + + + E Sbjct: 61 KFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEA 120 Query: 610 NAAVLSNRLQQKREELLQQKETLDQLTKSFEQERLINQREHKELVASVEKQKEILGK--K 667 A L L+ + L K L ++ K Sbjct: 121 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180 Query: 668 LQDFSQTSLNASKNLAEREMAIKFKEKEIEATEKQLLNDVNNAEVIQADLAQLNQSLNQE 727 L+ L + A K L + +ADL + + Sbjct: 181 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNF 240 Query: 728 RSELQNAKQRIADFHNDSLKKLNEYELSLQKRLQELQTLEANQKQHSYQNQAYFEGELDK 787 + + + + E E + T ++ + + +A E E Sbjct: 241 STADSAKIKTLEAEKAALEARQAELE-KALEGAMNFSTADSAKIKTLEAEKAALEAEKAD 299 Query: 788 LNREKQAFLNLRKKQTMEVDA---IKQRLSDKHQALNMQQAELDRKTHELNNAFLNHDAD 844 L + Q R+ ++DA K++L +HQ L Q + L Sbjct: 300 LEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREA 359 Query: 845 QKSLQDQLATVKETQKLIDLERSAL---LEKQREFAENVAGFKRHWSNKTSQLQK 896 +K L+ + ++E K+ + R +L L+ RE + V ++K + L+K Sbjct: 360 KKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEK 414 Score = 35.8 bits (82), Expect = 0.002 Identities = 39/302 (12%), Positives = 102/302 (33%), Gaps = 4/302 (1%) Query: 1229 LKNLSQTYLANKNKAEYSQQQLQQKYTNLLDLKENLERTKDQLDKKHRSIFARLTKFAND 1288 ++ + + N + L L D + L +K R L++ A+ Sbjct: 55 VQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASK 114 Query: 1289 LRFEKKQLLKAQRIVDDKNRLLKENERNLHFLSNETERKRAVLEDQISYFEKQRKQATDA 1348 ++ + + ++ ++ + + L E A + + + + A Sbjct: 115 IQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA-RKADLEKALEGAMNFSTA 173 Query: 1349 ILASHKEVKKKEGELQKLLVELETRKTKLNNDFAKFSRQREEFENQRLKLLELQKTLQTQ 1408 A K ++ ++ L+ ELE N S + + E ++ L + L+ Sbjct: 174 DSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKA 233 Query: 1409 TNSNNFKTKAIQEIENSYKRGMEELNFQKKEFDK---NKSRLYEYFRKMRDEIERKESQV 1465 + A + + L ++ E +K +E +++ + Sbjct: 234 LEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAAL 293 Query: 1466 KLVLKETQRKANLLEAQANKLNIEKNTIDFKEKELKAFKDKVDQDIDSTNKQRKELNELL 1525 + + + ++ +L A L + + +K+L+A K+++ + R+ L L Sbjct: 294 EAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDL 353 Query: 1526 NE 1527 + Sbjct: 354 DA 355 Score = 35.4 bits (81), Expect = 0.002 Identities = 48/359 (13%), Positives = 112/359 (31%), Gaps = 6/359 (1%) Query: 1200 EKQRQLVAIKTQCEKLSDEKKALNQKLVELKNLSQTYLANKNKAEYSQQQLQQKYTNLLD 1259 + + + ++ + ++ L E + Q A K E + + T Sbjct: 82 ALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSA 141 Query: 1260 LKENLERTKDQLDKKHRSIFARLTKFAND---LRFEKKQLLKAQRIVDDKNRLLKENERN 1316 + LE K L + + L N + K L + ++ + L++ Sbjct: 142 KIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEG 201 Query: 1317 LHFLSNETERKRAVLEDQISYFEKQRKQATDAILASHKEVKKKEGELQKLLVELETRKTK 1376 S K LE + + ++ A+ + +++ L E + + Sbjct: 202 AMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEAR 261 Query: 1377 LNNDFAKFSRQREEFENQRLKLLELQKTLQTQTNSNNFKTKAIQEIENSYKRGMEELNFQ 1436 K+ L+ Q + + + +L+ Sbjct: 262 QAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDAS 321 Query: 1437 KKEFDKNKSRLYEYFRKMR-DEIERKESQVKLVLKETQRKANLLEAQANKLNIEKNTIDF 1495 ++ + ++ + + + E R+ + L +K LEA+ KL + + Sbjct: 322 REAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ--LEAEHQKLEEQNKISEA 379 Query: 1496 KEKELKAFKDKVDQDIDSTNKQRKELNELLNENKLLQQSLIERERAINSKDSLLNKKIE 1554 + L+ D + K +E N L + L + L E ++ + + L K+E Sbjct: 380 SRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLE 438 Score = 31.6 bits (71), Expect = 0.034 Identities = 51/369 (13%), Positives = 113/369 (30%), Gaps = 3/369 (0%) Query: 952 EKNNQVKLELDNRFQALQNQKQDTVQAQLELEREQHQLNLEQTAF-NQANESLLKQREQL 1010 N + R Q +K + E+E +L +F N+A + + + Sbjct: 34 VVNTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEE 93 Query: 1011 TKKIQAFHYELKKRNQFLALKGKRLFAKEQDQQRKDQEINWRFKQFEKEYTDFDEAKKRE 1070 + + K A K + L A++ D ++ + + + K Sbjct: 94 LSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 153 Query: 1071 LEELEKIRRSLSQSNVELERKREKLATDFTNLNKVQHNTQINRDQLNSQIRQFLLERKNF 1130 + ++L + K+ T ++ L + + Sbjct: 154 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 213 Query: 1131 QRFSNEANAKKAFL--IKRLRSFASNLKLQKEALAIQKLEFDKRDEQQKKELQQATLQLE 1188 + E A A +++ A N A E ++ EL++A Sbjct: 214 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAM 273 Query: 1189 QFKFEKQNFDIEKQRQLVAIKTQCEKLSDEKKALNQKLVELKNLSQTYLANKNKAEYSQQ 1248 F + + A++ + L + + LN L+ K + E Q Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333 Query: 1249 QLQQKYTNLLDLKENLERTKDQLDKKHRSIFARLTKFANDLRFEKKQLLKAQRIVDDKNR 1308 +L+++ +++L R D + + + A K + + +R +D Sbjct: 334 KLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE 393 Query: 1309 LLKENERNL 1317 K+ E+ L Sbjct: 394 AKKQVEKAL 402
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.0 bits (67), Expect = 0.004 Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 16/92 (17%) Query: 73 IPTPVVKEIDQPA---------------VIPPVKAKPKATKKKTPVKSKPTSKSTKQTKP 117 I TP + D P+ V PP A P T + SK SK+ ++ + Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056 Query: 118 KQSKPKSKQVQQTK-AKPTQIQTKKSNKKTRS 148 ++ ++ + K AK ++N+ +S Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 25.8 bits (56), Expect = 0.035 Identities = 23/69 (33%), Positives = 31/69 (44%), Gaps = 1/69 (1%) Query: 7 AQAKQVVGGLSFWTFSAGLIMIVNALTGVAHAVNDIFQSTTANANGSDDDNENKNNSYRS 66 A K +G L W SAGL +I G ND +TT N +D ++NNS Sbjct: 304 ASNKTHIGTLDLWQ-SAGLNIIAPPEGGYKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQ 362 Query: 67 KSNYFNTAR 75 N N+A+ Sbjct: 363 VINPPNSAQ 371