>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 113 bits (283), Expect = 2e-32 Identities = 73/248 (29%), Positives = 117/248 (47%), Gaps = 17/248 (6%) Query: 12 LNGKVAAVTGAASGIGLECARTLLAAGAKVVLIDREGEKLTKIVAELGENAF---ALQVD 68 + GK+A +TGAA GIG ARTL + GA + +D EKL K+V+ L A A D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 69 LMQADQVDNILAGILNLTGRLDIFHANAGAYIGGPVAEGDPDVWDRVLHLNTNAAFRCVR 128 + + +D I A I G +DI AG G + + W+ +N+ F R Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 129 SVLPHMIAQKSGDIIFTSSIAGVVPVIWEPIYTASKFAVQAFVHTTRRQVSQHGVRVGAV 188 SV +M+ ++SG I+ S VP Y +SK A F +++++ +R V Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 189 LPGPVVTALLDD-WPKEKMEEALANGSLMQ------------PIEVAESVLFMVT-RSKN 234 PG T + W E E + GSL P ++A++VLF+V+ ++ + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 235 VTVRDLVI 242 +T+ +L + Sbjct: 246 ITMHNLCV 253
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.9 bits (93), Expect = 2e-05 Identities = 60/373 (16%), Positives = 138/373 (36%), Gaps = 43/373 (11%) Query: 42 ELGFTPAQASFAFTLYGLAAALSAWISGVVAEIITPQKTMLIGFVLWCVFHVLFLIFGLG 101 + PA ++ T + L ++ + G +++ + ++ +L G ++ C +I +G Sbjct: 43 DFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGS---VIGFVG 99 Query: 102 HANYPLILLFYGIRGFAYPLFLYSFIVAIVHNVKSDSASSALGWFWAVYSVGIGVFGSYI 161 H+ + L+++ I+G F +V + + ++ A G ++ ++G G G I Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEG-VGPAI 158 Query: 162 PSFTIPHIGEMGTLWLALVFCVTGGIIALVSLRHIQTPRHMQ---------NLTTREKFA 212 +I L + ++ +T + + + ++ H + F Sbjct: 159 GGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFT 218 Query: 213 ELGRAATL-------------------------LYTNRNILLSGMVRIINTLSLFGFAVI 247 + L L N ++ + I ++ GF + Sbjct: 219 TSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSM 278 Query: 248 MPMMFVDELGFTTSEWLQVWAAFFFTTIFSNVFWGIVAEKMGWMKVIRWFGCIGMALSSL 307 +P M D +T+E + + F S + +G + + + + IG+ S+ Sbjct: 279 VPYMMKDVHQLSTAE---IGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335 Query: 308 AFYYIP-QHFGHNFAMALVPAIALGIFVAAFVPMAAVFP-ALEPQHKGAAISVYNLSAGL 365 +F ++ M ++ LG ++ + +L+ Q GA +S+ N ++ L Sbjct: 336 SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFL 395 Query: 366 SNFLAPAIAVVLL 378 S AI LL Sbjct: 396 SEGTGIAIVGGLL 408
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 45.6 bits (108), Expect = 2e-07 Identities = 60/271 (22%), Positives = 95/271 (35%), Gaps = 17/271 (6%) Query: 29 LSKSGFSAGEIGWSYACTAIAAILSPILVGSLTDRFFSAQRVLAVLMFAGAILMYFAAQQ 88 L S G A A+ ++G+L+DRF +R + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFSNVADVERDFPRIRVMGTIG-WIGSGLVCGF 147 A F +L + T A T ++A + +AD+ R R G + G G+V G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQMLG-LSDISPTNVPLLIAAGSSALLGVFALFLPDTPPKSRGKLDLKVMLGLDALILL 206 P + G + SP + P AA + L + FL K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEAGMKHATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLITAAIRYGFFVF 291 R G ++ L+LG+I Y F Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAF 296 Score = 34.8 bits (80), Expect = 6e-04 Identities = 31/153 (20%), Positives = 52/153 (33%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLITAAIRYGFFVFGGAENYFTYTLMFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y ++++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVSMRNAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYPQPVN 372 + V D R G ++ C GFG + G LGG+M P Sbjct: 109 TGATGAVAGAYIADI-TDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGGFSPHAP---- 162 Query: 373 GFTFNWAGMWTFGAIMIAIIAVLFMVFFRESDK 405 + A + + + ES K Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 35.1 bits (81), Expect = 9e-04 Identities = 26/104 (25%), Positives = 45/104 (43%), Gaps = 17/104 (16%) Query: 14 AARGEIPFDLLLTGAQIVDMVTGEIREADVGITGEMIASVHPRGSR----------ADAH 63 R D ++T A I+D G I +AD+G+ IA++ G+ Sbjct: 61 VTREGGAVDTVITNALILDH-WG-IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGT 118 Query: 64 QRHALDGAYLSPGLMDTHVHLESSHLPPERYAEIVLAQGTTAVF 107 + A +G ++ G MD+H+H + P++ E L G T + Sbjct: 119 EVIAGEGKIVTAGGMDSHIHF----ICPQQ-IEEALMSGLTCML 157
>PF06580#Sensor histidine kinase Length = 349 Score = 206 bits (526), Expect = 8e-64 Identities = 59/216 (27%), Positives = 117/216 (54%), Gaps = 3/216 (1%) Query: 343 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTLKAVIRRDSDQA 402 L G + + + ++ ++++ L AQ+NPHF+FNALN ++A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 403 GQLVQYLSTFFRKNLKR-PSEFVTLADEIEHVNAYLQIEKARFQANLQIQMLVPDALAHH 461 +++ LS R +L+ + V+LADE+ V++YLQ+ +F+ LQ + + A+ Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 462 QLPAFTLQPIVENAIKHGTSQHLGVGEITIRASQHQRWLQLDIEDNAGL-YQHNPNASGL 520 Q+P +Q +VEN IKHG +Q G+I ++ ++ + L++E+ L ++ ++G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 521 GMSLVDRRLRARFGADCGITVTCEAERFTRVTLRLP 556 G+ V RL+ +G + I ++ + + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 35.9 bits (83), Expect = 1e-04 Identities = 37/146 (25%), Positives = 60/146 (41%), Gaps = 6/146 (4%) Query: 4 LTLLLAVPFAPQAIAKTPVAATALQPEIASGSAMI-VDLASKKIIYQSQPDLVRPIASIT 62 ++LL +P A A + + +++ MI +DLAS + + + D P+ S Sbjct: 9 ISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTF 68 Query: 63 KVMTAMVVLDAHLPLDEMLTVDISHTPEMKGIYSRV---RLNSQISRRNMLLLALMSSEN 119 KV+ VL DE L I + + YS V L ++ + A+ S+N Sbjct: 69 KVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDN 128 Query: 120 RAAASLAHHY--PGGYDAFIRAMNAK 143 AA L P G AF+R + Sbjct: 129 SAANLLLATVGGPAGLTAFLRQIGDN 154
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.9 bits (75), Expect = 0.004 Identities = 9/19 (47%), Positives = 13/19 (68%) Query: 557 HVKEGDKVKAGDLLLEFDR 575 VKEG+ V+ GD+LL+ Sbjct: 111 IVKEGESVRKGDVLLKLTA 129
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 546 bits (1407), Expect = 0.0 Identities = 264/384 (68%), Positives = 300/384 (78%), Gaps = 15/384 (3%) Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLYGKIDGLHYFSDDKSVDGDQTYMRVG 60 MK KVL+L++PALL AGAA+AAEIYNKDGNKLDLYGK+DGLHYFSDD S DGDQTYMRVG Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 VKGETQINDQLTGYGQWEYNVQANNTESSSDQAWTRLAFAGLKFGDAGSFDYGRNYGVVY 120 KGETQINDQLTGYGQWEYNVQAN TE +WTRLAFAGLKFGD GSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DVTSWTDVLPEFGGDTYG-SDNFLQSRANGVATYRNSDFFGLVDGLNFALQYQGKNGSVS 179 DV WTD+LPEFGGD+Y +DN++ RANGVATYRN+DFFGLVDGLNFALQYQGKN S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 180 ------GEGATNNGRGALKQNGDGFGTSVTYDIWDGISAGFAYSHSKRTDDQNSLSK--G 231 G NNG NGDGFG S TYDI G SAG AY+ S RT++Q + Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240 Query: 232 RGDNAETYTGGLKYDANNIYLATQYTQTYNATRFSGNGNADSISGYANKAQNFEVVAQYQ 291 GD A+ +T GLKYDANNIYLAT Y++T N T + G + G ANK QNFEV AQYQ Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPY-GKTDKGYDGGVANKTQNFEVTAQYQ 299 Query: 292 FDFGLRPSVAYLQSKGKDIE----GFGDQDILKYVDLGATYYFNKNMSTYVDYKINLLDD 347 FDFGLRP+V++L SKGKD+ D+D++KY D+GATYYFNKN STYVDYKINLLDD Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359 Query: 348 NN-FTRRAGISTDDVVALGLVYQF 370 ++ F + AGISTDD+VALG+VYQF Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 49.8 bits (119), Expect = 2e-09 Identities = 31/170 (18%), Positives = 66/170 (38%), Gaps = 27/170 (15%) Query: 1 MNTMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHFPDLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114 + L+ IK+ PDL ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 115 DLPKALAALQKGKKFTPESVSRLLEKISASGYGDKRL---SPKESEVLRL 161 DL + + + + R K+ L S E+ R+ Sbjct: 107 DLTELIGIIGRAL----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRV 152
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.9 bits (210), Expect = 4e-19 Identities = 32/126 (25%), Positives = 54/126 (42%), Gaps = 1/126 (0%) Query: 827 ILVVDDHPINRRLLADQLGSLGYQCVTANDGVDALGVLSKQHIDIVLSDVNMPNMDGYRL 886 ILV DD R +L L GY ++ ++ D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 887 TQRIRELGMTLPVIGVTANALAEEKQRCLESGMDNCLSKPVTLD-IIKQTLAVYAARVRK 945 RI++ LPV+ ++A + E G + L KP L +I A R+ Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 946 ERQERE 951 + + Sbjct: 126 PSKLED 131
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.029 Identities = 22/119 (18%), Positives = 46/119 (38%), Gaps = 6/119 (5%) Query: 59 GFSRGDLGFALSGISIAYGFSK-FIMGSVSDRSNPRIFLPAGLILAALVMLVMGFVPWAT 117 + +G +L+ I + ++ I G V+ R R L G+I +++ F Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 118 SSIMVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLF 176 + +M +L G+G P + ++ +G + ++ + PLLF Sbjct: 302 MAFPIMVLLASG-----GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 21/101 (20%), Positives = 39/101 (38%), Gaps = 21/101 (20%) Query: 370 LLDNAIRY----TPANGTVTLSLRRDGEGVVMEVEDSGPGIEDDQIQQALLPFQRLENVG 425 L++N I++ P G + L +D V +EVE++G + + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309 Query: 426 DNPGAGLGLALVTD-IARLHRSHPQLLRSETLGGLKVRLRF 465 G GL V + + L+ + Q+ SE G + + Sbjct: 310 ---STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.1 bits (234), Expect = 2e-24 Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%) Query: 2 RLLLAEDNRELAHWLEKALVQGGFAVDCVFDGRAADHLLQSEKYALAVLDIGMPGFDGLE 61 +L+A+D+ + L +AL + G+ V + + + L V D+ MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VVQRLRKRGQTLPVLLLTARSAVADRVKGLNVGADDYLPKPFELEE-LDARLRALLRRSE 120 ++ R++K LPVL+++A++ +K GA DYLPKPF+L E + RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 GQ 122 Sbjct: 125 RP 126
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 28.7 bits (64), Expect = 0.009 Identities = 12/79 (15%), Positives = 28/79 (35%), Gaps = 12/79 (15%) Query: 1 MITWQDLHHAELTVPQLYALLQLRCAVFV--------VEQTCPYQDVDGDDLIGENRHLL 52 M+ D++H L+ + L LR F + D ++ +L Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNN----TTYLF 56 Query: 53 GWRGDELVAYARILKSDDD 71 G + + ++ R +++ Sbjct: 57 GIKDNTVICSLRFIETKYP 75
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 378 bits (973), Expect = e-136 Identities = 223/253 (88%), Positives = 236/253 (93%), Gaps = 2/253 (0%) Query: 1 MNFRLSALALGATLLVGCASSSSGDLPQGRSDPLEGFNRTMFDFNFNVVDPYVLRPVAVA 60 M RLSALALG TLLVGCASS G QGRSDPLEGFNRTM++FNFNV+DPY++RPVAVA Sbjct: 1 MKLRLSALALGTTLLVGCASS--GTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVA 58 Query: 61 WRDYVPQPARNGLSNFTSNLEEPAVMANYFLQGDPYKGMVHFTRFFLNTLLGMGGLIDVA 120 WRDYVPQPARNGLSNFT NLEEPAVM NYFLQGDPY+GMVHFTRFFLNT+LGMGG IDVA Sbjct: 59 WRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVA 118 Query: 121 GMANPKLQRVEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDEGGDMADGLYPVLSWLTW 180 GMANPKLQR EPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRD+GGDMAD LYPVLSWLTW Sbjct: 119 GMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTW 178 Query: 181 PMSIGKWAVEGVETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGKLTPVENPNA 240 PMS+GKW +EG+ETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGG+L P ENPNA Sbjct: 179 PMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNA 238 Query: 241 QAIQGDLKDIDSQ 253 QAIQ DLKDIDS+ Sbjct: 239 QAIQDDLKDIDSE 251
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 96.5 bits (240), Expect = 7e-24 Identities = 57/175 (32%), Positives = 79/175 (45%), Gaps = 31/175 (17%) Query: 382 GWVRNGVPARMSFGLYQGERLRMPVLDAIRSWVPPPPPPEPQPKSKPVTKIARLDSMSLF 441 G + GV R FG QGE PV V P P P P+ ++K T L S LF Sbjct: 181 GMLSLGVSYR--FG--QGEA--APV-------VAPAPAPAPEVQTKHFT----LKSDVLF 223 Query: 442 DSGQSVLKPGSTKLLVN---SLVGIKAKSGWLIVVAGYTDNTGTSQLNQRLSQKRAEAVR 498 + ++ LKP L L + K G +VV GYTD G+ NQ LS++RA++V Sbjct: 224 NFNKATLKPEGQAALDQLYSQLSNLDPKDG-SVVVLGYTDRIGSDAYNQGLSERRAQSVV 282 Query: 499 DWMRDTGDVPESCFAVQGYGASRPAATNDT---------PEGRALNRRVEISLVP 544 D++ G +P + +G G S P N + A +RRVEI + Sbjct: 283 DYLISKG-IPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336
>FLAGELLIN#Flagellin signature. Length = 507 Score = 33.5 bits (76), Expect = 0.003 Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 3/53 (5%) Query: 473 DAAQEQVSDQTSEQVRMLTQVKRALNERDLVLYAQPIQNSAGEGYHEILTRMR 525 DAA + ++++ + ++ LTQ R N D + AQ A + L R+R Sbjct: 43 DAAGQAIANRFTSNIKGLTQASRNAN--DGISIAQ-TTEGALNEINNNLQRVR 92
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 41.1 bits (96), Expect = 3e-06 Identities = 22/104 (21%), Positives = 34/104 (32%), Gaps = 3/104 (2%) Query: 98 PYERQVQQPVRPEEPVRQQPAQHQAPQPHVPSPQVQQPPQPVAPQQPVP---PQYQPQQS 154 P + + Q V+P +P P P P +P +P P + Q Q Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111 Query: 155 PQQPVVQSAPQQPAQVQQPAPQPVAAQPQPVAEPQVSEPQQPAP 198 V+S P P + PA + ++P S P Sbjct: 112 RDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRA 155 Score = 35.0 bits (80), Expect = 3e-04 Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 6/66 (9%) Query: 136 PQPVAPQQPVPPQYQPQQSPQQPVVQSAPQQPAQVQQPAPQPVAAQPQPVAEPQVSEPQQ 195 P A Q P P +P+ P+ P+ P + +P +P+P +P +Q Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPI-----PEPPKEAPVVIEKPK-PKPKPKPKPVKKVQEQ 109 Query: 196 PAPQPK 201 P K Sbjct: 110 PKRDVK 115
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 6e-06 Identities = 38/156 (24%), Positives = 63/156 (40%), Gaps = 23/156 (14%) Query: 55 WAMSSALVGCVFGALASGWCADKFGRKQPLIMAATLFTLSAWGTALAHSFDMFVVWRIVG 114 +A+ V GAL+ D+FGR+ L+++ + A A + + RIV Sbjct: 52 YALMQFACAPVLGALS-----DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106 Query: 115 GLGIGLASALSPMYIAEISPAAQRGRFVAVNQLTIVIGVLAAQLVNLMIAKPVASSATLA 174 G+ G A++ YIA+I+ +R R G ++A M+A PV + Sbjct: 107 GI-TGATGAVAGAYIADITDGDERARH---------FGFMSACFGFGMVAGPVL-GGLMG 155 Query: 175 EISASWNGQVGWRWMFGSGIVPAVVFLVLMFFVPES 210 S F + + + FL F +PES Sbjct: 156 GFSPHAP-------FFAAAALNGLNFLTGCFLLPES 184 Score = 29.0 bits (65), Expect = 0.039 Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 2/59 (3%) Query: 325 LVDKIGRRKLMLLGAAGLTIIYALIGAAYGLGILGLPVLI--LVLAAIAIYALTLAPVT 381 L D+ GRR ++L+ AG + YA++ A L +L + ++ + A A+ +A +T Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 32.8 bits (74), Expect = 0.008 Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 13/66 (19%) Query: 160 MTGFTLYPDRALIEITGKVFNGNATPRH--FLWW-ANPAVKGGDAHQSVFPPDVTAVFDH 216 + G DR+ + KV GNATP +LW A PAV Q +F T VFD+ Sbjct: 218 LNGNEAGRDRSAMRYLSKVQYGNATPAADLYLWTSATPAV------QWLF----TLVFDY 267 Query: 217 GKRDVS 222 G+R V Sbjct: 268 GERGVD 273
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 85.9 bits (212), Expect = 4e-22 Identities = 67/257 (26%), Positives = 124/257 (48%), Gaps = 7/257 (2%) Query: 3 QVAVVIGGGQTLGEFLSRGLAAQGYRVAVVDIQSDKATRVAQAINDEYGEGMAYGFGADA 62 ++A + G Q +GE ++R LA+QG +A VD +K +V ++ E A F AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66 Query: 63 TSEASVMALARGVDEIFARVDLLVYSAGIAKAAFISDFALGDFDRSLQVNLVGYFLCARE 122 A++ + ++ +D+LV AG+ + I + +++ + VN G F +R Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FSRLMIRDGIKGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHAL 182 S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + + Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 MLGNLLKSPMFQSL-LPQYASKLGIAEDEVEQYYIDKVPLKRGCDYQDVLNVLMFYASPQ 241 G+ ++ M SL + ++ I +E + +PLK+ D+ + ++F S Q Sbjct: 186 SPGS-TETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQ 242 Query: 242 ASYCTGQSINITGGQVM 258 A + T ++ + GG + Sbjct: 243 AGHITMHNLCVDGGATL 259
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 27.9 bits (62), Expect = 0.028 Identities = 11/45 (24%), Positives = 19/45 (42%), Gaps = 5/45 (11%) Query: 1 MKPRQRQAAIVEHLQAQGKCSVEEL-----AQHFDTTGTTIRKDL 40 M QR I E + A + +EL ++ T T+ +D+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 364 bits (936), Expect = e-123 Identities = 133/399 (33%), Positives = 199/399 (49%), Gaps = 40/399 (10%) Query: 149 IAALVAGALN----------NALLIARLENQNVLPEASASYTPPDRQEIIGLSAPIQQLK 198 I A GA + +I R + + D ++G SA +Q++ Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150 Query: 199 KEIDIVAASDLNVLISGETGTGKELVAKAVHQGSPRAANPLIYLNCAALPESVAESELFG 258 + + + +DL ++I+GE+GTGKELVA+A+H R P + +N AA+P + ESELFG Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210 Query: 259 HVKGAFTGAISHRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318 H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270 Query: 319 VDVRVLAATNRDLRQEVIAGRFRADLYHRLSVFPLSVPALRDRDDDVVLLAGYFCEQCRL 378 DVR++AATN+DL+Q + G FR DLY+RL+V PL +P LRDR +D+ L +F +Q Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329 Query: 379 RMGLAQVVLSDATRASLRSWPWPGNVRELEHAIHRAVVLARATQAGDEVRLDPTHFQFAV 438 + GL +++ PWPGNVRELE+ + R L D + + + Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYP----QDVITREIIENELRS 385 Query: 439 DAPGLPAATSAVPMQAINLRAA-------------------------TEAFQREAISRAL 473 + P P +A ++++ A + I AL Sbjct: 386 EIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAAL 445 Query: 474 ADNHRNWAAAARALALDVANLHRLAKRLGLKESRPDRSS 512 N AA L L+ L + + LG+ R RS+ Sbjct: 446 TATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRSA 484
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 30.4 bits (68), Expect = 0.008 Identities = 17/108 (15%), Positives = 36/108 (33%), Gaps = 16/108 (14%) Query: 2 ATMLEVAKRAGVSKATVSRVLSGNGYVSQETKDRVFQAVAESGYRPNLLARNLATKKSQT 61 ++ E+AK AGV++ + K +F + E + +++ Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFK--------DKSDLFSEIWELSESN--IGELELEYQAKF 81 Query: 62 LGLVVTNTLYHGVYFSELLFHVARMTEDKGRQLILADGKHSAEEEREA 109 G V L+ + ++ R+L++ H E E Sbjct: 82 PGDP------LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 381 bits (979), Expect = e-127 Identities = 142/373 (38%), Positives = 205/373 (54%), Gaps = 41/373 (10%) Query: 350 YREIQRLKERLVDENLALTEQLNNVESEFGEIIGRSEAMHSVLKQVEMVAQSDSTVLILG 409 E+ + R + E +L + + ++GRS AM + + + + Q+D T++I G Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167 Query: 410 ETGTGKELIARAIHNLSGRNARRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRF 469 E+GTGKEL+ARA+H+ R V +N AA+P L+ES+LFGHE+GAFTGA + GRF Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227 Query: 470 ELADKSSLFLDEVGDMPLELQPKLLRVLQEQEFERLGSNKLIRTDVRLIAATNRDLRQMV 529 E A+ +LFLDE+GDMP++ Q +LLRVLQ+ E+ +G IR+DVR++AATN+DL+Q + Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287 Query: 530 ADREFRSDLYYRLNVFPIHLPPLRERPDDIPLLVKAFTFKIARRLGRNIDSIPAETLRTL 589 FR DLYYRLNV P+ LPPLR+R +DIP LV+ F + A + G ++ E L + Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELM 346 Query: 590 SHMEWPGNVRELENVIERAVLLTRGSVLQLSLPEMNIDAET------------------- 630 WPGNVRELEN++ R L V+ + E + +E Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406 Query: 631 -----------MMAEVLPQEG-------EDEYQLIVRVLKESNGVVAGPKGAAQRLGLKR 672 + LP G E EY LI+ L + G AA LGL R Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIK---AADLLGLNR 463 Query: 673 TTLLSRMKRLGIN 685 TL +++ LG++ Sbjct: 464 NTLRKKIRELGVS 476
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 26.7 bits (59), Expect = 0.039 Identities = 16/138 (11%), Positives = 42/138 (30%), Gaps = 28/138 (20%) Query: 22 NDEVELTLAGGAKLVAIV--------------THSSKEALGLVAGKEAIAL----IKAPW 63 N + A A++ ++V + + L+ +EAI L K P Sbjct: 17 NLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPI 76 Query: 64 VTL--ATEDCGLKFSARNQFAGSVT--------RITEGAVNATVHIKTDAGFAIIAVVTN 113 + L L+ +++ V + +++K ++G + + Sbjct: 77 LMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVNSGMNRLGFQPD 136 Query: 114 ESQEEMKLVEGSRVIALI 131 + + + + Sbjct: 137 RVLTVWQQLRAMANVGEM 154
>adhesinb#Adhesin B signature. Length = 310 Score = 319 bits (818), Expect = e-111 Identities = 95/301 (31%), Positives = 165/301 (54%), Gaps = 12/301 (3%) Query: 10 LIFTALLGLLAI-----APAQASEKFKVITTFTVIADMAQNVAGEAAQVSSITKPGAEIH 64 L+ A +GL A + S K V+ T ++IAD+ +N+AG+ + SI G + H Sbjct: 9 LLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPH 68 Query: 65 EYQPTPGDIKRAQGAQLILTNGLNLEL----WFARFYQHLKGVPE---VVVSEGIQPMGI 117 EY+P P D+K+ A LI NG+NLE WF + ++ K VSEG+ + + Sbjct: 69 EYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYL 128 Query: 118 SEGPYNGKPNPHAWMSADNALIYVDNIRDALSKYDPANAQTYRQNAAIYKEKIRQTMAPL 177 GK +PHAW++ +N +IY NI LS+ DPAN +TY +N Y EK+ Sbjct: 129 EGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEA 188 Query: 178 KAELAKLPVEKRWLVTSEGAFSYLARDNGLKERYLWPINADQQGTPQQVRKTIDTMKKEH 237 K + +P EK+ +VTSEG F Y ++ + Y+W IN +++GTP Q++ ++ ++K Sbjct: 189 KEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTK 248 Query: 238 IPTIFSESTISDKPARQVAREAGAHYGGVLYVDSLSAADGPVPTYLDLLRVTTQTIVQGI 297 +P++F ES++ D+P + V+++ ++ DS++ +Y +++ + I +G+ Sbjct: 249 VPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308 Query: 298 N 298 + Sbjct: 309 S 309
>PF07675#Cleaved Adhesin Length = 1358 Score = 29.7 bits (66), Expect = 0.016 Identities = 18/51 (35%), Positives = 22/51 (43%) Query: 212 YTVMVKGTVLASGPTETTFTAANLELAFSGALRHVVLSGGEEQIITDDERP 262 YT+ T +ASG TETT+ +L F VV GE I T Sbjct: 1261 YTIYRNNTQIASGVTETTYRDPDLATGFYTYGVKVVYPNGESAIETATLNI 1311
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.7 bits (98), Expect = 4e-06 Identities = 28/119 (23%), Positives = 53/119 (44%), Gaps = 4/119 (3%) Query: 40 VFLALGGVFLDAYDLTTLSYGIDDVVREFQLSPLL---TGLVTSSIMVGTIIGNLIGGWL 96 + + L V LDA + + + ++R+ S + G++ + + + G L Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 97 TDKYGRYSVFMADMLFFVVSAIAAGLAPNVWVLIGARFLMGIGVGIDLPVAMSYLAEFS 155 +D++GR V + + V AP +WVL R + GI G VA +Y+A+ + Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.9 bits (67), Expect = 0.020 Identities = 12/41 (29%), Positives = 21/41 (51%) Query: 56 KALSQFILSAKVAPGVGITGAVNHYYGKKVGNLITALYFLA 96 K +SQ+I++ + A G+ + A V I+ L FL+ Sbjct: 285 KGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPLSFLS 325
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 660 bits (1703), Expect = 0.0 Identities = 253/846 (29%), Positives = 392/846 (46%), Gaps = 47/846 (5%) Query: 10 RLSTAVAMALFCFPPVSSGQESPGTVYQFNDGFIVG-SREKVDLSRFS-ASAITEGTYSL 67 + LF ++ FN F+ + DLSRF + GTY + Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRV 80 Query: 68 DVYTNNEWKGRYELNVTRDKDGNMGI-CYTREMLERYGIAAEKLNPQLSQQAGYCGRLKE 126 D+Y NN + ++ + C TR L G+ ++ C L Sbjct: 81 DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTS 140 Query: 127 WRNEENVKDNLIQSSLRLEVSVPQIYEDQRLKNFVSPEFWDKGIAALNLGWMANTWTSHT 186 + L RL +++PQ + R + ++ PE WD GI A L + + + Sbjct: 141 MI--HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN 198 Query: 187 SAANGSDNSSAYLGVNAGFSWDGWLLKHIGNLDWQQQQG----KAHWNSNQTYLQRPIPQ 242 G ++ AYL + +G + W L+ + K W T+L+R I Sbjct: 199 R--IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIP 256 Query: 243 INSIVSGGQIFTNGEFFDTIGLRGVNLATDDNMFPDGMRSYAPEIRGVAQSNALVTVRQG 302 + S ++ G +T G+ FD I RG LA+DDNM PD R +AP I G+A+ A VT++Q Sbjct: 257 LRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQN 316 Query: 303 NNIIYQTSVPPGPFTLQDVYPSGYGNDLEVSVKEADGSVQVFSVPYASVAQLLRPGMTRY 362 IY ++VPPGPFT+ D+Y +G DL+V++KEADGS Q+F+VPY+SV L R G TRY Sbjct: 317 GYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRY 376 Query: 363 ALSAGKV-DDNSLRNKPMLYQATWQRGLSNMFTGYTGVTGFDDYQAFLLGAGMNTG-IGA 420 +++AG+ N+ + KP +Q+T GL +T Y G D Y+AF G G N G +GA Sbjct: 377 SITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGA 436 Query: 421 LSFDITQSRLKS-DTLDEKGQSYRATFNRMFTDTQTSIVLAAYRYSTKGYYNLNDALYA- 478 LS D+TQ+ D GQS R +N+ ++ T+I L YRYST GY+N D Y+ Sbjct: 437 LSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSR 496 Query: 479 -------------VDQEKNRNNNYTLWRQKNGMTFTVNQNLPDGWGGFYLSGQIADYWNR 525 + K + + ++ + TV Q L YLSG YW Sbjct: 497 MNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGT 555 Query: 526 SGTEKQYQLSYNNMFGRLSWSASAQRVYTPDNSGHRRDDRISLNFSYPL--WFGEN---- 579 S ++Q+Q N F ++W+ S T + RD ++LN + P W + Sbjct: 556 SNVDEQFQAGLNTAFEDINWTLSYSL--TKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQ 613 Query: 580 -RTANLTSNTTFNNSHFGSSQIGVNGSLDSENNLNYGISTTAATGGQHD----VALNGSY 634 R A+ + + + + + ++ GV G+L +NNL+Y + T A GG + +Y Sbjct: 614 WRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNY 673 Query: 635 RTPWTTLNGSYSQGEGYRQSGLGASGTLIAHRHGVVFSPESGNTMALIEAKDAAGAMLPG 694 R + N YS + +Q G SG ++AH +GV +T+ L++A A A + Sbjct: 674 RGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVEN 733 Query: 695 SPGTRVDSNGYAILPYLRPYRINAVEIDPKGSQEDIAFERTVAQVVPWEGSVVKVSFDTT 754 G R D GYA+LPY YR N V +D +++ + VA VVP G++V+ F Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793 Query: 755 VQNTLTLQARQANGQPLPFAATILDPSGKDIGVVGQGSMMFISDAGIKQAI-VKW---SG 810 V L + N +PLPF A + S + G+V +++S + + VKW Sbjct: 794 VGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852 Query: 811 GQCAVD 816 C + Sbjct: 853 AHCVAN 858
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 987 bits (2553), Expect = 0.0 Identities = 688/869 (79%), Positives = 773/869 (88%), Gaps = 2/869 (0%) Query: 10 RLGARHARRRASGSARVFARFPLALLLAMQAFSAQAELYFNPRFLADDPAAVADLSSFEK 69 H R+ A F R +A A QA + AELYFNPRFLADDP AVADLS FE Sbjct: 12 NTQCLHIRKHRL--AGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFEN 69 Query: 70 GQEVPPGTYRVDIYLNNGFMTTRDVTFKSGENHHGLAPCLTRGQLASMGVNTAAVAGMNA 129 GQE+PPGTYRVDIYLNNG+M TRDVTF +G++ G+ PCLTR QLASMG+NTA+V+GMN Sbjct: 70 GQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNL 129 Query: 130 LAADACVPMAEMIKESTSRFDVGQQRLYLTVPQAFMGNRARGYIPPELWDDGITAGLLNY 189 LA DACVP+ MI ++T++ DVGQQRL LT+PQAFM NRARGYIPPELWD GI AGLLNY Sbjct: 130 LADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNY 189 Query: 190 NFTGNNVHNDIGGSSNYAYLNLQSGLNLGAWRLRDNTTWSYSSGGSSSSNENKWQHVNSW 249 NF+GN+V N IGG+S+YAYLNLQSGLN+GAWRLRDNTTWSY+S SSS ++NKWQH+N+W Sbjct: 190 NFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249 Query: 250 LERDIVPLRSRLTLGDSYTNGDVFDGINFRGAQLASDDNMLPDSQKGFAPVIHGIARGTA 309 LERDI+PLRSRLTLGD YT GD+FDGINFRGAQLASDDNMLPDSQ+GFAPVIHGIARGTA Sbjct: 250 LERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTA 309 Query: 310 QVSIKQNGYEIYQSTVPPGPFTIDDLYAAGNGGDLQVTIKETDGTRQVFTVPWSTVPVLQ 369 QV+IKQNGY+IY STVPPGPFTI+D+YAAGN GDLQVTIKE DG+ Q+FTVP+S+VP+LQ Sbjct: 310 QVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQ 369 Query: 370 REGHSRFALTAGEYRSGNDQQEKLKFFQGTLLHGLAAGWTLYGGSQLADRYRAFNLGVGK 429 REGH+R+++TAGEYRSGN QQEK +FFQ TLLHGL AGWT+YGG+QLADRYRAFN G+GK Sbjct: 370 REGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429 Query: 430 NMGEFGAVSLDVTQANATLPDDSKHQGQSLRFLYNKSLNEVGTNIQLVGYRYSTRGYYSF 489 NMG GA+S+D+TQAN+TLPDDS+H GQS+RFLYNKSLNE GTNIQLVGYRYST GY++F Sbjct: 430 NMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNF 489 Query: 490 ADTTYSRMSGYDVETQDGVIQVKPKFTDYYNLAYSKRGKVQVSVTQQLGRTATLYLSGSH 549 ADTTYSRM+GY++ETQDGVIQVKPKFTDYYNLAY+KRGK+Q++VTQQLGRT+TLYLSGSH Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSH 549 Query: 550 QTYWSTGKADQQLQAGLNAAVDDINWTLSYSLTKNAWQQGRDQMLAVNVNIPFSHWLRSD 609 QTYW T D+Q QAGLN A +DINWTLSYSLTKNAWQ+GRDQMLA+NVNIPFSHWLRSD Sbjct: 550 QTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSD 609 Query: 610 SKSVWRHASASYSMSHDLDGRMTNLAGLYGTLLEDNNLSYSVQTGYAGGGNGGSGGTGYA 669 SKS WRHASASYSMSHDL+GRMTNLAG+YGTLLEDNNLSYSVQTGYAGGG+G SG TGYA Sbjct: 610 SKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYA 669 Query: 670 ALNYRGGYGNANVGYSRSDGLKQLYYGVSGGVLAHGNGVTLSQPLNDTVVLIKAPGADNV 729 LNYRGGYGNAN+GYS SD +KQLYYGVSGGVLAH NGVTL QPLNDTVVL+KAPGA + Sbjct: 670 TLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDA 729 Query: 730 KVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDDAVVSVVPTHGAIVRAD 789 KVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLD+AV +VVPT GAIVRA+ Sbjct: 730 KVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAE 789 Query: 790 FKAHVGMKLLMTLTRHGKPVPFGAMVTSANNQSGSIVADNGQVYLSGMPLAGKVQVTWGE 849 FKA VG+KLLMTLT + KP+PFGAMVTS ++QS IVADNGQVYLSGMPLAGKVQV WGE Sbjct: 790 FKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGE 849 Query: 850 GPDASCVAEYQLPQESQQQALSQLSAVCR 878 +A CVA YQLP ESQQQ L+QLSA CR Sbjct: 850 EENAHCVANYQLPPESQQQLLTQLSAECR 878
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.1 bits (117), Expect = 2e-08 Identities = 67/362 (18%), Positives = 134/362 (37%), Gaps = 23/362 (6%) Query: 47 THSAMALGMIGLMQFLPSVLLALPAGHLADQFDRRRIVLLGQFIEWVALLGLVALTLLHW 106 H + L + LMQF + +L G L+D+F RR ++L+ V ++ Sbjct: 43 AHYGILLALYALMQFACAPVL----GALSDRFGRRPVLLVS------LAGAAVDYAIMAT 92 Query: 107 ADKIEIWGLVFLISVAKALEWPAITSMLPALVPPPILARAMAASSVGGQAAVIIGPTLGG 166 A + + + +++ + + + AR S ++ GP LGG Sbjct: 93 APFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG 152 Query: 167 LLYVAGPEVVYGVSALFYLFSIVLVSQLRYERPPQTRLPMNLT--NLFAGVHFIRERKDV 224 L+ P + +A + + L E R P+ N A + R V Sbjct: 153 LMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212 Query: 225 LGVISLDLFAVLLGGA-TALLPIFAQDILHTGPWGLG-MLRGAPSVGALLVGVWLSR--H 280 ++++ L+G AL IF +D H +G L + +L + Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272 Query: 281 KLEKNVGLIMFASVAGFGLATLVFALSTQLWLSLLALAALGGFDMVSMVIRGALVQLDTP 340 +L + L++ G G L FA T+ W++ + L + ++ ++ Sbjct: 273 RLGERRALMLGMIADGTGYILLAFA--TRGWMAFPIMVLLASGGIGMPALQA-MLSRQVD 329 Query: 341 DDMRGRVNAVNAIFINTSNQLGEFESGLLAAWMGAVPAAALGGIGTLVVVAIWMTIFPHL 400 ++ +G++ A + ++ +G LL + A G + A+++ P L Sbjct: 330 EERQGQLQGSLAALTSLTSIVGP----LLFTAIYAASITTWNGWAWIAGAALYLLCLPAL 385 Query: 401 RK 402 R+ Sbjct: 386 RR 387
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 25.7 bits (56), Expect = 0.026 Identities = 9/28 (32%), Positives = 16/28 (57%) Query: 32 LSIIEHTDVDESLKGQGVGKQLVAKVVE 59 ++IE V + + +GVG L+ K +E Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 68.7 bits (168), Expect = 6e-15 Identities = 44/337 (13%), Positives = 98/337 (29%), Gaps = 78/337 (23%) Query: 11 KKWPLLALVLAAIVALILVIWQL-----QTSPETNDAYVYTDTIDVVPEVSGRIVEMPIR 65 + P L +I I + + + ++ P + + E+ ++ Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 66 DNQRVKKGDLLFRLDPRP---------------------YQAMLDDA------------- 91 + + V+KGD+L +L YQ + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 92 ------------------RARLTTLDAQIMLTRRTIKAQEYNAQSVAAAVERAKALVKQT 133 + + +T Q + + +V A + R + L + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 134 TSSRTRLEPLVPQGFASQEDLDQARTAEKAARAELDATLLQAKQASAAVTGVDAMVAQRA 193 S L+ + ++ + + A EL Q +Q + + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 194 GIL-------------------AQIALAELHLEFTEVRAPFNGVVVALKT-TVGQYASAL 233 + ++A E + + +RAP + V LK T G + Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 234 KPVFTLM-DDDRWYVIANFRETDLKNVRPGVAARVTI 269 + + ++ +DD V A + D+ + G A + + Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV 390
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 29.7 bits (67), Expect = 0.033 Identities = 17/109 (15%), Positives = 40/109 (36%), Gaps = 13/109 (11%) Query: 393 LATLLALLMIVFIQPHTESLVGLLAMTLPV---MALSAWIAAGSERIAYAGIQIGFTFS- 448 + L+ + P +++L ++ L + A IA +Q GF S Sbjct: 53 FSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISG 112 Query: 449 ---------LAFLSWFGPLTNLTELRDRVIGILLGVLVSSIVHLYLWPD 488 + + + ++ L + + IL VL+S ++ + + + Sbjct: 113 EAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGN 161
>SECA#SecA protein signature. Length = 901 Score = 28.3 bits (63), Expect = 0.014 Identities = 9/30 (30%), Positives = 14/30 (46%) Query: 118 LSELLCEEEVLTALLAAKSEKQLADIIAHA 147 +S L + + +L AK A I+A A Sbjct: 465 VSNELTKAGIKHNVLNAKFHANEAAIVAQA 494
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 108 bits (271), Expect = 2e-30 Identities = 77/262 (29%), Positives = 114/262 (43%), Gaps = 9/262 (3%) Query: 1 MNAQ-IEGRVAVVTGGSSGIGFETLRLLLGEGAKVAFCGRNPDRLASAHAALQNEYPGAE 59 MNA+ IEG++A +TG + GIG R L +GA +A NP++L ++L+ E AE Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 60 VFSYRCDVLQEDEVQAFADAVAARFGGADMLINNAGQGYVAHFADTPREAWLHEAELKLF 119 ++ DV + + G D+L+N AG E W + Sbjct: 61 --AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118 Query: 120 GVINPVKAFQPQLERSDIASITCVNSLLALQPEEHMIATSAARAALLNMTLTLSKDLVGK 179 GV N ++ + SI V S A P M A ++++AA + T L +L Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178 Query: 180 GIRVNSILLGMVESG-QWQRRFENRADKQQSWPEWTADIAR-KRGIPMARLGKPQEPAQA 237 IR N + G E+ QW + +Q + K GIP+ +L KP + A A Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVI----KGSLETFKTGIPLKKLAKPSDIADA 234 Query: 238 LLFLASPLASFTTGAALDVSGG 259 +LFL S A T L V GG Sbjct: 235 VLFLVSGQAGHITMHNLCVDGG 256
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 27.9 bits (62), Expect = 0.033 Identities = 16/78 (20%), Positives = 28/78 (35%), Gaps = 10/78 (12%) Query: 28 KQMALSGYRVLAWDMPGYGESPMLPVAQANAGDYADALARMLDHA----GVEQAVVVGHS 83 + G+ V+ W Y P D ++D G ++ +++G+S Sbjct: 72 GILQQQGWPVVGWSSLKYYWKQKDP------KDVTQDTLAIIDKYQAEFGTQKVILIGYS 125 Query: 84 LGALVASAFAAKYPRRVR 101 GA V + P R R Sbjct: 126 FGAEVIPFVLNEMPARYR 143
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.5 bits (92), Expect = 2e-05 Identities = 25/132 (18%), Positives = 57/132 (43%), Gaps = 3/132 (2%) Query: 75 AFLLSYGFSSVLLSGLGDRIAPLRLLTGMMMVWCVLMVMMGFTHNYTVMVTLRILLGIAE 134 AF+L++ + + L D++ RLL +++ C V+ H++ ++ + + A Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG 116 Query: 135 GPLFPLAFAIVRHTF-PQRLQARATMLWLLGTPVGAAIGFPLSIWLLNTFGWQSTFFVM- 192 FP +V + P+ + +A L +G +G + + + W + Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 193 -AMLTVPVLIFV 203 ++TVP L+ + Sbjct: 177 ITIITVPFLMKL 188
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 40.2 bits (94), Expect = 3e-06 Identities = 18/67 (26%), Positives = 30/67 (44%), Gaps = 2/67 (2%) Query: 159 NPFTLGHRHLVEHAAARCDWLHLFVVREDASFFPFTA--RLEMVRAGVAHLSNVSVHEGS 216 +P T GH ++E D +++ V+R F+ RLE + +AHL N V Sbjct: 10 DPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFE 69 Query: 217 QYIISRA 223 ++ A Sbjct: 70 GLTVNYA 76
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.9 bits (153), Expect = 1e-13 Identities = 26/125 (20%), Positives = 52/125 (41%), Gaps = 6/125 (4%) Query: 8 VLIVEDEHALAQLHAELIGKHPGLRLVGIASSLADAQTQIESKQPQLVLLDNYLPDGKGI 67 +L+ +D+ A+ + + + + G + S+ A I + LV+ D +PD Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 68 TLIG--NALLARTPCSVIFITAASDMDTCSLAIRNGAFDYILKPVSWKRLNQSLERFMQF 125 L+ P V+ ++A + T A GA+DY+ KP L + R + Sbjct: 64 DLLPRIKKARPDLP--VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 126 AEQQR 130 +++ Sbjct: 122 PKRRP 126
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 42.9 bits (101), Expect = 3e-06 Identities = 40/205 (19%), Positives = 68/205 (33%), Gaps = 52/205 (25%) Query: 374 TDWLYLEVQAARENGFRVISMSINFQQIVSNSEYSFLAARIDEISNKLNVIFVISVGNLE 433 DW+ + A E +ISMS+ + L + + ++ + + GN Sbjct: 126 YDWIIQGIYYAIEQKVDIISMSLG-----GPEDVPELHEAVKKAVAS-QILVMCAAGN-- 177 Query: 434 DKKYRSEWPKTEEQVFRMLARFQQQDKVLQPADSVTSLSVGSVNHIENELITFQAPTRYT 493 +T D++ P +SVG++N + F Sbjct: 178 ---EGDGDDRT--------------DELGYPGCYNEVISVGAIN-FDRHASEF------- 212 Query: 494 RRGPATAYGIKPDLVHIGGIGDPNNSCYVTLDGNNHLCLNSH-GTSLAAPHIAKSIATID 552 + + DLV P T+ G + GTS+A PH+A ++A I Sbjct: 213 -----SNSNNEVDLV------APGEDILSTVPGG---KYATFSGTSMATPHVAGALALIK 258 Query: 553 FKSNESLNTN----TLKALLIHSAK 573 +N S + L A LI Sbjct: 259 QLANASFERDLTEPELYAQLIKRTI 283
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.5 bits (66), Expect = 0.015 Identities = 14/50 (28%), Positives = 19/50 (38%) Query: 106 GAGVAAFAPTAALAVATTFGTASTGTAIATLSGAAATNAALAWLGGGALA 155 GAG AAF + VA + G A + A + GG +A Sbjct: 114 GAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIA 163
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 668 bits (1725), Expect = 0.0 Identities = 255/855 (29%), Positives = 397/855 (46%), Gaps = 47/855 (5%) Query: 10 RLSTAIAVALCCFPPFSSGEENPGTVYQFNDGFIVG-SREKVDLSRFSTS-AISEGVYSL 67 V L F++ FN F+ + DLSRF + G Y + Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRV 80 Query: 68 DVYTNGEWKGRYDLKITAGKDGKMGV-CYTKAMLMQYGISPEKFNPQLSEKEGFCGRLQE 126 D+Y N + D+ G + V C T+A L G++ + + C L Sbjct: 81 DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTS 140 Query: 127 WRNEENVKDTLIQSSLRLDISVPQIYEDQRLKNFVSPEFWDKGVAALNLGWMANAWNSHS 186 + L RL++++PQ + R + ++ PE WD G+ A L + + + Sbjct: 141 MI--HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSG--NSV 196 Query: 187 SSSNGSDNSSAYLGVNAGLSWDGWLLKHIGNLNWQQQQG----KAHWNSNQTYLQRPIPQ 242 + G ++ AYL + +GL+ W L+ ++ K W T+L+R I Sbjct: 197 QNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIP 256 Query: 243 INSIVSGGQIFTNGEFFDTIGMRGVSLATDDNMFPDGMRSYAPEIRGVAQSNALVTVRQG 302 + S ++ G +T G+ FD I RG LA+DDNM PD R +AP I G+A+ A VT++Q Sbjct: 257 LRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQN 316 Query: 303 SNIIYQTTVPPGPFTLQDVYPSGYGSDLEVSVKEADGTVEVFSVPYASVAQLLRPGMTRY 362 IY +TVPPGPFT+ D+Y +G DL+V++KEADG+ ++F+VPY+SV L R G TRY Sbjct: 317 GYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRY 376 Query: 363 ALSAGKV-DDSSLRNKPMLYQGTWQHGLNNLFTGYTGVTGFDDYQAFLLGTGMNTG-IGA 420 +++AG+ ++ + KP +Q T HGL +T Y G D Y+AF G G N G +GA Sbjct: 377 SITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGA 436 Query: 421 LSFDVTHSRLKS-DTLDEQGQSYRATFNRMFTETQTSIVLAAYRYSTKGYYNLNDALYA- 478 LS D+T + D GQS R +N+ E+ T+I L YRYST GY+N D Y+ Sbjct: 437 LSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSR 496 Query: 479 -------------VDQEKNRNSNYTVWRQKNGMTFTVNQNLPDGWGGFYLSGRVADYWNR 525 + K + + ++ + TV Q L YLSG YW Sbjct: 497 MNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGT 555 Query: 526 SGTEKQYQFSYNNMYGRLSWSVGAQRVYTPDSSGHRRDDRVSLNFSYPL--WFGEN---- 579 S ++Q+Q N + ++W++ T ++ RD ++LN + P W + Sbjct: 556 SNVDEQFQAGLNTAFEDINWTLSYSL--TKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQ 613 Query: 580 -RTANLTSNTSFNNSRFASSQIGVNGSLDSENNLNYGVSTTTSTGGQHD----VALNGSY 634 R A+ + + S + + ++ GV G+L +NNL+Y V T + GG + +Y Sbjct: 614 WRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNY 673 Query: 635 RTPWTTLNGSYSQGEGYRQSGVGASGTLIAHRHGVVFSPESGTTMALIEAKDAAGAMLPG 694 R + N YS + +Q G SG ++AH +GV T+ L++A A A + Sbjct: 674 RGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVEN 733 Query: 695 SPGTRIDSNGYAILPYLRPYRINSVEIDPKGSNDDVAFDRTVAQVVPWEDSVVKVSFDTT 754 G R D GYA+LPY YR N V +D D+V D VA VVP ++V+ F Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793 Query: 755 VQNNITVLAHQANGLPLPFAATIFDPSGKEIGVVGQGSMMFISDTSVP-KATVKW---SG 810 V + ++ N PLPF A + S + G+V +++S + K VKW Sbjct: 794 VGIKL-LMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852 Query: 811 GQCSVDLSQAKTKET 825 C + + Sbjct: 853 AHCVANYQLPPESQQ 867
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 25.4 bits (55), Expect = 0.024 Identities = 5/24 (20%), Positives = 10/24 (41%) Query: 20 LRIKDVMEKLGIARATIYDWLNTK 43 + ++ + G+ R IY K Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDK 55
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 26.7 bits (59), Expect = 0.007 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 2/40 (5%) Query: 5 THPHLPSGPF-TREQASGIAAQYDNVAIEDDQGTHFRLVI 43 +P P GPF E A +A + + E D G +R V+ Sbjct: 127 QNPTKPVGPFYDEETAKRLAREKGWIVKE-DSGRGWRRVV 165
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 29.0 bits (65), Expect = 0.027 Identities = 10/37 (27%), Positives = 20/37 (54%) Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383 G FD + GH+ + +L D++ VAV + + + + Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.6 bits (82), Expect = 4e-04 Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 4/120 (3%) Query: 49 IALGGIFLDAYDLGSLAFGLKDITREFNLT---PAGTGMVASAITFGAIVGALLGGYLTD 105 + L + LDA +G + L + R+ + A G++ + A + G L+D Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68 Query: 106 KIGRYRVFMADMLFFVVAAIACALAPNEYVLTGARFVMGLGVGIDLPVAMAFLSEFSKLK 165 + GR V + + V A AP +VL R V G+ G VA A++++ + Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGD 127
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 1080 bits (2794), Expect = 0.0 Identities = 409/566 (72%), Positives = 471/566 (83%), Gaps = 2/566 (0%) Query: 4 ISRQAYADMFGPTTGDKVRLADTELWIEVEDDLTTYGEEVKFGGGKVIRDGMGQGQML-A 62 +SR AYA+MFGPT GDKVRLADTEL+IEVE D TT+GEEVKFGGGKVIRDGMGQ Q+ Sbjct: 5 MSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTRE 64 Query: 63 DDCVDLVLTNALIVDHWGIVKADIGIKDGRILAIGKAGNPDIQPGVNIPIGAATEVIAAE 122 VD V+TNALI+DHWGIVKADIG+KDGRI AIGKAGNPD+QPGV I +G TEVIA E Sbjct: 65 GGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE 124 Query: 123 GKIVTAGGVDTHIHWICPQQAEEALVSGVTTMVGGGTGPAAGTHATTCTPGPWYISRMLQ 182 GKIVTAGG+D+HIH+ICPQQ EEAL+SG+T M+GGGTGPA GT ATTCTPGPW+I+RM++ Sbjct: 125 GKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIE 184 Query: 183 AADSLPVNIGLLGKGNGSNPDALREQVAAGVIGLKIHEDWGATPAAINCALTVADEMDIQ 242 AAD+ P+N+ GKGN S P AL E V G LK+HEDWG TPAAI+C L+VADE D+Q Sbjct: 185 AADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDVQ 244 Query: 243 VALHSDTLNESGFVEDTLAAIAGRTIHTFHTEGAGGGHAPDIITACAHPNILPSSTNPTL 302 V +H+DTLNESGFVEDT+AAI GRTIH +HTEGAGGGHAPDII C PN++PSSTNPT Sbjct: 245 VMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPTR 304 Query: 303 PYTVNTIDEHLDMLMVCHHLDPDIAEDVAFAESRIRRETIAAEDVLHDLGAFSLTSSDSQ 362 PYTVNT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAED+LHD+GAFS+ SSDSQ Sbjct: 305 PYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQ 364 Query: 363 AMGRVGEVVLRTWQVAHRMKVQRGPLAEERGDNDNFRVKRYIAKYTINPALTHGIAHEVG 422 AMGRVGEV +RTWQ A +MK QRG L EE GDNDNFRVKRYIAKYTINPA+ HG++HE+G Sbjct: 365 AMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEIG 424 Query: 423 SIEVGKLADLVLWSPAFFGVKPATVIKGGMIAIAPMGDINASIPTPQPVHYRPMFGALGS 482 S+EVGK ADLVLW+PAFFGVKP V+ GG IA APMGD NASIPTPQPVHYRPMFGA G Sbjct: 425 SLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYGR 484 Query: 483 ARHHCRLTFLPQAAVDSGVAQRLNLQSATAVVKGCR-TVQKTDMIHNGLQPNITVDAQTY 541 +R + +TF+ QA++D+G+A RL + V+ R + K MIHN L P+I VD +TY Sbjct: 485 SRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPETY 544 Query: 542 EVRIDGELITSEPADVLPMAQRYFLF 567 EVR DGEL+T EPA VLPMAQRYFLF Sbjct: 545 EVRADGELLTCEPATVLPMAQRYFLF 570
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 251 bits (642), Expect = 3e-78 Identities = 96/382 (25%), Positives = 160/382 (41%), Gaps = 36/382 (9%) Query: 289 KPIVEEQGNSFILLLHPVEQMRQLMTSQLGKVSHTFAQMSSDDPETRRLIHFGRQAARGG 348 KP + I ++ + S+L S + + + + + Sbjct: 104 KPFDLTEL---IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160 Query: 349 FPVLLCGEEGVGKELVAQAIHNESERAGGPYIAVNCQLYADSVLGQDFMGS---APTDDE 405 +++ GE G GKELVA+A+H+ +R GP++A+N ++ + G A T + Sbjct: 161 LTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQ 220 Query: 406 NGRLSRLELASGGTLFLEKIEYLAPELQSALLQVIKQGVLTRLDARRLIPVDVKVIATTT 465 R E A GGTLFL++I + + Q+ LL+V++QG T + R I DV+++A T Sbjct: 221 TRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280 Query: 466 VDLANLVEQNRFSRQLYYALHSFEIVIPPLRSRRNSIPSLVHNRLRSLEKRFSSRLKVDD 525 DL + Q F LYY L+ + +PPLR R IP LV + ++ EK + D Sbjct: 281 KDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQ 340 Query: 526 DALAQLVNYSWPGNDFELNSIIENIAISSDNGHIRLSNLPEYLFSERP------------ 573 +AL + + WPGN EL +++ + I + L SE P Sbjct: 341 EALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSG 400 Query: 574 ------------------GGDTASPLLPASLTFSAIEKEAIIHAARVTSGRVQEMSHLLN 615 GD P + +E I+ A T G + + LL Sbjct: 401 SLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLG 460 Query: 616 IGRTTLWRKMKQYDIDASQFKR 637 + R TL +K+++ + + R Sbjct: 461 LNRNTLRKKIRELGVSVYRSSR 482
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 27.5 bits (61), Expect = 0.037 Identities = 10/60 (16%), Positives = 24/60 (40%), Gaps = 6/60 (10%) Query: 31 KEIGDAD----HGLNMYRGFSKVVEKLP--SIADKDIGFILKNTGMTLLSNVGGASGPLF 84 K+ +AD +G+N+ G + KL + ++ + + G+ ++ G Sbjct: 77 KKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKE 136
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 159 bits (405), Expect = 3e-45 Identities = 64/206 (31%), Positives = 104/206 (50%), Gaps = 1/206 (0%) Query: 260 GKALRYPLPAPRPVQQTGADIANEQRRLQQAIGQTLDDLNALTTLAEERYSADIAAIFSG 319 KA + P + + D++ E +L A+ ++ ++L A+ E AD A IF+ Sbjct: 17 AKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAA 76 Query: 320 HHTLLDDPDLYEAACDILRQEQCNAEWAWYQVLADLSQQYRQLNDAYLQARYIDIDDLLH 379 H +LDDP+L + + EQ NAE+A +V + +++ Y++ R DI D+ Sbjct: 77 HLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMKERAADIRDVSK 136 Query: 380 RTLRHLQGSSEEPLA-VHEPTIIIADDLFPSTVLQLDPRLVKGICLREGSEASHGAIIAR 438 R L HL G LA + E T+IIA+DL PS QL+ + VKG G SH AI++R Sbjct: 137 RVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSR 196 Query: 439 QAGIVCLCQQGDALQQIGDGESLTLD 464 I + + ++I G+ + +D Sbjct: 197 SLEIPAVVGTKEVTEKIQHGDMVIVD 222
>PF01206#SirA family protein Length = 76 Score = 102 bits (255), Expect = 1e-32 Identities = 27/71 (38%), Positives = 42/71 (59%) Query: 9 DHTLDAQGLRCPEPVMMVRKTVRTMPVGETLLIIADDPATTRDIPGFCRFMEHELLAQET 68 D +LDA GL CP P++ +KT+ TM GE L ++A DP + +D F + HELL Q+ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 AALPYRYLIRK 79 Y + +++ Sbjct: 65 EDGTYHFRLKR 75
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 45.2 bits (107), Expect = 3e-07 Identities = 76/379 (20%), Positives = 134/379 (35%), Gaps = 32/379 (8%) Query: 27 FASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSRPHAGRYADLLGPKK 84 + IGL + VLPG + D++ G++++L P G +D G + Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74 Query: 85 IVIFGLCGCFLSGLSYLLAAVGSGWPIFSLALLCLGRVILGI-GQSFAGTGSTLWGVGVV 143 +++ L G + + Y + A L +L +GR++ GI G + A G+ + + Sbjct: 75 VLLVSLAG---AAVDYAIMATAP-----FLWVLYIGRIVAGITGATGAVAGAYIADITDG 126 Query: 144 GSL--HIGRVISWNGIVTYGAMAMGAPLGVLCYSLVGLPGLAWAIMAVALVAILCALPRA 201 H G + + G +G +G A + L Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186 Query: 202 AVR------ATKGKAMTFRAVLGRVWPYGMALALASAGFGTI-ATFITLFYDAK-GWDGA 253 R A A A V MA+ G + A +F + + WD Sbjct: 187 GERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT 246 Query: 254 AFALTLFSCAFVGT---RLLFPNGINRLGGLNVAMLCFSVEIVGLLLVGLAETSPIAKVG 310 ++L + + + ++ RLG ML + G +L+ A +A Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPI 306 Query: 311 -TFFAGAGFSLVFPALGVVAVKAVPQQNQGSALATYTVFMDLSLGITGPLAGLLMAWAGI 369 A G + PAL + + V ++ QG + L+ I GPL + A I Sbjct: 307 MVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SIVGPLLFTAIYAASI 363 Query: 370 ST----IYLAAAGLVAVAL 384 +T ++A A L + L Sbjct: 364 TTWNGWAWIAGAALYLLCL 382
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 42.9 bits (101), Expect = 5e-06 Identities = 45/261 (17%), Positives = 95/261 (36%), Gaps = 32/261 (12%) Query: 217 THRVVTTLAELKEQLKARYPQAQVLARGTVFY--SDYASQQAKQDISTLGIATLLGVILL 274 + +L+ +PQ + Y + + + + TL A +L V L+ Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKV---LYPYDTTPFVQLSIHEVVKTLFEAIML-VFLV 354 Query: 275 IVATFRSLRPLLLCVISVGIGGLAGTVVTLLLFG-ELHLMTLVMSMSIIGISADYTLYYL 333 + +++R L+ I+V + L GT L FG ++ +T+ + IG+ D + + Sbjct: 355 MYLFLQNMRATLIPTIAVPV-VLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 334 --TERMVHGHDATPWQ----SLAKVRRTLLLALLTAVVAYL-IMMLAPFPGI--RQMSVF 384 ER++ P + S+++++ L+ + ++ + G RQ S+ Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473 Query: 385 AAVGLSASCLTVIFWHPLLS----RGLPVRPVPAMGLMLRWLA-AWRRNKKLYIGLP--- 436 ++ S L + P L + + G W + + Y Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 437 -------LTLALLSLAGIATL 450 L + L +AG+ L Sbjct: 534 LGSTGRYLLIYALIVAGMVVL 554 Score = 39.1 bits (91), Expect = 8e-05 Identities = 32/171 (18%), Positives = 57/171 (33%), Gaps = 33/171 (19%) Query: 635 ILTGLLVVALAVIACGAILRLGWRKGSIGLLPSVLSLGCGLAALAFSGHPVNLFSLLALV 694 + T + L + L+ R I + + L A LA G+ +N ++ +V Sbjct: 341 VKTLFEAIMLVFLVMYLFLQ-NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMV 399 Query: 695 LVLGIGI--------NYTLFF----SNPR----------GTPLTSLLAIILAMMTTLLTL 732 L +G+ + N P+ L + ++ A+ + Sbjct: 400 LAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFF 459 Query: 733 G------MLVFSATQAISSFGIVLVSGIFT----AFLLSPLAMPGKKEKKR 773 G FS T + VLV+ I T A LL P++ + K Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510 Score = 37.1 bits (86), Expect = 3e-04 Identities = 35/182 (19%), Positives = 67/182 (36%), Gaps = 19/182 (10%) Query: 604 AALRALSEKQPGVAWVDRKSAFDELFTLYRHILTGLLVVALAVIACGAILRLGWRKGSIG 663 A + L+ K P D + + + + V C A L W Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 664 LLPSVLSLGCGLAALAFSGHPVNLFSLLALVLVLGI----GINYTLFFSNPRGTPLTSLL 719 +L L + L A +++ ++ L+ +G+ I F + ++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 720 -AIILA--------MMTTLLT-LGM--LVFSAT---QAISSFGIVLVSGIFTAFLLSPLA 764 A ++A +MT+L LG+ L S A ++ GI ++ G+ +A LL+ Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 765 MP 766 +P Sbjct: 1021 VP 1022
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 97.4 bits (242), Expect = 2e-26 Identities = 66/251 (26%), Positives = 122/251 (48%), Gaps = 15/251 (5%) Query: 3 RSVLVTGASKGIGRAIACQLAADGF-IVGVHYHRDAEGAQETLKSLRAAGGAGRTLSFDV 61 + +TGA++GIG A+A LA+ G I V Y + E ++ + SL+A DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDY--NPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 62 ANREQCREVLEQEIATHGPWYGVVSNAGITRDGAFPALSDNDWDAVIHTNLDSFYNVIQP 121 + E+ + GP +V+ AG+ R G +LSD +W+A N +N + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 122 CIMPMIGARQGGRIITLSSVSGVMGNRGQVNYSAAKAGIIGATKALAIELAKRKITVNCI 181 + + R+ G I+T+ S + Y+++KA + TK L +ELA+ I N + Sbjct: 127 -VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 182 APGLIDTGM---IAMEETALKEAMS--------IVPMKRMGQAEEVAGLASYLMSDIAGY 230 +PG +T M + +E ++ + +P+K++ + ++A +L+S AG+ Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 231 VTRQVISINGG 241 +T + ++GG Sbjct: 246 ITMHNLCVDGG 256
>TETREPRESSOR#Tetracycline repressor protein signature. Length = 218 Score = 30.7 bits (69), Expect = 0.008 Identities = 16/60 (26%), Positives = 24/60 (40%), Gaps = 3/60 (5%) Query: 271 TQPQRETMQICMEQSLRMAGLSAEDIGY-ISAHGTATDRGDIAESQASAAVFGDRVPISS 329 + Q +T++ + + + G S D Y ISA T G + E Q A DR Sbjct: 106 DEKQYDTVETQL-RFMTENGFSLRDGLYAISAVSHFT-LGAVLEQQEHTAALTDRPAAPD 163
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 31.9 bits (72), Expect = 0.007 Identities = 19/105 (18%), Positives = 32/105 (30%), Gaps = 9/105 (8%) Query: 265 GLNNENQILNSTG------TTEGLL--VVTEAVNNSRSFFRARVSNGVHVLPGLHSLYAS 316 GLN LN+ G A NS V NGV+V G+ Y + Sbjct: 10 GLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYV-SGVQREYDA 68 Query: 317 LPSAGYAIEWFRNLFELDMPAFLRMVDTLRNEKDRVVAGSLDGIF 361 + ++ + +D + + +A + F Sbjct: 69 FITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFF 113
>SOPEPROTEIN#Salmonella type III secretion SopE effector protein signature. Length = 239 Score = 25.5 bits (55), Expect = 0.038 Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 9/55 (16%) Query: 16 SAVATEKVVQYCK---SQGINVDPIQSNIGTIGKQDGMADLIIVTSAVKTELTTP 67 SAV ++ Q C S+GIN+ P IG K G+ K ++ TP Sbjct: 115 SAVYSKNKDQCCNLLISKGINIAPFLQEIGEAAKNAGLP------GTTKNDVFTP 163
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 30.4 bits (68), Expect = 0.010 Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 2/66 (3%) Query: 4 SIKRAVTSYDVAKAAGVSQSAVSRAFTDGAKISPATREKVRKVAAELGYRPS--FIAQSL 61 ++ A +AKAA ++AVS F D K A + R++ YR + + L Sbjct: 315 NVAAAAKVAKLAKAAKPGKAAVSGDFADSYKKKLALSDSARQLYQNAKYREALDIHYEDL 374 Query: 62 ITRRSN 67 I R+++ Sbjct: 375 IRRKTD 380
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.1 bits (78), Expect = 9e-05 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 5/52 (9%) Query: 76 VAPDAIRRGIGSALLNEVKQ-----HYRWLSLEVYQKNVQAVNFYHAQGFRI 122 VA D ++G+G+ALL++ + H+ L LE N+ A +FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 114 bits (286), Expect = 2e-32 Identities = 42/124 (33%), Positives = 64/124 (51%), Gaps = 11/124 (8%) Query: 108 LNMPNNVTFDSSSANLKPAGANTLTGVAMVLKEYEKT--AVNVVGYTDSTGGQDLNMRLS 165 + ++V F+ + A LKP G L + L + +V V+GYTD G N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASALITQGVAANRIRTSGMGPANPIASNTTAEGK---------AQNRRVEITL 216 ++RA SV LI++G+ A++I GMG +NP+ NT K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SPLQ 220 ++ Sbjct: 335 KGIK 338
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 49.5 bits (118), Expect = 2e-08 Identities = 37/186 (19%), Positives = 85/186 (45%), Gaps = 3/186 (1%) Query: 19 KYQWMILALCFITVAMDGFDTAIIGFIASDLVQEWGVEKSALGPVMSAALVGLAVGALTA 78 ++ +++ LC ++ + ++ D+ ++ ++ V +A ++ ++G Sbjct: 11 RHNQILIWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVY 69 Query: 79 GPLADRIGRKKVLLMSIVVFGSFSLLTAFATSLNQLTLL-RFLTGLGLGAAMPNAATLMS 137 G L+D++G K++LL I++ S++ S L ++ RF+ G G A +++ Sbjct: 70 GKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVA 129 Query: 138 EYAPERRRALLVNLMFVGFPMGSSLGGFLSAWMIPHYGWQSVLILGGVMPLLLAVALVFL 197 Y P+ R L+ MG +G + + + W S L+L ++ ++ L+ L Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMITIITVPFLMKL 188 Query: 198 LPESAR 203 L + R Sbjct: 189 LKKEVR 194 Score = 35.2 bits (81), Expect = 4e-04 Identities = 37/183 (20%), Positives = 72/183 (39%), Gaps = 11/183 (6%) Query: 260 LCMTYFLGLLIFYLLTSWLPLLIRETGASMSQASIITALFPLGGGIGVLILGALMDKINP 319 LC+ F +L +L LP + + + + + F L IG + G L D++ Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 320 NKVVAVGWLLTGVFVFLVGFSTNNLVLMGVMVFIAGSIMNGAQSSM-PALAAG----FYP 374 +++ G ++ F ++GF ++ + I + GA ++ PAL + P Sbjct: 79 KRLLLFGIIINC-FGSVIGFVGHSF----FSLLIMARFIQGAGAAAFPALVMVVVARYIP 133 Query: 375 TQGRATGVAWMLGIGRFGGILG-AFSGAFLMQAQLSFVTIFTLLSIPAFLSAIALLVKYK 433 + R + I G +G A G S++ + +++I + LL K Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV 193 Query: 434 TSK 436 K Sbjct: 194 RIK 196
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 30.2 bits (68), Expect = 0.018 Identities = 6/43 (13%), Positives = 16/43 (37%) Query: 211 RILELTPGALRSYGGNYADYQQQRDAEQQAARAALDHAATERR 253 ++ P Y N +Y + D + ++ + E++ Sbjct: 157 QLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKK 199
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 142 bits (360), Expect = 1e-39 Identities = 98/418 (23%), Positives = 182/418 (43%), Gaps = 19/418 (4%) Query: 20 LMLVMLLSALDQTIVSTALPTIVGELGGL-DKLSWVVTAYILSSTIVVPLYGKFGDLFGR 78 L ++ S L++ +++ +LP I + +WV TA++L+ +I +YGK D G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 79 KIVLQVAIVLFLVGSALCGVAQNMTQLVLM-RGLQGLGGGGLMVISMAAVADVIPPANRG 137 K +L I++ GS + V + L++M R +QG G + M VA IP NRG Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 138 RYQGLFGGVFGLATVIGPLIGGFLVQHASWRWIFYINLPLGLFALLVIGAVFHSSNKRSQ 197 + GL G + + +GP IGG + + W ++ +P+ + R + Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIK 196 Query: 198 HQVDWLGAIYLSMALLCIILFTSEGGSVRAWNDPQLWCILAFGVVGVIGFIYEERIAAEP 257 D G I +S+ ++ +LFT+ L V+ + F+ R +P Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTS----------YSISFLIVSVLSFLIFVKHIRKVTDP 246 Query: 258 IIPLSLFRNRSFLLCSLIGFVIGMSLFGSVTFLPLYLQVVKAATPTEAGMQLI-PLMGGL 316 + L +N F++ L G +I ++ G V+ +P ++ V + E G +I P + Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306 Query: 317 LMTSIVSGRIISRTGRYRIFPILGTLSGMVGMVLLTRITIYSPMWQLYLFTAVLGMGLGL 376 ++ + G ++ R G + +G V + + + + + + VLG GL Sbjct: 307 IIFGYIGGILVDRRGPLYVL-NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSF 364 Query: 377 VMQVLVLAVQNAMPAQMYGVATSGVTLFRSIGGSIGVALFGAVFTHVLQNNLQRLLPE 434 V+ V +++ Q G S + + G+A+ G + + L + QRLLP Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD--QRLLPM 420
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 74.3 bits (182), Expect = 1e-18 Identities = 39/175 (22%), Positives = 78/175 (44%), Gaps = 10/175 (5%) Query: 12 RPGRPRGKKPGTASREQLMDIALMLFARQGIAHTSLNAIAKEAGVTPAMLHYYFSSREAL 71 R + ++ +R+ ++D+AL LF++QG++ TSL IAK AGVT ++++F + L Sbjct: 3 RKTKQEAQE----TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58 Query: 72 VDQLLEERFMPLRSEIGQIFIEHPEDPVTAF----TLLIEALGTLAEKHNWFAPLWM-QE 126 ++ E + + + P DP++ ++E+ T + ++ E Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118 Query: 127 VIGEMPILRQHMHARFGDDKYHRMLATVKRWQEEGKLNPALSPELLFTTLISLVL 181 +GEM +++Q + Y R+ T+K E L L + + Sbjct: 119 FVGEMAVVQQ-AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.0 bits (88), Expect = 3e-06 Identities = 23/100 (23%), Positives = 42/100 (42%), Gaps = 10/100 (10%) Query: 23 EYNLRYLDATQFADLGVYFRDEAGVMLGGLIAKRKANW---LCIEYLWVSEASRGSGLGG 79 + ++ Y++ A Y + G I R NW IE + V++ R G+G Sbjct: 54 DMDVSYVEEEGKAAFLYYLENN----CIGRIKIRS-NWNGYALIEDIAVAKDYRKKGVGT 108 Query: 80 ELMRAAEKQAREEGCRHVLVDTFSFQ--ALPFYQKQGYQL 117 L+ A + A+E ++++T A FY K + + Sbjct: 109 ALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 53.9 bits (129), Expect = 3e-11 Identities = 23/148 (15%), Positives = 47/148 (31%), Gaps = 12/148 (8%) Query: 1 MSNALAARQKIRQDEIIAAARHCFRRHGFHGASMAQIASEAKLSVGQIYRYFANKDAIIA 60 M+ + + I+ A F + G S+ +IA A ++ G IY +F +K + + Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EMIRRIIDY---RIAEMDGKTQTDQIPRLLA-----WRQTLDEDDDALMLEM----AAEA 108 E+ E K D + L T+ E+ L++E+ Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 109 TRNPQIAAMMVEADKRMFANACAHIRKA 136 + + ++ Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHC 148
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.4 bits (105), Expect = 4e-07 Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 8/85 (9%) Query: 42 PVSVISELTGRTTAS-LSAEVRPQVGGIIQKRLFTEGDRVKAGQALYQIDPASYRAAYNE 100 V +++ G+ T S S E++P I+++ + EG+ V+ G L ++ A + Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 101 AAAALKQAQSLVTSDCQKAQRYAAL 125 ++L QA+ + RY L Sbjct: 139 TQSSLLQAR-------LEQTRYQIL 156
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1152 bits (2981), Expect = 0.0 Identities = 580/1031 (56%), Positives = 751/1031 (72%), Gaps = 6/1031 (0%) Query: 3 SRFFVRRPVFAWVIAILIMLAGVLAIQTLPVAQYPDVAPPAVKISATYTGASAETLENSV 62 + FF+RRP+FAWV+AI++M+AG LAI LPVAQYP +APPAV +SA Y GA A+T++++V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 63 TQVIEQQLTGLDNLLYFTSTSSSDGSVDITVTFEQGTDPDTAQVQVQNKVQQAESRLPSE 122 TQVIEQ + G+DNL+Y +STS S GSV IT+TF+ GTDPD AQVQVQNK+Q A LP E Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 123 VQQSGVTVVKSQSNFLLILAVYDKNNKATSSDISDWLVSNMQDPLARVEGVGSLRVFGAE 182 VQQ G++V KS S++L++ N T DISD++ SN++D L+R+ GVG +++FGA+ Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181 Query: 183 YAMRIWMDPTKLASYALMPSDVQTAIEAQNVQISAGKIGALPSSSAQQLTATVRAQSRLQ 242 YAMRIW+D L Y L P DV ++ QN QI+AG++G P+ QQL A++ AQ+R + Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 243 TVDQFKNIIVKSKSDGSVVRLGDVARVEMGSEDYTATSNLNGHPAAGIAVMMAPGANALD 302 ++F + ++ SDGSVVRL DVARVE+G E+Y + +NG PAAG+ + +A GANALD Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 303 TATRVKSKIAEYQRQMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIILVVCVMYLFLQN 362 TA +K+K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 363 FRATLIPAVAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422 RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 423 DEGLPAREATEKSMGEISGALIAIALVLSAVFLPMAFFGGSTGVIYRQFSVTIISAMMLS 482 ++ LP +EATEKSM +I GAL+ IA+VLSAVF+PMAFFGGSTG IYRQFS+TI+SAM LS Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 483 VVVALTLTPALCGALL----SHSKPHTKGFFGAFNRLWGRTEQGYQHRVLGGLRRSAMMM 538 V+VAL LTPALC LL + + GFFG FN + + Y + V L + + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 539 GAFVLICGTMALAMWKLPGSFLPVEDQGEIMVQYTLPAGATAVRTAEVRRQVTDWFLSKE 598 + LI M + +LP SFLP EDQG + LPAGAT RT +V QVTD++L E Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 599 KANTNVIFTVDGFNFSGSGQNAGMAFVSLKNWSERKGAENTAQAIALRATRDLSSIRDAS 658 KAN +FTV+GF+FSG QNAGMAFVSLK W ER G EN+A+A+ RA +L IRD Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 659 LFAMTPPSVDGLGQSNGFTFELMASGGTDRDSLMKMRSQLLAAANQSP-ELQSVRANDLP 717 + P++ LG + GF FEL+ G D+L + R+QLL A Q P L SVR N L Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 718 QMPQLQVDIDNDKAVSLGLSLSDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGESDARAV 777 Q ++++D +KA +LG+SLSD+ T+S+A GGTYVNDFIDRGRVKK+Y+Q ++ R + Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 778 PSDLGKWFVRSSNDSMTPFSAFATTHWQYGPESLVRYNGATAFEIQGENASGFSSGAAME 837 P D+ K +VRS+N M PFSAF T+HW YG L RYNG + EIQGE A G SSG AM Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 838 KMEALANSLPAGTTWAWSGMSLQEKLASGQAMSLYAISILVVFLCLAALYESWSVPFSVI 897 ME LA+ LPAG + W+GMS QE+L+ QA +L AIS +VVFLCLAALYESWS+P SV+ Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901 Query: 898 MVIPLGLLGAALAAWMRDLSNDVYFQVALLTTIGLSSKNAILIVEFA-EAAVDEGYSLSR 956 +V+PLG++G LAA + + NDVYF V LLTTIGLS+KNAILIVEFA + EG + Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961 Query: 957 AALRAAQTRLRPIVMTSLAFIAGVLPLAVATGAGANSRVAIGTGIIGGTLTATLLAVFFV 1016 A L A + RLRPI+MTSLAFI GVLPLA++ GAG+ ++ A+G G++GG ++ATLLA+FFV Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021 Query: 1017 PLFFVLVKRLF 1027 P+FFV+++R F Sbjct: 1022 PVFFVVIRRCF 1032
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 68.3 bits (167), Expect = 9e-15 Identities = 76/347 (21%), Positives = 123/347 (35%), Gaps = 16/347 (4%) Query: 2 PRISLSWALILGLLTAIGPLCTDFYLPALPDITRQLNATGTQTQFSLTAALIGLGLGQLF 61 P L L L A+G +P LP + R L + L L Q Sbjct: 3 PNRPLIVILSTVALDAVG---IGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFA 58 Query: 62 FGP----LSDRIGRKKPLVLSLLLFIFSSAMCAATDDIHLLIGWRFLQGFAGAGGSVLSR 117 P LSDR GR+ L++SL A+ A + +L R + G GA G+V Sbjct: 59 CAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGA 118 Query: 118 SIARDKYQGARLTQFFALLMTVNGIAPVVSPVIGGYIITAFDWRILFWTMAGIGGVLLLL 177 IA D G + F + G V PV+GG + F F+ A + G+ L Sbjct: 119 YIA-DITDGDERARHFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLT 176 Query: 178 SLTVLRETLPG-KDPATTHQQSDIPVLKNRPFMR--YCLIQAFMMAGLFSYIGSSSFVIQ 234 +L E+ G + P + + + M L+ F + L + ++ +VI Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236 Query: 235 SE--YGMTALQFSLLFGVNGI-GLIIAAQIFSRLSRRYSADTLLRGGLSLAVLCAVITLF 291 E + A + GI + A I ++ R L G+ ++ F Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296 Query: 292 LAWQHLPLPALIGLFFTVSFMSGISTVAGSKAMSEVSSTQSGTASAL 338 + P ++ L M + + + E G+ +AL Sbjct: 297 ATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAAL 343
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 27.3 bits (60), Expect = 0.029 Identities = 24/104 (23%), Positives = 38/104 (36%), Gaps = 24/104 (23%) Query: 31 AAIEKRQKEISDGLAS----AERAKKDLDLA-QANATDQLKKAKAEAQVIIEQANKRRSQ 85 +EK +++ ++ A A+ AK ++ Q N Q E Q K + Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ---TTETKETAT 1105 Query: 86 MLDEAKAEAEQERTKIVA----------------QAQAEIDAER 113 + E KA+ E E+T+ V Q QAE E Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 136 bits (344), Expect = 1e-37 Identities = 99/406 (24%), Positives = 174/406 (42%), Gaps = 16/406 (3%) Query: 12 LPWIAAMAFFMQALDATILNTALPAIAHSLNRSPLAMQSAIISYTLTVAMLIPVSGWLAD 71 L W+ ++FF L+ +LN +LP IA+ N+ P + ++ LT ++ V G L+D Sbjct: 16 LIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 72 RFGTRKVFIIAVGLFTLGSLACALSSSLMELVIF-RVIQGIGGAMMMPVARLALLRAYPR 130 + G +++ + + + GS+ + S L+I R IQG G A + + + R P+ Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 131 SELLPVLNFVTMPGLVGPILGPVLGGVLVTWASWHWIFLINIP-IGIIGILYARKYMPDF 189 + +G +GP +GG++ + HW +L+ IP I II + + K + Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKE 192 Query: 190 TTPRRRFDTSGFLLFGLSLVLFSSGIELFGEKIVATWIALSVIAFSVILLLAYIRHARRH 249 + FD G +L + +V F + L V SV+ L +++H R+ Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTT------SYSISFLIV---SVLSFLIFVKHIRKV 243 Query: 250 PTPLISLSLFKTRTFSVGIAGNLATRLGTGCVPFLMPLMLQVGFGY-PAIVAGCMIAPTA 308 P + L K F +G+ ++P M++ A + +I P Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 309 IGSIIAKSTVTQILRWLGYRKTLVGITIFIGLMIAQFSFQSPAMPVWMLLLPLFVLGMAM 368 + II ++ G L F+ + SF +M ++ +FVLG Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS 363 Query: 369 STQFTSMNTITLADLTDDNASSGNSLLAVTQQLSISLGVAISAAVL 414 T+ T ++TI + L A +G SLL T LS G+AI +L Sbjct: 364 FTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408
>SECA#SecA protein signature. Length = 901 Score = 27.1 bits (60), Expect = 0.041 Identities = 10/57 (17%), Positives = 24/57 (42%) Query: 15 KTREEMNQESRDRKRQKKHRGHAAGSRATGGDAASSGKKQSQQQDPRIGSKKPIPLG 71 + EE+ + + R+ + + D+A++ +Q + ++G P P G Sbjct: 832 RMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCG 888
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 598 bits (1544), Expect = 0.0 Identities = 203/478 (42%), Positives = 295/478 (61%), Gaps = 11/478 (2%) Query: 1 MQRGIAWIVDDDSSIRWVLERALTGAGLSCTTFESGNEVLDALTTKTPDVLLSDIRMPGM 60 M + DDD++IR VL +AL+ AG + + + D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVDRAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNAPINSPTADIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTVRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQIAARELGVEAKLLHPETETALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLTQDLPSELFETAIPDSPTHMQPDSWATLLGQWADRALRS---- 416 EN R LT + + + + +EL S + + Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPEMERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L EME L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 179 bits (455), Expect = 9e-51 Identities = 100/448 (22%), Positives = 171/448 (38%), Gaps = 87/448 (19%) Query: 4 NLRNIAIIAHVDHGKTTLVDKLLQQSGTFDARAETQE--RVMDSNDLEKERGITILAKNT 61 + NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAMDGPMPQTRFVTKKAFAHGL 121 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159 I INK+D+ G V + + L+ N+ T+ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187 FP+ + SA N I G+D+ L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231 Query: 188 QTIVDRVPAPNVDLDGPLQMQISQLDYNNYVGVIGIGRIKRGKVKPNQQVTIVDSEGKTR 247 + I ++ + L ++ +++Y+ + R+ G + V I + E Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289 Query: 248 NGKVGKVLTHLGLERIDSDIAEAGDIIAITGLG-ELN--ISDTICDPQNVEALPALSVDE 304 K+ ++ T + E D A +G+I+ + +LN + DT PQ + Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPL 343 Query: 305 PTVSMFFCVNTSPFCGKEGKYVTSRQILDRLNKELVHNVALRVEETPDADAFRVSGRGEL 364 P + + + D L LR +S G++ Sbjct: 344 PLLQTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKV 394 Query: 365 HLSVLIENMRRE-GFEMAVSRPKVIFRE 391 + V ++ + E+ + P VI+ E Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 30.6 bits (69), Expect = 0.019 Identities = 12/75 (16%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 398 EPFENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457 EP+ + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 458 MTSGTGLLYSTFSHY 472 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 33.9 bits (77), Expect = 0.001 Identities = 14/62 (22%), Positives = 23/62 (37%), Gaps = 10/62 (16%) Query: 137 FFETGRGIVDTIVAFSALAVFAWFGSGLLGFKAGIWFYSVIVISVGIIIFFVLNRDDDEV 196 F + D V+ V A S L G GIW +++ + I D +++ Sbjct: 464 FLTLEKKAADAGVS----YVVALLFSLLAGTTLGIWGIAIVTGILCSYI------DKNKL 513 Query: 197 KT 198 T Sbjct: 514 NT 515
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 38.3 bits (89), Expect = 5e-05 Identities = 30/156 (19%), Positives = 60/156 (38%), Gaps = 8/156 (5%) Query: 59 IAAQFGISPGLAATVNASVLVAALIGGLLANRVINRFGQKRAFIIGMGLCTIGAAAVAIA 118 IA F P VN + ++ IG + ++ ++ G KR + G+ + G+ + Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99 Query: 119 PNIW-WVLVCRVVMGFGLGIDFPLATNAVAELRGSTSKKTGSSVNLWQMAWYVSTTVVYL 177 + + +++ R + G G FP A V R + G + L S + Sbjct: 100 HSFFSLLIMARFIQGAG-AAAFP-ALVMVVVARYIPKENRGKAFGLIG-----SIVAMGE 152 Query: 178 VLLPLLLSGVAEEQLWRYGIFVGAIFAVSIMILRYF 213 + P + +A W Y + + I +++ L Sbjct: 153 GVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL 188 Score = 30.2 bits (68), Expect = 0.020 Identities = 18/109 (16%), Positives = 38/109 (34%), Gaps = 5/109 (4%) Query: 325 LCGITGGLIGSLILQRLGTRLQSMYGFALVTVALLALGALATTNPWLSLGLLGAIIFFHS 384 + I G IG +++ R G G ++V+ L L T W ++ ++ S Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS 363 Query: 385 AGPGGLGMTIATLSYPPSIRPTGVGFARAIMRTGAIAGLIFWPMLWGAL 433 T+ + S++ G +++ + + G L Sbjct: 364 F-----TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.5 bits (74), Expect = 0.004 Identities = 33/158 (20%), Positives = 58/158 (36%), Gaps = 12/158 (7%) Query: 298 IVCCPLVGWISDRIGQRKMYLFGAGFCVLFAFPFFFLLDTKSTPIIWCSMILGYNLGPTM 357 C P++G +SDR G+R + L A + + +++ I+ G T Sbjct: 57 FACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG 113 Query: 358 MFAVQPTLFTRMFGIRVRYTG-LSFAYQFSAILGGLSPLIASSLLALGGGRPWYVALFLF 416 A R R+ G +S + F + G P++ + P++ A L Sbjct: 114 AVAGAYIADITDGDERARHFGFMSACFGFGMVAG---PVLGGLMGGFSPHAPFFAAAAL- 169 Query: 417 VISTLSFICVWLIEPH----SNNKKDAKKPLTSYCYIR 450 C L E H +++A PL S+ + R Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWAR 207
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 29.5 bits (66), Expect = 0.031 Identities = 20/78 (25%), Positives = 27/78 (34%), Gaps = 9/78 (11%) Query: 538 QMARRDNADPSGLGNT-LGWAWAWPLNRRILYNRASADPQGKPWDPKRQILKWDGAKWAG 596 + RD ADP+ + G+ W PL G K + G W Sbjct: 75 VLEARDGADPAPADDGWSGYRWL-PLRAG------RVATSGSIAGGKLNLAFAQGEAWTP 127 Query: 597 MDIPDYSAAAPGSDVGPF 614 D PD PG ++ PF Sbjct: 128 GDNPDV-LIFPGGEMTPF 144
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 69.5 bits (170), Expect = 4e-14 Identities = 40/296 (13%), Positives = 84/296 (28%), Gaps = 32/296 (10%) Query: 440 SDSSWSSIGSISATLPGGFSTVWVNQEKTIIGARLRRSDADNRAIGGTLNLNPLWSKLGT 499 +D+++S + + G V A +R L + + T Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKL-------QLTVTQQLGRTST 542 Query: 500 FSISYNDDRRYNSHYYTADYYQTLFSGAFGSLGLRAGIQRFNNGGSGSSSSTGKYVALDF 559 +S + Y + +Q + AF + N + +AL+ Sbjct: 543 LYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG---RDQMLALNV 598 Query: 560 SLPLGNWFSAGMTHQNGYTMANLSARKQFDEGVV------------RTLGANISRAISGD 607 ++P +W + Q + A+ S + + L ++ +G Sbjct: 599 NIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGG 658 Query: 608 TGDDKTLSGGGYAQFDTRYANGTLNINSGADGYVNTNLTASGSVGWQGRNIAASGRTDGN 667 + +G + Y N + S +D SG V + + Sbjct: 659 GDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDT 717 Query: 668 AGVIFNTDLDDDGKLSARVNGRVIQLTGKRNYL---PLSPYSRYEVELQNSKNSLD 720 ++ G A+V + T R Y + Y V L + + + Sbjct: 718 VVLV-----KAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADN 768
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 667 bits (1723), Expect = 0.0 Identities = 223/870 (25%), Positives = 379/870 (43%), Gaps = 69/870 (7%) Query: 8 KTLLALFIALACTSFFPA-AADETIEFNTAVLDASDRQNVDLQRFSEGNFVAPGDYLLDV 66 + + +AC A + + FN L + DL RF G + PG Y +D+ Sbjct: 23 LAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDI 82 Query: 67 HINGQEIAQQQVRYISDVDHPHKTLVCLSPQQLELLALKEDAL-KYTRPLAENCLDI-SR 124 ++N +A + V + + D + CL+ QL + L ++ + C+ + S Sbjct: 83 YLNNGYMATRDVTFNTG-DSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSM 141 Query: 125 LPG--IALNNSAGVLDITVPQAWMKYTDPNWTPPERWDNGITGLIFDYNLSGQATRYQQD 182 + L+ L++T+PQA+M + PPE WD GI + +YN SG + + + Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRI- 200 Query: 183 GGSYQSLSGYGQTGFNLGAWRVRSQYQANYT----SDTQGTRFDWDQFYAYRPLPMLAAK 238 GG+ Q+G N+GAWR+R +Y S ++ + R + L ++ Sbjct: 201 GGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSR 260 Query: 239 LTLGEIYLNSQIFDSVRFTGANLASDERMLPPNLQGYAPQVHGIAKSNAKVTVSQQQHVI 298 LTLG+ Y IFD + F GA LASD+ MLP + +G+AP +HGIA+ A+VT+ Q + I Sbjct: 261 LTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDI 320 Query: 299 YQTTVPAGPFNIEDL-RSSVRGTLDVRVEEQDGTVQTFQVNTADIPYLTRPGYIRYNAAV 357 Y +TVP GPF I D+ + G L V ++E DG+ Q F V + +P L R G+ RY+ Sbjct: 321 YNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITA 380 Query: 358 GKPSRYDHHLQGPAFYSGDFSWGMSNAWSLYGGALLTGNRYNAGSLGIGRDLSLLGALSA 417 G+ + + P F+ G+ W++YGG L RY A + GIG+++ LGALS Sbjct: 381 GEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD-RYRAFNFGIGKNMGALGALSV 439 Query: 418 DVTQSISRIKNQNQQKGLSFKLSYAKTFDEYNSAITFAGYRFSQRNFRSFSQFLDEQY-- 475 D+TQ+ S + + +Q G S + Y K+ +E + I GYR+S + +F+ + Sbjct: 440 DMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNG 499 Query: 476 -----------------ENNDSTGREKEMYTLTGNTTFFADDPRLATTLYLTYAHQNYWD 518 + + ++ LT R + TLYL+ +HQ YW Sbjct: 500 YNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL----GRTS-TLYLSGSHQTYWG 554 Query: 519 RRSQDRYGLSVGHTFSFAGMEGISANLAAYRSEYQGKRDDSLSLSLSIPWRDGRSTEYQL 578 + D G +F + + + + ++ +Q RD L+L+++IP+ ++ + Sbjct: 555 TSNVDEQ-FQAGLNTAFEDI-NWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKS 612 Query: 579 Q------------NSGGRSSQMVSYSDNRDRNNP--WRVRAGLSEDGR----TAFDGYYQ 620 Q + GR + + +N + V+ G + G + Sbjct: 613 QWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLN 672 Query: 621 HRSMMAELESNLSWQQDRYISVGGTVRGGFTATRHGAAFHNSQASMNTARVMVDTDGVAN 680 +R S D + V GG A +G +N V+V G + Sbjct: 673 YRGGYGNANIGYS-HSDDIKQLYYGVSGGVLAHANGVTLGQ---PLNDTVVLVKAPGAKD 728 Query: 681 VPLNGQQ-AHSNRFGIAVVPDIVSYNSFDTRIDVDAMAEDIAATKAIVTSTLTEGAIGYQ 739 + Q ++ G AV+P Y +D + +A+++ A+ T GAI Sbjct: 729 AKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788 Query: 740 RFAVAQGEKMMGLLRLADGSAPPFGAEIFNANGVSVAMVMDNGESWIAGVKPDETLSVVW 799 F G K++ L + PFGA + + + S +V DNG+ +++G+ + V W Sbjct: 789 EFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKW 847 Query: 800 G--GQTQCHLN--VPRHINPQG--NVLLPC 823 G C N +P Q + C Sbjct: 848 GEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 27.5 bits (61), Expect = 0.036 Identities = 8/42 (19%), Positives = 16/42 (38%), Gaps = 2/42 (4%) Query: 66 TSYHSGYFSLSVQGSLKAESGQYVVDRKLFFRYSRPLGHAAG 107 T++ ++LS + + Q D+ L + P H Sbjct: 568 TAFEDINWTLSYSLTK--NAWQKGRDQMLALNVNIPFSHWLR 607
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 27.3 bits (60), Expect = 0.008 Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 9/48 (18%) Query: 36 TRGEIIAVGKGRILENGT--VQPLDVKV-------GDIVIFNDGYGVK 74 T E+ A+G+ ++L T +++ G++V ND GV+ Sbjct: 244 TLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVE 291
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 264 bits (675), Expect = 1e-93 Identities = 74/152 (48%), Positives = 106/152 (69%), Gaps = 1/152 (0%) Query: 25 PQGVTVVSPFDTQRYLGTWYEIARFDHQFESGLEKVTATYSLRDDGGLDVVNKGYNPDRG 84 P+ V VS F+ YLG WYE+AR DH FE GL +VTA Y +R+DGG+ V+N+GY+ ++G Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79 Query: 85 MWQKTDGVAYFTGQPTRAALKVSFFGPFYGGYNVIALDKD-YRYALVCGPDRDYLWLLAR 143 W++ +G AYF T LKVSFFGPFYG Y V LD++ Y YA V GP+ +YLWLL+R Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139 Query: 144 SPKVSPEVKQQMLDIATRQGFDVSKLIWVNQR 175 +P V + + ++++ +GFD ++LI+V Q+ Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQQ 171
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 49.3 bits (117), Expect = 7e-08 Identities = 56/255 (21%), Positives = 104/255 (40%), Gaps = 32/255 (12%) Query: 26 KQIAQELEQAKAAKPAQPGTVEALQSALNALEERSASLERARQ-YQQVIDNFPKLFQTLR 84 + + LE A A ++ L++ ALE R A LE+A + +TL Sbjct: 228 ADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLE 287 Query: 85 SQIGNLPDEPRQVSTNLSTDALNQEILQVSSQLLEASRQAQQEQERARDIADSLNQLPQQ 144 ++ L E + + Q + L+ASR+A+++ E + N++ Sbjct: 288 AEKAALEAEKADLE---HQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKI--- 341 Query: 145 QSDARRQLNEVERRVGTQTSNTPLAQAQNLGLQAESARLKALVNELDLAQLSANNRQELS 204 S+A RQ + R + ++ L+AE +L+ ++S +RQ L Sbjct: 342 -SEASRQ--SLRRDLDA-------SREAKKQLEAEHQKLEEQN------KISEASRQSLR 385 Query: 205 RMRGDLAQKQ--GKLLDGYLQALRNQLNSQRQREAEKALESTELLAENSENLPPELVAQF 262 R DL + K ++ L+ ++L + + E ES +L + L +L A+ Sbjct: 386 R---DLDASREAKKQVEKALEEANSKLAALEKLNKE-LEESKKLTEKEKAELQAKLEAEA 441 Query: 263 KVNRELSQALNQQAQ 277 K L + L +QA+ Sbjct: 442 KA---LKEKLAKQAE 453 Score = 45.1 bits (106), Expect = 1e-06 Identities = 51/392 (13%), Positives = 114/392 (29%), Gaps = 80/392 (20%) Query: 33 EQAKAAKPAQPGTVEALQSALNAL-EERSASLERARQYQQVIDNFPKLFQTLRSQIGNLP 91 E + A +Q T+E +Q + E + + L ++ N Sbjct: 39 EVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAK 98 Query: 92 DEPRQVSTNLSTDALNQEILQVSSQLLEASRQAQQEQERARDIADSLNQLPQQQSDARRQ 151 ++ R+ +LS A + L+ + + + + + + L +++ + Sbjct: 99 EKLRKNDKSLSEKASKIQELEARKA--DLEKALEGAMNFSTADSAKIKTLEAEKAALAAR 156 Query: 152 LNEVERRVGTQTSNTPLAQAQNLGLQAESARLKALVNELDLAQLSANNRQELSRMRGDLA 211 ++E+ + + + A+ L+AE A L+A EL E + Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL-------EKALEGAMNFSTAD 209 Query: 212 QKQGKLLDGYLQALRNQLNSQRQREAEKALESTELLAENSENLPPELVAQFKVNRELSQA 271 + K L+ AL + + ST A+ + E + Sbjct: 210 SAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT-----------LEAEKAAL 258 Query: 272 LNQQAQRMDLVASQQRQAANQTLQVRQALNTLREQSQWLGSSNLLGEALRAQVARLPETP 331 +QA+ + + + +++ Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTL------------------------------EA 288 Query: 332 KPQQLDTEMAQLRVQRLHYEDLLGSQPQLRLIRQADGEPLTSEQSRILEAQLRTQTELLN 391 + L+ E A L QS++L A ++ L+ Sbjct: 289 EKAALEAEKADLE-----------------------------HQSQVLNANRQSLRRDLD 319 Query: 392 SLLQGGDTLILELTKLKVSNGQLEDALKEINE 423 + + L E KL+ N E + + + Sbjct: 320 ASREAKKQLEAEHQKLEEQNKISEASRQSLRR 351
>cloacin#Cloacin signature. Length = 551 Score = 29.7 bits (66), Expect = 0.025 Identities = 19/67 (28%), Positives = 22/67 (32%), Gaps = 8/67 (11%) Query: 6 PGNNGQDRDPWGSSNNQGGNSGGNGNKGGREQGPPDLDDIFRKLSKKLGGLGGGKGGQGS 65 G D W S NN G G+G G G G GGG G G+ Sbjct: 29 VGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG--------HGNGGGNGNSGGGSGTGGN 80 Query: 66 GSSSQGP 72 S+ P Sbjct: 81 LSAVAAP 87
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 350 bits (899), Expect = e-118 Identities = 126/370 (34%), Positives = 195/370 (52%), Gaps = 18/370 (4%) Query: 123 DLSRFRKLQRHLSVLNEKL---FSRDAGEQPEIIHDSEAMQQVLDRAARLAASHVPVMVI 179 DL+ + ++ D+ + ++ S AMQ++ ARL + + +M+ Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 180 GETGTGKELLANFVHNHSPRRHKPFIALNCGALPVTLIESTLFGTVKGGFTGAENTR-GY 238 GE+GTGKEL+A +H++ RR+ PF+A+N A+P LIES LFG KG FTGA+ G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 239 LELAHGGTLFLDELNALPVDVQGKILRFLQEKTFWKVGGSKELKADIRIIAAMNESPFDM 298 E A GGTLFLDE+ +P+D Q ++LR LQ+ + VGG +++D+RI+AA N+ Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 299 IRQKRLRDDLFYRLEIGMVVIPPLRERKDEIIPLARHFMAKHQANSNKDVYPFLASVEKQ 358 I Q R+DL+YRL + + +PPLR+R ++I L RHF+ + + DV F + Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 359 LLDYDWPGNVRMLENVIVRSLLLQKTPGPLTELFFNHETDVITHFSAE---ASSQSASLV 415 + + WPGNVR LEN++ R L E+ N I E A S S S+ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 416 ETYRENL----------TVEEGTLTDKLEQYEYQILIEALKEAHGCVAKAARMLGISRGA 465 + EN+ G L + EY +++ AL G KAA +LG++R Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 466 LQYKVKKHNI 475 L+ K+++ + Sbjct: 466 LRKKIRELGV 475
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 38.6 bits (89), Expect = 4e-05 Identities = 87/337 (25%), Positives = 140/337 (41%), Gaps = 58/337 (17%) Query: 88 ANFTTDLALKDEILPMDELFRYGDQKAGEFLVNEFWPAMHKNAQVMGTTYAIPFHNSTPI 147 A T D A +D++ P W A+ N +++ A P Sbjct: 103 AEITPDKAFQDKLYPFT------------------WDAVRYNGKLI----AYPIAVEALS 140 Query: 148 LYYNKTMFEQAGITQPPQTWAELLADAKKLTDESKGQWGIMLPSTNDDYGGWIFSALVRA 207 L YNK + + PP+TW E+ A K+L ++KG+ +M + + Y W L+ A Sbjct: 141 LIYNKDL-----LPNPPKTWEEIPALDKEL--KAKGKSALMF-NLQEPYFTW---PLIAA 189 Query: 208 NGG---KYFNEDYP-GEVYYNAPTTIGALRFWQDLIYKDKVMPSGVLNSKQISASFFSGK 263 +GG KY N Y +V + L F DLI K+K M + + A+F G+ Sbjct: 190 DGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLI-KNKHMNADT-DYSIAEAAFNKGE 247 Query: 264 VGMAMLSTGALGFMRENSKDFELGVAMLPA-KEQRAVPIGGASLVSFKGINDA--QKKVA 320 M + G + ++ GV +LP K Q + P G V GIN A K++A Sbjct: 248 TAMTI--NGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVG---VLSAGINAASPNKELA 302 Query: 321 YQFL-TYLVSPQVNGAWSRFTGYFSPRKAAYDTPEMKAYLQQDPRAAIALEQLKYAHPWY 379 +FL YL++ + A ++ A + L +DPR A +E + Sbjct: 303 KEFLENYLLTDEGLEAVNK-----DKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMP 357 Query: 380 STWETVAVRKAMENQLAAVVNDA--KVTPEAAVQTAQ 414 + + A A+ AV+N A + T + A++ AQ Sbjct: 358 NIPQMSAFWYAVR---TAVINAASGRQTVDEALKDAQ 391
>PF05272#Virulence-associated E family protein Length = 892 Score = 35.4 bits (81), Expect = 4e-04 Identities = 14/33 (42%), Positives = 18/33 (54%) Query: 30 VVLVGPSGCGKSTLLRLLAGLEPVSEGEIWLHD 62 VVL G G GKSTL+ L GL+ S+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 60.0 bits (145), Expect = 1e-13 Identities = 29/119 (24%), Positives = 56/119 (47%) Query: 9 AQHRKKDPARRHQQLLESAAMIAGRDGIASLSLNAVAREAGVSKGGLLHHFPNKQALIFA 68 A+ K++ Q +L+ A + + G++S SL +A+ AGV++G + HF +K L Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 69 LFARLLAIMEEAISGLMAADNVSYGRFTRAYLHYLSDLTDTDESRQLMVLSLAMPDEPV 127 ++ + + E A R L ++ + T T+E R+L++ + E V Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 71.4 bits (175), Expect = 8e-16 Identities = 35/217 (16%), Positives = 75/217 (34%), Gaps = 30/217 (13%) Query: 1 MMTPEQKFARWVRVSIVAFLTI-FAWFIVADIWIPLTPDSTVMRVVTP------VSPRVS 53 + TP + R V I+ FL I F ++ + +T +T + P + Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLG----QVEIVATANGKLTHSGRSKEIKPIEN 104 Query: 54 GYVSQVYVHNNSQVKKGDLLYELDPTPFINKVQAAQIAYEQAKLSNQQLDAQLAAARAN- 112 V ++ V V+KGD+L +L Q + QA+L + + N Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164 Query: 113 -------------LRTAQFTARNDKVTLDRYQRLSTMQNVSQSDLDKVRTTWQTSEQSVS 159 + + R + +++ + + +LDK R T ++ Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224 Query: 160 ALNASIQNLLIQRGERDDNRNVTLQKY--RNALEEAQ 194 + + DD ++ ++ ++A+ E + Sbjct: 225 RYENLSRVE---KSRLDDFSSLLHKQAIAKHAVLEQE 258
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 30.6 bits (69), Expect = 0.004 Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 14/94 (14%) Query: 115 FC-PIGTLAPVHSVDNLTIITEINGREADSWNTGDLQR-----------NAAELLSALSE 162 FC P T+ P+ S + R + + G L+ L ++ + Sbjct: 215 FCIPYITIEPIISKLSSQFWFSSVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRD 274 Query: 163 FATLNPGDAILIGTPHAR--VTLQPGDRVRILAE 194 L GD I + H L G+R + L + Sbjct: 275 ILGLRVGDIIRLHDTHVGDPFVLSIGNRKKFLCQ 308
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 29.8 bits (67), Expect = 0.012 Identities = 10/27 (37%), Positives = 15/27 (55%) Query: 202 HLPLAEYARQVGLSATHLNYLCREFHG 228 LP+ E A + GL+ LN+ C E + Sbjct: 58 SLPITEVAEKTGLTFLQLNHYCEELNA 84
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 383 bits (985), Expect = e-138 Identities = 182/264 (68%), Positives = 209/264 (79%), Gaps = 2/264 (0%) Query: 1 MVYRSASFRNDIDIIWQAPLLPAKDALANAIREKITTLRPHLLDFLRLDEEAPPCALTLA 60 M YRSA D+ IW+ L P LA A+R I R HLL+F+RLDE AP A+TLA Sbjct: 1 MAYRSAPLYEDV--IWRTHLQPQDPTLAQAVRATIAKHREHLLEFIRLDEPAPLNAMTLA 58 Query: 61 EWSAPTVLSSLLATWSDHIYRNQPTMPREQKPLLSLWAQWYIGLLVPPLMLALLSEETAI 120 +WS+P VLSSLLA +SDHIYRNQP M RE KPL+SLWAQWYIGL+VPPLMLALL++E A+ Sbjct: 59 QWSSPNVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKAL 118 Query: 121 SVAPERFRVEFHETGRAACFWIDVKADSSARSHSPQTRMETLVTNALLPVVQALEATGDI 180 V+PE F EFHETGR ACFW+DV D +A HSPQ RMETL++ AL+PVVQALEATG+I Sbjct: 119 DVSPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQHRMETLISQALVPVVQALEATGEI 178 Query: 181 NGKLIWSNTGYLINWYLGEMKALLGDEQVTALRQHCFFEKQFADGQDNPLWRTVILREGL 240 NGKLIWSNTGYLINWYL EMK LLG+ V +LR FFEK +G+DNPLWRTV+LR+GL Sbjct: 179 NGKLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGL 238 Query: 241 LVRRTCCQRNRLPDVHQCGDCTLK 264 LVRRTCCQR RLPDV QCGDCTLK Sbjct: 239 LVRRTCCQRYRLPDVQQCGDCTLK 262
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.7 bits (72), Expect = 0.017 Identities = 20/110 (18%), Positives = 37/110 (33%), Gaps = 18/110 (16%) Query: 34 CKALREEGYRVILVNS-----------NPATIMTDPEMADATYIEPIHWEVVRKIIEKER 82 +AL GY V + ++ + ++TD M D + + I+K R Sbjct: 20 NQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD------LLPRIKKAR 73 Query: 83 PDAVLPTMGGQTALNCALELERQGVLAEFGVTM-IGATADAIDKAEDRRR 131 PD + M Q A++ +G + I +A + Sbjct: 74 PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 233 bits (597), Expect = 2e-75 Identities = 104/405 (25%), Positives = 198/405 (48%), Gaps = 13/405 (3%) Query: 6 LWRWQGVDMQGQFCQGTQWNPGRLEVFQALQHERIIPLAIRRCAIKN---------TLWH 56 + +Q +D QG+ C+GTQ + Q L+ ++PL++ Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62 Query: 57 PRYGS----QVVRQLAVLLQAGLSLAEGLELLAQQQPSAQWQALLRTLAQDLAQGVSLSA 112 R + + RQLA L+ A + L E L+ +A+Q L+ + + +G SL+ Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122 Query: 113 ALEKWPQAFAPLSLAMIRTGELTGKLDFCCLQLARQQQEQQQLADKVKKAVRYPAVILGL 172 A++ +P +F L AM+ GE +G LD +LA +++QQ+ ++++A+ YP V+ + Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182 Query: 173 ALMVVVAMLCFVLPEFAAIYQTFNTPLPLLTRLVIHASESLSYGWPMLILPIVLPAVLNL 232 A+ VV +L V+P+ + LPL TR+++ S+++ P ++L ++ + Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242 Query: 233 IACRRPHWLLQRQKMLHALPVVGKLKRGQRLSQIFTVLALTQSAGISFLQGLESVEDTLN 292 + R+ + + L LP++G++ RG ++ L++ ++ + LQ + D ++ Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302 Query: 293 CPLWRQRIQQAHLHISHGVPIWQALERSGGFTTLCLQLIRTGEASGSLDTMLENLARHHS 352 R R+ A + GV + +ALE++ F + +I +GE SG LD+MLE A + Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362 Query: 353 EQTHYQAENLATLLEPALLLITGTIVGVLVVAMYLPIFHLGDAIS 397 + Q L EP L++ +V +V+A+ PI L +S Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 49.1 bits (117), Expect = 2e-10 Identities = 22/69 (31%), Positives = 38/69 (55%) Query: 1 MDRQRGFTLIELMVVIGIIAILSAIGIPAYQNYLRKAALTDMLQTFVPYRTAIELCALEH 60 D+QRGFTL+E+MVVI II +L+++ +P KA + V A+++ L++ Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63 Query: 61 GGLTSCDAG 69 + + G Sbjct: 64 HHYPTTNQG 72
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.4 bits (79), Expect = 0.001 Identities = 43/285 (15%), Positives = 85/285 (29%), Gaps = 40/285 (14%) Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGVVKEIKVSVGDKTETGKLIMIFDSADGAADAAP 85 + V +T G S E+ + +VKEI V G+ G +++ + AD Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 86 AKA--------EEKKEAAPVAAPAAAAAKDVHVPDIGGDEVEVTEIMVKVGDTVAAEQSL 137 ++ + + + + + + V E + T ++ Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE---VLRLTSLIKEQF 195 Query: 138 ITVEGDKASMEVPAPFAGTVKEIKINTGDKVSTGSLIMVF--EVAGAAPAAAP---AQAA 192 T + K E E +L V + + A+ A Sbjct: 196 STWQNQKYQKE--LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHA 253 Query: 193 APAAAAAPAAAAGAKDVHVPDIGGDEVEVTEVMVK-----------VGDKVA-------- 233 A V+ + E E+ + + DK+ Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 234 AEQSLITVEGDKASMEVPAPFAGTVKEIKIST-GDKVSTGSLIMV 277 L E + + + AP + V+++K+ T G V+T +MV Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 67.7 bits (165), Expect = 2e-15 Identities = 52/246 (21%), Positives = 109/246 (44%), Gaps = 4/246 (1%) Query: 5 WVALKSIWAKEIHRFMRIWIQTLVPPVITMTLYFVIFGNLIGSRIGEMHGFTYMQFIVPG 64 W+A +W + + + + +L+ + +Y G +G +G + G +Y F+ G Sbjct: 16 WIA---VWRRNYIAWKKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAAG 72 Query: 65 LIMMAVITNA-YANVASSFFSAKFQRNIEELLVAPVPTHVVIAGYVGGGVARGLCVGILV 123 ++ + +T A + + ++F + QR E +L + ++ G + + G + Sbjct: 73 MVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGI 132 Query: 124 TAISLFFVPFQVHSWLFVGLTLILTAVLFSLAGLLNAVFAKTFDDISLIPTFVLTPLTYL 183 ++ Q S L+ + LT + F+ G++ A ++D T V+TP+ +L Sbjct: 133 GVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFL 192 Query: 184 GGVFYSLTLLPPFWQALSHLNPIVYMISGFRYGFLGINDVPLVTTFGVLVVFIVLFYALC 243 G + + LP +Q + P+ + I R LG V + G L ++IV+ + L Sbjct: 193 SGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLS 252 Query: 244 WYLIQR 249 L++R Sbjct: 253 TALLRR 258
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 280 bits (718), Expect = 4e-97 Identities = 140/277 (50%), Positives = 169/277 (61%), Gaps = 16/277 (5%) Query: 1 MTLLSAFSLQFPLLWGGFLFIFGLTFGSFFNVVIHRLPLMMRQE---------------- 44 M LL + P L+ +F+F L GSF NVVIHRLP+M+ +E Sbjct: 1 MALLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGV 60 Query: 45 ESARFNLCVPASFCPQCQRPLIWRDNIPLLSYLSLKGRARCCQAPISQRYPLTELASGLL 104 + +NL VP S CP C P+ +NIPLLS+L L+GR R CQAPIS RYPL EL + LL Sbjct: 61 DEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALL 120 Query: 105 FVLAGYLLTPGLPLLGGLILLSTLLVLAIIDGQTQLLPDRLTLPLLWAGLLFNLNGTFVP 164 V L PG L L+L L+ L ID LLPD+LTLPLLW GLLFNL G FV Sbjct: 121 SVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVS 180 Query: 165 LSEAVIGAMTGYLSLWTVYWLFRLLTGKEALGYGDFKLLAALGAWSGWQILPQTLLCASA 224 L +AVIGAM GYL LW++YW F+LLTGKE +GYGDFKLLAALGAW GWQ LP LL +S Sbjct: 181 LGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSL 240 Query: 225 SGLIWTLLQRRITLQSLDQPLAFGPWLALAGSGLFLW 261 G + + +P+ FGP+LA+AG LW Sbjct: 241 VGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLW 277
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 28.4 bits (63), Expect = 0.030 Identities = 16/46 (34%), Positives = 20/46 (43%), Gaps = 12/46 (26%) Query: 4 RQRGVALLMVLLILALMMVLASAMT------------ERTARLYQQ 37 RQRG LL ++LIL LM V A + + AR Q Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQ 47
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 29.5 bits (66), Expect = 0.006 Identities = 12/34 (35%), Positives = 22/34 (64%), Gaps = 1/34 (2%) Query: 7 RGFTLVEMLLALAILAAL-SIAAMAVLQNVLRAD 39 RGFTL+E+++ + I+ L S+ ++ N +AD Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD 41
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 30.6 bits (69), Expect = 7e-04 Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 4/66 (6%) Query: 1 MKAQSGMTLIEVMVALVVF-ALAGLSVMQATLQQTRHMGRMEEKTLAGWLADNQLVQLKL 59 Q G TL+E+MV +V+ LA L V + + + + +N L KL Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVA--LENALDMYKL 61 Query: 60 EN-RWP 64 +N +P Sbjct: 62 DNHHYP 67
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 28.4 bits (63), Expect = 0.036 Identities = 9/44 (20%), Positives = 19/44 (43%), Gaps = 2/44 (4%) Query: 206 VYLGQSTKIYDRETGE--VHYGRVPAGSVVVSGNLPSKDGKYSL 247 V+L + G V+Y + G + + G ++ G Y++ Sbjct: 623 VFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATEAGNYTV 666
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 31.6 bits (71), Expect = 0.003 Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 3/55 (5%) Query: 47 IKAAKKAGNVAADGVIITKIDGTYGIILEVNCQTD---FVAKDGGFQAFANKVLD 98 +K K AA G+ I GT G +L + D +V DGGF A +D Sbjct: 246 VKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQID 300
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 29.8 bits (67), Expect = 0.008 Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 14/66 (21%) Query: 120 AEAI-SLLRNNRVVILSAGTGNPFFTT-------------DSAACLRGIEIEADVVLKAT 165 AE I L+ +VI S G G P D A E+ AD+ + T Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT 235 Query: 166 KVDGVF 171 V+G Sbjct: 236 DVNGAA 241
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 95.5 bits (237), Expect = 9e-26 Identities = 75/259 (28%), Positives = 112/259 (43%), Gaps = 20/259 (7%) Query: 4 LTGKKVFITGAEQGIGKETARKLIEAGCDIYIHYFSGEEGPRELIAIAQQRGGKAACGY- 62 + GK FITGA QGIG+ AR L G +I E + + + + A + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGA--HIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63 Query: 63 ADLTSEADAARCVAEAAAFLGGIDILVNNVGGIIARKWLGEIDPQFWRTVIDVNMTTMLN 122 AD+ A A +G IDILVN V G++ + + + W VN T + N Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVN-VAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 123 VTQSALPWLKEATDGASIVNLASLAGRSGGHSGSLVYSMAKGAVLTWTRSLAAELGEFGI 182 ++S ++ SIV + S S + Y+ +K A + +T+ L EL E+ I Sbjct: 123 ASRSVSKYMM-DRRSGSIVTVGSNPAGVPRTSMA-AYASSKAAAVMFTKCLGLELAEYNI 180 Query: 183 RVNAVAPGLILGTRFHNQHTTQASADRTIE-----------DIPLGRAGTPEDIARAICF 231 R N V+PG T Q + A + + IPL + P DIA A+ F Sbjct: 181 RCNIVSPG---STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237 Query: 232 LASEYDGFISGATLDINGG 250 L S G I+ L ++GG Sbjct: 238 LVSGQAGHITMHNLCVDGG 256
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 55.0 bits (132), Expect = 1e-11 Identities = 32/174 (18%), Positives = 61/174 (35%), Gaps = 5/174 (2%) Query: 1 MAGRPRE---FDREHALLKARNLFWRQGYEGTSMSDLVAELGIASARIYKAFGSKEQLFR 57 MA + ++ R+H L A LF +QG TS+ ++ G+ IY F K LF Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 58 QAIVHYESQEGGFADRAFAA-ENNVQEAIKKMLVDAVH-LYSQAELPRGCMVVASAASVS 115 + ES G A + ++++L+ + ++ ++ Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 116 AENDQIKTWLAQHRLQRTQQIIDRLRQAVYNGELPDTTDADSLGDYFAVFLHGL 169 E ++ L+ +I L+ + LP ++ GL Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 39.8 bits (93), Expect = 1e-05 Identities = 29/127 (22%), Positives = 49/127 (38%), Gaps = 17/127 (13%) Query: 119 DTLRALLDNNI---------VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLT 169 +T++ L++ + VPVI E+ + E V D D A AD ++LT Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235 Query: 170 DQPGLFTADPRNNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMGTKLQAA-DVACRAG 228 D G + + +++V +++ + G MG K+ AA G Sbjct: 236 DVNGAALY--YGTEKEQWLREV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289 Query: 229 IDTIIAA 235 IIA Sbjct: 290 ERAIIAH 296 Score = 29.0 bits (65), Expect = 0.032 Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%) Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAMGHRIVIVTSG-------- 51 + +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 52 -AIAAGREHLGYPELP 66 + AG+ G P P Sbjct: 62 LHMDAGQATYGIPAQP 77
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 537 bits (1385), Expect = 0.0 Identities = 230/384 (59%), Positives = 267/384 (69%), Gaps = 35/384 (9%) Query: 1 MKKSSLALMMMGLVASSATQAAEVYNKDGNKLDVYGKVKAMHYISDYDSKDGDQTYVRFG 60 MK+ LAL++ L+A+ A AAE+YNKDGNKLD+YGKV +HY SD SKDGDQTY+R G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 IKGETQINDQLTGYGRWESEFSGNKTESDSTQ-KTRLAFAGLKLKNYGSFDYGRNLGALY 119 KGETQINDQLTGYG+WE N TE + TRLAFAGLK +YGSFDYGRN G LY Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVVDGLDMTLQYQGKNE--- 176 DVE WTDM PEFGGDS DN+MT RA+G+ATYRNTDFFG+VDGL+ LQYQGKNE Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 177 -------------GREAKKQNGDGFGTSLSYDFGGSDFAISAAYTSSDRTNDQNLLAR-- 221 G + + NGDGFG S +YD G F+ AAYT+SDRTN+Q Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDI-GMGFSAGAAYTTSDRTNEQVNAGGTI 239 Query: 222 GVGKKAEAWATGLKYDANNIYLATMYSETRKMTP-------ISGGFANKTQNFEAVAQYQ 274 G KA+AW GLKYDANNIYLATMYSETR MTP GG ANKTQNFE AQYQ Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299 Query: 275 FDFGLRPSLGYVLSKGKDIE----GVGNEDLVNYIDVGLTYYFNKNMNAFVDYKINQLNS 330 FDFGLRP++ +++SKGKD+ ++DLV Y DVG TYYFNKN + +VDYKIN L+ Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359 Query: 331 DNKL----AINNDDIVALGMTYQF 350 D+ I+ DDIVALGM YQF Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.026 Identities = 13/31 (41%), Positives = 19/31 (61%) Query: 35 VVLVGSSGCGKSTLLRMLIGLEPVTQGEIRV 65 VVL G+ G GKSTL+ L+GL+ + + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 53.7 bits (129), Expect = 5e-10 Identities = 40/183 (21%), Positives = 78/183 (42%), Gaps = 4/183 (2%) Query: 4 QSQMIFLLFIGYVFVYIDKTVTGFALLPIEKEFGLNAEQLGYITGIFFLAYSLFQVPAGW 63 +Q++ L I F +++ V +L I +F ++ F L +S+ G Sbjct: 12 HNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK 71 Query: 64 LNDRIGYKTMLVLSLSALGIFALCFGALGLSFGLLLLF-RFLSGVGHSGYPCSCAKAVVS 122 L+D++G K +L+ + ++ G +G SF LL+ RF+ G G + +P V Sbjct: 72 LSDQLGIKRLLLFGIIINCFGSV-IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 123 NFSVENRTFAQSVLLSSAGLAMTIGPIIAVNALSLLGWHRSFAALGALVCVTAALIAWRV 182 ENR A ++ S + +GP I + W S+ L ++ + ++ Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPFLMKL 188 Query: 183 PRR 185 ++ Sbjct: 189 LKK 191
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.026 Identities = 9/23 (39%), Positives = 14/23 (60%) Query: 34 MVTLLGPSGCGKTTILRLVAGLE 56 V L G G GK+T++ + GL+ Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.2 bits (81), Expect = 4e-04 Identities = 62/399 (15%), Positives = 122/399 (30%), Gaps = 31/399 (7%) Query: 38 VNYVLPALQTDLGLD---KGDIGLLGSLFYLTYGLSKFTAGLWHDCHGQRWFMGVGLFTT 94 + VLP L DL G+L +L+ L G D G+R + V L Sbjct: 24 IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83 Query: 95 GLLNVVFAFGESLTLLLAVWSLNGFFQGWGWPPCARLLTHWYSRNERGFWWGCWNMSINI 154 + + A L +L + G G + +ER +G + Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADITDGDERARHFGFMSACFGF 142 Query: 155 GGAIIPLISAFAAHWWGWQSAMLTPGIISMALGIWLTLQLKGTPQEEGLPSVGAWRQDPL 214 G P++ + + ++ + L + + E P Sbjct: 143 GMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP---------- 191 Query: 215 ELRQEQQSPPMGLWQMLRTTMLQNSMIWLLGVSYVLVYLIRIALNDWGNIWLTESHGVNL 274 LR+E +P R + L+ V +++ + ++ W + + Sbjct: 192 -LRREALNPLAS----FRWARGMTVVAALMAVFFIMQLVGQVPAALWV---IFGEDRFHW 243 Query: 275 LSANATVMLFEAGGLLGALFAGWGSDLLFSGQRAPMILLFTLGLMVSVAALWLAPVHHYA 334 + + L A G+L +L + + + L+ + + L + Sbjct: 244 DATTIGISL-AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302 Query: 335 LLAVCFFTVGFFVFGPQMLIGLAAVECGHK--AAAGSITGFLGLFAYLGAALAGWPLSLV 392 + + P + L+ + GS+ L + +G L + Sbjct: 303 AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362 Query: 393 IERYGWPGMFSLLSVAAVLMGLLLMPLLMASMTTSAAQR 431 I W G +A + LL +P L + + A QR Sbjct: 363 ITT--WNG---WAWIAGAALYLLCLPALRRGLWSGAGQR 396
>PF06580#Sensor histidine kinase Length = 349 Score = 44.5 bits (105), Expect = 5e-07 Identities = 46/205 (22%), Positives = 80/205 (39%), Gaps = 43/205 (20%) Query: 337 QSQLVKRARDPAQTQAAASQIN-------------------ELARRIHHSTRQLLR-QLR 376 Q ++ A++ AQ A +QIN AR + S +L+R LR Sbjct: 151 QWKMASMAQE-AQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLR 209 Query: 377 PPALDELSFKEALHHL-----LNEFAFAERGIRCHFDYQLTATPASETVRFTLYRLLQEL 431 ++S + L + L F +R ++ PA V+ L+Q L Sbjct: 210 YSNARQVSLADELTVVDSYLQLASIQFEDR-----LQFENQINPAIMDVQVPPM-LVQTL 263 Query: 432 LNNVCKHA-----DASEVAITLFQQGEWLRLEVKDNGIGISPDKI--TGFGIQGMRERVS 484 + N KH ++ + + + LEV++ G + TG G+Q +RER+ Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQ 323 Query: 485 ALGGE---LTLESQRG-TWVIVNLP 505 L G + L ++G +V +P Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 4e-18 Identities = 39/165 (23%), Positives = 67/165 (40%), Gaps = 20/165 (12%) Query: 2 IRVVLVDDHVVVRSGFAQLLSLEEDLDIVGQFSSAAEAWPALLRDDVNVAVMDIAMPDEN 61 +++ DD +R+ Q LS D+ S+AA W + D ++ V D+ MPDEN Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLSLLKRLRTQKPQFRAIILSIYDTPTFVQSALDAGASGYLTKRCGPEELVQAVRSVGMG 121 LL R++ +P +++S +T A + GA YL K EL+ + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117 Query: 122 GHYLCADALRALRGGEQPARV-------LEGLTPREREVFDLLVK 159 AL + L G + +E++ +L + Sbjct: 118 -------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 30.8 bits (69), Expect = 0.007 Identities = 18/69 (26%), Positives = 29/69 (42%) Query: 254 DILRDIRERSDLPLGAYQVSGEYAMIKFAAQAGAIDEEKVVLESLGAIKRAGADLIFSYF 313 + ++ + L L QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 314 ALDLAEKKI 322 L+L E++I Sbjct: 526 DLNLVERRI 534
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 118 bits (297), Expect = 3e-29 Identities = 94/414 (22%), Positives = 158/414 (38%), Gaps = 54/414 (13%) Query: 609 ATGNYKVRIDNADGKGSIADYKGKELVYVNDKNSTATFSAAN---KADLGAYTYQAKQEG 665 A+G +++ + N+ + L+ S ATF+ AN K D+G Y Y+ G Sbjct: 504 ASGQHRLWVRNSGSE---PASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANG 560 Query: 666 NTV------------------------------------VMEQSRLTDYANMALSIP--S 687 N L+ AN A++ Sbjct: 561 NGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVG 620 Query: 688 ANSNIWNLQQDTVATRLTQSRHGLTDNGGAWGSYFGGSFNGDNGTI-SYDQNVNGVMVGL 746 S +W + + ++ RL + R D GGAWG F DN +DQ V G +G Sbjct: 621 LASTLWYAESNALSKRLGELRL-NPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGA 679 Query: 747 DSKIDGNDAKWIVGAAAGFVKGDLS---DRSGQVDQDSQTAYLYTSAHFANN-FFLDGSV 802 D + +W +G AG+ +GD D G D Y + + A++ F+LD ++ Sbjct: 680 DHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGY---ATYIADSGFYLDATL 736 Query: 803 NYSHFNNELSANMSNGQYVDGSTSSDAWGFGLKLGYDAKLGHAGYVTPYGSISGLFQSGD 862 S N+ S+G V G + G L+ G ++ P ++ G Sbjct: 737 RASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGG 796 Query: 863 DYRLSNGMKVGGQSYDSMRYELGVDAGYTFTYGNDQALTPYFKLAYVYD-DASNHADVNG 921 YR +NG++V + S+ LG++ G + + PY K + + + D + NG Sbjct: 797 AYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNG 856 Query: 922 DSINNGTEGSAVRVGLGTQFSFTRNFSAYSDVSYLGGGDVDQDWAANVGVKYTW 975 + G+ +GLG + R S Y+ Y G + W + G +Y+W Sbjct: 857 IAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 124 bits (314), Expect = 3e-34 Identities = 92/385 (23%), Positives = 163/385 (42%), Gaps = 17/385 (4%) Query: 8 LISVWFGCFFTGLAISQILPFLPLYVSQLGVTSHEALSMWSGLTFSVTFLVSAIVSPMWG 67 LI + + I I+P LP + L ++ G+ ++ L+ +P+ G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHY--GILLALYALMQFACAPVLG 64 Query: 68 SLADRKGRKLMLLRASLGMAIAILLQAFATNVWQLFILRAVMGLTSGYIPNAMALVASQV 127 +L+DR GR+ +LL + G A+ + A A +W L+I R V G+T A A +A Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 128 PRERSGWALSTLSTAQISGVIGGPLLGGFLADHVGLRAVFFITAILLTVSFLVTLFLIKE 187 + +S G++ GP+LGG + A FF A L ++FL FL+ E Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 188 GVRPQTSKADRLSGREVLASLPYPGL---VISLFFTTLVIQLCNGSIGPILALF-IKSMA 243 + + + R LAS + V +L ++QL + +F Sbjct: 184 SHKGER-RPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 244 PDSNNIAFLAGMIAAVPGVSALISAPRLGKLGDRIGTSRILLATLCCAVVMFFAMSFVT- 302 D+ I +AA + +L A G + R+G R L+ + + ++F T Sbjct: 243 WDATTIGI---SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATR 299 Query: 303 TPLQLGVLRFLLGFADGAMLPAVQTLLLKYSSDKVTGRIFGYNQSFMYLGNVVGPLIGA- 361 + ++ L G +PA+Q +L + ++ G++ G + L ++VGPL+ Sbjct: 300 GWMAFPIMVLLASG--GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357 Query: 362 --SVSAMAGFRWVFIATAVIVLINL 384 + S W +IA A + L+ L Sbjct: 358 IYAASITTWNGWAWIAGAALYLLCL 382
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 164 bits (417), Expect = 3e-52 Identities = 92/251 (36%), Positives = 134/251 (53%), Gaps = 8/251 (3%) Query: 7 LSGKRALITGSARGIGYLLAEGLAEYGAEIIINDRTQQKADAAAQALCAQGYRATGVAFD 66 + GK A ITG+A+GIG +A LA GA I D +K + +L A+ A D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTRSAEVEQAVARIEAQIGAIDILINNAGIQRRYPFTEFPEDEWDQVIEVNQKGVFLVSQ 126 V SA +++ ARIE ++G IDIL+N AG+ R ++EW+ VN GVF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 QVAKYMMRRRSGKIINICSMQSELGRKTITPYAASKGAVKMLTRGMCVELAEYNIQVNGI 186 V+KYMM RRSG I+ + S + + R ++ YA+SK A M T+ + +ELAEYNI+ N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 187 APGYFATEMTAALVNDRD--------FSAWLYQRTPAARWGKPEELIGAAVYLASPAANF 238 +PG T+M +L D + P + KP ++ A ++L S A Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 239 VNGHLLFVDGG 249 + H L VDGG Sbjct: 246 ITMHNLCVDGG 256
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.023 Identities = 26/139 (18%), Positives = 54/139 (38%), Gaps = 2/139 (1%) Query: 246 IGVYGFMMWMPSILKNAAQMDIVAVGWLAAVP-YLAAICLMLTVSWLSDKFQNRKLFIWP 304 V GF+ +P ++K+ Q+ +G + P ++ I L D+ + Sbjct: 270 GTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIG 329 Query: 305 LLLIAAVAFFGSWMIGNQSFWFSYGLLVLAAACMYAPYGPFFALIPELLPRNVSGVSMGL 364 + ++ V+F + + + WF ++V + ++ L + +G M L Sbjct: 330 VTFLS-VSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSL 388 Query: 365 INSFGALGAFLGAWLVGYL 383 +N L G +VG L Sbjct: 389 LNFTSFLSEGTGIAIVGGL 407
>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin signature. Length = 574 Score = 30.7 bits (69), Expect = 0.008 Identities = 16/70 (22%), Positives = 36/70 (51%), Gaps = 8/70 (11%) Query: 181 RKIAFFGSMDDPRDLSRFRGTEQAVAACGLKAYHIT----PRTISSVALGRQMFLQMQQS 236 ++I + S + P + + ++V L+ ++ P +S+VA GR +F++++ S Sbjct: 301 KQIFYTVSANLPNNPADVFD--KSVTLKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETS 358 Query: 237 HP--DIDAIF 244 D++A F Sbjct: 359 SKSNDVEAAF 368
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.006 Identities = 19/69 (27%), Positives = 28/69 (40%), Gaps = 6/69 (8%) Query: 4 PIFLIGPRGCGKTTIGHALARARHYQFTDTDHALQER----EQRTVATIVEQEGWARFRE 59 + L G G GK+T+ + L F+DT + EQ E FR Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDF--FSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRR 655 Query: 60 LESEALKAA 68 ++EA+KA Sbjct: 656 ADAEAVKAF 664
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 40.6 bits (95), Expect = 3e-05 Identities = 27/152 (17%), Positives = 56/152 (36%), Gaps = 7/152 (4%) Query: 551 GQLEALLKQQVKEKEELDSLLQQEQALTSQ--WQTTISGLHCELQPQDDIPGWLAAQQES 608 G + L E + L + QA Q +Q + P+ +P Q S Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180 Query: 609 EQQL-----YQHQQRLAWQAQQQAGEQQLRQLQQEQEQRRAQLEAELSPFALSVPQADRT 663 E+++ +Q WQ Q+ E L + + E+ A++ + + + D Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240 Query: 664 AEWLAQREAESRLWQEKQNQFVALQEQLQQLT 695 + L ++ E++N++V +L+ Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYK 272 Score = 34.0 bits (78), Expect = 0.003 Identities = 29/210 (13%), Positives = 67/210 (31%), Gaps = 18/210 (8%) Query: 116 LARCDDGQILADKVKDKLEL-TASLTGLDYGRFTRSMLLSQGQFAAFLNAKPKERAELLE 174 L + AD +K + L A L Y +RS+ L++ + + E Sbjct: 124 LLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183 Query: 175 ELTGTEIYGQISAQVFEKHKLARNELEKLQAQASGV---LLLSDEQQQALQQSLQALTDE 231 L T + + + + L+K +A+ V + + + + L + Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243 Query: 232 ERLQLAEQTRLQATQQWLLRQQELSAEASQSQIRLQEAQQALEQAQPQLAALLNAQPAEQ 291 Q + + + + E E + +L++ + + A+ + + Sbjct: 244 LHKQAIAKHAVLEQEN---KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ------ 294 Query: 292 LRPLWTRQQEQSAELAQTHRQVEEVNTRLQ 321 + E +L QT + + L Sbjct: 295 -----LFKNEILDKLRQTTDNIGLLTLELA 319 Score = 32.5 bits (74), Expect = 0.010 Identities = 25/211 (11%), Positives = 57/211 (27%), Gaps = 10/211 (4%) Query: 247 QWLLRQQELSAEASQSQIRLQEAQQALEQAQPQLAALLNAQPAEQLRPLWTRQQEQSAEL 306 LL+ L AEA + + Q LEQ + Q+ + L Q+ Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181 Query: 307 AQTHRQVEEVNTRLQDRLRLRAGIRLAASRQMTRLQDAHHALNLWLKEHDSYRQWGNS-- 364 + R + + + L ++ +N + + + Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241 Query: 365 -LAGWRAVFQQQARDAQQQ-NAVQQSLAETTRKLSELPPAALTLDADQVTASLAQHAAAR 422 L +A+ + + + + L +L ++ L Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEIL-----SAKEEYQLVTQLF 296 Query: 423 PLRQQLSTLHSRLLPLRQRQQQLQTAEQARR 453 + L L + +L E+ ++ Sbjct: 297 K-NEILDKLRQTTDNIGLLTLELAKNEERQQ 326 Score = 30.6 bits (69), Expect = 0.043 Identities = 19/165 (11%), Positives = 47/165 (28%), Gaps = 31/165 (18%) Query: 529 QARRDSLEREVKQLTEEGAQLRGQLEALLKQQVKEKEELDSLLQQEQALTSQWQTTISGL 588 + + + + + Q + R + +L + + + + +S Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS--------- 242 Query: 589 HCELQPQDDIPGWLAAQQESEQQLYQHQQRLAWQAQQQAGEQQLRQLQQEQEQRRAQLEA 648 L +Q + Q+ +A + + + Q E E A+ E Sbjct: 243 -------------LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 649 ELSPFALSVPQADRTAEWLAQ-REAESRLWQEKQNQFVALQEQLQ 692 +L E L + R+ + + +E+ Q Sbjct: 290 QL-------VTQLFKNEILDKLRQTTDNI-GLLTLELAKNEERQQ 326
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 26.9 bits (59), Expect = 0.039 Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 5/61 (8%) Query: 6 IVSEVDLQEVRNAVENATREVESRFDFR--NVEASFELNEKNETIKVLSESDFQINQLLD 63 IV +D+Q + N T E+ V+ F L KN T+ L QLL Sbjct: 202 IVETLDIQHI-EEENNTTETAETLPGLNQLPVKLEFVLYRKNVTLAELEA--MGQQQLLS 258 Query: 64 I 64 + Sbjct: 259 L 259
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 85.3 bits (211), Expect = 4e-20 Identities = 57/231 (24%), Positives = 97/231 (41%), Gaps = 13/231 (5%) Query: 16 GLGTVFSLRMLGMFMVLPVLTTY--GMALQGASEALIGLAIGIYGLAQAVFQIPFGLLSD 73 L TV L +G+ +++PVL + A G+ + +Y L Q G LSD Sbjct: 10 ILSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68 Query: 74 RIGRKPLIVGGLLIFVIGSVIAALSDSIWGIILGRALQG-SGAIAAAVMALLSDLTREQN 132 R GR+P+++ L + I A + +W + +GR + G +GA A A ++D+T Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDE 128 Query: 133 RTKAMAFIGVSFGVTFAIAMVLGPIITHQLGLHALFWMIAILATIGILLTLWVVPNSHNH 192 R + F+ FG VLG ++ HA F+ A L + L +++P SH Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187 Query: 193 VLNRESGMVKGCFSKVLAEPKLLKLNFGIMCLHIMLMSTFVA-LPGQLEAA 242 + + G+ + ++ F+ L GQ+ AA Sbjct: 188 ERRPLRREALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAA 231
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 53.7 bits (129), Expect = 8e-10 Identities = 34/181 (18%), Positives = 77/181 (42%), Gaps = 3/181 (1%) Query: 26 VFLGFCVIALDG-FDIAIMGFIAPTLKHEWGVTNYELGFVISAALIGLALGAILSGPLAD 84 + + C+++ + ++ P + +++ +V +A ++ ++G + G L+D Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 85 WLGRKKIIVNSVFFFGFWTIVTAFSQN-IEQMIFFRFMTGLGLGAAMPNIGTLVSEYAPE 143 LG K++++ + F +++ + +I RF+ G G A + +V+ Y P+ Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 144 RQRSFLITVIFCGFTFGAAAGGFSASWLIPRFGWHSLMALGGILPLLFAPLLIWLLPESV 203 R +I G G + W L+ + I ++ P L+ LL + V Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKKEV 193 Query: 204 R 204 R Sbjct: 194 R 194
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 35.6 bits (82), Expect = 4e-04 Identities = 41/196 (20%), Positives = 73/196 (37%), Gaps = 15/196 (7%) Query: 221 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGMVNKTLGLFATIVGALYG 279 R+N LI L ++ + + + ++++ + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GVLMQRLTLFRALLIFGVLQGVSNAGYWLLSITDKHLYSMATAVFFENLCGGMGTAAFVA 339 L +L + R LL ++ S+ +S + + G G AAF A Sbjct: 71 K-LSDQLGIKRLLLFGIIINC-------FGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGP-IAGWFVEAHGWPTFYLFSVFAAVP 394 L+M K F L+ ++ A+G VGP I G W L + + Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GIVLLLLCRQTLEYTQ 410 L+ L ++ + Sbjct: 182 VPFLMKLLKKEVRIKG 197
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.8 bits (67), Expect = 0.024 Identities = 16/73 (21%), Positives = 28/73 (38%), Gaps = 13/73 (17%) Query: 60 ERSALPTPHEIRHHLDDYVIGQEPAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119 E P+ E ++G+ A + +Y RL D +++ G Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167 Query: 120 PTGSGKTLLAETL 132 +G+GK L+A L Sbjct: 168 ESGTGKELVARAL 180
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKTEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWSARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 116 bits (293), Expect = 4e-38 Identities = 49/88 (55%), Positives = 65/88 (73%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDALIASVTESLQAGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPGFRAGKALKDAV 89 NPQTG+EI I A+KVP F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.021 Identities = 12/64 (18%), Positives = 25/64 (39%), Gaps = 10/64 (15%) Query: 193 LAVLSQHLGLSMQDCMAFGDAMNDREMLGSVGRGVIMGN----------AMPQLKAELPH 242 VL+Q L + D +A + + +++ + +P++K P Sbjct: 16 RTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD 75 Query: 243 LPVI 246 LPV+ Sbjct: 76 LPVL 79
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1376 bits (3564), Expect = 0.0 Identities = 803/1032 (77%), Positives = 908/1032 (87%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLSILKLPVAQYPTIAPPAVSITATYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG L+IL+LPVAQYPTIAPPAVS++A YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFQSGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTFQSGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTNGTMTQEDISDYVGANMKDAISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ N TQ+DISDYV +N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMDPNKLNNYQLTPVDVISAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW+D + LN Y+LTPVDVI+ +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KSADEFSNILLKVNQDGSQVRLRDVAKVELGGENYDIVAKFNNQPASGLGIKLATGANAL 300 K+ +EF + L+VN DGS VRL+DVA+VELGGENY+++A+ N +PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTANAIRAELAKMEPYFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+LA+++P+FP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAILAIFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFAILA FG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIAKGGHGEHKGFFGWFNRMFDKSTHHYTDSVGNILRSTGRY 540 SVLVALILTPALCAT+LKP++ H GFFGWFN FD S +HYT+SVG IL STGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LVLYLIIVVGMAWLFVRLPSSFLPDEDQGVFLSMAQLPAGATQERTQKVLDEMTDYYLTK 600 L++Y +IV GM LF+RLPSSFLP+EDQGVFL+M QLPAGATQERTQKVLD++TDYYL Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKANVESVFAVNGFGFAGRGQNTGIAFVSLKDWSERPGSENKVEAITGRAMARFSQIKDA 660 EKANVESVF VNGF F+G+ QN G+AFVSLK W ER G EN EA+ RA +I+D Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 MVFAFNLPAIVELGTATGFDFQLIDQGGLGHEKLTQARNQLFGEVAKHPDLLVGVRPNGL 720 V FN+PAIVELGTATGFDF+LIDQ GLGH+ LTQARNQL G A+HP LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKVDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYRM 780 EDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPEDIGNWYVRGSDGQMVPFSAFSTSHWEYGSPRLERYNGLPSMEILGQAAPGRSTGEAM 840 LPED+ YVR ++G+MVPFSAF+TSHW YGSPRLERYNGLPSMEI G+AAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 TMMEELAKKLPTGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 +ME LA KLP GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGLI 960 MLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLEAVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMVTATVLAIFF 1020 EATL AVRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGMV+AT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVVRRRF 1032 VPVFFVV+RR F Sbjct: 1021 VPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.0 bits (96), Expect = 7e-06 Identities = 30/215 (13%), Positives = 69/215 (32%), Gaps = 32/215 (14%) Query: 99 ATYQASYESAKGDLAKAQAAANIAQLTVKRYQKLLGTKYISQQDYDTAVADA-QQSNAAV 157 K L + ++ A+ + Q + + D +Q+ + Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVT----------QLFKNEILDKLRQTTDNI 311 Query: 158 VAAKAAVETARINLAYTKVTSPISGRIGKSAV-TEGALVQNGQATALATVQQLDPIYVDV 216 + + + +P+S ++ + V TEG +V + T + V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTA 370 Query: 217 TQSSNDFLRLKQELA----------NGTLKQENGKAKVELVTNDGLKYPQGGTLEFSDVT 266 + D + KV+ + D ++ + G + ++ Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIIS 427 Query: 267 VDQTTGSITLRAIFPNPDHTLLPGMFVRARLQEGT 301 +++ S + I L GM V A ++ G Sbjct: 428 IEENCLSTGNKNIP------LSSGMAVTAEIKTGM 456 Score = 30.6 bits (69), Expect = 0.013 Identities = 21/87 (24%), Positives = 38/87 (43%), Gaps = 8/87 (9%) Query: 48 APLQITTELPGR-TSAYRVAEVRPQVSGIILKRNFTEGSDIQAGVSLYQIDPATYQASYE 106 ++I G+ T + R E++P + I+ + EG ++ G L ++ +A Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA--- 134 Query: 107 SAKGDLAKAQAAANIAQLTVKRYQKLL 133 D K Q++ A+L RYQ L Sbjct: 135 ----DTLKTQSSLLQARLEQTRYQILS 157
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 195 bits (497), Expect = 2e-65 Identities = 173/212 (81%), Positives = 197/212 (92%) Query: 1 MARKTKQQALETRQHILDVALRLFSQQGVSSTSLAAIAKAAGVTRGAIYWHFKNKSDLFN 60 MARKTKQ+A ETRQHILDVALRLFSQQGVSSTSL IAKAAGVTRGAIYWHFK+KSDLF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWVLSDASISDLEVEYRAKFPNDPLSVVREILVHILEATVTEERRRLMMEIIFHKCEFV 120 EIW LS+++I +LE+EY+AKFP DPLSV+REIL+H+LE+TVTEERRRL+MEIIFHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMAVVQKAQRSLCLESYERIEHTLKECIAANMLPANLLTRRAAVLMRSYLSGLMENWLF 180 GEMAVVQ+AQR+LCLESY+RIE TLK CI A MLPA+L+TRRAA++MR Y+SGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APDSFDLEKEARDYVAILLEMYQFCPTLRAPS 212 AP SFDL+KEARDYVAILLEMY CPTLR P+ Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPA 212
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 36.6 bits (84), Expect = 5e-04 Identities = 50/264 (18%), Positives = 102/264 (38%), Gaps = 17/264 (6%) Query: 27 VNQAL---AADVPDRSDIQNQLNTLNKQKELTPQDKLVQQDLVQTLEALDKIERIKSETT 83 VN+AL A+ P +++ + N + ++ + ++ + EA +K + E Sbjct: 98 VNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQ---EAE 154 Query: 84 QLRQQVEQAPAKMRQAVDSLNALSEVSDDEATRKTLSTLSLRQLESRLSQTLDDLQTAQN 143 Q R+++E+ A + L EA K L+ LS + + L AQ+ Sbjct: 155 QRRKEIEREKA---ETERQLKLA------EAEEKRLAALS--EEAKAVEIAQKKLSAAQS 203 Query: 144 DLATYNSQLVSLQTQPERVQNAMYAASQQLQQIRNRLNGTTTTGDEALRPSQQSLLLAQQ 203 ++ + ++ +L ++ +A A + L RN L + E ++ A Sbjct: 204 EVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRAND 263 Query: 204 ALLNAQIEQQRKSLEGNTVLQDTLQKQRDYVSANSNRLEHQLQLLQEAVNSKRLTLTEKT 263 L N + + G +++ QKQ NR+ + +Q+A++ Sbjct: 264 PLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGI 323 Query: 264 AQEAVTPDETARIQSNPLVKQELD 287 A+ + + Q+N L Q D Sbjct: 324 ARVHEAEENLKKAQNNLLNSQIKD 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.6 bits (64), Expect = 0.037 Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 2/49 (4%) Query: 6 TEENLLAFTTAARFGSFSKAADELGLTTSAISYTIKRMETGLDVVLFTR 54 E L+ A G+ KAAD LGL + + I+ + G+ V +R Sbjct: 436 MEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL--GVSVYRSSR 482
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 29.4 bits (66), Expect = 0.012 Identities = 23/120 (19%), Positives = 45/120 (37%), Gaps = 9/120 (7%) Query: 128 IALSAEPSRSTTFAAMERIKLAGGRVSFDPNIRADLWQD-----PELLHAC-LDRALRLA 181 +A+ P++ F+ ER++ ++ PN + D ++ A + R LR+ Sbjct: 32 VAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFEGLTVNYARQRQAGAILRGLRVL 91 Query: 182 NVVKLSEEELALIAGKEDLAEGVTALTQRYQPELLLVTQGKAGVLAAFQQQCTHFSAQPV 241 + E EL + + LA + + E ++ +A F HF V Sbjct: 92 SDF---ELELQMANTNKTLASDLETVFLTTSTEYSFLSSSLVKEVARFGGNVEHFVPSHV 148
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 81.3 bits (200), Expect = 2e-20 Identities = 60/251 (23%), Positives = 102/251 (40%), Gaps = 6/251 (2%) Query: 3 RVVVITGGGTGIGAACARLMAAAGETVFITGRRLTPLQALAEETGAVALAGDAANGD--- 59 ++ ITG GIG A AR +A+ G + L+ + A A +A D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 60 -EWQAHLLPTILERAGHIDVLICSAGGMGNSAVVETSDRQWRAALESNLNSAFASVRACL 118 + I G ID+L+ AG + + SD +W A N F + R+ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 PSLIARR-GNVLFVASIASLAAGPQACGYVTAKHAVIGLMRSVARDYGPQGVRANAICPG 177 ++ RR G+++ V S + Y ++K A + + + + +R N + PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 178 WVTTPMADEEMQPLMDAQGISLEQAYQQVCRDVPLRRPASAQEIALACQFLCSPHASIIS 237 T M + + ++ + + +PL++ A +IA A FL S A I+ Sbjct: 189 STETDM-QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 238 GATLVADGGAS 248 L DGGA+ Sbjct: 248 MHNLCVDGGAT 258
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 101 bits (252), Expect = 4e-25 Identities = 76/407 (18%), Positives = 159/407 (39%), Gaps = 25/407 (6%) Query: 21 MLPLIDTSITNVALDSITHSLDASATQLELIVALYGVSFAVCLAMGSKLGDNYGRRRLFM 80 +++ + NV+L I + + + + ++F++ A+ KL D G +RL + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 81 WGVALFGLASLLCGMANSIGGLL-AARTLQGAGAALIVPQILTTLHVTLKGSAHARAISL 139 +G+ + S++ + +S LL AR +QGAGAA ++ + + +A L Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 140 YGGIGGIAFIVGQMGGGWLVSADIAGLGWRNAFFINVPICLLVLAFSRRYVPETRREAHS 199 G I + VG GG + + W ++ + +P+ ++ + + Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY----IHW--SYLLLIPMITIITVPFLMKLLKKEVRIKG 197 Query: 200 AIDWQGTFSLALILCCLLFPMALGPELHWPWTLQLLLVAILPLLAWMRTSALRKQQRGEQ 259 D +G +++ + + L+V++L L +++ ++ Sbjct: 198 HFDIKGIILMSVGIVFFMLFTTSYSISF-------LIVSVLSFLIFVKH-----IRKVTD 245 Query: 260 PLLPPRLLKLTSIRFGMAIALLFFSAWSGFMFCMALTMQAGLGMAPWQSGNSFIALG-VA 318 P + P L K G+ + F +GF+ + M+ ++ + G+ I G ++ Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305 Query: 319 YFVSALYAPRLIARFSMGRILLIGLAVQIAGLLLLMATFGHFGAQTSSLAMVPSTALIGY 378 + L+ R +L IG+ L F +T+S M + Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-----FLLETTSWFMTIIIVFVLG 360 Query: 379 GQALIVNSFYRIGMRDISASDAGAGSAILSTLQQATLGLGPAILGSL 425 G + I + +AGAG ++L+ + G G AI+G L Sbjct: 361 GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 57.7 bits (139), Expect = 1e-12 Identities = 25/111 (22%), Positives = 52/111 (46%), Gaps = 4/111 (3%) Query: 1 MARP---KSEDKKQALLEAATAAFAQSGV-AASTSAIARSAGVAEGTLFRYFATKDELLN 56 MAR ++++ +Q +L+ A F+Q GV + S IA++AGV G ++ +F K +L + Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 57 ELYLSIKSGLVKAMVSGLTPNEKRPKENARNIWDGYIDWSIRHPMEHKAIR 107 E++ +S + + + P R I ++ ++ + Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.4 bits (97), Expect = 4e-06 Identities = 30/98 (30%), Positives = 44/98 (44%), Gaps = 10/98 (10%) Query: 62 EGTIVQR-HFQDGQYVRKGDRLFTLDDAQPRAALALAEAELKSAQASLRQAQQLLTRYQQ 120 E +IV+ ++G+ VRKGD L L AEA+ Q+SL QA+ TRYQ Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALG-------AEADTLKTQSSLLQARLEQTRYQI 155 Query: 121 LKNNHSISRNDVDNARMQRDVAAAAVEQAKARVTTQQI 158 L SI N + ++ + V + + T I Sbjct: 156 LSR--SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI 191
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 902 bits (2332), Expect = 0.0 Identities = 410/1035 (39%), Positives = 608/1035 (58%), Gaps = 17/1035 (1%) Query: 1 MLTFFIRRPRFAMVIALLLTFIGAVSLKLIPVEQYPQITPPVVNVSASWPGASAADVAEA 60 M FFIRRP FA V+A++L GA+++ +PV QYP I PP V+VSA++PGA A V + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 IAAPLETQLNGVDHMLYMESTSSDDGSYSLSLTFAAGTDADLAAIDVQNRVSQAVAQLPV 120 + +E +NG+D+++YM STS GS +++LTF +GTD D+A + VQN++ A LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 DAQQNGVQVRKRASNLLMGVSLYSPLETLSPLFVSNYASTQVREALARLPGVGEVQMFGA 180 + QQ G+ V K +S+ LM S + +S+Y ++ V++ L+RL GVG+VQ+FGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 RDYSMRVWLRPDRMNALNVTTDDIAQALREQNVQGAAGQVGTPPVFNGQQQTLTINGIGR 240 + Y+MR+WL D +N +T D+ L+ QN Q AAGQ+G P GQQ +I R Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 241 LSDAESFSQIAIRSGEHGQLVRLAEVATIELGARSYSSGAQLNGKDSAYLGIYPTPTANA 300 + E F ++ +R G +VRL +VA +ELG +Y+ A++NGK +A LGI ANA Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 301 LQVANAVRAELTRLHSRFPADLTWEVKFDTTRFVAATIKEIGVSLALTLLAVVAVVSLFL 360 L A A++A+L L FP + +DTT FV +I E+ +L ++ V V+ LFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 361 QSWRATLIVTLAIPVSLIGSFAVLYLLGYSANTLSLFAIILALTMVVDDAIVVVESVETH 420 Q+ RATLI T+A+PV L+G+FA+L GYS NTL++F ++LA+ ++VDDAIVVVE+VE Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 421 LAE-GVGRAEATALALRQIAGPVIATTLVLLAVFVPVALLPGIVGELYRQFAVTLSTAVT 479 + E + EAT ++ QI G ++ +VL AVF+P+A G G +YRQF++T+ +A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 480 VSSIVALTLTPALCALLLRPRPAQP----AAIWRAFNCALATLRDGYGALVEWMNRRLWL 535 +S +VAL LTPALCA LL+P A+ + FN + Y V + Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 536 ALVATLAAAGLVAFSFSQMPKGFLPQEDQGYFFASVQLPEAASLERTEAVMAQARELLLA 595 L+ + F ++P FLP+EDQG F +QLP A+ ERT+ V+ Q + L Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 596 NSA--VEDVIQVSGFNILNGTSASNGGFISVMLKEWHQRPPLNE----VMGKLQGQLMGL 649 N VE V V+GF+ A N G V LK W +R V+ + + +L + Sbjct: 600 NEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657 Query: 650 PEATIMAFAPPTLPGLGNASGFNLRILAQAGQSSAELEQVTRQVLRMANQHP-QLSQVFT 708 + ++ F P + LG A+GF+ ++ QAG L Q Q+L MA QHP L V Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 709 TWSSNVPQLTLKVDRDQASRLDVPVSRIFNSLQTAFGGTRAGDFSINNRVYPVVVQNEMQ 768 + Q L+VD+++A L V +S I ++ TA GGT DF RV + VQ + + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 769 WRERAEQIGELYVRSHHGERIRLSNLVTVTPTVGAPFIQQYNQFPSVSVSGSAAQGVSSR 828 +R E + +LYVRS +GE + S T G+P +++YN PS+ + G AA G SS Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 829 TAMAVMEHILLTNLPQGYDYAWSGISWQEQQTGNQAIWIVLAAVAMAWLFLVAQYESWTL 888 AMA+ME++ + LP G Y W+G+S+QE+ +GNQA +V + + +L L A YESW++ Sbjct: 838 DAMALMENL-ASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 889 PASVMLSVLFAIAGALLWLWIAGYANDVYVQIGLVLLIALAAKNAILIVEFAR-ARREEG 947 P SVML V I G LL + NDVY +GL+ I L+AKNAILIVEFA+ +EG Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 948 MSVVDAAREGATRRFRAVLMTAVSFIIGIMPMMLATGAGAQSRRIIGTTVFSGMLVATAI 1007 VV+A R R +LMT+++FI+G++P+ ++ GAG+ ++ +G V GM+ AT + Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 1008 GIVFIPSLFVLFQRL 1022 I F+P FV+ +R Sbjct: 1017 AIFFVPVFFVVIRRC 1031 Score = 76.0 bits (187), Expect = 4e-16 Identities = 89/526 (16%), Positives = 183/526 (34%), Gaps = 51/526 (9%) Query: 531 RRLWLALVATLAAAGLVAFSFSQMPKGFLPQEDQGYFFASVQLPEAASLERTEAVMAQAR 590 RR A V + A + Q+P P S P A + + V + Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV---TQ 63 Query: 591 ELLLANSAVEDVIQVSGFNILNGTSASNGGFISVMLKEWHQRPP---LNEVMGKLQGQLM 647 + + +++++ +S ++ + G +++ L P +V KLQ Sbjct: 64 VIEQNMNGIDNLMYMSS-------TSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 648 GLPEATIMAFAPPTLPGLGNASGFNLRILAQAGQSSAELEQVTRQVLRMANQHPQLSQV- 706 LP+ + ++S + + + + ++ V LS++ Sbjct: 117 LLPQE----VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVK--DTLSRLN 170 Query: 707 ----FTTWSSNVPQLTLKVDRDQASRLDVPVSRIFNSLQTAFGGTRAGDF------SINN 756 + + + + +D D ++ + + N L+ AG Sbjct: 171 GVGDVQLFGAQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQ 229 Query: 757 RVYPVVVQNEMQWRERAEQIGELYVR-SHHGERIRLSNLVTVTPTVGA-PFIQQYNQFPS 814 ++ Q + E+ G++ +R + G +RL ++ V I + N P+ Sbjct: 230 LNASIIAQTRFK---NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPA 286 Query: 815 VSVSGSAAQGVSSR-TAMAVMEHI--LLTNLPQG------YDYAWSGISWQEQQTGNQAI 865 + A G ++ TA A+ + L PQG YD + Sbjct: 287 AGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFE 346 Query: 866 WIVLAAVAMAWLFLVAQYESWTLPASVMLSVLFAIAGALLWLWIAGYANDVYVQIGLVLL 925 I+L + M LFL ++ ++V + G L GY+ + G+VL Sbjct: 347 AIMLVFLVMY-LFL----QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLA 401 Query: 926 IALAAKNAILIVE-FARARREEGMSVVDAAREGATRRFRAVLMTAVSFIIGIMPMMLATG 984 I L +AI++VE R E+ + +A + ++ A++ A+ +PM G Sbjct: 402 IGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGG 461 Query: 985 AGAQSRRIIGTTVFSGMLVATAIGIVFIPSLFVLFQRLREWGHRRM 1030 + R T+ S M ++ + ++ P+L + H Sbjct: 462 STGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHEN 507
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 85.5 bits (211), Expect = 6e-22 Identities = 53/186 (28%), Positives = 82/186 (44%), Gaps = 3/186 (1%) Query: 6 VVFITGATSGFGEAAAQVFAEAGWSLVLSGRRLARLQKLYDRLSLLV-PVHIIELDVRDS 64 + FITGA G GEA A+ A G + +L+K+ L DVRDS Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 65 EAVAAAVAELPAAFADIKTLINNAGLALAPQPAQKVDLQDWKTMIDTNVTGLVNVTHALL 124 A+ A + I L+N AG+ L P + ++W+ N TG+ N + ++ Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 125 PTLIQHGAGASIINVGSIAGQWPYPGSHVYGASKAFVKQFSYNLRCDLLGTGVRVTDLAP 184 ++ +G SI+ VGS P Y +SKA F+ L +L +R ++P Sbjct: 129 KYMMDRRSG-SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 185 GIAETE 190 G ET+ Sbjct: 188 GSTETD 193
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 59.3 bits (143), Expect = 3e-13 Identities = 21/108 (19%), Positives = 42/108 (38%), Gaps = 2/108 (1%) Query: 10 RQRQLIDATLEAINAVGMHDATIAQIARRAGVSTGIISHYFKDKNGLLEATMRDITGQLR 69 ++ ++D L + G+ ++ +IA+ AGV+ G I +FKDK+ L + Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71 Query: 70 LAVLSRLHALPGASAERRLRAIVAGNFDDSQISSAAMKAWLAFWASSM 117 L PG LR I+ + + ++ + + Sbjct: 72 ELELEYQAKFPGDPLS-VLREILIHVLEST-VTEERRRLLMEIIFHKC 117
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 130 bits (327), Expect = 9e-39 Identities = 79/254 (31%), Positives = 130/254 (51%), Gaps = 12/254 (4%) Query: 13 LQGRVAFVTGAGSGIGQMIAYGLASAGARVVGFDLREDGGLAETVSHIEAIGGEACFYTG 72 ++G++AF+TGA GIG+ +A LAS GA + D + L + VS ++A A + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEK-LEKVVSSLKAEARHAEAFPA 64 Query: 73 DVRQLSDLRAGVALAKSRFGRLDIAVNAAGIANANPALEMETEQWQRVIDINLTGVWNSC 132 DVR + + A + G +DI VN AG+ + E+W+ +N TGV+N+ Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 133 KAEAELMLESGGGSIINIASMSGIIVNRGLEQAHYNSAKAGVIHLSKSLAMEWVGKGIRV 192 ++ ++ M++ GSI+ + S + + A Y S+KA + +K L +E IR Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSM--AAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 193 NSISPGYTATPM--------NTRPEMVHQTRE-FENQTPIQRMAKVEEMAGPALFLASDA 243 N +SPG T T M N +++ + E F+ P++++AK ++A LFL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 244 ASFCTGVDLVVDGG 257 A T +L VDGG Sbjct: 243 AGHITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 136 bits (344), Expect = 3e-41 Identities = 76/262 (29%), Positives = 119/262 (45%), Gaps = 12/262 (4%) Query: 3 RNFDNKTIVITGACRGIGAGIAERFARDGARLVMVSNAPR---VHETAETLRQRYQADIL 59 + + K ITGA +GIG +A A GA + V P ++ R+ Sbjct: 4 KGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE--- 60 Query: 60 SLEIDVTDEAQVQSLYQQAAARFGTIDVSIQNAGVITIDYFDRMPKADFEKVLAVNTTGV 119 + DV D A + + + G ID+ + AGV+ + ++E +VN+TGV Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 120 WLCCREAAKYMVKQNYGSLINTSSGQGRQGFIYTPHYAASKMGVIGITQSLAHELAPWNI 179 + R +KYM+ + GS++ S YA+SK + T+ L ELA +NI Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180 Query: 180 TVNAFCPGIIESEMWDYNDRVWGEILSSEQKRYGKGELMAEWVEGIPMKRAGKPEDVAGL 239 N PG E++M +W + +EQ G E + GIP+K+ KP D+A Sbjct: 181 RCNIVSPGSTETDM---QWSLWADENGAEQVIKGSLE---TFKTGIPLKKLAKPSDIADA 234 Query: 240 VAFLASDDARYLTGQTINIDGG 261 V FL S A ++T + +DGG Sbjct: 235 VLFLVSGQAGHITMHNLCVDGG 256
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 192 bits (490), Expect = 3e-64 Identities = 100/183 (54%), Positives = 128/183 (69%), Gaps = 1/183 (0%) Query: 1 MQTTHSTFLFAGHRLHQIDFDPATFAPHDILWLPHHQQLENCGRKRQMEHLAGRIAAACA 60 M T+H FAGHRLH +DFD ++F HD+LWLPHH +L + GRKR+ EHLAGRIAA A Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 Query: 61 LKAVGIKGVPGTGDQRQPLWPVPWSGSISHCDTRALAVVAARPVGIDIENVLTPALATEL 120 L+ VG++ VPG GD+RQPLWP GSISHC T ALAV++ + +GIDIE +++ ATEL Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 Query: 121 ESSIISPTERAVLKASGLPFELALTLAFSAKESGFKALPLTQQSGTGFMHFRITDIQGEV 180 SII ER +L+AS LPF LALTLAFSAKES +KA + GF ++T + Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDR-VTLPGFNSAKVTSLTATH 179 Query: 181 VTL 183 ++L Sbjct: 180 ISL 182
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.7 bits (77), Expect = 0.001 Identities = 40/187 (21%), Positives = 71/187 (37%), Gaps = 8/187 (4%) Query: 24 IARFISILSLGLLGVAIPVQIQMMTHSTWQVGLSVTLTGCSMFVGLMVGGVLADRYERKR 83 I F S+L+ +L V++P T + +G V G L+D+ KR Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 84 LILLARGTCGVGFVGLCLNALLPEPSLIAIYLLGIWDGFFASLGVTALLAATPALVGREN 143 L+L G V + ++A ++ G F +L ++ + +EN Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKEN 136 Query: 144 LMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITTLTLLRLPLLPPPP 203 +A + V +G + P IGG++ + W+Y L IT +T+ L L Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKE 192 Query: 204 QPREHPL 210 + Sbjct: 193 VRIKGHF 199
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 60.3 bits (146), Expect = 1e-12 Identities = 64/291 (21%), Positives = 104/291 (35%), Gaps = 35/291 (12%) Query: 39 AHTLESKPLRIVSTSVTLTGSLLAIDAPVVASGATTPNNRVADDRGFLRQWSAVAKERKL 98 AH P RIV+ LLA+ VAD + R W E L Sbjct: 28 AHAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPL 74 Query: 99 ARLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKSWQ 153 I EP+ E + P ++ SA G S + L+ IAP N+ D Sbjct: 75 PDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQP 130 Query: 154 AL-----LTELGHITGQEKQATARIAEFDKQLATLKQQMKLPPQPVSALVYTVAAHSANL 208 LTE+ + + A +A+++ + ++K + L + + Sbjct: 131 LAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190 Query: 209 WTPESAQGQMLEQLGFKLATLPAGLHASQSQGKRHDIIQLGGENLAAGLNGNSLFLFAGD 268 + P S ++L++ G +A Q + + + LAA + + L + Sbjct: 191 FGPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDN 242 Query: 269 GKDADAIYANPLLAHLPAVENKRVYPLGTETFRLDYYSAMLVLQRLATLFG 319 KD DA+ A PL +P V R + F SAM ++ L G Sbjct: 243 SKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIG 293
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 417 bits (1072), Expect = e-150 Identities = 145/299 (48%), Positives = 197/299 (65%), Gaps = 20/299 (6%) Query: 1 MAIPKLQAYTLPEASDIPANKVNWAFEPSRAALLIHDMQEYFLNFWGEDSAMMAKVVANI 60 MAIP +Q Y +P ASD+P NKV+W +P+RA LLIHDMQ YF++ + ++ + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRDFCKQNNIPVFYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQRVIAALAPDEHDTV 120 L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++++I LAP++ D V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEEMLKESGRDQLIITGVYAHIGCMTTATDAFMRDIKPFFVADALAD 180 L KWRYSAF R+ L EM+++ GRDQLIITG+YAHIGC+ TA +AFM DIK FFV DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSREEHLMSLKYVAGRSGRVVMTEELL--------PLPASKT-----------ALRALVL 221 FS E+H M+L+Y AGR VMT+ LL + + +R + Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 222 PLLDESDEPMD-DENLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWALLS 279 LL E+ E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 330 bits (847), Expect = e-117 Identities = 105/265 (39%), Positives = 150/265 (56%), Gaps = 20/265 (7%) Query: 1 MAALDFQGQTVWVTGAGKGIGYATALAFVEAGARVSGFD---------------LAFEGD 45 M A +G+ ++TGA +GIG A A GA ++ D A + Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 46 DYPFATHTLDVANAQQVAEVCGHLLKGIDRLDVLVNAAGILRMGATDELSLEAWQQTFAV 105 +P DV ++ + E+ + + + +D+LVN AG+LR G LS E W+ TF+V Sbjct: 61 AFP-----ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV 115 Query: 106 NVGGAFNLFQQTMGQFRRQQGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLEL 165 N G FN + ++ G+IVTV S+ A PR M+AY +SKAA +GLEL Sbjct: 116 NSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175 Query: 166 AGSGVRCNLVSPGSTDTDMQRTLWVSEDAEQRRIRGFGEQFKLGIPLGKIARPQEIASTI 225 A +RCN+VSPGST+TDMQ +LW E+ ++ I+G E FK GIPL K+A+P +IA + Sbjct: 176 AEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAV 235 Query: 226 LFLASDHASHITLQDIVVDGGSTLG 250 LFL S A HIT+ ++ VDGG+TLG Sbjct: 236 LFLVSGQAGHITMHNLCVDGGATLG 260
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.3 bits (71), Expect = 0.015 Identities = 9/60 (15%), Positives = 25/60 (41%), Gaps = 1/60 (1%) Query: 172 VIILAVLAMIVVKALTHSPWG-TYTVAFTIPLAIFMGIYIRYLRPGRIGEVSVIGLVMLV 230 ++ ++ + + + A + W +V +PL I + L + ++GL+ + Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 135 bits (340), Expect = 6e-41 Identities = 85/253 (33%), Positives = 128/253 (50%), Gaps = 15/253 (5%) Query: 5 LTGKRALVTGASRGIGRAIALSLAQAGADVVITYEKSADKAQAVADEIVALGRRGAAIQA 64 + GK A +TGA++GIG A+A +LA GA + + + +K + V + A R A A Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 65 DSANAQAIQQAVTRTVETLGGLDILVNNAGIARGGPLESLSLEDIDALINVNIRGVVIAI 124 D ++ AI + R +G +DILVN AG+ R G + SLS E+ +A +VN GV A Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 125 QAALPHLPA--GGRIINIGSCLANRVASPGIAVYAMTKSALNSLTRGLARDLGPRGITVN 182 ++ ++ G I+ +GS A V +A YA +K+A T+ L +L I N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAG-VPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 183 LVHPGPTDSDMNPANGEQADSQRQLIA-----------VGHYGQPDDVAAAVTFLASPAA 231 +V PG T++DM + + Q+I + +P D+A AV FL S A Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 232 GQISGTGLDVDGG 244 G I+ L VDGG Sbjct: 244 GHITMHNLCVDGG 256
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.8 bits (85), Expect = 2e-04 Identities = 29/177 (16%), Positives = 69/177 (38%), Gaps = 12/177 (6%) Query: 41 VTFYVCRLSFTVAKSALV----------ELGISPTELGMIGSALFFSYAIGKLVNGFVAD 90 + ++C LSF + +V + P + +A +++IG V G ++D Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 91 HANVVRYMSLGLLLSAGMNLMMGMTSNALLLAVFWG-INGWAQSMGVGPCAVSLARWYGY 149 + R + G++++ +++ + + L + I G + V +AR+ Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 150 KERGTFYGIWSTAHNIGEALTYIVIAAVIAAFGWQMGYLSTAALGAVGVALLLLFMR 206 + RG +G+ + +GE + + + W L + V L+ ++ Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLK 190 Score = 29.5 bits (66), Expect = 0.028 Identities = 22/116 (18%), Positives = 41/116 (35%), Gaps = 3/116 (2%) Query: 284 SGIIGVNAIAGIAGTIIAGMLSDRLFPRNRSVMAGFISLLNTAGFALMLWSPHGYYTDIL 343 S II ++ I I G+L DR P + + F + + Sbjct: 296 SVIIFPGTMSVIIFGYIGGILVDRRGPLY---VLNIGVTFLSVSFLTASFLLETTSWFMT 352 Query: 344 AMIIFGATIGALTCFLGGLIAVDISSRKAAGAALGTIGIASYAGAGLGEFLTGIII 399 +I+F + T + I ++ AGA + + S+ G G + G ++ Sbjct: 353 IIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 37.1 bits (86), Expect = 1e-04 Identities = 13/32 (40%), Positives = 22/32 (68%) Query: 345 LETLLQESGNVVRAAERLGIHRNTLHQRIQRI 376 L L GN ++AA+ LG++RNTL ++I+ + Sbjct: 442 LAALTATRGNQIKAADLLGLNRNTLRKKIREL 473
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 100 bits (249), Expect = 3e-27 Identities = 74/252 (29%), Positives = 112/252 (44%), Gaps = 24/252 (9%) Query: 6 GKVAVITGGTQGVGAAIARQLAENGASGIIICGRNQEKGRLV--ADEIMSRTAAQVTFVR 63 GK+A ITG QG+G A+AR LA GA I N EK V + + +R A Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEAF---P 63 Query: 64 ADLSSVDDCRAVIAKADELFKRVDVLVNAGGMTDRGSILDTTPERFDSIFATNVRGPFFL 123 AD+ + A+ + +D+LVN G+ G I + E +++ F+ N G F Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 124 MQETIKIMRRENITGSIVNICSMSSLAGQPFIAAYCSSKGALATLTRNTAYALLRNRIRV 183 + K M +GSIV + S + + +AAY SSK A T+ L IR Sbjct: 124 SRSVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 184 NGLNIG----------WMASEGEDRIMKTYHGAQDDWLEKAAASQPFGRLIQPEEVARAV 233 N ++ G W G ++++K LE P +L +P ++A AV Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGS-------LETFKTGIPLKKLAKPSDIADAV 235 Query: 234 AFLASDESGLMT 245 FL S ++G +T Sbjct: 236 LFLVSGQAGHIT 247
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 47.5 bits (113), Expect = 3e-08 Identities = 35/171 (20%), Positives = 58/171 (33%), Gaps = 35/171 (20%) Query: 80 IMKALDAGAWGIICPMINTAEQARELVSCVRYPPAGSRSFGPTRVTFSAGQDYGQHADEQ 139 +++A G ++ PMI T E+ R+ + ++ S G V S + G Sbjct: 378 LLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEG---VDVSDSIEVG------ 428 Query: 140 VVCFAMIETAEAVRNLDAILDTPGLDGVYIGPADLTLGLTGRRYRTGFDRE--------- 190 M+E + +D IG DL +Y DR Sbjct: 429 ----IMVEIPSTAVAANLFA--KEVDFFSIGTNDLI------QYTMAADRMNERVSYLYQ 476 Query: 191 --EPEIVAAIQQILAAAHRAGKRAGL---HNGTPEYAAKAVSWGFDLVTVS 236 P I+ + ++ AAH GK G+ G + G D ++S Sbjct: 477 PYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMS 527
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 31.0 bits (70), Expect = 0.014 Identities = 14/74 (18%), Positives = 22/74 (29%), Gaps = 7/74 (9%) Query: 55 NFNEQHILAIAQAIAEDRAKNGITGPCYVGK------DTHALSEPAFISVLEVLAANGVD 108 +F I ++++ K I P G D + + L VL G Sbjct: 33 SFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQ-YYQFFLSVLDVYGFA 91 Query: 109 VIVQENNGFTPTPA 122 VI N + Sbjct: 92 VINMNNGVLKVVRS 105
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.3 bits (211), Expect = 2e-21 Identities = 33/122 (27%), Positives = 57/122 (46%), Gaps = 1/122 (0%) Query: 4 VLIIEDEHAIRRFLRTALEADGMRVFEADTLQRGLIEAATRKPDVVILDLGLPDGDGNEF 63 +L+ +D+ AIR L AL G V A D+V+ D+ +PD + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 IRDVRQ-WSQMPIIVLSARTEEHDKIAALDAGADDYLSKPFGIGELQARLRVAMRRHAGA 122 + +++ +P++V+SA+ I A + GA DYL KPF + EL + A+ Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 123 QA 124 + Sbjct: 126 PS 127
>PF06580#Sensor histidine kinase Length = 349 Score = 34.5 bits (79), Expect = 0.002 Identities = 13/73 (17%), Positives = 28/73 (38%), Gaps = 8/73 (10%) Query: 760 HIQLALPDPLLLVHVDGPLFERVLINLLENALKYA----GSKAQIGITAQDDEQQLRLEV 815 + + ++ V V P ++ L+EN +K+ +I + D + LEV Sbjct: 241 QFENQINPAIMDVQV--PPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 816 WDNGPGIPTGQEQ 828 + G ++ Sbjct: 297 ENTGSLALKNTKE 309
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 30.5 bits (68), Expect = 0.017 Identities = 36/115 (31%), Positives = 57/115 (49%), Gaps = 4/115 (3%) Query: 361 ILTLSARWSAAY-GHSSMPLMVLGLAVMGFAELFIDPVAMSQITRIEIPGVTGVLTGIYM 419 +LT+ + +A + G +S+ L +GLAVM E+ +S I + P + VL + Sbjct: 324 LLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLME 383 Query: 420 LLSGAIANYLAGVIAD-QTAQGAFDEAGATSYAID--AYINLFSQITWGALACVG 471 L+ AI L G+ D +TA+ A GA AI A I + + + GA A +G Sbjct: 384 LIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLG 438
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.7 bits (72), Expect = 0.002 Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 14 VDDAPRMQDYTLEAEEGRDM-MLLDALMQLKEKDPSLSFRR 53 +++ + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.6 bits (69), Expect = 0.011 Identities = 25/196 (12%), Positives = 56/196 (28%), Gaps = 11/196 (5%) Query: 48 EVPASADGILDAVLEDEGATVLSRQILGRLREGNSAGKESAAKADAKESTPAQ-RQQASL 106 E+ + I+ ++ EG +V +L +L + ++ ++ Q R Q Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 107 EEQSNDAL------SPAIRRLLAEHNLDAAAIKGTGVGGRLTREDVEKHLAAAPAKAE-A 159 + L + ++E + + +K L +AE Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217 Query: 160 KAPAAAAAPVAQLGHRSEKRVPMTRLRKRVA---ERLLEAKNSTAMLTTFNEVNMKPIMD 216 A + + L + A +LE +N V + Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277 Query: 217 LRKQYGDAFEKRHGIR 232 + + A E+ + Sbjct: 278 IESEILSAKEEYQLVT 293 Score = 28.6 bits (64), Expect = 0.045 Identities = 21/103 (20%), Positives = 44/103 (42%), Gaps = 4/103 (3%) Query: 26 KPGDSVQRDEVLVEIETDKVVLEVPASADGILDAVLEDEGATVLSRQI----LGRLREGN 81 K G+SV++ +VL+++ + + +L A LE +LSR I L L+ + Sbjct: 113 KEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172 Query: 82 SAGKESAAKADAKESTPAQRQQASLEEQSNDALSPAIRRLLAE 124 ++ ++ + T ++Q S + + + AE Sbjct: 173 EPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAE 215
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 63.2 bits (153), Expect = 1e-12 Identities = 38/272 (13%), Positives = 84/272 (30%), Gaps = 8/272 (2%) Query: 55 VDPGAVVNNYNRQQQQQAS-SRRAAEQREKQAQQQ----AEELREKQAAEQERLKQLEQD 109 VD + N Q + S R +A A + + ++ + Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051 Query: 110 RLQAQEAAKQAKEEQKQAEEAAAKAAAKAAAAAKAKADSQAKEAQEAAAKAAADAKA--K 167 Q+A + + ++ A+EA + A A++ S+ KE Q K A + K Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111 Query: 168 ADAQAKAAEAAAAKAAADAKKQAEAEAAKAAADAQKKAE-AEAAKKAQQQAEKKAQQEAA 226 A + + + + + KQ ++E + A+ ++ + K+ Q Q A E Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171 Query: 227 KQAAAEKAAAEKAAEKAAEKAAAQKAASEKAAAEKAAAAEKAAAEKAEKAAAAKAAAAEK 286 + + + E + + K ++ + Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231 Query: 287 AAADKAAKAAAAKAAAAKAAAAKKAAAAKDAD 318 + A ++ ++ A A +D Sbjct: 1232 HNVEPATTSSNDRSTVALCDLTSTNTNAVLSD 1263 Score = 57.4 bits (138), Expect = 8e-11 Identities = 29/255 (11%), Positives = 72/255 (28%), Gaps = 11/255 (4%) Query: 67 QQQQQASSRRAAEQREKQAQQQAEELREKQAAEQERLKQLEQDRLQAQEAAKQAKEEQKQ 126 +Q++ + EQ + + ++ A++ + + Q E A+ E K+ Sbjct: 1043 NSKQESKTVEKNEQDATET-----TAQNREVAKEAKSNV--KANTQTNEVAQS-GSETKE 1094 Query: 127 AEEAAAKAAAKAAAAAKAKADSQAKEAQEAAAKAAADAKAKADAQAKAAEAAAAKAAADA 186 + K A KAK +++ + + + +++ AE A Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154 Query: 187 KKQAEAEAAKAAADAQKKAEAEAAKK---AQQQAEKKAQQEAAKQAAAEKAAAEKAAEKA 243 K+ +++ A Q E + + + A + Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSE 1214 Query: 244 AEKAAAQKAASEKAAAEKAAAAEKAAAEKAEKAAAAKAAAAEKAAADKAAKAAAAKAAAA 303 + + + ++ A + A A+A A A Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALN 1274 Query: 304 KAAAAKKAAAAKDAD 318 A + + + + Sbjct: 1275 VGKAVSQHISQLEMN 1289 Score = 54.7 bits (131), Expect = 6e-10 Identities = 24/198 (12%), Positives = 60/198 (30%), Gaps = 4/198 (2%) Query: 99 EQERLKQLEQDRLQAQEAAKQAKEEQKQAEEAAAKAAAKAAAAAKAKADSQAKEAQEAAA 158 E E+ Q D + A A + ++ + A Sbjct: 984 EVEKRNQT-VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042 Query: 159 KAAADAKAKADAQAKAAEAAAAKAAADAKKQAEAEAAKAAAD-AQKKAEAEAAKKAQQQA 217 + ++K + A E A + ++ +A + AQ +E + + + Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK--ETQTTET 1100 Query: 218 EKKAQQEAAKQAAAEKAAAEKAAEKAAEKAAAQKAASEKAAAEKAAAAEKAAAEKAEKAA 277 ++ A E ++A E ++ + ++ + Q+ + + A E + Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160 Query: 278 AAKAAAAEKAAADKAAKA 295 A + A + + Sbjct: 1161 QTNTTADTEQPAKETSSN 1178 Score = 54.3 bits (130), Expect = 6e-10 Identities = 21/234 (8%), Positives = 68/234 (29%), Gaps = 3/234 (1%) Query: 65 NRQQQQQASSRRAAEQREKQAQQQAEELREKQAAEQERLKQLEQDRLQAQEAAKQAKEEQ 124 NR+ ++A S A + + Q E +E Q E + +E++ E K + + Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124 Query: 125 KQAEEAAAKAAAKAAAAAKAKADSQAKEAQEAAAKAAADAKAKADAQAKAAEAAAAKAAA 184 ++ + + ++ + +A+ + K + A+ ++ Sbjct: 1125 VTSQVSPKQEQSE---TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181 Query: 185 DAKKQAEAEAAKAAADAQKKAEAEAAKKAQQQAEKKAQQEAAKQAAAEKAAAEKAAEKAA 244 + + + + + + +++ + A ++ Sbjct: 1182 PVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSS 1241 Query: 245 EKAAAQKAASEKAAAEKAAAAEKAAAEKAEKAAAAKAAAAEKAAADKAAKAAAA 298 + + A ++ A + KA + + + + Sbjct: 1242 NDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295 Score = 53.9 bits (129), Expect = 1e-09 Identities = 31/260 (11%), Positives = 74/260 (28%), Gaps = 4/260 (1%) Query: 59 AVVNNYNRQQQQQASSRRAAEQREKQAQQQAEELREKQAAEQERLKQLEQDRLQAQEAAK 118 V N ++ + + + A + Q ++ A+E + A + + + + Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098 Query: 119 QAKEEQKQAEEAAAKA-AAKAAAAAKAKADSQAKEAQEAAAKAAADAKAKADAQAKAAEA 177 + KE +E AK K K + K+ Q + A+ + D E Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158 Query: 178 AAAKAAADAKKQAEAEAAKAAADAQKKAEAEAAKKAQQQAEKKAQQEAAKQAAAEKAAAE 237 + +Q E + ++ + + + A Q ++ Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN-TTPATTQPTVNSESSN 1217 Query: 238 KAAEKAAEKAAAQKAASEKAAAEKAAAAEKAAAEKAEKAAAAKAAAAEKAAADKAAKAAA 297 K + + E A + A + + A ++ A + Sbjct: 1218 KPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT--STNTNAVLSDARAKAQFVALNV 1275 Query: 298 AKAAAAKAAAAKKAAAAKDA 317 KA + + + + Sbjct: 1276 GKAVSQHISQLEMNNEGQYN 1295
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 36.4 bits (84), Expect = 2e-05 Identities = 20/127 (15%), Positives = 49/127 (38%), Gaps = 12/127 (9%) Query: 65 LSDQDKT---NYAYHLAKKGEYQAALNLLDSLKNGNTAEAWNYR-----GYATRKLGRTD 116 +S + A++ + G+Y+ A + +L + ++ R G + +G+ D Sbjct: 31 ISSDTLEQLYSLAFNQYQSGKYEDAHKVFQAL---CVLDHYDSRFFLGLGACRQAMGQYD 87 Query: 117 EGIGYYQRALALEPNYAKAREYLGEAWMVKGRRDLAQEQLKVISGICGQSCEEYRDLQAA 176 I Y ++ + + E + KG A+ L + + E+++L Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADK-TEFKELSTR 146 Query: 177 INGHPES 183 ++ E+ Sbjct: 147 VSSMLEA 153
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.021 Identities = 20/86 (23%), Positives = 35/86 (40%), Gaps = 16/86 (18%) Query: 51 VRNGRIEAI--------VPENDAPAGRSIDL---GGRLLTPGLIDCHTHLVFGGSRAQEW 99 +++GRI AI P G ++ G+++T G +D H H + Q Sbjct: 90 LKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI---CPQQIE 146 Query: 100 EQRLNGVSYQTISANGGGINSTVRAT 125 E ++G++ + G G AT Sbjct: 147 EALMSGLT--CMLGGGTGPAHGTLAT 170
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 30.6 bits (69), Expect = 0.009 Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 3/52 (5%) Query: 235 ITDFAADRLKVEEIHFLPYHTLGMNKYQLLSQPYTAPDKPLAAPELLAFAQH 286 +TD + I F Y+ L N YQ+ P PD + +L+ F H Sbjct: 441 LTDSFPRYFSDKSIDFHSYYLLQDNVYQI---PDLKPDLVITHSQLIPFVHH 489
>PF07520#Virulence protein SrfB Length = 1041 Score = 33.0 bits (75), Expect = 7e-04 Identities = 19/77 (24%), Positives = 31/77 (40%), Gaps = 14/77 (18%) Query: 87 VKFLANSMSFNNSPQSRL--------------ILTMMGAVAEFERSMIASRRNEGIALAK 132 + +A++M N P SR IL++ A + E++MI SR + + L K Sbjct: 467 AEVIAHAMVQINDPASRSRRSQSDLPRRLNRVILSLPTATSVQEQAMIRSRVSGALTLVK 526 Query: 133 EARKYKGKQKNKELHSK 149 E K + K Sbjct: 527 EMLGTKDGTSTIAVEGK 543
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.9 bits (223), Expect = 8e-23 Identities = 34/120 (28%), Positives = 63/120 (52%), Gaps = 1/120 (0%) Query: 4 KILLVEDDDDIAALLRLNLQDEGYQIVHEADGAQALVQLEKGGWDAAILDLMLPNVDGLE 63 IL+ +DD I +L L GY + ++ A + G D + D+++P+ + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 ICRRIRQMTRYLPVIIISARSSETHRVLGLEMGADDYLAKPFSVIELVARV-KALFRRQE 122 + RI++ LPV+++SA+++ + E GA DYL KPF + EL+ + +AL + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124
>PF06580#Sensor histidine kinase Length = 349 Score = 42.2 bits (99), Expect = 3e-06 Identities = 31/129 (24%), Positives = 51/129 (39%), Gaps = 26/129 (20%) Query: 367 RLQLSLAGGLPPVVADLSMMERVLTNLLDNAIRH----TPDGGSISLTARQQGAEMVVEV 422 RLQ + P + D+ + ++ L++N I+H P GG I L + + +EV Sbjct: 239 RLQFENQ--INPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 423 ADSGPGVSGELRATLFERPSVLEPGQSPESRGGLGLMIVRRMLQLHGGD---IQLVDVPA 479 ++G L + ES G GL VR LQ+ G I+L + Sbjct: 297 ENTGS----------------LALKNTKES-TGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 480 GASFRFTLP 488 + +P Sbjct: 340 KVNAMVLIP 348
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 33.7 bits (77), Expect = 3e-04 Identities = 18/54 (33%), Positives = 28/54 (51%), Gaps = 8/54 (14%) Query: 6 KTQRVRRIIEMLDVNHSTTINQL-----AEYFTVSHMTIRRDIEELQKNGQVKV 54 K QR +I E++ N T ++L + + V+ T+ RDI+EL VKV Sbjct: 3 KGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL---HLVKV 53
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 41.2 bits (96), Expect = 3e-05 Identities = 53/328 (16%), Positives = 112/328 (34%), Gaps = 25/328 (7%) Query: 273 DYMRHANERRVHLDKALEYRRDLFTSRAQLAAEQYKHVDMARELQEHNGAEGDLEADYQA 332 A++ + + + + L + A+ K + E + DLE + Sbjct: 107 SLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEG 166 Query: 333 ASDHLNLVQTALRQQEKIERYEADLDELQIRLEEQNEVVAEATDLQEENEARAEAAELEV 392 + + KI+ EA+ L+ R E + + A + + A+ + E E Sbjct: 167 ------AMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220 Query: 393 DELKSQLADYQQALDVQQTRAIQYNQALQALERAKALCHLPDLTPESADEWLETFQAKEQ 452 L ++ AD ++AL+ + + ++ LE E+ Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLE-------------AEKAALEARQAELEK 267 Query: 453 EATEKMLSLEQKMSVAQTAHGQFEQAYQLVAAINGPLARNEAWDVARE-LLRDGVNQRHL 511 M + +T + A + +++ + R+ L RD R Sbjct: 268 ALEGAMNFSTADSAKIKTLEAEKAALEAEKADLE---HQSQVLNANRQSLRRDLDASREA 324 Query: 512 AGQAQGLRSRLTELEQRL-REQQDAERQLSEFCKRQGK-RYEIDDLEALHQELEARIASL 569 Q + +L E + +Q R L + + + E LE ++ EA SL Sbjct: 325 KKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSL 384 Query: 570 SDSVSSASEQRMNLRQELEQLQSRTQSL 597 + ++ E + + + LE+ S+ +L Sbjct: 385 RRDLDASREAKKQVEKALEEANSKLAAL 412 Score = 35.4 bits (81), Expect = 0.002 Identities = 52/287 (18%), Positives = 97/287 (33%), Gaps = 18/287 (6%) Query: 835 EEEIRKLNSRRGELERALNAHESDNQQNRVQFEQ---AKEGVSALNRLLPRLNLLADDTL 891 +I++L +R+ +LE+AL + + + + + K ++A L + A + Sbjct: 112 ASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFS 171 Query: 892 ADRVDEIQERLDEAQEAARFIQQYGNQLAKLEPIVSVLQSDPEQFEQLKEDYAYAQQMQR 951 +I+ E + L + + + E K A + Sbjct: 172 TADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 231 Query: 952 DARQQAFALSEVVQRRAHFSYSDSAEMLNGNSDLNEKLRQRLEQAEAER------SRARD 1005 A + A S + ++ A + ++L + L + + A+ + Sbjct: 232 KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKA 291 Query: 1006 AMRTHSAQLNQYNQVL----ASLKSSFDTKKELLSDLYKELQDIGVRADSGAEERARARR 1061 A+ A L +QVL SL+ D +E L E Q + + R RR Sbjct: 292 ALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRR 351 Query: 1062 DQLHTQLSNNRSRRNQLEKALTFCEAEMDNLTRRLRKLERDYCEMRE 1108 D L +R + QLE E + + L RD RE Sbjct: 352 D-----LDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE 393 Score = 31.6 bits (71), Expect = 0.027 Identities = 38/307 (12%), Positives = 104/307 (33%), Gaps = 18/307 (5%) Query: 935 QFEQLKEDYAYAQQMQRDARQQAFALSEVVQRRAHFSYSDSAEMLNGNSDLNEKLRQRLE 994 ++ +Q + Q+ E+ SD + D N++L + L Sbjct: 36 NTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELS 95 Query: 995 QAEAERSRARDAMRTHSAQLNQYNQVLASLKSSFDTKKELLSDLYKELQDIGVRADSGAE 1054 A+ + + ++ ++++ + A L+ + + + +++ + + A Sbjct: 96 NAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA 155 Query: 1055 ERA--RARRDQLHTQLSNNRSRRNQLEKALTFCEAEMDNLTRRLRKLERDY-------CE 1105 +A + + + ++ LE EA L + L Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215 Query: 1106 MREQVVSAKAGWCAVMRLVKDNGVQRRLHRRELAYLSAD------ELRSMSDKALGALRL 1159 + + + A + + ++ ++ L A+ + GA+ Sbjct: 216 LEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275 Query: 1160 AVSDNEHLRDVLRVSEDPKRPERKIQFFVAVYQHLRERIRQDIIRTDDPVEAIEQMEIEL 1219 + +D+ ++ + + + ++ V R+ +R+D+ D EA +Q+E E Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL---DASREAKKQLEAEH 332 Query: 1220 SRLTEEL 1226 +L E+ Sbjct: 333 QKLEEQN 339
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 67.3 bits (164), Expect = 4e-16 Identities = 33/194 (17%), Positives = 55/194 (28%), Gaps = 10/194 (5%) Query: 19 REQIVEAAFEHFGHYGYEKTTVAELAKSIGFSKSYIYKFFDSKQAIGEVICANRLSLIME 78 R+ I++ A F G T++ E+AK+ G ++ IY F K + I S I E Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 79 AVDAAIADAPSASEK------LRRLFGALTEAGSELF--FHDRKLYDIAAVAARDKWPST 130 A P + L +TE L K + +A + Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132 Query: 131 EKYAEHLIKLIEGILVEGRKNGEFERKTPLDEATHAVYLVMCPFVNPVQLQFNLEAAPKA 190 IE L + A + + + K Sbjct: 133 --LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKE 190 Query: 191 AVLLSSLILRSLAP 204 A +++L Sbjct: 191 ARDYVAILLEMYLL 204
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 40.2 bits (94), Expect = 9e-06 Identities = 18/102 (17%), Positives = 38/102 (37%), Gaps = 7/102 (6%) Query: 70 GKILERFVDTGQTVKRGQPLMRLDPVDLKLQALAQQQAVDAARA-RARKAISDEARYRGL 128 + E V G++V++G L++L + + L Q ++ AR + R I + Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164 Query: 129 VASGAVSASEY------DQIKAAADSAKAELSAAQAQANVAQ 164 + + Y +++ K + S Q Q + Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 451 bits (1163), Expect = e-144 Identities = 231/1048 (22%), Positives = 424/1048 (40%), Gaps = 57/1048 (5%) Query: 8 LSALAVRERSITLFLIILITIAGIYSFFGLGRAEDPPFTVKQMTIISVWPGATAQEIQDQ 67 ++ +R L I++ +AG + L A+ P +++ + +PGA AQ +QD Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 68 VAEPLEKRLQELKWYDRTETYT-RPGMAFITLSLQDSTPPSQVQEEFYQARKKLGDEAKN 126 V + +E+ + + + + G ITL+ Q T P Q Q + KL Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQV---QVQNKLQLATPL 117 Query: 127 LPAGVIGPMINDEFSDVTFAL---FALKAKGEPQRLLVRDAE-TLRQQLLHVPGVKKVNI 182 LP V I+ E S ++ + F G Q + ++ L + GV V + Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177 Query: 183 IGEQ-AERIYISFSHARLATIGLSPQDIFNALNSQNVLTPAGSIET------RGAQIFIR 235 G Q A RI++ L L+P D+ N L QN AG + + I Sbjct: 178 FGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235 Query: 236 LDGAFDELQKIRDTPFIAQ--GKTLKLSDVATVERGYEDPPTLQIRNQSEPALLLGIVMR 293 F ++ G ++L DVA VE G E+ + R +PA LGI + Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI-ARINGKPAAGLGIKLA 294 Query: 294 EGWNGLALGKALDAEAAKINAAMPLGMTLTKVTDQSVNISASVDEFMLK-FFAALLVVMM 352 G N L KA+ A+ A++ P GM + D + + S+ E + F A +LV ++ Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354 Query: 353 VCFVSMGWRVGVVVAAAVPLTLAIVFVVMEASGINFDRVTLGSLILALGLLVDDAIIAIE 412 + R ++ AVP+ L F ++ A G + + +T+ ++LA+GLLVDDAI+ +E Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414 Query: 413 MMV-VKMEEGYDRIKASAYAWSHTAAPMLAGTLVTAIGFMPNGFAQSTAGEYTSNMFWIV 471 + V ME+ +A+ + S ++ +V + F+P F + G + Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474 Query: 472 GLALIASWLVAVVFTPYLGVKMLPDIA----KVAGGHAAIYNT---PRYNRFRRMLARVI 524 A+ S LVA++ TP L +L ++ + GG +NT N + + +++ Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534 Query: 525 ASKWGVAGSVVAIFVLAVLGMGLVKKQFFPTSDRPEVLVEVQMPYGTSIGLTSAAAAKIE 584 S I V+ + F P D+ L +Q+P G + T ++ Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594 Query: 585 AWLQKQPEAKMVTAYIGQGSPRFYLAMAPELPDPSFARIVV-----LTDSQQSRDALKLR 639 + K +A + + + G + + + + A + + + S +A+ R Sbjct: 595 DYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHR 649 Query: 640 LREAV-----ASGLAPEARVRVTQLVFGPYSPFPVAWRVMGPDSDKLRDIADKV-ESVLQ 693 + + + V + + G D L +++ Q Sbjct: 650 AKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQA--GLGHDALTQARNQLLGMAAQ 707 Query: 694 ASPMMRTVNTDWGPKVPALHFTLDQDRLQATGLTSNAVAQQLQFLLSGIPITSVREDIRS 753 + +V + +DQ++ QA G++ + + Q + L G + + R Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767 Query: 754 VQVTGRAAGDIRLDPAKIEGFTLVGNAGQRIPLAQIGKVEVRMEDPLLRRRDRTPTITVR 813 ++ +A R+ P ++ + G+ +P + P L R + P++ ++ Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQ 827 Query: 814 GDIADNLQPPDVSAAIMKQLQPIVDSLPPGYRIDMAGSIEESGKATRAMAPLFPIMIALT 873 G P S M ++ + LP G D G + + L I + Sbjct: 828 G----EAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVV 883 Query: 874 LLVIILQVRSMSAMVMVFLTAPLGLVGVVPALLLFNQPFGINALVGLIALSGILMRNTLI 933 L + S S V V L PLG+VGV+ A LFNQ + +VGL+ G+ +N ++ Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAIL 943 Query: 934 LIGQIHH-NEREGLDPYHAVIEATVQRARPVLLTAMAAVLAFIPLTHSVFWGT-----LA 987 ++ E+EG A + A R RP+L+T++A +L +PL S G+ + Sbjct: 944 IVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVG 1003 Query: 988 YTLIGGTLGGTIMTLVFLPAMYAIWFRI 1015 ++GG + T++ + F+P + + R Sbjct: 1004 IGVMGGMVSATLLAIFFVPVFFVVIRRC 1031 Score = 87.6 bits (217), Expect = 1e-19 Identities = 90/508 (17%), Positives = 182/508 (35%), Gaps = 42/508 (8%) Query: 535 VAIFVLAVLGMGLVKKQFFPTSDRPEVLVEVQMPYGTSIGLTSAAAAKIEAWLQKQPEAK 594 + + + L + + +PT P V V P + + IE + Sbjct: 17 IILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMNGIDNLM 76 Query: 595 MVTAY-IGQGSPRFYLAMAPELPDPSFARIVVLTDSQQSRDALKLRLREAVASGLAPEAR 653 +++ GS L DP A+ Q ++ L + L E + Sbjct: 77 YMSSTSDSAGSVTITLTFQSGT-DPDIAQ-------VQVQNKL-----QLATPLLPQEVQ 123 Query: 654 VRVTQLVFGPYSPFPVAWRVMGPDSDKLRDIADKVESVLQASPMMRTVN-----TDWGPK 708 + + S VA V DI+D V S ++ + +N +G + Sbjct: 124 QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVK--DTLSRLNGVGDVQLFGAQ 181 Query: 709 VPALHFTLDQDRLQATGLT----SNAVAQQLQFLLSGIPITSVREDIRSVQVTGRAAGDI 764 A+ LD D L LT N + Q + +G + + + + A Sbjct: 182 Y-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 765 RLDPAKIEGFTLVGNA-GQRIPLAQIGKVEVRMED-PLLRRRDRTPTITVRGDIADNLQP 822 + +P + TL N+ G + L + +VE+ E+ ++ R + P + +A Sbjct: 241 K-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 823 PDVSAAIMKQLQPIVDSLPPGYRI----DMAGSIEESGKATRAMAPLFPIMIALTLLVII 878 D + AI +L + P G ++ D ++ S + I L LV+ Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLS---IHEVVKTLFEAIMLVFLVMY 356 Query: 879 LQVRSMSAMVMVFLTAPLGLVGVVPALLLFNQPFGINALVGLIALSGILMRNTLILIGQI 938 L +++M A ++ + P+ L+G L F + G++ G+L+ + ++++ + Sbjct: 357 LFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENV 416 Query: 939 H-HNEREGLDPYHAVIEATVQRARPVLLTAMAAVLAFIPL-----THSVFWGTLAYTLIG 992 + L P A ++ Q ++ AM FIP+ + + + T++ Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS 476 Query: 993 GTLGGTIMTLVFLPAMYAIWFRIRPEVP 1020 ++ L+ PA+ A + Sbjct: 477 AMALSVLVALILTPALCATLLKPVSAEH 504
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.0 bits (88), Expect = 1e-06 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 2/53 (3%) Query: 28 DFFVDPSSRKQGVAEALIARAGQLAKESDPAFIMLSTATDNTQA--LYEKNGF 78 D V RK+GV AL+ +A + AKE+ +ML T N A Y K+ F Sbjct: 94 DIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.041 Identities = 8/18 (44%), Positives = 11/18 (61%) Query: 32 IVGPSGSGKTTLLRILAG 49 + G G GK+TL+ L G Sbjct: 601 LEGTGGIGKSTLINTLVG 618
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 34.9 bits (80), Expect = 3e-04 Identities = 11/55 (20%), Positives = 20/55 (36%) Query: 53 AKILEVSSKSPDALGVQLSAFNMGVGSLGGQFISVESLFQSSKVFTDSGTLRGPF 107 A I EV LG+Q + N G+ + + + + + GT+ Sbjct: 351 AIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSL 405
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 118 bits (297), Expect = 2e-34 Identities = 67/252 (26%), Positives = 114/252 (45%), Gaps = 9/252 (3%) Query: 4 LAGKYALITGGTSGIGLATAQTFIAEGAQVAVTGRNP---VALEQAQALLGNNGWVIAAD 60 + GK A ITG GIG A A+T ++GA +A NP + + + AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 61 SGDAAGQRQLAATLTDRWPQLDVVFVNAGDVTHASFGDWREAEWDRLMNINLKGPFFLLQ 120 D+A ++ A + +D++ AG + + EW+ ++N G F + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 ALLPLLAN--PASVILCGSVSAHIGLPTSSAYAASKAGLLSLARTLSAELLPHGIRVNGL 178 ++ + + S++ GS A + + +AYA+SKA + + L EL + IR N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 179 SPGPVRTPALDKLGLS----AQALSDLQEEIKNLVPLGRMGTPQELANAALYLASDESSY 234 SPG T L Q + E K +PL ++ P ++A+A L+L S ++ + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 235 VLGSELRVDGGT 246 + L VDGG Sbjct: 246 ITMHNLCVDGGA 257
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 78.2 bits (192), Expect = 3e-19 Identities = 63/246 (25%), Positives = 110/246 (44%), Gaps = 23/246 (9%) Query: 8 ALITGASSGIGALYAERLAARGYHLILVARREERLQALAQELQRQYGIRADVLKADLSEE 67 A ITGA+ GIG A LA++G H+ V E+L+ + L+ + A+ AD+ + Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVRDS 69 Query: 68 SGIRAVEARL--QSDPTIALVINNAGTAKMGGLLTTDVREHQMIHTLNTTALLRLSYAAL 125 + I + AR+ + P LV N AG + G + + E + ++N+T + S + Sbjct: 70 AAIDEITARIEREMGPIDILV-NVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 126 AAFSPRGQGTLINIASILALHALPGSAVYSASKAWVLSFTRALQEEFSDSGLRIQAVLPA 185 R G+++ + S A A Y++SKA + FT+ L E ++ +R V P Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 186 ATATDLWPTSGVALDA---------------LPAGSVMTTEDLVDAALSGL-DQGENITL 229 +T TD+ + + +P + D+ DA L + Q +IT+ Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248 Query: 230 PPVHDL 235 H+L Sbjct: 249 ---HNL 251
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 26.5 bits (58), Expect = 0.037 Identities = 8/21 (38%), Positives = 13/21 (61%) Query: 56 LETIKTVIETAGGSMEDVTFN 76 L +K V+ T G ++D +FN Sbjct: 59 LLKLKPVLITDEGKIDDKSFN 79
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.5 bits (113), Expect = 5e-08 Identities = 64/365 (17%), Positives = 122/365 (33%), Gaps = 10/365 (2%) Query: 14 PAKAMVAAISGYAMDGFDLLILGFMLPAISVSLALSTSQA---GSLVTWTLIGAVLGGIF 70 P + ++ +S A+D + ++ +LP + L S G L+ + Sbjct: 3 PNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPV 62 Query: 71 FGHLSDRLGRIRVLTFTILMFSVFTGLCAVAQGYWDLLAYRTLAGMGLGGEFGIGMALIA 130 G LSDR GR VL ++ +V + A A W L R +AG+ G + A IA Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIA 121 Query: 131 EVWPASKRNRASAWVGIGWQLGVLLAAFITPLLLDIIGWRGMFLVGLLPALVSFAIRRGM 190 ++ +R R ++ + G++ + L+ F L L + Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181 Query: 191 GEPDAFTKDIAVTQQVSFTTRLRMLFADRATSKASIGILILCSVQNFGYYGLMIWMPSYL 250 E + + ++ R + + + +Q G +W+ + Sbjct: 182 PESHKGERRPLRREALNPLASFRWARGMTVV---AALMAVFFIMQLVGQVPAALWV-IFG 237 Query: 251 SSSFGFSLTKSGL-WTAVTVVGMTFGIWLFGVLADRFARWKIFLIYQVGAVVMVIIYAQL 309 F + T G+ A ++ + G +A R + L+ + A I Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE-RRALMLGMIADGTGYILLAF 296 Query: 310 RDPTLMLFTGAVMGMFVNGMIGGYGALISDTYPLQVRATAQNVLFNLGRGVGGFGPLVIG 369 M F V+ + A++S + + Q L L GPL+ Sbjct: 297 ATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356 Query: 370 LCVSH 374 + Sbjct: 357 AIYAA 361 Score = 31.7 bits (72), Expect = 0.005 Identities = 38/138 (27%), Positives = 52/138 (37%), Gaps = 18/138 (13%) Query: 278 LFGVLADRFARWKIFLIYQVGAVVMVIIYAQLRDPTL-MLFTGAVMGMFVNGMIGGYGAL 336 + G L+DRF R + L+ GA V I A P L +L+ G ++ GA Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMAT--APFLWVLYIGRIVAGITGATGAVAGAY 119 Query: 337 ISDTYPLQVRA-------TAQNVLFNLGRGVGGFGPLVIGLCVSHWSFTAAITLLALLYL 389 I+D RA G +GG +G H F AA A L Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGL----MGGFSPHAPFFAA----AALNG 171 Query: 390 LDIFATLFLLPKTQGSED 407 L+ FLLP++ E Sbjct: 172 LNFLTGCFLLPESHKGER 189
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 562 bits (1451), Expect = 0.0 Identities = 245/893 (27%), Positives = 414/893 (46%), Gaps = 65/893 (7%) Query: 1 MYRFLLRCIHTEEPRSQVSRISMILPVLFSSVSVSVFAGNDFEEAFLR-RDKNGVSQDVF 59 +Y+ +C+H + R + + + F++ + A F FL + F Sbjct: 8 LYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRF 67 Query: 60 MYQDPVMPGRRLTDIVINDRLREKTEIDFVSNGNNK-VIPCLSYRQLKASGIRVSHYSGW 118 + PG DI +N+ ++ F + + + ++PCL+ QL + G+ + SG Sbjct: 68 ENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGM 127 Query: 119 ETREGEAAGSSDAETSVPSRCEDLALRIPAAFVQYDHTHQVLNITVPQEAMDNERFTMIS 178 +A C L I A Q D Q LN+T+PQ M N I Sbjct: 128 NLLADDA-------------CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIP 174 Query: 179 PAEWDHGTPSLRSSYSGYFYSSRLKGASGPGWKVDDSTTESAWLSLNTTGNAGPWRLYSI 238 P WD G + +Y+ S + + + A+L+L + N G WRL Sbjct: 175 PELWDPGINAGLLNYNFSGNSVQNRIGG---------NSHYAYLNLQSGLNIGAWRLRDN 225 Query: 239 DSFYRNDR-----HQWKSNHDRAYLARDIALLRSSLQVGEIYTRTSGTMTGAIPLRGISL 293 ++ N + K H +L RDI LRS L +G+ YT G + I RG L Sbjct: 226 TTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYT--QGDIFDGINFRGAQL 283 Query: 294 ATSERMSLDNQYSYAPVIRGVARTNARLTVRQRDAVIYSTLLTPGAFAIDDLYTAQVGAD 353 A+ + M D+Q +APVI G+AR A++T++Q IY++ + PG F I+D+Y A D Sbjct: 284 ASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGD 343 Query: 354 LDVMVEESDGQIQSFRVPYTALPGMIRAGSIRYSLAAGTWRGPDGGTSEPALLSGTLEYG 413 L V ++E+DG Q F VPY+++P + R G RYS+ AG +R + +P TL +G Sbjct: 344 LQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHG 403 Query: 414 F-EHFTLNSASVMTENYQMFSSGAAWNIGAIGAFSADLAYARHSETWRDNRQREGTAARL 472 +T+ + + + Y+ F+ G N+GA+GA S D+ A S D++ G + R Sbjct: 404 LPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQAN-STLPDDSQHD-GQSVRF 461 Query: 473 LYARQFDVTGTSLQLLGYQYQSESFLDAGEFLARQSQS----------WIDGYAPDTTTW 522 LY + + +GT++QL+GY+Y + + + + + + D Sbjct: 462 LYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNL 521 Query: 523 QRRRRNRMEMTVSQNMNSVGNLYMTISQESFYGTGDKNSSLSAGAGTTVGSASVSLALTH 582 +R ++++TV+Q + LY++ S ++++GT + + AG T + +L+ + Sbjct: 522 AYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSL 581 Query: 583 NR--YQRLSDNQLTLSLSLPLSVWLPARQDAGF----LSYGLSRNKNNQYGQSLGYAGNS 636 + +Q+ D L L++++P S WL + + + SY +S + N + G G Sbjct: 582 TKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTL 641 Query: 637 -AGNDFSYSASLQRDTQGEYSQ----SGSLGWNSSRANITAGISHARDYRQYSAGMSGGV 691 N+ SYS G+ + +L + N G SH+ D +Q G+SGGV Sbjct: 642 LEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGV 701 Query: 692 TLYRGGVIMSPPLGNTVAIVETPGAENIRVSGINNARTDSAGRAVVTWLTPYRYNQINLD 751 + GV + PL +TV +V+ PGA++ +V RTD G AV+ + T YR N++ LD Sbjct: 702 LAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALD 761 Query: 752 -AGESDGAELQESSRKIVPTEGAAVLLRFATRSGRRALVEI-YSRKSIPLGALAYTESAP 809 +D +L + +VPT GA V F R G + L+ + ++ K +P GA+ +E Sbjct: 762 TNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSE--- 818 Query: 810 GVNETEEAGIVGQKGLVWLTGLDTHRAQVLNVIWGQRPEERCQIALSAPTEEQ 862 ++ +GIV G V+L+G+ A + V WG+ C P E Q Sbjct: 819 ---SSQSSGIVADNGQVYLSGM--PLAGKVQVKWGEEENAHCVANYQLPPESQ 866
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 34.2 bits (78), Expect = 3e-04 Identities = 39/175 (22%), Positives = 59/175 (33%), Gaps = 22/175 (12%) Query: 4 PANFNDSRPMIDVNDTAMLLIDHQSGLFQTVGD--MPMPELRARAATLAKMASLAGLPVI 61 P N D N +L+ D Q+ P+ EL A L G+PV+ Sbjct: 18 PQNKVSWV--PDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVV 75 Query: 62 TTASVPQ-------------GPNGPLIPE----IHENAPHA-QYIARKGEINAWDNPDFV 103 TA GP P I E AP + K +A+ + + Sbjct: 76 YTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLL 135 Query: 104 AAVKATGRKTLIIAGTITSVCMAFPSIAAVADGYRVFAVIDASGTYSKMAQEITL 158 ++ GR LII G + + A + + F V DA +S ++ L Sbjct: 136 EMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMAL 190
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 76.2 bits (187), Expect = 2e-18 Identities = 49/188 (26%), Positives = 85/188 (45%), Gaps = 8/188 (4%) Query: 3 QVILITGASSGFGALAARAFAHAGHIVYASMRDTAGRNAPQVQSTLEYARQHNVDLRTVE 62 ++ ITGA+ G G AR A G + A D +V S+L+ +H Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAV--DYNPEKLEKVVSSLKAEARHAEAFP--- 63 Query: 63 LDVQSQDSADAAIAQIIAQDGRLDVVVHNAGHMVYGPTEAFLPEQFSQLYDINVLGTQRV 122 DV+ + D A+I + G +D++V+ AG + G + E++ + +N G Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 123 NRAALPQLRKQGQGLLLWVGSSSTRGGTPPY-LAPYFAAKAAMDAVAVSYAAELARWGIE 181 +R+ + + G ++ VGS+ G P +A Y ++KAA ELA + I Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNP--AGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 182 TSIIVPGA 189 +I+ PG+ Sbjct: 182 CNIVSPGS 189
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 84.3 bits (208), Expect = 1e-21 Identities = 63/252 (25%), Positives = 107/252 (42%), Gaps = 12/252 (4%) Query: 3 VTNKTALVTGASRGIGRAIAERLAQDGFSVVVNYAGNANAAQETVKDIITKGGKAVAIQA 62 + K A +TGA++GIG A+A LA G + + N ++ V + + A A A Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQG-AHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 63 DVSSEADVGRLFSEAKAVTGHLDVVVHSAGVMPMAKITPAGLADFDKVIHTNLRGAFLVL 122 DV A + + + + G +D++V+ AGV+ I +++ N G F Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 123 ANAAESV--SEGGRIIALSTSVIAKSFPAYGPYIASKAGVEGLVHVLANELRGRDITVNA 180 + ++ + G I+ + ++ + Y +SKA L EL +I N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 181 VAPGPTGTD----LFFNGKSEEQI----SAIAKLA-PLERIGTPEEIASVVATLAGPDGS 231 V+PG T TD L+ + EQ+ K PL+++ P +IA V L Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 232 WINSQVIRVNGG 243 I + V+GG Sbjct: 245 HITMHNLCVDGG 256
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 6e-23 Identities = 32/124 (25%), Positives = 59/124 (47%) Query: 2 RVLVVEDNALLRHHLKVQLQELGHQVDAAEDAKEADYYLGEHVPDIAIVDLGLPDEDGLS 61 +LV +D+A +R L L G+ V +A ++ D+ + D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LIRRWRSHDVSVPVLVLTAREGWQDKVEVLSAGADDYVTKPFHIEEVAARMQALLRRNSG 121 L+ R + +PVLV++A+ + ++ GA DY+ KPF + E+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 LASQ 125 S+ Sbjct: 125 RPSK 128
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 27.2 bits (60), Expect = 0.023 Identities = 12/51 (23%), Positives = 20/51 (39%) Query: 101 QGQGLGTQALRAFEREMREQGIEQIRLRVAGDNQRARHVYASAGFWVTGIN 151 + +G+GT L +E + L N A H YA F + ++ Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVD 152
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 67.4 bits (164), Expect = 2e-15 Identities = 48/186 (25%), Positives = 83/186 (44%), Gaps = 5/186 (2%) Query: 2 LRGKRAVITGGGSGFGQALAVWLAREGVSVDYCARRPADIQETSALIAAEGGTAQGYLCD 61 + GK A ITG G G+A+A LA +G + P +++ + + AE A+ + D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 62 LAQPASIAQFSAQLLQSETPVDILILNAAQWLSGNLDDRSPPEIVDTLHSGLTSSVLLVQ 121 + A+I + +A++ + P+DIL+ A G + S E T T + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 122 ALLPALRRSEQADIVSMIS-ACGIPHFTDSIAHPAFFASKHGLSGFTQTLSHQLAAENIR 180 ++ + IV++ S G+P + A+ +SK FT+ L +LA NIR Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPR----TSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 181 VTGLYP 186 + P Sbjct: 182 CNIVSP 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.8 bits (72), Expect = 0.006 Identities = 27/108 (25%), Positives = 39/108 (36%), Gaps = 13/108 (12%) Query: 51 ELGWTDNSTTATFSAMTTAGMFLGALGGGIIGDKIGRKNAFILYEAIHIIAMVVGAFSPN 110 W + + TFS T G L D++G K + I+ V+G + Sbjct: 50 STNWVNTAFMLTFSIGT---AVYGKLS-----DQLGIKRLLLFGIIINCFGSVIGFVGHS 101 Query: 111 MNF-LIACRFVMGVGLGALLVTLFAGFTEYMPGRNR----GTWSSRVS 153 LI RF+ G G A + Y+P NR G S V+ Sbjct: 102 FFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVA 149
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 28.6 bits (64), Expect = 0.019 Identities = 25/153 (16%), Positives = 48/153 (31%), Gaps = 16/153 (10%) Query: 27 SADGVTLTGFAIGVLALPFLGLRWYGAALAAIVVNRLLDGLDGALARRRGLSDAGGFLDI 86 S + V+ L L ++ +++ L + A +D Sbjct: 26 SKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQA-------LSYVVDN 78 Query: 87 SLDFLFYALVPFGFIIADPAQNAVAGGWLLFSFIGTGSSFLAFAALAAKHQIANPGYAHK 146 L FY P + A A +A + + F+ ++ A+ + NP K Sbjct: 79 VLLEFFYLCFPLLTVAALMA---IASHVVQYGFL------ISGEAIKPDIKKINPIEGAK 129 Query: 147 SFYYLGGLTEGSETILLFVLCCLFPAHFAWLAW 179 + + L E ++IL VL + Sbjct: 130 RIFSIKSLVEFLKSILKVVLLSILIWIIIKGNL 162
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.024 Identities = 9/22 (40%), Positives = 12/22 (54%) Query: 29 VLTLMGPSGSGKSTLFAWMIGA 50 + L G G GKSTL ++G Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 33.9 bits (78), Expect = 2e-04 Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 5/61 (8%) Query: 74 ANKTALTAVIAAETGKPQWEAVTEISAMINKIAISLKAYHSRTGESQTAMGDGSATLRHR 133 ANK L A +A T + ++ + A+ + ++ L GE +G G+ +R R Sbjct: 2 ANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAK-----GEKVQLIGFGNFEVRER 56 Query: 134 P 134 Sbjct: 57 A 57
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 41.3 bits (97), Expect = 2e-06 Identities = 24/95 (25%), Positives = 36/95 (37%), Gaps = 13/95 (13%) Query: 1 MRIFLTGATGFIGSRILNELLAAGHQVTGL---------ARSEASALALQTAGADVQYGT 51 M+ +TGA GFIG + LL AGHQV G+ + +A L G Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LEEPRSLLEAVTR---CDAVIHTAFDHDFSRFVEN 83 L + R + + + V + +EN Sbjct: 61 LAD-REGMTDLFASGHFERVFISPHRLAVRYSLEN 94
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 89.6 bits (222), Expect = 2e-21 Identities = 75/398 (18%), Positives = 157/398 (39%), Gaps = 22/398 (5%) Query: 35 VINV-VPAMKSSLDISLETLTLAVSLSALFSGCFVVASGGLADKFGRMRMTHIGLGLSIA 93 V+NV +P + + + + + L G L+D+ G R+ G+ ++ Sbjct: 32 VLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF 91 Query: 94 GSALLIVAQGPW-LFLAGRVLQGLSAACIMPATLALIKTWYEGKARQRAVSFWVIGSWGG 152 GS + V + L + R +QG AA + ++ + + R +A G Sbjct: 92 GSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMG 151 Query: 153 SGLCSFVGGAIATGLGWRWIFVFSIAVALVALMLLRGTPESRSAGAQQQKLDISGLLSLI 212 G+ +GG IA + W ++ + + + L++ + + DI G++ + Sbjct: 152 EGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKK---EVRIKGHFDIKGIILMS 208 Query: 213 ASLVLLNLFISKGHGWGWSSGLSLTMFAGSLLAAGFFIRSGLRKGDAALIDFALFRNRAY 272 +V LF + S++ S+L+ F++ +RK +D L +N + Sbjct: 209 VGIVFFMLFTTSY---------SISFLIVSVLSFLIFVKH-IRKVTDPFVDPGLGKNIPF 258 Query: 273 SAAVLSNFLLNGAI-GTMMITSIWLQKGHNMTPLETGGMTLGYLVTVLAMIR--VGEKLL 329 VL ++ G + G + + ++ H ++ E G + + + T+ +I +G L+ Sbjct: 259 MIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII-FPGTMSVIIFGYIGGILV 317 Query: 330 QRYGARLPMMTGPLLTAVAIVLISCTFLDKSLYIVTVFLSNVLFGLGLGCYATPSTDTAV 389 R G + G +V+ + S FL ++ + + G GL T + Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTAS--FLLETTSWFMTIIIVFVLG-GLSFTKTVISTIVS 374 Query: 390 VNAPENKVGVASGIYKMGSSLGGAMGIAVTASLFALFL 427 + + + G + S L GIA+ L ++ L Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 348 bits (895), Expect = e-120 Identities = 127/341 (37%), Positives = 177/341 (51%), Gaps = 21/341 (6%) Query: 6 DNLLGEANSFLEVLEQVSRLAPLDKPVLVIGERGTGKELIANRLHFLSSRWQGPFISLNC 65 L+G + + E+ ++RL D +++ GE GTGKEL+A LH R GPF+++N Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196 Query: 66 AALNENLLDSELFGHEAGAFTGASKRHPGRFERADGGTLFLDELATAPMLVQEKLLRVIE 125 AA+ +L++SELFGHE GAFTGA R GRFE+A+GGTLFLDE+ PM Q +LLRV++ Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256 Query: 126 YGELERVGGSQPLQVNVRLVCATNADLPQRVEDGHFRADLLDRLAFDVVQLPPLRDRQSD 185 GE VGG P++ +VR+V ATN DL Q + G FR DL RL ++LPPLRDR D Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316 Query: 186 IMLLANQFAIQMCRELGLPLFPGFSDRARETLLGYRWPGNIRELKNVVERSV--YRHGTS 243 I L F Q +E F A E + + WPGN+REL+N+V R Y Sbjct: 317 IPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374 Query: 244 DSELDNIIINPFHQHQPLQSPAADV-----------------AAHPTGPTLPVDLRAFQQ 286 E+ + P++ AA A+ Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434 Query: 287 EQEKNLLQTSLQQAKYNQKQAAALLGLTYHQLRALLKKHQL 327 E E L+ +L + NQ +AA LLGL + LR +++ + Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 28.6 bits (64), Expect = 0.018 Identities = 24/132 (18%), Positives = 52/132 (39%), Gaps = 16/132 (12%) Query: 13 ANINSLLEKAEDPQKLVRLMIQEMED--TLVEVRSTSARALAEKKQLSRRIEQALAQQAE 70 A ++L + + L R+ ++D +L+ ++ + A+ E++ L Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVY-- 271 Query: 71 WQEKAELALRKDKEDLARAALIEKQKLTDLIGSLENEAQMVDETLTRMKKEIGELENKLS 130 + E + L K++ + +NE + + L + IG L +L+ Sbjct: 272 ---------KSQLEQIESEILSAKEEYQLVTQLFKNE---ILDKLRQTTDNIGLLTLELA 319 Query: 131 ETRARQQALTLR 142 + RQQA +R Sbjct: 320 KNEERQQASVIR 331
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 310 bits (795), Expect = e-102 Identities = 113/377 (29%), Positives = 172/377 (45%), Gaps = 38/377 (10%) Query: 174 VLTGAVAMLRSTVRMGRQLQTMTTQDTSAFSQILAVGPKMRHVVEQARKLAMLSAPLLIV 233 LT + ++ + ++ + D+ ++ M+ + +L L+I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 234 GDTGTGKDLFAHACHLASPRANKPYLALNCGSIPEDAVESELFG-------DAIQGKKGF 286 G++GTGK+L A A H R N P++A+N +IP D +ESELFG A G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 287 FEQANGGSVLLDEIGEMSPRMQTKLLRFLNDGTFRRVGEDHEVHVDVRVICATQKNLLEL 346 FEQA GG++ LDEIG+M QT+LLR L G + VG + DVR++ AT K+L + Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 347 VQKGLFREDLYYRLNVLTLYLPPLRDCPQDIIPLTELFVSRFADEQGIPRPKLSADLSTV 406 + +GLFREDLYYRLNV+ L LPPLRD +DI L FV + E G+ + + + Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 407 LTRYSWPGNVRQLKNAVYRALTQLEGYEMRPQDILLP---------------DHDVASLP 451 + + WPGNVR+L+N V R + + I S+ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 452 VGEEAM--------------EGSLDDITRRFERSVLTQ-LYRSFPSTRKLAKRLGVSHTA 496 E G D + E ++ L + + K A LG++ Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 497 IANKLREYGLSQKKGEE 513 + K+RE G+S + Sbjct: 466 LRKKIRELGVSVYRSSR 482
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.026 Identities = 32/170 (18%), Positives = 59/170 (34%), Gaps = 30/170 (17%) Query: 6 KVLILGATGGIGGEIARQLVR------------DQWDVHALRRHAPQNEEHGSITWISGD 53 K L+ GA G IG ++++L+ D +DV +L++ + + D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDV-SLKQARLELLAQPGFQFHKID 60 Query: 54 ALNAEQVAS--AASACSVIVH-----AV-----NPPGYRNWEQLVLPMLHNTIHAAERNG 101 + E + A+ + AV NP Y L N + N Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAY---ADSNLTGFLNILEGCRHNK 117 Query: 102 -ALIVLPGTVYNYGPDA-FPLLREDAPQNPVTRKGAIRVQMEKALLAYAQ 149 ++ + YG + P +D+ +PV+ A + E Y+ Sbjct: 118 IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH 167
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 39.4 bits (92), Expect = 9e-06 Identities = 21/80 (26%), Positives = 35/80 (43%), Gaps = 12/80 (15%) Query: 1 MKILVAGATGSIGLHVVNTAIDMGHHPVAL---------VRNRRKVKRLPRGTDIFY-GD 50 MK LV GA G IG HV ++ GH V + + +++ L + F+ D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 51 VSMPETLSDLPKD--IDAVI 68 ++ E ++DL + V Sbjct: 61 LADREGMTDLFASGHFERVF 80
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (290), Expect = 2e-33 Identities = 69/254 (27%), Positives = 113/254 (44%), Gaps = 9/254 (3%) Query: 4 LQGKRALITGGTSGIGLETAKLFVAEGARVIVTGVNPDSIAKAKVELGNDVLVVSADSAD 63 ++GK A ITG GIG A+ ++GA + NP+ + K L + A AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 64 VNAQKALAQTVQ---EHFGQLDIAFLNAGISLYMPIEVWTEEQFDLIYAINVKGPYFLMQ 120 V A+ + G +DI AG+ I ++E+++ +++N G + + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 ALLPVFTS--SASVVFNTSINAHTGPVNSSVYGSTKAALLNMSKTLSNELLSRGIRINAV 178 ++ S S+V S A + + Y S+KAA + +K L EL IR N V Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 179 SPGPVNTPLYDKAGIPEEYHDQVMKNIVAT----IPAGRFGKPQEVAQAVLYFASDASAW 234 SPG T + E +QV+K + T IP + KP ++A AVL+ S + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 235 TVGSEIIIDGGVSI 248 + +DGG ++ Sbjct: 246 ITMHNLCVDGGATL 259
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 120 bits (301), Expect = 5e-35 Identities = 72/254 (28%), Positives = 120/254 (47%), Gaps = 10/254 (3%) Query: 2 SKPLSDKIALVTGGSTGIGLASAQELAAQGAKVY---ITGRRQQELDAAIALIGTSAKGI 58 +K + KIA +TG + GIG A A+ LA+QGA + + +++ +++ A+ Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62 Query: 59 RADVSRLEDLDKVYAQIAEESGRLDILFANAGGGDMLALGAITEEHFDRIFGTNVRGVLF 118 ADV +D++ A+I E G +DIL AG + ++++E ++ F N GV Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 119 TVQKALPLLGA--GSSIILTASTVSVKGTANFSVYSASKAAVRNFARSWALDLQGRGIRV 176 + + SI+ S + + + Y++SKAA F + L+L IR Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 177 NVVSPGPVKTPGLGGL----VAEEQR-QGLFDALAAQVPLGRIGEPAEVGKVVAFLASDA 231 N+VSPG +T L EQ +G + +PL ++ +P+++ V FL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 232 ASFINAIELFVDGG 245 A I L VDGG Sbjct: 243 AGHITMHNLCVDGG 256
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.7 bits (152), Expect = 2e-14 Identities = 32/191 (16%), Positives = 61/191 (31%), Gaps = 23/191 (12%) Query: 1 MKVSKEQVRENRTRIVETASKLFRERGFDGVGVAELMSAAGLTHGGFYKHFGSKADLMAE 60 + +K++ +E R I++ A +LF ++G + E+ AAG+T G Y HF K+DL +E Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 61 AMHCGFTRSAESTAGVDR--------------EKFIEYYLSRPHRDDMGKGCVMSALGAD 106 + E +E ++ R + + Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121 Query: 107 TARQSESIRETFA----AGIERQLAVLENEHETRADL-----IDTIAHLVGALVLSRACP 157 + + IE+ L ADL + + L+ + Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181 Query: 158 DNSALADEILD 168 S + Sbjct: 182 PQSFDLKKEAR 192
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 29.3 bits (65), Expect = 0.026 Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 2/70 (2%) Query: 71 QIPENPHKYPIVMLHGAGQFSRTWESTPDGREGFQNIFLRRGFSTYLVDQPRRGSAGRTT 130 ++ NP KY + G +S E P + + R G +V R S G TT Sbjct: 247 KVDANPDKY--IKATGYPGYSEKVEVAPGTKVNMGPVTDRNGNPVQVVATFGRDSQGNTT 304 Query: 131 VEGTVTPKPD 140 V+ V P+PD Sbjct: 305 VDVQVIPRPD 314
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 87.6 bits (217), Expect = 3e-22 Identities = 38/120 (31%), Positives = 66/120 (55%), Gaps = 2/120 (1%) Query: 2 RLLLVEDEEKTSTYLNRALGESGFTVDISADGAEGLHYALEFDYDAIILDVMLPGMDGYR 61 +L+ +D+ T LN+AL +G+ V I+++ A + D D ++ DV++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLEGVRAV-KQTPVLMLSARGSVDERVKGLRLGADDYLPKPFSLIELVARI-QALVRRRA 119 +L ++ PVL++SA+ + +K GA DYLPKPF L EL+ I +AL + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 97.6 bits (243), Expect = 6e-24 Identities = 80/408 (19%), Positives = 160/408 (39%), Gaps = 21/408 (5%) Query: 27 MCVGMFIALIDIQIVSASLRDIGGGLSAGDDETVWVQTSYLIAEIIIIPLSGWLARVMST 86 +C+ F ++++ +++ SL DI + T WV T++++ I + G L+ + Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 87 RWLFAASAAGFTLMSLLCGWAWNIQS-MIAFRALQGLAGGSMIPLVFTTAFAFFQGKQRV 145 + L S++ + S +I R +QG + LV + + R Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 146 IAAATIGGLASLAPTLGPTVGGWITENFNWHWLFFINVIPGIYIAVAVPLLVKVDSADPS 205 A IG + ++ +GP +GG I +W +L I +I + VP L+K+ + Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI----TIITVPFLMKLLKKEVR 194 Query: 206 LLRGADYLSILLLALSLGCLEYTLEEGPRWGWFDDATLTTTAWVALLCGVAFIIRTLRHP 265 + D I+L+++ + F + + V++L + F+ + Sbjct: 195 IKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVT 244 Query: 266 QPVMDLRALQDRTFSLGCYFSFMAGVGIFATIYLTPLYLGSVRGFSALEIGLAV-FSTGL 324 P +D ++ F +G + + + + P + V S EIG + F + Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304 Query: 325 FQVMSIPFYSWLANRVDLRWLLMAGLIGFAMSMY--SFVPITHDWGADQLLLPQAFRGLA 382 ++ L +R ++L G+ ++S SF+ T W +++ GL+ Sbjct: 305 SVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTIIIVFVLGGLS 363 Query: 383 QQFAVAPTVTLTLGSLPPARLKLASGLFNLMRNLGGAIGIALCGTVLN 430 V T+ + SL L N L GIA+ G +L+ Sbjct: 364 FTKTVISTIVSS--SLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 107 bits (269), Expect = 4e-28 Identities = 57/411 (13%), Positives = 126/411 (30%), Gaps = 90/411 (21%) Query: 9 AFVLAALAVAALAAASYGAYWWHTGRFIQTTDDAYVGGDISAISSKISGYIQQLAVQDNM 68 F++ A ++ L A G+ G I + ++++ V++ Sbjct: 66 GFLVIAFILSVLGQVEIVAT--ANGKLT-------HSGRSKEIKPIENSIVKEIIVKEGE 116 Query: 69 AVKKGDLLIRIDDRDYRAARAKAV------------------------------------ 92 +V+KGD+L+++ A K Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176 Query: 93 -----GEVAAQQAALADIIATRQLQ-----------QATIAASAASLQAAAAAAEKLAND 136 EV + + + +T Q Q +A A + + + Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236 Query: 137 NRRYNALAASSAISA-------QIRDNASADYRRARAEQDKAKADKTAAERQLAVLDA-R 188 +++L AI+ A + R +++ ++ +++ +A+ + ++ Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296 Query: 189 QQQTLAALTQAQAN-------LEMATLNLSYTEIRAPFDGVIGNRRAWS-GSFVSSGTQL 240 + + L L Q N L + IRAP + + + G V++ L Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356 Query: 241 LSLVPA-HGLWIDANFKENQLAHMRAGQPVTIVADVLPNRTF---KGHVTSLAPATGSRF 296 + +VP L + A + + + GQ I + P + G V ++ Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA---- 412 Query: 297 SILPAENATGNFTKIVQRVPVRIALEGEGAKLDVLRPGLSVVVTVNEKSRR 347 + G ++ + G K L G++V + R Sbjct: 413 ---IEDQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAVTAEIKTGMRS 458
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.2 bits (68), Expect = 0.016 Identities = 66/382 (17%), Positives = 137/382 (35%), Gaps = 64/382 (16%) Query: 15 RIAIVMAFVQFTNALEYMALTPVFAFMAEGFSVPVSFSGYVSGMYTLGAVLSGIVAFYCI 74 +I I + + F + L M L +A F+ P + + +V+ + L + V Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 75 DLFNKKQFLLTNMVLLGALTFLPTLT-THFDILLALRFCAGAVGGTTMGVGMSILINYAP 133 D K+ LL +++ + + + + F +L+ RF GA + M ++ Y P Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133 Query: 134 ANLRGKMLATVIASFSLVSIVGMPAILFLCTHYGWHTAPGLISSLCLLSLPLIIFIIPKD 193 RGK + + ++ VG PAI + HY + LI + ++++P ++ ++ K+ Sbjct: 134 KENRGKAFGLIGSIVAMGEGVG-PAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKE 192 Query: 194 TAPSGVKRNLSIDAQTLLFASCTALVQFSPM-----LIIPILA----------------- 231 +K + I L+ + F+ LI+ +L+ Sbjct: 193 VR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249 Query: 232 ---------------------------PLMTQYMGAQQNLLP-----LLFLSGGIAGYLS 259 ++ M L ++ G ++ + Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309 Query: 260 TKITGMLTSRLSALMLATISTLFLVASLLIPAM-----GYHNVFLFITLFLGASYSRLVC 314 I G+L R L + I FL S L + + + + + G S+++ V Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVI 369 Query: 315 ASSVAVQYPEDEQRASFTSLQT 336 ++ V+ + E A + L Sbjct: 370 STIVSSSLKQQEAGAGMSLLNF 391
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 56.9 bits (137), Expect = 3e-13 Identities = 36/108 (33%), Positives = 49/108 (45%), Gaps = 20/108 (18%) Query: 19 FGTQASDSTPSGSTSSAT--------TAITAHDKNGKLLP---------GALRVQFDVDA 61 F A S + ++AT + A +N P G ++V+FDV Sbjct: 125 FENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTP 184 Query: 62 NGRVQNTQILESTTTPEFEHKVISIMKNEWRYEKGKPGKDHRIVVMIR 109 +GRV N QIL + FE +V + M+ WRYE GKPG IVV I Sbjct: 185 DGRVDNVQILSAKPANMFEREVKNAMRR-WRYEPGKPGSG--IVVNIL 229
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 68.2 bits (166), Expect = 8e-16 Identities = 62/250 (24%), Positives = 104/250 (41%), Gaps = 17/250 (6%) Query: 3 KKTLIIGGTSGIGFAVASALAEQGESLILAGRDSEKLARARQLLSQKSASVDTVVLDISK 62 K I G GIG AVA LA QG + + EKL + L ++ + D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 EEEVIQLSQTL----GEVDNIIVTAGSQAPGGALASLNLNEARLAFDTKFWGSIHVARHL 118 + +++ + G +D ++ AG P G + SL+ E F G + +R + Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 119 SKNIKAR--GTLTLTSGFVSRRTVAGAIVKTTMNAALESAVKVLAKELSP--LRVNAVSP 174 SK + R G++ + + AA K L EL+ +R N VSP Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 175 GLTDTEAYAGM--DPAAREKLLASAAEN----LPAKAFGRAEDIAKGYLFVMDNPFVTGT 228 G T+T+ + D E+++ + E +P K + DIA LF++ T Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 229 LLDI--EGGA 236 + ++ +GGA Sbjct: 248 MHNLCVDGGA 257
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 112 bits (280), Expect = 6e-32 Identities = 69/252 (27%), Positives = 115/252 (45%), Gaps = 13/252 (5%) Query: 4 QNKVAVITGSTAGIGQAVAEQLHKYGAKVVIVSRSSEQAKQQAKRLTSQGQQALGIGCDV 63 + K+A ITG+ GIG+AVA L GA + V + E+ ++ L ++ + A DV Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 64 SQPEQVQKMIDEVIKHFGRLDYAVNNAGLTGEHGKNITEQTVENWDKVIATSLSGVFYCL 123 + ++ + + G +D VN AG+ I + E W+ + + +GVF Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVL--RPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 124 KYEIPEMM-KFGGSIVNLSAVNGLVGIPGLAPYTVAKHGIIGLTQTAALEFASQGIRINA 182 + MM + GSIV + + V +A Y +K + T+ LE A IR N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 183 VAPGYVQTPRMTEF------PENIVRSFANSH----PMKRMAKMQEIADFILFLLSDNSA 232 V+PG +T E +++ + P+K++AK +IAD +LFL+S + Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 233 FCTGGVYPIDGG 244 T +DGG Sbjct: 245 HITMHNLCVDGG 256
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 37.3 bits (86), Expect = 2e-05 Identities = 36/133 (27%), Positives = 48/133 (36%), Gaps = 13/133 (9%) Query: 7 ALLVIDMQQGLFRG-PASPHSSDTVLLNIRLLIENARQAQVPVFFARHIG---PDD---- 58 LL+ DMQ A + NIR L Q +PV + G PDD Sbjct: 32 VLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDDRALL 91 Query: 59 ----SPFSEQSPLTQLIPELNVNAEQDIVFIKKYPSCFRDTELQLQLSLRGVKQLVIAGM 114 P P + I + D+V K S F+ T L + G QL+I G+ Sbjct: 92 TDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGI 151 Query: 115 KTEF-CVDTTCRA 126 C+ T C A Sbjct: 152 YAHIGCLVTACEA 164
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 31.3 bits (70), Expect = 0.028 Identities = 84/365 (23%), Positives = 135/365 (36%), Gaps = 26/365 (7%) Query: 449 YGTQSVKTTDTGADATQAGSTVGSVNGDLTVRAGDNLTV-TGSDLIAGR-DMALSGKNVA 506 YG+ ++ A + DLT G T S LIAG +G N Sbjct: 644 YGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSI 703 Query: 507 ITAAENQSRQTHEVEQKTSGLTLALSGTAGSALNSVVQATQDAR-----SAGSSRLQALQ 561 +TA ++ E TSG + A S+L + +TQ A +AG Q + Sbjct: 704 LTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAR 763 Query: 562 GVKAALSGVQASQAARLDAAQGNDPANNNTVGV--SLSYGSQSSKSTQ-RSEQTTAQGSS 618 +G ++ A D++ + T G L+ G S+++ Q RS+ TT GS+ Sbjct: 764 EQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGST 823 Query: 619 LTAGRDLSITAREGDLNAVGSQLKAGNDVALSASRDINLVSAENTSLLEGKNDSHGG--- 675 TAG D S+ A GS AG + L+A + EN+ L G + Sbjct: 824 STAGADSSLIA------GYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYD 877 Query: 676 ---TVGVGIGVGSGGWGISVSASANKGKGSESGNGTTHSETTVDAGNRLTLNSGRDTTLT 732 G G +G I + + E+ + TT +T AG +L +G +T Sbjct: 878 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGST-- 935 Query: 733 GAQVSGDTVIADIGRNLTLTSEQDSDRYDSKQQNASAGGSFTFGSMSGSASVNLSKDKMH 792 Q + G + T+ + S + AG + + GS + + Sbjct: 936 --QTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLT 993 Query: 793 SNYDS 797 + Y S Sbjct: 994 AGYGS 998 Score = 30.9 bits (69), Expect = 0.035 Identities = 83/344 (24%), Positives = 116/344 (33%), Gaps = 26/344 (7%) Query: 449 YGTQSVKTTDTGADATQAGSTVGSVNGDLTVRAGDNLTV-TGSDLIAGRDMALSGKNVAI 507 YG+ ++ A + G DLT G T S LIAG + + Sbjct: 212 YGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSS 271 Query: 508 TAAENQSRQTHEVEQKTSGLTLAL--SGTAGSALNSVVQATQDARSAGSSRLQALQGVKA 565 A S QT QK S LT +GTAG+ + + + S A G Sbjct: 272 LTAGYGSTQT---AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYG--- 325 Query: 566 ALSGVQASQAARLDAAQGNDPANNNTVGVSLSYGSQSSKSTQRSEQTTAQGSSLTAGRDL 625 S A + + L A G+ + + YGS QT + SSLTAG Sbjct: 326 --STQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGST---------QTAGEDSSLTAGYGS 374 Query: 626 SITAREG-DLNA-VGSQLKAGNDVALSASRDINLVSAENTSLLEGKNDSHGGTVGVGIGV 683 + TA++G DL A GS AG D +L A + E ++ G + G + Sbjct: 375 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 434 Query: 684 GSGGWGISVSASANKGKGSESGNGTTHSETTVDAGNRLTLNSGRDTTL----TGAQVSGD 739 G G G + S+ + S T G+ T G D T T Sbjct: 435 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYES 494 Query: 740 TVIADIGRNLTLTSEQDSDRYDSKQQNASAGGSFTFGSMSGSAS 783 ++IA G T Q A G S S + Sbjct: 495 SLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTA 538
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 36.0 bits (83), Expect = 1e-04 Identities = 13/60 (21%), Positives = 27/60 (45%), Gaps = 2/60 (3%) Query: 97 PEQVWQDIDTVLEHVRVKFPWARVHLLGHSSGGGMLINYFTRFTPSQQS--DSLILLAPE 154 P+ V QD +++ + +F +V L+G+S G ++ + +LL+P Sbjct: 96 PKDVTQDTLAIIDKYQAEFGTQKVILIGYSFGAEVIPFVLNEMPARYRKNVLGAVLLSPS 155
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 35.1 bits (80), Expect = 0.003 Identities = 116/489 (23%), Positives = 183/489 (37%), Gaps = 36/489 (7%) Query: 300 GSEVTGKGNVTLSAGH---DLSARGALLSSGAALNLGAGNDLTLEAGENS-QTLDERHKV 355 GS T + +L+AG+ + ++L++G AG D +L AG S QT + Sbjct: 741 GSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSIL 800 Query: 356 TGSSGWLSKTTTRTE-----DSVSRQTSRGSELNGDSVSLTAGHDLTLRGSSVAGSGDVA 410 T G R++ S S + S + G + TAG++ L AG G Sbjct: 801 TAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSIL----TAGYGSTQ 856 Query: 411 LLAGNDLLIGTQNEYSSELHLKQEKKSGLMSSGGIGFSYGTQSVKTTDTGADATQAGSTV 470 N L S+ + S L++ G + G S+ T G+ T ++ Sbjct: 857 TAQENSDLTTGYGSTSTAGY-----DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSD 911 Query: 471 GSVNGDLTLSAGENLTVT---GSDLIAGRDMAL-SGKNVAITAAENQSRQTHEVEQKTSG 526 + T +AG ++ GS A L +G + TA E S +G Sbjct: 912 LTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAG 971 Query: 527 LTLALSGTAGSALNSVVQATQDARSAGSSRLQALQGVKAAL-SGVQASQAARLDAAQGND 585 +L GS + Q+T A + + + A S A + L A G+ Sbjct: 972 YDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSS 1031 Query: 586 PANNNTVGVSLSYGSQSSKSTQRSEQTTAQGSSLTAGRDLSITAREGDLNAVGSQLKAGN 645 + ++ YGS + S RS T GSSL +GR S+TA GS A + Sbjct: 1032 LTSGIRSFLTAGYGS-TLISGLRSVLTAGYGSSLISGRRSSLTA------GYGSNQIASH 1084 Query: 646 DVALSASRDINLVSAENTSLLEGKNDSHGGTVGVGIGVGSGGWGISVSASANKGKGSESG 705 +L A + ++ + L+ GK S T G + SG + ++ K Sbjct: 1085 RSSLIAGPESTQITGNRSMLIAGKGSSQ--TAGYRSTLISGADSVQMAGERGKLIAGADS 1142 Query: 706 NGTTHSETTVDAGNNLILNSGRDTTLTGAQ----VSGDTVIADIGRNLTLTSEQDSDRYD 761 T + + AGNN L +G + LT ++GD G N LT+ S Sbjct: 1143 TQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCILMAGDRSKLTAGINSILTAGCRSKLIG 1202 Query: 762 SKQQNASAG 770 S +AG Sbjct: 1203 SNGSTLTAG 1211 Score = 32.4 bits (73), Expect = 0.018 Identities = 111/517 (21%), Positives = 183/517 (35%), Gaps = 50/517 (9%) Query: 294 SQSRSTGSEVTGKGNVTLSAGHDLSARGALLSSGAALNLGAGNDLTLEAGENS-QTLDER 352 +Q+ S++ T +AG + S L +G A + L AG S QT E Sbjct: 519 TQTAQNESDLITGYGSTSTAGANSS-----LIAGYGSTQTASYNSVLTAGYGSTQTAREG 573 Query: 353 HKVTGSSGWLSKTTTRTEDSVSRQTSRGSELNGDSVSLTAGHDLTLRGSSVAGSGDVALL 412 +T G T T DS ++ SLTAG+ GS+ L Sbjct: 574 SDLTAGYG---STGTAGSDSSIIAGYGSTQTASYHSSLTAGY-----GSTQTAREQSVLT 625 Query: 413 AGNDLLIGTQNEYSSELHLKQEKKSGLMSSGGIGFSYGTQSVKTTDTGADATQAGSTVGS 472 G G+ + ++ L S + + G S +T G+D T + + Sbjct: 626 TG----YGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681 Query: 473 VNGDLTLSAGENLTVTGSDLIAGRDMALSGKNVAITAAENQSRQTHEVEQKTSGLTLALS 532 D +L AG T T +G N +TA ++ E TSG + Sbjct: 682 AGADSSLIAGYGSTQT------------AGYNSILTAGYGSTQTAQEGSDLTSGYGSTST 729 Query: 533 GTAGSALNSVVQATQDAR-----SAGSSRLQALQGVKAALSGVQASQAARLDAAQGNDPA 587 A S+L + +TQ A +AG Q + +G ++ A D++ Sbjct: 730 AGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 789 Query: 588 NNNTVGV--SLSYGSQSSKSTQ-RSEQTTAQGSSLTAGRDLSITAREGDLNAVGSQLKAG 644 + T G L+ G S+++ Q RS+ TT GS+ TAG D S+ A GS AG Sbjct: 790 STQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIA------GYGSTQTAG 843 Query: 645 NDVALSASRDINLVSAENTSLLEGKNDSHGGTVGVGIGVGSGGWGISVSASANKGKGSES 704 + L+A + EN+ L G + T G + +G + + Sbjct: 844 YNSILTAGYGSTQTAQENSDLTTGYGST--STAGYDSSLIAGYGSTQTAGYNSILTAGYG 901 Query: 705 GNGTTHSETTVDAGNNLILNSGRDTTLTGA----QVSGDTVIADIGRNLTLTSEQDSDRY 760 T + + G +G +++L Q + G + T+ + S Sbjct: 902 STQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLT 961 Query: 761 DSKQQNASAGGSFTFGSMSGSASVNLSKDKMHSNYDS 797 + AG + + GS + + + Y S Sbjct: 962 AGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGS 998
>PF05860#haemagglutination activity domain. Length = 117 Score = 86.0 bits (213), Expect = 1e-21 Identities = 22/141 (15%), Positives = 48/141 (34%), Gaps = 24/141 (17%) Query: 67 ANIVADAGAPKNQQPTVMQSANGTPQVNIQTPSAAGVSRNTYSQFDVNQQGAILNNSHKN 126 A I D P N + + + T + T + + + + + +F V G N+ Sbjct: 1 AQITPDTTLPIN---SNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFNN--- 53 Query: 127 VQSQLGGMVAGNPWLAKGEAKVILNEVNSRDPSRLNGMIEVAGKKAQVVIANPSGITCNG 186 + I++ V S ++G+I A + + NP+GI Sbjct: 54 ----------------PTNIQNIISRVTGGSVSNIDGLIRANAT-ANLFLINPNGIIFGQ 96 Query: 187 CGFINANRATLTTGQAQLNNG 207 ++ + + + +L Sbjct: 97 NARLDIGGSFVGSTANRLKFA 117
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 28.8 bits (64), Expect = 0.016 Identities = 33/139 (23%), Positives = 51/139 (36%), Gaps = 38/139 (27%) Query: 70 QQEALALSDELIAELKANDVIVIAAPMYNFNIPTQLKNYFDL---VARAGVTFRY----- 121 QQ D EL+ N + +I +F+I T+ K ++ L + + T Y Sbjct: 129 QQSIKQYIDAHREELERNQIKIIGI---DFDIETEYKWFYSLQFNIKESAFTTGYAIASW 185 Query: 122 -TEKGPEGLVTGKRAVVLTSRGGIHKDTPTDLVTPYLSTFLGFIGITDVNFVFAEGIAY- 179 +E+ KR V S GG F G+T N FA+GI Y Sbjct: 186 LSEQDES-----KRVV--ASFGGGA-----------------FPGVTTFNEGFAKGILYY 221 Query: 180 -GPEVAAKAQSDAKAAIDS 197 ++K + +DS Sbjct: 222 NQKHKSSKIYHTSPVKLDS 240
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.7 bits (85), Expect = 1e-04 Identities = 18/164 (10%), Positives = 49/164 (29%), Gaps = 28/164 (17%) Query: 140 AASAAQASLEAQKAAAAAADLTVATSVAAGYLTLLSLDEQLRVTRQTLKSRQDAYNLAKR 199 S + + +L + A L++ ++ + + + Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAE----RLTVLARINRYENLSRVEKSRLDDFSS 242 Query: 200 QFETGYTSRLELM-------QADSELRSTRSQIPPLQHQIAQQENALSVLTGSNPGSIQR 252 ++ ++ +A +ELR +SQ+ ++ +I + ++T Sbjct: 243 LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT--------- 293 Query: 253 QDFARLTPLALPSQLPSTLLNRRPDIVQAQRQLIAADATLASSQ 296 Q F ++ L +I +L + +S Sbjct: 294 QLFKN--------EILDKLRQTTDNIGLLTLELAKNEERQQASV 329
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 105 bits (263), Expect = 4e-27 Identities = 49/283 (17%), Positives = 98/283 (34%), Gaps = 16/283 (5%) Query: 82 NVKPGQVLFQIDDRIYKQRVHQAQATLAMKQAALKNNLQQRKSAEAVIQRNEAALQNARA 141 V L + ++ + +Q + L K+A L + E + + ++ L + + Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS 242 Query: 142 QNQKNQADLKRVQDLTADGSLSIRERDSARASAAQGSADIEQAKATLEMSRQDLQSTIVN 201 K V + ++ E ++ Q ++I AK ++ Q ++ I++ Sbjct: 243 LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD 302 Query: 202 RDALA-ADVASAGAALELAEIDLNNTRIVAPTAGQLGQISVR-LGAYVSAGTHLTSLVPP 259 + ++ L E + I AP + ++ Q+ V G V+ L +VP Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE 362 Query: 260 QH--WVIANMKETQLAQIRIGQPTTFSVDALNGETF---SGTVQSISPATGVEFSAISPD 314 V A ++ + I +GQ V+A + G V++I+ D Sbjct: 363 DDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA-------IED 415 Query: 315 NATGNFVKIAQRIPVRIAVNDGQKNSSRLRPGMSVQVTIDTRE 357 G + I + L GM+V I T Sbjct: 416 QRLGLVFNVIISIEENCLSTGNKN--IPLSSGMAVTAEIKTGM 456
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.6 bits (69), Expect = 0.012 Identities = 26/195 (13%), Positives = 76/195 (38%), Gaps = 19/195 (9%) Query: 16 RIVPFIMLLYFIAFLDRVNIGFAALTMNNDLGFSPSVFGFGAGIFFLGYFLFEVPSNLIL 75 +I+ ++ +L F + L+ + + + + ND P+ + + L + Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNW----VNTAFMLTFSIGTAVY 69 Query: 76 HKVGARIWIARVMISWGIVSGA---MAFVQGTTSFYSL--RFLLGVAEAGFFPGIILYLS 130 K+ ++ I R+++ I++ + FV + + RF+ G A F +++ ++ Sbjct: 70 GKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVA 129 Query: 131 YWFPAKKRAQVTAIFMAAAPISTALGSPVSAALLEMHGLLGMTGWQWMFLLEAVPAVVLG 190 + P + R + + + + +G + + W ++ L+ ++ Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH------YIHWSYLLLI----PMITI 179 Query: 191 VMVLFWLTDRPEKAS 205 + V F + ++ Sbjct: 180 ITVPFLMKLLKKEVR 194
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 12/31 (38%), Positives = 14/31 (45%) Query: 37 LLGPSGCGKSTLLRLLAGLSVPAAGEIRFGD 67 L G G GKSTL+ L GL + G Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 25.9 bits (57), Expect = 0.037 Identities = 7/17 (41%), Positives = 10/17 (58%) Query: 88 FYQNLVARLERSLGIGP 104 FY+ L+ RSL + P Sbjct: 5 FYKTLLDDFSRSLEMQP 21
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.7 bits (113), Expect = 4e-09 Identities = 24/157 (15%), Positives = 60/157 (38%), Gaps = 11/157 (7%) Query: 3 AAMRVALQGGLGAMTVRQIAAEAGVSTGQLHHHFTSIGELKAQAFVRLIREMLDIQLVAE 62 A+R+ Q G+ + ++ +IA AGV+ G ++ HF +L ++ + + +++L + Sbjct: 19 VALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQ 78 Query: 63 D-------ASWRERL---FSMLGSEDGRLDPYIRLWREGQLLCGSDSDIKEAYLLTMSMW 112 + RE L +E+ R ++ + G + +++A Sbjct: 79 AKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK-CEFVGEMAVVQQAQRNLCLES 137 Query: 113 HEETVNIIRLGSASGEFHPADSAENIAWRLIALVCGL 149 ++ ++ + A + + GL Sbjct: 138 YDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 56.4 bits (136), Expect = 8e-11 Identities = 77/360 (21%), Positives = 125/360 (34%), Gaps = 33/360 (9%) Query: 5 IFSLALGTFGLGMAEFGIMGVLPEIARDVGVSIPVA---GNMIAWYAFGVVIGAPVMALF 61 + ++AL G+G+ IM VLP + RD+ S V G ++A YA APV+ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRFSLKSVMLFLAALCIVGNTLFTFSSSYFMLATGRLVSGFPHGAFFGVGAIILSKVAP 121 S RF + V+L A V + + ++L GR+V+G GA I + + Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYI-ADITD 125 Query: 122 PGKVTAAVAGMIAGMTVANLVGVPAGTWLGHQFSWRYTFLGIAIFN-VAVLMSILWWVPT 180 + M A + G G +G FS F A N + L + Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 181 VFDRSTTRLREQFH---------FLSSPAPWLI--FAATLFGNAGVFTWFSYIKPFMIHV 229 RE + ++ A + F L G W F Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI----FGEDR 240 Query: 230 SGFSESAMIAIMMLVG--LGMVIGNLLSGKISARYSPLRIAAATDGAIVVVMLLIFLFGE 287 + + I I + L + +++G ++AR R A +L+ Sbjct: 241 FHWD-ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATR 299 Query: 288 HKTASLLLAFLCCAGLFALAAPLQILLLQNARGGEMLGAAGGQIAF--NLGSAIGAFCGG 345 A ++ L G+ A LQ +L + E G G +A +L S +G Sbjct: 300 GWMAFPIMVLLASGGIGMPA--LQAMLSRQV-DEERQGQLQGSLAALTSLTSIVGPLLFT 356
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.5 bits (74), Expect = 0.012 Identities = 24/138 (17%), Positives = 48/138 (34%), Gaps = 15/138 (10%) Query: 1068 RLAEYQQFLQREAVSIDAFRQHQQRAFNEERERWIASGQAHFDSQEVAADAGDEAPLQRG 1127 RL ++ L ++A++ A + + + E + ++ + E + E Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE--LRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 1128 QQGVESPISGNLWQVQAAAGSH----VRAGDVLVVLESMKMEIPLLAPCDGVVQQMNVQ- 1182 Q ++ I L Q G + + + AP VQQ+ V Sbjct: 294 QL-FKNEILDKLRQTTDNIGLLTLELAKNEERQQASV-------IRAPVSVKVQQLKVHT 345 Query: 1183 PGASVRAGQRVAVILEEN 1200 G V + + VI+ E+ Sbjct: 346 EGGVVTTAETLMVIVPED 363
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 97.2 bits (242), Expect = 2e-25 Identities = 39/127 (30%), Positives = 65/127 (51%), Gaps = 1/127 (0%) Query: 6 HILVVDDDHDIRELVTDYLNKSGYRATGAANGKAMWSVLQGQHVDLIVLDIMLPGDDGLI 65 ILV DDD IR ++ L+++GY +N +W + DL+V D+++P ++ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 LCRQLRSHGQQNIPVLMLTARTDDSDRILGLEMGADDYLVKPFVARELLARIKAILRRTR 125 L +++ ++PVL+++A+ I E GA DYL KPF EL+ I L + Sbjct: 65 LLPRIKKAR-PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 126 ALPPNLQ 132 P L+ Sbjct: 124 RRPSKLE 130
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 67.4 bits (164), Expect = 2e-15 Identities = 64/256 (25%), Positives = 110/256 (42%), Gaps = 18/256 (7%) Query: 4 LQNKHLLIVGGSSGIGFAIAQRAGLEGARLTLMGRSQQRLDEARAMLKAQNIKVCETLAC 63 ++ K I G + GIG A+A+ +GA + + + ++L++ + LKA+ + E Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEA-RHAEAFPA 64 Query: 64 DAHDHDAL----QACFAKLAPFDHLVSMVGDAMGGGFLAASMETIEHV--IHSKFLTNVV 117 D D A+ ++ P D LV++ G G + S E E ++S + N Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 118 IGKLAAKKLRSGGSL-IFTSGTGGRAQHACASY-VGNLGIQALVQGLAAEMAPEG-RVNA 174 R GS+ S G + + A+Y + L E+A R N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 175 VAPTWTVT----PFWREQSREQ--VDNTRQHFASVIPLGRTAEIDELASAYLFLMKND-- 226 V+P T T W +++ + + + + F + IPL + A+ ++A A LFL+ Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 227 FVTGQQLAVDGGIMLG 242 +T L VDGG LG Sbjct: 245 HITMHNLCVDGGATLG 260
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.2 bits (63), Expect = 0.037 Identities = 5/22 (22%), Positives = 12/22 (54%) Query: 77 VQHKNVIESAKKAGVRHIIYTS 98 N++E + ++H++Y S Sbjct: 104 TGFLNILEGCRHNKIQHLLYAS 125
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 46.0 bits (109), Expect = 2e-07 Identities = 47/235 (20%), Positives = 85/235 (36%), Gaps = 10/235 (4%) Query: 7 RSTIALLASSLLLTIGRGATLPFMTIFLTRQYQLEVD---KIGYALSIALTVGVVFSMGF 63 R I +L++ L +G G +P + L R D G L++ + + Sbjct: 5 RPLIVILSTVALDAVGIGLIMP-VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 64 GILADKFDKKRYMLIAVLAFICGFIAIPLVNSVNLVVFFFALINCAYSVFSTVLKAWFAD 123 G L+D+F ++ +L+++ + AI V++ ++ V A+ AD Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDY-AIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD 122 Query: 124 VLSPEKKARIFSLNYTFLNIGWTVGPPIGTLLVMHSINLPFWLAAACAALP-LVGIQLFV 182 + +++AR F G GP +G L+ S + PF+ AAA L L G L Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182 Query: 183 QRTSAAIAQDNATQWSPSVLLRD----RALMWFTLSGLLASFVGGSFASCISQYV 233 + +P R + + VG A+ + Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 27.3 bits (60), Expect = 0.029 Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 4/47 (8%) Query: 20 EQLAELAGLSVRTIQRIENGDR-PGLETLSALAAVFEVNVADITGDS 65 E+ + G+SV + QR++NG+R G+E L+ L + N +TG S Sbjct: 23 EETGKHKGVSVISYQRVKNGERNKGIEALNRL---YLQNQTSLTGKS 66
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 32.1 bits (73), Expect = 5e-04 Identities = 25/138 (18%), Positives = 53/138 (38%), Gaps = 13/138 (9%) Query: 10 GISGVLIYICYMDIRWRRIPNRATLLILLLSCLAGFTHMPYP----------AFILPGIL 59 ++ VL+ + ++D+ +P++ TL +L L +++ L Sbjct: 139 LLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWSL 198 Query: 60 LALGFIAVMVKLMGAGDIKLVCALAVALSVPETGNFLLLTAIAGIPVSLASLFYFYFFAR 119 + + MG GD KL+ AL L LLL+++ G + + + Sbjct: 199 YWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLIL---LRNH 255 Query: 120 EQRATVPYALAISCGYWL 137 Q +P+ ++ W+ Sbjct: 256 HQSKPIPFGPYLAIAGWI 273
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 140 bits (355), Expect = 3e-38 Identities = 73/271 (26%), Positives = 130/271 (47%), Gaps = 25/271 (9%) Query: 178 YDNVINRLQLPSSNQVNVKLTVVEVSKEFTDNLGIEWS----------SLTLDSIINGGG 227 + VI +L + QV V+ + EV NLGI+W+ + L G Sbjct: 333 LERVIAQLDIRR-PQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAG 391 Query: 228 NNGINTN-------SPGVFNLLGFRRGFDAGNISTLINAVKNDAIARVLAQPNLTVLSGE 280 N N + + + + G GF GN + L+ A+ + +LA P++ L Sbjct: 392 ANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNM 451 Query: 281 SASFLVGGEIPIMVKDQDSV------TVQYKEYGIRLNITAKVEKRQKIRLYVSNELSSV 334 A+F VG E+P++ Q + TV+ K GI+L + ++ + + L + E+SSV Sbjct: 452 EATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSV 511 Query: 335 TGSYAYNDYQIP-TMRTRRSSSTIELADGDSFVIGGLLSEADKESLTKVPFIGDIPVLGA 393 + + + T TR ++ + + G++ V+GGLL ++ ++ KVP +GDIPV+GA Sbjct: 512 ADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGA 571 Query: 394 LARSSMTERSKSELVVFATVNLVKPQAEAAA 424 L RS+ + SK L++F +++ + E Sbjct: 572 LFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQ 602
>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6 signature. Length = 547 Score = 28.1 bits (62), Expect = 0.021 Identities = 13/36 (36%), Positives = 22/36 (61%) Query: 56 DSLVVWKLDRLGRSVRDLITLVSELQEKGIHFRSLT 91 D +VWK+DRLG+ + IT+ + ++G F + T Sbjct: 158 DGKLVWKIDRLGQGEKSKITVWVKPLKEGCCFTAAT 193
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 27.4 bits (60), Expect = 0.040 Identities = 32/117 (27%), Positives = 49/117 (41%), Gaps = 19/117 (16%) Query: 73 GQIVYSTASGEPVEI-----SALGELPAGVTT--KAPAGSYQKWDGENWVNDAEAKHQAE 125 G +V+ ASGEPV I S GE P G T KAP E V + A +++ Sbjct: 274 GNVVFDRASGEPVVIDLGLHSRSGEQPKGFTESFKAP---------ELGVGNLGASEKSD 324 Query: 126 VSSAIELLTELMR--EANAKIAPLNDAVELGIQTDEEVMQLTEWKKYRVALSRIDTS 180 V + L + E N +I P N + VM + +R ++ ++T+ Sbjct: 325 VFLVVSTLLHCIEGFEKNPEIKP-NQGLRFITSEPAHVMDENGYPIHRPGIAGVETA 380
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 30.4 bits (68), Expect = 0.006 Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 3/68 (4%) Query: 156 SAFIEVAAGGDITATTAGSATINAPEIVLNGNVTINGNLSQGMGESGGTATMHGPVTVTN 215 + + V A I A +A +A NA V +VT + + + +S + G +T + Sbjct: 23 ATAMPVNAATTINADSAINANTNAKYDV---DVTPSISAIAAVAKSDTMPAIPGSLTGSI 79 Query: 216 DVTAGGKS 223 + GKS Sbjct: 80 SASYNGKS 87
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 30.6 bits (69), Expect = 4e-04 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%) Query: 30 WFHGLDWNFIALASGVIIGVA-TYLTNLYFKRRWTKMYQ---QSLDRGYGGP 77 W G N +A+A G II + ++TN F+ K+++ L RG GGP Sbjct: 348 WNDGA--NVLAIAPGEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGP 397
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 81.2 bits (200), Expect = 3e-22 Identities = 31/82 (37%), Positives = 52/82 (63%) Query: 26 APEQLVSTPPVYPYYALANHLDGEVKIRFDVGANGKVEKMWILTSEPQHLFDDAVISAVA 85 P L P YP A A ++G+VK++FDV +G+V+ + IL+++P ++F+ V +A+ Sbjct: 152 GPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMR 211 Query: 86 KWRFESNKPYKGMTKTIRFKLN 107 +WR+E KP G+ I FK+N Sbjct: 212 RWRYEPGKPGSGIVVNILFKIN 233
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 5e-06 Identities = 65/308 (21%), Positives = 119/308 (38%), Gaps = 39/308 (12%) Query: 33 PFFPVWLADVNHLTK--TETGVVFSAISLFAIICQPIFGLISDKLGLRKHLLWTITILLI 90 P P L D+ H G++ + +L C P+ G +SD+ G R LL + L Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----VSLA 81 Query: 91 LFA-PFFIFVFSPLLQVNIIAGALVGGLYLGIVFSSGSGAVEAYIERVSRANRFEYGKVR 149 A + I +P L V + G +V G+ G + + + RA F + Sbjct: 82 GAAVDYAIMATAPFLWV-LYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGF---- 135 Query: 150 VAGCVGWALCAS--ITGILFGIDPNITFWIASGFALVLGVLLWFSRPESSNS------AQ 201 ++ C G+ + A + G++ G P+ F+ A+ + + F PES + Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRRE 195 Query: 202 VMDALGANRQAFSLRVAAELLRMPRFWGFIIYVVG--VASVYDVFDQQFANFFKGFFADP 259 ++ L + R A + V A L+ FI+ +VG A+++ +F + ++ Sbjct: 196 ALNPLASFRWARGMTVVAALM----AVFFIMQLVGQVPAALWVIFGEDRFHW-------- 243 Query: 260 RRGTEVFGFVTTGGELLNALI-MFCAPAIVNRIGAKNALLTAGMIMSVRILGSSFATTAV 318 G +L++L + R+G + AL+ GMI T Sbjct: 244 --DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMIADGTGYILLAFATRG 300 Query: 319 EVVILKML 326 + M+ Sbjct: 301 WMAFPIMV 308
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 379 bits (974), Expect = e-135 Identities = 199/284 (70%), Positives = 235/284 (82%) Query: 1 MRQYRFALLPLLAALALPGWAHQATVTTVKQAESQLQGRVGYAELDLASGQLLAGYRSDE 60 MR R ++ LLA L L A + +K +ESQL GRVG E+DLASG+ L +R+DE Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60 Query: 61 RFPMMSTFKVLLCGAVLSRVDAGEEQLDRRIHYRQQDLVEYSPVTEKHLTDGLTVGELCA 120 RFPMMSTFKV+LCGAVL+RVDAG+EQL+R+IHYRQQDLV+YSPV+EKHL DG+TVGELCA Sbjct: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA 120 Query: 121 AAITLSDNTAANLLLTTLGGPQGLTSFLRHSGDQTSRLDRWETELNEARPGDVRDTTTPQ 180 AAIT+SDN+AANLLL T+GGP GLT+FLR GD +RLDRWETELNEA PGD RDTTTP Sbjct: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180 Query: 181 AMARTLRNLLTGRVLSSASQQQLQRWMVEDKVAGPLLRSVLPAGWFIADKTGAGNRGSRG 240 +MA TLR LLT + LS+ SQ+QL +WMV+D+VAGPL+RSVLPAGWFIADKTGAG RG+RG Sbjct: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240 Query: 241 IIAALGPDGKAARIVVIYLTGTPATMDERNKQIAAIGATLVTHW 284 I+A LGP+ KA RIVVIYL TPA+M ERN+QIA IGA L+ HW Sbjct: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHW 284
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.7 bits (235), Expect = 2e-25 Identities = 59/180 (32%), Positives = 87/180 (48%), Gaps = 4/180 (2%) Query: 2 IILITGATAGFGESMTRRFIANGHKVIATGRREERLKTLQDELGNNLYTAQ---LDVRNR 58 I ITGA G GE++ R + G + A E+L+ + L A+ DVR+ Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 59 AAIEEMIAGLPAEWQAIDVLVNNAGLALGLEPAHKASVEDWEDMIDTNNKGLVYMTRAVL 118 AAI+E+ A + E ID+LVN AG+ L H S E+WE N+ G+ +R+V Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 PGMVERNRGHIINIGSTAGSWPYSGGNVYGATKAFVRQFSLNLRTDLHGTAIRVTDVEPG 178 M++R G I+ +GS P + Y ++KA F+ L +L IR V PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.8 bits (111), Expect = 1e-07 Identities = 33/137 (24%), Positives = 58/137 (42%), Gaps = 21/137 (15%) Query: 73 VGAFIFGRMGDKIGRKRVLFITITMMGICTTLIGVLPTYAQVGIFAPVLLVTLRIIQGLG 132 +G ++G++ D++G KR+L I + + + V ++ + I A R IQG G Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAG 116 Query: 133 AGAEISGAGTMLAEYAPKGKR----GIISSLVAMGTNCGTLSATAIWAIMFFLLDREELV 188 A A + ++A Y PK R G+I S+VAMG G I + Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI----------AHYI 166 Query: 189 AWGWRIPFLASAVVMIF 205 W + + ++ + Sbjct: 167 HWSYLLLIPMITIITVP 183
>PF01206#SirA family protein Length = 76 Score = 90.2 bits (224), Expect = 4e-28 Identities = 16/71 (22%), Positives = 38/71 (53%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPSLKKGEILEVVSDCPQSINNIPQDARNYGYTVLDIQQ 66 D LD G CP P + + + ++ GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 27.4 bits (61), Expect = 0.045 Identities = 13/27 (48%), Positives = 17/27 (62%), Gaps = 1/27 (3%) Query: 1 MKIGIIG-AGFVGRSIAKLALAAGHDV 26 MK + G AGF+G ++K L AGH V Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQV 27
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 93.2 bits (231), Expect = 8e-25 Identities = 69/264 (26%), Positives = 114/264 (43%), Gaps = 25/264 (9%) Query: 6 ALEGKRVLITSGTKGAGAATVALFRQLGARVLT--CARHQPDAAVDALFVTA-------- 55 +EGK IT +G G A GA + + + V +L A Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 56 DLSTASGCAVLATAVQELLGGVDIIVHMLGGSSSPAGGFAALSDALWQQELDLNLFPALR 115 D+ ++ + ++ +G +DI+V++ G G +LSD W+ +N Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAG--VLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 116 LDRLLLPSMLNSGKGVIIHVSSIQRKMPLPESTTAYAAAKSALSTYSKSLSKELSPRGVR 175 R + M++ G I+ V S +P S AYA++K+A ++K L EL+ +R Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVP-RTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 176 VLSVAPGWIETEAAVRFAQRLAQQEGVDYAQGKQIIMDSLG----GIPLGRPAQPEEVAN 231 V+PG ET+ + D +Q+I SL GIPL + A+P ++A+ Sbjct: 182 CNIVSPGSTETD--------MQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233 Query: 232 LIAFLASDRAASITGAEYVIDGGT 255 + FL S +A IT +DGG Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGGA 257
>PF07520#Virulence protein SrfB Length = 1041 Score = 29.2 bits (65), Expect = 0.027 Identities = 21/117 (17%), Positives = 44/117 (37%), Gaps = 11/117 (9%) Query: 29 ISQPALSLTIKGLEEGLGGALLQRSTRRVTLTQEG--EIFLPMARQLLADWDNAEEAMRQ 86 I+Q + + E GG + + + V ++ + +P+A +L+ ++AE Sbjct: 656 IAQAGGQFVAERMRELFGGDIGGQEQQTVQRRRQFSIRVLVPLAEAILSACEDAE----- 710 Query: 87 SFTLQRGKISIAAMPSFAANVLPEVLKAFRDRYAGINVT--VHDVINEQVIEMVREG 141 R I +A + + E A VT + D + + ++ EG Sbjct: 711 --EADRIDIPVADVLGLVPTPVGEEGDEEGHEDASPQVTDEILDYLEKPATQLGAEG 765
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 106 bits (266), Expect = 6e-30 Identities = 64/252 (25%), Positives = 105/252 (41%), Gaps = 3/252 (1%) Query: 3 LNGKVALVTGSTSGIGLGIAKVLAKSGAQLILNGFGDSASARAEVAQ--IGKTPGYHDAD 60 + GK+A +TG+ GIG +A+ LA GA + + + + + AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 61 LRDVQQIEAMMAYAEAEFGGVDIVINNAGIQHVAPVEHFAVEKWNDIIAINLSSVFHTSR 120 +RD I+ + A E E G +DI++N AG+ + + E+W ++N + VF+ SR Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 LALPGMRARNWGRIINIASVHGLVASKDKSAYVAAKHGVIGLSKTLALETARTGVTCNAI 180 M R G I+ + S V +AY ++K + +K L LE A + CN + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 181 CPGWVLTPLVQQQIDKRIAEGCDPHQAREQLLAEKQPSGEFVTPEQLGELALFLCSEGAA 240 PG T + + E P + P + + LFL S A Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLET-FKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 241 QVRGAAWNMDGG 252 + +DGG Sbjct: 245 HITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 119 bits (298), Expect = 1e-34 Identities = 70/254 (27%), Positives = 112/254 (44%), Gaps = 10/254 (3%) Query: 3 KVALVTGAAQGIGKAIALRLVKDGFAVAIADYNDTAAKAVAAEINQHGGRALAVKVDVSR 62 K+A +TGAAQGIG+A+A L G +A DYN + V + + A A DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 REQVFAAVEQARKTLGGFHVIVNNAGIAPSTPIEAIVEETVDKVYNINVKGVIWGIQAAV 122 + + + +G ++VN AG+ I ++ +E + +++N GV ++ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 123 DAFKKEGHGGKIINACSQAGHVGNPELAVYSSSKFAVRGLTQTAARDLAPLGITVNGYCP 182 G I+ S V +A Y+SSK A T+ +LA I N P Sbjct: 129 KYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 183 GIVKTPM----WAEIDRQVSEAAGKPLGYGTAEFAKRITLGRLSEPEDVAACVSYLASPD 238 G +T M WA+ + G F I L +L++P D+A V +L S Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGS-----LETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 239 SDYMTGQSLLIDGG 252 + ++T +L +DGG Sbjct: 243 AGHITMHNLCVDGG 256
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 29.8 bits (66), Expect = 0.014 Identities = 12/39 (30%), Positives = 21/39 (53%) Query: 247 QAVAALQQLLEILPANDARRQAIERQLAQAQAQAQASAR 285 +A+++LQ + L A A +A A+ QA A+A + Sbjct: 195 EAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRK 233
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.7 bits (90), Expect = 3e-05 Identities = 69/341 (20%), Positives = 116/341 (34%), Gaps = 23/341 (6%) Query: 30 VPLISLELAQQQTDTVYVGLLAALPPAGMMLSSFLSPALCRRFEIGTLLTANLILLALAT 89 +P + +L T + G+L AL + + AL RF +L +L A+ Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87 Query: 90 IASCLTVDLQQLLLPRFLTGIASGVIIVLGESWITGGAAGKNRATLTGIYASAFTGCQLA 149 L L + R + GI V G ++I G RA G ++ F +A Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERARHFGFMSACFGFGMVA 146 Query: 150 GPLL------ISAGADYQIWVLLAVVGLTAACLLMLRHLPGGSRESLAERA-------SW 196 GP+L S A + L + C L+ G R L A W Sbjct: 147 GPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL-PESHKGERRPLRREALNPLASFRW 205 Query: 197 RSLGAFLPVLASGVFCFAFFDASILALLPLYGMDK-GLNEAMAVLLVTIVLTGDAFFQAP 255 + L + F AL ++G D+ + + + + QA Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 256 L-GWVADKFGIRRVHLSCAVVFCLALAALPFLLASPVQLIVGCLILGAAAG--ALYTLSL 312 + G VA + G RR + + L F + + L+ G AL + Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325 Query: 313 VRAGKTFSGQKLIMINALLGFFWSAGSVAGPVVSSLLISVS 353 + + GQ + L S S+ GP++ + + + S Sbjct: 326 RQVDEERQGQ----LQGSLAALTSLTSIVGPLLFTAIYAAS 362
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.020 Identities = 14/52 (26%), Positives = 22/52 (42%) Query: 9 KRFGDSHVLRGISCDIKPQEVVCIIGPSGSGKSTFLRCMNALESVSEGIVEV 60 K HV R + K V + G G GKST + + L+ S+ ++ Sbjct: 578 KYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 43.0 bits (101), Expect = 8e-07 Identities = 32/150 (21%), Positives = 57/150 (38%), Gaps = 14/150 (9%) Query: 39 PQRIVSMHDLDITIPLIELGVPPVASHGRTRPDGSHYLRSSAQLTGVDFDNSAIQFIGTA 98 P RIV++ L + + L+ LG+ P D +Y ++ +S I Sbjct: 35 PNRIVALEWLPVEL-LLALGIVPYGV-----ADTINYRLWVSEPP---LPDSVIDVGLRT 85 Query: 99 DIDLEAVAAAKPDLIITEPSRHVSVEQLEKIAPTVSIDHLQGSAP-----RIYSKLAQLT 153 + +LE + KP ++ S E L +IAP + G P + +++A L Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLL 145 Query: 154 GSQARLAILERRYQAQIAQLKAMVDTRNIT 183 Q+ +Y+ I +K R Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFVKRGAR 175
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.020 Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 4/32 (12%) Query: 43 LRPG---ESVALL-GPSGCGKSTLLRLLAGLE 70 + PG + +L G G GKSTL+ L GL+ Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLD 620
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.8 bits (72), Expect = 0.002 Identities = 32/151 (21%), Positives = 62/151 (41%), Gaps = 7/151 (4%) Query: 41 LLLWLCY--FFTLLVVYMLINWLPMLLIGQGFRAGQAAGVMFVLQLGAACGTLLLGALLD 98 +L+WLC FF++L +L LP + V L + GT + G L D Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 99 KLSPLLMSLLIYSGM---LASLLALGCATSFPAMLFSGFVAGLFATGGQSVLYALAPLFY 155 +L + LL++ + S++ + F ++ + F+ G A +++ + + Sbjct: 75 QLG--IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132 Query: 156 RTEIRATGVGTAVAVGRLGAMSGPLLAGKML 186 E R G ++ +G GP + G + Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIA 163
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 51.0 bits (122), Expect = 3e-10 Identities = 44/161 (27%), Positives = 74/161 (45%), Gaps = 5/161 (3%) Query: 1 MTTTTSGSASRL-MLTIGLCFMVALMEGLDLQAAGIAAAGMAHAFALDNMQMGWIFSAGI 59 M T+ S S R + I LC + L+ ++ +A+ F W+ +A + Sbjct: 1 MNTSYSQSNLRHNQILIWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFM 59 Query: 60 LGLLPGALVGGILADRHGRKRVLIASVMLFGLFSIATALAGS-FPLLLLARLMTGVGLGA 118 L G V G L+D+ G KR+L+ +++ S+ + S F LL++AR + G G A Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AA 118 Query: 119 ALPNLIA-LTSEAAGPRFRGTAVSLMYCGVPVGAALAAALG 158 A P L+ + + RG A L+ V +G + A+G Sbjct: 119 AFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.1 bits (112), Expect = 8e-08 Identities = 77/409 (18%), Positives = 133/409 (32%), Gaps = 49/409 (11%) Query: 31 LAIGTMINYLDRTVLGIAAPSLTSEL------GIDAAVMGIVFSAFAWTYALAQIPGGLF 84 L + LD +G+ P L L A GI+ + +A G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 85 LDRFGNKVTYFLSLTLWSLFTLFHGMAIGLKTLLLCRFGLGVSEAPCFPVNSRVVSAWFP 144 DRFG + +SL ++ A L L + R G++ A V ++ Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGAT-GAVAGAYIADITD 125 Query: 145 QQERAKA----TAVYTVGEYLGLACFSPLLFWIMGSFGWRALFISVGAVGVLFALVWWRC 200 ERA+ +A + G G P+L +MG F A F + A+ L L Sbjct: 126 GDERARHFGFMSACFGFGMVAG-----PVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180 Query: 201 YREPHEDKRLSQQEREHIVNGGGLVTASDQQTAFSWPLVRQLLGKRQILGASIGQFAGNT 260 E H+ +R + A + +F W ++ + + Sbjct: 181 LPESHKGERRPLRRE-----------ALNPLASFRWARGMTVVAALMAVFFIMQLVGQ-- 227 Query: 261 VLVFFLTWFPTYLATARHMPWLKVGIFAILPFLAAAGGVM---FGGWISDKLLKATGSAN 317 + + H +G AA G++ I+ + G Sbjct: 228 ---VPAALWVIFGEDRFHWDATTIG------ISLAAFGILHSLAQAMITGPVAARLGERR 278 Query: 318 LARKLPIVAGLL--MASSIISANWLSSDLAVILVMSFAFFGQGMVGLGWTLISDIAPKGL 375 ++ G++ I+ A +A +++ A G GM L ++S + Sbjct: 279 A-----LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ-AMLSRQVDEER 332 Query: 376 GGLTGGLFNFCANFAGILTPLVIGFIVAAFGDFFYALIYIGGAALLGVV 424 G G + I+ PL+ I AA + +I GAAL + Sbjct: 333 QGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 63.5 bits (154), Expect = 1e-14 Identities = 30/167 (17%), Positives = 66/167 (39%), Gaps = 4/167 (2%) Query: 1 MSVVAHDEAQSLKERIFTAAIVVFAEHGLSGARMEQIATEAQTTKRMVVYYFKTKEQLYQ 60 M+ EAQ ++ I A+ +F++ G+S + +IA A T+ + ++FK K L+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EVLQHVYARIRETEQQLGLEQLP-PVEALVQLVRWSVR--YHATHADFMRVICMENMQR- 116 E+ + + I E E + + P+ L +++ + + I + Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 117 GKWLQSSGQLKPLNRTALSILEDILQRGQQQGIFQEGLQARDVHRLI 163 G+ + L + +E L+ + + L R ++ Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (299), Expect = 6e-39 Identities = 35/89 (39%), Positives = 55/89 (61%) Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEVRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + + L+K+D+ V+ F V L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 219 bits (560), Expect = 3e-74 Identities = 180/240 (75%), Positives = 199/240 (82%), Gaps = 6/240 (2%) Query: 1 MTLDLPRRFPWPTLLSVAIHGAVVAGLLYTSVHQVIERPSPSQPIEITMVAPADLEPPQA 60 MTLDLPRRFPWPTLLSV IHGAVVAGLLYTSVHQVIE P+P+QPI +TMV PADLEPPQA Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60 Query: 61 AQPVVEPVVEPEPEPEVVPEPEPPKEAPVVIHKPEPKPKPKPKPKPKPEKKVEQPKRDVK 120 QP EPVVEPEPEPE P PEPPKEAPVVI KP+PKPKPKPKP K + EQPKRDVK Sbjct: 61 VQPPPEPVVEPEPEPE--PIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ---EQPKRDVK 115 Query: 121 PAETRVASPFETTNTAPARTQPNAAPATAKPTLTAPSGPRALSRNQPAYPARAQALRIEG 180 P E+R ASPFE T A T A AT+KP + SGPRALSRNQP YPARAQALRIEG Sbjct: 116 PVESRPASPFENTAPA-RLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEG 174 Query: 181 SVRVKFDVTPDGRVDNVEILSAQPANMFERDVKNALRKWRYEAGKPGTGVTMTIKFRLKG 240 V+VKFDVTPDGRVDNV+ILSA+PANMFER+VKNA+R+WRYE GKPG+G+ + I F++ G Sbjct: 175 QVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKING 234
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 55 VVGESGCGKSTFARAI 70 + GESG GK ARA+ Sbjct: 165 ITGESGTGKELVARAL 180
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.1 bits (65), Expect = 0.037 Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Query: 128 TPFSVFVIISLLCGFAGANF-ASSMANISFFFPKQKQGGALGLNGGLGNMGVSVMQLV 184 + FS+ ++ + G A F A M ++ + PK+ +G A GL G + MG V + Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158
>PF06580#Sensor histidine kinase Length = 349 Score = 47.9 bits (114), Expect = 5e-08 Identities = 27/116 (23%), Positives = 47/116 (40%), Gaps = 9/116 (7%) Query: 476 FGFTVQLDYQLPPRFVPSHQAIHVLQIAREALSNALKHAQAT-----EVTVTVSLRDNQV 530 F +Q + Q+ P + ++Q E N +KH A ++ + + + V Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTV 292 Query: 531 RLVVADNGRGVPDQAERSNHYGLIIMRDRAQSLRG-DCQVRRRENGGTEVVVTFIP 585 L V + G + S GL +R+R Q L G + Q++ E G + IP Sbjct: 293 TLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.6 bits (178), Expect = 4e-17 Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 2/118 (1%) Query: 6 RATILLIDDHPMLRTGVKQLVSMASDIQVIGEASNGEQGIALAETLDPDLILLDLNMPGM 65 ATIL+ DD +RT + Q +S A V SN D DL++ D+ MP Sbjct: 3 GATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 66 NGLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123 N + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>INTIMIN#Intimin signature. Length = 939 Score = 251 bits (641), Expect = 1e-76 Identities = 133/445 (29%), Positives = 218/445 (48%), Gaps = 18/445 (4%) Query: 1 MPVSYRVTPLLPLLLLLAGLPARALQGNTAFSEKQAALPDLGIAPQVDDDARHFAEIAKK 60 +P Y PLL L+A A + G+T K + PD+ + DD A ++A Sbjct: 117 LPFEYSALPLLGSAPLVA---AGGVAGHTNKLTKMS--PDVTKSNMTDDKALNYAAQQAA 171 Query: 61 FGEASMSDNGLTTGEQARMLAIGKLGNELSHQLENWLSPWGNANVNLRVDKEGNFTGSQG 120 + + L G+ A+ A+G GN+ S QL+ WL +G A VNL+ NF GS Sbjct: 172 SLGSQLQSRSLN-GDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGN--NFDGSSL 228 Query: 121 NWFIPLQDNGDYLTWNQYSVTQRENDLVGNIGLGQRWRRGDWLLGYNSFYDKVLGEHIAR 180 ++ +P D+ L + Q ++ N+G GQR+ + +LGYN F D+ R Sbjct: 229 DFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTR 288 Query: 181 GSIGAEAWGEYLRLSANYYHPVGSWQHG-DSLTQEQRMASGYDVTAQARLPFYQHINTSV 239 IG E W +Y + S N Y + W + ++R A+G+D+ LP Y + + Sbjct: 289 LGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKL 348 Query: 240 SVEQYFGDSVDLFHSGTGYRNPVAVSVGLNYTPVPLVTMTAKHKQGESGLSQNNVGLNLN 299 EQY+GD+V LF+S NP A +VG+NYTP+PLVTM ++ G + + Sbjct: 349 MYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFR 408 Query: 300 YRFGVPLKQQLAADEVAVSRSLRGSRYDSPERDNLPVVEYRQRKSLSVYLATPPWDLQPG 359 Y+F P QQ+ V R+L GSRYD +R+N ++EY+++ LS+ + + Sbjct: 409 YQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTER 467 Query: 360 ETVQLKLQIRSLHGIKSLKWQGDTQSLSLTSPVDTNSPDG---WSVILPAWSSDPGATNL 416 T +++L ++S +G+ + W D+ S + + + ILPA+ G +N+ Sbjct: 468 STQKIQLIVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAYV--QGGSNV 524 Query: 417 WHLSVVVEDKTGQRVSSNEIALALT 441 + ++ D+ G SSN + L +T Sbjct: 525 YKVTARAYDRNGN--SSNNVLLTIT 547
>PF06580#Sensor histidine kinase Length = 349 Score = 212 bits (542), Expect = 1e-67 Identities = 57/234 (24%), Positives = 103/234 (44%), Gaps = 21/234 (8%) Query: 172 HANNIQRELRQKEQELTGELRKRVEIERSLHEAEFKALSYQINPHFLFNVLNTIGRLAFL 231 + + +Q E + ++ EA+ AL QINPHF+FN LN I L L Sbjct: 136 FGWHFFKNYKQAE-------IDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALI-L 187 Query: 232 EDASRTETMVHDFSDMMRYLLRKNNNGLITLGREMNYVNCYMAIQKVRMNERFDYVCDIP 291 ED ++ M+ S++MRY LR +N ++L E+ V+ Y+ + ++ +R + I Sbjct: 188 EDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQIN 247 Query: 292 EKYNETVCPFLILQPLVENFFNYVVEPRETKSHIILRATDDGKDVIIEIADNGDGISPED 351 + P +++Q LVEN + + I+L+ T D V +E+ + G Sbjct: 248 PAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT 307 Query: 352 IEHILSGNQNRQKGGIGINNINNRLQLLFGENYGLQIASPHKPMLGTTVKLRFP 405 ++ G G+ N+ RLQ+L+G ++++ + P Sbjct: 308 ----------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQG---KVNAMVLIP 348
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.030 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 53 VTLLGPSGCGKSTLLKMVAGLVEPSDGKLMLW-RRDSREKAQ 93 V L G G GKSTL+ + GL SD + +DS E+ Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIA 640
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 30.4 bits (68), Expect = 0.018 Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 10/111 (9%) Query: 147 RLASLGGLYCGGFGGIGS--INYGPLAAPGNVLSVKVMTVEASPRVLTIPAPQALLLHHA 204 LA + GG+ I + I+ GPL P L KV +E + ++ P P L+++ Sbjct: 280 ELADVNDYMRGGYTAINNYLISNGPLNNPNPELDSKVNNIENALKLT--PIPSNLIVYRR 337 Query: 205 YGSNGIILEVELALAPVHRWIERLDVFDDFADALKYANECLRSPGFVKRQL 255 G E L L +++ D F + K+ + + P F+ + Sbjct: 338 SGP----QEFGLTLTSPEYDFNKIENIDAFKE--KWEGKVITYPNFISTSI 382
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 520 bits (1342), Expect = 0.0 Identities = 168/397 (42%), Positives = 256/397 (64%), Gaps = 11/397 (2%) Query: 7 VLVINCGSSSIKFSVLDASSCDVLMAGIADGINSENAFLSIN-DGEPVK--LAQRNYEGA 63 +LVINCGSSS+K+ ++++ +VL G+A+ I ++ L+ N +GE +K ++++ A Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62 Query: 64 LKAIAFELEKRSL-----IDSVALIGHRIAHGGNIFTQSVVITEEVIDNIRRVSPLAPLH 118 +K + L + + +GHR+ HGG FT SV+IT++V+ I LAPLH Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122 Query: 119 NYANLSGIASAQQLFPGVRQVAVFDTSFHQTLAPEAYLYGLPWKYFEELGVRRYGFHGTS 178 N AN+ GI + Q+ P V VAVFDT+FHQT+ AYLY +P++Y+ + +R+YGFHGTS Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182 Query: 179 HRYVAQRAQTLLALPEADSGLVIAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRCG 238 H+YV+QRA +L P ++ HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTR G Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242 Query: 239 DVDFGAMAWIACETNQTLSDLERVVNKESGLLGISGLSSDLR-VLEKAWHAGHERARLAI 297 +D ++++ + N + ++ ++NK+SG+ GISG+SSD R + + A+ G +RA+LA+ Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302 Query: 298 KTFVHRIARHIAGHAASLHRLDGIIFTGGIGENSLLIRRLVIEHLAVLGVTLDPDMNSLP 357 F +R+ + I +AA++ +D I+FT GIGEN IR +++ L LG LD + N + Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362 Query: 358 NSHGERIISTDSARVICAVIPTNEEKMIALDAIHLGK 394 E IIST ++V V+PTNEE MIA D + + Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 46.3 bits (109), Expect = 1e-07 Identities = 73/325 (22%), Positives = 136/325 (41%), Gaps = 34/325 (10%) Query: 14 LAMLVISGSAAAATQVNALFM-TQAAYSENDIRAMTADFEKQNPDIKINLEFVPYEALHD 72 L ++ S SA A + L + N + + FEK + IK+ +E + L + Sbjct: 15 LTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEK-DTGIKVTVEHP--DKLEE 71 Query: 73 KIVAARGAGGNGYDVVLFDAIWPAEFSHFDLLQDVSARIPAAEREKVFPGAMNTVVYKGK 132 K A G+G D++ + ++ LL +++ A ++K++P + V Y GK Sbjct: 72 KFPQV-AATGDGPDIIFWAHDRFGGYAQSGLLAEITP--DKAFQDKLYPFTWDAVRYNGK 128 Query: 133 TLGMPWILDTKYLYYNKAMLDKAGIKQVPTTWQQV--LDDAKIIKDKGIVKYPLVWSWSQ 190 + P ++ L YNK +L P TW+++ LD K K + + L + Sbjct: 129 LIAYPIAVEALSLIYNKDLLPNP-----PKTWEEIPALDKELKAKGKSALMFNLQEPYFT 183 Query: 191 AEALVCDYTTLVSGFGGQFYQNGKLDFS-----TPASLKAVTLMKQSLDDGLSNPASREY 245 + D G+ ++ +NGK D + +T + + + N + Sbjct: 184 WPLIAAD-----GGYAFKY-ENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYS 237 Query: 246 LEEDVRKAFSNGDAAFALNWTYMYNMANDPKQSKVAGDVGIMPAPGDTPDKPGAVNGSMG 305 + E AF+ G+ A +N + + ++ SKV V ++P P KP G + Sbjct: 238 IAE---AAFNKGETAMTINGPWAW---SNIDTSKVNYGVTVLPTFKGQPSKPFV--GVLS 289 Query: 306 LGIAKASQHPEQAWQYI-HYLTSQP 329 GI AS + E A +++ +YL + Sbjct: 290 AGINAASPNKELAKEFLENYLLTDE 314
>PF05272#Virulence-associated E family protein Length = 892 Score = 35.0 bits (80), Expect = 5e-04 Identities = 15/55 (27%), Positives = 18/55 (32%), Gaps = 9/55 (16%) Query: 33 VLVGPSGCGKSTLLRLLAGLESLSAGTILMNNRPINDLDPADRDVAMVFQSYALY 87 VL G G GKSTL+ L GL+ S +D Y Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHF---------DIGTGKDSYEQIAGIVAY 645
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 61.8 bits (150), Expect = 1e-12 Identities = 60/208 (28%), Positives = 87/208 (41%), Gaps = 19/208 (9%) Query: 61 GRFSDRYGRRPALLAGLAAYALGCVLALAARDFTLLLVARMLSAFGAATGSVVSQTILRD 120 G SDR+GRRP LL LA A+ + A +L + R+++ ATG+V I D Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYI-AD 122 Query: 121 RFNGRQLAALFSVMGLVLAVSPAAGVYLGGAIVAGWGMHGVLTALALLAAVLLLLCCWLL 180 +G + A F M AG LGG ++ G+ H A A L + L C+LL Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGG-LMGGFSPHAPFFAAAALNGLNFLTGCFLL 181 Query: 181 PET-----RPSQHVHTAFLPLLRRMATDKPLALAVLLVAAFNVSLFSYYSLAPF-IFAQL 234 PE+ RP + L R +A L+ F + L A + IF + Sbjct: 182 PESHKGERRPLRREALNPLASFRWARGMTVVAA--LMAVFFIMQLVGQVPAALWVIFGED 239 Query: 235 RQGSVIFGYS----GIALAVGSLAGALF 258 R F + GI+LA + +L Sbjct: 240 R-----FHWDATTIGISLAAFGILHSLA 262
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 430 bits (1107), Expect = e-153 Identities = 204/388 (52%), Positives = 249/388 (64%), Gaps = 37/388 (9%) Query: 1 MKRKTLALLVPPLLLAGAVNAAEIYNKNGNKLDFYGKMVGEHIFTDTDRNNSDNNSRDTT 60 MKRK LAL++P LL AGA +AAEIYNK+GNKLD YGK+ G H F+D + D T Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGD-----QT 55 Query: 61 YARFGVKGETQINSDLTGYGRFEYNIKADKPEGEQG-SATRLAFAGLKFANYGSFDYGRN 119 Y R G KGETQIN LTGYG++EYN++A+ EGE S TRLAFAGLKF +YGSFDYGRN Sbjct: 56 YMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRN 115 Query: 120 YGLVYDAAGYTDMLVEWGGDGLVATDNFMTGRTNGIATYRNSDFFGLVDGLNIGLQYQGK 179 YG++YD G+TDML E+GGD DN+MTGR NG+ATYRN+DFFGLVDGLN LQYQGK Sbjct: 116 YGVLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGK 175 Query: 180 N----------------NNRDRLKANGDGYSTSVDYSID-GFGFAAAYSNSDRTDEQTAD 222 N N D NGDG+ S Y I GF AAY+ SDRT+EQ Sbjct: 176 NESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNA 235 Query: 223 RK----GENAEVWSLAAKYDANSIYTAVMYAESHNMTPMANGV------FANKTQNIEAV 272 G+ A+ W+ KYDAN+IY A MY+E+ NMTP ANKTQN E Sbjct: 236 GGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295 Query: 273 VQYQFDFGLRPSLGYVYAQGKDLGTNGGK--DAEIMNYIELGTWYYFNKNFNVYSAYKFN 330 QYQFDFGLRP++ ++ ++GKDL N D +++ Y ++G YYFNKNF+ Y YK N Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 331 LIDHKDSII--TGAAEDDQFAVGITYQF 356 L+D D G + DD A+G+ YQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 668 bits (1724), Expect = 0.0 Identities = 240/890 (26%), Positives = 402/890 (45%), Gaps = 66/890 (7%) Query: 6 KNKEVYFRLALITCMIKASLFSHSAFAEEYRFDNNLLAGSGFAKGISLEKFNDTEVIIAP 65 K++ F + L A+ S+ E F+ LA L +F + + + P Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSS--AELYFNPRFLADDP-QAVADLSRFENGQELP-P 75 Query: 66 GQQNLDLVLNGTKIKSQVPVKFQKINPTDKSAQLCLDAELIKLLQLKIQPSS----LPQV 121 G +D+ LN + ++ V F +++ CL + + L S L Sbjct: 76 GTYRVDIYLNNGYMATR-DVTFNT-GDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADD 133 Query: 122 PCLSLADITRQGSWRIKPSTLSVEFSIPQIMLNRPPRDYIPVAEWDAGAPLLFLRHNTSY 181 C+ L + + ++ + +IPQ ++ R YIP WD G L +N S Sbjct: 134 ACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSG 193 Query: 182 TRNILRDIHYS-YLWSMINAGANMGMWQLRHQANLRYMQS-STAGNAYKWNSVRTWVQRP 239 R S Y + + +G N+G W+LR Y S S++G+ KW + TW++R Sbjct: 194 NSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERD 253 Query: 240 LPAISSELMVGDTYTDSAMFGSISFNGVKLYTDQSMWPQGKLGYAPEIRGVANTNARVLI 299 + + S L +GD YT +F I+F G +L +D +M P + G+AP I G+A A+V I Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313 Query: 300 RQAGHLIYETQVPPGPFLIDDLYNTRSQGDIEVQIIETDGKSSFFVVPYAAVPGSMRPGN 359 +Q G+ IY + VPPGPF I+D+Y + GD++V I E DG + F VPY++VP R G+ Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373 Query: 360 MSYQLAAGKVRNYYSVQNA--FAEGVLQYGVNNRFTANTGARFANNYQALLAGGVFAS-E 416 Y + AG+ R+ + Q F + L +G+ +T G + A+ Y+A G Sbjct: 374 TRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGA 433 Query: 417 LGALSLNTTWSHAQVENNQSKSGWRAEAAYSKTF-PSNTNIVLAAYRYSTAGYRDLQDVL 475 LGALS++ T +++ + ++ G Y+K+ S TNI L YRYST+GY + D Sbjct: 434 LGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTT 493 Query: 476 GVRRQQQNG-------------ANYYSDTLKQRNRFTATLSQNMDEYGTLSLSGSSTDYY 522 R N +YY+ +R + T++Q + TL LSGS Y+ Sbjct: 494 YSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYW 553 Query: 523 NNRSRITELQLSYNNIWKKLSYNINVGRQRSSWSNTNYIYSVNDAEYDSSRYQKYTENVV 582 + + Q N ++ +++ ++ +++W + ++ Sbjct: 554 GTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-------------------DQML 594 Query: 583 SIGFSIPLDWRDSR--------SSVSFDMTKNKNTRTA-MTTLSGSAGEENDFTYSLYAN 633 ++ +IP +S S+ M+ + N R + + G+ E+N+ +YS+ Sbjct: 595 ALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTG 654 Query: 634 HDKYTALEQGRSHTMRWGANVQERTRSGTMRASYAHSGKDHQLGLGTAGTLALHSGGVTY 693 + G + A + R G Y+HS QL G +G + H+ GVT Sbjct: 655 YAGGGDGNSGST----GYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTL 710 Query: 694 GPYASDTFALVEAKGASGARINNAQGARIDVFGYGIAPSLAPYRYNTISIDGNSLDQDVE 753 G +DT LV+A GA A++ N G R D GY + P YR N +++D N+L +V+ Sbjct: 711 GQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVD 770 Query: 754 LEGGNVRVVPVRGAVPKVAFNTLGGTPALIAVMMPDGSPVPMGAEVQDRDGKNIGMAGQN 813 L+ VVP RGA+ + F G L+ + + P+P GA V ++ G+ N Sbjct: 771 LDNAVANVVPTRGAIVRAEFKARVGIKLLM-TLTHNNKPLPFGAMVTSESSQSSGIVADN 829 Query: 814 GQVYARLPDASGTLFIRW--DSKQTCRVNYQMPARNKASGNNFIHLNGIC 861 GQVY +G + ++W + C NYQ+P ++ L+ C Sbjct: 830 GQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQL--LTQLSAEC 877
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 146 bits (371), Expect = 1e-41 Identities = 74/424 (17%), Positives = 172/424 (40%), Gaps = 50/424 (11%) Query: 27 PPFRWLIVAMVCALTIVLVGFCSLGSYTKRETAKGVLTPESGIMTITALTAGTVTALPVR 86 R + ++ L I + LG TA G LT I + V + V+ Sbjct: 55 RRPRLVAYFIMGFLVIAFI-LSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 87 EGAGVKKGERIATVSSEISTARYGQTREA--IARQLEIQSQGLTQQL------------- 131 EG V+KG+ + +++ + A +T+ + AR + + Q L++ + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 132 ---------------TNLEQRNAEALKSLQERSSLLAQQTTELDTIYRQRQ---RQIALS 173 + ++++ + ++ L ++ E T+ + + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 174 QKQVDKMAAMRAEGYASNTQVEQQESDLLDAKVRLQDVARQRIEIRQQHAQTRQQLREQP 233 + ++D +++ + + V +QE+ ++A L+ Q +I + +++ + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 234 LTYFQ----QKNDLQQKLSDITQSMMENESRRS-VDLRAPEEGTVSAVLVK-PGQIVSAG 287 + + + +T + +NE R+ +RAP V + V G +V+ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 288 QTIAMLLPDNAHLQARILLSSRAIGFIHTGQRVVLRYESFPWQKFGQHSGAVSEISTSPL 347 +T+ +++P++ L+ L+ ++ IGFI+ GQ +++ E+FP+ ++G G V I+ + Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413 Query: 348 SPQEIAGITGNTQIQEPLYQVKVTLDSQSVQAYGKQIGLRPGSGLDADFIVDKRRIYEWV 407 Q ++ V ++++ + K I L G + A+ R + ++ Sbjct: 414 EDQR----------LGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYL 463 Query: 408 LEPL 411 L PL Sbjct: 464 LSPL 467
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 30.6 bits (69), Expect = 0.015 Identities = 15/124 (12%), Positives = 40/124 (32%), Gaps = 5/124 (4%) Query: 301 DFYDEFIAAAHHRITTLSAEEYLFSLKDRVLKELEFFQAKSSALQQEVETLTDSFAEQKD 360 + + + S L + L ++F KD Sbjct: 233 EVCKDLFKEIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKD 292 Query: 361 ENAILHNQLHEIEERVTEQEKNIRQLTDKNNNMHHEITIKQQEFNEIYQHHKNLISSLSW 420 ++ L ++ ++++ V + T+K+ E +I++ + L+++ + Sbjct: 293 KSDRLKSKSSDLQKIVMNNINRCTKKDKI-----LNNTLKKCEDKDIFKLYGELLTANIY 347 Query: 421 KLTK 424 L K Sbjct: 348 ALKK 351
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.031 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 5/43 (11%) Query: 5 NILIVG-AGFSGVVIARQLAEQGHKVKIIDQRDHIGGNSYDTR 46 L+ G AGF G ++++L E GH+V ID + + YD Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLN----DYYDVS 40
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 582 bits (1502), Expect = 0.0 Identities = 268/333 (80%), Positives = 303/333 (90%) Query: 1 MKFLVTGAAGFIGFHASQRLLEAGHEVVGIDNMNDYYDVNLKQSRLDLLQSPLFSFYKTD 60 MK+LVTGAAGFIGFH S+RLLEAGH+VVGIDN+NDYYDV+LKQ+RL+LL P F F+K D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 61 LADREGIAQIFATEKFDRVIHLAAQAGVRYSLENPHAYADANLIGYLNILEGCRHTKVQH 120 LADREG+ +FA+ F+RV + VRYSLENPHAYAD+NL G+LNILEGCRH K+QH Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGIPTTGLRFFT 180 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG+P TGLRFFT Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180 Query: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIVEAIVRVQDVIPHADPE 240 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI EAI+R+QDVIPHAD + Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQ 240 Query: 241 WTVENGSPATSSAPYRIYNIGNSSPVELMDYITALEEALGMEAEKNMMPIQPGDVLETSA 300 WTVE G+PA S APYR+YNIGNSSPVELMDYI ALE+ALG+EA+KNM+P+QPGDVLETSA Sbjct: 241 WTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSA 300 Query: 301 DTKPLYDLVGFKPQTSVKDGVKNFVDWYKAYYN 333 DTK LY+++GF P+T+VKDGVKNFV+WY+ +Y Sbjct: 301 DTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.012 Identities = 40/236 (16%), Positives = 80/236 (33%), Gaps = 40/236 (16%) Query: 176 GNLVNEGGLLIKVDSISADIGTDFSIRKVTKLKAISDLQDRFTAVDQGKDTGILVLSLYG 235 G V +G +L+K+ ++ A+ +D +++ Q + L Sbjct: 115 GESVRKGDVLLKLTALGAE----------------ADTLKTQSSLLQARLEQTRYQILS- 157 Query: 236 DNPDLIKRTINSISHNYLSQNIARQAAQDAKSLDFLNQQLPKVRNDLDIAEDKLNRYRRL 295 + +L K + QN++ + SL + +Q +N E L++ R Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL--IKEQFSTWQNQKYQKELNLDKKRA- 214 Query: 296 SDSVDLSLEAKSVLDQIVNVDNQLNELTFRESEISQLYTKEHPTYKALLEKRKTLQDERV 355 E +VL +I +N R + S L K+ A+LE+ + Sbjct: 215 --------ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266 Query: 356 KLNKKVSSMPETQQEILRLSRDVESGRAVF------------MQLLNRQQELNIAK 399 +L S + + + EIL + + +F + EL + Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE 322
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 39.2 bits (91), Expect = 9e-06 Identities = 52/208 (25%), Positives = 85/208 (40%), Gaps = 24/208 (11%) Query: 2 KRWLMLFAAL-PLF-----AHAAA---ERIISLGGDVTEIVYALDAQQQLVAKDSTSTW- 51 +R L+ AL PL AHAAA RI++L E++ AL VA T + Sbjct: 9 RRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA--DTINYR 66 Query: 52 -----PAAARSLPDVGYIRQLNTEGILSLRPTIVLASAQAQPSL-VLQKVEDSRVRVVNI 105 P S+ DVG + N E + ++P+ ++ SA PS +L ++ R + Sbjct: 67 LWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSD 126 Query: 106 PGGNDLSAIDKKIVAVADALGKQAEGDALRHTVAAQIAQIPTQPVGKR----VLFILSHG 161 G L+ K + +AD L Q+ + I + + V + +L L Sbjct: 127 -GKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLID- 184 Query: 162 GMNTMVAGQKTAADGAIQAAGLQNAMQG 189 + +V G + + G+ NA QG Sbjct: 185 PRHMLVFGPNSLFQEILDEYGIPNAWQG 212
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.034 Identities = 45/218 (20%), Positives = 64/218 (29%), Gaps = 50/218 (22%) Query: 35 MTALIGPNGAGKSTLLRLLTG--------FLSPNGGE--RHVEGRSLEHWSSEALSRRRA 84 L G G GKSTL+ L G F G + + G S RR Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRAD 657 Query: 85 VMLQRTQLHADWPVETVIAMGRSPWGKNPDPQMIQQVMAETGCDH------LAGRRY-PS 137 + + + R +G+ Q V+ T RR+ P Sbjct: 658 AEAVKAFFSSR--KDRY----RGAYGRYVQDHPRQVVIWCTTNKRQYLFDITGNRRFWPV 711 Query: 138 LSGGEQQRVQLARCLAQLW------HDDAPRGWLFLDEPTSALDLYYQQHLLRLLKRLTR 191 L G V L + QL+ + R + P + + LRL++ Sbjct: 712 LVPGRANLVWLQKFRGQLFAEALHLYLAGERYFP---SPEDEEIYFRPEQELRLVE---- 764 Query: 192 SGQLHVCVVLHDLNLAALWADRILLLHQGKLVAEGTPQ 229 LWA LL +G AEG Q Sbjct: 765 -----------TGVQGRLWA---LLTREGAPAAEGAAQ 788
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 128 bits (323), Expect = 3e-38 Identities = 77/252 (30%), Positives = 122/252 (48%), Gaps = 9/252 (3%) Query: 7 LTGKTALVTGSARGLGFAYAEGLAAAGARVILNDIRETLLAESVDALASKGYSAHGAAFD 66 + GK A +TG+A+G+G A A LA+ GA + D L + V +L ++ A D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTSEEAIEKAFAELDAQDIHVDILINNAGIQYRKPMVELELENWQKVIDTNLTSAFLVSR 126 V AI++ A ++ + +DIL+N AG+ + L E W+ N T F SR Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 TAAKRMIARRSGGKIVNIGSLTSQAARPTVAPYTAAKGGIKMLTCSMAAEWAQFNIQSNA 186 + +K M+ RR G IV +GS + R ++A Y ++K M T + E A++NI+ N Sbjct: 126 SVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 187 IGPGYILTDMNTALIED--------KQFDNWVKDSNPSQRWGRPEELIGTAVFLSSKASD 238 + PG TDM +L D K K P ++ +P ++ +FL S + Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 239 YINGQIIYVDGG 250 +I + VDGG Sbjct: 245 HITMHNLCVDGG 256
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.7 bits (77), Expect = 0.001 Identities = 49/336 (14%), Positives = 115/336 (34%), Gaps = 38/336 (11%) Query: 39 AASGLAEDLNITPGISSLLGALFFLGYFFFQVPGGIYAEKRSAKKLIFWSLILWGILATA 98 + +A D N P ++ + F L + G +++ K+L+ + +I+ + Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI 95 Query: 99 TGMVHD-VKVLAVIRFLLGVAESVVMPAMLVFLSHWFTKKERSKANTFLFLGNPITVLWM 157 + H +L + RF+ G + ++V ++ + K+ R KA + + Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155 Query: 158 SVLSGYLVDAFGWRGMFIIEGVPAIIWAAIWWFIFVDRPADAKWLTQQEKNDIEAAL--A 215 + G + W + +I P I + + + + + + + L Sbjct: 156 PAIGGMIAHYIHWSYLLLI---PMITIITVPFLMKLLKKEVR---IKGHFDIKGIILMSV 209 Query: 216 AEQTNIKEIKNYSEAFRSGKVLSL----------------------------SFIHFFWN 247 + +YS +F VLS Sbjct: 210 GIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIF 269 Query: 248 IGMYGFIMWLPSILKSASGLSIVATGWLSAAPYLLAVPLM-LAASWYSDKFLNRKIIVLL 306 + GF+ +P ++K LS G + P ++V + D+ ++ + Sbjct: 270 GTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIG 329 Query: 307 FLGLGAVCFIASFTIGAANFWLSYVLLMIAGGAMYT 342 L ASF + +++++ +++ + GG +T Sbjct: 330 VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT 365
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 69.9 bits (171), Expect = 4e-15 Identities = 56/390 (14%), Positives = 128/390 (32%), Gaps = 52/390 (13%) Query: 16 FLDLINMFIASVAFPGMSRVLNASVSELAWVSNGYIAGLTLVVPFSAWLTQRCGARRLFM 75 F ++N + +V+ P ++ N + WV+ ++ ++ L+ + G +RL + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 76 LSLTLFSAAALACGLSDSLGS-LIFWRALQGMGGGLLIPVGQSLTWQLFQPHERAKLSAA 134 + + ++ + S S LI R +QG G + + + R K Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 135 VMLVGLLAPACSPAAGGLLVETFSWRGVYFASLPVALITLALAGVWLKNDDGP------- 187 + + + PA GG++ W Y +P+ I + L + Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 188 ----------------ARPSRFLHL-------------------PLLADPLLRFAMLIYL 212 L P + L + + Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIG 261 Query: 213 CVPGVFIGINVVGL-----FYLQTITGMSPSATGA-LMLPWSLASFVAISLTGRFFNRLG 266 + G I V G + ++ + +S + G+ ++ P +++ + + G +R G Sbjct: 262 VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRG 321 Query: 267 PRPFILSGCLLQAAGILLLTGVTPQSSMASLVFAFILMGAGGSLCSSTAQSCAFLNIANG 326 P + G + L +++ + + + G S + + ++ Sbjct: 322 PLYVLNIGVTFLSVSFLTA-SFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQ 380 Query: 327 DMPDASALWNINRQLSFLAGAALLSALLAV 356 + +L N LS G A++ LL++ Sbjct: 381 EAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 132 bits (332), Expect = 1e-39 Identities = 78/258 (30%), Positives = 124/258 (48%), Gaps = 10/258 (3%) Query: 2 AIENKVALVTGAGQGIGRGIALRLAKEGASLMLVDVNPEGIAAVAAEVEALGRKAATFVA 61 IE K+A +TGA QGIG +A LA +GA + VD NPE + V + ++A R A F A Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 62 NIAERDQVYAAIDRAEKELGGFDIIVNNAGIAQVQALADVTPEEVDRIMRINVQGTLWGI 121 ++ + + R E+E+G DI+VN AG+ + + ++ EE + +N G Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 122 QAAAKKFIDRKQNGKIINACSIAGHDGFALLGIYSATKFAVRALTQAAAKEYASRGITVN 181 ++ +K +DR+ G I+ S + Y+++K A T+ E A I N Sbjct: 125 RSVSKYMMDRRS-GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 182 AYCPGIVGTGM----WTEIDKRFAEITGAPVGETYKKYVEGIALGRAETPDDVASLVSYL 237 PG T M W + + I G+ ET+K GI L + P D+A V +L Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGS--LETFKT---GIPLKKLAKPSDIADAVLFL 238 Query: 238 AGPDSDYITGQSILIDGG 255 + +IT ++ +DGG Sbjct: 239 VSGQAGHITMHNLCVDGG 256
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 59.3 bits (143), Expect = 5e-13 Identities = 32/159 (20%), Positives = 68/159 (42%), Gaps = 10/159 (6%) Query: 11 REQKKRLTRQQLSNTATELFIKQGFDNVTVSDIAAAAKVSKMTVFNYFPRKEELYFDRID 70 +Q+ + TRQ + + A LF +QG + ++ +IA AA V++ ++ +F K +L+ + + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 71 EIHQLLQDALER---HRTLAPVAVFRALTRELIEQ------EHPLIRIDRRVADFWQGVA 121 + + P++V R + ++E L+ I +F +A Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124 Query: 122 ASPAL-RVYALEQFAELTQALSNMLAARLSAPEHNPVAA 159 R LE + + Q L + + A++ + A Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRA 163
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 329 bits (845), Expect = e-113 Identities = 189/384 (49%), Positives = 250/384 (65%) Query: 2 NRTLSVIFMTIWLDAVGIGLIFPILPQLLKEVMHTADIAHYMGILAALYALMQFIFAPLL 61 NR L VI T+ LDAVGIGLI P+LP LL++++H+ D+ + GIL ALYALMQF AP+L Sbjct: 4 NRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 62 GALSDNFGRRPVLLVSLAGAVVNYLIMAFASHLWLLLLGRAIAGLTSANIAVAMAWLTDI 121 GALSD FGRRPVLLVSLAGA V+Y IMA A LW+L +GR +AG+T A AVA A++ DI Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADI 123 Query: 122 TPAGKRASRFGLFNATFGAGFIIGPVLGGVLGDYGVRLPFFVAAVLNSANVLLALFALQE 181 T +RA FG +A FG G + GPVLGG++G + PFF AA LN N L F L E Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 182 SRQPARQKMALAGLNPLRPMRWLFSVKGLLTIALVFFFLSATGEVYGTCWALWGQDMFHW 241 S + R+ + LNPL RW + + + VFF + G+V W ++G+D FHW Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHW 243 Query: 242 NGLWIGLSLAAFGVCQTLAQAFLPGPATRLLGERGAILGGIACSCTALIVLSLAQQSWIV 301 + IG+SLAAFG+ +LAQA + GP LGER A++ G+ T I+L+ A + W+ Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMA 303 Query: 302 FAVMPLIALGGIGTPALQALATRQVQADNQGQLQGVLASTVSLASIVAPLVFSSLYFGFR 361 F +M L+A GGIG PALQA+ +RQV + QGQLQG LA+ SL SIV PL+F+++Y Sbjct: 304 FPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASI 363 Query: 362 TWWPGAIWLSVVVLYVFAVPLIYR 385 T W G W++ LY+ +P + R Sbjct: 364 TTWNGWAWIAGAALYLLCLPALRR 387
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.5 bits (74), Expect = 0.004 Identities = 34/188 (18%), Positives = 72/188 (38%), Gaps = 12/188 (6%) Query: 33 SFYGIRPLLILFMAATVYDGGMGLARENASAIVGIFAGSMYLAALPGGWLADNWLGQQKA 92 SF+ + ++L ++ + + + F + + G L+D LG ++ Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ-LGIKRL 81 Query: 93 VWYGSILIALGHLSIALSAVLGDNLFFIGLMFIVL---GSGLFKTCISVMVGTLYKKGDA 149 + +G I+ G ++ +G + F + +M + G+ F + V+V K Sbjct: 82 LLFGIIINCFG----SVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK--E 135 Query: 150 RRDGGFSLFYMGINMGSFIAPLISGWLIKSHGWHWGFGIGGIGMLVALVIFRVFAVPAMK 209 R F L + MG + P I G + +H HW + + + + V F + + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGG--MIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV 193 Query: 210 RYDSEVGL 217 R + Sbjct: 194 RIKGHFDI 201
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.002 Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 12/82 (14%) Query: 149 AAGAGKVVYVGNQLRGYGNLIMIKHGEDYISAYAHNDKMMVNNGQNVKIGQQIATMGSSD 208 A GK+ + G IK E+ I +++V G++V+ G + + + Sbjct: 84 ATANGKLTHSGRSK-------EIKPIENSIV-----KEIIVKEGESVRKGDVLLKLTALG 131 Query: 209 ADSVRLHFQIRYRATAIDPLRY 230 A++ L Q ++ RY Sbjct: 132 AEADTLKTQSSLLQARLEQTRY 153
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 99.4 bits (247), Expect = 3e-27 Identities = 76/256 (29%), Positives = 121/256 (47%), Gaps = 20/256 (7%) Query: 5 LTGKRMVITGAARGLGFHFAKACAEQGADVVMCDILQGELAESAHRLREQGYRIESQTID 64 + GK ITGAA+G+G A+ A QGA + D +L + L+ + E+ D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 65 LADASSIEEAFLAI-GERGKIDGLVNNAAMATGVGGKNMIDYDPDLWDRVMSVNVKGTWL 123 + D+++I+E I E G ID LVN A + ++ D + W+ SVN G + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEE---WEATFSVNSTGVFN 122 Query: 124 VTRAAVPLL--REGAGIVNVASDTALWGAPR--LMAYVASKGAVIAMTRSMARELGEKRI 179 +R+ + R IV V S+ A G PR + AY +SK A + T+ + EL E I Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180 Query: 180 RINAIAPGLTRVE----------ATEYVPAERHQLYENGRALSGAQQPEDVTGSVVWLLS 229 R N ++PG T + E V + ++ G L +P D+ +V++L+S Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240 Query: 230 DLSRFITGQLIPVNGG 245 + IT + V+GG Sbjct: 241 GQAGHITMHNLCVDGG 256
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 117 bits (294), Expect = 6e-35 Identities = 61/140 (43%), Positives = 87/140 (62%), Gaps = 2/140 (1%) Query: 6 LLVYSGLSLALCWYDLRYGLLPDRLTCPLLWSGLLYYLYYAPLRLEYAVGGAIGGYLAFT 65 L+ + + +AL + DL LLPD+LT PLLW GLL+ L + L AV GA+ GYL Sbjct: 137 ALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLW 196 Query: 66 LLYWLYRGIRGYEGLGYGDVKFLAALGAWHGWPMLPQLVFLATVFAGGVMLLVIILGKAP 125 LYW ++ + G EG+GYGD K LAALGAW GW LP ++ L+++ + + +I+L Sbjct: 197 SLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNH- 255 Query: 126 GSLNNPLPFGPFLAAAGFWC 145 + P+PFGP+LA AG+ Sbjct: 256 -HQSKPIPFGPYLAIAGWIA 274
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 37.2 bits (86), Expect = 7e-06 Identities = 19/103 (18%), Positives = 43/103 (41%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLQSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+Q+ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAKDLREAIAYADSVHDYVSRDMLIEILTDEEGHIDWLETEL 138 + + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 4e-18 Identities = 53/154 (34%), Positives = 78/154 (50%), Gaps = 13/154 (8%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGSARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G+ R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ 160 G+P I F+NK D + L V +++E LS Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSA 150
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 611 bits (1577), Expect = 0.0 Identities = 176/698 (25%), Positives = 303/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRVNIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W +VNIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ------------------ 156 Query: 189 KAINWNEEDAGVTFTYEDIPADMQDLAEEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248 V + + +E+W ++ E +++L+EKY+ G+ L E+ Sbjct: 157 ----------KVELYPNMCVTNFTE-SEQW-----DTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KKALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 ++ R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDEEPFAALAFKIATDPFVGNLTFFRVYSGVVNSGDTILNSVKSARERFGRIVQMHA 368 + FKI L + R+YSGV++ D++ S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKE-KIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCNPDHPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDLGVRLHFGSYHDVDSSELAFKLAASIAFKDG 604 + + AV +GI+ + G L G+ V D + +G Y+ S+ F++ A I + Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLRGQESNVTGVVIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + V++ E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYSMEFLKYDDAPNNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.6 bits (74), Expect = 4e-04 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 9/61 (14%) Query: 84 MSVDAAFHGRGVGSALMREMINLCDNWLRIER----IELTVFADNSSALALYRKFGFEIE 139 ++V + +GVG+AL+ + W + E + L N SA Y K F I Sbjct: 95 IAVAKDYRKKGVGTALL----HKAIEWAK-ENHFCGLMLETQDINISACHFYAKHHFIIG 149 Query: 140 G 140 Sbjct: 150 A 150
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 33.9 bits (77), Expect = 0.001 Identities = 26/81 (32%), Positives = 36/81 (44%), Gaps = 15/81 (18%) Query: 275 RTPISGEYRGYQVFSMPPPSSGGIHIVQILNI--LENFDMHKYGFGSADAMQVMAEAEKH 332 R P+ GE R + SMPPP G H +I N+ + FD + G A +++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFDGY---VGGQTAWGILSELEKG 132 Query: 333 AYADRSEYLGDPDFVKVPWQA 353 Y P F WQ+ Sbjct: 133 RY---------PTFSYQDWQS 144
>PF06580#Sensor histidine kinase Length = 349 Score = 26.0 bits (57), Expect = 0.031 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 7/45 (15%) Query: 30 GYVIPSQQRMQNQMQVQQQQHQSMLKQDMQSQTRAQQQHLQTQLN 74 Y + Q ++ Q + SM ++ AQ L+ Q+N Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQE-------AQLMALKAQIN 171
>PF04619#Dr-family adhesin Length = 160 Score = 28.7 bits (64), Expect = 0.012 Identities = 13/60 (21%), Positives = 24/60 (40%), Gaps = 4/60 (6%) Query: 29 VGARFGHTMIEFDAKLSKDGQIFLLHDDHLERTSNGWGVAGELAWSE----LLKADAGSW 84 +G ++ D + G+ FL+ D++ ++ AW+ K D GSW Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.040 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 38.9 bits (90), Expect = 3e-05 Identities = 44/176 (25%), Positives = 73/176 (41%), Gaps = 17/176 (9%) Query: 133 SGHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQELAAYTDKLKAAGMKCGYASGW 192 +G L++ P L YNKD L P PPKTW+E+ A +LKA G + Sbjct: 126 NGKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQ 178 Query: 193 QGWIQIENFSAWHGLPVATKNNGFDGTDAVLEF--NKPEQVKHIAMLEAMNKKGDFSYFG 250 + + +A G +N +D D ++ K + +++ + D Y Sbjct: 179 EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY-- 236 Query: 251 RKDESTEKFYNGDCAITTASSGSLADIRQYAKFNYGVGMMPYDADVKGAPQNAIIG 306 + F G+ A+T + ++I +K NYGV ++P KG P +G Sbjct: 237 --SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 49.5 bits (118), Expect = 1e-08 Identities = 36/174 (20%), Positives = 75/174 (43%), Gaps = 3/174 (1%) Query: 21 RIFIVMLILGAINYIDRTSLSIAMPYITDEFGITDTRVVGVIHSAFFWAYALMQIPSGVI 80 +I I + IL + ++ L++++P I ++F V +AF +++ G + Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVN-TAFMLTFSIGTAVYGKL 72 Query: 81 ADKFKARNIIAIATILWGAFQAIAALCHSIFTLSI-SRIGLGIAESPIMPAGAKLMGTWL 139 +D+ + ++ I+ I + HS F+L I +R G + ++ ++ Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132 Query: 140 TPTERGRGSMLLDGGAPLGTALGAVIIAGLIAQFDSWRMAFIIAGVGTIAIGFL 193 RG+ L+ +G +G I G+IA + W +I + I + FL Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGP-AIGGMIAHYIHWSYLLLIPMITIITVPFL 185
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.9 bits (80), Expect = 7e-04 Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 16/168 (9%) Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--- 89 Query: 109 MLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 F + +G S F ++ + F Q G + + + ++ P+ RG G Sbjct: 90 --CFGSVIGFVGHSFFSLLIM---ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRF 212 +G + A+Y+ + + + P +I +I ++ Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP-MITIITVPFLMKL 188
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 40.2 bits (94), Expect = 1e-05 Identities = 67/407 (16%), Positives = 136/407 (33%), Gaps = 58/407 (14%) Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILASNVLTRSDIGLLATLFYITYGLSKFFSG 86 RH + IWL F+ N ++P+I + + T F +T+ + G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSNARYFMGLGLIATGVVNILFGFSTSLWAFALLWALNAFFQGWGS---PVCARLL 143 +SD+ + + G+I +++ S F L + F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPMVVGAAALHYGWRTGMMIAGMLAILAGLFLC 202 A Y + RG + L + +G + P + G A + W + +++ M+ I+ FL Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMITIITVPFL- 185 Query: 203 WRLRDRPQVVGLPAVGDWRHDELEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSLCYVLV 262 + L +I G L I+ + Y + VL Sbjct: 186 ---------MKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANSAVTMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNGNRGPMNLIFAAGILLSVGGLWLMPFASYVMQAACFFTTGFFVFGPQMLI- 365 GS +F G + + GIL+ G + + F T F + + Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352 Query: 366 --------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395 G++ + ++ AGA + ++L Sbjct: 353 IIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.1 bits (169), Expect = 4e-16 Identities = 31/174 (17%), Positives = 61/174 (35%), Gaps = 20/174 (11%) Query: 2 TTIALIDDHLIVRSGFAQLLGLEADFQVVAEFGSAREALAGLPGRGVQVCICDISMPDMS 61 TI + DD +R+ Q L + V +A + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVRTVAAG 118 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117 Query: 119 GCYLTPDIAMKLAAGRQDPLTRRERQVAEKLAQG---MAVKEIAAELALSPKTV 169 A+ R L + + + + + A L + T+ Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 818 bits (2115), Expect = 0.0 Identities = 481/549 (87%), Positives = 509/549 (92%), Gaps = 2/549 (0%) Query: 1 MDSQRNLLIIALLFVSFMIWQAWEQDNNPQAQ-QQTTQTTTTAAGSAADQGVPASGQGKL 59 MDSQRNLL+IALLFVSFMIWQAWEQD NPQ Q QQTTQTTTTAAGSAADQGVPASGQGKL Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60 Query: 60 ITVKTDVLELTINTRGGDVEQALLPAYPKALKSTEPFQLLETTPQFIYQAQSGLTGRDGP 119 I+VKTDVL+LTINTRGGDVEQALLPAYPK L ST+PFQLLET+PQFIYQAQSGLTGRDGP Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120 Query: 120 DNPANGARPLYNVDKDAFVLAEGQNEIVIPLTYTDKAGNVFTKTFTLKRGDYAVNVGYNV 179 DNPANG RPLYNV+KDA+VLAEGQNE+ +P+TYTD AGN FTKTF LKRGDYAVNV YNV Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180 Query: 180 QNVGEKPLEISTFGQLKQTANLPTSRDTQTGGLSTMHTFRGAAYSTSESKYEKYKFDTIV 239 QN GEKPLEIS+FGQLKQ+ LP DT + + +HTFRGAAYST + KYEKYKFDTI Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFA-LHTFRGAAYSTPDEKYEKYKFDTIA 239 Query: 240 DNENLNVSTKDGWVAMLQQYFTTAWVPHNAGTNSFYTANLGNGVVAIGYKSQPVLVQPGQ 299 DNENLN+S+K GWVAMLQQYF TAW+PHN GTN+FYTANLGNG+ AIGYKSQPVLVQPGQ Sbjct: 240 DNENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQ 299 Query: 300 TDKLESILWVGPAIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKFIHSFLGNWGFSII 359 T + S LWVGP IQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLK+IHSF+GNWGFSII Sbjct: 300 TGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSII 359 Query: 360 VITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRQSQEMMALYKAEKVNP 419 +ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQR SQEMMALYKAEKVNP Sbjct: 360 IITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNP 419 Query: 420 LGGCFPLIIQMPIFLALYYMLSASVELRHAPFILWIHDLSAQDPYYILPIIMGATMFFIQ 479 LGGCFPL+IQMPIFLALYYML SVELR APF LWIHDLSAQDPYYILPI+MG TMFFIQ Sbjct: 420 LGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQ 479 Query: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVVYYIVSNLVTIIQQQLIYRGLEKRG 539 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLV+YYIVSNLVTIIQQQLIYRGLEKRG Sbjct: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRG 539 Query: 540 LHSREKKKS 548 LHSREKKKS Sbjct: 540 LHSREKKKS 548
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 39.2 bits (91), Expect = 1e-06 Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 3/56 (5%) Query: 69 IVDVAVDPAHQGKGLGRVIMEKIVSW-LDANACQGAYVTLVADVPG--LYAKFGFT 121 I D+AV ++ KG+G ++ K + W + + C T ++ YAK F Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 58.7 bits (142), Expect = 1e-11 Identities = 68/311 (21%), Positives = 115/311 (36%), Gaps = 14/311 (4%) Query: 5 LICSFALVLLYPSGIDMYLVGLPRIAQDLGASEAQLHIAFSVYLAGMAAAML----FAGR 60 LI + V L GI + + LP + +DL S + + + LA A G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGA 65 Query: 61 AADKSGRKPVAIFGSTTFILASLLCAQAQTSDLFLVGRFIQGIGAGSCYVVAFAVLRDTL 120 +D+ GR+PV + + + A A + +GR + GI G+ VA A + D Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124 Query: 121 DDRRRARVLSLLNGITCIIPVLAPVLGHLIMLKYPWQSLFYTMTGMGVMVGLLSVFMLRE 180 D RAR ++ V PVLG L M + + F+ + + L F+L E Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 181 TRPAVPSHTLAGQKNATESLLNRFFLSRIVITTLSVTAILTYVNVSPVLMMEEMGFDRGQ 240 + N S + +V ++V I+ V P + G DR Sbjct: 184 SHKGERRPLRREALNPLASFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 241 YSMAMALMALV------SMTVSFSTPFALTLFKPRTLMLTSQVMFFTAGTLLSLATSQTV 294 + ++L S+ + T R ++ + T LL+ AT + Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302 Query: 295 TMLGLALICAG 305 + L+ +G Sbjct: 303 AFPIMVLLASG 313
>PF05946#Toxin-coregulated pilus subunit TcpA Length = 199 Score = 28.7 bits (64), Expect = 0.013 Identities = 15/54 (27%), Positives = 20/54 (37%) Query: 130 WINIQHDPDEARAQLVALRRSVLKLTAEAPVRLQLLPGAGRLRTTNMQPVEALC 183 +I I+ A A L S V + P + L TN+ VE LC Sbjct: 133 YIAIKAGGAVALADLGDFENSAAAAETGVGVIKSIAPASKNLDLTNITHVEKLC 186
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 30.7 bits (69), Expect = 0.018 Identities = 12/66 (18%), Positives = 29/66 (43%), Gaps = 8/66 (12%) Query: 69 LAKETDLAGAIKSMFSGEKINR-------TEDRAVLHVALRNRSNTPIIVDGKDVMPEVN 121 AK +DL + + S + + D+ ++ + ++N I+ DVM ++ Sbjct: 276 YAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNII-IKAHGQTNALIVTAAPDVMNDLE 334 Query: 122 AVLEKM 127 V+ ++ Sbjct: 335 RVIAQL 340
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.0 bits (62), Expect = 0.015 Identities = 11/47 (23%), Positives = 13/47 (27%) Query: 98 PTETGHGYGEQIFSAVEARAKAAGDDWLWLEVLASNPGARRFYERSG 144 G G + AK L LE N A FY + Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHH 145
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 747 bits (1930), Expect = 0.0 Identities = 374/396 (94%), Positives = 390/396 (98%) Query: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 Query: 61 VSIEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEISPDKAFQDKLYPFTW 120 V++EHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEI+PDKAFQDKLYPFTW Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 Query: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLVPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 DAVRYNGKLIAYPIAVEALSLIYNKDL+PNPPKTWEEIPALDKELKAKGKSALMFNLQEP Sbjct: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 Query: 181 YFTWPLIAADGGYAFKFENGKYDVKNVGVDSAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 YFTWPLIAADGGYAFK+ENGKYD+K+VGVD+AGAKAGLTFLVDLIKNKHMNADTDYSIAE Sbjct: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 Query: 241 AAFNKGETAMTINGPWAWSNIDKSKVNYGVTLLPTFKGKASKPFVGVLSAGINAASPNKE 300 AAFNKGETAMTINGPWAWSNID SKVNYGVT+LPTFKG+ SKPFVGVLSAGINAASPNKE Sbjct: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300 Query: 301 LAKEFLENYLLTDQGLDEVNKDKPLGAVALKSFQEKLEKDPRIAATMANAQKGEIMPNIP 360 LAKEFLENYLLTD+GL+ VNKDKPLGAVALKS++E+L KDPRIAATM NAQKGEIMPNIP Sbjct: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 Query: 361 QMSAFWYAVRTAVINAASGRQTVDAALKDAQSRITK 396 QMSAFWYAVRTAVINAASGRQTVD ALKDAQ+RITK Sbjct: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 0.001 Identities = 13/35 (37%), Positives = 18/35 (51%) Query: 32 VVFVGPSGCGKSTLLRMIAGLETITSGDLFIGDAR 66 VV G G GKSTL+ + GL+ + IG + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.8 bits (228), Expect = 1e-23 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 1/120 (0%) Query: 2 KPVILVVDDDSAIGELLSDVLSAHVFDVLLCQTGSDALAVAAQRTDISLVLLDMILPDTN 61 ILV DDD+AI +L+ LS +DV + + A LV+ D+++PD N Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDEN 61 Query: 62 GLLVLQQLQRSRPDLPVVMLTGLGSESDVVVGLEMGADDYIAKPFNSRVVVARVKAVLRR 121 +L +++++RPDLPV++++ + + E GA DY+ KPF+ ++ + L Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 29.4 bits (66), Expect = 0.018 Identities = 16/65 (24%), Positives = 25/65 (38%), Gaps = 5/65 (7%) Query: 55 KLAGDNVKVTLVSSGYDLGQQVAQIDNFIAAKVDMIIL---NAADSKGIGPAVKRAKDAG 111 L +KV + I I KVD+I + D + AVK+A + Sbjct: 111 DLLI--IKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKAVASQ 168 Query: 112 IVVVA 116 I+V+ Sbjct: 169 ILVMC 173
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 28.3 bits (63), Expect = 0.047 Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 2/40 (5%) Query: 228 FVYGMSGLLSGLGGVMSASRLYSANGNLGVGYELDAIAAV 267 ++G+ + GG A R + N G+G + A+ A+ Sbjct: 400 LLHGLPAGWTIYGGTQLADRYRAF--NFGIGKNMGALGAL 437
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 53.3 bits (128), Expect = 3e-09 Identities = 25/113 (22%), Positives = 54/113 (47%), Gaps = 4/113 (3%) Query: 741 LVLEDEADVRQTLCEQLHQLGWLTLEAGNGEQALQLLEASTEIALLISDLMLPGRLSGAE 800 LV +D+A +R L + L + G+ N + + A L+++D+++P + + Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE-NAFD 64 Query: 801 VINETQRRFPTVAVLLISGQDLRPAHNPALPD--VELLRKPFSRTQLMQALRQ 851 ++ ++ P + VL++S Q+ A + L KPF T+L+ + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 25.9 bits (57), Expect = 0.034 Identities = 8/48 (16%), Positives = 23/48 (47%) Query: 62 RQQSTDRQRQYDDRQRQIEDRRRQLDDRQRQLDQDRRQLENDQRRLDD 109 ++Q + Q Q ++ ++ +R + ++++ ++ RLDD Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.6 bits (74), Expect = 3e-04 Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 5/84 (5%) Query: 51 LAQVEGETVGLIGLQLQFPLNFNAWIGEVQELVVLPQRRGLHVGQALLAWAEQEAREHGA 110 L +E +G I ++ N+N + ++++ V R VG ALL A + A+E+ Sbjct: 69 LYYLENNCIGRIKIRS----NWNGYA-LIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123 Query: 111 QMVELSSGKARPDAHRFYLREGYT 134 + L + A FY + + Sbjct: 124 CGLMLETQDINISACHFYAKHHFI 147
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 31.2 bits (71), Expect = 0.008 Identities = 19/47 (40%), Positives = 25/47 (53%), Gaps = 8/47 (17%) Query: 315 ADDHDNAFSLPQAIRLVSK---NPAQALGLDDR-GVIAEGKRADMVL 357 D+DN + R ++K NPA A GL G + GKRAD+VL Sbjct: 394 TGDNDNF----RVKRYIAKYTINPAIAHGLSHEIGSLEVGKRADLVL 436
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.029 Identities = 14/42 (33%), Positives = 19/42 (45%) Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDTGHIHIRHGDEWVDLV 77 VVL G G GKSTL+ +L H I G + + + Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI 639
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 80.1 bits (197), Expect = 1e-19 Identities = 56/211 (26%), Positives = 83/211 (39%), Gaps = 20/211 (9%) Query: 17 IKGQDLSDKRAVVTGAASGIGLETARALAGAGAEVTLAVRNIEAGKQAAEDIIRSTGNKN 76 + + + K A +TGAA GIG AR LA GA + N E K ++ Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPE--KLEKVVSSLKAEARH 58 Query: 77 IHVAYLDLTDRNSIAEFTSSWS---GALHILINNAGVMAMPETR--TREGWEAQFATNYL 131 D+ D +I E T+ G + IL+N AGV+ + E WEA F+ N Sbjct: 59 AEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNST 118 Query: 132 GHFALTKGLYGALKKAGGARVVVVSSSGHHFSPVVFDDIHYHIRPYDPWTAYGQSKTAMV 191 G F ++ + + +V V S+ P AY SK A V Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAG-------------VPRTSMAAYASSKAAAV 165 Query: 192 LFAVEASRRWKEEGIFVNAVMPGAIKSNLQR 222 +F E I N V PG+ ++++Q Sbjct: 166 MFTKCLGLELAEYNIRCNIVSPGSTETDMQW 196
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 48.9 bits (116), Expect = 1e-09 Identities = 28/146 (19%), Positives = 58/146 (39%), Gaps = 10/146 (6%) Query: 1 MARPLSPQKQ---AALLEAAVAIVAQSGLSATTAS-IAKRAEVAVGTLFTYFPTKEQLLN 56 MAR + Q +L+ A+ + +Q G+S+T+ IAK A V G ++ +F K L + Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 57 EVYLMLKQDMSDLLVSGYPKD-ADFHTQIMHIWRAYTEWSLKNPDGKQVLRLLTVSDLLT 115 E++ + + ++ +L + K D + + I E ++ R L + + Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEE-----RRRLLMEIIFH 115 Query: 116 PETLARTPEALNQVDKMFDEAIKQSL 141 + Q + + Sbjct: 116 KCEFVGEMAVVQQAQRNLCLESYDRI 141
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 82.4 bits (203), Expect = 1e-20 Identities = 60/237 (25%), Positives = 108/237 (45%), Gaps = 15/237 (6%) Query: 9 FITGASGGLGLALTRRVLEAGNTVVAAVRNPAALAELQQQFAGQLITE---KLDVTDYAR 65 FITGA+ G+G A+ R + G + A NP L ++ + DV D A Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71 Query: 66 LPAIAKKHADS----DVIVNNAGGAIIGAMEEFTEQEIEHQFALNLLSPVHITRAFLPAL 121 + I + D++VN AG G + +++E E F++N + +R+ + Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYM 131 Query: 122 RAKKQGRLIYITSMGGRVAFPGGAFYHAAKYGLEGFAESTAQEVAEFNIKVQIIEPGSIK 181 ++ G ++ + S V A Y ++K F + E+AE+NI+ I+ PGS + Sbjct: 132 MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTE 191 Query: 182 TNFQANVRWTEESDAYK--NGTVGQLRRWIAEHGEESNAGDPQKMADAI-YTLSQQA 235 T+ Q ++ W +E+ A + G++ + I P +ADA+ + +S QA Sbjct: 192 TDMQWSL-WADENGAEQVIKGSLETFKTGIP----LKKLAKPSDIADAVLFLVSGQA 243
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 43.5 bits (102), Expect = 1e-07 Identities = 20/179 (11%), Positives = 57/179 (31%), Gaps = 11/179 (6%) Query: 2 EEKTVQREDVLGEAIQILEIEGIANTTLEMVAERVSYPLADLKRFWPDREALLYDALRYL 61 +E R+ +L A+++ +G+++T+L +A+ + + D+ L + Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66 Query: 62 SHQVDAWRRQLLLDDTLTPEQKLLARYGALTQCVSNHRYPGCLFIAACTFYPDAQHPIHQ 121 + + P L + + + F+ Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLEST--VTEERRRLLMEIIFHKCEFVGEMA 124 Query: 122 LAEQQKQASLANTHELLTQL--------EVDDPAMVAKQMELIVEGCLSRLLIKRSQAD 172 + +Q ++ +++ + Q + M + ++ G +S L+ A Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMR-GYISGLMENWLFAP 182
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 688 bits (1776), Expect = 0.0 Identities = 217/1059 (20%), Positives = 437/1059 (41%), Gaps = 54/1059 (5%) Query: 1 MIEWIIRRSVANRFLVMMGALFLSLWGAWTIVHTPVDALPDLSDVQVIVKTSYPGQAPQI 60 M + IRR + A+ L + GA I+ PV P ++ V V +YPG Q Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56 Query: 61 VENQVTWPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119 V++ VT + M + + S G + + F+ GTDP A+ +V L Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 120 KLPAGVSAEMGP-DATGVGWIFEYALVDRSGKHDLAELRSLQDWFLKYELKTIPNVSEVA 178 LP V + + + ++ V + ++ +K L + V +V Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176 Query: 179 SVGGVVKEYQIVVDPQKLTQYGISLSAVKSALDASNQEAGGSSVELS------EAEYMVR 232 G +I +D L +Y ++ V + L N + + + + + Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235 Query: 233 ASGYLQTLDDFNNIVLKTGDNGVPVFLRDVARVQTGPEMRRGIAELNGQGEVAGGVVILR 292 A + ++F + L+ +G V L+DVARV+ G E IA +NG+ AG + L Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294 Query: 293 SGKNAREVISAVKHKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSYKLLEEFIVVALV 352 +G NA + A+K KL L+ P+G++++ YD + + +I + L E ++V LV Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354 Query: 353 CALFLWHIRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412 LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414 Query: 413 NAHKRLEEWEHHHPGQKLANDTRWKIITDASVEVGPALFISLLIITLSFIPIFTLEGQEG 472 N + + E + + ++ AL ++++ FIP+ G G Sbjct: 415 NVERVMME----------DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 473 KLFGPLAFTKTWSMAGAALLAIVVIPILMGFWIRGKIPAETSNPLNRF----------LI 522 ++ + T +MA + L+A+++ P L ++ + AE F + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKP-VSAEHHENKGGFFGWFNTTFDHSV 523 Query: 523 RIYHPLLLKVLHWPKTTLLVALLSILTIIWPLSRVGGEFLPQINEGDLLYMPSTLPGISA 582 Y + K+L LL+ L + ++ R+ FLP+ ++G L M G + Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583 Query: 583 AQAADMLQKTDKLIMT--VPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQAQW-RAG 639 + +L + + V VF G + + + LKP + Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640 Query: 640 MTMEKIIEELDKTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLSDIDEVAER 699 + E +I + + + +++ + I +G + + + Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700 Query: 700 IEVVARTVPG-VTSALAERLVGGRYLNIAINREKAARYGMTVGDVQLFVSSAIGGAMVGE 758 + +A P + S L + +++EKA G+++ D+ +S+A+GG V + Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 759 TVEGVERYPINIRYPQSYRDSPEALRQLPVLTPMKQQITLGDVAEVKVVTGPSMLKTENA 818 ++ + ++ +R PE + +L V + + + V G L+ N Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820 Query: 819 RPASWIYIDARDRDMVSVVHDLQKTIAEQVQMKPGISVSYSGQFELLERANQKLKLMVPM 878 P+ I +A L + +A ++ GI ++G + + +V + Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAI 878 Query: 879 TLMIIFVLLYLAFRRFGEALLIITSVPFALVGGIWFLYGMGFHLSVATGTGFIALAGVAA 938 + +++F+ L + + + ++ VP +VG + V G + G++A Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938 Query: 939 EFGVVMLMYLRHAIEAHPSLENPQTFSVEKLDEALYQGAVLRVRPKAMTVAVIIAGLLPI 998 + ++++ + + +E + + EA +R+RP MT I G+LP+ Sbjct: 939 KNAILIVEFAKDLMEKEG----------KGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988 Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037 GAGS + + ++GGM++A LL++F +P + + Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.9 bits (88), Expect = 9e-05 Identities = 17/144 (11%), Positives = 37/144 (25%), Gaps = 7/144 (4%) Query: 330 ANANIGAARAAFFPSITLTSSLSGSSADLSRLFNPASGMWNFVPKIELPIFNAGRNQANL 389 A A+ +++ + + S + P + + + R + + Sbjct: 132 AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI 191 Query: 390 DLAEIRQQQSVVNYEQKIQAAFKEVADALALRQSLADQISAQQRYLASLQTTQQRARALY 449 Q E + E LA + ++ L + L Sbjct: 192 KEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS-------LL 244 Query: 450 QHGAVSYIEVLDAERSLFTTQQTL 473 A++ VL+ E L Sbjct: 245 HKQAIAKHAVLEQENKYVEAVNEL 268
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.1 bits (208), Expect = 6e-21 Identities = 35/117 (29%), Positives = 62/117 (52%) Query: 2 KILIVEDEKKTGEYLTKGLTESGFVVDLADNGLNGYHLAMTGDYDLLILDIMLPDVNGWD 61 IL+ +D+ L + L+ +G+ V + N + GD DL++ D+++PD N +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 IVRMLRAANKGVPVLLLTALGSVEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118 ++ ++ A +PVL+++A + +K E GA DYL KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>SECYTRNLCASE#Preprotein translocase SecY subunit signature. Length = 437 Score = 29.7 bits (67), Expect = 0.026 Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 8/76 (10%) Query: 181 NQLKAKLISAASVISIVIIFIVLFV-VYQGHKPI------RQVSRQIQNITSKDLDVRLN 233 L I +VI++ +I + L V V Q + I R + R+ TS + +++N Sbjct: 213 GTLAGGWIEFGTVIAVGLIMVALVVFVEQAQRRIPVQYAKRMIGRRSYGGTSTYIPLKVN 272 Query: 234 PGSV-PVELERLVISF 248 V PV ++ Sbjct: 273 QAGVIPVIFASSLLYI 288
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 25.6 bits (56), Expect = 0.047 Identities = 12/36 (33%), Positives = 17/36 (47%), Gaps = 2/36 (5%) Query: 64 QPLMTFSALVRISLSWVVLLFILFSMAMGFTWLLRR 99 + LM S VR W +L L + M F +LR+ Sbjct: 214 RVLMGMSDAVRTFGPW--MLLALLAGFMAFRVMLRQ 247
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 47.2 bits (112), Expect = 3e-08 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 9/172 (5%) Query: 200 REREHGTIEHLLVMPVTPFEIMLAKI-WSMGLVVLVVSGLSLLLMVQGVLQVPIEGSIPL 258 R T E +L + +I+L ++ W+ L +G+ ++ G + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQAVQD 317 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILY--RGADFAIVWPQFLTLMAIGGVFFTIALLRFR 367 +P +H + L + I+ D L + + F + ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHV-GALCIYIVIPFFLSTALLRRR 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.0 bits (72), Expect = 0.013 Identities = 18/81 (22%), Positives = 32/81 (39%), Gaps = 4/81 (4%) Query: 1 MRGVEQDTHPPVALLEH--VGQRFGTTVALRDITLSIPARQMVGLIGPDGVGKSSLLSLI 58 + G D + P L VG+ R + V L G G+GKS+L++ + Sbjct: 557 VLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTL 616 Query: 59 SGARVIAQGNVMVLGGDMRDA 79 G + + + G +D+ Sbjct: 617 VGLDFFSDTHFDI--GTGKDS 635
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 67.5 bits (165), Expect = 1e-14 Identities = 54/311 (17%), Positives = 107/311 (34%), Gaps = 32/311 (10%) Query: 42 RIEATEVDIATKTAGRIDAILVKEGQFVHKGEVLARMDIRVLNEQRLEAAAQIKEAESAV 101 R + I + Q V + EVL ++ EQ Q + E + Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL--TSLIKEQFSTWQNQKYQKELNL 209 Query: 102 AAAKALLDQRQSEMRATEAVVKQRQAELNSTAKRHVRSSALSQRGAVSAQQLDDDQAAAE 161 +A + + E + + ++ L+ + L + A++ + + + Sbjct: 210 DKKRAERLTVLARINRYENLSRVEKSRLDDFSS-------LLHKQAIAKHAVLEQENKYV 262 Query: 162 SARAALESARAQVSAAKAAIEAARTSIIQ-------------AQTRVDAAQATERRILAD 208 A L ++Q+ ++ I +A+ QT + T + Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNE 322 Query: 209 ID--DSELKAPRDGRI-QYRVAEPGEVLAAGGRVLNMVDLSDVY-MTFFLPTEQAGLLAL 264 S ++AP ++ Q +V G V+ ++ +V D +T + + G + + Sbjct: 323 ERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINV 382 Query: 265 GSEARIVLDAAPGLVVPAHISFVASVAQFTPKTVETSDERLKLMFRVKARIPPELLEQHL 324 G A I ++A P + V V +E D+RL L+F V I L Sbjct: 383 GQNAIIKVEAFPY---TRYGYLVGKVKNINLDAIE--DQRLGLVFNVIISIEENCLSTGN 437 Query: 325 EYVKTGLPGMA 335 + + GMA Sbjct: 438 KNIPLS-SGMA 447 Score = 65.6 bits (160), Expect = 6e-14 Identities = 38/194 (19%), Positives = 74/194 (38%), Gaps = 14/194 (7%) Query: 9 AWCLIGLLAVIAALIWWALRPPGLPQGFAGSNGRI--EATEVDIATKTAGRIDAILVKEG 66 A+ ++G L + A I L + A +NG++ +I + I+VKEG Sbjct: 61 AYFIMGFLVI--AFILSVLGQ--VEIV-ATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115 Query: 67 QFVHKGEVLARMDIRVLNEQRLEAAAQIKEAESAVAAAKALLDQRQSEMRATEAVVKQRQ 126 + V KG+VL ++ L A A + +S++ A+ + Q R+ E Sbjct: 116 ESVRKGDVLLKLT-------ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPEL 168 Query: 127 AELNSTAKRHVRSSALSQRGAVSAQQLDDDQAAAESARAALESARAQVSAAKAAIEAART 186 + ++V + + ++ +Q Q L+ RA+ A I Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYEN 228 Query: 187 SIIQAQTRVDAAQA 200 ++R+D + Sbjct: 229 LSRVEKSRLDDFSS 242
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 147 bits (372), Expect = 1e-45 Identities = 86/256 (33%), Positives = 133/256 (51%), Gaps = 8/256 (3%) Query: 7 LQGKRILITGAGQGIGFVMAQGLAQYGAEIIINDISASRADDAVMKLRDEGAIAHSAVFN 66 ++GK ITGA QGIG +A+ LA GA I D + + + V L+ E A + + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTDADAVENAIAKIEEETGAIDVLFNNAGIQRRHPFTEFPVQEWNDVISVNQTAVFLVSQ 126 V D+ A++ A+IE E G ID+L N AG+ R +EW SVN T VF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 AVAKRMVSRQRGKIVNICSMQSELGRDTITPYAAAKGAVKMLTRGMCVELARYNIQVNGI 186 +V+K M+ R+ G IV + S + + R ++ YA++K A M T+ + +ELA YNI+ N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 187 APGYFKTAMTQAL-VDD-------RAFTDWLCKRTPANRWGDPQELVGAAVFLSSRASDF 238 +PG +T M +L D+ + + P + P ++ A +FL S + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 239 VNGHLLFVDGGMLVAV 254 + H L VDGG + V Sbjct: 246 ITMHNLCVDGGATLGV 261
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 31.1 bits (70), Expect = 0.005 Identities = 16/66 (24%), Positives = 28/66 (42%), Gaps = 1/66 (1%) Query: 4 QRITLNDIAALAGVTKMTVSRYLRTPDKVKPETAERIASVIAEIGYEPDPDNPAMTSVAV 63 +L +IA AGVT+ + + + + E E S I E+ E P ++V Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGD-PLSV 88 Query: 64 PRIGVL 69 R ++ Sbjct: 89 LREILI 94
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 45.3 bits (107), Expect = 3e-07 Identities = 69/400 (17%), Positives = 150/400 (37%), Gaps = 41/400 (10%) Query: 21 CIISF---MDRVNISFALPGGMEADLGITSQMAGVASGIFFIGYLFLQIPGGRIAVNGSG 77 CI+SF ++ + ++ +LP + D + F + + G+++ Sbjct: 20 CILSFFSVLNEMVLNVSLPD-IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 78 KRFIAWSLMAWAVVSIATGFVTHEYQ--LLVLRFILGVSEGGMLPVVLTMVSNWFPEKEL 135 KR + + ++ S+ GFV H + L++ RFI G +V+ +V+ + P++ Sbjct: 79 KRLLLFGIIINCFGSVI-GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENR 137 Query: 136 GRANAFVMMFAPLGGMLTAPVSGAIIAALDWRWLFIIEGLLSVVVLAVWWFMISDRPEEA 195 G+A + +G + + G I + W +L +I + + V + + +E Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL----KKEV 193 Query: 196 R----------------------WLPEAERHYLVTTLAAERAAKLAEDAVSNAPVK-DVF 232 R + +L+ ++ + V++ V + Sbjct: 194 RIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLG 253 Query: 233 RNSGLMKLVILNFFYQTGDYGYTLWLPTILKGLTGGSMASVGFLAVLPFVATLAGI---Y 289 +N M V+ G+ +P ++K + S A +G +V+ F T++ I Y Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG--SVIIFPGTMSVIIFGY 311 Query: 290 VISLFSDRSGKRRLWVRFSLYSFAAALVASVVLR-EHVVAAYIALVICGFFLKSATSPFW 348 + + DR G + + + L AS +L I + + G + T Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371 Query: 349 SMPGRIAAAEVAGSARGVINGLGNLGGFCGPYLVGVMIYL 388 + + E G+ ++N L G +VG ++ + Sbjct: 372 IVSSSLKQQEA-GAGMSLLNFTSFLSEGTGIAIVGGLLSI 410
>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature. Length = 398 Score = 27.7 bits (61), Expect = 0.023 Identities = 11/45 (24%), Positives = 20/45 (44%) Query: 106 KARKLYLTHIDAEVEGDTHFPDYDPDQWESVFSEFHDADAQNSHS 150 + L E++ + +PDYD + ++ F D D N+ S Sbjct: 63 RKNLEILKENMHELQLGSTYPDYDKNAYDLYQDHFWDPDTDNNFS 107
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 182 bits (464), Expect = 8e-62 Identities = 111/161 (68%), Positives = 133/161 (82%), Gaps = 1/161 (0%) Query: 1 MRQRGFTLLEMMLILLLMGVSAGMVMLAFPTSREDDAAHTLERFQTQLRFIRERGLQTGQ 60 MRQRGFTLLEMMLILLLMGVSAGMV+LAFP SR+D AA TL RF+ QLRF+++RGLQTGQ Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60 Query: 61 FFGISIHPDRWQFMLLQPRDDAAANPATEESWYGYRWLPLPPGRVATAGQVASGKLTLSF 120 FFG+S+HPDRWQF++L+ RD A PA ++ W GYRWLPL GRVAT+G +A GKL L+F Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAPA-DDGWSGYRWLPLRAGRVATSGSIAGGKLNLAF 119 Query: 121 PHDAQWTPGEQPDVLLFPGGEVTPFQLQIGSAEGIAVDARG 161 WTPG+ PDVL+FPGGE+TPF+L +G A GIA +ARG Sbjct: 120 AQGEAWTPGDNPDVLIFPGGEMTPFRLTLGEAPGIAFNARG 160
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 238 bits (609), Expect = 1e-84 Identities = 95/140 (67%), Positives = 110/140 (78%) Query: 1 MQRQRGFTLLEIMVVIVILGILASLVVPNLMGNKEKADRQKVVSDLVALEGALDMYKLDN 60 +QRGFTLLEIMVVIVI+G+LASLVVPNLMGNKEKAD+QK VSD+VALE ALDMYKLDN Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63 Query: 61 SRYPNSEQGLQALVSRPGAEPQARNYPEGGYIRRLPQDPWGNDYQLLSPGQHGQIDVFSV 120 YP + QGL++LV P P A NY + GYI+RLP DPWGNDY L++PG+HG D+ S Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSA 123 Query: 121 GPDGMPDTNDDIGNWNIGKK 140 GPDG T DDI NW + KK Sbjct: 124 GPDGEMGTEDDITNWGLSKK 143
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 524 bits (1352), Expect = 0.0 Identities = 279/407 (68%), Positives = 335/407 (82%), Gaps = 4/407 (0%) Query: 1 MAVFRYQALDEHGKTQRGVQQADSARHARQLLREKGWLLLEIH-AVAQATPGSPRSLLTR 59 MA + YQALD GK RG Q+ADSAR ARQLLRE+G + L + L R Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 60 R---TSASDLALLTRQLATLVAAAIPLEKALDAVAQQCEKAPLRTLMAGVRGKVLEGHSL 116 R S SDLALLTRQLATLVAA++PLE+ALDAVA+Q EK L LMA VR KV+EGHSL Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 117 AESMRGYPGCFDQLYCAMVAAGETSGHLDSVLNRLADYTEQRQQLRARLLQAMIYPIVLT 176 A++M+ +PG F++LYCAMVAAGETSGHLD+VLNRLADYTEQRQQ+R+R+ QAMIYP VLT Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 177 LVAISVIAILLSTVVPKVVDQFVHMKQALPFSTRLLMALSDLVRTAGPWLLLALIGGGLL 236 +VAI+V++ILLS VVPKVV+QF+HMKQALP STR+LM +SD VRT GPW+LLAL+ G + Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 237 LHYALRQPARRLLWHKQLLRLPLIGRVARSINSARYARTLSILNASAVPLLLSMRISAEV 296 LRQ RR+ +H++LL LPLIGR+AR +N+ARYARTLSILNASAVPLL +MRIS +V Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 297 LSNAWARRQLEAATESVREGISLHRALEMTALFPPMMRYMVASGEQSGELNGMLERAADN 356 +SN +AR +L AT++VREG+SLH+ALE TALFPPMMR+M+ASGE+SGEL+ MLERAADN Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 357 QDRELSAQIQMALSLFEPLLVVAMAGMVLFIVLAILQPILQLNTLMS 403 QDRE S+Q+ +AL LFEPLLVV+MA +VLFIVLAILQPILQLNTLMS Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 836 bits (2161), Expect = 0.0 Identities = 607/646 (93%), Positives = 632/646 (97%) Query: 13 TLMIFSSLLFRPLHAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72 TL+IF++LLFRP AEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72 Query: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGEGDEVVTRVV 132 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG GDEVVTRVV Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVV 132 Query: 133 PLTNVAARDLAPLLRQLNDNAGAGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 192 PLTNVAARDLAPLLRQLNDNAG GSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR Sbjct: 133 PLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 192 Query: 193 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNSVLVSGEPNSRQRI 252 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTN+VLVSGEPNSRQRI Sbjct: 193 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 252 Query: 253 ITMIKQLDRQQAVQGNTKVIYLKYAKAADLVEVLTGISSTLQSDKQSAKPVAALDKNIII 312 I MIKQLDRQQA QGNTKVIYLKYAKA+DLVEVLTGISST+QS+KQ+AKPVAALDKNIII Sbjct: 253 IAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIII 312 Query: 313 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGVQWANKN 372 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLG+QWANKN Sbjct: 313 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 372 Query: 373 AGMTQFNNSGLPMSTVIAGANQYNKDGTVTTSLASALSSFNGIAAGFYQGNWAMLLTALS 432 AGMTQF NSGLP+ST IAGANQYNKDGTV++SLASALSSFNGIAAGFYQGNWAMLLTALS Sbjct: 373 AGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALS 432 Query: 433 SSSKNDILATPSIVTLDNMEATFNVGQEVPVLSGSQTTSGDNIFNTVERKTVGIKLKVKP 492 SS+KNDILATPSIVTLDNMEATFNVGQEVPVL+GSQTTSGDNIFNTVERKTVGIKLKVKP Sbjct: 433 SSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 492 Query: 493 QINEGDSVLLEIEQEVSSVADSASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS 552 QINEGDSVLLEIEQEVSSVAD+ASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS Sbjct: 493 QINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS 552 Query: 553 VTDTADKVPLLGDIPVIGALFRSSSKKVSKRNLMLFIRPTIIRDRDSYRQASSGQYNAFN 612 V+DTADKVPLLGDIPVIGALFRS+SKKVSKRNLMLFIRPT+IRDRD YRQASSGQY AFN Sbjct: 553 VSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612 Query: 613 EAQTKQRGKENNEALLSDDRLHIYPQQDTVAFRQISAAIDAFNLGG 658 +AQ+KQRGKENN+A+L+ D L IYP+QDT AFRQ+SAAIDAFNLGG Sbjct: 613 DAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFRQVSAAIDAFNLGG 658
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 216 bits (552), Expect = 6e-72 Identities = 102/261 (39%), Positives = 153/261 (58%), Gaps = 14/261 (5%) Query: 28 IAMLVTLLLIGQQLAKLSWRMILPAYSPAIEVNDAPEISPLIPAPKTELPV----FTLFG 83 I + +LL QQLA + WR+ LP +P V + PA + PV FTLFG Sbjct: 17 ILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSV-------QITPAQARQQPVTLNDFTLFG 69 Query: 84 RAEKKPQSSAHDESL-DQAPLSSLKLRITGLLASSDAARSIVIMAKGNQQVSLMTGDSTP 142 + +K ++ A D S P S+L L +TG++A D +RSI I++K N+Q S + P Sbjct: 70 VSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVP 129 Query: 143 GNEARIIAILRDRIIVNYRGRNEAILLADDGSMKAAGAPDTALNSPLSKIRQQPQNILNY 202 G A+I++I DR+++ Y+GR E + L + G P +N L + + +Y Sbjct: 130 GYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQRA--STTMSDY 187 Query: 203 LNISPVMVNEQLSGYRLNPGKDPTLFRESGLHENDLAVALNGLDLRDRQQAQQALKQLPE 262 ++ SP+M + +L GYRLNPG F GL +ND+AVALNGLDLRD +QA++A++++ + Sbjct: 188 VSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMAD 247 Query: 263 LSEITLTVEREGQRQDIYLAL 283 + TLTVER+GQRQDIY+ Sbjct: 248 VHNFTLTVERDGQRQDIYMEF 268
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 36.9 bits (85), Expect = 2e-05 Identities = 30/123 (24%), Positives = 50/123 (40%), Gaps = 22/123 (17%) Query: 3 AQRVVMVV-DMQN---GVFATPRIERERCVAHINRLIRAADR----VIFI-----QHAED 49 R V+++ DMQN F A+I +L + V++ Q+ +D Sbjct: 28 PNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDD 87 Query: 50 ---------GGLEEGGEGFALLAELEPPADALYVTKTACDAFYKTCLEQVLSEQQIHQFV 100 GL G ++ EL P D L +TK AF +T L +++ ++ Q + Sbjct: 88 RALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLI 147 Query: 101 ICG 103 I G Sbjct: 148 ITG 150
>SSPANPROTEIN#Salmonella invasion protein InvJ signature. Length = 336 Score = 30.9 bits (69), Expect = 0.008 Identities = 20/59 (33%), Positives = 28/59 (47%), Gaps = 11/59 (18%) Query: 62 QDPARIALRPSDVTLAQIPGRDAQQLIDADSGQPLAA----IDVIFPIVHGTLGEDGSL 116 +D +++ L+P+ + D QL D PLAA + IFP G GED SL Sbjct: 212 KDVSQLPLQPTTIA-------DLSQLTGGDEKMPLAAQSKPMMTIFPTADGVKGEDSSL 263
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 69.1 bits (169), Expect = 8e-15 Identities = 40/183 (21%), Positives = 87/183 (47%), Gaps = 5/183 (2%) Query: 422 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIF-FYKKFGLIATSALVANLVLIV 480 ++I ++GP + + + + + LA VV + ++ F +F L A ALV +++L V Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 481 GIMSLLPGATLSMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRSVQQAIDEGYRGA 538 G+ ++L + +A ++ +++ V++ +R++E L ++ ++ Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 539 FSSIFDANITTLIKVIILYAVGTGAIKGFAITTGIGVATSMFTAIVGTRAIVNLLYGGKR 598 S +TTL+ ++ + G I+GF GV T ++++ + IV L G R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV-LFIGLDR 312 Query: 599 VKK 601 K+ Sbjct: 313 NKE 315
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 346 bits (888), Expect = e-121 Identities = 100/309 (32%), Positives = 174/309 (56%), Gaps = 12/309 (3%) Query: 17 YDFMRWDYWAFGISGFLLIAAIVVMGVRGFNWGLDFTGGTVIEITLEKPAELDQMRQALQ 76 +DF RW + FG + ++IA++++ V G N+G+DF GGT I ++ R AL+ Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73 Query: 77 KAGFEEPQVQNFGSSR------DIMVRMPPVHDANISQELGSKVVSVINE------STSQ 124 + + M+R+ D ++ G++ ++N+ + Sbjct: 74 PLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDP 133 Query: 125 SATVKRIEFVGPSVGADLAQSGALALLAALICILVYVGFRFEWRLAAGVVIALAHDVVIT 184 + + E VGP V +L + +LLAA + I+ Y+ RFEW+ A G V+AL HDV++T Sbjct: 134 ALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLT 193 Query: 185 MGILSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQT 244 +G+ ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +T Sbjct: 194 VGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 245 LHRTLITSGTTLVVILMLYLFGGPVLEGFSLTMLIGVSIGTASSIYVASALALKLGMKRE 304 L RT++T TTL+ ++ + ++GG V+ GF M+ GV GT SS+YVA + L +G+ R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRN 313 Query: 305 HMLQQKVEK 313 + +K Sbjct: 314 KEKKDPSDK 322
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 31.8 bits (72), Expect = 0.001 Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 5/56 (8%) Query: 3 RRADRLFQIVQILRGRRLTT-----AALLAERLGVSERTIYRDIRDLSLSGVPVEG 53 + R +I +I+ + T L + V++ T+ RDI++L L VP Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNN 57
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 533 bits (1374), Expect = 0.0 Identities = 285/294 (96%), Positives = 291/294 (98%) Query: 1 MKKTLLAAGAVLALSTSFTAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 MKKTLLAAGAV+ALST+F AGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 Query: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNNGSPLFMEIEPRFSIDKLTNTDLSFGP 120 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWN GSPLFMEIEPRFSIDKLTNTDLSFGP Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120 Query: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN Sbjct: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180 Query: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDMNGKHARTSNSIASSH 240 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYD+NGKHARTSNSIASSH Sbjct: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240 Query: 241 ILALNYAHWHYSVVARYWHNGGQWADDAKLNFGDGDFNVRSTGWGGYFVVGYNF 294 ILALNYAHWHYS+VARY+HNGGQWADDAKLNFGDG F+VRSTGWGGYFVVGYNF Sbjct: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.5 bits (102), Expect = 3e-06 Identities = 39/217 (17%), Positives = 67/217 (30%), Gaps = 31/217 (14%) Query: 359 FHPQAPLPEPEVKPLPATLAAPVQAAPVSAP------------PVQPPPQNLPQTTSQVL 406 ++P+ V T +QA S P PV PP P T++ + Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040 Query: 407 AARSQLQ------RSQGAPTPKKSEPAAASRARPVNNAALE-----RLSSITERVQAR-- 453 A S+ + Q A A A+ A + + S T+ Q Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100 Query: 454 --PAVAAVQEKAPAKKEAYRWKATTIVETVKEEVATPKALKKALEHEKTPELAAKLALEA 511 A +EKA + E + + + V + + ++ E + + E Sbjct: 1101 KETATVEKEEKAKVETEKTQ-EVPKVTSQVSPKQEQSETVQPQAEPAREND-PTVNIKEP 1158 Query: 512 VERDGWAAEVNQLA--VPKLVEQVALNAWKEQDGNRI 546 + A+ Q A VEQ + GN + Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSV 1195
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 25.9 bits (56), Expect = 0.046 Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 4/84 (4%) Query: 13 KQAQQMQ-DKMQKMQEEIAQLEVTGESGAGLVKVTINGAHNCRRVEIDPSLLEDDKDM-- 69 +QAQ+ Q DK +K +EE A+ E+ + N ++N E+ E++ D Sbjct: 156 EQAQKAQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQME 215 Query: 70 -LEDLVAAAFNDAARRIEETQKEK 92 LED+ A +A ++IEE K++ Sbjct: 216 RLEDMQEQAQANALKQIEELNKKQ 239
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.5 bits (74), Expect = 0.003 Identities = 18/150 (12%), Positives = 46/150 (30%), Gaps = 8/150 (5%) Query: 299 RSQLNYSEENLKQARSALERLYTALRGTDKSAEPAGGDAFEARFIEAMDDDFNTP----- 353 + ++ +L QAR R R + + P E F +++ Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 354 EAYSVLFDMAREVNRLKTEDVAAANAMAAHMRKLSGVLGLLEQEPDVFLQSGAQADDGEV 413 E +S + + + A + A + + + + + D F + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249 Query: 414 AEIELLIQQRLDARKAKDWAAADAARDRLN 443 A+ +L Q+ + + +++ Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIE 279
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 51.0 bits (122), Expect = 2e-09 Identities = 23/111 (20%), Positives = 45/111 (40%), Gaps = 1/111 (0%) Query: 63 VVSLNQVDIQSQITGTVKRVAFQEGAFVRQGQLLFTLEDSTQQATLHRDQASRAQAQSQL 122 S +I+ VK + +EG VR+G +L L +A + Q+S QA+ + Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ 150 Query: 123 DKAQRDLARGRALKAQNDISASDWETLLSTRQQYVAQAQAAQEDILSAEAQ 173 + Q L+R L ++ D + ++ V + + ++ S Sbjct: 151 TRYQI-LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200 Score = 50.2 bits (120), Expect = 3e-09 Identities = 24/132 (18%), Positives = 50/132 (37%), Gaps = 4/132 (3%) Query: 103 TQQATLHRDQASRAQAQSQLDKAQRDLARGRALKAQNDISASDWETLLSTRQQYVAQAQA 162 Q+ +SQL++ + ++ + ++ +L +Q Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL--VTQLFKNEILDKLRQTTDNIGL 313 Query: 163 AQEDILSAEAQLGYTRIYAPVSGKTGALNVH-PGSLVQPGSSLPLVTVHQFDPVGISFTL 221 ++ E + + I APVS K L VH G +V +L +V V + D + ++ + Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTALV 372 Query: 222 AENDLNAVVGGL 233 D+ + G Sbjct: 373 QNKDIGFINVGQ 384
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 71.0 bits (174), Expect = 2e-16 Identities = 27/112 (24%), Positives = 53/112 (47%) Query: 2 RLLLVEDQTMAADYIARGLRENDFVVDVAHDGVDGLHFLLTNDYDLAILDVMLPGMNGWK 61 +L+ +D + + L + V + + ++ D DL + DV++P N + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 ILELARQAGKLTPVMFLTARDDVEDRVCGLELGAEDYLIKPFSFSELLARVR 113 +L ++A PV+ ++A++ + E GA DYL KPF +EL+ + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>PF06580#Sensor histidine kinase Length = 349 Score = 41.8 bits (98), Expect = 4e-06 Identities = 26/104 (25%), Positives = 48/104 (46%), Gaps = 25/104 (24%) Query: 366 LLSNALRH----TPSGGKIVLRAGRENGQIVLSVEDNGEGISPEHLPHIFDRFYRLDDAR 421 L+ N ++H P GGKI+L+ ++NG + L VE+ G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-----------------LK 305 Query: 422 SNAENTGLGLALVKT-IAELHGG--RIVVTSTPHRGSCFSLLLP 462 + E+TG GL V+ + L+G +I ++ + + +L+P Sbjct: 306 NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 28.3 bits (63), Expect = 0.033 Identities = 13/52 (25%), Positives = 24/52 (46%), Gaps = 3/52 (5%) Query: 78 LTVVFISIFGTLVTDNLSDALHVPLA---YSSLFFALALLVTFVLWYAREKT 126 L +V + I+G V A+ + YSS++ A +++ L +EK Sbjct: 266 LALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEKK 317
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 43.4 bits (102), Expect = 6e-07 Identities = 31/137 (22%), Positives = 57/137 (41%), Gaps = 1/137 (0%) Query: 197 AREREQGTLDQLLVSPLATWQIFVGKAVPALIVATLQASIVLAIGIWAYQIPFAGSLLLF 256 R Q T + +L + L I +G+ A A L + + + + SLL Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL-SLLYA 150 Query: 257 YFTMVIYGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPQWLQDVT 316 + + GL+ G+++++L + + + P + LSG V PV+ +P Q Sbjct: 151 LPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAA 210 Query: 317 WINPIRHFTDITKQIYL 333 P+ H D+ + I L Sbjct: 211 RFLPLSHSIDLIRPIML 227
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.018 Identities = 11/27 (40%), Positives = 14/27 (51%) Query: 30 IRAGYVTGLVGPDGAGKTTLMRMLAGL 56 + Y L G G GK+TL+ L GL Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 69.1 bits (169), Expect = 4e-15 Identities = 48/282 (17%), Positives = 106/282 (37%), Gaps = 20/282 (7%) Query: 55 ASLNVDEGDKIQAGQILGQLDRAPYENALQQAQANVSTAQAQYDLMMAGYRAEEIAQAAA 114 NV E + L + + ++N Q + N+ +A+ ++A E Sbjct: 175 YFQNVSEEEV-LRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 115 AVKQAQAAYDYAQNFYQRQ--LGLRQNSAISVNDLENARSSRDQAQATLKSAQDKLRQYR 172 + + + + L +VN+L +S +Q ++ + SA+++ + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 173 AGNRPQ---EIAQAKASLEQAQAALAQAKLDLHDTTLIAPSDGTLMTRAV-EPGSMLSAG 228 + + ++ Q ++ LA+ + + + AP + V G +++ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 229 GTVMTLSLT-HPVWVRAYIDEKNLGQAVPGREVLLYTDSRPNQPWHGKIGFVSPVAEFTP 287 T+M + + V A + K++G G+ ++ ++ P +G + V V Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP-YTRYGYL--VGKVKNINL 410 Query: 288 KTVETPDLRTDLVYRLRIVVTDADDS-------LRQGMPVTV 322 +E D R LV+ + I + + S L GM VT Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTA 450
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 64.6 bits (157), Expect = 6e-15 Identities = 29/227 (12%), Positives = 74/227 (32%), Gaps = 35/227 (15%) Query: 5 PVTTKGEQAKSQLIAAAIAQFGEYGLHATT-RDIAAQAGQNIAAIPYYFGSKDDLYLACA 63 + ++ + ++ A+ F + G+ +T+ +IA AG AI ++F K DL+ Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63 Query: 64 QWIADFISHNFRPHAEAAETLLATASPDKAAIRVLILRACQNMILLLTQDDTVNLSKFIS 123 + I E + +R +++ ++ + T++ L + I Sbjct: 64 ELSESNI------GELELEYQAKFPGDPLSVLREILIHVLESTV---TEERRRLLMEIIF 114 Query: 124 REQLSP------TPAYQLIHGQVIAPLHRYLTRLI---GAFTGLDADDTAMILHTHALLG 174 + A + + + + + L I L A+I+ + Sbjct: 115 HKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYIS 172 Query: 175 EILAFRLGRETILLRTGWTQFDEEKTEQIYQVIACHIDFVLQGLLQR 221 ++ W + + + ++ +L+ L Sbjct: 173 GLM------------ENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.027 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGGIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 28.8 bits (64), Expect = 0.032 Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 2/48 (4%) Query: 38 AELELGGILIALRIKGEGEAEMKGFYEAMQQKTLRLTPPVARPMPIVI 85 AE+E+G + KGE +A+ G +A +K +LTPP PI I Sbjct: 101 AEVEVGKG--EVDSKGEIKADSGGGTDAPIRKPFKLTPPQPTMSPISI 146
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 31.3 bits (71), Expect = 0.005 Identities = 36/139 (25%), Positives = 56/139 (40%), Gaps = 9/139 (6%) Query: 14 LVAGCALLVLVAPAV-QAAEQLPDAPS-IDAR-AWILMDYASGKVLSEGNADEKLDPASL 70 +++ A L L A Q EQ+ + S + R I MD ASG+ L+ ADE+ S Sbjct: 8 IISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMST 67 Query: 71 TKIMTSYVVGQALKAGKIKSTDMVTVGRDAWATGNPALRGSSVMFLKPGMQVSVEDLNKG 130 K++ V + AG + + + +P L GM +V +L Sbjct: 68 FKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSE----KHLADGM--TVGELCAA 121 Query: 131 VIIQSGNDASIAIADYVAG 149 I S N A+ + V G Sbjct: 122 AITMSDNSAANLLLATVGG 140
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 44.4 bits (105), Expect = 4e-07 Identities = 68/310 (21%), Positives = 121/310 (39%), Gaps = 22/310 (7%) Query: 31 IGNDMIQPGMLSVVEEFQVGNEWVPTSMTAYLAGGMFLQWL----LGPLSDRIGRRPVML 86 +G +I P + ++ + V + V LA +Q+ LG LSDR GRRPV+L Sbjct: 19 VGIGLIMPVLPGLLRDL-VHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL 77 Query: 87 TGVVWFIVTCLATLLAQTIEQFTLLRFLQGISLCFIGAVGYAAIQESFEEAMCIKITALM 146 + V A + + R + GI+ GAV A I + + + M Sbjct: 78 VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFM 136 Query: 147 ANVALIAPLLGPLVGAAWVHILPWEMMFVLFAVLAAIAFVGLQRAMPETATRMGEKLSMK 206 + + GP++G P F A L + F+ +PE+ L + Sbjct: 137 SACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRRE 195 Query: 207 ELGRDYGLVLKN-LRFVAGALATGFVSLPLLAWIAQSP--VIIISGEKATSYEYGLLQVP 263 L + VA +A F ++ + Q P + +I GE ++ + + Sbjct: 196 ALNPLASFRWARGMTVVAALMAVFF----IMQLVGQVPAALWVIFGEDRFHWDATTIGIS 251 Query: 264 I--FGAL--IAGNLVLARLTSRRTVRSLIILGGWPIMFGLLLSAVATVVSTHAYLWMTAG 319 + FG L +A ++ + +R R ++LG G +L A A T ++ Sbjct: 252 LAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA----TRGWMAFPIM 307 Query: 320 LSVYAFGIGL 329 + + + GIG+ Sbjct: 308 VLLASGGIGM 317
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.9 bits (75), Expect = 0.002 Identities = 32/150 (21%), Positives = 63/150 (42%), Gaps = 9/150 (6%) Query: 17 LFMFFFIPGLLMASWATRTPAIRDLLTLSTAEMGIVLFGLSIGSMSGILCS---AWLVKR 73 + I G + + ++D+ LSTAE+G V+ + G+MS I+ LV R Sbjct: 262 VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI--IFPGTMSVIIFGYIGGILVDR 319 Query: 74 FGTRKVIRTTMSCAVVGMIVLSVALWLTSAVLFAIGLAIFGASFGSAEVAINVEGAAVER 133 G V+ ++ V + S L TS + I + + G + V + +++++ Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQ 379 Query: 134 EMNKTVLPMMHGFYSFGTLFGAGVGMAVTG 163 + + +F + G G+A+ G Sbjct: 380 QEAGAGM----SLLNFTSFLSEGTGIAIVG 405 Score = 31.0 bits (70), Expect = 0.008 Identities = 33/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%) Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTIGRFTGGWFI 275 +IGV+ + F + + P +M D H S GS+I T+ + I + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 276 DRYSRVTVVR-ASALM--GALGIGLIIFVDNPWVA-GVSVLLWGLGASLGFPLTISAASD 331 DR + V+ + L ++ + ++ + +L GL + TI ++S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 332 TGPDAPKRVSVVAITGYLAFLVGPPLLGFL 361 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 51.9 bits (124), Expect = 1e-10 Identities = 21/156 (13%), Positives = 49/156 (31%), Gaps = 5/156 (3%) Query: 1 MAR--RPNDPQRRERILQATLDTIATHGIHAVTHRKIASCAEVPLGSLTYYFSGIEALIE 58 MAR + + R+ IL L + G+ + + +IA A V G++ ++F L Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 59 EAFRIFTRDMSVQYQQFFTSVSSREEACDAIAELIFSAQVTTARNMELMYQLYAFCSSQP 118 E + + ++ + + ++ I + + E L + Sbjct: 61 EIWELSESNIG---ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC 117 Query: 119 ALKAVMQHWMRRSQQTLEQWFEPATARGLDAFIEGM 154 M + + + ++ M Sbjct: 118 EFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKM 153
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 77.4 bits (190), Expect = 7e-19 Identities = 70/263 (26%), Positives = 122/263 (46%), Gaps = 8/263 (3%) Query: 3 VKDKVAAIFGGSGAIGSAVAHAMAREGARVYLGARDRQKLDRVAGEIRAAGGRAETFIVD 62 ++ K+A I G + IG AVA +A +GA + + +KL++V ++A AE F D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 63 LLDERSTADRIAQLTQQSDGLDIVVNATGFVHNQGKEITTLSLAEFMQGITPFLAAQFNL 122 + D + + A++ ++ +DI+VN G + I +LS E+ + FN Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRP--GLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 123 AKAVTPYMGGARPGAIITVVAPAATMAMPGHLGHIVGCAGSEAFIKALASELGPKNIRVL 182 +++V+ YM R G+I+TV + A + + A + F K L EL NIR Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 183 GVRSHAITGAVEAGSYTAEVFAAKAQAMGLTVEQWVGGAAHSTMLKRLPTLTQVADVITF 242 V + ++ + E Q + ++E + G LK+L + +AD + F Sbjct: 184 IVSPGSTETDMQWSLWADE--NGAEQVIKGSLETFKTGIP----LKKLAKPSDIADAVLF 237 Query: 243 LASDRADAMTATVVNMTAGATTG 265 L S +A +T + + GAT G Sbjct: 238 LVSGQAGHITMHNLCVDGGATLG 260
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.4 bits (79), Expect = 0.001 Identities = 23/106 (21%), Positives = 36/106 (33%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSSFSFGIGNAAGLLFAGIML-GFLRANHPTFG-YIPQ--GALNMVKEFGL 449 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 MVFMAGVGLSAGAGIGHGLGAIGGQM--LAAGLIVSLVPVVICFLF 493 M G G+ AG +G +G AA + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>LCRVANTIGEN#Low calcium response V antigen signature. Length = 326 Score = 30.0 bits (67), Expect = 0.023 Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 2/44 (4%) Query: 248 ENGLARQR--LEQERDADWAIRELLARMTQRLQGCESFDDVIKV 289 +NG+ R + LE + W +R +A M L DD++KV Sbjct: 95 QNGIKRVKEFLESSPNTQWELRAFMAVMHFSLTADRIDDDILKV 138
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 28.6 bits (63), Expect = 0.049 Identities = 16/51 (31%), Positives = 29/51 (56%), Gaps = 2/51 (3%) Query: 229 EISCLLATYMHLPEEECKKIEIAGYLHDIGKINIPLDILEKQSELTPDELR 279 E+S LL T HL K++++ G L D+ + + LD E++ + D+L+ Sbjct: 447 ELSDLLRT--HLSSAATKQLDMGGVLSDLDTMLVALDKAEREGGVDKDQLK 495
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 30.9 bits (70), Expect = 0.020 Identities = 18/91 (19%), Positives = 31/91 (34%), Gaps = 13/91 (14%) Query: 548 GSFGTVQYSQIGKAVQSGNVEPEKARTWEVGTRYDNGALSAEMGLFLINFNNQYDSNQTN 607 G F + NV EK + + + YDN AL A + Q + Sbjct: 187 GFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVA-----------VQQQD 235 Query: 608 DTVTARGKTRHSGLE--TQTRYDLGDLNPQL 636 + + +S E Y G++ P++ Sbjct: 236 AKLVEENYSHNSQTEVAATLAYRFGNVTPRV 266
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 83.2 bits (205), Expect = 4e-21 Identities = 50/185 (27%), Positives = 89/185 (48%), Gaps = 2/185 (1%) Query: 6 VLITGASTGIGAVYAERFARRGHDLVLVARDKAKLEILADRLRQENGISVDVLPADLTQA 65 ITGA+ GIG A A +G + V + KLE + L+ E + PAD+ + Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVRDS 69 Query: 66 SDLAQVEARLREDTQ-IGILINNAGIAQSGSFTEQTPESIESLIALNVTALTRLASAVAP 124 + + ++ AR+ + I IL+N AG+ + G + E E+ ++N T + + +V+ Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129 Query: 125 RFVQAGEGSIVNISSVVGLAPEFAMTVYGATKAFVLFLSQGMNVELSSKGIYIQAVLPAG 184 + GSIV + S P +M Y ++KA + ++ + +EL+ I V P Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189 Query: 185 TYTEI 189 T T++ Sbjct: 190 TETDM 194
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 88.6 bits (219), Expect = 7e-23 Identities = 56/189 (29%), Positives = 84/189 (44%), Gaps = 12/189 (6%) Query: 18 RVALVTGASSGIGEASAIKLLAAGYTVYG----------TSRRGALAGKHPFPLLALDVT 67 ++A +TGA+ GIGEA A L + G + +H DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVR 67 Query: 68 DDASVGAAIEELLRLEGRIDLLVNNAGFGIAPAAAEESSVEQARHMFDTNFLGIVRLTRA 127 D A++ + R G ID+LVN AG + P S E+ F N G+ +R+ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 128 VIPHMRRQGSGRIINIGSILGVVPFPYVALYAASKYAVEGYTGALDHELRTQGIRVSVIE 187 V +M + SG I+ +GS VP +A YA+SK A +T L EL IR +++ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 188 PAYMKTQFE 196 P +T + Sbjct: 187 PGSTETDMQ 195
>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C signature. Length = 170 Score = 127 bits (320), Expect = 8e-40 Identities = 58/149 (38%), Positives = 82/149 (55%), Gaps = 3/149 (2%) Query: 19 NEAEVLGASVWLWMHSPMHRDAPLHALPTLLLPIIKRRQYVLIMENERPVFFLSWAWLNP 78 E+LG WLW SP+HR+ P+ +LP I+ QYVL+ ++ PV + SWA L+ Sbjct: 5 KPLEILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRDDYPVAYCSWANLSL 64 Query: 79 ESEARYLTRPAIEMPEADWNSGDRMWFCDWIAPFGHTAAMNALMRQDIFADHCARALYHR 138 E+E +YL + E DW SGDR WF DWIAPFG A+ MR+ F D RA+ Sbjct: 65 ENEIKYLNDVTSLVAE-DWTSGDRKWFIDWIAPFGDNGALYKYMRKK-FPDELFRAIRVD 122 Query: 139 GEQRGKRVVMFHGRRVSRVQARA-WQQAH 166 + +V FHG ++ + A ++Q H Sbjct: 123 PKTHVGKVSEFHGGKIDKQLANKIFKQYH 151
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 34.4 bits (79), Expect = 0.001 Identities = 22/177 (12%), Positives = 49/177 (27%), Gaps = 17/177 (9%) Query: 238 TGRYQGGVTLSLDNPFSLSDLLYFSASHDLDDNGGKKSRNYIAHYSVPYGYWMLGITGSD 297 +G + L++ + LY S SH + A + + ++ S Sbjct: 522 AYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSL 581 Query: 298 YDYH-----QTVAGLN----------GDYRYSGKSKNLDVQLSRVLHRSGTQKTTFTYDV 342 + LN D + + + +S L+ T + Sbjct: 582 TKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTL 641 Query: 343 LARESRNYIDDTEVGVQRRQTAAWRVGLQHRHYIGQATLDLGASYQRGTR--WFGAQ 397 L + +Y T + + G ++G S+ + ++G Sbjct: 642 LEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVS 698
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 29.8 bits (67), Expect = 0.005 Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 5/48 (10%) Query: 1 MHKTARQKYLVDLISENGQVSISELVEKLQ-----VSADTLRRDLADL 43 M+K R + ++I+ N + ELV+ L+ V+ T+ RD+ +L Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.014 Identities = 13/42 (30%), Positives = 19/42 (45%) Query: 162 VVKEVNRDGEVVWEWRAWEHLSPEDYPVHTIFDRRHWPMING 203 V RDG W+WR W+ P +P H + R ++ G Sbjct: 194 VYSRSQRDGSEAWKWRGWDDPRPLYFPSHRAPESRTVVLVEG 235
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 266 bits (681), Expect = 9e-87 Identities = 116/478 (24%), Positives = 196/478 (41%), Gaps = 73/478 (15%) Query: 8 ILLIDDDSDVLDAYTQLLTQAGHRVYACGDPLRAQDLVSEAWPGIVLSDVCMPHCSGIDL 67 IL+ DDD+ + Q L++AG+ V + ++ +V++DV MP + DL Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 68 LKILLKHDPHLPVILITGHGDVPMAVEAVKKGAWDFLQKPVNPGQLLDLIDKALAQRQSQ 127 L + K P LPV++++ A++A +KGA+D+L KP + +L+ +I +ALA+ + + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 128 VVRRQWQKEQLEDNLIGRSEWVTQMRQTLQQLAETDVAVYFHGERGTGRTLAACYLHRLS 187 + + + L+GRS + ++ + L +L +TD+ + GE GTG+ L A LH Sbjct: 126 PSKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYG 184 Query: 188 TRGDRPII----------------FG---DILQGQEAPLQAWCEQAQGGTLVLRNIEFLS 228 R + P + FG G + EQA+GGTL L I + Sbjct: 185 KRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMP 244 Query: 229 QAQQ--LVLAHQQAQDEP---------PCRLVATGLQPLIALVGNNLILPDLYYCFAMTQ 277 Q L+ QQ + R+VA + L + L DLYY + Sbjct: 245 MDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVP 304 Query: 278 QFCPPLAQRLDDIEPLFLHYLQQACLRLNHPVPELPAALAKKLVNRHWPNNVRELAN--- 334 PPL R +DI L H++QQA + V + + WP NVREL N Sbjct: 305 LRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLVR 363 Query: 335 -AAKLFAVGVMPLAEIPNPLLHQV------------------------------------ 357 L+ V+ I N L ++ Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423 Query: 358 -EPMQLDQRVEDYERQIIIEALNIHQGRINDVSEYLQIPRKKLYLRMKKFGLDKHHYR 414 D+ + + E +I+ AL +G ++ L + R L ++++ G+ + Sbjct: 424 PPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSS 481
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.1 bits (73), Expect = 0.005 Identities = 37/172 (21%), Positives = 62/172 (36%), Gaps = 11/172 (6%) Query: 52 TPYLKEQLDLSATQI---GMLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVMCAAVN 108 P L L S G+L + + V+ +L+D+ + + L A Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87 Query: 109 VGLGFSSAFWIFVALVVINGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNVGGGI 168 + + W+ ++ G+ G IA+ ER R F +S G G+ Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144 Query: 169 VA-PIVGTAFALLGTEHWQTASYIVPAAVAVLFAVVVLILGKGSPRSEGLPA 219 VA P++G L+G A + AA+ L + L S + E P Sbjct: 145 VAGPVLG---GLMGGFSPH-APFFAAAALNGLNFLTGCFLLPESHKGERRPL 192
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.1 bits (112), Expect = 8e-08 Identities = 58/339 (17%), Positives = 102/339 (30%), Gaps = 32/339 (9%) Query: 56 VTFSLLIILQTFFSPFQGRLVERFGPRLLISLGTVMAGLSWVLSAQVSGLAALYL--VYG 113 + +L ++Q +P G L +RFG R ++ + A + + + A L LY+ + Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106 Query: 114 GLGGLGTGIVYIGVVGLM--VRWFPRHRGFAAGAVAAGYGMGAIMTTFPVSVSLGQYGLE 171 G+ G TG V + + RH GF + G G ++ +G + Sbjct: 107 GITG-ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL-----MGGFSPH 160 Query: 172 QTMTVFGLLFAAVGFLASQGLKLPPDSGALPASQTVAQSSRQFTSREMLRQPLFWLMFAM 231 L L P + F + + LM Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMT-VVAALMAVF 219 Query: 232 MAMMSTSG-------LMVTSQMAVFAEDFGISKAVVFGMAALPLALTIDRFTNGLTRPLF 284 M + + A GIS A + +L A+ + Sbjct: 220 FIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM------------IT 267 Query: 285 GFISDRYGRENTMFIAFALEGVAMTLWLACRDDPMLFVLLSGVVFFGWGEIFSLFPSTLT 344 G ++ R G + + +G L M F ++ V+ G + L+ Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIM--VLLASGGIGMPALQAMLS 325 Query: 345 DTFGSEYASSNYGWLYISQGIGSIFGGPLAALLYQHTQG 383 E G L + SI G L +Y + Sbjct: 326 RQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.8 bits (85), Expect = 1e-05 Identities = 25/89 (28%), Positives = 33/89 (37%), Gaps = 9/89 (10%) Query: 46 EGETIWLAHDPQGELAGFIAV---WMPDHFIHHLHVAPARQGCGVGKMLLQALPGW---- 98 EG+ +L + G I + W I + VA + GVG LL W Sbjct: 63 EGKAAFLYYLEN-NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN 121 Query: 99 QEHGYRLKCLTRNRNALAFYAASGFVTIG 127 G L+ N +A FYA F IG Sbjct: 122 HFCGLMLETQDINISACHFYAKHHF-IIG 149
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 37.6 bits (87), Expect = 8e-05 Identities = 28/164 (17%), Positives = 58/164 (35%), Gaps = 2/164 (1%) Query: 26 LGVFGLIVAEFLPASLLTPMAASLGVSEGMAGQAVTATALVALVTGLLITPATKNIDRRW 85 L F ++ L SL +A TA L + + + + + Sbjct: 22 LSFFSVLNEMVLNVSLPD-IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 86 VLMFFSVLQIVSSLMVAFA-SSLEFLLLGRLLLGIAIGGFWSMSTATAMRLVPAALVPKA 144 +L+F ++ S++ S L++ R + G F ++ R +P KA Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140 Query: 145 LAVIFSAVSIATVVAAPLGSYLGGLIGWRSVFILCTVPSVLALL 188 +I S V++ V +G + I W + ++ + + Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPF 184
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 32.8 bits (75), Expect = 0.001 Identities = 14/71 (19%), Positives = 32/71 (45%), Gaps = 3/71 (4%) Query: 22 GRGKVADYIPALASVSGNKLGV-AICTVEGQHYSAGDAHERFSIQSISKVL--SLVVAMN 78 + + I S ++G+ + G+ +A A ERF + S KV+ V+A Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80 Query: 79 HYQEDEIWQRV 89 ++++ +++ Sbjct: 81 DAGDEQLERKI 91
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.0 bits (88), Expect = 4e-06 Identities = 10/50 (20%), Positives = 19/50 (38%) Query: 80 IDPEHRGQKLGERLLEALETEALRRDCHTVRLETGIYQQAAVRLYTRWGY 129 + ++R + +G LL A + LET +A Y + + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 29.8 bits (67), Expect = 0.010 Identities = 6/76 (7%), Positives = 22/76 (28%), Gaps = 2/76 (2%) Query: 113 TLDFPACATQESTPPAALLAGLGIERAVSARRGRAWLIELASREEVAAVRPTIAAMTPGE 172 L + L A + + ++ + + L ++ + + Sbjct: 79 VLVSSVALENKFISQFILEAEIKTDFTINLLK--DYFSSLTIDNMISKMISGVVTEELKN 136 Query: 173 HKVTITASGDGEYDFV 188 + ++ +G F+ Sbjct: 137 YTSSLDDLVNGANLFI 152
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 59.5 bits (144), Expect = 8e-12 Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 2/156 (1%) Query: 36 LSDIADSFGMETAQVGMMLTIYAWVVALMSLPFMLLTSQMERRRLLIGLFILFIASHVLS 95 L DIA+ F A + T + ++ + + L+ Q+ +RLL+ I+ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FFAWS-FNVLVISRIGIAFAHAVFWSITSALAIRLAPPGKRAQALSLIATGTALAMVFGI 154 F S F++L+++R A F ++ + R P R +A LI + A+ G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PIGRIIGQYFGWRTTFLAIGLGALVTLVCLFKLLPK 190 IG +I Y W L I + ++T+ L KLL K Sbjct: 157 AIGGMIAHYIHWSYLLL-IPMITIITVPFLMKLLKK 191
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.2 bits (81), Expect = 6e-04 Identities = 70/365 (19%), Positives = 113/365 (30%), Gaps = 75/365 (20%) Query: 107 CAAALWGSWLEKVGPRLVSLVATLCWCGGLLLSALAIYSHQLWLLWLG---AGVIGGIGL 163 A + G+ ++ G R V LV+ G + A+ + LW+L++G AG+ G G Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLA---GAAVDYAIMATAPFLWVLYIGRIVAGITGATGA 114 Query: 164 G-LGYIA---PVSTLLRWF-----PDKRGMAAGMAIMGFGGGALVGAPLANWLMNFFAGP 214 YIA R F GM AG + G GG AP F A Sbjct: 115 VAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPF------FAAA- 167 Query: 215 QGNGVWQSILALAGIYSLFMLCGTFGYRLPPMGWHPSGEKAQKVMRNHSTIQVHVCVAWR 274 ++ L + F+L + P+ A T+ VA Sbjct: 168 -------ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV-----VAAL 215 Query: 275 TPQFWLLWLVLWLNVTAGIGILGMASPMLQEIFAGKLLSQDLSWNELSGEQLKRIATMAA 334 F+++ LV + ++ + GE Sbjct: 216 MAVFFIMQLV---------------GQVPAALWV------------IFGEDRFHWDATTI 248 Query: 335 GFTGLLSLFNILGRFFWASVSDMC----GRKTTFALIFFLGALLYGTLPLTNHGGYVALF 390 G + L+ F IL A ++ G + L Y L G Sbjct: 249 GIS--LAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG------ 300 Query: 391 VCALCVIISMYGGGFA--TIPAYLADVFGSQMVGAIHGRLLTAWSAAGISGPVLVNYLRE 448 A +++ + GG + A L+ + G + G L S I GP+L + Sbjct: 301 WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360 Query: 449 YQLAQ 453 + Sbjct: 361 ASITT 365
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 392 bits (1009), Expect = e-133 Identities = 137/347 (39%), Positives = 195/347 (56%), Gaps = 25/347 (7%) Query: 265 DEPQIEQLIGECRPMRQLKKLISRIARGPSSVMVAGESGTGKEVVARAIHKLSDRQAKPF 324 D L+G M+++ ++++R+ + ++M+ GESGTGKE+VARA+H R+ PF Sbjct: 132 DSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPF 191 Query: 325 IAINCAAIPEQLLESELFGYVKGAFTGASANGKPGLIQAANHGTLFLDEIGDMPLTLQAK 384 +AIN AAIP L+ESELFG+ KGAFTGA G + A GTLFLDEIGDMP+ Q + Sbjct: 192 VAINMAAIPRDLIESELFGHEKGAFTGAQTRST-GRFEQAEGGTLFLDEIGDMPMDAQTR 250 Query: 385 LLRAIESREIQPVGASHPVPVDIRIISATNQNLAQFIAEGKFREDLYYRLNVIPLRLPPL 444 LLR ++ E VG P+ D+RI++ATN++L Q I +G FREDLYYRLNV+PLRLPPL Sbjct: 251 LLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPL 310 Query: 445 RERQDDIELLIHHFLQLHTRRLELVYPGVSPEVIGLLKRYHWPGNIRELSNLMEYLVNVV 504 R+R +DI L+ HF+Q + L E + L+K + WPGN+REL NL+ L + Sbjct: 311 RDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLT-AL 368 Query: 505 PSGEVIDASLLPPNLIC--------SGKTTGRDASLPGEMAEVKEEEGS----------- 545 +VI ++ L S+ + E + + Sbjct: 369 YPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL 428 Query: 546 --TALEAMEKQMIRDALTRYS-NKKQAADELGIGIATLYRKIKKYEL 589 L ME +I ALT N+ +AAD LG+ TL +KI++ + Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 42.8 bits (101), Expect = 2e-06 Identities = 38/178 (21%), Positives = 65/178 (36%), Gaps = 49/178 (27%) Query: 1 MRVLIKNGIVVNADGRAKQDLLIENGLVSRLAH----------QITLDQPCDTIDAEGCY 50 + +I N ++++ G K D+ +++G ++ + I + + I EG Sbjct: 68 VDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKI 127 Query: 51 VMPGGIDVHTHF----NIDTGLARSCDDFFTGTRAAACGGTTTIIDHMGFGPA-GCR--- 102 V GG+D H HF I+ L G T ++ G GPA G Sbjct: 128 VTAGGMDSHIHFICPQQIEEALM---------------SGLTCMLGG-GTGPAHGTLATT 171 Query: 103 -------LRHQLEAYHGYAAYKAVIDYSFHGVIQHINHAILDEIPMMVEAGISSFKLY 153 + +EA + ++ +F G L E MV G +S KL+ Sbjct: 172 CTPGPWHIARMIEAADAFP-----MNLAFAGKGNASLPGALVE---MVLGGATSLKLH 221
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 425 bits (1095), Expect = e-153 Identities = 153/314 (48%), Positives = 205/314 (65%), Gaps = 8/314 (2%) Query: 1 MSKKIVLALGGNALGDD-----LAGQMQAVKQTAQAIVDLIAQGHQVVVTHGNGPQVGMI 55 M K++V+ALGGNAL M V++TA+ I ++IA+G++VV+THGNGPQVG + Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60 Query: 56 NLAFEAAAKTEAHTPMLPMSVCVALSQGYIGYDLQNALREALLSRGMSVPVATLITQVEV 115 L +A T P PM V A+SQG+IGY +Q AL+ L RGM V T+ITQ V Sbjct: 61 LLHMDAGQATYG-IPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIV 119 Query: 116 DANDPAFKHPTKPIGSFFSEEEARQLTRQ-GYTMKEDAGRGYRRVVASPQPVDIIEKETV 174 D NDPAF++PTKP+G F+ EE A++L R+ G+ +KED+GRG+RRVV SP P +E ET+ Sbjct: 120 DKNDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETI 179 Query: 175 KAMMEAGHVVITVGGGGIPVIREGHHLRGASAVIDKDWASAKLAAMIDADMLIILTAVEK 234 K ++E G +VI GGGG+PVI E ++G AVIDKD A KLA ++AD+ +ILT V Sbjct: 180 KKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239 Query: 235 VAINFGKADEQWLDTLTLSDAERYIREGHFAKGSMLPKVEAAASFARSRPGRQALITVLS 294 A+ +G EQWL + + + +Y EGHF GSM PKV AA F G +A+I L Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEW-GGERAIIAHLE 298 Query: 295 KAQEGIEGKTGTVI 308 KA E +EGKTGT + Sbjct: 299 KAVEALEGKTGTQV 312
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 31.7 bits (72), Expect = 0.002 Identities = 21/131 (16%), Positives = 48/131 (36%), Gaps = 14/131 (10%) Query: 60 PDAHFVEDTAVVMPELAVITHPGAPSRQGEV----ASIA--PVLAG-FRSLVHMSERGHM 112 P+ F D + I RQ E PV ++ E + Sbjct: 157 PNVLFTRDPFASIGNGVTINKMFTKVRQRETIFAEYIFKYHPVYKENVPIWLNRWEEASL 216 Query: 113 DGGDVLLVGKQ-FFVGQTARTDEQGIREFAAAV-EPHG--YQVTSIEV---SAGLHLKSI 165 +GGD L++ K +G + RT+ + + + A ++ + + + ++ + +HL ++ Sbjct: 217 EGGDELVLNKGLLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTV 276 Query: 166 VNYVGRNTLLL 176 + + Sbjct: 277 FTQIDYSVFTS 287
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 50.2 bits (120), Expect = 9e-09 Identities = 27/146 (18%), Positives = 54/146 (36%), Gaps = 16/146 (10%) Query: 56 GAALAPVQAATATSESVPRYLSGLGTVTAA-NTVTVRSRVDGQLLAIHFQEGQQVKAGDL 114 +A + + E V + G +T + + ++ + + I +EG+ V+ GD+ Sbjct: 67 FLVIAFILSVLGQVEIV---ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDV 123 Query: 115 LAEIDPSQFKVALAQAQGQLAKDKATLANARRDLARYQQLVKSNLVSRQELDTQQSLVSE 174 L ++ A+ K +++L AR + RYQ L +S EL+ L Sbjct: 124 LLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQILSRS-----IELNKLPELKLP 171 Query: 175 TEGTIKADEAAVASAQLQLNWSRITA 200 E + L + + Sbjct: 172 DEPYFQNVSEEEVLRLTSLIKEQFST 197 Score = 40.2 bits (94), Expect = 1e-05 Identities = 30/167 (17%), Positives = 64/167 (38%), Gaps = 11/167 (6%) Query: 126 ALAQAQGQLAKDKATLANARRDLARYQQLVKSNLVSRQELDTQQSLVSETEGTIKAD--E 183 +A +L K+ L ++ ++ +L + L + T Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKE----EYQLVTQLFKNEILDKLRQTTDNIGLLT 315 Query: 184 AAVASAQLQLNWSRITAPIDGRV-GLKQVDIGNQISSSDTTGIVVLTQTHPIDLVFTLPE 242 +A + + S I AP+ +V LK G +++++T ++V +++ + Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED-DTLEVTALVQN 374 Query: 243 NDIATVVQAQKAGKTLQVEAWDRTNKQKI-SEGSLLSLDNQIDATTG 288 DI + Q A ++VEA+ T + + ++LD D G Sbjct: 375 KDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 894 bits (2313), Expect = 0.0 Identities = 293/1036 (28%), Positives = 509/1036 (49%), Gaps = 29/1036 (2%) Query: 13 SRLFILRPVATTLLMVAIMLAGIIGYRFLPVSALPEVDYPTIQVVTLYPGASPDVVTSAI 72 + FI RP+ +L + +M+AG + LPV+ P + P + V YPGA V + Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPVYSKVNPADPPIMTLAVTSSAIPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189 + + S + +M S TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAEAIAALGLTSETIRTAITSANVNSAKGSLDGP------SRAVTLSANDQ 243 Q A+R+ L+A+ + LT + + N A G L G ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSAEDYRRLII-AYQNGAPIRLGDVATIEQGAENSWLGAWANNQQAIVMNVQRQPGANI 302 ++ E++ ++ + +G+ +RL DVA +E G EN + A N + A + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IATADSVRQMLPQLTESLPKSVKVQVLSDRTTNIRASVSDTQFELMLAIALVVMIIYLFL 362 + TA +++ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNVPATIIPGVAVPLSLVGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SHESLRKQNRFSRASERFFERVIAAYGRMLSRVLNHPWL 538 +S +V+L LTP +CA +L S E + F F+ + Y + ++L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVALGTLVLSIMLWVFIPKGFFPIQDNGIIQGTLQAPQSASFANMAQRQEQVSAAILK 598 L + + ++L++ +P F P +D G+ +Q P A+ + +QV+ LK Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VESLTSYVGVDGTNPALNSARLQINLKPLDERDDR---VQTVISRLQKSVDGIPG 653 + VES+ + G + A N+ ++LKP +ER+ + VI R + + I Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658 Query: 654 VELYLQPTQDLTIDTTVSRTQYQFTLQ---ANSLDALSNWVPQLLTRL-QALPQLSDVSS 709 + ++ P I + T + F L DAL+ QLL Q L V Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDKGLAAYIKVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHDTR 769 + + ++VD++ A LG+S++D++ + A G ++ + ++ ++ D + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 ATPGLAALDNIRLTSSGGGIVPLKSIASVEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829 +D + + S+ G +VP + + + + + P I S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVDAILTAEQALDLPTDIRTQFQGSTLAFQSALGNTVWLVVAAVVAMYIVLGVLYESFI 889 DA+ + L P I + G + + + LV + V +++ L LYES+ Sbjct: 838 DAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALWLAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 DMPPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLMLSQV 1009 EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 886 bits (2291), Expect = 0.0 Identities = 280/1035 (27%), Positives = 501/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILISLAITLCGVLGFRLLPVAPLPQVDFPVIMISASLPGASPETMASSVAT 65 FI RP+ ++++ + + G L LPVA P + P + +SA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVSEMTSTS-SLGSTRIIMEFNFDRDINGAARDVQAAINAAQSLLPSGMP 124 +E+++ I + M+STS S GS I + F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAQTIAQIDGVGDVDVGGSSL 182 + S + +M+ SD +Q ++ D+ ++ + T+++++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVDLNPQALFNQGVSLDAVRTAIDNANVRKPQG------ALEDHSSRWQVQTNDELK 236 A+R+ L+ L ++ V + N + G AL + K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIVHYN-NGAAVRLGDVAQISDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G+ VRL DVA++ ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDSIRARLPELQKTIPAAIDLQIAQDRSPTIRASLEEVEQTLVISVALVILVVFLFLRS 355 T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCDFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414 RATLIP +AVPV L+GTFA + +S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGSREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LAVSLTLTPMMCGWLLKSGKPSEPTRNRGFG----RLLVAVQGGYGKSLKWVLKHSRITG 530 + V+L LTP +C LLK GF Y S+ +L + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 LVLLGTMALSVWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMQII 586 L+ +A V L++ +P +F PE+D GV + IQ + + + +++ Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 R-EDPAVDNVTGFT-GGSRVNSGMMFITLKPRDQRH---ETAQQIIDRLRKKLAKEPGAN 641 + +V V GF+ G N+GM F++LKP ++R+ +A+ +I R + +L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQANASYQYSLLSDDLSALREWEPKIRKALAAL-----PELADVNSD 696 + + I G ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMDLIYDRDTMSRLGISVQDANNLLNNAFGQRQISTIYQPLNQYKVVMEVDPVY 756 ++ A+ L D++ LG+S+ D N ++ A G ++ K+ ++ D + Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDVSALDKMFVINSEGKPIPLAYFARWQPANAPLSVNHQGLSAASTISFNLPTGKSLSE 816 +DK++V ++ G+ +P + F + + I G S + Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ASDAINRAMTQLGVPSSVRGSYAGTAQVFQQTMNSQVILILAAIATVYIVLGILYESYVH 876 A + ++L P+ + + G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFGAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRNGN 936 P++++ +P VG LLA LF + ++G++ IG+ KNAI++V+FA + Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LPPEEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 120 bits (303), Expect = 7e-32 Identities = 97/431 (22%), Positives = 185/431 (42%), Gaps = 17/431 (3%) Query: 20 FMQSLDTTIVNTALPSMAKSLGESPLHMHMVIVSYVLTVAIMLPASGWLADRVGVRNIFF 79 F L+ ++N +LP +A + P + V +++LT +I G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 SAIILFTLGSLFCAQAAT-LDQLVLARVLQGVGGAMMVPVGRLTVMKIVPRAQYMAAMTF 138 II+ GS+ + L++AR +QG G A + + V + +P+ A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIP-VGIIGAIATLWLMPNYTLQTRRFDM 197 + +G +GPA+GG++ Y HW +L+ IP + II + L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 FGFILLALGMATLTLALDGKKGLGISPLLLSALVVIGVASILLYLWYARGNDNALFSLNL 257 G IL+++G+ L IS L++S V S L+++ + R + L Sbjct: 202 KGIILMSVGIVFFML---FTTSYSISFLIVS------VLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRNRTFSLGLGGSFAGRIGSGMLPFMTPVFLQIGLGFTPFHAG-MMMIPMVLGSMGMKRI 316 +N F +G+ M P ++ + G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQVVNRFGYRRVLMVATVGLALVSLLFMAVALAGWYWLLPVVLFVQGMINSSRFSSMNT 376 +V+R G VL + L+ VS L + L W + +++ S + ++T Sbjct: 313 GGILVDRRGPLYVLNIGVTFLS-VSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371 Query: 377 LTLKDLPDDLASSGNSMLSMVMQLSMSIGVTLAGMLL--GLFGQQHISADAAATHQVFLY 434 + L A +G S+L+ LS G+ + G LL L Q+ + + + ++ Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431 Query: 435 TYLSMAVIIAL 445 L + II + Sbjct: 432 LLLLFSGIIVI 442
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 32.9 bits (75), Expect = 0.003 Identities = 22/77 (28%), Positives = 32/77 (41%), Gaps = 8/77 (10%) Query: 214 ILAALATFPLARGLLAPVKRLVEGTHRLAA------GDFTT--RVAVSSSDELGRLAQDF 265 +A + P L+A V+ V H LA G F V++ + G L Sbjct: 93 AVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVL 152 Query: 266 NQLASTLERNQQMRRDL 282 N+LA E+ QQMR + Sbjct: 153 NRLADYTEQRQQMRSRI 169 Database: VIFASCDB Posted date: Jun 1, 2014 9:04 PM Number of letters in database: 79,683 Number of sequences in database: 213 Lambda K H 0.321 0.136 0.381 Gapped Lambda K H 0.267 0.0668 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 213 Number of Hits to DB: 230,413,705 Number of extensions: 10463398 Number of successful extensions: 40301 Number of sequences better than 5.0e-02: 1246 Number of HSP's gapped: 37709 Number of HSP's successfully gapped: 2373 Length of database: 79,683 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits)