PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomexantho.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NZ_CP014347 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1XADLMG695_RS00100XADLMG695_RS00415Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS00100326-2.762413IS3 family transposase
XADLMG695_RS00110324-3.244245hypothetical protein
XADLMG695_RS00115323-3.249579TolC family protein
XADLMG695_RS00120225-3.791593efflux RND transporter periplasmic adaptor
XADLMG695_RS00125226-4.111225CusA/CzcA family heavy metal efflux RND
XADLMG695_RS00130234-4.445379cation transporter
XADLMG695_RS00135236-5.214226phosphoethanolamine--lipid A transferase
XADLMG695_RS00140441-5.573732helix-turn-helix transcriptional regulator
XADLMG695_RS00150544-6.373562hypothetical protein
XADLMG695_RS00160741-8.194014hypothetical protein
XADLMG695_RS00170741-9.334074sce7726 family protein
XADLMG695_RS00175446-8.067506hypothetical protein
XADLMG695_RS00180336-6.188046ImmA/IrrE family metallo-endopeptidase
XADLMG695_RS00185222-4.219559hypothetical protein
XADLMG695_RS21445020-2.574683transposase
XADLMG695_RS21450019-1.830412DDE-type integrase/transposase/recombinase
XADLMG695_RS21455120-1.948456alpha/beta fold hydrolase
XADLMG695_RS00200118-2.167345efflux RND transporter permease subunit
XADLMG695_RS00205018-2.231984efflux RND transporter periplasmic adaptor
XADLMG695_RS00210126-2.518792efflux transporter outer membrane subunit
XADLMG695_RS00220328-4.682608response regulator transcription factor
XADLMG695_RS00225230-5.096290HAMP domain-containing histidine kinase
XADLMG695_RS00230130-5.710430thermonuclease family protein
XADLMG695_RS21465132-5.861297ROK family protein
XADLMG695_RS00235033-5.630665TonB-dependent receptor plug domain-containing
XADLMG695_RS00240235-6.164788glycerophosphodiester phosphodiesterase family
XADLMG695_RS00245039-5.484778glycerophosphodiester phosphodiesterase family
XADLMG695_RS00255238-3.380653IS3 family transposase
XADLMG695_RS21470138-3.322328trypsin-like serine protease
XADLMG695_RS00270338-4.306047phage tail protein
XADLMG695_RS21475339-4.779225nuclear transport factor 2 family protein
XADLMG695_RS00285544-7.360217NAD(P)-dependent alcohol dehydrogenase
XADLMG695_RS00290344-7.202319YafY family transcriptional regulator
XADLMG695_RS00300453-7.379855hypothetical protein
XADLMG695_RS00305454-7.379958glycosyltransferase family 2 protein
XADLMG695_RS22975054-5.831658hypothetical protein
XADLMG695_RS00315265-7.595764hypothetical protein
XADLMG695_RS00320362-8.364609DUF4189 domain-containing protein
XADLMG695_RS00330250-7.292793hypothetical protein
XADLMG695_RS00335344-7.627049lipase
XADLMG695_RS21480337-7.193193hypothetical protein
XADLMG695_RS00340435-7.675379type III effector
XADLMG695_RS00345330-6.257799cellulase
XADLMG695_RS00350120-4.683411DegV family protein
XADLMG695_RS00360118-4.904332peptide deformylase
XADLMG695_RS00370-213-0.307483arsenate reductase (glutaredoxin)
XADLMG695_RS00380-1102.578439hypothetical protein
XADLMG695_RS00385-1103.202828serine protease
XADLMG695_RS004001102.718930cellulose biosynthesis protein BcsC
XADLMG695_RS004051102.900288cellulase
XADLMG695_RS004151113.114901cellulose biosynthesis cyclic di-GMP-binding
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00115YERSSTKINASE290.030 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 29.3 bits (65), Expect = 0.030
Identities = 32/118 (27%), Positives = 49/118 (41%), Gaps = 5/118 (4%)

Query: 87 QAEVTVTLSGVLERGGKLDARRTLALARIDSLAPQREIARLDLLAETARRYLAITAAIRQ 146
Q LS ++ R G +L R DS P + A R+ +A AAI
Sbjct: 615 QESAKAQLSILINRSGSWADVARQSLQRFDSTRPVVKFGTEQYTA-IHRQMMAAHAAITL 673

Query: 147 REIAELDIEQRKRTVDAARRRLEAGASPESVVLTAKAALAEAELDRDRAAQAERTARL 204
+E++E + R TVD+ ++ G S L + + + E R+ AER RL
Sbjct: 674 QEVSEFTDDMRNFTVDSIPLLIQLGRSS----LMDEHLVEQREKLRELTTIAERLNRL 727


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00125ACRIFLAVINRP7670.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 767 bits (1983), Expect = 0.0
Identities = 247/1058 (23%), Positives = 436/1058 (41%), Gaps = 57/1058 (5%)

Query: 5 IIRTSIANRWLVMTMTVVLIAIGVWSFNQLPIDATPDITNVQVQVNTAAPGYSPLEAEQR 64
+ I + ++L+ G + QLP+ P I V V+ PG +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 VTYPIETAMAGLPKMENFRSIS-RYGLSQITVVFKDGTDIYFARQQVAERLQQVKSQIPA 123
VT IE M G+ + S S G IT+ F+ GTD A+ QV +LQ +P
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 NIEPTLGPIATGMGEIFSYTIDADPKAKKTDGTPYTATDLRTLQDWVIRPQLRNIPGVTE 183
++ + +D T D+ ++ L + GV +
Sbjct: 121 EVQQQGISVEKSSSSYLMVA------GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 VNTLGGYKREVHITPDPSRLRSLGLTLDDVVKALQLNNQNVGAGYIER----NGQQFLVR 239
V G + + I D L LT DV+ L++ N + AG + GQQ
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 240 IP--GQVADISQIEQVVL-ARREGAVIRMRDVAKVADGAELRTGAATQNGHEVVLGTVVM 296
I + + + +V L +G+V+R++DVA+V G E A NG + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LIGSNSRDVSQAAAAKLKDAAKSLPAGVTATPVYDRTKLVDRTIATVAKNLTEGAVLVIV 356
G+N+ D ++A AKL + P G+ YD T V +I V K L E +LV +
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 ILFLLLGNFRAALITALVIPLAMLFTLTGMARGGISANLMSLG--ALDFGLIVDGAVIII 414
+++L L N RA LI + +P+ +L T +A G S N +++ L GL+VD A++++
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENCLSRFGHRQHELGRQMTLAERFETTASATAEVIRPSLFGLGIITAVYLPIFALTGVEG 474
EN E E T + +++ + +++AV++P+ G G
Sbjct: 414 ENV---------ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAITVVLALTGAMVLALTFVPAAIALMLGG---KVEEKENWLMGWLRR------- 524
++ +IT+V A+ ++++AL PA A +L + E + GW
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 RYEPLLDMSLRRGKWVAVGAILLLACSGVLFTRLGSEFVPNLDEGDFAMQAMRIPGTSLT 584
Y + L + L++A VLF RL S F+P D+G F G +
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 585 QSVNMQRLLEQRLLKVPEIERVFSKIGTAEVASDPMPPSIGDTFVMVKPRDQWPDPDKPK 644
++ + + LK E V S + + G FV +KP ++ +
Sbjct: 585 RTQKVLDQVTDYYLKN-EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSA 643

Query: 645 AELVAQVEQLVARVPGNNYEFTQPIQM-RTNELISGVRADVA-INVYGDDLATLLKIGQQ 702
++ + + + ++ F P M EL + D I+ G L + Q
Sbjct: 644 EAVIHRAKMELGKIRDG---FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 703 IEAVAKKVTGA-ADVRVEQASGLPLLEVVPNRLALASYGLTTDDVQSTVATAVGGEVAGK 761
+ +A + + VR ++ ++ + G++ D+ T++TA+GG
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 762 LFEGDRRFDVVVRLPESLRQDPAALESLPIPLGPASDPQASGVSGPRTIPLSSVAKVVAS 821
+ R + V+ R P ++ L + A+G +P S+
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYV-------RSANGE----MVPFSAFTTSHWV 809

Query: 822 EGANQINRYNGKRRIAVTANVRDRDLGGFVSELQGVINANVQPPSGYWIEYGGSFEQLIS 881
G+ ++ RYNG + + G L + N + P+G ++ G Q
Sbjct: 810 YGSPRLERYNGLPSMEIQGEAAPGTSSGDAMAL--MENLASKLPAGIGYDWTGMSYQERL 867

Query: 882 ASKRLAIVVPATLVIIFALLFWAFRSVKDSAIVFSGVPLALTGGILALTVRGIPLSISAG 941
+ + +V + V++F L + S V VPL + G +LA T+ +
Sbjct: 868 SGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFM 927

Query: 942 IGFIALSGVAVLNGLVLISFIRGLRE-QGEPLESAVREGALSRLRPVLMTAFVASLGFVP 1000
+G + G++ N ++++ F + L E +G+ + A RLRP+LMT+ LG +P
Sbjct: 928 VGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987

Query: 1001 MALNVGAGSEVQRPLATVVIGGIVSSTLLTLVVLPVLY 1038
+A++ GAGS Q + V+GG+VS+TLL + +PV +
Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFF 1025



Score = 91.1 bits (226), Expect = 1e-20
Identities = 73/522 (13%), Positives = 169/522 (32%), Gaps = 40/522 (7%)

Query: 3 ERIIRTSIANRWLVMTMTVVLIAIGVWSFNQLPIDATPDITNVQVQVNTAAPGYSPLEAE 62
+ + + + + +++A V F +LP P+ P + E
Sbjct: 527 TNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERT 586

Query: 63 QRVTYPIETAMAGLPKMENFRSISRYG-----------LSQITVV-FKDGTDIYFARQQV 110
Q+V + K + G ++ +++ +++ + + V
Sbjct: 587 QKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAV 646

Query: 111 AERLQQVKSQIP-ANIEPTLGPIATGMGEIFSYTIDADPKAKKTDGTPYTATDLRTLQDW 169
R + +I + P P +G + D + G + A L ++
Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGF----DFELIDQAGLGHDA--LTQARNQ 700

Query: 170 VIRPQLRNIPGVTEVNTLGGYKR-EVHITPDPSRLRSLGLTLDDVVKALQLNNQNVGAGY 228
++ ++ + V G + + D + ++LG++L D+ + +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 229 IERNGQQFLVRIPGQ---VADISQIEQVVLARREGAVIRMRDVAKVADGAELRTGAATQN 285
G+ + + ++++ + G ++ + +
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY-----GSPRL 815

Query: 286 GHEVVLGTVVMLIGSNSRDVSQAAAAKLKDAAKSLPAGVTATPVYDRTKLVDRTIATVAK 345
L ++ + + S A A +++ A LPAG+ + +
Sbjct: 816 ERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIG-YDWTGMSYQERLSGNQAPA 874

Query: 346 NLTEGAVLVIVILFLLLGNFRAALITALVIPLAMLFTLTGMARGGISANLMSLGAL--DF 403
+ V+V + L L ++ + LV+PL ++ L ++ + L
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 404 GLIVDGAVIIIENCLSRFGHRQHELGRQMTLAERFETTASATAEVIRPSLFGLGIITAVY 463
GL A++I+E + G+ E T A +RP L
Sbjct: 935 GLSAKNAILIVE----FAKDLMEKEGK-----GVVEATLMAVRMRLRPILMTSLAFILGV 985

Query: 464 LPIFALTGVEGKMFHPMAITVVLALTGAMVLALTFVPAAIAL 505
LP+ G + + I V+ + A +LA+ FVP +
Sbjct: 986 LPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00205ACRIFLAVINRP453e-145 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 453 bits (1168), Expect = e-145
Identities = 233/1039 (22%), Positives = 430/1039 (41%), Gaps = 63/1039 (6%)

Query: 13 LTLFTALLVLVGGVLTFLNFPSQEEPSVTIRDAVVQLAYPGMPTEKVETLLARPVEENLR 72
A+++++ G L L P + P++ V YPG + V+ + + +E+N+
Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMN 70

Query: 73 SLAGIKKIV-TTVRPGSAILQITAHDSVADLPALWLRVRAKAAEVGGALPAG---TMGPF 128
+ + + T+ GS + +T D ++V+ K LP
Sbjct: 71 GIDNLMYMSSTSDSAGSVTITLTFQSGT-DPDIAQVQVQNKLQLATPLLPQEVQQQGISV 129

Query: 129 VDDDFGRVSVASIAVTAPGFSMSEMRGPL-RKMREQLYTLPGVERVSLYGLQEDRIYIAF 187
+ VA PG + ++ + +++ L L GV V L+G + + I
Sbjct: 130 EKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQYAMRIWL 188

Query: 188 DRVRLVEAGLSPASVIDQLRRQNVVVPGGLVSASGMA------MTVATSGEVGNVQALKQ 241
D L + L+P VI+QL+ QN + G + + ++ N + +
Sbjct: 189 DADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGK 248

Query: 242 VLINTQGSGGAREIALGALAQVQVMPADPPETAAIYQGQPAVVVAVSMASGYNVVSFGKA 301
V + G + L +A+V + + A G+PA + + +A+G N + KA
Sbjct: 249 VTLRVNSDGS--VVRLKDVARV-ELGGENYNVIARINGKPAAGLGIKLATGANALDTAKA 305

Query: 302 LREKLVETASLLPTGFQQHVVTFQADVVDREMSKMHQVMGETVVIVMAVVMLFLG-WRTG 360
++ KL E P G + V + ++ + + E +++V V+ LFL R
Sbjct: 306 IKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRAT 365

Query: 361 LIVGAIVPLTILGSLILMRVLNVELQTVSIAAIILALGLLVDNGIVIAEDIERRL-MAGE 419
LI VP+ +LG+ ++ + T+++ ++LA+GLLVD+ IV+ E++ER +
Sbjct: 366 LIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKL 425

Query: 420 ERFHACVEAGRTLAIPLLTSSLVIVLAFSPFFFGQTSTNEYMRSLAVVLAITLLGSWLLS 479
A ++ + L+ ++V+ F P F ST R ++ + + S L++
Sbjct: 426 PPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVA 485

Query: 480 ITVTPLLCLYFARAHAGEHNEDN------YNSKFYRA---YRAVIERLLEFKLLFISTML 530
+ +TP LC + + EH+E+ +N+ F + Y + ++L ++
Sbjct: 486 LILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYA 545

Query: 531 LALGGAVVVLSSIPYDFLPKSDRLQFQIPVTLRAGADSRQTLQSVETMSRW-LADKRANP 589
L + G VV+ +P FLP+ D+ F + L AGA +T + ++ ++ + L +++AN
Sbjct: 546 LIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANV 605

Query: 590 EIVDSIGYVADGGPRIVLGLNPPLPGSNIAYFTVSVKPKTD-------IDQVIDRVRQYV 642
E V ++ + G N VS+KP + + VI R + +
Sbjct: 606 ESVFTVNGFSFSGQ-----------AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL 654

Query: 643 RKTLPDVRAEPKR----FSLG-ATEAGVAVYRVTGADEQVLRTAADQIADALRSLPGTL- 696
+ D P LG AT + G L A +Q+ P +L
Sbjct: 655 -GKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 697 DVTDDWQARIPRYIVQVDQLKARRAGVSSDDIAQALQLRYSGVPASQIRDDGVDVPILLR 756
V + ++ ++VDQ KA+ GVS DI Q + G + D G + ++
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 757 GDAGERAGNGSPAD--TLVYPQSGGKPLPLSAIASIQHDSEPSTLMRRNLERAITVTGRN 814
DA R P D L + G+ +P SA + L R N ++ + G
Sbjct: 774 ADAKFR---MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEA 830

Query: 815 PDSTTSEMVAALADKIAKITLPPGYRIELGGEIEDSAEANQALLEYMPHALGAILLLFIW 874
T+S AL + +A LP G + G + + + + L
Sbjct: 831 APGTSSGDAMALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889

Query: 875 QFNSFRKLFIVVSAIPFVLIGAALALLVTGYPFGFMATFGLLALAGIIVNNAVLLLERI- 933
+ S+ V+ +P ++G LA + GLL G+ NA+L++E
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949

Query: 934 EAELAEGLPRREAVISAAVKRLRPIVMTKLTCIVGLIPLMLFAGP---LWTGMAITMIGG 990
+ EG EA + A RLRPI+MT L I+G++PL + G + I ++GG
Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGG 1009

Query: 991 LALGTLVTLGMIPILYDVL 1009
+ TL+ + +P+ + V+
Sbjct: 1010 MVSATLLAIFFVPVFFVVI 1028



Score = 108 bits (272), Expect = 4e-26
Identities = 82/418 (19%), Positives = 161/418 (38%), Gaps = 28/418 (6%)

Query: 619 AYFTVSVKPKTDIDQVIDRVR---QYVRKTLPDVRAEPKRFSLGATEAGVAVYRVTGAD- 674
T++ + TD D +V+ Q LP + ++ + + V +
Sbjct: 88 VTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNP 147

Query: 675 ----EQVLRTAADQIADALRSLPGTLDVTDDWQARIPRYIVQVDQLKARRAGVSSDDIAQ 730
+ + A + D L L G DV R + D L ++ D+
Sbjct: 148 GTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKY--KLTPVDVIN 205

Query: 731 ALQLRYSGVPASQIRDDGVDVPILLRGDAGERAGNGSPAD---TLVYPQSGGKPLPLSAI 787
L+++ + A Q+ L + +P + + S G + L +
Sbjct: 206 QLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDV 265

Query: 788 ASIQHDSEP-STLMRRNLERAITVT-GRNPDSTTSEMVAALADKIAKI--TLPPGYRIEL 843
A ++ E + + R N + A + + + A+ K+A++ P G ++
Sbjct: 266 ARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVL- 324

Query: 844 GGEIEDSAEANQALLEYMPHAL-GAILLLFIWQF---NSFRKLFIVVSAIPFVLIGAALA 899
D+ Q + + L AI+L+F+ + + R I A+P VL+G
Sbjct: 325 --YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAI 382

Query: 900 LLVTGYPFGFMATFGLLALAGIIVNNAVLLLERIEAELAE-GLPRREAVISAAVKRLRPI 958
L GY + FG++ G++V++A++++E +E + E LP +EA + + +
Sbjct: 383 LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGAL 442

Query: 959 VMTKLTCIVGLIPLMLFAG---PLWTGMAITMIGGLALGTLVTLGMIPILYDVLFGLR 1013
V + IP+ F G ++ +IT++ +AL LV L + P L L
Sbjct: 443 VGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPV 500



Score = 89.1 bits (221), Expect = 5e-20
Identities = 87/526 (16%), Positives = 183/526 (34%), Gaps = 50/526 (9%)

Query: 2 NLTRSALASSRLTLFTALLVLVGGVLTFLN-----FPSQEEPSVTIRDAVVQLAYPGMPT 56
N L S+ L L++ G V+ FL P +++ ++QL G
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLT---MIQLP-AGATQ 583

Query: 57 EKVETLLARPVEENLRSLAGIKKIVTTVR--------PGSAILQIT------AHDSVADL 102
E+ + +L + + L++ + V TV + + ++ +
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSA 643

Query: 103 PALWLRVRAKAAEVGGALPAGTMGPFVDD--DFGRVSVASIAVTAPGF-SMSEMRGPLRK 159
A+ R + + ++ P + + I G ++++ R L
Sbjct: 644 EAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLG 703

Query: 160 MREQLYTLPGVERVSLYGLQ-EDRIYIAFDRVRLVEAGLSPASVIDQL------RRQNVV 212
M Q + V GL+ + + D+ + G+S + + + N
Sbjct: 704 MAAQH--PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF 761

Query: 213 VPGGLVSASGMAMTVATSGEVGNVQALKQVLINTQGSGGAREIALGALAQVQVMPADPPE 272
+ G V + A + + + ++ + S + A +
Sbjct: 762 IDRGRVKKLYVQ---ADAKFRMLPEDVDKLYVR---SANGEMVPFSAFTTSHWVYG--SP 813

Query: 273 TAAIYQGQPAVVVAVSMASGYNVVSFGKALREKLVETASLLPTGFQQHVVTFQADVVDRE 332
Y G P++ + A G AL E L LP G + T +
Sbjct: 814 RLERYNGLPSMEIQGEAAPGT-SSGDAMALMENLASK---LPAGIG-YDWTGMSYQERLS 868

Query: 333 MSKMHQVMGETVVIV-MAVVMLFLGWRTGLIVGAIVPLTILGSLILMRVLNVELQTVSIA 391
++ ++ + V+V + + L+ W + V +VPL I+G L+ + N + +
Sbjct: 869 GNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMV 928

Query: 392 AIILALGLLVDNGIVIAEDI-ERRLMAGEERFHACVEAGRTLAIPLLTSSLVIVLAFSPF 450
++ +GL N I+I E + G+ A + A R P+L +SL +L P
Sbjct: 929 GLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 451 FFGQTSTNEYMRSLAVVLAITLLGSWLLSITVTPLLCLYFARAHAG 496
+ + ++ + + ++ + LL+I P+ + R G
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFKG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00210RTXTOXIND484e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.5 bits (113), Expect = 4e-08
Identities = 16/105 (15%), Positives = 33/105 (31%)

Query: 66 GGRIKAIYVDVGDRVREGQLLAQLDLEPARLRLQQAQANAASAAADLRERKIQLDQQTAM 125
+K I V G+ VR+G +L +L A + Q++ A + +I
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163

Query: 126 FTDGATSQATLTTATVAADAARARLQVAESDRALAQRALRQADIR 170
V+ + + + + Q Q ++
Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208



Score = 34.0 bits (78), Expect = 8e-04
Identities = 20/140 (14%), Positives = 42/140 (30%), Gaps = 14/140 (10%)

Query: 89 LDLEPARLRLQQAQANAASAAADLRERKIQLDQQTAMFTDGATSQAT--LTTATVAADAA 146
L+ E + S + + ++ + T ++ L T
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 147 RARLQVAESDRALAQRALRQADIRAPFDGNVVARLQQPHVD--VPAGQGVLQLEGQGRTQ 204
L E + + IRAP V +L+ V + ++ + + T
Sbjct: 315 TLELAKNEER-------QQASVIRAPVSV-KVQQLKVHTEGGVVTTAETLMVIVPEDDTL 366

Query: 205 VV-ALLPP-QVADLSPGSTV 222
V AL+ + ++ G
Sbjct: 367 EVTALVQNKDIGFINVGQNA 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00220HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-21
Identities = 34/115 (29%), Positives = 59/115 (51%), Gaps = 1/115 (0%)

Query: 1 MAAKKVLVVEDDADSASVLEAYLRREGFDVAIAADGIRAVQLHAQWKPDLVLLDMMLPAL 60
M +LV +DDA +VL L R G+DV I ++ + A DLV+ D+++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIEVLSAIRLVG-DTPVIMVTAIGDEPEKLGALRYGADDYVVKPYSPKEVVARV 114
+ ++L I+ D PV++++A + A GA DY+ KP+ E++ +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00300BCTERIALGSPF300.012 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 30.2 bits (68), Expect = 0.012
Identities = 22/84 (26%), Positives = 33/84 (39%), Gaps = 6/84 (7%)

Query: 220 AQGRQLLKRGGRF-VDIHPTPAKFLQSVFNSTLKVVVCKPRKEILKKVAVAAQDGLLKTT 278
Q RQLL+ G + + +S + RK L +A L T
Sbjct: 26 RQARQLLRERGLVPLSVDENRGDQQKS-----GSTGLSLRRKIRLSTSDLALLTRQLATL 80

Query: 279 VGASVPLKAAIDLLAQLENGKRLG 302
V AS+PL+ A+D +A+ L
Sbjct: 81 VAASMPLEEALDAVAKQSEKPHLS 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS22980FbpA_PF05833280.021 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 27.5 bits (61), Expect = 0.021
Identities = 13/71 (18%), Positives = 27/71 (38%), Gaps = 6/71 (8%)

Query: 75 KAKKAAPS-SSTIKVLRQEIAELKKLKAELITVIASQRAELDRARKSLIELGADPIVRSM 133
K KK+ + + + +E+ L + + A E++ +K LIE G ++
Sbjct: 392 KLKKSEEAANEQLLQNEEELNYLYSVLTNINN--ADNYDEIEEIKKELIETG---YIKFK 446

Query: 134 QRNFRNKRAKE 144
+ K
Sbjct: 447 KIYKSKKSKTS 457


2XADLMG695_RS00745XADLMG695_RS00875Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS00745-112-3.080780mannose-1-phosphate
XADLMG695_RS00750011-2.907880nucleotide sugar dehydrogenase
XADLMG695_RS00755-214-1.900587dTDP-4-dehydrorhamnose reductase
XADLMG695_RS00760-114-2.886487dTDP-4-dehydrorhamnose 3,5-epimerase
XADLMG695_RS00765-313-2.851747glucose-1-phosphate thymidylyltransferase RfbA
XADLMG695_RS00770-214-2.387456dTDP-glucose 4,6-dehydratase
XADLMG695_RS00775-217-2.249222electron transfer flavoprotein subunit beta/FixA
XADLMG695_RS00780-219-3.274772electron transfer flavoprotein subunit
XADLMG695_RS00785-221-4.702395flippase-like domain-containing protein
XADLMG695_RS00790-120-5.993191UbiA family prenyltransferase
XADLMG695_RS00795024-6.839576FAD-binding oxidoreductase
XADLMG695_RS00800029-8.773644SDR family oxidoreductase
XADLMG695_RS00805233-9.811567class I SAM-dependent methyltransferase
XADLMG695_RS00810236-9.825501NAD-dependent epimerase/dehydratase family
XADLMG695_RS00815343-10.495263NAD(P)/FAD-dependent oxidoreductase
XADLMG695_RS00820348-10.686441GtrA family protein
XADLMG695_RS00825346-10.382378hypothetical protein
XADLMG695_RS22995345-10.314295hypothetical protein
XADLMG695_RS00830345-10.664908hypothetical protein
XADLMG695_RS00835343-10.394651hypothetical protein
XADLMG695_RS23000240-9.824659hypothetical protein
XADLMG695_RS00840239-9.984840glycosyltransferase
XADLMG695_RS00845334-8.704836glycosyltransferase
XADLMG695_RS00850221-6.635999class I SAM-dependent methyltransferase
XADLMG695_RS00855317-5.579327CatB-related O-acetyltransferase
XADLMG695_RS00860215-4.053761ABC transporter ATP-binding protein
XADLMG695_RS00865314-2.422039ABC transporter permease
XADLMG695_RS00870314-0.979446cystathionine gamma-synthase
XADLMG695_RS00875214-0.581448pyridoxal-phosphate dependent enzyme
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00770NUCEPIMERASE1831e-57 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 183 bits (467), Expect = 1e-57
Identities = 88/346 (25%), Positives = 136/346 (39%), Gaps = 42/346 (12%)

Query: 5 LVTGGAGFIGGNFVLEAVARGVRVVNLDALT--YAGNLNTL-ASLDGNPDHVFVKGDIGD 61
LVTG AGFIG + + G +VV +D L Y +L L P F K D+ D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 62 GPLVASLLHEHQPDAVLNFAAESHVDRSIEGPGAFIQTNVVGTLALLEAVRDHWKALPKE 121
+ L + V V S+E P A+ +N+ G L +LE R
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH-------- 115

Query: 122 RQDAFRFLHVSTDEVYGTLGETGKFTETTPYA-PNSPYSASKAASDHLVRAFRHTYGLPV 180
L+ S+ VYG L F+ P S Y+A+K A++ + + H YGLP
Sbjct: 116 -NKIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 181 LTTNCSNNYGPYHFPEKLIPLVIAKALAGEPLPVYGDGKQVRDWLFVSDHCEAIRTVL-- 238
YGP+ P+ + L G+ + VY GK RD+ ++ D EAI +
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 239 ----------------AKGKVGETYNVGGNSERQNIEVVQAICALLDQHRPRDDGKPRAS 282
A YN+G +S + ++ +QA+ L +
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG----------IEA 283

Query: 283 QITYVTDRPGHDRRYAIDASKLKNELGWEPAYTFEQGIAQTVHWYL 328
+ + +PG + D L +G+ P T + G+ V+WY
Sbjct: 284 KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00800DHBDHDRGNASE571e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 56.6 bits (136), Expect = 1e-11
Identities = 48/204 (23%), Positives = 78/204 (38%), Gaps = 7/204 (3%)

Query: 4 VLIIGATSAIAEATARRYAARGAAVHLLGRQAPRLETIAADLTTRGARTSIGVLDVNDNA 63
I GA I EA AR A++GA + + +LE + + L DV D+A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 64 RHGEVLDAAWAALGGVDVVLIAHGTLPDQAACNASVELSLREFATNGTSTIALCAALVPR 123
E+ +G +D+++ G L + S E F+ N T ++
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 124 LTS--GATLAVISSVAGDRGRASNYLYGSAKAAVTAYLSGLGQRLRPQGINVLTIKPGFV 181
+ ++ + S R S Y S+KAA + LG L I + PG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190

Query: 182 DTPMTAAFKKGALWAKPDQIAKGI 205
+T M + +LWA + + I
Sbjct: 191 ETDM-----QWSLWADENGAEQVI 209


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00810NUCEPIMERASE512e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 51.3 bits (123), Expect = 2e-09
Identities = 62/293 (21%), Positives = 114/293 (38%), Gaps = 60/293 (20%)

Query: 7 KIVLTGAAGLVGQNLIVEMKQQGYTQLVAIDK---------HEHNLEILRKLHPDVKTIL 57
K ++TGAAG +G ++ + + G+ Q+V ID + LE+L + P +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQ--PGFQFHK 58

Query: 58 ADLAEPGVWSEAFA--GARLIVQLHAQITGKF-----RTLFDRNNLQATENVLKACVDHQ 110
DLA+ ++ FA + ++ ++ D NL N+L+ C ++
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADS-NLTGFLNILEGCRHNK 117

Query: 111 IPYMVHISSSVV------NSVATDD--------YTETKKLQEALV----RNSGIPHCVLR 152
I ++++ SSS V +TDD Y TKK E + G+P LR
Sbjct: 118 IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177

Query: 153 PTLMFG-WFDPK--HLGWLSRFMARTPVFPIPGDGKFMRQPLYERDFCRCIVQCIEREPA 209
++G W P + + + + GK R Y D I++ + P
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSI-DVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 210 GD------------------IYDIVGATRVDYVDIIRTIKRAKQLRTVIVHIP 244
D +Y+I ++ V+ +D I+ ++ A + +P
Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLP 289


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00850PF02370300.006 M protein repeat
		>PF02370#M protein repeat

Length = 168

Score = 30.5 bits (68), Expect = 0.006
Identities = 20/98 (20%), Positives = 38/98 (38%), Gaps = 5/98 (5%)

Query: 224 QLGKLIGANIDIGTLAQEQGRLIGLVNEHQAARQIINQEVVDLKANLEQRIDALHRA--N 281
Q L+G N D L + +G+ + E + R+ + + Q D ++
Sbjct: 49 QYRALMGENQD---LRKREGQYQDKIEELEKERKEKQERPERREKFERQHQDKHYQEQQK 105

Query: 282 LLGEQLSQTQEILALREKDNQELNASLLSMTRELERMR 319
++ Q + K+ Q +AS + R+LE R
Sbjct: 106 KHQQEQQQLEAEKQKLAKEKQISDASRQGLNRDLEASR 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00865ABC2TRNSPORT374e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 36.8 bits (85), Expect = 4e-05
Identities = 26/89 (29%), Positives = 39/89 (43%)

Query: 151 LTVLYFPLVIFPLVLVSAGVTWFFAALGVYYRDIGQITGLLATVLLFMSPALYPVSSLPP 210
L++LY VI L A + AL Y L+ T +LF+S A++PV LP
Sbjct: 145 LSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPI 204

Query: 211 SMQKLIYLNPLTFIIEQSRNVLMWGLPPD 239
Q PL+ I+ R +++ D
Sbjct: 205 VFQTAARFLPLSHSIDLIRPIMLGHPVVD 233


3XADLMG695_RS01070XADLMG695_RS01125Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS01070222-0.853165GtrA family protein
XADLMG695_RS01075322-0.584895chorismate mutase
XADLMG695_RS01080425-1.214942F0F1 ATP synthase subunit epsilon
XADLMG695_RS01085426-1.199428F0F1 ATP synthase subunit beta
XADLMG695_RS01090221-1.755213F0F1 ATP synthase subunit gamma
XADLMG695_RS01095221-1.179239F0F1 ATP synthase subunit alpha
XADLMG695_RS01100218-1.929353F0F1 ATP synthase subunit delta
XADLMG695_RS01105219-2.299097F0F1 ATP synthase subunit B
XADLMG695_RS01110224-0.331867F0F1 ATP synthase subunit C
XADLMG695_RS011151210.160361F0F1 ATP synthase subunit A
XADLMG695_RS011203221.332190hypothetical protein
XADLMG695_RS011252211.555118hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01125OMPADOMAIN310.004 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.1 bits (70), Expect = 0.004
Identities = 17/73 (23%), Positives = 23/73 (31%), Gaps = 8/73 (10%)

Query: 198 QGQYLNTSW-GDFGDYDGDLSRANAIAEYRFTKNFGIFAGYDWFKLDVDREGSDGLVGLK 256
QY +T + + G + A A Y+ G GYDW G G
Sbjct: 36 WSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWL-------GRMPYKGSV 88

Query: 257 QEFKGPVAGVTLA 269
+ GV L
Sbjct: 89 ENGAYKAQGVQLT 101


4XADLMG695_RS01205XADLMG695_RS23010Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS012052131.564881hypothetical protein
XADLMG695_RS01210012-0.284833response regulator
XADLMG695_RS01215118-2.350431alpha/beta fold hydrolase
XADLMG695_RS01220325-4.510313DUF1415 domain-containing protein
XADLMG695_RS01230225-4.188858SDR family oxidoreductase
XADLMG695_RS01235325-4.073596hypothetical protein
XADLMG695_RS01240227-4.583425hypothetical protein
XADLMG695_RS23010324-3.950729hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01210HTHFIS743e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 3e-16
Identities = 34/118 (28%), Positives = 56/118 (47%), Gaps = 1/118 (0%)

Query: 460 LVFEDMDTNRLVIGNLLTRAGHRVSFQVDGTDAVQRIREAAPDLVFLDLHMPGTSGWDAL 519
LV +D R V+ L+RAG+ V + + I DLV D+ MP + +D L
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 520 REARDAMSALPPIIVLTADTRTDSMRDASAAGVAGYLPKPINAHELLALLAQHASHAR 577
+ A L P++V++A + AS G YLPKP + EL+ ++ + + +
Sbjct: 67 PRIKKARPDL-PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01230DHBDHDRGNASE945e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.0 bits (233), Expect = 5e-25
Identities = 65/259 (25%), Positives = 104/259 (40%), Gaps = 13/259 (5%)

Query: 11 NPSPLQDRVVVITGGAQGIGRGIAQAVLGAGGSVVIGDLDADAGKACLQ-EWALPRRSAF 69
N ++ ++ ITG AQGIG +A+ + G + D + + + + A R +
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 70 VRCDAARQAQATRLIEAALKRFGRIDGLVNNAGVPDPHIAALPQLDWDTWNSRLS-SLHG 128
D A + + G ID LVN AGV + L + W + S + G
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVL--RPGLIHSLSDEEWEATFSVNSTG 119

Query: 129 AFLCSKQALPALRQAPEGGAIINIASTRAWQSEPHSEAYAAAKGGLVAFTHALALSEGPH 188
F S+ + G+I+ + S A AYA++K V FT L L +
Sbjct: 120 VFNASRSVSKYMM-DRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 189 -VRVNSISPGWISTEAWRA--PQRRRAPKLSRRDHAQH----PAGRVGTPEDIAQLAVYL 241
+R N +SPG T+ + A ++ + P ++ P DIA AV
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD-AVLF 237

Query: 242 LAPQLSGFVTGQDFIVDGG 260
L +G +T + VDGG
Sbjct: 238 LVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS23010INTIMIN310.026 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 31.2 bits (70), Expect = 0.026
Identities = 10/30 (33%), Positives = 18/30 (60%)

Query: 194 GPGTYTITAVATDNNGNTGNSQAVSVSITQ 223
G Y +TA A D NGN+ N+ +++++
Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLS 550


5XADLMG695_RS01665XADLMG695_RS01780Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS01665212-0.318968DUF1653 domain-containing protein
XADLMG695_RS01670215-0.414889hypothetical protein
XADLMG695_RS01675117-0.994384SulP family inorganic anion transporter
XADLMG695_RS01685217-1.023200chloride channel protein
XADLMG695_RS01690218-1.537181hypothetical protein
XADLMG695_RS01695013-1.623080type I methionyl aminopeptidase
XADLMG695_RS01700118-3.297795ParD-like family protein
XADLMG695_RS01705117-3.085181LysR family transcriptional regulator
XADLMG695_RS01710113-3.510584SDR family oxidoreductase
XADLMG695_RS01715113-2.978333nuclear transport factor 2 family protein
XADLMG695_RS01720113-1.618028*hypothetical protein
XADLMG695_RS01725012-1.573999RNA polymerase sigma factor RpoD
XADLMG695_RS01735-2120.380415D-tyrosyl-tRNA(Tyr) deacylase
XADLMG695_RS01740-1121.064586lauroyl acyltransferase
XADLMG695_RS017450133.318166N-acetyltransferase
XADLMG695_RS017550142.571011GTP cyclohydrolase II RibA
XADLMG695_RS017701152.223752PH domain-containing protein
XADLMG695_RS017750142.515735PH domain-containing protein
XADLMG695_RS017800163.007016CDP-glycerol glycerophosphotransferase family
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01720DHBDHDRGNASE1152e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (290), Expect = 2e-33
Identities = 75/251 (29%), Positives = 120/251 (47%), Gaps = 8/251 (3%)

Query: 16 GKIALVTGGSSGIGLAAAKRLALEGATVV---ISGRRQQELDRAVAEIGHGATAVRADIS 72
GKIA +TG + GIG A A+ LA +GA + + + +++ ++ A A AD+
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 73 VGAELDAVMDGIATAHGRLDLLLANAGGGEFAPIESITEAGFDKYFNINVKGTLLTVQKA 132
A +D + I G +D+L+ AG I S+++ ++ F++N G +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 133 LPLMGA--GSAIVVTGSIAANQGVSNFGVYAATKAALRSFVRTWASELRARDIRVNLIAP 190
M +IV GS A ++ YA++KAA F + EL +IR N+++P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 191 GVVVTPAYRS---ELGMSEEDIDAYLDQIKQKAPLGRSASPDEMAKAMSFLASDDASYIT 247
G T S + +E+ I L+ K PL + A P ++A A+ FL S A +IT
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 248 GIELTVDGGLT 258
L VDGG T
Sbjct: 248 MHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01755SACTRNSFRASE387e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 7e-06
Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 82 SVEHSIYVHRDHRGKGLGRLLLQALIAAAQARGVHVLVGGIDASNQASIALHEQFGFTHA 141
+E I V +D+R KG+G LL I A+ L+ N ++ + + F
Sbjct: 91 LIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIG 149

Query: 142 G 142

Sbjct: 150 A 150


6XADLMG695_RS01870XADLMG695_RS01895Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS018700144.068110signal peptide peptidase SppA
XADLMG695_RS01875-1124.481514MATE family efflux transporter
XADLMG695_RS01880-1124.165588DUF3667 domain-containing protein
XADLMG695_RS01885-1134.687073DUF3106 domain-containing protein
XADLMG695_RS01890-2124.223888hypothetical protein
XADLMG695_RS01895-2123.382520primosomal protein N'
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01870PF07520310.024 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 30.7 bits (69), Expect = 0.024
Identities = 30/140 (21%), Positives = 45/140 (32%), Gaps = 33/140 (23%)

Query: 303 ADSDADSGFR-----NIGFNDYLSQLQAQRSPMDSRPQVAVVVAAGEISGGEQPAGRIGG 357
+ D + R IG QL +R + P + A I AG+I
Sbjct: 922 SAQDPTAIVRMHSPVYIGAR----QLPLERWT--TTPLYRLDFANDSI------AGKIKL 969

Query: 358 ESTAALLRQARDDDEVKAVVLRVDSPGGEVFASEQIRREVV---ALKQAGKPV-----VV 409
L+R+ D DE E +E++R A G + V+
Sbjct: 970 PVKVELVREDDDFDE--------AETSLEKLRAERVREVFRVDAAEDAEGTMIKNDDVVL 1021

Query: 410 SMGDLAASGGYWISMNADRI 429
S+ L YW+ RI
Sbjct: 1022 SLHTLGFEDEYWLDTGVFRI 1041


7XADLMG695_RS02520XADLMG695_RS21605Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS02520211-0.185520Do family serine endopeptidase
XADLMG695_RS02525-190.304587hypothetical protein
XADLMG695_RS025301111.321442hypothetical protein
XADLMG695_RS025352131.526087hypothetical protein
XADLMG695_RS025401131.478781AI-2E family transporter
XADLMG695_RS025454141.521450HAD family hydrolase
XADLMG695_RS025503141.098550leucyl aminopeptidase family protein
XADLMG695_RS025553150.797828cytochrome b
XADLMG695_RS216052170.488046response regulator transcription factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02525V8PROTEASE822e-19 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 82.4 bits (203), Expect = 2e-19
Identities = 31/193 (16%), Positives = 71/193 (36%), Gaps = 40/193 (20%)

Query: 110 LGSGVIIDAQKGYVLTNHHVIENADDVQVTL------------GDGRTVKAEFIGSDADT 157
+ SGV++ K +LTN HV++ L +G + +
Sbjct: 103 IASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG 160

Query: 158 DIALIRIKAD--------NLTDIKLADSNALRVGDFVVAIGNPFG---FTQTVTSGIVSA 206
D+A+++ + + ++++ +V + G P T + G ++
Sbjct: 161 DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITY 220

Query: 207 VGRSGIRGLGYQNFIQTDASINPGNSGGALVNLQGQLVGINTASFNPQGSMAGNIGLGLA 266
+ +Q D S GNSG + N + +++GI+ + N + +
Sbjct: 221 L---------KGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNE----FNGAVFIN 267

Query: 267 --IPSNLARNVVE 277
+ + L +N+ +
Sbjct: 268 ENVRNFLKQNIED 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02575HTHFIS621e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.2 bits (151), Expect = 1e-13
Identities = 26/116 (22%), Positives = 50/116 (43%), Gaps = 2/116 (1%)

Query: 2 IRVLLAEDQALLRGALVALLGLEDDIAVVGSAGDGESAWRELQRLQPDVLVTDIEMPGLT 61
+L+A+D A +R L L V + + WR + D++VTD+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELAQRIQRQALPVRVMIVTTFARPGFLRRALDAGVAGYLLKDAPAEQLVDALRQ 117
+L RI++ + V++++ +A + G YL K +L+ + +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


8XADLMG695_RS02650XADLMG695_RS02775Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS026502121.082535roadblock/LC7 domain-containing protein
XADLMG695_RS026550130.859467hypothetical protein
XADLMG695_RS026600131.688143hypothetical protein
XADLMG695_RS026650112.854905GTPase
XADLMG695_RS02670193.655965dTMP kinase
XADLMG695_RS026751114.365855*SPOR domain-containing protein
XADLMG695_RS026800114.234081type III pantothenate kinase
XADLMG695_RS026851104.396804bifunctional biotin--[acetyl-CoA-carboxylase]
XADLMG695_RS216201133.392851hypothetical protein
XADLMG695_RS027051121.835092zinc-dependent peptidase
XADLMG695_RS02715-29-0.459985DUF1501 domain-containing protein
XADLMG695_RS02725-310-0.161034DUF1800 domain-containing protein
XADLMG695_RS02730-310-0.458305sensor histidine kinase
XADLMG695_RS02740014-0.997400response regulator transcription factor
XADLMG695_RS02745-112-1.180231hypothetical protein
XADLMG695_RS02750-114-2.380768tRNA dihydrouridine(20/20a) synthase DusA
XADLMG695_RS02755018-4.306787hypothetical protein
XADLMG695_RS02760121-4.533388ankyrin repeat domain-containing protein
XADLMG695_RS02765114-3.572333catalase
XADLMG695_RS02770019-3.716342hypothetical protein
XADLMG695_RS02775017-3.109336hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21620IGASERPTASE310.006 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.006
Identities = 26/126 (20%), Positives = 34/126 (26%), Gaps = 18/126 (14%)

Query: 46 PTVTGAGGPVRPAAAPTESTAAAG------------RASTAPAPSPSPASGPASVPTPAV 93
P V V T + A R AP P P+PA+ + T A
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 94 AGP--SASTSAAPPAATTAAAQAPRGAAE----TSAAANAAAKPANGVATTAAQPSPAPS 147
S + AT AQ A E A +G T Q +
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 148 PPAAER 153
E+
Sbjct: 1103 TATVEK 1108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02705PF033091144e-33 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 114 bits (288), Expect = 4e-33
Identities = 57/252 (22%), Positives = 92/252 (36%), Gaps = 25/252 (9%)

Query: 5 LFDLGNSRFKYAPLHGNRAGQ--VQAWAHGAE--------AMDAAALAALPSGRI--AYV 52
D+ N+ + G+ VQ W E A+ L + R+ A
Sbjct: 4 AIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTGASG 63

Query: 53 ASVAAPALTQRMIACLQERFTQVRIVRTTAECA-GIRIAYADPSRFGVDRFLALLGARG- 110
S P++ + L++ + V V GI + +P G DR + L A
Sbjct: 64 LSTV-PSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAYHK 122

Query: 111 -DAPVLVAGVGTALTIDVLGADGLHHGGRIAASPTTMREALHARAEQLPA---SGGDYVE 166
+V G+++ +DV+ A G GG IA +A AR+ L + V
Sbjct: 123 YGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPRSV- 181

Query: 167 LAIDTDDALTSG----CDGAAVALIERSLQHAQRSLGVPVRLLVHGGGAPPLLPLLPDA- 221
+ +T + + +G G L+ R G V ++ G AP +LP L
Sbjct: 182 IGKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLVLPDLRTVE 241

Query: 222 TFRAALVLDGLA 233
+ L LDGL
Sbjct: 242 HYDRHLTLDGLR 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02725PF07201300.013 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 30.2 bits (68), Expect = 0.013
Identities = 20/124 (16%), Positives = 39/124 (31%), Gaps = 11/124 (8%)

Query: 177 GKGGLDARQAQILSQMYDSTPLAAAAREGLALRQQVTAQLREEME---QAG-RGAASART 232
GK + Q ++L + D+ L +Q + EE G R A
Sbjct: 127 GKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARITPEAYR 186

Query: 233 FADETRRMATLMRERYRLGFVDVGG----WDT-HANQGSVEGGLANNLRNLGEGLAAYAD 287
+ +R+ YR + G W + + + + + L + L+A
Sbjct: 187 ESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGD--IDSVILFLQKALSADLQ 244

Query: 288 ALGP 291
+
Sbjct: 245 SQQS 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02740HTHFIS971e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.2 bits (242), Expect = 1e-25
Identities = 29/117 (24%), Positives = 58/117 (49%)

Query: 2 RILLVEDEAPLRETLAARLKREGFAVDAAQDGEEGLYMGREVPFDVGIIDLGLPKMSGME 61
IL+ +D+A +R L L R G+ V + D+ + D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LIKALRDEGKKFPVLILTARSSWQDKVEGLKQGADDYLVKPFHVEELLARVNALLRR 118
L+ ++ PVL+++A++++ ++ ++GA DYL KPF + EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


9XADLMG695_RS02825XADLMG695_RS21640Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS02825-114-4.220980DNA/RNA non-specific endonuclease
XADLMG695_RS02830224-6.702260porphobilinogen synthase
XADLMG695_RS02840337-9.631304hypothetical protein
XADLMG695_RS02845233-8.970626hypothetical protein
XADLMG695_RS21635123-6.969846type IV secretion system protein
XADLMG695_RS02855022-5.818002DUF4189 domain-containing protein
XADLMG695_RS21640-116-3.602176hypothetical protein
10XADLMG695_RS03045XADLMG695_RS03105Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS030450133.779713glucose-6-phosphate dehydrogenase
XADLMG695_RS030501134.866909glycoside hydrolase family 15 protein
XADLMG695_RS216652146.051892hypothetical protein
XADLMG695_RS030550144.593412ankyrin repeat domain-containing protein
XADLMG695_RS03060-1132.269312YcgL domain-containing protein
XADLMG695_RS03065-2121.790164beta-ketoacyl-[acyl-carrier-protein] synthase
XADLMG695_RS03070-1110.745355beta-ketoacyl synthase chain length factor
XADLMG695_RS030750120.474106glycosyltransferase family 2 protein
XADLMG695_RS030800130.318419tryptophan 7-halogenase
XADLMG695_RS030852131.3563243-oxoacyl-ACP reductase FabG
XADLMG695_RS030901141.399095hypothetical protein
XADLMG695_RS030952152.137393hotdog family protein
XADLMG695_RS031001151.908281MMPL family transporter
XADLMG695_RS031052141.456304hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03045PF03544290.044 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 29.2 bits (65), Expect = 0.044
Identities = 12/79 (15%), Positives = 20/79 (25%), Gaps = 4/79 (5%)

Query: 510 PPPRRPPEVRAGAATPVKKAAKKTARKVGKAPAKKLASASARRASAAAKPSQDAATSSGG 569
P P PE A ++K K K P +R + + +
Sbjct: 78 PEPEPIPEPPKEAPVVIEKPKPKPKPK----PKPVKKVEQPKRDVKPVESRPASPFENTA 133

Query: 570 AAKKTVRKRATSKVAKAAG 588
A+ T +
Sbjct: 134 PARPTSSTATAATSKPVTS 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03070PF06776344e-04 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 33.8 bits (77), Expect = 4e-04
Identities = 16/66 (24%), Positives = 21/66 (31%)

Query: 21 AAGAFARGGALQDTPARPAPQLLAANERRRAPDTVAVSLDAALAACSAAGRDPASLPSIF 80
A A+Q PA +P L + R + A A S D A
Sbjct: 17 TNHAVPALKAIQMGPAELSPMLASCRRLARRNGARLMLAGAMAIALSFGWSDRADAQGAV 76

Query: 81 TSTYGD 86
S +GD
Sbjct: 77 RSVHGD 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03085DHBDHDRGNASE1175e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (293), Expect = 5e-34
Identities = 80/249 (32%), Positives = 128/249 (51%), Gaps = 14/249 (5%)

Query: 6 RRALVTGGSGDLGGAICRHLAAQGRHVIVHANRNLTRADEVVAAIVADGGSAQAVAFDVA 65
+ A +TG + +G A+ R LA+QG H+ + N + ++VV+++ A+ A+A DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAA-VDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 66 DAQASAAALERLL-EAGPIQIVVNNAGIHDDAPMAGMNAEQWHRVIDVSLHGFFNVTQPL 124
D+ A R+ E GPI I+VN AG+ + ++ E+W V+ G FN ++ +
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 125 LLPMARTRWGRIVSVSSVAAVLGNRGQTNYAAAKAALHGASKSLSREMASRGIAVNVVAP 184
M R G IV+V S A + YA++KAA +K L E+A I N+V+P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 185 GVIESDM-----VGDSFAPEVIKQL-------VPAGRVGKPDEVAALVAFLCSEPAGYIN 232
G E+DM ++ A +VIK +P ++ KP ++A V FL S AG+I
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 233 GQVIGVNGG 241
+ V+GG
Sbjct: 248 MHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03100ACRIFLAVINRP482e-07 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 47.5 bits (113), Expect = 2e-07
Identities = 31/155 (20%), Positives = 65/155 (41%), Gaps = 18/155 (11%)

Query: 638 VLGALVLAALLLAVTVAIALRSPRRIVRVLLPMALTTVLILAILRGTGVELNLFHLIALI 697
V+ L A +L+ + + + L++ R + + + + + AIL G +N + ++
Sbjct: 340 VVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMV 399

Query: 698 LAAGLGLDYAL-----FFDHAGDDHADQLRTLH--------ALIVCSLMTLLVF---ALL 741
LA GL +D A+ +D AL+ +++ VF A
Sbjct: 400 LAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFF 459

Query: 742 AASSIPVLRAIGSTVALGVLFNFILALLVSREPAL 776
S+ + R T+ + + ++AL+++ PAL
Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILT--PAL 492



Score = 41.0 bits (96), Expect = 2e-05
Identities = 31/163 (19%), Positives = 59/163 (36%), Gaps = 20/163 (12%)

Query: 246 ARTQGEAQWIGTLDTVGLVLLLLVAYRSWKIPVLGVLPLASAGLAGLGAVALLFDGVHGI 305
+ +A + + V + L L Y SW IPV +L + + L A LF+ + +
Sbjct: 866 RLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA-TLFNQKNDV 924

Query: 306 TVAFGF-TLIGVVQ-------DYPIHLFSHQRPGLDPRENARH-----LWPTLATGVVST 352
G T IG+ ++ L ++ G E L P L T ++
Sbjct: 925 YFMVGLLTTIGLSAKNAILIVEFAKDL--MEKEGKGVVEATLMAVRMRLRPILMT-SLAF 981

Query: 353 CIAYVTFLFSGVDG---LRQLAVFTIAGLATAAVTTRWLLPAL 392
+ + S G + + + G+ +A + + +P
Sbjct: 982 ILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024


11XADLMG695_RS03185XADLMG695_RS03340Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS03185-1173.161692DNA polymerase I
XADLMG695_RS031900154.015186DUF2782 domain-containing protein
XADLMG695_RS031950144.103585type VI secretion system protein TssA
XADLMG695_RS032001144.030881ShlB/FhaC/HecB family hemolysin
XADLMG695_RS03205094.662042hypothetical protein
XADLMG695_RS03210094.858332serine/threonine-protein phosphatase
XADLMG695_RS03215094.503439type VI secretion system-associated protein
XADLMG695_RS032200133.535626type VI secretion system membrane subunit TssM
XADLMG695_RS032250123.513623DotU family type IV/VI secretion system protein
XADLMG695_RS216850113.651484aldo/keto reductase family oxidoreductase
XADLMG695_RS23060-1112.228836LysR family transcriptional regulator
XADLMG695_RS032450101.873150type VI secretion system-associated FHA domain
XADLMG695_RS032501122.584653hypothetical protein
XADLMG695_RS032553123.450073type VI secretion system tip protein VgrG
XADLMG695_RS032604143.586317tetratricopeptide repeat protein
XADLMG695_RS032653133.163215PAAR domain-containing protein
XADLMG695_RS032706144.197115protein kinase
XADLMG695_RS032752143.425308sigma-70 family RNA polymerase sigma factor
XADLMG695_RS032802123.634879RNA polymerase sigma factor
XADLMG695_RS032852162.935749FecR domain-containing protein
XADLMG695_RS032900162.281775TonB-dependent receptor
XADLMG695_RS03295-2161.867391histidine-type phosphatase
XADLMG695_RS03300-1171.468536endonuclease
XADLMG695_RS03305-3151.290649TetR/AcrR family transcriptional regulator
XADLMG695_RS03310015-0.780321amidase
XADLMG695_RS03315118-1.896488hypothetical protein
XADLMG695_RS03320119-1.451662DUF4124 domain-containing protein
XADLMG695_RS03325228-2.676990hypothetical protein
XADLMG695_RS21695340-6.610052hypothetical protein
XADLMG695_RS21700440-5.576881type VI secretion system baseplate subunit TssG
XADLMG695_RS21705223-3.719967type VI secretion system baseplate subunit TssE
XADLMG695_RS03340218-2.187059virulence protein SciE type
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS23060OMPADOMAIN355e-04 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 35.3 bits (81), Expect = 5e-04
Identities = 22/106 (20%), Positives = 37/106 (34%), Gaps = 26/106 (24%)

Query: 237 SLSAPISAQAAQWGIAPATPPDAAPVPPPVRLKQQLSAQERAGLLRVDEQADGQTRVRLS 296
LS +S + Q AP P AP P L
Sbjct: 182 MLSLGVSYRFGQGEAAPVVAPAPAPAPEVQT-----------------------KHFTLK 218

Query: 297 SAAMFASGGVEVEPQQRGLIAQIAAAIEQL---PGRVIVVGHTDDV 339
S +F ++P+ + + Q+ + + L G V+V+G+TD +
Sbjct: 219 SDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03280YERSSTKINASE382e-04 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 37.8 bits (87), Expect = 2e-04
Identities = 34/149 (22%), Positives = 62/149 (41%), Gaps = 31/149 (20%)

Query: 149 HPAIAQIHDVGTDAHG---QPYLVMEYLRGEPITWWCDEHRLSL-----HARV------- 193
HP +A +H + +G + L+M+ + G W C + +L ++
Sbjct: 190 HPNLANVHGMAVVPYGNRKEEALLMDEVDG----WRCSDTLRTLADSWKQGKINSEAYWG 245

Query: 194 ---LLMLRVSEAVQHAHQKGVIHRDLKPSNVLVSEIDGRPMPGVIDFGIAIDATNPGTTC 250
+ R+ + H + GV+H D+KP NV+ G P+ VID G+ +
Sbjct: 246 TIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPV--VIDLGLH------SRSG 297

Query: 251 AHDRG-TPGYMSPEQARGAQDVDARSDIY 278
+G T + +PE G +SD++
Sbjct: 298 EQPKGFTESFKAPELGVGNLGASEKSDVF 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03295ACRIFLAVINRP290.026 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.4 bits (66), Expect = 0.026
Identities = 19/73 (26%), Positives = 31/73 (42%), Gaps = 4/73 (5%)

Query: 178 RLQLLQGQASFDVAADRRRLQVRALGLRVEDIGTAFDIALHGQQARVDVSAGRVH--VWR 235
R L+ A F + D+ + Q ALG+ + DI AL G + GRV +
Sbjct: 716 RPNGLEDTAQFKLEVDQEKAQ--ALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773

Query: 236 ADHPAQPMLANLG 248
AD + + ++
Sbjct: 774 ADAKFRMLPEDVD 786


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03315HTHTETR557e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.0 bits (132), Expect = 7e-12
Identities = 26/155 (16%), Positives = 53/155 (34%), Gaps = 8/155 (5%)

Query: 2 AATRIAQAHGYSGLNVRSLAEDVGIKAASLYHHFPSKADLAAAVAKRYWEDSAATLDALS 61
A R+ G S ++ +A+ G+ ++Y HF K+DL + + + +
Sbjct: 19 VALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQ 78

Query: 62 AET-KEPMKALRRYPETFRRSLENGNRICL---CSFMAAEYDDLPDIVKDEVQAFADVNI 117
A+ +P+ LR S R L F E+ +V+ + +
Sbjct: 79 AKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESY 138

Query: 118 AWLSKMLVAA----EVVGAKDAKKRARTIFAAIGG 148
+ + L + ++ A + I G
Sbjct: 139 DRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


12XADLMG695_RS03430XADLMG695_RS03520Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS034305133.36106650S ribosomal protein L33
XADLMG695_RS03435392.25293550S ribosomal protein L28
XADLMG695_RS034402101.805623cation transporter
XADLMG695_RS03445291.831004CusA/CzcA family heavy metal efflux RND
XADLMG695_RS034502100.710830hypothetical protein
XADLMG695_RS034551121.011647efflux RND transporter periplasmic adaptor
XADLMG695_RS034600140.011859TolC family protein
XADLMG695_RS034650131.332388hypothetical protein
XADLMG695_RS034700122.789045DUF885 family protein
XADLMG695_RS034751113.15195616S rRNA (guanine(527)-N(7))-methyltransferase
XADLMG695_RS03480-1112.749726alkaline phosphatase D family protein
XADLMG695_RS23070-1113.476506right-handed parallel beta-helix
XADLMG695_RS03485-1123.8087424'-phosphopantetheinyl transferase superfamily
XADLMG695_RS03490218-1.913038GlsB/YeaQ/YmgE family stress response membrane
XADLMG695_RS03495220-3.106434GFA family protein
XADLMG695_RS03500220-3.508541exodeoxyribonuclease III
XADLMG695_RS03510127-5.437239ROK family transcriptional regulator
XADLMG695_RS035203112.047225hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03480ACRIFLAVINRP7570.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 757 bits (1956), Expect = 0.0
Identities = 244/1074 (22%), Positives = 416/1074 (38%), Gaps = 70/1074 (6%)

Query: 5 IIRFAIAQRWLMLALTGVLIAIGAWSFSRLPIDATPDITNVQVQVNTAAPGYSPLESEQR 64
+ F I + L +L+ GA + +LP+ P I V V+ PG +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 VTFPLETVLAGLPGLESTRSLS-RYGLSQVTAVFADGTDLYFARQQVAERLQQVKSQLPA 123
VT +E + G+ L S S G +T F GTD A+ QV +LQ LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 124 DLEPQLGPIATGLGEIFMYTVEAKPNARKPDGSAWTATDLRTLQDWVVRPQLRNVPGVTE 183
+++ Q + M D T D+ V+ L + GV +
Sbjct: 121 EVQQQGISVEKSSSSYLMVA------GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174

Query: 184 VNTIGGYARQIHITPDPARLVALGFTLDEVAQAVESNNRNIGAGYI------ERNGQQFL 237
V G + I D L T +V ++ N I AG +
Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 238 VRVPGQVDDIAQIGAIVLD-RRAGVPIRVRDVAQVGEGRELRTGAATQDGSEVVLGTVFM 296
+ + + + G + L G +R++DVA+V G E A +G + +
Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293

Query: 297 LVGANSRTVAQAAAQRLEVANASLPAGVQAVPVYDRTALVDRTIVTVAKNLIEGALLVIV 356
GAN+ A+A +L P G++ + YD T V +I V K L E +LV +
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 357 VLFLLLGNVRAALITAAVIPLAMLFTLTGMVRGGVSGNLMSLG--ALDFGLIVDGAVIIV 414
V++L L N+RA LI +P+ +L T + G S N +++ L GL+VD A+++V
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 415 ENCLRRFGEAQLRLGRVLERDERFELTAEASAEVIRPSLFGLGIITAVYLPVFALTGIEG 474
EN R E +L E T ++ +++ + +++AV++P+ G G
Sbjct: 414 ENVERVMMEDKLP---------PKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 475 KMFHPMAITVVLALTGAMLLSLTFVPAAIALLLGGKVAEHE----------NRAMRWARG 524
++ +IT+V A+ ++L++L PA A LL AEH N +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524

Query: 525 VYAPLLDRALRHSRWVGIAALATVALCAVLATRLGSEFIPNLDEGDIALHALRIPGTSLE 584
Y + + L + + VA VL RL S F+P D+G G + E
Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584

Query: 585 --QAITMQSTLEKRIKQFPEVAHVFGKLGTAEVATDPMPPSVADTFLIMHPRAQWPDPRK 642
Q + Q T + V VF G + + F+ + P +
Sbjct: 585 RTQKVLDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDEN 641

Query: 643 PKAQLLAEIEEAVKQLPGNNYEFTQPIQM-RMNELISGVRADVA-IKVYGDDLDTLVTLG 700
++ + + ++ F P M + EL + D I G D L
Sbjct: 642 SAEAVIHRAKMELGKIRDG---FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQAR 698

Query: 701 QRVQEIASAVPGA-ADVSLEQATGLPMLAVVPDRAALAGYGLNPGVVQDTVAAAVGGQEA 759
++ +A+ P + V + D+ G++ + T++ A+GG
Sbjct: 699 NQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYV 758

Query: 760 GQLFEGDRRFDIVVRLPEALRQDPTALADLPIPLRGDGERADADESSRAAGWRSGEPSTV 819
+ R + V+ R P + L + +GE V
Sbjct: 759 NDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSA-NGE-------------------MV 798

Query: 820 PLREVAKVDTVLGPNQINREDGKRRIVITANVRDRDLGGFVAEVQQRVKAQVKLPTGYWI 879
P V G ++ R +G + I G + + + KLP G
Sbjct: 799 PFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLA--SKLPAGIGY 856

Query: 880 GYGGTFEQLISASQRLAWVVPGTLLLIFALLYWSFGSLRDALVVFSGVPLALTGGVVALA 939
+ G Q + + +V + +++F L + S + V VPL + G ++A
Sbjct: 857 DWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAAT 916

Query: 940 LRGLALSISAGVGFIALSGVAVLNGLVMIAFVRSL-RAEGMPLEQALREGALARLRPVLM 998
L + VG + G++ N ++++ F + L EG + +A RLRP+LM
Sbjct: 917 LFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILM 976

Query: 999 TALVAALGFVPMAFNVGAGAEVQRPLATVVIGGIVSSTLLTLLVLPVLYRWLHR 1052
T+L LG +P+A + GAG+ Q + V+GG+VS+TLL + +PV + + R
Sbjct: 977 TSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 79.5 bits (196), Expect = 4e-17
Identities = 75/361 (20%), Positives = 137/361 (37%), Gaps = 35/361 (9%)

Query: 708 SAVPGAADVSLEQATGLPMLAVVPDRAALAGYGLNPGVVQDTVAAAVGGQEAGQLFEGDR 767
S + G DV L + + D L Y L P V + + AGQL
Sbjct: 167 SRLNGVGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL----- 219

Query: 768 RFDIVVRLPEALRQDPTALADLPIPLRGDGERADADESSRAAGWRSGEPSTVPLREVAKV 827
P Q A + + +E + + + S V L++VA+V
Sbjct: 220 -----GGTPALPGQQLNAS------IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARV 268

Query: 828 -DTVLGPNQINREDGKRRIVITANVRDRDLGGFVAEVQQRVKAQVK-----LPTGYWIGY 881
N I R +GK + + G + + +KA++ P G + Y
Sbjct: 269 ELGGENYNVIARINGKPAAGLGIKLAT---GANALDTAKAIKAKLAELQPFFPQGMKVLY 325

Query: 882 GGTFEQLISASQRLAWVVPGTLL----LIFALLYWSFGSLRDALVVFSGVPLALTGGVVA 937
+ S V TL L+F ++Y ++R L+ VP+ L G
Sbjct: 326 PYDTTPFVQLSIH---EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAI 382

Query: 938 LALRGLALSISAGVGFIALSGVAVLNGLVMIAFV-RSLRAEGMPLEQALREGALARLRPV 996
LA G +++ G + G+ V + +V++ V R + + +P ++A + +
Sbjct: 383 LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGAL 442

Query: 997 LMTALVAALGFVPMAFNVGAGAEVQRPLATVVIGGIVSSTLLTLLVLPVLYRWLHRERAP 1056
+ A+V + F+PMAF G+ + R + ++ + S L+ L++ P L L + +
Sbjct: 443 VGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502

Query: 1057 R 1057

Sbjct: 503 E 503



Score = 76.4 bits (188), Expect = 4e-16
Identities = 85/523 (16%), Positives = 159/523 (30%), Gaps = 40/523 (7%)

Query: 3 TNIIRFAIAQRWLMLALTGVLIAIGAWSFSRLPIDATPDITNVQVQVNTAAPGYSPLESE 62
TN + + L + +++A F RLP P+ P + E
Sbjct: 527 TNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERT 586

Query: 63 QRVTFPLETVLAGLPGLESTRSLSRYGLSQVTAVFADGTDLYFARQQVAERLQQVKSQ-- 120
Q+V + + G S G + + + ER S
Sbjct: 587 QKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAG-MAFVSLKPWEERNGDENSAEA 645

Query: 121 LPADLEPQLGPIATGLGEIFMYTVEAKPNARKPDGSAWT---ATDLRTLQDWVVRPQL-- 175
+ + +LG I G F + L R QL
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQA--RNQLLG 703

Query: 176 ---RNVPGVTEVNTIGGY-ARQIHITPDPARLVALGFTLDEVAQAVESNNRNIGAGYIER 231
++ + V G Q + D + ALG +L ++ Q + +
Sbjct: 704 MAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID 763

Query: 232 NGQQFLVRVPGQVD---DIAQIGAIVLDRRAGVPIRVRDVAQVGEGRELRTGAATQDGSE 288
G+ + V + + + G + + +
Sbjct: 764 RGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY-----GSPRLERY 818

Query: 289 VVLGTVFMLVGANSRTVAQAAAQRLEVANASLPAGVQAVPVYDRTALVDRTIVTVAKNLI 348
L ++ + A T + A +E + LPAG+ YD T + + ++ +
Sbjct: 819 NGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIG----YDWTGMSYQERLSGNQAPA 874

Query: 349 EGAL---LVIVVLFLLLGNVRAALITAAVIPLAMLFTLTGMVRGGVSGNLMSLGAL--DF 403
A+ +V + L L + + V+PL ++ L ++ + L
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934

Query: 404 GLIVDGAVIIVENCLRRFGEAQLRLGRVLERDERFELTAEASAEVIRPSLFGLGIITAVY 463
GL A++IVE + + G+ E T A +RP L
Sbjct: 935 GLSAKNAILIVE----FAKDLMEKEGK-----GVVEATLMAVRMRLRPILMTSLAFILGV 985

Query: 464 LPVFALTGIEGKMFHPMAITVVLALTGAMLLSLTFVPAAIALL 506
LP+ G + + I V+ + A LL++ FVP ++
Sbjct: 986 LPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03485RTXTOXIND347e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.4 bits (79), Expect = 7e-04
Identities = 22/125 (17%), Positives = 38/125 (30%), Gaps = 8/125 (6%)

Query: 171 AVGAGSIADQHEVQGLLTPAEGAQAQATARFPGPVRSLRVNVGDQVRA-GQVLATVESNL 229
V + +++ + A+ T F + D + LA E
Sbjct: 269 RVYKSQLE---QIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325

Query: 230 SLTTYSVSAPISGTVLARNA-SLGSNAGEGQALFEIA-DLSTLWVDLHIFGADAGHITAG 287
+ + AP+S V + G + L I + TL V + D G I G
Sbjct: 326 QASV--IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383

Query: 288 APVTV 292
+
Sbjct: 384 QNAII 388


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03520ENTSNTHTASED290.010 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 29.2 bits (65), Expect = 0.010
Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 2/71 (2%)

Query: 68 SHSGEYLLVGLGQGVRLGVDLERIRARPRVLEIAQRFFHPDEIALLAALAPDAQHALFFR 127
SH L + + R+G+D+E+I ++ E+A DE +L A AL
Sbjct: 89 SHCATTALAVISRQ-RIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALTL- 146

Query: 128 LWCAKEALLKA 138
+ AKE++ KA
Sbjct: 147 AFSAKESVYKA 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03530SECYTRNLCASE260.018 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 25.9 bits (57), Expect = 0.018
Identities = 16/83 (19%), Positives = 33/83 (39%), Gaps = 2/83 (2%)

Query: 3 IIIWLIVGG-IVGWLASIIMRRDAQQGIILNVVVGIVGALIAGFL-FGGGINQAITLWTF 60
++I + G +V WL +I R G+ + + + I + A F
Sbjct: 163 MVICMTAGTCVVMWLGELITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEF 222

Query: 61 VWSLVGAVILLAIVNLVTRGRLR 83
+ +I++A+V V + + R
Sbjct: 223 GTVIAVGLIMVALVVFVEQAQRR 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03550CHANLCOLICIN270.050 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.0 bits (59), Expect = 0.050
Identities = 21/74 (28%), Positives = 31/74 (41%), Gaps = 1/74 (1%)

Query: 8 RTAAARGDAAAQRYLLAQRAADLMQRAVAAAPAGTQPTLSPDAEREVAVIVSELEALALA 67
A A+ A A R L QR D++ A+ A P+ + A A + +E E L LA
Sbjct: 75 AAAEAQAKAKANRDALTQRLKDIVNEALRHN-ASRTPSATELAHANNAAMQAEDERLRLA 133

Query: 68 GHRDAIDTLAQVVE 81
+ A+ E
Sbjct: 134 KAEEKARKEAEAAE 147


13XADLMG695_RS03870XADLMG695_RS03890Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS038700114.109800class II aldolase/adducin family protein
XADLMG695_RS217552155.212438FGGY-family carbohydrate kinase
XADLMG695_RS038751144.592463DUF2147 domain-containing protein
XADLMG695_RS038801154.555172SMP-30/gluconolactonase/LRE family protein
XADLMG695_RS038851154.637075endo-1,4-beta-xylanase
XADLMG695_RS038901154.292826DUF4982 domain-containing protein
14XADLMG695_RS03965XADLMG695_RS21770Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS03965116-5.349352hypothetical protein
XADLMG695_RS03975239-9.552243hypothetical protein
XADLMG695_RS03980751-13.520502hypothetical protein
XADLMG695_RS03985450-10.883822YdcH family protein
XADLMG695_RS03990756-12.503810zinc-binding dehydrogenase
XADLMG695_RS23095644-10.067860glycerol-3-phosphate 1-O-acyltransferase PlsB
XADLMG695_RS21770534-7.523581hypothetical protein
15XADLMG695_RS04065XADLMG695_RS23115Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS04065111-3.738305hypothetical protein
XADLMG695_RS04070424-4.313431hypothetical protein
XADLMG695_RS23580537-5.951104hypothetical protein
XADLMG695_RS23105337-8.889780hypothetical protein
XADLMG695_RS04085-122-3.113260hypothetical protein
XADLMG695_RS04090029-5.995143hypothetical protein
XADLMG695_RS04095540-9.906430hypothetical protein
XADLMG695_RS04100853-12.775906TonB-dependent receptor
XADLMG695_RS04105426-3.944069sugar kinase
XADLMG695_RS04110420-2.809341trans-2-enoyl-CoA reductase family protein
XADLMG695_RS23115317-1.912798winged helix-turn-helix domain-containing
16XADLMG695_RS04220XADLMG695_RS04480Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS04220323-4.935344TldD/PmbA family protein
XADLMG695_RS21800433-8.143780TldD/PmbA family protein
XADLMG695_RS23120116-4.023713TldD/PmbA family protein
XADLMG695_RS04230-113-2.138528MoxR family ATPase
XADLMG695_RS04235010-2.404745DUF58 domain-containing protein
XADLMG695_RS21805112-2.553066BatA domain-containing protein
XADLMG695_RS04245-210-0.317765hypothetical protein
XADLMG695_RS04250-1101.359404EcsC family protein
XADLMG695_RS042552133.844231SRPBCC domain-containing protein
XADLMG695_RS042603136.107180hypothetical protein
XADLMG695_RS042703116.692921thioredoxin family protein
XADLMG695_RS042754116.462459host attachment protein
XADLMG695_RS042802135.830716NUDIX domain-containing protein
XADLMG695_RS04290123-0.042947SDR family oxidoreductase
XADLMG695_RS043000341.760604M4 family metallopeptidase
XADLMG695_RS231250342.060186hypothetical protein
XADLMG695_RS04305-1312.507843SRPBCC family protein
XADLMG695_RS04310-1253.180614HDOD domain-containing protein
XADLMG695_RS043150253.486863CsbD family protein
XADLMG695_RS043201243.432452hypothetical protein
XADLMG695_RS043252123.903441hypothetical protein
XADLMG695_RS043301133.671735hypothetical protein
XADLMG695_RS04340326-2.751396hypothetical protein
XADLMG695_RS04345536-5.731112DUF2971 domain-containing protein
XADLMG695_RS04355539-6.950013NAD(P)H-dependent oxidoreductase
XADLMG695_RS04360848-10.662680hypothetical protein
XADLMG695_RS21830547-9.956907hypothetical protein
XADLMG695_RS04365449-10.288314LysR family transcriptional regulator
XADLMG695_RS23130138-5.836751SDR family oxidoreductase
XADLMG695_RS04375025-3.524849SDR family NAD(P)-dependent oxidoreductase
XADLMG695_RS04380021-2.585477type II toxin-antitoxin system Phd/YefM family
XADLMG695_RS04385-118-2.145572hemolysin III family protein
XADLMG695_RS043900130.989961DEAD/DEAH box helicase
XADLMG695_RS043950140.834737ribonuclease H-like domain-containing protein
XADLMG695_RS044001160.410896avirulence protein
XADLMG695_RS044051131.066233ROK family protein
XADLMG695_RS044101131.256703TonB-dependent receptor
XADLMG695_RS044152132.028145DUF1868 domain-containing protein
XADLMG695_RS21855-1100.784947saccharopine dehydrogenase NADP-binding
XADLMG695_RS044250111.258992hypothetical protein
XADLMG695_RS04430-1101.542436hypothetical protein
XADLMG695_RS04435-1101.399550hypothetical protein
XADLMG695_RS04440-191.510610BlaI/MecI/CopY family transcriptional regulator
XADLMG695_RS04450-181.014565LysR family transcriptional regulator
XADLMG695_RS044550102.017839NmrA family NAD(P)-binding protein
XADLMG695_RS044602112.485022hypothetical protein
XADLMG695_RS044654162.623700DNA topoisomerase IB
XADLMG695_RS044704152.868213Elastase inhibitor AFLEI Flags: Precursor
XADLMG695_RS044753122.688470glutamate synthase large subunit
XADLMG695_RS044802102.841408FAD-dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04265HTHFIS371e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.7 bits (85), Expect = 1e-04
Identities = 34/164 (20%), Positives = 60/164 (36%), Gaps = 14/164 (8%)

Query: 9 LAAQRAQLGTLRAAL--AQAVVGQDAVVEQLL--IGLLAGG--HCLLEGAPGLGKTLLVR 62
LA + + L +VG+ A ++++ + L ++ G G GK L+ R
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 63 SLGQA---LELQFRRVQ---FTPDLMPSDILGTELLEEDHGTGHRHFRFQQGPIFTNLLL 116
+L F + DL+ S++ G E RF+Q T L
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT--LF 236

Query: 117 ADELNRTPPKTQAALLEAMSERTVSYAGTTYALPAPFFVLATQN 160
DE+ P Q LL + + + G + + ++A N
Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04335THERMOLYSIN2861e-94 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 286 bits (732), Expect = 1e-94
Identities = 118/288 (40%), Positives = 164/288 (56%), Gaps = 23/288 (7%)

Query: 76 YDAEQGTALPGTLVRD--EGAPATQDVAVTEAYDYLGATHDFFQTVYGRNSIDAAGMPLI 133
YD T LPG+L D A+ D A +A+ Y G +D+++ V+GR S D + +
Sbjct: 270 YDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIR 329

Query: 134 GTVHYERGYDNAFWNGEQMVFGDGDGEVFNRFTIALDVVGHELTHGVTERTANLIYQGQS 193
TVHY RGY+NAFWNG QMV+GDGDG+ F F+ +DVVGHELTH VT+ TA L+YQ +S
Sbjct: 330 STVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNES 389

Query: 194 GALNESISDVFGVLIKQYTLRQSADQADWIIGAGLLMPGIQGVGLRSMQAPGSAYDDPAL 253
GA+NE++SD+FG L++ Y + DW IG + PG+ G LRSM P
Sbjct: 390 GAINEAMSDIFGTLVEFY----ANRNPDWEIGEDIYTPGVAGDALRSMSDP--------- 436

Query: 254 GKDPQPATMAGYVDTQEDDGGVHYNSGIPNHAFYRAA-------VAIGGAAWEKTGRIWY 306
K P + +D+GGVH NSGI N A Y + V++ G +K G+I+Y
Sbjct: 437 AKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVHYGVSVTGIGRDKMGKIFY 496

Query: 307 RALTGGELAAGADFATFADLTASVASADYGANSREAVAVRQAWRDVGV 354
RAL L ++F+ A+ YG+ S+E +V+QA+ VGV
Sbjct: 497 RALV-YYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGV 543


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04410DHBDHDRGNASE1043e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 3e-29
Identities = 81/249 (32%), Positives = 123/249 (49%), Gaps = 12/249 (4%)

Query: 6 KVALVTGASRGIGAAIAQRLAGDGFAVVLNYAGHADEADQQVRSIEAAGGRAIGVQADVS 65
K+A +TGA++GIG A+A+ LA G A + + ++ ++ V S++A A ADV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQG-AHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 66 DPAAVERLFAAAEAAFGGVDVLVNNAGIMQLATLADSDDGLFDKHIAINLKGTFNTLRQA 125
D AA++ + A E G +D+LVN AG+++ + D ++ ++N G FN R
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 126 AR--RLRNGGRIVNLSTSVVGLKLETYGVYAATKAAVETLTAILSKELRGRAITVNAVAP 183
++ R G IV + ++ G+ + YA++KAA T L EL I N V+P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 184 GPTGTA----LFLDGKSPELI-----ERLSKANPLERLGCPDDIAAAVAFLVGPDGGWIN 234
G T T L+ D E + E PL++L P DIA AV FLV G I
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 235 GQVLRANGG 243
L +GG
Sbjct: 248 MHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04415DHBDHDRGNASE741e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 74.3 bits (182), Expect = 1e-17
Identities = 47/188 (25%), Positives = 82/188 (43%), Gaps = 8/188 (4%)

Query: 3 KRILVTGASSGFGRLAAQALAAAGHTVYASMRDTAGRNAGVAQAMAELADKQQLALHTVE 62
K +TGA+ G G A+ LA+ G + A + V+ AE +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-----P 63

Query: 63 LDVQSQASADAAVASIVAQAGGLDVVVHNAGHMVFGPAEAFTAEQLAQVYDINVLGTQRV 122
DV+ A+ D A I + G +D++V+ AG + G + + E+ + +N G
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 123 NRAALPQLRAQQQGLLVWVSSSSSAGGTPPY-LGPYFAAKAAMDALAVQYARELARWGIE 181
+R+ + ++ G +V V S+ G P + Y ++KAA ELA + I
Sbjct: 124 SRSVSKYMMDRRSGSIVTV--GSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 182 TSIIVPGA 189
+I+ PG+
Sbjct: 182 CNIVSPGS 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04450TYPE3IMRPROT310.017 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 30.9 bits (70), Expect = 0.017
Identities = 6/33 (18%), Positives = 10/33 (30%)

Query: 418 YYTGGFDQFLSNLYKHYQINPLHSQDAPRRAAL 450
G +S L + P+ + A L
Sbjct: 138 LTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFL 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04485GPOSANCHOR525e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 52.0 bits (124), Expect = 5e-09
Identities = 27/144 (18%), Positives = 57/144 (39%), Gaps = 4/144 (2%)

Query: 439 AVHDALAADNHDLELQQDRMQAAQEALQDAHEELASLGPELAQAKQEAQQQAREAEQQIR 498
A +A LE ++ + A + L+ A E + + + + + E +
Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263

Query: 499 EMTQQHRQAQYAYAAAVRQADALSRRQVEM-AKQAALQGRAEAERGQREAAQAQRQA--- 554
E+ + A A + L + + A++A L+ +++ R++ + A
Sbjct: 264 ELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASRE 323

Query: 555 AQAQVEAEKARAEADRAQAEAERA 578
A+ Q+EAE + E +EA R
Sbjct: 324 AKKQLEAEHQKLEEQNKISEASRQ 347



Score = 50.1 bits (119), Expect = 2e-08
Identities = 31/153 (20%), Positives = 57/153 (37%), Gaps = 6/153 (3%)

Query: 431 DASRDAQAAVHDALAADNHDLELQQDRMQAAQEALQDAHEELASLGPELAQAKQEAQQQA 490
+ + + A +A LE ++ ++A Q L+ A E + AK + +
Sbjct: 231 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS-TADSAKIKTLEAE 289

Query: 491 REAEQQIREMTQQHRQAQYAYAAAVRQADALSRR-----QVEMAKQAALQGRAEAERGQR 545
+ A + + + Q A ++R+ SR + E K +EA R
Sbjct: 290 KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSL 349

Query: 546 EAAQAQRQAAQAQVEAEKARAEADRAQAEAERA 578
+ A+ Q+EAE + E +EA R
Sbjct: 350 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 382



Score = 32.0 bits (72), Expect = 0.008
Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 431 DASRDAQAAVHDALAADNHDLELQQDRMQAAQEALQDAHEELASLGPEL-AQAKQEAQQQ 489
DASR+A+ V AL N L + + +E+ + +E A L +L A+AK ++
Sbjct: 389 DASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKL 448

Query: 490 AREAEQ 495
A++AE+
Sbjct: 449 AKQAEE 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04500NUCEPIMERASE352e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 35.1 bits (81), Expect = 2e-04
Identities = 28/128 (21%), Positives = 42/128 (32%), Gaps = 32/128 (25%)

Query: 1 MSILVTGATGTVGSLVTQGLADAGAQV--------------KALVRQQGKRPFPAGVTEV 46
M LVTGA G +G V++ L +AG QV K + +P G
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFH 57

Query: 47 VADLTDVASMRAALA--PVLTLFLLNAVT--------PDEVTQALIA-----LNLAKEAG 91
DL D M A +F+ P + + L +
Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117

Query: 92 IERIVYLS 99
I+ ++Y S
Sbjct: 118 IQHLLYAS 125


17XADLMG695_RS04835XADLMG695_RS05005Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS04835-1173.160521tetratricopeptide repeat protein
XADLMG695_RS04840-1173.552314YbhB/YbcL family Raf kinase inhibitor-like
XADLMG695_RS04845-1153.804392NAD(P)-dependent alcohol dehydrogenase
XADLMG695_RS04850-1153.425495META and DUF4377 domain-containing protein
XADLMG695_RS04855-2172.349161undecaprenyl-diphosphate phosphatase
XADLMG695_RS048600180.654057type I glutamate--ammonia ligase
XADLMG695_RS048651190.768664P-II family nitrogen regulator
XADLMG695_RS048751211.394575ammonium transporter
XADLMG695_RS048800202.092740histidine kinase
XADLMG695_RS04885-1173.266914nitrogen regulation protein NR(I)
XADLMG695_RS04890-2141.727582superoxide dismutase family protein
XADLMG695_RS04895-1130.973516superoxide dismutase family protein
XADLMG695_RS04900-1130.457389hypothetical protein
XADLMG695_RS04905-110-0.108642hypothetical protein
XADLMG695_RS04910-110-0.257092hypothetical protein
XADLMG695_RS235201100.120775acetyl-CoA C-acetyltransferase
XADLMG695_RS23525382.287829porphyrin biosynthesis protein
XADLMG695_RS04925392.351381uroporphyrinogen-III C-methyltransferase
XADLMG695_RS04930472.313293uroporphyrinogen-III synthase
XADLMG695_RS04935692.344629glycosyltransferase family 25 protein
XADLMG695_RS04940492.251787YiiD C-terminal domain-containing protein
XADLMG695_RS049452111.252341hypothetical protein
XADLMG695_RS04950-2120.912530rhodanese-like domain-containing protein
XADLMG695_RS04960011-0.259071protein-export chaperone SecB
XADLMG695_RS04965-2110.318291NAD(P)-dependent glycerol-3-phosphate
XADLMG695_RS04970-1120.759876Ax21 family protein
XADLMG695_RS04975-1131.654572ubiquinone-dependent pyruvate dehydrogenase
XADLMG695_RS04980-1141.435743sigma-54-dependent Fis family transcriptional
XADLMG695_RS049902142.984626hypothetical protein
XADLMG695_RS050003123.295206hypothetical protein
XADLMG695_RS050052101.504789MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04855SYCDCHAPRONE353e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 35.3 bits (81), Expect = 3e-04
Identities = 19/102 (18%), Positives = 31/102 (30%), Gaps = 3/102 (2%)

Query: 96 DPNQFNAYVMQAHLAVARGDLDEAERLSRTAARLAPEHPQLLAVDGVVEMRRGHSDRALA 155
Q + + + G ++A ++ + L + G G D A+
Sbjct: 35 TLEQLYSLAFNQYQS---GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIH 91

Query: 156 LLTRAAEQLPDDARVLFSLGFAYLQKEHFAFAERAFERVIEL 197
+ A + R F LQK A AE EL
Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04900HTHFIS495e-175 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 495 bits (1276), Expect = e-175
Identities = 206/471 (43%), Positives = 283/471 (60%), Gaps = 14/471 (2%)

Query: 8 SHIWVVDDDRSVRFVLSTALRDAGYAVDGFDSAAAALQALAMRPTPDLLFTDVRMPGEDG 67
+ I V DDD ++R VL+ AL AGY V +AA + +A DL+ TDV MP E+
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-AGDGDLVVTDVVMPDENA 62

Query: 68 LTLLDKLKSKHPQLPVIVMSAYTDVASTAGAFRGGAHEFLSKPFDLDDAVALAARALPDA 127
LL ++K P LPV+VMSA + A GA+++L KPFDL + + + RAL +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 128 DAGVEEILATRLAEGSASLIGDTPAMQALFRAIGRLAQAPLSVLINGETGTGKELVARAL 187
++ ++ L+G + AMQ ++R + RL Q L+++I GE+GTGKELVARAL
Sbjct: 123 KRRPSKLEDD--SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 188 HNESPRARKPFVALNTAAIPAELLESELFGHETGAFTGATKRHIGRFEQADGGTLFLDEI 247
H+ R PFVA+N AAIP +L+ESELFGHE GAFTGA R GRFEQA+GGTLFLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 248 GDMPLPLQTRLLRVLAENEFFRVGGRELIRVDVRVIAATHQDLEALVEQGRFRADLLHRL 307
GDMP+ QTRLLRVL + E+ VGGR IR DVR++AAT++DL+ + QG FR DL +RL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 308 DVVRLQLPPLRERRGDIAQLAENFLAMAGRKLDMLPKRLSSAALEDLRQYDWPGNVRELE 367
+VV L+LPPLR+R DI L +F+ A K + KR ALE ++ + WPGNVRELE
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQA-EKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 368 NVCWRLAALATSDIIDVVDV---------DAALARGGRRHRSGRSDGQWDDMLSSWAAQR 418
N+ RL AL D+I + D+ + + R S ++ + + A
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 419 LSE-GAQGLHAEARERLDRTLLEAALQLTQGRRAEAAARLGLGRNTVTRKL 468
GL+ ++ L+ AAL T+G + +AA LGL RNT+ +K+
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKI 470


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04970SECBCHAPRONE1955e-67 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 195 bits (498), Expect = 5e-67
Identities = 64/160 (40%), Positives = 99/160 (61%), Gaps = 3/160 (1%)

Query: 1 MSDEILNGAAAPADAAAGPAFTIEKIYVKDVSFESPNAPAVFNDANQPELQLNLNQKVQR 60
MS+E AA A P I++IYVKDVSFE+PN P +F +P+L +L+ + ++
Sbjct: 1 MSEENQVNAAD-TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQDWEPKLSFDLSTEAKQ 59

Query: 61 LNDNAFEVVLAVTLTCTA--GGKTAYVAEVQQAGVFGLVGLDPQAIDVLLGTQCPNILFP 118
+ D+ +EV L +++ T G A++ EV+QAGVF + GL+ + L +QCPN+LFP
Sbjct: 60 VGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFP 119

Query: 119 YVRTLVSDLIQAGGFPPFYLQPINFEALYAETLRQRQNEG 158
Y R LVS L+ G FP L P+NF+AL+ + L++++
Sbjct: 120 YARELVSSLVNRGTFPALNLSPVNFDALFMDYLQRQEQAE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04980OUTRMMBRANEA280.034 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.034
Identities = 20/94 (21%), Positives = 32/94 (34%), Gaps = 10/94 (10%)

Query: 49 KASYAIAPNFHVFGDYSKQ--NADDNNNVFENTDSDFQQWGV-GVGFNHEIATSTDFVAR 105
K Y I + ++ AD +NV + D V G E A + + R
Sbjct: 103 KLGYPITDDLDIYTRLGGMVWRADTKSNV-YGKNHDTGVSPVFAGGV--EYAITPEIATR 159

Query: 106 VAYRKL----DLDTPNINFDGYSVEAGLRNAFGE 135
+ Y+ D T D + G+ FG+
Sbjct: 160 LEYQWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQ 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04995HTHFIS463e-163 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 463 bits (1193), Expect = e-163
Identities = 178/478 (37%), Positives = 262/478 (54%), Gaps = 37/478 (7%)

Query: 2 ARILIIDDDAAFRTTLQVTLRSLGHAVVAAENGPDGLARLSEGGIDMAFVDFRMPGMDGI 61
A IL+ DDDAA RT L L G+ V N ++ G D+ D MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 AVLRARLDDAQARQVPLVMLTAHVSSGNTIEAMTLGAFDHLVKPVGRADIVEVVERALLS 121
+L R+ A+ +P+++++A + I+A GA+D+L KP +++ ++ RAL
Sbjct: 64 DLLP-RIKKARPD-LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 RADAQAAAADSSPAPVEDDDALVGHSPAMRTVHKRIGLAAASDLPVLITGETGTGKELAA 181
+ +D LVG S AM+ +++ + +DL ++ITGE+GTGKEL A
Sbjct: 122 PKRRPSKL----EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA 177

Query: 182 RALHRASPRASAPFVAVNCAAIPLELMESELFGHRKGAFSGASSDRRGLIREADGGTLFL 241
RALH R + PFVA+N AAIP +L+ESELFGH KGAF+GA + G +A+GGTLFL
Sbjct: 178 RALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237

Query: 242 DEIGDMPLPMQAKLLRFLQEGEVTPLGGSGPQKVDVRVLAATHRDLAACVADGRFRSDLR 301
DEIGDMP+ Q +LLR LQ+GE T +GG P + DVR++AAT++DL + G FR DL
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLY 297

Query: 302 YRLNVVPIELPPLPERGQDILLLAQHFL---SADAARAQSLSPAAQERLLAHRWPGNVRE 358
YRLNVVP+ LPPL +R +DI L +HF+ + + A E + AH WPGNVRE
Sbjct: 298 YRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 359 LRNVMQRSQVLVRGASIDAADLDD---------------------ALGEAGELPPPQPSA 397
L N+++R L I +++ ++ +A E Q A
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 398 -------VTGTLPEAVARLETQMIRSALEQSQGNRAEAARRLGIHRQLLYRKLEEYGL 448
+G +A +E +I +AL ++GN+ +AA LG++R L +K+ E G+
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05010TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.6 bits (77), Expect = 0.001
Identities = 64/375 (17%), Positives = 121/375 (32%), Gaps = 12/375 (3%)

Query: 30 PFLSVFLQSKGWSVAAIGTVMSVGGIAGMLATTPAGALVDATRRKRAVVVIGCLAILLAT 89
P L L A G ++++ + GAL D R R V+++ +
Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGR-RPVLLVSLAGAAVDY 87

Query: 90 ALIWLQPTSSGVVAAQIASALAAAGIGPALTGITLGLVHAHGFDHQLARNQVANHAGNVL 149
A++ P + +I + + A G + G V
Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146

Query: 150 AAVLAGWLGWRYGFAAVFLLTAFFGALALVAVLAIPAATIDHRAARGLATTNGGDALSGW 209
VL G +G + A F A L + + + H+ R + L+ +
Sbjct: 147 GPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPES--HKGERRPLRREALNPLASF 203

Query: 210 RVLLTCRPLALLAVTLGLFHLGNAAMLPLYGMAIVAAHAGDPSALTATTIVVAQATMVVV 269
R +A L + L L+ + D + + + +
Sbjct: 204 RWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQ 263

Query: 270 ALLAMRWIRVHGHWWVLLVAFMALPLRALVAASVIHGWGVFPVQILDGLGAGLQSVVVPA 329
A++ G L++ +A ++ A GW FP+ +L G + +PA
Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG----IGMPA 319

Query: 330 LVARLLQGTGRVNVG--QGAVMTVQGVGAALSPAFGGWL-AHAFGYRVAFLTLGAIALLA 386
L A L + G QG++ + + + + P + A + + + AL
Sbjct: 320 LQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYL 379

Query: 387 VALWAGCRGMLQAAA 401
+ L A RG+ A
Sbjct: 380 LCLPALRRGLWSGAG 394


18XADLMG695_RS05335XADLMG695_RS05485Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS05335-2143.057553ATP-dependent helicase HrpB
XADLMG695_RS05340-1154.028720hypothetical protein
XADLMG695_RS05345-2153.666349DUF1456 family protein
XADLMG695_RS05350-2153.533991hydroxyisourate hydrolase
XADLMG695_RS05355-1171.164673FAD-dependent urate hydroxylase HpyO
XADLMG695_RS053600151.7784852-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline
XADLMG695_RS053653112.265768oxalurate catabolism protein HpxZ
XADLMG695_RS05375492.521753allantoinase PuuE
XADLMG695_RS05380292.004782alanine--glyoxylate aminotransferase family
XADLMG695_RS053851101.631282allantoate amidohydrolase
XADLMG695_RS053901122.346233LysR family transcriptional regulator
XADLMG695_RS053950123.377926alpha/beta hydrolase
XADLMG695_RS054000102.724672hypothetical protein
XADLMG695_RS05405092.783425gamma-glutamyltransferase family protein
XADLMG695_RS054102103.294098AtzE family amidohydrolase
XADLMG695_RS054153113.753432nucleoside hydrolase
XADLMG695_RS054203123.327461adenosine deaminase
XADLMG695_RS054252142.436272NCS2 family permease
XADLMG695_RS054302142.212848oxidoreductase
XADLMG695_RS054352172.049983aromatic ring-hydroxylating dioxygenase subunit
XADLMG695_RS05440-1170.144267LysR family transcriptional regulator
XADLMG695_RS05445-1160.145334TatD family hydrolase
XADLMG695_RS05450-1160.218314glyoxalase/bleomycin resistance/dioxygenase
XADLMG695_RS05460-119-0.314600hypothetical protein
XADLMG695_RS05465-2170.293021LysR family transcriptional regulator
XADLMG695_RS05470-1162.400888MFS transporter
XADLMG695_RS054750151.896744VOC family protein
XADLMG695_RS054802122.087287MerR family transcriptional regulator
XADLMG695_RS054852112.346061alpha/beta hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05415SALSPVBPROT320.006 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 32.0 bits (72), Expect = 0.006
Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 3/30 (10%)

Query: 56 GDGFWLIHEPDGRVHAIDACGRAAQAATLD 85
GD FWL+H+ +G +H + G+ A A D
Sbjct: 155 GDDFWLLHDSNGILHLL---GKTAAARLSD 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05475TCRTETA582e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.9 bits (140), Expect = 2e-11
Identities = 95/380 (25%), Positives = 140/380 (36%), Gaps = 47/380 (12%)

Query: 47 VQPLLPEFAHAF-KVDAATASLPLSLATGALALAIFC--AGAVSENLGRRGLMFVSIALA 103
+ P+LP + TA + LA AL GA+S+ GRR ++ VS+A A
Sbjct: 24 IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83

Query: 104 AVLNLIAAFLPHWGALVVVRTLSGIALGGVPAVAMVYLGEELPASK-------MGAATGL 156
AV I A P L + R ++GI G AVA Y+ + + M A G
Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 157 -YVAGNAFGGMSGRIVMSVLTDHTDWRTALAVLSVFDLLCALAFFWLLPPS----RNFVR 211
VAG GG+ G + + L L +LLP S R +R
Sbjct: 143 GMVAGPVLGGLMGGFSP---------HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLR 193

Query: 212 RHGINLRFHLRAWAGHLRDRNLPFLFALPFLLM---GVFVCLYNYAGFRLGGPEFGLSQS 268
R +N R WA + + L A+ F++ V L+ G F +
Sbjct: 194 REALNPLASFR-WARGMT--VVAALMAVFFIMQLVGQVPAALWVI----FGEDRFHWDAT 246

Query: 269 QIGMIFSAYVFGIVSS----SVAGAASDRFGRGPVVTTGIVLCVLGMALTLAHVLALVVA 324
IG+ + FGI+ S + G + R G + G++ G L +
Sbjct: 247 TIGISLA--AFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304

Query: 325 GIVVVTIGFFIAHSAASAWVSRLGGAHRSHAASLYLLAYYAGSSVIGALGGWFW------ 378
I+V+ I A A +SR R L A + +S++G L
Sbjct: 305 PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364

Query: 379 QHGGWAALVGLCLTLLMLAL 398
GWA + G L LL L
Sbjct: 365 TWNGWAWIAGAALYLLCLPA 384


19XADLMG695_RS05585XADLMG695_RS05695Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS05585213-0.368710SDR family oxidoreductase
XADLMG695_RS05590226-2.613249helix-turn-helix transcriptional regulator
XADLMG695_RS05600428-3.666503hypothetical protein
XADLMG695_RS05605940-5.746376MAPEG family protein
XADLMG695_RS05610950-6.835420hypothetical protein
XADLMG695_RS05615951-8.220635hypothetical protein
XADLMG695_RS056201052-10.425880hypothetical protein
XADLMG695_RS056251155-10.070176SRPBCC domain-containing protein
XADLMG695_RS056301257-10.311787hypothetical protein
XADLMG695_RS21925848-8.726118hypothetical protein
XADLMG695_RS05635845-8.675125hypothetical protein
XADLMG695_RS05640745-7.943407hypothetical protein
XADLMG695_RS23165338-4.804843hypothetical protein
XADLMG695_RS05645021-3.146098dihydroxy-acid dehydratase
XADLMG695_RS21940-113-1.372976cellulase family glycosylhydrolase
XADLMG695_RS23175-112-0.471742YkgJ family cysteine cluster protein
XADLMG695_RS056700110.168247gamma carbonic anhydrase family protein
XADLMG695_RS056751100.542795hypothetical protein
XADLMG695_RS056801111.052930MFS transporter
XADLMG695_RS056851121.517842HD-GYP domain-containing protein
XADLMG695_RS056952121.432149MarR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05600NUCEPIMERASE374e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.5 bits (87), Expect = 4e-05
Identities = 26/130 (20%), Positives = 42/130 (32%), Gaps = 35/130 (26%)

Query: 8 ILVTGASGQLGALVVEALLGHLPANRIVA---------TARDTASLAEFAKRDIAVRQAD 58
LVTGA+G +G V + LL +++V + A L A+ + D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEA--GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 59 YANPHSLD--------------AAFAGVGRVL-----LVSSNAVGQRVPQHRNVIEAAKR 99
A+ + V L SN G N++E +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTG-----FLNILEGCRH 115

Query: 100 AGVELLAYTS 109
++ L Y S
Sbjct: 116 NKIQHLLYAS 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05710TCRTETB392e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.5 bits (92), Expect = 2e-05
Identities = 30/172 (17%), Positives = 70/172 (40%), Gaps = 2/172 (1%)

Query: 29 LLTMLDGFDVMAMAFTAPHVSADWQLSGKQLGMLFSAGLIGMALGALGLAPLADRIGRRA 88
+L+ + M + + P ++ D+ + +A ++ ++G L+D++G +
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 89 LTLACLAILTVGMGLSALASTAWQ-LGALRLLTGLGIGGMLACVAVTAGEFSSPRWRNTA 147
L L + I G + + + + L R + G G A V V + R A
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140

Query: 148 IVLQVTGYPVGATLGGTIAELLMQQWSWPAVFVLGAVASLLCVPLVLAFLPE 199
L + +G +G I ++ W + ++ + +++ VP ++ L +
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI-PMITIITVPFLMKLLKK 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05715cdtoxina300.018 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 29.7 bits (66), Expect = 0.018
Identities = 15/40 (37%), Positives = 15/40 (37%), Gaps = 9/40 (22%)

Query: 63 APTSSGASAAPAVPPSPAPAPAPAAPE-----PPEPAAAP 97
PT P P P P P PA P PEP AP
Sbjct: 43 GPTVPS----PDEPGLPLPGPGPALPTNGAIPIPEPGTAP 78


20XADLMG695_RS05785XADLMG695_RS05810Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS057851103.0383493-oxoadipate enol-lactonase
XADLMG695_RS05790193.3050434-carboxymuconolactone decarboxylase
XADLMG695_RS057951103.042649alpha/beta fold hydrolase
XADLMG695_RS058000113.223013helix-turn-helix domain-containing protein
XADLMG695_RS058051123.800216serine protein kinase RIO
XADLMG695_RS05810-1123.058492alpha/beta hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05820PF06057270.029 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 26.7 bits (59), Expect = 0.029
Identities = 6/27 (22%), Positives = 16/27 (59%)

Query: 55 LPAHTRSLITLSMMIALGHDEEFKLHV 81
+PA R + +++++ +F++HV
Sbjct: 138 MPARYRKNVLGAVLLSPSQSSDFEIHV 164


21XADLMG695_RS05905XADLMG695_RS06015Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS05905214-1.666362hypothetical protein
XADLMG695_RS05915418-3.271037CesT family type III secretion system chaperone
XADLMG695_RS23540424-3.160031hypothetical protein
XADLMG695_RS05930324-3.446089hypothetical protein
XADLMG695_RS05935324-4.057615hypothetical protein
XADLMG695_RS21985-230-2.241993type III secretion system export apparatus
XADLMG695_RS05940-329-1.765427type III secretion system export apparatus
XADLMG695_RS05945-329-1.910700type III secretion system cytoplasmic ring
XADLMG695_RS05950-221-3.179067type III secretion system protein SctP
XADLMG695_RS05960-121-1.437753FHIPEP family type III secretion protein
XADLMG695_RS05965-121-1.051901type III secretion system export apparatus
XADLMG695_RS05970117-1.601626HrpB1 family type III secretion system apparatus
XADLMG695_RS05975-114-2.444985type III secretion protein HrpB2
XADLMG695_RS05980-214-2.504439type III secretion inner membrane ring
XADLMG695_RS05985-115-2.265735type III secretion protein HrpB4
XADLMG695_RS05990-213-3.104700type III secretion system stator protein SctL
XADLMG695_RS06000-116-1.829731type III secretion system ATPase SctN
XADLMG695_RS060051140.262483type III secretion protein HrpB7
XADLMG695_RS060102150.716590type III secretion system export apparatus
XADLMG695_RS060152160.112895type III secretion system outer membrane ring
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05975TYPE3IMQPROT622e-16 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 61.7 bits (150), Expect = 2e-16
Identities = 24/78 (30%), Positives = 43/78 (55%)

Query: 4 DDLVRFTSEALLLCLKVSLPVVGVAALAGLLIAFVQAVMSLQDASISFALKLVVVVAAIA 63
DDLV ++AL L L +S VA + GLL+ Q V LQ+ ++ F +KL+ V +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 VTAPWGASAIMQFGQALM 81
+ + W ++ +G+ ++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05980TYPE3IMPPROT2462e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 246 bits (630), Expect = 2e-85
Identities = 80/219 (36%), Positives = 130/219 (59%), Gaps = 8/219 (3%)

Query: 3 MPDVGSLLLVVIMLGLLPFAAMVVTSYTKIVVVLGLLRNAIGVQQVPPNMVLNGVALLVS 62
M + SL+ ++ LLPF T + K +V ++RNA+G+QQ+P NM LNGVALL+S
Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60

Query: 63 CFVMAPVGMEAFKA-AQNYGAGSDNSRVVVLLDACREPFRQFLLKHTREREKAFFMRSAQ 121
FVM P+ +A+ +D S + +D + +R +L+K++ FF +
Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120

Query: 122 QIWPKDKAAT-------LKSDDLLVLAPAFTLSELTEAFRIGFLLYLVFIVIDLVVANAL 174
+ ++ T ++ + L PA+ LSE+ AF+IGF LYL F+V+DLVV++ L
Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180

Query: 175 MAMGLSQVTPTNVAIPFKLLLFVAMDGWSMLIHGLVLSY 213
+A+G+ ++P ++ P KL+LFVA+DGW++L GL+L Y
Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05985TYPE3OMOPROT649e-14 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 63.9 bits (155), Expect = 9e-14
Identities = 40/177 (22%), Positives = 74/177 (41%), Gaps = 15/177 (8%)

Query: 144 PTQLPAWLAALRVNTRLRIGGRTASAALLQSLRPGDVLLHCTASAAVTSGELLWGIAGGA 203
P LR R IG +LL + GDVLL T+ A V G
Sbjct: 138 PAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTSRAEVYCYAKKLG----- 192

Query: 204 VLRAPVRLNLQQMILEATPTMQHDTFE---PDVAPSTSNVAELELPVQLEVDQLALSLST 260
++ I+ T +QH E + A + + +L + ++ + + ++L+
Sbjct: 193 -----HFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYRKNVTLAE 247

Query: 261 LSGLQPGQILELSVPVDQADIRLVVYGQTIGTGRLLAVGEHLGVQILS-MSESTHAD 316
L + Q+L L + ++ ++ G +G G L+ + + LGV+I +SES + +
Sbjct: 248 LEAMGQQQLLSLPTNAEL-NVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESGNGE 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06000TYPE3IMSPROT332e-115 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 332 bits (854), Expect = e-115
Identities = 115/345 (33%), Positives = 191/345 (55%), Gaps = 2/345 (0%)

Query: 1 MSEEKTEKPTEKKLRDARKDGEVPVSPDVTAAAVLFGALLVMKSAGDYFADHVRALMTIG 60
MS EKTE+PT KK+RDARK G+V S +V + A++ ++ DY+ +H LM I
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 FDFPENTRDAAAINRALGHLGIQGLLLMLPFLAACLIAGVAGGAFQTGLNASLKPVAPKF 120
+ + A++ + ++ ++ L P L + +A Q G S + + P
Sbjct: 61 AE-QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 121 DSLNPAAGVKKLFSLRSLINLLKLIIKAILIGVVLWVGIRALMPMIIGLAYETPLDIAQI 180
+NP G K++FS++SL+ LK I+K +L+ +++W+ I+ + ++ L I +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 181 AWHTLGMLFALGVLLFVLVGAADWSVQHWLFIRDKRMSKDEQKREFKESEGDPEIKGKRK 240
L L + + FV++ AD++ +++ +I++ +MSKDE KRE+KE EG PEIK KR+
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 241 EFAKELVFGDPRERVAKAKVMVVNPTHYAVALAYEPDDFGLPQVVAKGVDDGALELRALA 300
+F +E+ + RE V ++ V+V NPTH A+ + Y+ + LP V K D +R +A
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 301 HNQGIPIVANPPLARALY-QVELGDAIPEPLFETVAVVLRWVDEL 344
+G+PI+ PLARALY + IP E A VLRW++
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06015FLGMRINGFLIF796e-19 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 79.2 bits (195), Expect = 6e-19
Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 11/188 (5%)

Query: 3 ALRCLVVLLVALLLSACSQQ---LYSGLTENDANDMLEVLLHAGVDASKVTPDDGKTWAV 59
A V ++VA++L A + L+S L++ D ++ L + + G A+
Sbjct: 30 AGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY-RFANGSG---AI 85

Query: 60 NAPHDQVSYSLEVLRAHGLPHEQHANLG-EMFKKDGLISTPTEERVRFIYGVSQQLSQTL 118
P D+V L GLP + +G E+ ++ + E+V + + +L++T+
Sbjct: 86 EVPADKVHELRLRLAQQGLP--KGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTI 143

Query: 119 SNIDGVISADVEIVLPNNDPLSTSVKPSSAAVFIKFRVGSDLT-SLVPNIKTMVMHSVEG 177
+ V SA V + +P K SA+V + G L + + +V +V G
Sbjct: 144 ETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAG 203

Query: 178 LTYENVSV 185
L NV++
Sbjct: 204 LPPGNVTL 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06035IGASERPTASE290.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.009
Identities = 13/75 (17%), Positives = 24/75 (32%), Gaps = 13/75 (17%)

Query: 93 AEQAQAAADQSLQSARDELASVQQALSKLQAQAQV-------------YADKAASARRAR 139
+E + A+ S Q ++ + Q A +V + A S +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 140 QAQRDAAEEEDAVEA 154
+ Q +E VE
Sbjct: 1094 ETQTTETKETATVEK 1108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06040TYPE3IMRPROT1776e-57 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 177 bits (451), Expect = 6e-57
Identities = 52/238 (21%), Positives = 105/238 (44%), Gaps = 3/238 (1%)

Query: 8 LLAISSQGVSLLALLALCGVRVFVMFIVLPATAQDSLPGIARNGVIYVLSSFIAYGQPAD 67
L S Q +S L L +RV + P ++ S+P + G+ +++ IA PA+
Sbjct: 2 LQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPAN 61

Query: 68 ALAKIQTVGLVGVVFKEAFIGLLIGFAASTVFWIAESVGLLIDDLAGYNNVQMTNPLSGQ 127
+ L + ++ IG+ +GF F + G +I G + +P S
Sbjct: 62 DVPVFSFFAL-WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 128 QSTPVSTVLLQLAIVSFYALGGMLLLLGALFESFRWWPLTQLGPNMGSVAESFVIQQSDS 187
++ ++ LA++ F G L L+ L ++F P+ + S A + +
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGG--EPLNSNAFLALTKAGSL 178

Query: 188 MMTAVVKLSAPVMLVLVLVDLAIGLVARAADKLEPSNLSQPIRGVLALLLLALLTSVF 245
+ + L+ P++ +L+ ++LA+GL+ R A +L + P+ + + L+A L +
Sbjct: 179 IFLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06045TYPE3OMGPROT334e-109 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 334 bits (857), Expect = e-109
Identities = 101/288 (35%), Positives = 155/288 (53%), Gaps = 13/288 (4%)

Query: 320 DVGGGAELASDAPVIEADPRTNAILIRDRPERMQSYGTLIQQLDNRPKLLQIDATIIEIR 379
+ A AS +EADP NAI++RD PERM Y LI LD +++ +I++I
Sbjct: 233 RIPQAATRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDIN 292

Query: 380 DGAMQDLGVDWRFHSQHTDIQTGNGSGSQLGFNGALSGAATDGATTPAGGTLTAVLGDAG 439
+ +LGVDWR I+TGN + G S A++GA G+L G
Sbjct: 293 ADQLTELGVDWR-----VGIRTGNNHQVVIKTTGDQSNIASNGAL----GSLVDARGL-- 341

Query: 440 RYLMTRVSALETTNKAKIVSSPQVATLDNVEAVMDHKQQAFVRVSGYASADLYNLSAGVS 499
YL+ RV+ LE A++VS P + T +N +AV+DH + +V+V+G A+L ++ G
Sbjct: 342 DYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTM 401

Query: 500 LRVLPSVVPGSPNGQMRLDVRIEDGQLGSNT--VDGIPVITSSEITTQAFVNEGQSLLIA 557
LR+ P V+ ++ L++ IEDG N+ ++GIP I+ + + T A V GQSL+I
Sbjct: 402 LRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIG 461

Query: 558 GYAYDADETDLNAVPGLSKIPLLGNLFKHRQKSGSRMQRLFLLTPHVV 605
G D L+ VP L IP +G LF+ + + R RLF++ P ++
Sbjct: 462 GIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRII 509



Score = 250 bits (639), Expect = 5e-77
Identities = 72/230 (31%), Positives = 115/230 (50%), Gaps = 6/230 (2%)

Query: 15 MAAVLMLSLLPLLSPHADAAQVPWHSRTFKYVADNKDLKEVLRDLSASQSIATWISPEVT 74
VL +LL LLS ++ A ++ W + YVA + L+++L D A+ +S ++
Sbjct: 9 FKRVLTGTLL-LLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKIN 67

Query: 75 GTLSGKFE-TSPQKFLDDLAATYGFVWYYDGAVLRIWGANESKSATLSLGTASTKSLRDA 133
+SG+FE +PQ FL +A+ Y VWYYDG VL I+ +E S + L + L+ A
Sbjct: 68 DKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQA 127

Query: 134 LARMRLDDPRFPVRYDEAAHVAVVSGPPGYVDTVSAIAKQVEQGVRQR----DATEVQVF 189
L R + +PRF R D + + VSGPP Y++ V A +EQ + R A +++F
Sbjct: 128 LQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIF 187

Query: 190 QLHYAQAADHTTRIGGQDVQIPGMASLLRSMYGARGAPVAAIAGPSANFG 239
L YA A+D T +V PG+A++L+ + +
Sbjct: 188 PLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQA 237


22XADLMG695_RS06265XADLMG695_RS06325Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS06265214-0.010363Na+/H+ antiporter subunit C
XADLMG695_RS06270215-0.474847monovalent cation/H+ antiporter subunit A
XADLMG695_RS06275216-0.387244DUF962 domain-containing protein
XADLMG695_RS06285214-0.547755M4 family metallopeptidase
XADLMG695_RS06290013-1.088335peptidoglycan DD-metalloendopeptidase family
XADLMG695_RS06305-317-2.010032hypothetical protein
XADLMG695_RS06310-222-2.658103hypothetical protein
XADLMG695_RS06315023-2.695998hypothetical protein
XADLMG695_RS06320223-2.644846phosphoribosylaminoimidazolesuccinocarboxamide
XADLMG695_RS06325319-1.600123DnaJ domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06310THERMOLYSIN311e-102 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 311 bits (799), Expect = e-102
Identities = 170/504 (33%), Positives = 235/504 (46%), Gaps = 54/504 (10%)

Query: 50 PADAFIARDSIVDADGTEHVRFDRTYQGLPVIGGDVVVHSRRGVMRELSQTMDTTV---R 106
+ + +D G +RF++ +G +V H G + LS T+ +
Sbjct: 72 ARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLSGTLIPNLDKRT 131

Query: 107 PSLVPGIDAATALRVAGSQFDVAQDAA-------PRASLALYAGQGAPRLVYEVIYSGVK 159
I A +A L +Y + PRL YEV +
Sbjct: 132 LKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNVRFLT 191

Query: 160 PDQTPTEMHYIVDAVNQRILESWDTVHTACSGGT------STAGTGRSLYAGSVTVNTTR 213
P P Y++DA + ++L W+ + A GG ST G GR + +NTT
Sbjct: 192 PV--PGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKYINTTY 249

Query: 214 CSSTS-YEMTDLSRGSGA-TYNMRNSTSGNGTLVTDDDNAWGSGTTGDTVTAAVDAHYGV 271
S Y + D +RGSG TY+ RN T G+L D DN + AAVDAHY
Sbjct: 250 SSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQF----FASYDAAAVDAHYYA 305

Query: 272 ALTWDYYRTMHSRTGIANDGAGARSRVHYGSRYNNAFWQDSCFCMTFGDGDGSTFTPLV- 330
+ +DYY+ +H R A RS VHYG YNNAFW S M +GDGDG TF P
Sbjct: 306 GVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQ--MVYGDGDGQTFLPFSG 363

Query: 331 SVDVAGHEMTHGVTSRTAALTYSGESGGLNEATSDIMGTMVEYSAANSAEPGNYLIGEKI 390
+DV GHE+TH VT TA L Y ESG +NEA SDI GT+VE+ A + ++ IGE I
Sbjct: 364 GIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNP---DWEIGEDI 420

Query: 391 IPNNSTGTLALRYMFKPSLDGKS---PDCYSSSLGSLNVHYSSGVANHFYYLLAEGAVVP 447
G ALR M P+ G Y+ + + VH +SG+ N YLL++G
Sbjct: 421 YTPGVAGD-ALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQG---- 475

Query: 448 SGFGSGTSYNLTPTSLVCSGSTALTAIGRAAASRIWYRALTVYMTSSTNYAAARRATLSA 507
G Y ++ +T IGR +I+YRAL Y+T ++N++ R A + A
Sbjct: 476 -----GVHYGVS-----------VTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQA 519

Query: 508 ATDLYGSTSTQYRAVAAAWSAVSV 531
A DLYGSTS + +V A++AV V
Sbjct: 520 AADLYGSTSQEVNSVKQAFNAVGV 543


23XADLMG695_RS06395XADLMG695_RS06475Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS06395211-2.180838cAMP-activated global transcriptional regulator
XADLMG695_RS06400211-2.262705adenosylmethionine decarboxylase
XADLMG695_RS06405313-1.791430multidrug efflux SMR transporter
XADLMG695_RS06410514-2.4293472-polyprenyl-3-methyl-6-methoxy-1,4-benzoquinone
XADLMG695_RS06415619-2.90478250S ribosomal protein L13
XADLMG695_RS06420316-2.75717930S ribosomal protein S9
XADLMG695_RS06425120-0.185544**RNA pyrophosphohydrolase
XADLMG695_RS06430321-1.246399(2Fe-2S)-binding protein
XADLMG695_RS06435-1160.279386bacterioferritin
XADLMG695_RS064400131.916616EAL domain-containing response regulator
XADLMG695_RS064551122.563454DUF4126 domain-containing protein
XADLMG695_RS064601132.821258hypothetical protein
XADLMG695_RS064650132.938913polymer-forming cytoskeletal protein
XADLMG695_RS064700133.467310iron-sulfur cluster insertion protein ErpA
XADLMG695_RS064750123.474724NAD(+) diphosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06465HELNAPAPROT290.004 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 29.5 bits (66), Expect = 0.004
Identities = 19/103 (18%), Positives = 41/103 (39%), Gaps = 10/103 (9%)

Query: 44 EYKESIDEMKHADKLSDRILFLEGLPNF---QALGKLRIGENP-----TEMFRCDLALER 95
E + E D +++R+L + G P + I + +EM + + +
Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109

Query: 96 EAVVVLREAVAYAETVKDYVSRQLLVDILESEEEHIDWLETQL 138
+ + + AE +D + L V ++E E+ + L + L
Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06475HTHFIS662e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.0 bits (161), Expect = 2e-13
Identities = 30/148 (20%), Positives = 61/148 (41%), Gaps = 11/148 (7%)

Query: 115 RVLIVEDDRSQALFAQSVLHGAGMHAQVEMTAASVPQAIQDYHPDLILMDLHMPELDGIR 174
+L+ +DD + L AG ++ AA++ + I DL++ D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 175 LTTLIRQQPGQQLLPIVFLTGDPDPERQFEVLDSGADDFLTKPIRPRHLIAAVSNRIRRA 234
L I++ + LP++ ++ + + GA D+L KP +
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL--------TELIGI 114

Query: 235 RQQALQQVGEQVSVRS-NPETGLPTRGH 261
+AL + + S + + G+P G
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06485GPOSANCHOR391e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 39.3 bits (91), Expect = 1e-05
Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 1/79 (1%)

Query: 66 EAALQQARRSQAQQRRQIEQLQQRQVNLAMSDKISRAANTEVQASLAERDEQIAALRADV 125
A Q RR R +QL+ L +KIS A+ ++ L E L A+
Sbjct: 308 NANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEH 367

Query: 126 AFYERLVG-STAQRKGLNA 143
E S A R+ L
Sbjct: 368 QKLEEQNKISEASRQSLRR 386


24XADLMG695_RS06515XADLMG695_RS06805Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS065152141.569498hypothetical protein
XADLMG695_RS065200142.138161bifunctional
XADLMG695_RS065251130.513149CDP-alcohol phosphatidyltransferase family
XADLMG695_RS06530-1110.277595phosphatase PAP2/dual specificity phosphatase
XADLMG695_RS06535-1130.099773hypothetical protein
XADLMG695_RS06540-1110.026804TIGR04222 domain-containing membrane protein
XADLMG695_RS06545-2110.231721CDP-alcohol phosphatidyltransferase family
XADLMG695_RS22040-114-0.4415141-acyl-sn-glycerol-3-phosphate acyltransferase
XADLMG695_RS065502122.170129phosphatidate cytidylyltransferase
XADLMG695_RS065555162.676916DNA-binding transcriptional regulator Fis
XADLMG695_RS065604172.261047DUF3426 domain-containing protein
XADLMG695_RS065655161.565177helix-turn-helix domain-containing protein
XADLMG695_RS065703151.456304hypothetical protein
XADLMG695_RS065752142.57303450S ribosomal protein L11 methyltransferase
XADLMG695_RS065851142.708794hypothetical protein
XADLMG695_RS065901153.113762hypothetical protein
XADLMG695_RS065950115.050553acetyl-CoA carboxylase biotin carboxylase
XADLMG695_RS06610-2122.775329four helix bundle protein
XADLMG695_RS06615-1101.991421acetyl-CoA carboxylase biotin carboxyl carrier
XADLMG695_RS066200121.053349type II 3-dehydroquinate dehydratase
XADLMG695_RS066251110.880296protein-disulfide reductase DsbD
XADLMG695_RS06630-1172.125200divalent-cation tolerance protein CutA
XADLMG695_RS066350172.146860endonuclease
XADLMG695_RS066400193.289896co-chaperone GroES
XADLMG695_RS066450233.551743chaperonin GroEL
XADLMG695_RS066500262.627950hypothetical protein
XADLMG695_RS066550202.1377753-deoxy-7-phosphoheptulonate synthase
XADLMG695_RS066601190.334829hypothetical protein
XADLMG695_RS06675-1130.241489hypothetical protein
XADLMG695_RS06680-480.649852SMP-30/gluconolactonase/LRE family protein
XADLMG695_RS06685-371.188045hypothetical protein
XADLMG695_RS06690-291.355751bifunctional [glutamate--ammonia
XADLMG695_RS066952132.691616VUT family protein
XADLMG695_RS067003142.802356S8/S53 family peptidase
XADLMG695_RS067052132.649909hypothetical protein
XADLMG695_RS067150122.984081DUF2145 domain-containing protein
XADLMG695_RS067201122.295582mitochondrial fission ELM1 family protein
XADLMG695_RS067251122.802305malonic semialdehyde reductase
XADLMG695_RS067300112.167515polyisoprenoid-binding protein
XADLMG695_RS06735-1122.287379hypothetical protein
XADLMG695_RS067400142.457073histidine-type phosphatase
XADLMG695_RS067451123.258105siderophore-interacting protein
XADLMG695_RS067502133.745381helix-turn-helix transcriptional regulator
XADLMG695_RS067550142.701514malonate decarboxylase subunit alpha
XADLMG695_RS06760-2111.819454malonate decarboxylase acyl carrier protein
XADLMG695_RS06765091.539533biotin-independent malonate decarboxylase
XADLMG695_RS067701101.565593biotin-independent malonate decarboxylase
XADLMG695_RS067751120.521914malonate decarboxylase
XADLMG695_RS067853121.123166triphosphoribosyl-dephospho-CoA synthase MdcB
XADLMG695_RS067907172.347325malonate decarboxylase subunit epsilon
XADLMG695_RS067955121.677910hypothetical protein
XADLMG695_RS068003111.590558GntR family transcriptional regulator
XADLMG695_RS06805391.532407hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06575cloacin372e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 37.0 bits (85), Expect = 2e-04
Identities = 20/55 (36%), Positives = 25/55 (45%), Gaps = 3/55 (5%)

Query: 441 GTAALIGTPWADYHSLRAPHGHAGGSGSSCGGGGGDSGGDGGSSDGGGCGGCGGG 495
G A G+ W+ ++ P G GSG GGG G G G + GGG G G
Sbjct: 30 GGGASDGSGWSSENN---PWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06595DNABINDNGFIS1139e-37 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 113 bits (284), Expect = 9e-37
Identities = 37/74 (50%), Positives = 55/74 (74%)

Query: 16 KSPLREHVAQSVRRYLRDLDGSDADDVYEIVLREMEIPLFVEVLNHCEGNQSRAAALLGI 75
+ PLR+ V Q+++ Y L+G D +D+YE+VL E+E PL V+ + GNQ+RAA ++GI
Sbjct: 24 QKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQPLLDMVMQYTRGNQTRAALMMGI 83

Query: 76 HRATLRKKLKEYGL 89
+R TLRKKLK+YG+
Sbjct: 84 NRGTLRKKLKKYGM 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06705IGASERPTASE310.023 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.023
Identities = 21/119 (17%), Positives = 35/119 (29%), Gaps = 14/119 (11%)

Query: 131 AMRAPAAMQAPRAAVAAAKGIAETPAASANAGTSTATPNVEHTATPPPAQSLRITA-TTN 189
++ A K P + N T P E ++ + T T N
Sbjct: 1138 TVQPQAEPARENDPTVNIK----EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193

Query: 190 ATSATPTNRTP---------ESAKRADAQARTPRSSTAWTLQFDRIVAEQVQAVSLRQL 239
+ P N TP ES+ + + R S ++ + V+L L
Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06735SUBTILISIN411e-05 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 41.0 bits (96), Expect = 1e-05
Identities = 47/235 (20%), Positives = 71/235 (30%), Gaps = 52/235 (22%)

Query: 200 YNDLKQAY---GYPSYQTMIGAPGKQQRLDGSGSTIAVLIGSDVLDTDIAAMFDHERFSR 256
Y +KQ P MI AP + G G +A VLDT DH
Sbjct: 10 YQVIKQEQQVNEIPRGVEMIQAPAVWNQTRGRGVKVA------VLDTGC--DADHPDLK- 60

Query: 257 YAGNHANPTLYARRYVAGAKPGVQDGNR-----AAAREATLDVDMALGGAPGAHVLLYVI 311
G +D N A AT + + +G AP A +L+ +
Sbjct: 61 ---ARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKV 117

Query: 312 PDL----SIDSILAGYRQIVQDNEADVVSSSFGFCEQVFTAAYNGKDATSILGVFDSVFK 367
+ D I+ G ++ D++S S G +D K
Sbjct: 118 LNKQGSGQYDWIIQGIYYAIEQK-VDIISMSLG----------GPEDVPE----LHEAVK 162

Query: 368 QGNAQGISFVAPSGDNAGLDCPDTQYLVEGKNGRYVPSVHWPAADAYVTAVGGGN 422
+ A I + +G+ EG + +P V +VG N
Sbjct: 163 KAVASQILVMCAAGN-------------EGDGDDRTDELGYPGCYNEVISVGAIN 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06760PF06776310.004 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 30.7 bits (69), Expect = 0.004
Identities = 13/51 (25%), Positives = 20/51 (39%)

Query: 8 LLPLALTLAIAACSKPADNAAAPAAETPAAATAPADAAAAPAPAPAAAAST 58
+ P L+ +A+C + A A A A A + + A A A S
Sbjct: 29 MGPAELSPMLASCRRLARRNGARLMLAGAMAIALSFGWSDRADAQGAVRSV 79


25XADLMG695_RS06850XADLMG695_RS06920Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS06850116-4.013799hypothetical protein
XADLMG695_RS06855323-5.331375hypothetical protein
XADLMG695_RS06860326-5.361794hypothetical protein
XADLMG695_RS06865424-5.996692hypothetical protein
XADLMG695_RS06870526-5.682369transposase
XADLMG695_RS06875748-8.265678hypothetical protein
XADLMG695_RS06880235-3.743749hypothetical protein
XADLMG695_RS06890333-4.039231HNH endonuclease
XADLMG695_RS06895436-6.023084hypothetical protein
XADLMG695_RS06900540-7.313514type I toxin-antitoxin system SymE family toxin
XADLMG695_RS22060746-9.157483SDR family oxidoreductase
XADLMG695_RS06910546-8.562744AraC family transcriptional regulator
XADLMG695_RS06915546-8.088812hypothetical protein
XADLMG695_RS23210649-8.769685hypothetical protein
XADLMG695_RS06920130-3.247461hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS23210CHANLCOLICIN290.024 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.3 bits (65), Expect = 0.024
Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 7/57 (12%)

Query: 124 GTGGLRGHGVGGGSGGLRSRRTDFYSVAAL--VSRWTKARTAGVDADQTIRAGANQA 178
GT G G GGG GG +S S AA+ ++W+ A+ A+Q RA A
Sbjct: 27 GTPDGSGSGGGGGKGGSKSE-----SSAAIHATAKWSTAQLKKTQAEQAARAKAAAE 78


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06945DHBDHDRGNASE828e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.0 bits (202), Expect = 8e-21
Identities = 54/222 (24%), Positives = 96/222 (43%), Gaps = 5/222 (2%)

Query: 3 IENKVVVITGAGSGMGRATALHLAALGAKVVLGARREARIAEVARQITLSGGQAVYRPTD 62
IE K+ ITGA G+G A A LA+ GA + ++ +V + A P D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 63 VTVHEEVLALADLACSQFGRLDVMVNNAGISPLSRFDALQVEAWNAMIDVNLRGVLHGIA 122
V + + + G +D++VN AG+ +L E W A VN GV +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 123 AALPIFGRQQSSHVINVVSTAGLRIVPTMGVYAATKNAVRTISEALRQESGPH-IRVTEI 181
+ ++S ++ V S +M YA++K A ++ L E + IR +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 182 SPGMVQSELLDT--VSDPALRQTLQAQSEA--SGMPAEAIAR 219
SPG ++++ + + Q ++ E +G+P + +A+
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAK 227


26XADLMG695_RS07575XADLMG695_RS07615Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS075754131.471552GspH/FimT family pseudopilin
XADLMG695_RS075803121.309245type II secretion system minor pseudopilin GspI
XADLMG695_RS075854151.607036type II secretion system minor pseudopilin GspJ
XADLMG695_RS075902181.358408type II secretion system minor pseudopilin GspK
XADLMG695_RS075953201.254894type II secretion system protein GspL
XADLMG695_RS076005221.908111type II secretion system protein M
XADLMG695_RS076104202.109045type II secretion system protein N
XADLMG695_RS076152140.792293GntR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07600BCTERIALGSPH518e-11 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 50.7 bits (121), Expect = 8e-11
Identities = 20/79 (25%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 9 RGFTLLEMLAVLVIAALASTLVVMTLPDTRRDLHDHADTLAS---ALIHARDEAILSLRM 65
RGFTLLEM+ +L++ +++ +V++ P +R D A TLA L + + + +
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDD--SAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 66 VEVGIDAGGYGFRRQAQQQ 84
V + + F +
Sbjct: 62 FGVSVHPDRWQFLVLEARD 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07610BCTERIALGSPG354e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 4e-05
Identities = 12/48 (25%), Positives = 26/48 (54%)

Query: 1 MIRKQRTRGFTLIELLVALAVFALVAAAAVMVMRQSIDQRDAVRARLQ 48
M + RGFTL+E++V + + ++A+ V + + ++ D +A
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


27XADLMG695_RS07845XADLMG695_RS07875Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS078450123.074581response regulator transcription factor
XADLMG695_RS078500144.130534FAD-binding protein
XADLMG695_RS078600154.875107metal-dependent hydrolase
XADLMG695_RS078650164.770394YraN family protein
XADLMG695_RS07870-1164.814729penicillin-binding protein activator
XADLMG695_RS07875-1174.47048816S rRNA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07855HTHFIS964e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.4 bits (240), Expect = 4e-25
Identities = 34/118 (28%), Positives = 61/118 (51%), Gaps = 1/118 (0%)

Query: 11 ARVLIVDDEPQIRRFLDISLRAQGYRVLQAGTGEEGLALLAGQGAELVVLDIGLPDRDGH 70
A +L+ DD+ IR L+ +L GY V +A +LVV D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 EVLREIRQ-WSNVPVIMLTVRAGETEKVAALDAGANDYVTKPFGVQELMARIRALLRQ 127
++L I++ ++PV++++ + + A + GA DY+ KPF + EL+ I L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


28XADLMG695_RS08555XADLMG695_RS08630Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS085552183.534704branched-chain amino acid aminotransferase
XADLMG695_RS085601162.214464S8 family serine peptidase
XADLMG695_RS085650151.477215S8 family serine peptidase
XADLMG695_RS085701131.558330Cys-tRNA(Pro) deacylase
XADLMG695_RS085802112.276609asparaginase
XADLMG695_RS085852121.896790xylanase
XADLMG695_RS085903141.654572DUF2069 domain-containing protein
XADLMG695_RS085953131.166985NAD(P)H:quinone oxidoreductase
XADLMG695_RS086002161.131014YihY family inner membrane protein
XADLMG695_RS086052120.999967TlpA family protein disulfide reductase
XADLMG695_RS08610-1130.274754acylphosphatase
XADLMG695_RS08615-1150.685209hypothetical protein
XADLMG695_RS086202131.757230polyphosphate kinase 2
XADLMG695_RS086252163.211905molybdenum cofactor biosynthesis protein B
XADLMG695_RS086302172.463226tetratricopeptide repeat protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS08595SUBTILISIN2031e-62 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 203 bits (518), Expect = 1e-62
Identities = 97/331 (29%), Positives = 143/331 (43%), Gaps = 53/331 (16%)

Query: 147 QWAFGTTNAGL---NIRPAWDKSTGANVVVAVIDTGI-VSHPDLDANILPGYDFISDATA 202
+ G+ W+++ G V VAV+DTG HPDL A I+ G +F
Sbjct: 16 EQQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFT----- 70

Query: 203 ARDGNGRDNNPADEGDWNSTSGCTTSNSSWHGTHVAGTVAAVTNNTTGVAGTAFNAKVVP 262
+ + +P D+N HGTHVAGT+AA T N GV G A A ++
Sbjct: 71 ----DDDEGDPEIFKDYN-----------GHGTHVAGTIAA-TENENGVVGVAPEADLLI 114

Query: 263 VRVLGRCG-GSLSDIADAIIWASGGTVSGVPANPNAAEVINMSLGGGGTCSSTMQSAING 321
++VL + G G I I +A ++I+MSLGG + A+
Sbjct: 115 IKVLNKQGSGQYDWIIQGIYYA----------IEQKVDIISMSLGGPED-VPELHEAVKK 163

Query: 322 AVSRGTTVVVAAGNSAANVSG----SLPANCANVIAVAATTSAGAKASYSNYGSGIDVSA 377
AV+ V+ AAGN P VI+V A + +SN + +D+ A
Sbjct: 164 AVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVA 223

Query: 378 PGSGILSTLNSGTTTPGNASYASYNGTSMAAPHVAGVVALVQSVAPTT----LTPAAVET 433
PG ILST+ G YA+++GTSMA PHVAG +AL++ +A + LT +
Sbjct: 224 PGEDILSTVPGGK-------YATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYA 276

Query: 434 LLKNTARALPGACSGGCGAGIVDADAAVTAA 464
L L + G G++ A +
Sbjct: 277 QLIKRTIPLGNS-PKMEGNGLLYLTAVEELS 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS08605SUBTILISIN2035e-63 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 203 bits (519), Expect = 5e-63
Identities = 97/339 (28%), Positives = 142/339 (41%), Gaps = 57/339 (16%)

Query: 134 DPGVPQQWAMGATAASL---NIRPAWDRSTGKGIVVAVIDTGI-TNHPDLAANVLPGYDF 189
+ Q+ + + W+++ G+G+ VAV+DTG +HPDL A ++ G +F
Sbjct: 10 YQVIKQEQQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNF 69

Query: 190 IVDPATARDGTARDANAADQGDWAAANECGPGASASNSSWHGTHVAGIVAAVGNNAVGVV 249
++ G + + HGTHVAG +AA N GVV
Sbjct: 70 TD------------------------DDEGDPEIFKDYNGHGTHVAGTIAATENE-NGVV 104

Query: 250 GTAFNAKILPLRVLGRCG-GYMSDIADAIVWASGGKVSGVPTNPNPATVINLSLGGAGTC 308
G A A +L ++VL + G G I I +A +I++SLGG
Sbjct: 105 GVAPEADLLIIKVLNKQGSGQYDWIIQGIYYA----------IEQKVDIISMSLGGPED- 153

Query: 309 SATLNNAIAAAVTRGSAVVVAAGNSNLDVST----SVPANCANVIAVAATTSAGAKASFS 364
L+ A+ AV V+ AAGN P VI+V A + FS
Sbjct: 154 VPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFS 213

Query: 365 NFGKGVDIAAPGQSIVSTLNTGTTAPGNAAYAVYSGTSMAAPHVAGVVALMQSVALN--- 421
N VD+ APG+ I+ST+ G YA +SGTSMA PHVAG +AL++ +A
Sbjct: 214 NSNNEVDLVAPGEDILSTVPGGK-------YATFSGTSMATPHVAGALALIKQLANASFE 266

Query: 422 -PLTPATVKALLKASARPMPVACTQGCGAGLVNADGAVA 459
LT + A L P+ + G GL+
Sbjct: 267 RDLTEPELYAQLIKRTIPLGNSPKM-EGNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS08665SYCDCHAPRONE378e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.8 bits (85), Expect = 8e-05
Identities = 14/81 (17%), Positives = 28/81 (34%)

Query: 46 VQRALALHPGHPEAVARLGRVRWAQQRHAEAATLLQQASDLVPQHPGIALWLGHALEDAG 105
+ + E + L ++ ++ +A + Q L L LG + G
Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMG 84

Query: 106 QPEQAAAAYTRAHRLLPDEPY 126
Q + A +Y+ + EP
Sbjct: 85 QYDLAIHSYSYGAIMDIKEPR 105


29XADLMG695_RS09160XADLMG695_RS09220Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS091602190.616121glutaredoxin 3
XADLMG695_RS09165423-0.682490carboxymuconolactone decarboxylase family
XADLMG695_RS09170522-1.595995isocitrate dehydrogenase
XADLMG695_RS09180418-2.326836Bax inhibitor-1/YccA family protein
XADLMG695_RS09210417-1.888389*****trigger factor
XADLMG695_RS09215213-1.707929ATP-dependent Clp endopeptidase proteolytic
XADLMG695_RS09220313-1.463037ATP-dependent Clp protease ATP-binding subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09220HTHFIS330.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 0.002
Identities = 15/85 (17%), Positives = 32/85 (37%), Gaps = 10/85 (11%)

Query: 60 QSARSSLPKPREILEVLDQY----VIGQLRAKRTLAVAVYNHYKRIESRSKNDDVELAK- 114
+ A LPKP ++ E++ + R + + S + + +
Sbjct: 96 KGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155

Query: 115 -----SNILLVGPTGSGKTLLAETL 134
+++ G +G+GK L+A L
Sbjct: 156 LMQTDLTLMITGESGTGKELVARAL 180


30XADLMG695_RS22185XADLMG695_RS09475Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS22185-128-3.857706nuclear transport factor 2 family protein
XADLMG695_RS09320026-3.920040cyclic pyranopterin monophosphate synthase MoaC
XADLMG695_RS09325629-3.465397MoaD/ThiS family protein
XADLMG695_RS09330830-4.033470molybdenum cofactor biosynthesis protein MoaE
XADLMG695_RS093401038-6.151304hypothetical protein
XADLMG695_RS093451031-4.203582hypothetical protein
XADLMG695_RS09350931-5.419174hypothetical protein
XADLMG695_RS09355932-6.073975helix-turn-helix transcriptional regulator
XADLMG695_RS09360735-6.679104NYN domain-containing protein
XADLMG695_RS09365735-6.416712hypothetical protein
XADLMG695_RS09370634-6.566084hypothetical protein
XADLMG695_RS23245640-7.794095hypothetical protein
XADLMG695_RS23250837-4.452417hypothetical protein
XADLMG695_RS09375836-4.389249hypothetical protein
XADLMG695_RS09380835-5.063612hypothetical protein
XADLMG695_RS22190938-4.956378MobA/MobL family protein
XADLMG695_RS09390833-3.099589hypothetical protein
XADLMG695_RS09395630-1.938424hypothetical protein
XADLMG695_RS09400426-4.583605thermonuclease family protein
XADLMG695_RS22195212-3.635523hypothetical protein
XADLMG695_RS09405210-2.170032H-NS histone family protein
XADLMG695_RS09410110-2.227271glycogen debranching protein GlgX
XADLMG695_RS0941519-2.05034730S ribosomal protein S6--L-glutamate ligase
XADLMG695_RS0942029-2.049794hypothetical protein
XADLMG695_RS0942519-1.623223response regulator transcription factor
XADLMG695_RS09430111-2.023296HAMP domain-containing histidine kinase
XADLMG695_RS09435114-2.436875type I toxin-antitoxin system SymE family toxin
XADLMG695_RS09440113-2.026962RHS repeat protein
XADLMG695_RS09445119-4.205911dephospho-CoA kinase
XADLMG695_RS09450323-5.022525prepilin peptidase
XADLMG695_RS09460219-5.134218type II secretion system F family protein
XADLMG695_RS09465122-5.895758pilin
XADLMG695_RS09470222-6.579016type IV-A pilus assembly ATPase PilB
XADLMG695_RS09475118-5.834746hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09440HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 32/119 (26%), Positives = 60/119 (50%), Gaps = 2/119 (1%)

Query: 2 RILVIEDNSDIAANLGDYLEDRGHTVDFAADGVTGLHLAVVHEFDAIVLDLNLPGMDGIE 61
ILV +D++ I L L G+ V ++ T + D +V D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCRKLRNEARKQTPVLMLTARDSLDNKLAGFDSGADDYLIKPFALQE-VEVRLNALSRR 119
+ +++ +AR PVL+++A+++ + + GA DYL KPF L E + + AL+
Sbjct: 65 LLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09470PREPILNPTASE330e-116 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 330 bits (848), Expect = e-116
Identities = 129/282 (45%), Positives = 176/282 (62%), Gaps = 1/282 (0%)

Query: 1 MAFLDQHPGLGFPAAAGLGLLIGSFLNVVILRLPKRMEWQWRRDAREILELPDI-YEPPP 59
+ P L F L+IGSFLNVVI RLP +E +W+ + R D + PP
Sbjct: 5 LELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPP 64

Query: 60 PGIVVEPSHDPVTGDKLKWWENIPVLSWAMLRGKSRYSGKPISIQYPLVELLTSILCVAS 119
++V S P + ENIP+LSW LRG+ R PIS +YPLVELLT++L VA
Sbjct: 65 YNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAV 124

Query: 120 VWRFGFGWQGFGAIVLSCFLVAMSGIDLRHKLLPDQLTLPLMWLGLVGSMDNLYMPAKPA 179
GW A++L+ LVA++ IDL LLPDQLTLPL+W GL+ ++ ++ A
Sbjct: 125 AMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDA 184

Query: 180 LLGAAVGYVSLWTVWWLFKQLTGKEGMGHGDFKLLAALGAWCGLKGILPIILISSLVGAI 239
++GA GY+ LW+++W FK LTGKEGMG+GDFKLLAALGAW G + + ++L+SSLVGA
Sbjct: 185 VIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAF 244

Query: 240 LGSIWLVAKGRDRATPIPFGPYLAIAGWVVFFWGNDLVDGYL 281
+G ++ + ++ PIPFGPYLAIAGW+ WG+ + YL
Sbjct: 245 MGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09475BCTERIALGSPF382e-133 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 382 bits (982), Expect = e-133
Identities = 113/405 (27%), Positives = 211/405 (52%), Gaps = 9/405 (2%)

Query: 23 LFLWEGTDKRGIKMKGEQTARNMNMLRAELRRQGINPSIVKLK--------PKPLFGAAG 74
+ ++ D +G K +G Q A + R LR +G+ P V L
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 75 KKITPKDIAFFSRQMATMMKSGVPIVGSLEIIGEGHKNPRMKKMVGQVRTDIEGGSSLYE 134
+++ D+A +RQ+AT++ + +P+ +L+ + + + P + +++ VR+ + G SL +
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 135 SISRHPVQFDELYRNLVRAGEGAGVLETVLDTVATYKENIEALKGKIKKALFYPAMVIAV 194
++ P F+ LY +V AGE +G L+ VL+ +A Y E + ++ +I++A+ YP ++ V
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 195 ALIVSAILLIFVVPQFEEVFKGFGAELPAFTQMIVGASRFMVSYWWIMFFVIAGAIVGFV 254
A+ V +ILL VVP+ E F LP T++++G S + ++ M + + F
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 255 FAYKRSPSMQHTMDRLILRVPVIGQIMHNSSIARFARTTAVTFKAGVPLVEALGIVAGAT 314
R + + R +L +P+IG+I + AR+ART ++ + VPL++A+ I
Sbjct: 243 VML-RQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301

Query: 315 GNRVYEDAVLRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDSMLFKVAEYF 374
N + D V G ++ A++Q LFP M+ M A GE +G LDSML + A+
Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361

Query: 375 EQEVNNAVDALSSLLEPMIMVFIGVVVGGMVIGMYLPIFKLGAVV 419
++E ++ + L EP+++V + VV +V+ + PI +L ++
Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09480BCTERIALGSPG471e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 46.8 bits (111), Expect = 1e-09
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 5/71 (7%)

Query: 1 MKKQNGFTLIELMIVVAIIAILAAIALPAYQDYTVRGRVSEAMVAASAAKTVVAENAANG 60
KQ GFTL+E+M+V+ II +LA++ +P + G +A + + V ENA +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVP-----NLMGNKEKADKQKAVSDIVALENALDM 58

Query: 61 SALNSGWTPPT 71
L++ P T
Sbjct: 59 YKLDNHHYPTT 69


31XADLMG695_RS23255XADLMG695_RS09770Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS232554142.191411cobalamin biosynthesis protein
XADLMG695_RS096456142.169625threonine-phosphate decarboxylase
XADLMG695_RS096506162.198781cobyric acid synthase
XADLMG695_RS096556152.877423bifunctional adenosylcobinamide
XADLMG695_RS096607143.298124nicotinate-nucleotide--dimethylbenzimidazole
XADLMG695_RS096657143.117371histidine phosphatase family protein
XADLMG695_RS096706142.969735adenosylcobinamide-GDP ribazoletransferase
XADLMG695_RS096753143.064147type III PLP-dependent enzyme
XADLMG695_RS096802122.732899iron transporter
XADLMG695_RS096852122.518659MFS transporter
XADLMG695_RS096904122.507296siderophore biosynthesis protein, IucA/IucC
XADLMG695_RS096954131.980758carboxylate--amine ligase
XADLMG695_RS097002121.053292TonB-dependent siderophore receptor
XADLMG695_RS097053121.0293854-hydroxy-2-oxovalerate aldolase
XADLMG695_RS097102140.565688IS3 family transposase
XADLMG695_RS09715015-1.016560hypothetical protein
XADLMG695_RS09720-220-2.545566hypothetical protein
XADLMG695_RS09725-222-3.059892hypothetical protein
XADLMG695_RS09730-220-2.347533Kef family K(+) transporter
XADLMG695_RS09740-225-2.518711TonB-dependent receptor
XADLMG695_RS09745-115-2.042966TonB-dependent receptor
XADLMG695_RS09750-19-1.370564VOC family protein
XADLMG695_RS09755-111-1.038315TonB-dependent receptor
XADLMG695_RS09760012-0.310964ATP-binding cassette domain-containing protein
XADLMG695_RS09765111-0.386383YcxB family protein
XADLMG695_RS09770214-0.350084LysR family transcriptional regulator AmpR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09700ALARACEMASE401e-05 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 39.8 bits (93), Expect = 1e-05
Identities = 47/224 (20%), Positives = 78/224 (34%), Gaps = 32/224 (14%)

Query: 31 DLAALHTHAAWMRAQLPAQCELFYAAKANA----EPPVLRTLATHVDGFEAASGGELAWL 86
DL AL + + +R Q ++ KANA + + DGF + E L
Sbjct: 10 DLQALKQNLSIVR-QAATHARVWSVVKANAYGHGIERIWSAIGA-TDGFALLNLEEAITL 67

Query: 87 HAQQPQAPLLFGGPGKLDTELAQAAALPDCTVHVESLRELERLAAIATHGGRCVPVFLRM 146
+ + P+L G + + T V S +L+ L + ++L++
Sbjct: 68 RERGWKGPILMLE-GFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARL--KAPLDIYLKV 124

Query: 147 NIAVPGAQSTRLMMGGQPSPFGLDPCDLDAAMQRLQASPSLRLEGFHFHLMSHQRNATAQ 206
N + RL G P + Q+L+A ++ LMSH A
Sbjct: 125 NSGM-----NRL---------GFQPDRVLTVWQQLRAMANVGEMT----LMSHFAEAEHP 166

Query: 207 LHLVAAYLRTVQQWRQTYALGPLRVNAGGGFGVDYLAPEASFDW 250
+ A R ++Q + N+ PEA FDW
Sbjct: 167 DGISGAMAR-IEQAAEGLECRRSLSNSAATL----WHPEAHFDW 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09705PF041832937e-94 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 293 bits (751), Expect = 7e-94
Identities = 94/468 (20%), Positives = 165/468 (35%), Gaps = 46/468 (9%)

Query: 100 DAQALARCLLQALASTQAINPELLAQSANSVAVT------AAFLRQAQLTAATGEAMIDA 153
D LA+ LL L +++ +A+ + T R+ + D
Sbjct: 69 DEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNADR 128

Query: 154 EQSMLWGHALHPTPKSREGVDLDQVLACAPEARASFQLFWF-------------RIDPRL 200
Q +L GH K R G + + APE +F+L W +D
Sbjct: 129 LQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQ 188

Query: 201 LRIQGRDVRATLR-----QLSGSDDLY---PCHPWEAQRLLDAPLLRTMQARGLITPIGP 252
L D + R Q +G D + P HPW+ Q+ + + A G + +G
Sbjct: 189 LLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF-AEGRMVSLGE 247

Query: 253 LGDALRPTSSVRTLYHPE--LAYFLKCSVHVRLTNCVRKNAWYELESAVALSELLAPSWR 310
GD S+RTL + +K + + T+C R + + S L +
Sbjct: 248 FGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFA 307

Query: 311 ALAMQV-PGFDVMLEPAATSLDVALVDPALHAADPLAARTLSESFGILYRQGIPAAQRAR 369
A V G ++ EPAA V +AA A E G+++R+ +
Sbjct: 308 TDATLVQSGAVILGEPAA-----GYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPD 362

Query: 370 WQPQVAAALFTCDAQGNSVCAARLRALGSAQMNRRTATLLWFGAYAGLLLDGVWSALFQH 429
P + A L CD + A + G W +++ ++ L ++
Sbjct: 363 ESPVLMATLMECDENNQPLAGAYIDRSGLDAET-------WLTQLFRVVVVPLYHLLCRY 415

Query: 430 GIALEPHLQNTVIGFADGWPTRVWIRDLEGT-KLLAHHWPETRLRGVGERARQSLYYTPE 488
G+AL H QN + +G P RV ++D +G +L+ +PE + + + R
Sbjct: 416 GVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPE--MDSLPQEVRDVTSRLSA 473

Query: 489 QGWNRVAYCALVNNLAEAIFHLSQGDAALETQLWQCVGEIALRWQQRH 536
+ I L E + +Q + + + ++H
Sbjct: 474 DYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKH 521


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09710TCRTETA613e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 60.6 bits (147), Expect = 3e-12
Identities = 50/156 (32%), Positives = 67/156 (42%), Gaps = 3/156 (1%)

Query: 20 LGMPLFLPQVLAELAPAA-AVGWSGVLYVLPTLCTALTASSWGRWADRHGRKRSLLRAQL 78
L MP+ LP +L +L + G+L L L A G +DR GR+ LL +
Sbjct: 23 LIMPV-LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLA 81

Query: 79 GLALGFAIAGFAPSLSWLVIGLVVQGTCGGSLAAANAYLASQPQAGPLARALDWTQYSAR 138
G A+ +AI AP L L IG +V G G + A A AY+A AR +
Sbjct: 82 GAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFG 141

Query: 139 LAMVSAPALLGLALALGPAQSLYRALALLPLIAFAL 174
MV+ P L GL P + A A L + F
Sbjct: 142 FGMVAGPVLGGLMGGFSPHAPFFAA-AALNGLNFLT 176



Score = 35.6 bits (82), Expect = 3e-04
Identities = 35/110 (31%), Positives = 42/110 (38%), Gaps = 4/110 (3%)

Query: 278 LLPGLALFAVACVWQALLHDALALAVARLLFGL-GMLFALRGLNRSLAHIASGHGAGRLF 336
LL LA AV A L + R++ G+ G A+ G +A I G R F
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG--AYIADITDGDERARHF 133

Query: 337 GRFDACGKWAGVFAGAAAGALAQASGPATPFLAAALAAAAAALTVVVRFP 386
G AC G+ AG G L P PF AAA LT P
Sbjct: 134 GFMSACFG-FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09715PF041831482e-40 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 148 bits (376), Expect = 2e-40
Identities = 95/437 (21%), Positives = 145/437 (33%), Gaps = 62/437 (14%)

Query: 81 DSWIVRSDDGVHV---ERGAHAWLH-------RISAELDAQT--QQL-HRAYADEADCAA 127
D + + ERG WL + AQT QL +A A
Sbjct: 35 DRYCINLPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAE 94

Query: 128 AHRGLARQAYHAQAPALRTALHHPDAAERAYRCDQLASYRD-HPFYPTARAKAGLDAAEL 186
+ L L+ + D+L HP + + + G L
Sbjct: 95 HMQDLYATLLGDLQ-LLKARRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEAL 153

Query: 187 RHYAPEFAPTFALHWLAIPQALAQCTSAAP------------AELWPDFASLGLPPELAA 234
YAPE+A TF LHWLA+ + + + F+ + L
Sbjct: 154 ERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDH 213

Query: 235 THLPWPVHPLMWERLEQEGFA--LPEDVLR----APNAWLDVRPSLSVRTLVPPQHPQ-L 287
LP PVHP W++ F E + + W S+RTL L
Sbjct: 214 NWLPLPVHPWQWQQKIATDFIADFAEGRMVSLGEFGDQW---LAQQSLRTLTNASRRGGL 270

Query: 288 HLKLPIPMRTLGALNLRLIKPSTLYDGHWMERALRHIDALDPALQGRCVFV-DESHGGHV 346
+KLP+ + R I + G R L+ + A D L + E G+V
Sbjct: 271 DIKLPLTIYNTSC--YRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYV 328

Query: 347 -------------GQTRHLAYLVRRYPAL---DDATLVPVAALCAPMPDGRPMAIHLAER 390
L + R P D + V +A L + +P+A +R
Sbjct: 329 SHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDR 388

Query: 391 FAHGDVLRWWRDYTELLLAVHLRLWLGYGIALEANQQNSVLVYSDGQATRLLMKDN-DAA 449
D W +++ L YG+AL A+ QN L +G R+L+KD
Sbjct: 389 SGL-DAETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDM 447

Query: 450 RIALPQLRAALPELDAL 466
R+ + PE+D+L
Sbjct: 448 RLVKEE----FPEMDSL 460


32XADLMG695_RS22250XADLMG695_RS10130Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS22250-1123.300566hypothetical protein
XADLMG695_RS100600123.457239ATP-dependent DNA helicase
XADLMG695_RS232651113.927045tRNA
XADLMG695_RS10070-1123.340711ADP-ribosylglycohydrolase family protein
XADLMG695_RS10075-2133.850632energy transducer TonB
XADLMG695_RS100800134.097561glutathione synthase
XADLMG695_RS100850123.146012twitching motility response regulator PilG
XADLMG695_RS10095-3112.307367response regulator
XADLMG695_RS10100-2101.555289chemotaxis protein CheW
XADLMG695_RS101051130.642350methyl-accepting chemotaxis protein
XADLMG695_RS101103121.473706Hpt domain-containing protein
XADLMG695_RS101153111.754684glutamate methyltransferase
XADLMG695_RS101202112.117041chemotaxis protein CheW
XADLMG695_RS101252112.295453GNAT family N-acetyltransferase
XADLMG695_RS101302102.183850hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10080BACINVASINC290.006 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 29.5 bits (65), Expect = 0.006
Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 6/97 (6%)

Query: 69 RDTAKSKRQAGDLAGAAAALDQALGLVSGDPAILQERAEVSVLQADWPAAERFAKQAIDL 128
R A+ + GDL + + S A QER+E + Q + A + +A +
Sbjct: 315 RIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVNNRVASTASDEARES 374

Query: 129 GSKTGPLCRRHWATIEQSRLARGEKENAASAKAQIAG 165
K+ L + T+E ++ ASA A IAG
Sbjct: 375 SRKSTSLIQEMLKTMESI------NQSKASALAAIAG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10105PF035441323e-39 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 132 bits (332), Expect = 3e-39
Identities = 42/262 (16%), Positives = 88/262 (33%), Gaps = 37/262 (14%)

Query: 11 MDDGRRLMMTLVISLLLHGVLILGVGFAVSEDAPLVPTLDVIFSQTSAPLTPKQADFLAQ 70
+D RR ++S+ +HG ++ G+ + +P AP P +A
Sbjct: 8 LDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELP----------APAQPISVTMVAP 57

Query: 71 ANQQGGGDHDTAQRPRDSQPGVVPQDRTGLAPQAQRATSVNAPEPTQTRVVTSRRGEQAV 130
A D P P+ P+ + P +
Sbjct: 58 A--------DLEPPQAVQPP---PEPVVEPEPEPEPIPEPPKEAPVV------------I 94

Query: 131 PTPQPNPQTDPLTPAEAQRIQRDAEMARLAAEVHLRSEQYAKRPNRKFVSASTREYAYAN 190
P+P P+ P + ++ +RD + + A+ + +A+++
Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154

Query: 191 YLRAWVDRAERVGNLNYPDDARRRRLGGKVVISVGVRRDGSVESSRVLVSSGVPLLDDAA 250
+ R YP A+ R+ G+V + V DG V++ ++L + + +
Sbjct: 155 SGPRALSRN----QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREV 210

Query: 251 LRVVQLAQPFPPLPKTKDDVDI 272
++ + P P + V+I
Sbjct: 211 KNAMRRWRYEPGKPGSGIVVNI 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10115HTHFIS732e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.3 bits (180), Expect = 2e-18
Identities = 28/115 (24%), Positives = 49/115 (42%), Gaps = 2/115 (1%)

Query: 15 KVMVIDDSKTIRRTAETLLKREGCEVVTATDGFEALAKIADQQPQIIFVDIMMPRLDGYQ 74
++V DD IR L R G +V ++ IA ++ D++MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 75 TCALIKGNQLFKSTPVIMLSSKDGLFDKARGRIVGSEQYLTKPFTREELLSAIRT 129
IK + PV+++S+++ + G+ YL KPF EL+ I
Sbjct: 65 LLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10120HTHFIS881e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.0 bits (218), Expect = 1e-23
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 2/116 (1%)

Query: 2 ARIILIEDSPTDRAVFSQWLEKAGHTVVATDNAEEGLELVRSQAPDLVLMDVVLPGMSGF 61
A I++ +D R V +Q L +AG+ V T NA + + DLV+ DVV+P + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRALARDQATKDIPVLLVSTKGMETDRAWGLRQGASDYIVKPPREDDLIARIRQ 117
+ + + D+PVL++S + +GA DY+ KP +LI I +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10135HTHFIS684e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 4e-13
Identities = 24/116 (20%), Positives = 53/116 (45%), Gaps = 2/116 (1%)

Query: 2276 QVPLVMVVDDSLTMRKVTSRVLERHNLDVTTARDGVEALELLEERVPDLMLLDIEMPRMD 2335
++V DD +R V ++ L R DV + + DL++ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 2336 GYELATAMRADPRFKAVPIVMITSRSGEKHRQRAFEIGVQRYLGKPYQELDLMRNV 2391
++L ++ +P++++++++ +A E G YL KP+ +L+ +
Sbjct: 62 AFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


33XADLMG695_RS10270XADLMG695_RS10335Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS10270214-0.016833glycine cleavage system protein GcvH
XADLMG695_RS10275215-1.185097hypothetical protein
XADLMG695_RS10280720-2.310363hypothetical protein
XADLMG695_RS10285316-1.255990hypothetical protein
XADLMG695_RS10290313-1.426708serine hydrolase
XADLMG695_RS10295512-1.359581MFS transporter
XADLMG695_RS23270110-1.165419DUF1304 domain-containing protein
XADLMG695_RS10300-111-0.479180acyl-CoA dehydrogenase
XADLMG695_RS10310012-1.685725alpha/beta hydrolase
XADLMG695_RS10315113-1.767112TetR/AcrR family transcriptional regulator
XADLMG695_RS10320113-1.369733TonB-dependent receptor
XADLMG695_RS10325113-1.468689hypothetical protein
XADLMG695_RS10330213-1.630906Hsp33 family molecular chaperone HslO
XADLMG695_RS10335213-1.843448monofunctional biosynthetic peptidoglycan
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10310TCRTETA394e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 4e-05
Identities = 56/312 (17%), Positives = 107/312 (34%), Gaps = 41/312 (13%)

Query: 76 FTLQVLFTCTFLIMVLLQPVYGALVSRYPRR-VFLPGVYGFFIATLLL-----FYVLFDS 129
+L L+ PV GAL R+ RR V L + G + ++ +VL+
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 130 GVPG--RGMAFFLWVTVFNLFAVAVFWSFMADVFSNAQARSYYGYIGAAGTLGAFLGPVL 187
+ G AV +++AD+ + ++G++ A G GPVL
Sbjct: 103 RIVAGITGATG------------AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL 150

Query: 188 TRVLVERIGIAHLMLVSAGFLAVCVVCVLRLRLWAVAREQEGQLSSGEVPMGGDVLGGLK 247
++ H +A L L E + L +
Sbjct: 151 GGLMGG-FS-PHAPFFAAAALNGLNFLTGCFLL----PESHKGERRPLRREALNPLASFR 204

Query: 248 LIVREPLLRWLAFMVLFGVGVGTLLYNEQAALVRRLYTDAAAATAYYSSIDLAIN----- 302
++ L V L + A + ++ + ++ + + I+
Sbjct: 205 WARGMTVVAALMA-----VFFIMQLVGQVPAALWVIFGE---DRFHWDATTIGISLAAFG 256

Query: 303 ALALVLQLLVTRALLSRFGIAPALLIPGVAIMLGYAALAASPLPMMIAIVQVITRSSEFA 362
L + Q ++T + +R G AL++ +A GY LA + M + V+ S
Sbjct: 257 ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS--GG 314

Query: 363 LAKPARETLYTR 374
+ PA + + +R
Sbjct: 315 IGMPALQAMLSR 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10330HTHTETR623e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 3e-14
Identities = 32/187 (17%), Positives = 66/187 (35%), Gaps = 14/187 (7%)

Query: 29 QAALDLIAEQGVGAVAVEPLARRLGVTKGSFYWHFPSRDALLQAALERWEIFEQKEVFGS 88
AL L ++QGV + ++ +A+ GVT+G+ YWHF + L E E +
Sbjct: 18 DVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEY 77

Query: 89 LEDVP-DPSARLRA----LFQLVAHEVKPHVIYSELLKALDHPAVRPVIDRVSQRRLDYL 143
P DP + LR + + E + ++ + + V+ + +
Sbjct: 78 QAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLES 137

Query: 144 IASFRQ---AGLSR------TDAQHRARLAYAAYVGFLQLSLQLQQPKPAREDFEAYVEH 194
Q + + A + G ++ L Q +++ YV
Sbjct: 138 YDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAI 197

Query: 195 VIQTLIP 201
+++ +
Sbjct: 198 LLEMYLL 204


34XADLMG695_RS11365XADLMG695_RS11400Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS113650103.095961DNA-3-methyladenine glycosylase 2 family
XADLMG695_RS11370193.864968hypothetical protein
XADLMG695_RS113750104.229397hypothetical protein
XADLMG695_RS113801114.496844DUF4019 domain-containing protein
XADLMG695_RS113851114.279554MAPEG family protein
XADLMG695_RS11390-1103.026110hypothetical protein
XADLMG695_RS113953152.370684RNA-binding S4 domain-containing protein
XADLMG695_RS114002141.947055hypothetical protein
35XADLMG695_RS11785XADLMG695_RS11810Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS11785312-1.68848630S ribosomal protein S12 methylthiotransferase
XADLMG695_RS11790310-1.968139transcription elongation factor GreB
XADLMG695_RS11795410-1.979662rhomboid family intramembrane serine protease
XADLMG695_RS1180049-2.451965DUF4013 domain-containing protein
XADLMG695_RS11805511-2.801110GDP-mannose pyrophosphatase NudK
XADLMG695_RS11810214-0.456984DUF853 family protein
36XADLMG695_RS12150XADLMG695_RS12275Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS12150112-3.214898pilus assembly protein
XADLMG695_RS12160215-4.165477type IV pilin protein
XADLMG695_RS12165217-4.540245*GspH/FimT family pseudopilin
XADLMG695_RS12170320-5.883270excinuclease ABC subunit UvrB
XADLMG695_RS12175426-6.983209*hypothetical protein
XADLMG695_RS12180341-7.979830type IV secretory system conjugative DNA
XADLMG695_RS12185248-9.043187TcpQ domain-containing protein
XADLMG695_RS12190434-7.794798type IV secretion system protein
XADLMG695_RS12195428-7.123064TrbG/VirB9 family P-type conjugative transfer
XADLMG695_RS12210418-5.675637hypothetical protein
XADLMG695_RS12215522-7.588512P-type DNA transfer ATPase VirB11
XADLMG695_RS12225538-9.806495lytic transglycosylase domain-containing
XADLMG695_RS12230543-11.175449TrbC/VirB2 family protein
XADLMG695_RS12240762-13.531223VirB3 family type IV secretion system protein
XADLMG695_RS12245867-15.144050VirB4 family type IV secretion/conjugal transfer
XADLMG695_RS12250867-15.902795alpha-glucosidase
XADLMG695_RS12255654-13.637752hypothetical protein
XADLMG695_RS12260652-12.829411TonB-dependent receptor
XADLMG695_RS12265229-9.195954hypothetical protein
XADLMG695_RS12270227-8.550219glycoside hydrolase family 97 protein
XADLMG695_RS12275019-6.310172Six-hairpin glycosidase-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12200BCTERIALGSPG507e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 50.3 bits (120), Expect = 7e-11
Identities = 25/106 (23%), Positives = 49/106 (46%), Gaps = 18/106 (16%)

Query: 1 MKRTAAQVRGFTLIELMIVVAVVAILSAIAYPSYTEHVRKSRRAQAKVDLVEYGQLAERF 60
M+ T Q RGFTL+E+M+V+ ++ +L+++ P+ + K+ + +A D+V
Sbjct: 1 MRATDKQ-RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIV--------- 50

Query: 61 HTVQNTYSGFTLPTNVSPR-EGGTAAYTLALTQQ------TQSGYV 99
++N + L + P G + A T + GY+
Sbjct: 51 -ALENALDMYKLDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYI 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12210BCTERIALGSPH290.006 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.006
Identities = 16/107 (14%), Positives = 40/107 (37%), Gaps = 13/107 (12%)

Query: 7 LSARGYTAVQLLIVMAVIGIGAAIGVPSFKSLIEWQRATTRVHLLTAHLAMARSLAVTQG 66
+ RG+T +++++++ ++G+ A + + +F + + A + A L + + G
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRD-DSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 67 EPVSLCPSTDGTRCRTDRIWSQGWILFKDPGRGGQPPTSASVIRAEY 113
+ + + W R G P A + Y
Sbjct: 60 QFFGV------------SVHPDRWQFLVLEARDGADPAPADDGWSGY 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12240PF043352123e-70 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 212 bits (542), Expect = 3e-70
Identities = 52/230 (22%), Positives = 104/230 (45%), Gaps = 12/230 (5%)

Query: 14 QVGAAVQKAVNYEVSIADLARRSEKRAWIVATLSMLVTVMTAGGYYYMLPLKEKVPYLVM 73
++ A ++A ++E A RS+K AW+VA ++ + + PLK PY++
Sbjct: 9 ELKAYFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVIT 68

Query: 74 ADAYSGTSTIAKLEPNFGGRAISTSEALARSNIARFIIARESYDASNISDRDWNTVVAMA 133
D +G ++I G I+ EA+ + +A ++ RE + A+ + ++ V+ M+
Sbjct: 69 VDRNTGEASI--AAKLHGDATITYDEAVRKYFLATYVRYREGWIAAAREE-YFDAVMVMS 125

Query: 134 TTGVLAEYRALHAANNAARPFNVYGRNRAIRISILSITLIGGKGKPFTGATVRFQRSLYD 193
+ + +N P N+ + + I ++ +GG A V F +
Sbjct: 126 ARPEQDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGN-----VAQVYFTKESVT 180

Query: 194 KSSTVSTLLDNKIATMEFAYQDNLQMSDDLRVENPLGFRVSDYRVDNDYS 243
S++ T + +AT+++ D + R +NPLG++V YR D +
Sbjct: 181 GSNSTKT---DAVATIKYKV-DGTPSKEVDRFKNPLGYQVESYRADVEVP 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12245TYPE4SSCAGX361e-04 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 35.9 bits (82), Expect = 1e-04
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 10/89 (11%)

Query: 44 TGLGITTQVELSPNEKILDYSTGFTGGWELTRRENVFYLKPKNVDVD-------TNMMIR 96
T L T ++L +E I +TGF GW + N +++PK+V + N +
Sbjct: 59 TSLDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALM 118

Query: 97 TATHSYILELK---VVATDWQRLEQAKQA 122
T + L+ K V A D + LE+ K+A
Sbjct: 119 TRDYQEFLKTKKLIVDAPDPKELEEQKKA 147



Score = 28.6 bits (63), Expect = 0.027
Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 165 YDYDYSTRTKKSWLVPSRVYDDGKFTY 191
Y+Y + + ++PS ++DDG FTY
Sbjct: 401 YNYYQAPEKRSKHIMPSEIFDDGTFTY 427


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12255MYCMG045320.004 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 31.6 bits (71), Expect = 0.004
Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 187 TTFMKALVNHIP--NEERLVTIEDARELFISQPNSVH 221
T +KA+V H N+ RLV I+DAR +F S N V+
Sbjct: 171 TDVIKAIVKHKDRFNDNRLVFIDDARTIF-SLANIVN 206


37XADLMG695_RS12535XADLMG695_RS12560Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS12535-3113.409969proline racemase family protein
XADLMG695_RS12540-1113.682004FAD-binding oxidoreductase
XADLMG695_RS125450114.460810(2Fe-2S)-binding protein
XADLMG695_RS125502153.972384FAD-dependent oxidoreductase
XADLMG695_RS125553154.640914dihydrodipicolinate synthase family protein
XADLMG695_RS125603144.248300aldehyde dehydrogenase family protein
38XADLMG695_RS12655XADLMG695_RS12670Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS12655315-3.10510016S rRNA (guanine(966)-N(2))-methyltransferase
XADLMG695_RS12660215-3.140246pantetheine-phosphate adenylyltransferase
XADLMG695_RS23295418-3.451757hypothetical protein
XADLMG695_RS12665314-2.328849YfhL family 4Fe-4S dicluster ferredoxin
XADLMG695_RS22385313-2.278393gamma-glutamyltransferase
XADLMG695_RS12670214-1.225330glycoside hydrolase family 9 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12685LPSBIOSNTHSS2101e-72 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 210 bits (535), Expect = 1e-72
Identities = 72/152 (47%), Positives = 105/152 (69%)

Query: 9 AVYPGTFDPITNGHIDLVNRAAPLFERVVVGVAYSPSKGPALSLERRVALAQEALAAHTN 68
A+YPG+FDPIT GH+D++ R LF++V V V +P+K P S++ R+ +A+A N
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPN 62

Query: 69 VEVRGFDTLLAHFVREMGAGVLLRGLRAVSDFEYEFQMASMNRHLIPEVETLFLTPAEQY 128
+V F+ L ++ R+ AG +LRGLR +SDFE E QMA+ N+ L ++ET+FLT + +Y
Sbjct: 63 AQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTEY 122

Query: 129 SFISSSLVREIARLGGDVSGFVPASVVEALRQ 160
SF+SSSLV+E+AR GG+V FVP+ V AL
Sbjct: 123 SFLSSSLVKEVARFGGNVEHFVPSHVAAALYD 154


39XADLMG695_RS12775XADLMG695_RS12810Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS12775112-4.103195DUF1629 domain-containing protein
XADLMG695_RS12780213-4.520190hypothetical protein
XADLMG695_RS12785528-8.460189hypothetical protein
XADLMG695_RS12790636-4.992007carbohydrate porin
XADLMG695_RS12795634-4.403005PTS fructose transporter subunit IIBC
XADLMG695_RS22400424-3.3650481-phosphofructokinase
XADLMG695_RS23305218-1.740205phosphoenolpyruvate--protein phosphotransferase
XADLMG695_RS12800215-1.014424LacI family DNA-binding transcriptional
XADLMG695_RS128050121.159167multidrug efflux RND transporter permease
XADLMG695_RS12810-1183.015542efflux RND transporter periplasmic adaptor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12805IGASERPTASE546e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.5 bits (128), Expect = 6e-09
Identities = 37/238 (15%), Positives = 78/238 (32%), Gaps = 20/238 (8%)

Query: 725 QARMQASVAAQARQEREQQERVAQEQHVAQVREHLQQAQPEHE-DRSQSEQAVQAQAVLE 783
A + Q + E+ E+ A E AQ RE ++A+ + + +E A E
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATET-TAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 784 GQRQAEQQRELEERQVQERQADNQQREQQDRQAQETRQVEAQEGQARQAQDQQQQTQALE 843
Q ++ E++ + + + E+ + T QV ++ Q+ Q Q + + +
Sbjct: 1095 TQTTETKETATVEKEEKAK----VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150

Query: 844 PTQDQRQQASQQPDTQLHAPELALTQQTTLPQSQEDACSRLETQNQPANERLAPDAHDSL 903
PT + ++ SQ T + + T ++ + + +
Sbjct: 1151 PTVNIKEPQSQTNTT----ADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 904 KQTSEAGDAQSHLAQGAERALESQAVQSRDTARIQVPLSEGRESGNPPLQSAQADAVS 961
Q + ++ + R++ S S N A D S
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT----------TSSNDRSTVALCDLTS 1254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12810FLAGELLIN290.012 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.2 bits (65), Expect = 0.012
Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 7 AAMAEMMATLNASNTSLQETITVLTTLVASMQQREQRLRDV-VAEQ------LQVLQRAA 59
+ + + ++L A IT L V ++ R+ D A + Q+LQ+A
Sbjct: 429 SKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAG 488

Query: 60 SSADAKVNRVLENAL 74
+S A+ N+V +N L
Sbjct: 489 TSVLAQANQVPQNVL 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12830PHPHTRNFRASE5770.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 577 bits (1488), Expect = 0.0
Identities = 208/568 (36%), Positives = 321/568 (56%), Gaps = 11/568 (1%)

Query: 274 AIVGIGASPGVAIGIVHRLRAAQTEVADQPV-GLGDGGAQLHDALTRTRQQLAAIQDDTQ 332
I GI AS GVAI ++ + + +L AL +++++L AI+D T+
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTE 63

Query: 333 RRLGASDAAIFKAQAELLNDTDLITR-TCQLMVEGHGVAWSWHQAVEQMASGLAALGNPV 391
+GA A IF A +L+D +L+ ++ E ++ + + S ++ N
Sbjct: 64 ASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEY 123

Query: 392 LAGRAADLRDVGRRVLAQLDPAAAGAGLTDLPEQPCILLASDLSPSDTANLDTARVLGLA 451
+ RAAD+RDV +RVL L G+ L + E +++A DL+PSDTA L+ V G A
Sbjct: 124 MKERAADIRDVSKRVLGHLIGVETGS-LATIAE-ETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 452 TAQGGPTSHTAILSRTLGLPALVAAGGQLLDIEDGVTAIIDGSSGRLYIDPSAQDLDAAR 511
T GG TSH+AI+SR+L +PA+V I+ G I+DG G + ++P+ +++ A
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 512 THIAEQQAIREREAAQRALPAETSDGHHIDIGANVNLPDQVAMALTQGAEGVGLMRTEFL 571
A + ++ A P+ T DG H+++ AN+ P V L G EG+GL RTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 572 FLESGRTPSEDEQHATYLAMAQALDGRPLIVRALDIGGDKQVAHLELPHEENPFLGVRGA 631
+++ + P+E+EQ Y + Q +DG+P+++R LDIGGDK++++L+LP E NPFLG R
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 632 RLLLRRPDLLEPQLRALYRAAKDGARLSIMFPMITSVPELVALRAICARIRVDLDA---- 687
RL L + D+ QLRAL RA+ G L +MFPMI ++ EL +AI + L +
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYG-NLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVD 420

Query: 688 --PEVPIGIMIEVPAAAAQADVLARHADFFSIGTNDLTQYVLAIDRQNPELAAEADSLHP 745
+ +GIM+E+P+ A A++ A+ DFFSIGTNDL QY +A DR N ++ HP
Sbjct: 421 VSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHP 480

Query: 746 AVLRMIRSTIDGARKHERWVGVCGGLAGDAFGASLLAGLGVQELSMTPNDIPAVKARLRG 805
A+LR++ I A +WVG+CG +AGD LL GLG+ E SM+ I +++L
Sbjct: 481 AILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLK 540

Query: 806 AALSQLQQLAEQALACETAEQVRALEAK 833
+ +L+ A++AL +TAE+V L K
Sbjct: 541 LSKEELKPFAQKALMLDTAEEVEQLVKK 568


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12840ACRIFLAVINRP10810.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1081 bits (2796), Expect = 0.0
Identities = 518/1038 (49%), Positives = 706/1038 (68%), Gaps = 17/1038 (1%)

Query: 1 MPKFFIEHPVFAWVVAILISLAGVISILNLGIESYPTIAPPQVTVTANFPGASADTAEKA 60
M FFI P+FAWV+AI++ +AG ++IL L + YPTIAPP V+V+AN+PGA A T +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQQLTGIDHLLYFNSSSAANGRVTITLTFETGTDADIAQVQVQNKVSLATPRLPS 120
VTQVIEQ + GID+L+Y +S+S + G VTITLTF++GTD DIAQVQVQNK+ LATP LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVTQQGVVVAKANAGFLMVAALRSDNPSINRDALNDIVGSRVLEQISRVPGVGSTNQFGA 180
EV QQG+ V K+++ +LMVA SDNP +D ++D V S V + +SR+ GVG FGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 EYAMNIWLNPEKLQGYNLSATQVLTAVRNQNVQFAAGSVGADPTPEGISFTATVSAEGRF 240
+YAM IWL+ + L Y L+ V+ ++ QN Q AAG +G P G A++ A+ RF
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 SSPEQFENIILRTDNNGATVRLKDVARVTVGPSNYGFDTQYNGKPTGAFGIQLLPGANAL 300
+PE+F + LR +++G+ VRLKDVARV +G NY + NGKP GI+L GANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 NVSEAVGAKLDELQPTFPQGVTWFAPYESTTFVRISIEEVIHTLVEAIVLVFLVMLLFLQ 360
+ ++A+ AKL ELQP FPQG+ PY++T FV++SI EV+ TL EAI+LVFLVM LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATVIPTLVIPVALLGTFFGMYMIGFTINQLTLFAMVLAIGIVVDDAIVVIENVERIM 420
N RAT+IPT+ +PV LLGTF + G++IN LT+F MVLAIG++VDDAIVV+ENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 SEEHLEPKAATQKAMTQITGAVVAITVVLAAVFIPSSLQPGASGAIYKQFALTIAMSMGF 480
E+ L PK AT+K+M+QI GA+V I +VL+AVFIP + G++GAIY+QF++TI +M
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SAFLALSFTPALCAAFLK---STHSTKKNWVYRTFDKYYDKLAHRYVGVVGHTLKRSPPW 537
S +AL TPALCA LK + H K + F+ +D + Y VG L + +
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 538 MIAFVALVVLCGFLFTRMPGSFLPEEDQGFAVAIVQLPPGATKIRTNEAFAQMRAVLEKQ 597
++ + +V LF R+P SFLPEEDQG + ++QLP GAT+ RT + Q+ K
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 PA--VEGMLQIAGFSFLGSGENVGMGFIRLKPWEERDV---TAEQLIQQLNGAFYGIKGA 652
VE + + GFSF G +N GM F+ LKPWEER+ +AE +I + I+
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 653 QIFVVNLPTVQGLGQFGGFDMWLQDRSGAGQEALINARNIVLGKAAEKQDALVGVRPNGL 712
+ N+P + LG GFD L D++G G +AL ARN +LG AA+ +LV VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 713 ENSPQLQLHVDRVQAQSMGLDVSDIYSSIQLMLAPVYVNDYFAEGRIKRVNMRADDQFRA 772
E++ Q +L VD+ +AQ++G+ +SDI +I L YVND+ GR+K++ ++AD +FR
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 773 GPESLRDFFTPSATATGADGQPAMIPLSNVVKAEWNYASPALNRYNGYSAVNIVGNPAPG 832
PE + + S A+G+ M+P S + W Y SP L RYNG ++ I G APG
Sbjct: 781 LPEDVDKLYVRS-----ANGE--MVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833

Query: 833 GSSGQAMSAMEDIVNNDLPPGFGFDWSGMSYQEIIAGNAATLLLALSVVVVFLCLAALYE 892
SSG AM+ ME++ + LP G G+DW+GMSYQE ++GN A L+A+S VVVFLCLAALYE
Sbjct: 834 TSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYE 892

Query: 893 SWSIPVAVLLVVPIGVLGAITFSMLRGLPNDLYFKIGMITVIGLAAKNAILIVEFAVE-Q 951
SWSIPV+V+LVVP+G++G + + L ND+YF +G++T IGL+AKNAILIVEFA +
Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLM 952

Query: 952 RAAGKTLREATLEAAHLRFRPILMTSFAFILGVLPLAISTGAGANSRHSIGTGVIGGMVF 1011
GK + EATL A +R RPILMTS AFILGVLPLAIS GAG+ +++++G GV+GGMV
Sbjct: 953 EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVS 1012

Query: 1012 ATVLGVIFIPLFFVVVRR 1029
AT+L + F+P+FFVV+RR
Sbjct: 1013 ATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12845RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.2 bits (94), Expect = 1e-05
Identities = 17/108 (15%), Positives = 37/108 (34%)

Query: 59 RSADVRARVDGVVLKRLYTEGANVTEGQPLFQIDPSQLKATLLQAQGQLAAAEATYTNAK 118
RS +++ + +V + + EG +V +G L ++ +A L+ Q L A T +
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 119 IAATRARSLAPQQYVSRADIDTAEANERSSGANVQQARGAVEAARIQL 166
I + + + +E + + Q
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202



Score = 32.1 bits (73), Expect = 0.004
Identities = 13/51 (25%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 59 RSADVRARVDGVVLK-RLYTEGANVTEGQPLFQIDPSQLKATLLQAQGQLA 108
+++ +RA V V + +++TEG VT + L I P L+ +
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED---DTLEVTALVQ 373


40XADLMG695_RS13525XADLMG695_RS13545Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS13525315-0.754493chemotaxis response regulator CheY
XADLMG695_RS13530315-1.471386RNA polymerase sigma factor FliA
XADLMG695_RS13535615-2.243118MinD/ParA family protein
XADLMG695_RS22465537-6.454698flagellar biosynthesis protein FlhF
XADLMG695_RS13540335-5.603658flagellar biosynthesis protein FlhA
XADLMG695_RS22470220-3.470032flagellar biosynthesis protein FlhB
XADLMG695_RS13545215-2.093992bifunctional diguanylate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13570HTHFIS932e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.0 bits (231), Expect = 2e-25
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 6 RILIVDDFSTMRRIVKNLLGDLGFTNTAEAEDGNSALAALRAGPFDFVVTDWNMPGMTGI 65
IL+ DD + +R ++ L G+ + + + AG D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 DLLRNIRADAKLKHLPVMMVTAEAKREQIIEAAQCGVNGYIIKPF 110
DLL I+ LPV++++A+ I+A++ G Y+ KPF
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13595TYPE3IMSPROT347e-120 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 347 bits (891), Expect = e-120
Identities = 104/344 (30%), Positives = 182/344 (52%), Gaps = 2/344 (0%)

Query: 8 GERTELPTEKRLREAREQGNIPQSRELSTAAVFGAGVFALMVLARGIGDGAAVWMKTALS 67
GE+TE PT K++R+AR++G + +S+E+ + A+ A LM L+ + + M +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LIP 60

Query: 68 PDPKMRENPMALFGHFGDLLLQLLWVMLPLIGICLAAGLAGPLMMSGLRFSGKAIMPDLS 127
+ AL ++LL+ ++ PL+ + +A ++ G SG+AI PD+
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 128 KLNPANGIKRMWGSNSLAELIKSVLRLLFVGLAASFCISKGLHGLRSLVNQPLEQAIGNG 187
K+NP G KR++ SL E +KS+L+++ + + I L L L +E
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 188 LDFTKSLLFYTAGALVLLAAIDAPYQKWNWLRKLKMTREEIKREMKESEGSPEVKGRIRQ 247
+ L+ V+++ D ++ + ++++LKM+++EIKRE KE EGSPE+K + RQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 248 MQMQMSQRQMMEAVPKADVVLMNPTHYAVALKYEGGKMRAPIVVAKGVDEMAFRIREAGE 307
++ R M E V ++ VV+ NPTH A+ + Y+ G+ P+V K D +R+ E
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 308 QHRVAIVTAPPLARALHREAQIGKEIPVRLYSVVAQVLSYVYQL 351
+ V I+ PLARAL+ +A + IP A+VL ++ +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344


41XADLMG695_RS13610XADLMG695_RS13745Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS136102152.065950flagellar assembly protein FliH
XADLMG695_RS136153180.311578flagellar motor switch protein FliG
XADLMG695_RS13620321-0.070862flagellar M-ring protein FliF
XADLMG695_RS136252221.907975flagellar hook-basal body complex protein FliE
XADLMG695_RS136352283.034095glycosyltransferase
XADLMG695_RS136401283.498278hypothetical protein
XADLMG695_RS136452304.020406FkbM family methyltransferase
XADLMG695_RS136502283.9168423-deoxy-manno-octulosonate cytidylyltransferase
XADLMG695_RS136552293.152748FkbM family methyltransferase
XADLMG695_RS13660-2192.633460acetyltransferase
XADLMG695_RS13665-2110.446576aromatic ring-hydroxylating dioxygenase subunit
XADLMG695_RS13670-29-0.658170acetyltransferase
XADLMG695_RS13675-210-1.335034SDR family oxidoreductase
XADLMG695_RS13680-216-3.450563ketoacyl-ACP synthase III
XADLMG695_RS13685-218-4.057376acyl carrier protein
XADLMG695_RS23320342-12.138074DegT/DnrJ/EryC1/StrS family aminotransferase
XADLMG695_RS13690338-11.129166sigma-54-dependent Fis family transcriptional
XADLMG695_RS13700234-10.270604response regulator transcription factor
XADLMG695_RS23325130-8.444070RNA polymerase factor sigma-54
XADLMG695_RS13705124-7.207825response regulator transcription factor
XADLMG695_RS13710114-3.621557PilZ domain-containing protein
XADLMG695_RS13715011-3.439085hypothetical protein
XADLMG695_RS13720010-1.556772flagellar export chaperone FliS
XADLMG695_RS13725210-1.815190flagellar filament capping protein FliD
XADLMG695_RS13730011-1.092820flagellin
XADLMG695_RS13735111-0.642532flagellar hook-associated protein FlgL
XADLMG695_RS13740111-0.155733flagellar hook-associated protein FlgK
XADLMG695_RS137452140.617202flagellar assembly peptidoglycan hydrolase FlgJ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13665FLGFLIH433e-07 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 42.9 bits (100), Expect = 3e-07
Identities = 36/159 (22%), Positives = 76/159 (47%), Gaps = 7/159 (4%)

Query: 51 HEGFARGHAEGFAQGQSEVRRLTAQIDGILDNFTRPLARLENEVVGALGELAVRIAGQLV 110
EG A+G +G A+ +S+ + A++ ++ F L L++ + L ++A+ A Q++
Sbjct: 73 QEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVI 132

Query: 111 GRVYQADPQLLADLVGEAVDAVGGAGREVEVRLHPDDITALLPHLAPSSTT---RVAPDM 167
G+ D L + + + + ++R+HPDD+ + L + + R+ D
Sbjct: 133 GQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDP 192

Query: 168 SLSRGDLRVHAESVRIDGTLDARLRAALETVMRKSGAGL 206
+L G +V A+ +G LDA + + + R + G+
Sbjct: 193 TLHPGGCKVSAD----EGDLDASVATRWQELCRLAAPGV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13670FLGMOTORFLIG308e-106 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 308 bits (791), Expect = e-106
Identities = 106/329 (32%), Positives = 200/329 (60%)

Query: 1 MTGVQRAAVLLLSLGESDAAEVLKHMDPKEVQKIGIAMATMTGISRDQVEKVMDEFNGEL 60
+TG Q+AA+LL+S+G +++V K++ +E++ + +A + I+ + + V+ EF +
Sbjct: 15 LTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELM 74

Query: 61 AGKTSLGVGADDYIRNVLIQALGADKAGGLIDRILLGRNTTGLDTLKWMDPRAVADLVRN 120
+ + G DY R +L ++LG KA +I+ + + + ++ DP + + ++
Sbjct: 75 MAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQ 134

Query: 121 EHPQIIAIVMAHLDSDQAAEALKLLPERTRADVLLRIATLDGIPPNALSELNDIMERQFA 180
EHPQ IA+++++LD +A+ L LP + +V RIA +D P + E+ ++E++ A
Sbjct: 135 EHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLA 194

Query: 181 GNQNLKSSNVGGIKVAANILNFLDTGSDQGVLGEIGKIDADLAGKIQDLMFVFDNLVDLD 240
+ ++ GG+ I+N D +++ ++ + + D +LA +I+ MFVF+++V LD
Sbjct: 195 SLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLD 254

Query: 241 DRGLQTLLREVSGERLGLALRGADVKVREKITRNMSQRAAEILLEDMEARGPVRLADVEA 300
DR +Q +LRE+ G+ L AL+ D+ V+EKI +NMS+RAA +L EDME GP R DVE
Sbjct: 255 DRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEE 314

Query: 301 AQKEILTIVRRLADEGAISLGGAGAEAMV 329
+Q++I++++R+L ++G I + G E ++
Sbjct: 315 SQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13675FLGMRINGFLIF351e-116 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 351 bits (902), Expect = e-116
Identities = 187/575 (32%), Positives = 300/575 (52%), Gaps = 45/575 (7%)

Query: 16 KAGQWFDRVRSLQITRKLTMMAMIALAVAAGLAVFFWSQKPGYQSLYTGLDEKGNAEAAD 75
K +W +R+R+ ++ ++ + AVA +A+ W++ P Y++L++ L ++
Sbjct: 11 KPLEWLNRLRANP---RIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVA 67

Query: 76 LLRTAQIPYKIDQGTGAISVPQDRLYDARLKLAGSGLTGKETGGGFELMEKDPGFGVSQF 135
L IPY+ G+GAI VP D++++ RL+LA GL K GFEL++++ FG+SQF
Sbjct: 68 QLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLP-KGGAVGFELLDQEK-FGISQF 125

Query: 136 VESARYQHALETELSRTIGTLRPVREARVHLAIPKPSAFTRQRDVASASVVLELRGGQGL 195
E YQ ALE EL+RTI TL PV+ ARVHLA+PKPS F R++ SASV + L G+ L
Sbjct: 126 SEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL 185

Query: 196 ERNQVDAIVNLVASSIPDMTPERVTVVDQSGRMLSIADPNSDAAQHAAQFEQVRRQESSY 255
+ Q+ A+V+LV+S++ + P VT+VDQSG +L+ ++ + AQ + ES
Sbjct: 186 DEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLN-DAQLKFANDVESRI 244

Query: 256 NQRIRELLEPMTGPGRVNPETSVDMDFSVVEEARELYN----GEPAKLRSEQVSD-TSTS 310
+RI +L P+ G G V+ + + +DF+ E+ E Y+ A LRS Q++
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 311 ATGPQGPPGATSNSPGQPPAPAVAGAPGT--------PAAANGQAAAPATPTESSKSATR 362
A P G PGA SN P PP A P T P + + A P + ++ T
Sbjct: 305 AGYPGGVPGALSNQP-APPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETS 363

Query: 363 NYELDRTLQHTRQPAGRIKRVSVAVLLDNVPRPGAKGKMVEQPLTAAELTRIEGLVKQAV 422
NYE+DRT++HT+ G I+R+SVAV+++ K PLTA ++ +IE L ++A+
Sbjct: 364 NYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAM 419

Query: 423 GFDAARGDTVSVMNAPFVREAVAGEEGPKWWEDPRVQNGLRLLVGAVVVLALLF----GV 478
GF RGDT++V+N+PF G E P W + + L ++VL + +
Sbjct: 420 GFSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAG-RWLLVLVVAWILWRKA 478

Query: 479 VRPTLRQLTGVTAVKDKQGKAGKDGTPQSADVRMVEDDDDLMPRLEEDTAQIGQDKKTPI 538
VRP L + +Q + ++ + A + D+ L Q ++
Sbjct: 479 VRPQLTRRVEEAKAAQEQAQVRQET--EEAVEVRLSKDEQL------------QQRRANQ 524

Query: 539 ALPDAYEERMRLAREAVKADSKRVAQVVKGWVASE 573
L E + RE D + VA V++ W++++
Sbjct: 525 RLG--AEVMSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13680FLGHOOKFLIE603e-15 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 60.1 bits (145), Expect = 3e-15
Identities = 28/84 (33%), Positives = 48/84 (57%)

Query: 40 AGAQGTPATQAPSFSETLRGAIGGVNEAQQKAGALSKAFEMGDPNADLARVMVASQQSQV 99
A AQ + SF+ L A+ +++ Q A ++ F +G+P L VM Q++ V
Sbjct: 20 ARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASV 79

Query: 100 AFRATVEVRNRLVQAYQDVMNMPL 123
+ + ++VRN+LV AYQ+VM+M +
Sbjct: 80 SMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13725DHBDHDRGNASE1097e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (273), Expect = 7e-31
Identities = 68/260 (26%), Positives = 127/260 (48%), Gaps = 13/260 (5%)

Query: 7 FNPFSLADKRILVSGASSGLGRAIALGCARMGGELIVSGRDPQRLDATLADLRAISERPH 66
N + K ++GA+ G+G A+A A G + +P++L+ ++ L+A +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 67 QALRADLTVATERASLVAALS---APLHGVVHSAGISRLCPARMVGEAHLREVQATNVDA 123
AD+ + + A + P+ +V+ AG+ R + + + N
Sbjct: 61 A-FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTG 119

Query: 124 PILLTQGLLKRNLIAADGAIVFIASIAAHIGVAGVGAYSASKAALIAYARCLAMEVVKRH 183
++ + K + G+IV + S A + + AY++SKAA + + +CL +E+ + +
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 184 IRVNCLSPALVDTPLL-------DATAQVV-GSLETERSNYPLG-FGRPDDVANAAIFLL 234
IR N +SP +T + + QV+ GSLET ++ PL +P D+A+A +FL+
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 235 SGASRWITGTSLVMDGGLTI 254
SG + IT +L +DGG T+
Sbjct: 240 SGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13730PF04183290.028 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 29.1 bits (65), Expect = 0.028
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 4/45 (8%)

Query: 71 ERLQWKREEIDALIVVTQSPDYPIPATAII--LQDRLGLSHATVA 113
ER W IDA + D P+ A ++ L+ L +S ATVA
Sbjct: 51 ERGIWGWLWIDAQTLRCA--DEPVLAQTLLMQLKQVLSMSDATVA 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13745HTHFIS437e-152 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 437 bits (1125), Expect = e-152
Identities = 173/489 (35%), Positives = 249/489 (50%), Gaps = 16/489 (3%)

Query: 1 MSESRILLIDSDAVRAERTVSLLEFMDFNPRWVTDGADINPGRHRHDEWMAVMVGSAQDA 60
M+ + IL+ D DA L ++ R ++ A + R ++V
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLW--RWIAAGDGDLVVTDVVMP 58

Query: 61 -AQADKFFDWLADAKLPPPVLLMEGSPSAFAQTHGLHEANVWALDTPLRHAQLEALLRRA 119
A + A+ PVL+M + + L P +L ++ RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 S--LKRLDAEHQAGVQQDSGPTGNSEAVTRLRRLIDQVAAFDTTVLVLGESGTGKEVVAR 177
KR ++ + Q G S A+ + R++ ++ D T+++ GESGTGKE+VAR
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 178 AIHQHSPRRDGPFVAINCGAIPPDLLESELFGHEKGAFTGALTTRKGRFEMAEGGTLLLD 237
A+H + RR+GPFVAIN AIP DL+ESELFGHEKGAFTGA T GRFE AEGGTL LD
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 238 EIGDMSLPMQVKLLRVLQERSFERVGGGQTIRCNVRVIAATHRNLESRISDGQFREDLFY 297
EIGDM + Q +LLRVLQ+ + VGG IR +VR++AAT+++L+ I+ G FREDL+Y
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 298 RLNVFPIEMPALRERVDDLAMLVQTIAGQLARTGRGEVRFADEALQALRGYDWPGNVREL 357
RLNV P+ +P LR+R +D+ LV+ Q + G RF EAL+ ++ + WPGNVREL
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 358 TNLVERLAVLHPGGLVRVQDLPARYRGDFASAIPVELPPEPELVTAPVEVSALPSNVVTL 417
NLV RL L+P ++ + + R + + + ++ V
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 418 QPKTADAEPSATSSLPDDGIDLRGHMANIELALINEALERTQGVVAHAAQLLGLRRTTLV 477
+A +E LI AL T+G AA LLGL R TL
Sbjct: 419 FG-----------DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 478 EKLRKYGID 486
+K+R+ G+
Sbjct: 468 KKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13750HTHFIS553e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.2 bits (133), Expect = 3e-12
Identities = 20/118 (16%), Positives = 44/118 (37%), Gaps = 2/118 (1%)

Query: 1 MSKLTVLLVDDHEGFINAAMRHFRKVEWLDIVGSAANGLEAIERSESLRPNVVLMDLAMP 60
M+ T+L+ DD + + + V +N + ++V+ D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 EMGGLQATRLIKTQDDPPYIVIASHFDDAEHREHALRAGADNFVSKLSYIQEVMPILE 118
+ IK +++ S + A GA +++ K + E++ I+
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13760HTHFIS726e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 6e-17
Identities = 35/160 (21%), Positives = 66/160 (41%), Gaps = 9/160 (5%)

Query: 2 RVIIVDDHTLVRAGLSRLLQTFAGIDVVGEASNAQQALDMTSLHRPDLVLMDLSLPGRSG 61
+++ DD +R L++ L + AG DV SNA + DLV+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 LDAMTDVLRAAPRTHVVMMSMHDDPVHVRDALDRGAVGFVVKDAAPLELELALRAAAAGQ 121
D + + +A P V++MS + + A ++GA ++ K EL + A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA---- 118

Query: 122 VFLSPQISSKMIAPMLGREKPVGIAALSPRQREILREIGR 161
+ + + + + S +EI R + R
Sbjct: 119 ---LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13785FLAGELLIN1352e-37 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 135 bits (341), Expect = 2e-37
Identities = 124/360 (34%), Positives = 181/360 (50%), Gaps = 10/360 (2%)

Query: 2 AQVINTNVMSLNAQRNLNTSSASMSTSIQRLSSGLRINSAKDDAAGLAISERFTTQIRGL 61
AQVINTN +SL Q NLN S +S+S++I+RLSSGLRINSAKDDAAG AI+ RFT+ I+GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVASRNANDGISLAQTAEGAMVEIGSNLQRIRELSVQSSNATNSSTDRDALNSEVKQLTA 121
ASRNANDGIS+AQT EGA+ EI +NLQR+RELSVQ++N TNS +D ++ E++Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EIDRVANQTNFNGTKLLDGSFSGALFQVGADAGQTIGINSIVDANVDSLGKANFAAAVSG 181
EIDRV+NQT FNG K+L QVGA+ G+TI I + +V SLG F
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITI-DLQKIDVKSLGLDGFNVNGPK 178

Query: 182 AGVTGTATASGSVSGISLSFNDASGSAKSVTIADVKIAAGDTAADVNKKVASAINDKLDQ 241
G +S ++ + + + + +K +A N +L
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 242 TGMYASIKSDGSLQIESLKAGQDFTSLSAG--------TSSAAGITVGAGITTASAASGS 293
+ D +S + +++ T G+T T + +G
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 294 TASTLSSLDISTFSGSQKALEIVDKALTAVNSSRADMGAVQNRFTSTIANLSATSENLSA 353
++T++ ++ A A T +S V +FT + +++
Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358



Score = 97.4 bits (242), Expect = 4e-24
Identities = 74/340 (21%), Positives = 133/340 (39%), Gaps = 3/340 (0%)

Query: 60 GLDVASRNANDGISLAQTAEGAMVEIGSNLQRIRELSVQSSNATNSSTDRDALNSEVKQL 119
G +V L + + + + +S A + T + +V
Sbjct: 171 GFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVN 230

Query: 120 TAEIDRVANQTNFNGTKLLDGSFSGALFQVGADAGQTIGINSIVDANVDSLGKANFAAAV 179
A + N L + A A D G
Sbjct: 231 AANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTK 290

Query: 180 SGAGVTGTATASGSVSGISLSFNDASGSAKSVTIADVKIAAGDTAADVNKKVASAINDKL 239
+G G + + + ++L+ D + A +V A ++ + + VN + K
Sbjct: 291 TGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKN 350

Query: 240 DQTGMYASIKSDGSLQIESLKAGQDFTSLSAGTSSAAGITVGAGITTASAASGSTASTLS 299
+ + ++ + + ++ +T+ + ++ ++
Sbjct: 351 ESAKLSDLEANNAVKGESKITVN---GAEYTANAAGDKVTLAGKTMFIDKTASGVSTLIN 407

Query: 300 SLDISTFSGSQKALEIVDKALTAVNSSRADMGAVQNRFTSTIANLSATSENLSASRSRIR 359
+ + L +D AL+ V++ R+ +GA+QNRF S I NL T NL+++RSRI
Sbjct: 408 EDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIE 467

Query: 360 DTDYAKETAELTRTQILQQAGTAMLAQAKSVPQNVLSLLQ 399
D DYA E + +++ QILQQAGT++LAQA VPQNVLSLL+
Sbjct: 468 DADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13790FLAGELLIN592e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 58.5 bits (141), Expect = 2e-11
Identities = 62/349 (17%), Positives = 111/349 (31%), Gaps = 6/349 (1%)

Query: 4 RISTSMMYSQSVASMGAKQSRLNQFESQLSSGQRLVTAKDDPVAAGTAVGLDRALAAITR 63
I+T+ + + ++ QS L+ +LSSG R+ +AKDD A + +T+
Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62

Query: 64 FGENANNVQNRLGLQENALSQAGDKMARVTELAVQASNSSLSPDDRKAIASELTALRESM 123
NAN+ + E AL++ + + RV EL+VQA+N + S D K+I E+ E +
Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122

Query: 124 VSLANSTDGTGRYLFGGTADGSAPFIKSNG---SVTYNGDQTQKQVEVAPDTFVSDTLPG 180
++N T G + ++G ++ + +
Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 181 SEIFMRIRTGDGTVDAHANAGNTGTGLLLDFSRDASTGSWNGGSYSVQFTAADTYEVRDS 240
++ + G A + +T V
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 241 TNTVVGTGTYKEG--EDINAAGVRMRISGAPAVGDSFQIGASGTKDVFSTID-DLVGALN 297
NT V + A + I G G + T D + D + +
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 298 SDTLTAPQKAAMINTLQTSMRDITQASSKMIDARTSGGAQLSAIDNANA 346
+ A I ++ T SSK + G N
Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351



Score = 36.6 bits (84), Expect = 1e-04
Identities = 50/269 (18%), Positives = 83/269 (30%), Gaps = 1/269 (0%)

Query: 127 ANSTDGTGRYLFGGTADGSAPFIKSNGSVTYNGDQTQKQVEVAPDTFVSDTLPGSEIFMR 186
AN T D + G+ + DTF + +
Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291

Query: 187 IRTGDGTVDAHANAGNTGTGLLLDFSRDASTGSWNGGSYSVQFTAADTYEVRDSTNTVVG 246
G+G V N + + A+ + S +T+ + T
Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351

Query: 247 TGTYKEGEDINAAGVRMRISGAPAVGDSFQIGASGTKDVFSTIDDLVGALNSDTLTAPQK 306
+ + E NA +I+ A + G T + D A TL
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFID-KTASGVSTLINEDA 410

Query: 307 AAMINTLQTSMRDITQASSKMIDARTSGGAQLSAIDNANALLESNEVTLKTSLSSIRDLD 366
AA + + I A SK+ R+S GA + D+A L + L ++ S I D D
Sbjct: 411 AAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 367 YASAIGQYQLEKASLQAAQTIFQQMQSSS 395
YA+ + + QA ++ Q
Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVP 499


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13795FLGHOOKAP12277e-69 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 227 bits (580), Expect = 7e-69
Identities = 141/437 (32%), Positives = 220/437 (50%), Gaps = 8/437 (1%)

Query: 2 SIMSTGTSALIAFQRALSTVSHNVANINTEGYSRQRVEFATRTPTDMGYAFVGNGAKITD 61
S+++ S L A Q AL+T S+N+++ N GY+RQ A T +VGNG ++
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 62 VGRVADQLAISRLLDSGGELSRLQQLSSLSNRVDALYSNTATNVAGLWSNFFDSTSAVSS 121
V R D ++L + + S L +++D + S + +++A +FF S + S
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 122 NASSTAERQSMLDSGNSLATRFKQLNGQMDSLSNEVNSGLTSSVDEVNRLTQQIAKLNGT 181
NA A RQ+++ L +FK + + +VN + +SVD++N +QIA LN
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 182 I----GSSAQAAAPDMLDQRDALVSKLVGFTGGTAVIQDGGFMNVFTAGGQPLVVGTTSS 237
I G A A+ ++LDQRD LVS+L G +QDGG N+ A G LV G+T+
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 238 KLVTAADPYEPTKLQVAMQTQGQNVSLSASSL--GGQIGGLLEFRSSVLEPTQAELGRLA 295
+L +P++ VA L G +GG+L FRS L+ T+ LG+LA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 296 VGMASTFNAGHSQGMDLYGAMGGNFFNIGSPAVAANPSNTGSASLSASFSNVSAVDGQNV 355
+ A FN H G D G G +FF IG PAV N N G ++ A+ ++ SAV +
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDY 361

Query: 356 TLSFDGTNWKAINASTGSAVPMTGTGTAADPLVLNGVSMVVGGTPASGDKFLLQPTAGLA 415
+SFD W+ ++ + T T A + +G+ + GTPA D F L+P +
Sbjct: 362 KISFDNNQWQVTRLASNTT--FTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419

Query: 416 GSLSVAITDPSRIAAAT 432
++ V ITD ++IA A+
Sbjct: 420 VNMDVLITDEAKIAMAS 436



Score = 82.3 bits (203), Expect = 1e-18
Identities = 38/105 (36%), Positives = 56/105 (53%)

Query: 517 AGSSDNGNAKLLANIDDAKALSGGTVTLNGALSGLTTSVGSAARAASYSADAQKVINDQA 576
AG SDN N + L ++ GG + N A + L + +G+ S+ Q + Q
Sbjct: 440 AGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQL 499

Query: 577 QASRDSISGVNLDEEAANMLKLQQAYQAAAQMISTADTIFQAILG 621
+ SISGVNLDEE N+ + QQ Y A AQ++ TA+ IF A++
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13800FLGFLGJ1305e-37 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 130 bits (327), Expect = 5e-37
Identities = 63/140 (45%), Positives = 82/140 (58%), Gaps = 4/140 (2%)

Query: 218 FVAKIWTHAQKAARELGVDPRALVAQAALETGWGRRGI--GNGGDSNNLFGIKATG-WSG 274
F+A++ AQ A+++ GV ++AQAALE+GWG+R I NG S NLFG+KA+G W G
Sbjct: 152 FLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKG 211

Query: 275 DKVTTGTHEYVNGVKTTETADFRAYGSAEESFADYVRLLKNNSRYQTALQAGTDIKGFAR 334
T EY NG A FR Y S E+ +DYV LL N RY A+ + A+
Sbjct: 212 PVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRY-AAVTTAASAEQGAQ 270

Query: 335 GLQQAGYATDPGYAAKIAAI 354
LQ AGYATDP YA K+ +
Sbjct: 271 ALQDAGYATDPHYARKLTNM 290



Score = 71.3 bits (174), Expect = 5e-16
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 22/178 (12%)

Query: 4 AASPIDLNPSTKADPA-KIDKVSRQLEGQFAQMLVKSMRDASGGDPMFPGQNQ-MFREMY 61
A S +L DPA I V+RQ+EG F QM++KSMRDA D +F ++ ++ MY
Sbjct: 15 AQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMY 74

Query: 62 DQQMAKALTDGKGLGLSAMISKQLSGDTGGPALNTSL--------------NTAEAAKAY 107
DQQ+A+ +T GKGLGL+ M+ KQ++ + P +T N A +
Sbjct: 75 DQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQ 134

Query: 108 ALVAGKRDASLPLPARDGAATGVTTSSVAKAALGAGNLSGIGMSQVLDLIAGRTGAGE 165
V D SLP ++ A ++ A A SG+ +L A +G G+
Sbjct: 135 KAVPRNYDDSLPGDSKAFLA------QLSLPAQLASQQSGVPHHLILAQAALESGWGQ 186


42XADLMG695_RS14135XADLMG695_RS14180Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS141352140.415208DUF1349 domain-containing protein
XADLMG695_RS14140019-1.011731phosphodiesterase
XADLMG695_RS14145014-2.045288autotransporter-associated beta strand
XADLMG695_RS225054160.019161response regulator transcription factor
XADLMG695_RS141504160.127310HAMP domain-containing histidine kinase
XADLMG695_RS141554160.316142efflux transporter outer membrane subunit
XADLMG695_RS141704150.716967efflux RND transporter permease subunit
XADLMG695_RS141753130.556144efflux RND transporter periplasmic adaptor
XADLMG695_RS141804130.942693S8 family serine peptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14185HTHFIS941e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.7 bits (233), Expect = 1e-24
Identities = 25/131 (19%), Positives = 52/131 (39%)

Query: 5 APVVYLIDDDASMRAALEDLFASVGLQVYAFGSTDQFLAHRLHEVPACLVLDIRMPGQSG 64
+ + DDDA++R L + G V + +V D+ MP ++
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 65 MEFHRRMVESGVALPTIFITGHGDIAMSVEAMKNGAIEFLTKPFRDQALLDAIQDGIRRD 124
+ R+ ++ LP + ++ +++A + GA ++L KPF L+ I +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 125 RARRQSEAVAA 135
+ R +
Sbjct: 123 KRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14200ACRIFLAVINRP6450.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 645 bits (1665), Expect = 0.0
Identities = 233/1034 (22%), Positives = 426/1034 (41%), Gaps = 43/1034 (4%)

Query: 11 QRRGIVWLVFVLIALYGTWSWTQLPVEAYPDIADVTSQVVTQVPGLGAEEVEQQITVPLE 70
+R W++ +++ + G + QLPV YP IA V PG A+ V+ +T +E
Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIE 66

Query: 71 RALMGTPGLHVLRSRSLFA-LSLITLVFDDGTEGYFARQRVLERIQAVT--LPYGA-IPG 126
+ + G L + S S A ITL F GT+ A+ +V ++Q T LP G
Sbjct: 67 QNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQG 126

Query: 127 LDPYTSPTGEIYRYTLES--KTRSLRELSDLQFWTVIPRLQKVPGVADVTNFGGLTTQFS 184
+ S + + S + ++SD V L ++ GV DV FG
Sbjct: 127 ISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMR 185

Query: 185 LALEPDRLTRYGVSLQQVKSAITSNNAD------GGGSVMDRGEQSYVIRGIGLLHSLQD 238
+ L+ D L +Y ++ V + + N GG + + + I + ++
Sbjct: 186 IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEE 245

Query: 239 IGNVVVSSS-NGVPVLVKDLGEVRYDNVERRGILGKDGNPDTIEGIALLLKDSNPSVALQ 297
G V + + +G V +KD+ V I +G P L +N +
Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKP-AAGLGIKLATGANALDTAK 304

Query: 298 GIHSAVEELNNSVLPKDVKVVPYLDRTALIDATLHTVSATLTEGMLLVCVVLLIFLGSPR 357
I + + EL P+ +KV+ D T + ++H V TL E ++LV +V+ +FL + R
Sbjct: 305 AIKAKLAELQPF-FPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 358 AAAIVSLTIPLSLLIAFIFMHHLKIPANLLSLG--AIDFGILVDGAVVLVENVLRLREEN 415
A I ++ +P+ LL F + N L++ + G+LVD A+V+VENV R+ E+
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 416 SQRALTARDAIDATLQVARPIFFGMAVIGCAYLPLLAFERIEYKLFSPMAYAVGAALIGA 475
A + Q+ + V+ ++P+ F ++ + + +A+ +
Sbjct: 424 KLPPKEA--TEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 LLVALTLIPGLAWLAFRKPRKMLH-----------NRALETLGQRYRAVLERSVGRRGWL 524
+LVAL L P L KP H N + Y + + +G G
Sbjct: 482 VLVALILTPALCAT-LLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 525 LACAALALCVLAVLGGSIGRDFLPYIDEGSLWLQVQMPPGITLDKAATMANALRKATL-- 582
L AL + + VL + FLP D+G +Q+P G T ++ + + + L
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 583 EFPEVSYVVTQTGRNDDGTDYWTPSHTEASVGLRPYKDWP-AGMDKQALIAALGARYAQM 641
E V V T G + G + A V L+P+++ +A+I ++
Sbjct: 601 EKANVESVFTVNGFSFSGQ---AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657

Query: 642 PGYTVSMMQPMIDGVQDKLSGAHSDLTVKVFGDDLQQVRGVADQVAAALHKVPGA-ADIA 700
V +G +L + G + +Q+ + P + +
Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFEL-IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 701 VDVEPPLPNLQVRFDRAAAARYGINAADVSDLISTGIGGSPIGQMYLGEKSYDLTVRFPQ 760
+ ++ D+ A G++ +D++ IST +GG+ + + L V+
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 761 RYRNDPQAIGALRLRTAAGAEIPLSAVASITTTSGRSVIVREMGRRNIIVRLNVRGRDLS 820
++R P+ + L +R+A G +P SA + G + R G ++ ++
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP---G 833

Query: 821 SFLSDAQATLARQVRVDPQHMQLVWGGQFENLQRAQARLLVVLPTTLCIMFVLLFGAFGN 880
+ DA A + P + W G + + + ++ + ++F+ L + +
Sbjct: 834 TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893

Query: 881 LRQPTLVLAAVPLAMIGGLAALHLRGMTLNVSSAVGFVALFGVAVLNAVLMLAQIHRLRH 940
P V+ VPL ++G L A L +V VG + G++ NA+L++ L
Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953

Query: 941 DVGMPLREAVVAGAVSRMRPVLMTATVAALGLAPAMLATGLGSDVQRPLATVVVGGLVTA 1000
G + EA + R+RP+LMT+ LG+ P ++ G GS Q + V+GG+V+A
Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSA 1013

Query: 1001 TVLTLVLLPSLYYL 1014
T+L + +P + +
Sbjct: 1014 TLLAIFFVPVFFVV 1027



Score = 92.6 bits (230), Expect = 4e-21
Identities = 67/344 (19%), Positives = 137/344 (39%), Gaps = 15/344 (4%)

Query: 682 VADQVAAALHKVPGAADIAVDVEPPLPNLQVRFDRAAAARYGINAADVSDLISTG----I 737
VA V L ++ G D+ + +++ D +Y + DV + +
Sbjct: 158 VASNVKDTLSRLNGVGDVQLFGAQYA--MRIWLDADLLNKYKLTPVDVINQLKVQNDQIA 215

Query: 738 GGSPIGQMYLGEKSYDLTVRFPQRYRNDPQAIGALRLRTAA-GAEIPLSAVASITTTS-G 795
G G L + + ++ R++N P+ G + LR + G+ + L VA +
Sbjct: 216 AGQLGGTPALPGQQLNASIIAQTRFKN-PEEFGKVTLRVNSDGSVVRLKDVARVELGGEN 274

Query: 796 RSVIVREMGRRNIIVRLNVR-GRDLSSFLSDAQATLARQVRVDPQHMQLVWGGQFENLQR 854
+VI R G+ + + + G + +A LA PQ M++++ +
Sbjct: 275 YNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQ 334

Query: 855 AQARLLV---VLPTTLCIMFVLLFGAFGNLRQPTLVLAAVPLAMIGGLAALHLRGMTLNV 911
+V L + + LF N+R + AVP+ ++G A L G ++N
Sbjct: 335 LSIHEVVKTLFEAIMLVFLVMYLF--LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392

Query: 912 SSAVGFVALFGVAVLNAVLMLAQIHRLRHDVGMPLREAVVAGAVSRMRPVLMTATVAALG 971
+ G V G+ V +A++++ + R+ + +P +EA ++ A V +
Sbjct: 393 LTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAV 452

Query: 972 LAPAMLATGLGSDVQRPLATVVVGGLVTATVLTLVLLPSLYYLM 1015
P G + R + +V + + ++ L+L P+L +
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14205RTXTOXIND551e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.2 bits (133), Expect = 1e-10
Identities = 38/238 (15%), Positives = 76/238 (31%), Gaps = 52/238 (21%)

Query: 115 AELANAYSEAGKARATLEQARLELARQKTLAADSISAARDLQAAQQAFDSAGNDARAASD 174
AE + + + L +L A + + + A N+ R
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 175 RLAQLGVAAQASSHR--------------------------------------RYVLRAP 196
+L Q+ ++ V+RAP
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 197 IAGRVVDLSA-ALGGFWNDTSASLMTVADISQVWLTASVPEREVGQVFEGQPVTASLDAY 255
++ +V L GG ++ V + + +TA V +++G + GQ ++A+
Sbjct: 334 VSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF 393

Query: 256 PGQRF---VGHVQHV--DDLLDPAT-------RTLKVRVALTNRDGL-LKPGMFARAQ 300
P R+ VG V+++ D + D +++ T + L GM A+
Sbjct: 394 PYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAE 451



Score = 36.3 bits (84), Expect = 2e-04
Identities = 26/132 (19%), Positives = 46/132 (34%), Gaps = 9/132 (6%)

Query: 80 RLVRVVPPLAGRVVALPKTLGDTVHAGDVLCVLDSAELANAYSEAGKARATLEQARLELA 139
R + P V + G++V GDVL L + A ++ K +++L QARLE
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA---LGAEADTLKTQSSLLQARLEQT 151

Query: 140 RQKTLAADSISAARDLQAAQQAFDSAGNDARAASDRLAQLGVAAQASSHRR---YVLRAP 196
R S S + + D + + L + + S + Y
Sbjct: 152 R---YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208

Query: 197 IAGRVVDLSAAL 208
+ + + L
Sbjct: 209 LDKKRAERLTVL 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14210SUBTILISIN1215e-32 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 121 bits (304), Expect = 5e-32
Identities = 72/325 (22%), Positives = 117/325 (36%), Gaps = 43/325 (13%)

Query: 78 NADLAQQAGARGQGVKLAVLDDNLVPSYAPISGKVDSFNDYTASPGTPESSANALRGHGT 137
A G+GVK+AVLD + + ++ ++T GHGT
Sbjct: 30 QAPAVWNQTR-GRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGT 88

Query: 138 IVSALVLGSAQDGFAGGVAPDADLFYARICAENSCGTQQTRRAAVDLAAA-GVRIANLSI 196
V+ + + + GVAP+ADL ++ + G + A V I ++S+
Sbjct: 89 HVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSL 148

Query: 197 GASYPDATASANAALAWKYALTPLVQADALIVASTGNEGAAEAS-----YPAATPVQEAS 251
G A K A V + L++ + GNEG + YP
Sbjct: 149 GGPEDVPELHE----AVKKA----VASQILVMCAAGNEGDGDDRTDELGYPGCYN----- 195

Query: 252 VRNNWLAVGAINIDSAGNAAGLTSYSNHCGAAAQWCLVAPGTYTAPALAGTELGGQIAGT 311
++VGAIN D + +SN + LVAPG + G + +GT
Sbjct: 196 ---EVISVGAINFDR-----HASEFSNSN---NEVDLVAPGEDILSTVPGGKY-ATFSGT 243

Query: 312 SFSTAAVSGVAAQVLGVYPW-----MSASNLQQTLLTTATDLGDPGVDALYGWGLVNAAK 366
S +T V+G A + + ++ L L+ LG + G GL+
Sbjct: 244 SMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGNGLLYLTA 301

Query: 367 AIKGPGQFASNWAANVTAGYDSTFS 391
+ + + AG ST S
Sbjct: 302 V----EELSRIFDTQRVAGILSTAS 322


43XADLMG695_RS14235XADLMG695_RS14290Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS14235127-3.064492alpha/beta fold hydrolase
XADLMG695_RS14240-226-2.191872hypothetical protein
XADLMG695_RS14245-127-2.326202AraC family transcriptional regulator
XADLMG695_RS14250-232-2.724206SDR family oxidoreductase
XADLMG695_RS14255-231-2.850812nuclear transport factor 2 family protein
XADLMG695_RS14260-228-2.962888hypothetical protein
XADLMG695_RS14265-130-3.289833type VI secretion system contractile sheath
XADLMG695_RS14270-232-4.624610type VI secretion system contractile sheath
XADLMG695_RS14280-226-4.880724type VI secretion system tube protein Hcp
XADLMG695_RS14285-216-4.193414ImpE protein
XADLMG695_RS14290-113-4.472450type VI secretion system baseplate subunit TssE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14290DHBDHDRGNASE841e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.9 bits (207), Expect = 1e-21
Identities = 65/257 (25%), Positives = 109/257 (42%), Gaps = 19/257 (7%)

Query: 3 VIVITGGSRGIGAGTALECAKRGMGVILTYQSQAEAAAAVVEEIKAKGGRAVALGLDVGD 62
+ ITG ++GIG A A +G + E VV +KA+ A A DV D
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 VATFGEFKDAVARQLEDGWQVKKLSGLVNNAGHGLFNAIETVTEQQFDALCDVHLKGPFF 122
A E + R++ + LVN AG I ++++++++A V+ G F
Sbjct: 69 SAAIDEITARIEREMG------PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 123 LTQALLPLF--ERGASIVNLTSATTRSATAGVAPYAACKGGLEVLTRYMAKEFGERGIRI 180
++++ R SIV + S +A YA+ K + T+ + E E IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 181 NAVSPGAIRTELGGGM---DEAFEAVLSSQTA-------LGRIGEPEDVAHVIAMLLSQD 230
N VSPG+ T++ + + E V+ L ++ +P D+A + L+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 231 GQWINGQSIDVSGGYNL 247
I ++ V GG L
Sbjct: 243 AGHITMHNLCVDGGATL 259


44XADLMG695_RS14365XADLMG695_RS23340Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS14365-2113.496508protein kinase
XADLMG695_RS14370-1124.015991hypothetical protein
XADLMG695_RS14375-1123.842912type VI secretion system protein TssA
XADLMG695_RS143800103.846241LysR family transcriptional regulator
XADLMG695_RS143850104.126831two-component system VirA-like sensor kinase
XADLMG695_RS14390-1104.011896c-type cytochrome
XADLMG695_RS143951103.529158hypothetical protein
XADLMG695_RS144001102.842206hypothetical protein
XADLMG695_RS144050122.376478hypothetical protein
XADLMG695_RS144100161.020205hypothetical protein
XADLMG695_RS144200170.402936zonular occludens toxin
XADLMG695_RS14425019-0.183971DUF2523 domain-containing protein
XADLMG695_RS14430-123-0.951880hypothetical protein
XADLMG695_RS14435-142-2.097478hypothetical protein
XADLMG695_RS14440245-2.280825hypothetical protein
XADLMG695_RS14445438-3.905492hypothetical protein
XADLMG695_RS14455238-4.149270hypothetical protein
XADLMG695_RS23330237-4.034823hypothetical protein
XADLMG695_RS14460234-3.602240DUF3693 domain-containing protein
XADLMG695_RS22520127-4.581667helix-turn-helix transcriptional regulator
XADLMG695_RS14465131-6.052854YeeE/YedE family protein
XADLMG695_RS14470034-6.233328transporter
XADLMG695_RS23335134-5.922831TIGR01244 family phosphatase
XADLMG695_RS14480338-7.638347sulfite exporter TauE/SafE family protein
XADLMG695_RS14485236-7.498331SDR family oxidoreductase
XADLMG695_RS14490034-5.910652GMC family oxidoreductase
XADLMG695_RS23340030-3.325912alpha/beta hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14430HTHFIS340.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.002
Identities = 27/121 (22%), Positives = 48/121 (39%), Gaps = 4/121 (3%)

Query: 688 ASLLLLCDDAAELDRLEEMLAALGHEPVGMLELPAAVAMATADPMRFDGVLLK-RDRAGD 746
A++L+ DDAA L + L+ G++ A D V+ +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDEN 61

Query: 747 AERAIGALHAAAPTLPLILATRATSLATR-KGLGGAITEIIAQPFDLGALAMALERALSR 805
A + + A P LP+++ + + T K + + +PFDL L + RAL+
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 806 R 806

Sbjct: 122 P 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS22520adhesinmafb270.029 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 26.6 bits (58), Expect = 0.029
Identities = 7/26 (26%), Positives = 13/26 (50%)

Query: 67 VPSPPHRSNAANVIYLRDVIQRRHEE 92
+ P ++ A ++ D QR+H E
Sbjct: 21 LIQPALAADLAQDPFITDNAQRQHYE 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14530DHBDHDRGNASE1153e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (290), Expect = 3e-33
Identities = 76/263 (28%), Positives = 111/263 (42%), Gaps = 7/263 (2%)

Query: 4 GIKQRIALISGGDSGMGKETARQLLEAGVRVAITDLPNGTLDQAVAELSGLGEII-AIEG 62
GI+ +IA I+G G+G+ AR L G +A D L++ V+ L A
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 63 DVTQEQDVTRIWTQVRAQLGEPDIYVNAAGVTGATGDFLEVSDAGWLETLDINLMGAVRM 122
DV + I ++ ++G DI VN AGV G +SD W T +N G
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 123 CRQAIPAMRRKQWGRIVLFASEDAVQPYVDELAYCASKAGILSLAKGLSKAYGADNVLVN 182
R M ++ G IV S A P AY +SKA + K L N+ N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 183 TVSPAFIATPMTDKMMQKRAQENGTSVEEAIASFLDEERPGMALKRRGRPEEVASVVAFL 242
VSP T+ MQ + E+ I L+ + G+ LK+ +P ++A V FL
Sbjct: 184 IVSPG-----STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 243 CSERASFINGAGVRVDSGSVFTI 265
S +A I + VD G+ +
Sbjct: 239 VSGQAGHITMHNLCVDGGATLGV 261


45XADLMG695_RS15050XADLMG695_RS15160Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS15050-1163.301968hypothetical protein
XADLMG695_RS235600153.002695glutamine-hydrolyzing GMP synthase
XADLMG695_RS150600163.094305IMP dehydrogenase
XADLMG695_RS150651162.963056bifunctional methylenetetrahydrofolate
XADLMG695_RS150701162.606218DUF1244 domain-containing protein
XADLMG695_RS150751181.974707cation:proton antiporter
XADLMG695_RS15080216-0.084653UTP--glucose-1-phosphate uridylyltransferase
XADLMG695_RS15085313-0.185487polysaccharide biosynthesis protein
XADLMG695_RS15090310-0.381749glycosyltransferase family 4 protein
XADLMG695_RS1510017-0.863485lipopolysaccharide assembly protein LapB
XADLMG695_RS1510518-0.928729LapA family protein
XADLMG695_RS1511018-1.062760integration host factor subunit beta
XADLMG695_RS15115-113-1.03822630S ribosomal protein S1
XADLMG695_RS15120-212-1.646757(d)CMP kinase
XADLMG695_RS15125-213-1.64447250S ribosomal protein L36
XADLMG695_RS15135115-2.418933hypothetical protein
XADLMG695_RS15140215-2.349317agmatine deiminase family protein
XADLMG695_RS15145417-2.795039carbon-nitrogen hydrolase
XADLMG695_RS15150621-2.666809N-acetyltransferase
XADLMG695_RS15160216-0.771523TraB/GumN family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15110HTHFIS310.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.008
Identities = 16/77 (20%), Positives = 29/77 (37%), Gaps = 8/77 (10%)

Query: 219 VGAAVGVGGDTEQRIELLAAAGVDVVIVDTAHGHSQGVIDRVAWVKKTYPQLQVIGGNIV 278
G V + + +AA D+V+ D D + +KK P L V+ ++
Sbjct: 26 AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA-FDLLPRIKKARPDLPVL---VM 81

Query: 279 TG----DAALALMDAGA 291
+ A+ + GA
Sbjct: 82 SAQNTFMTAIKASEKGA 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15135NUCEPIMERASE817e-19 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 81.4 bits (201), Expect = 7e-19
Identities = 61/342 (17%), Positives = 113/342 (33%), Gaps = 71/342 (20%)

Query: 286 TVMVTGAGGSIGSEVCRQCARHGARRI----------VLLEIDELALLTIDSDLRRLFPD 335
+VTGA G IG V ++ G + + V L+ L LL P
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA--------QPG 53

Query: 336 IEVVRVLGDCGDPAVVAHALNTATPDAVFHAAAYKQVPLLEEQLREAVRNNVLATENVAR 395
+ + D D + + + VF + V E +N+ N+
Sbjct: 54 FQFHK--IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE 111

Query: 396 ACQRARIETFVFIST---------------DKAVEPVNVLGASKRYAEMICQSLDA-RDA 439
C+ +I+ ++ S+ D PV++ A+K+ E++ +
Sbjct: 112 GCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 440 PTRFITVRFGNVLDSAGS---VVPLFREQIRQGGPVTV-THPDVTRYFMTIPEACQLVIQ 495
P +RF V G + F + + +G + V + + R F I + + +I+
Sbjct: 172 PA--TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229

Query: 496 A------------------AASASHGAIYTLDMGEPVPIRLLAEQMIRLAGKQPGKDVAI 537
AAS + +Y + PV + I+ G +
Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVEL----MDYIQALEDALGIEAKK 285

Query: 538 LYTGLRPGEKLHE----TLFYSDEDYRPTAHPKILEAGVREF 575
L+PG+ L Y + P ++ GV+ F
Sbjct: 286 NMLPLQPGDVLETSADTKALYEVIGFTPETT---VKDGVKNF 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15155DNABINDINGHU1175e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (294), Expect = 5e-38
Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 1/89 (1%)

Query: 2 TKSELIEILARRQAHLKSDDVDLAVKSLLEMMGQALSDGDRIEIRGFGSFSLHYRPPRLG 61
K +LI +A L D AV ++ + L+ G+++++ GFG+F + R R G
Sbjct: 3 NKQDLIAKVAE-ATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGESVALPGKHVPHFKPGKELRERV 90
RNP+TGE + + VP FK GK L++ V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


46XADLMG695_RS15700XADLMG695_RS15805Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS15700-193.112287formylglycine-generating enzyme family protein
XADLMG695_RS15705-192.954014ribonuclease D
XADLMG695_RS15710091.879756DNA ligase D
XADLMG695_RS15715091.949151DUF3606 domain-containing protein
XADLMG695_RS15720081.280223H-NS histone family protein
XADLMG695_RS15725180.379276*integrase arm-type DNA-binding domain-containing
XADLMG695_RS15735414-3.544470hypothetical protein
XADLMG695_RS15740620-4.728901hypothetical protein
XADLMG695_RS157451037-8.404555helix-turn-helix transcriptional regulator
XADLMG695_RS15750939-8.046295ParB N-terminal domain-containing protein
XADLMG695_RS15760943-8.089862AlpA family phage regulatory protein
XADLMG695_RS23350538-6.478919diguanylate cyclase
XADLMG695_RS22575027-2.622200purine-binding chemotaxis protein CheW
XADLMG695_RS22580-116-0.967798IS3 family transposase
XADLMG695_RS15765019-1.030970recombinase family protein
XADLMG695_RS15770019-0.830262Tn3 family transposase
XADLMG695_RS15780119-0.842340histidine--tRNA ligase
XADLMG695_RS15790215-2.396412helix-turn-helix domain-containing protein
XADLMG695_RS15795216-2.556929murein L,D-transpeptidase catalytic domain
XADLMG695_RS15805213-1.982524threonine synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15750TCRTETOQM270.022 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 27.1 bits (60), Expect = 0.022
Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 12 AQAKAKLLDELQKLEEQEKTERASEASSAHATIVSLL 48
Q + LLD L ++ + + R S+ H I+S L
Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFL 391


47XADLMG695_RS16430XADLMG695_RS16500Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS164302120.578451class I SAM-dependent methyltransferase
XADLMG695_RS164353140.715014chain-length determining protein
XADLMG695_RS164404131.020406hypothetical protein
XADLMG695_RS164454121.249814glycosyltransferase
XADLMG695_RS164504121.513800O-antigen translocase
XADLMG695_RS164554121.936611DegT/DnrJ/EryC1/StrS family aminotransferase
XADLMG695_RS164602112.357205GNAT family N-acetyltransferase
XADLMG695_RS164652121.558085hypothetical protein
XADLMG695_RS164702150.913871acetyltransferase
XADLMG695_RS164752150.244182WxcM-like domain-containing protein
XADLMG695_RS16480216-0.394646cytochrome b
XADLMG695_RS16485317-0.711326hypothetical protein
XADLMG695_RS16490317-1.102938c-type cytochrome
XADLMG695_RS16495218-1.202183hypothetical protein
XADLMG695_RS16500215-1.195424sigma-70 family RNA polymerase sigma factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS23370BCTERIALGSPF290.025 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.0 bits (65), Expect = 0.025
Identities = 15/74 (20%), Positives = 26/74 (35%), Gaps = 13/74 (17%)

Query: 223 KQVSFFGAPLPALVAPDGDLGDTLGTWHLNAAWALLALVLLHIGAAL----------WHH 272
+Q LP + D + T+ W LLAL+ + + +H
Sbjct: 200 EQFIHMKQALPLSTRVLMGMSDAVRTF---GPWMLLALLAGFMAFRVMLRQEKRRVSFHR 256

Query: 273 LVLRDGLLRRVLPG 286
+L L+ R+ G
Sbjct: 257 RLLHLPLIGRIARG 270


48XADLMG695_RS16790XADLMG695_RS16900Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS167903101.119713iron-sulfur cluster assembly accessory protein
XADLMG695_RS167954101.053866asparagine--tRNA ligase
XADLMG695_RS168005110.647091hypothetical protein
XADLMG695_RS16810612-0.354526hypothetical protein
XADLMG695_RS16815615-0.511890hypothetical protein
XADLMG695_RS16820521-1.952077FMN-binding negative transcriptional regulator
XADLMG695_RS16825418-2.080042response regulator transcription factor
XADLMG695_RS16830417-1.432585hypothetical protein
XADLMG695_RS16835014-1.286690hypothetical protein
XADLMG695_RS16840-113-1.511086hypothetical protein
XADLMG695_RS16845018-2.560243hypothetical protein
XADLMG695_RS16850218-1.972141DUF2589 domain-containing protein
XADLMG695_RS16855219-0.922560DUF2589 domain-containing protein
XADLMG695_RS16860221-1.397958N-acetylmuramidase
XADLMG695_RS16870125-1.576892carbonate dehydratase
XADLMG695_RS16875126-2.1581543-hydroxyanthranilate 3,4-dioxygenase
XADLMG695_RS16885322-2.429614FUSC family protein
XADLMG695_RS16890419-3.598808kynureninase
XADLMG695_RS16895115-3.282418FAD-dependent monooxygenase
XADLMG695_RS16900216-3.164474exodeoxyribonuclease I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS16875HTHFIS502e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 2e-09
Identities = 33/164 (20%), Positives = 58/164 (35%), Gaps = 20/164 (12%)

Query: 1 MMKTRIVVAADRTILVEGMVALLQKVPGIEVVGHAEDGLACLQIAAREQPDIVLVDVLLP 60
M I+VA D + + L + G +V + + A D+V+ DV++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GLNGIDLTRRLMQRSPNSRAICIAPSDACTQASAVFEAGAKAYLARTSRFAELLRAIQCV 120
N DL R+ + P+ + ++ + A E GA YL + EL+ I
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 121 IQDQTY-----------------ISPQMSRSLIAGLRRAAKADS 147
+ + S M + + L R + D
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAM-QEIYRVLARLMQTDL 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS16925BCTERIALGSPF310.002 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 31.3 bits (71), Expect = 0.002
Identities = 11/40 (27%), Positives = 16/40 (40%), Gaps = 1/40 (2%)

Query: 134 PLKNIETDFPPVFDRFYRSLALRTCSQCGHLHPAPERYAT 173
L + FP F+R Y ++ + GHL R A
Sbjct: 119 SLADAMKCFPGSFERLYCAM-VAAGETSGHLDAVLNRLAD 157


49XADLMG695_RS17345XADLMG695_RS17440Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS173452221.605915outer membrane protein assembly factor BamE
XADLMG695_RS173503241.439503RnfH family protein
XADLMG695_RS173552212.526320type II toxin-antitoxin system RatA family
XADLMG695_RS173601212.477196SsrA-binding protein SmpB
XADLMG695_RS173651201.673459site-specific integrase
XADLMG695_RS17370-325-0.454429hypothetical protein
XADLMG695_RS17375-125-0.222309hypothetical protein
XADLMG695_RS17380028-0.483297hypothetical protein
XADLMG695_RS17385131-1.495148hypothetical protein
XADLMG695_RS17390236-2.896483helix-turn-helix transcriptional regulator
XADLMG695_RS17395339-2.911228MobA/MobL family protein
XADLMG695_RS174051036-4.929722conjugal transfer protein TraD
XADLMG695_RS174101039-4.139982hypothetical protein
XADLMG695_RS17415938-3.766868hypothetical protein
XADLMG695_RS17420731-1.538269hypothetical protein
XADLMG695_RS17425731-0.202647hypothetical protein
XADLMG695_RS17430731-0.251062GGDEF domain-containing protein
XADLMG695_RS17440731-0.399984hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17450PilS_PF08805585e-12 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 58.0 bits (140), Expect = 5e-12
Identities = 42/161 (26%), Positives = 69/161 (42%), Gaps = 16/161 (9%)

Query: 56 GYTLVEVLLVLGVSSAMAAAGWLLFGPTSVAADVKQTQMDLSETANAIDRSLGIVGGYSG 115
G TL+EVLLV+GV +AA+ + L+ Q ++ + +SL G Y+
Sbjct: 27 GATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANM-KSLKFQGRYTD 85

Query: 116 --LSTSLVLSDGLAAQRLRQNDG--LRNAWGGSVSFWPNTVKRGNDSFLVETRDVPKAAC 171
+L L + + G +N WGGSV+ ++ SF V +VP+ C
Sbjct: 86 SNYIKTLYAQGLLPSDMIADTTGASAKNPWGGSVTITTSS---DKYSFNVVEANVPQKNC 142

Query: 172 AKLIAAMAGDPAVADARVNGESVYLDEKYDPASAAVACERD 212
++ A+ + A +++N S SAA C D
Sbjct: 143 MAMVNALRS--SSAISKINNTST------STVSAATVCASD 175


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17475STREPKINASE290.004 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 28.5 bits (63), Expect = 0.004
Identities = 16/45 (35%), Positives = 21/45 (46%)

Query: 18 FWLSESAMPTREELATRLDALQEQLPKLSADEDADFDYLDFQARA 62
F AM + E A L A+QEQL D F+ +DF + A
Sbjct: 89 FATDSGAMSHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDA 133


50XADLMG695_RS17795XADLMG695_RS17865Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS177952172.055385GGDEF domain-containing protein
XADLMG695_RS17800-1161.728345UMP kinase
XADLMG695_RS17805-2201.134532ribosome recycling factor
XADLMG695_RS17810-2200.194633di-trans,poly-cis-decaprenylcistransferase
XADLMG695_RS17820123-0.602973phosphatidate cytidylyltransferase
XADLMG695_RS17825222-0.8524161-deoxy-D-xylulose-5-phosphate reductoisomerase
XADLMG695_RS17830217-0.099378RIP metalloprotease RseP
XADLMG695_RS178352130.829401outer membrane protein assembly factor BamA
XADLMG695_RS178402131.215830hypothetical protein
XADLMG695_RS178450160.208077UDP-3-O-(3-hydroxymyristoyl)glucosamine
XADLMG695_RS178550150.2889883-hydroxyacyl-ACP dehydratase FabZ
XADLMG695_RS17860216-0.339739acyl-ACP--UDP-N-acetylglucosamine
XADLMG695_RS17865317-0.860987lipid-A-disaccharide synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17840CARBMTKINASE342e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 34.4 bits (79), Expect = 2e-04
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 14/70 (20%)

Query: 113 DFIRRRAIRHL-EKGRIAIFAAGTGNPFFTTDSG-------------AALRAIEIGADLL 158
+ I+ L E+G I I + G G P D A E+ AD+
Sbjct: 172 GHVEAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIF 231

Query: 159 LKATKVDGVY 168
+ T V+G
Sbjct: 232 MILTDVNGAA 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17890BCTERIALGSPF290.022 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.0 bits (65), Expect = 0.022
Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 1/54 (1%)

Query: 193 GRPRGINSEGLKR-RGFDAERITAIKRAYRTLYVAGLPLADAKAQLAEQAESSE 245
+ G L+R + + R TL A +PL +A +A+Q+E
Sbjct: 49 QQKSGSTGLSLRRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPH 102


51XADLMG695_RS18885XADLMG695_RS18980Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS18885214-0.552194hypothetical protein
XADLMG695_RS188900140.139896HAMP domain-containing histidine kinase
XADLMG695_RS18895-2120.432912response regulator transcription factor
XADLMG695_RS18900-2130.461985glycosyltransferase family 39 protein
XADLMG695_RS18905-2120.857363phosphatase PAP2 family protein
XADLMG695_RS18910-1121.193541phosphoethanolamine transferase
XADLMG695_RS189152111.342732M2 family metallopeptidase
XADLMG695_RS189202100.783959hypothetical protein
XADLMG695_RS189251100.020256hypothetical protein
XADLMG695_RS1893009-1.089628hypothetical protein
XADLMG695_RS18935111-1.879798hypothetical protein
XADLMG695_RS18940-112-3.846727hypothetical protein
XADLMG695_RS18945-112-4.987181hypothetical protein
XADLMG695_RS18950121-6.965244hypothetical protein
XADLMG695_RS18955241-10.893677hypothetical protein
XADLMG695_RS18965440-9.972268hypothetical protein
XADLMG695_RS18970338-9.942425hypothetical protein
XADLMG695_RS23415145-9.717283multidrug effflux MFS transporter
XADLMG695_RS18975033-6.884110peptidase domain-containing ABC transporter
XADLMG695_RS18980222-3.645946HlyD family efflux transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18930HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.3 bits (206), Expect = 1e-20
Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 4/136 (2%)

Query: 2 RLLVIEDNRNMVANLFDYFEARGHTLDAAPDGVTGLHLATTQQYDALILDWMMPRMDGPE 61
+LV +D+ + L G+ + + T D ++ D +MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRRLREQHQSELPVIMLTARDELPDKIAGFRAGADDYLTKPFALPE---LEVRIEALLA 118
+L R+++ + +LPV++++A++ I GA DYL KPF L E + R A
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 119 RAHGRRRGKLLQVADL 134
R + L
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS1896056KDTSANTIGN290.003 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 28.8 bits (64), Expect = 0.003
Identities = 20/61 (32%), Positives = 26/61 (42%), Gaps = 6/61 (9%)

Query: 36 QIFNQNMQQQISLSQQQAMNQVQMAAAAKCVAMIERTSECKNQQSIDQMVKDIEKLIKDM 95
+ Q QQQ QQQA Q A AA V ++ I Q+ KD+ KL +
Sbjct: 335 VMPPQAQQQQGQGQQQQAQATAQEAVAAAAVRLL------NGSDQIAQLYKDLVKLQRHA 388

Query: 96 G 96
G
Sbjct: 389 G 389


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19005TCRTETA531e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 52.9 bits (127), Expect = 1e-09
Identities = 83/340 (24%), Positives = 127/340 (37%), Gaps = 44/340 (12%)

Query: 47 IQQTISVYLLAYGLMSIAHGP----LSDAWGRKRVILGGLALFVAGSIGCALSQDLPTLL 102
+ + L Y LM A P LSD +GR+ V+L LA A + L L
Sbjct: 41 VTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLY 100

Query: 103 AFRALQGLSAGVGMIVGRAVIRDLFHGPDAQRLMSQVSMIFGIAPAIAPIIGGWILLSGA 162
R + G++ G + G A I D+ G + R +S FG P++GG L+ G
Sbjct: 101 IGRIVAGITGATGAVAG-AYIADITDGDERARHFGFMSACFGFGMVAGPVLGG--LMGGF 157

Query: 163 GWPLIFWFLVVFGLVLLIATLTWLPETHPVEARTPLQFKRLMQDYVRIGFNPRFQRLAAA 222
F+ + + LPE+H E R PL+ + L F A
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPESHKGE-RRPLRREALNP---LASFRWARGMTVVA 213

Query: 223 GSFNFAGIFLYIASAPVLIMQHLKLGEGDFAWLFIPTIGGMTLGSF----------LSGR 272
I + P + + GE F W T G++L +F ++G
Sbjct: 214 ALMAVFFIMQLVGQVPAALW--VIFGEDRFHW--DATTIGISLAAFGILHSLAQAMITGP 269

Query: 273 MAGRMQPVRQIRIGFICCGVAALANLAYTFAVAQIALPWAVLPIFL----AGMGMALIFP 328
+A R+ R + +G I G + T W PI + G+GM +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRG-------WMAFPIMVLLASGGIGMPALQA 322

Query: 329 ILALAVLDMYPQQRGLASSLQAFTQLMTNTVVAGVLSPLL 368
+L+ V + +Q L SL A T L ++ PLL
Sbjct: 323 MLSRQVDE--ERQGQLQGSLAALTSL------TSIVGPLL 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19015RTXTOXIND1147e-30 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 114 bits (287), Expect = 7e-30
Identities = 69/428 (16%), Positives = 142/428 (33%), Gaps = 51/428 (11%)

Query: 65 RWCVGLLMTTVILLLVGFFRLGFARSE---TLYGTVVPAGGLIAVTTPQSGVVVQVGAAQ 121
R + + L++ F + E T G + +G + ++ +V ++ +
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 122 GQRVAAGQLLFVLSA-EHRDDRGRPTQRAAAVLAEQQRLAVEAMAQ-------------- 166
G+ V G +L L+A D + EQ R + + +
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 167 ------------LRAQGRVQQQAAARALAGLRDRLQQIDAELD----LLRHWQQLTQSIE 210
L + + Q L + AE + ++ L++ +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 211 QR---YRTALTRGLVSQQFVDEKQADVLDQRAHTLELQRERMALADALAQAQAEVQQLPL 267
R + + L + +++ V E++ ++ + + + + A+ E Q +
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 268 ----SVHQQLAMAGAGLQE-DRRAAIEQAAASRWEVRAPRAGRVA-LRPLQRGQAVAQGQ 321
+ +L + A + +RAP + +V L+ G V +
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 322 RLADLLPTSMATEVVLYAPSRAAGLIGPGMPVQLRFDALPYQHYGQFAGQVVEIAA-APE 380
L ++P EV ++ G I G ++ +A PY YG G+V I A E
Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414

Query: 381 PPRVDSTSASEPLYRVRVRLAGDAALRAGRTAVLRPGMRVQGTLALEWRRFSQWAFEPLS 440
R ++ V + + + + L GM V + R + PL
Sbjct: 415 DQR------LGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLE 468

Query: 441 -SLHGTLR 447
S+ +LR
Sbjct: 469 ESVTESLR 476


52XADLMG695_RS19265XADLMG695_RS19345Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS19265127-5.892920hypothetical protein
XADLMG695_RS19270222-6.298995DEAD/DEAH box helicase family protein
XADLMG695_RS19275225-6.390556TetR/AcrR family transcriptional regulator
XADLMG695_RS23425224-6.577386SDR family oxidoreductase
XADLMG695_RS19280323-7.032013hypothetical protein
XADLMG695_RS19285323-6.309615hypothetical protein
XADLMG695_RS19290013-3.078010M1 family metallopeptidase
XADLMG695_RS19295118-3.677097adenylosuccinate synthase
XADLMG695_RS19300118-3.662470DUF2065 family protein
XADLMG695_RS19305-114-3.028630acyltransferase family protein
XADLMG695_RS19310-210-0.862420protease modulator HflC
XADLMG695_RS19315-210-1.612569FtsH protease activity modulator HflK
XADLMG695_RS19320-112-2.569528twitching motility response regulator PilH
XADLMG695_RS19325-210-1.937486DnaJ domain-containing protein
XADLMG695_RS19330-211-0.987199Hsp20/alpha crystallin family protein
XADLMG695_RS19335015-1.338857peroxiredoxin
XADLMG695_RS19340214-1.088572ferritin-like domain-containing protein
XADLMG695_RS19345214-1.088062penicillin-binding protein 1C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19310HTHTETR507e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.0 bits (119), Expect = 7e-10
Identities = 18/123 (14%), Positives = 41/123 (33%), Gaps = 4/123 (3%)

Query: 6 SRARGRPRAFDAEQAVATAQRLFHASGYDALSVADLTAALGINPPSFYAAFGSKAGLYAR 65
++ + + + A RLF G + S+ ++ A G+ + Y F K+ L++
Sbjct: 5 TKQEAQETR---QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 66 ILDR-YAQTGAIPLPQLLDADRPLADALADVLEHAARCYAADPAATGCLVLEGTRSNDAQ 124
I + + G + L L ++L H + + + +
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 125 ARE 127

Sbjct: 122 EMA 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19315DHBDHDRGNASE761e-18 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 75.9 bits (186), Expect = 1e-18
Identities = 60/251 (23%), Positives = 108/251 (43%), Gaps = 24/251 (9%)

Query: 5 KNKSVLVLGGSRGIGAAIVRRFVAEGARVT-----FTYAGSAEAAQRLAGETGST--AVL 57
+ K + G ++GIG A+ R ++GA + ++ + A +
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 58 ADSADRDAVIATV-RRSGPLDVLVVNSGIALFGDALDQDPDA-VDRLFRINVHAPYHAAV 115
DSA D + A + R GP+D+LV +G+ G + D + F +N ++A+
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPG-LIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 116 EAARQMPS--GGRIIVIGSVNGDRMPLPGMASYALSKSALQGLARGLARDFGPRGITINV 173
++ M G I+ +GS N +P MA+YA SK+A + L + I N+
Sbjct: 126 SVSKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184

Query: 174 VQPGPIDTDA--------NPENGPMKDLMHSF---MAIKRHGRAEEVAGMVAWLAGPEAS 222
V PG +TD N +K + +F + +K+ + ++A V +L +A
Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244

Query: 223 FVTGAMHTIDG 233
+T +DG
Sbjct: 245 HITMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19360HTHFIS831e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 1e-21
Identities = 29/116 (25%), Positives = 56/116 (48%), Gaps = 2/116 (1%)

Query: 2 ARILIVDDSPSQLLGIQRIVEKLGHETITATDGAAGVEAAKESLPDLVLMDVVMPNLNGF 61
A IL+ DD + + + + + G++ ++ A DLV+ DVVMP+ N F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRTLKREPTTQHIPVILVTTKDQDTDRMWGMRQGARAYITKPFSEDELLEVMER 117
+K+ +PV++++ ++ + +GA Y+ KPF EL+ ++ R
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19380HELNAPAPROT417e-07 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 40.6 bits (95), Expect = 7e-07
Identities = 27/143 (18%), Positives = 59/143 (41%), Gaps = 2/143 (1%)

Query: 33 TESYHADREKVIELLNTALATEYVCTLRYYRHYFMAKGMLADAVKGEFLEHAQQEQEHAH 92
TE+ ++ V LNT L+ ++ + +R ++ KG + +F E E
Sbjct: 3 TENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVD 62

Query: 93 KLAERIVQLGGEP-DLNPDTLTKRSHAEYKEGTDLRDMVKENLIAERIAIDSYREMIDFI 151
+AER++ +GG+P + S + T +MV+ + + + +I
Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122

Query: 152 GD-KDTTTKRILESILAQEEEHA 173
+ +D T + ++ + E+
Sbjct: 123 EENQDNATADLFVGLIEEVEKQV 145


53XADLMG695_RS19555XADLMG695_RS23470Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS195552200.319659hypothetical protein
XADLMG695_RS19560326-1.262748DNA repair protein RadC
XADLMG695_RS19565429-1.895404hypothetical protein
XADLMG695_RS19570327-3.401506hypothetical protein
XADLMG695_RS19575330-3.286538hypothetical protein
XADLMG695_RS23435537-5.095691RNA-directed DNA polymerase
XADLMG695_RS23440336-5.853592ATP-binding protein
XADLMG695_RS23445434-5.198059helix-turn-helix domain-containing protein
XADLMG695_RS23565434-5.445091hypothetical protein
XADLMG695_RS19600753-7.963646hypothetical protein
XADLMG695_RS227951156-9.140684hypothetical protein
XADLMG695_RS228001155-8.983647hypothetical protein
XADLMG695_RS196101454-10.917942hypothetical protein
XADLMG695_RS234501559-12.180060hypothetical protein
XADLMG695_RS234551156-11.979594tyrosine-type recombinase/integrase
XADLMG695_RS196151151-10.728407*autotransporter domain-containing esterase
XADLMG695_RS196201039-9.091131sulfur carrier protein ThiS
XADLMG695_RS228051141-10.128629thiazole synthase
XADLMG695_RS228101244-11.528743tRNA (guanosine(46)-N7)-methyltransferase TrmB
XADLMG695_RS196251242-10.790526SLC13 family permease
XADLMG695_RS228151339-8.719135Rieske (2Fe-2S) protein
XADLMG695_RS196301236-7.894953hypothetical protein
XADLMG695_RS196351340-8.590026hypothetical protein
XADLMG695_RS196401446-10.523646fumarylacetoacetate hydrolase family protein
XADLMG695_RS196451347-9.240458large-conductance mechanosensitive channel
XADLMG695_RS196501455-12.744136M28 family peptidase
XADLMG695_RS196551349-12.178916LacI family DNA-binding transcriptional
XADLMG695_RS19660831-8.062256cytochrome P450
XADLMG695_RS23460831-8.273017TonB-dependent receptor
XADLMG695_RS23465521-4.717244glycoside hydrolase family 2
XADLMG695_RS23470415-2.643518glycoside hydrolase family 97 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19655adhesinb280.005 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 28.3 bits (63), Expect = 0.005
Identities = 10/32 (31%), Positives = 16/32 (50%)

Query: 1 MKNARIALVVLTMALGLTACGGKPSSDNAKEA 32
MK R +++L +GL AC + SS +
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSS 32


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19680OMPADOMAIN320.008 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 31.8 bits (72), Expect = 0.008
Identities = 26/124 (20%), Positives = 51/124 (41%), Gaps = 10/124 (8%)

Query: 367 FGGFAGFGRM-DADFGNRNGSFKQDDTTLGGFFGWYTGPVWVNAQVSYGWLSYDVDREVQ 425
G G+ + D F N NG ++ G F G+ P +V ++ Y WL +
Sbjct: 30 TGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNP-YVGFEMGYDWLG-------R 81

Query: 426 LGPATRVHSGSPDGSNLTAALNAGYSLGEGNLKYGPVAGLTWQK-IKLDGYTESNDSATA 484
+ V +G+ + GY + + Y + G+ W+ K + Y +++D+ +
Sbjct: 82 MPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVS 141

Query: 485 LGYA 488
+A
Sbjct: 142 PVFA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19690HTHFIS310.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.003
Identities = 26/162 (16%), Positives = 50/162 (30%), Gaps = 26/162 (16%)

Query: 105 TKLEVLGDERTLYPDVVQTLKAAEQLVADGFEVMVYTSDDPILAKRLEEIGCVAVMPLAA 164
+ V D+ + + Q L A G++V + ++ + G + V +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA------GYDVRITSNAATLWRWIAAGDGDLVVTDVVM 57

Query: 165 PIGSGLGIQNKYNLLEII--ENAKVPIIVDAGVGTASDAAIAMELGCDGVLMNTAIAGAR 222
P + LL I +P++V + T A A E GA
Sbjct: 58 PDENAFD------LLPRIKKARPDLPVLVMSAQNTFMTAIKASE------------KGAY 99

Query: 223 DPILMASAMRKAIEAGREAFLAGRIPRKRYASASSPVDGVIG 264
D + + + I A + + S ++G
Sbjct: 100 DYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVG 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19725MECHCHANNEL1462e-48 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 146 bits (370), Expect = 2e-48
Identities = 75/136 (55%), Positives = 97/136 (71%), Gaps = 7/136 (5%)

Query: 1 MGMVSEFKQFAMRGNVIDLAVGVVIGAAFGKIVTALVEKIIMPPIGWAIGNVDFSRLAWV 60
M ++ EF++FAMRGNV+DLAVGV+IGAAFGKIV++LV IIMPP+G IG +DF + A
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LKPAGVDATGKEIPAVAIGYGDFINTVVQFLIIAFAIFLVVKLINRVTHRK--PDAPKGP 118
L+ A D IPAV + YG FI V FLI+AFAIF+ +KLIN++ +K P A P
Sbjct: 61 LRDAQGD-----IPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEPAAAPAP 115

Query: 119 SEEVLLLREIRDALKN 134
++E +LL EIRD LK
Sbjct: 116 TKEEVLLTEIRDLLKE 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19755PHPHLIPASEA1310.015 Bacterial phospholipase A1 protein signature.
		>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature.

Length = 289

Score = 31.5 bits (71), Expect = 0.015
Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 10/60 (16%)

Query: 917 WNDDVGYLDASLSYDVNDHLTLYAQATNLTGESERRYAQWTNHYFDQNIFERRYYAGLRL 976
WN G + LSY + H+ LY Q + GES D N + R G+ L
Sbjct: 236 WNTGYGGAELGLSYPITKHVRLYTQVYSGYGES----------LIDYNFNQTRVGVGVML 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19765RTXTOXIND300.025 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.025
Identities = 18/121 (14%), Positives = 40/121 (33%), Gaps = 8/121 (6%)

Query: 18 LIAAPAAAQSLRVQSPDARTQVEFTLRADG-VPSYRVL-YRNTLVLGDAPLGLDLGRGNK 75
+ ++ + + D + L + + VL N V L + + +
Sbjct: 223 INRYENLSRVEKSRLDDFSS-----LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277

Query: 76 LGRDMTLQSSTTELHDSRFTLPV-GKTRQARDHYRALRVQLTDTQHRKLGIELRAYDDGV 134
+ ++ +L F + K RQ D+ L ++L + R+ +RA
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337

Query: 135 A 135

Sbjct: 338 V 338


54XADLMG695_RS20725XADLMG695_RS20810Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS20725216-3.065129hypothetical protein
XADLMG695_RS20730218-3.361624type II toxin-antitoxin system VapC family
XADLMG695_RS20735318-4.185026plasmid pRiA4b ORF-3 family protein
XADLMG695_RS20740220-1.610074hypothetical protein
XADLMG695_RS20745-115-0.293060orotidine-5'-phosphate decarboxylase
XADLMG695_RS20750-1130.4692765'-nucleotidase, lipoprotein e(P4) family
XADLMG695_RS207550141.531775YceH family protein
XADLMG695_RS234850152.543715radical SAM protein
XADLMG695_RS20760-1142.820610STM4012 family radical SAM protein
XADLMG695_RS207652153.217049STM4013/SEN3800 family hydrolase
XADLMG695_RS207703172.780576STM4014 family protein
XADLMG695_RS207753143.100404STM4015 family protein
XADLMG695_RS207803153.483331AAA family ATPase
XADLMG695_RS207855144.460458hypothetical protein
XADLMG695_RS207906144.913894hypothetical protein
XADLMG695_RS207955124.304943hypothetical protein
XADLMG695_RS208005144.855106hypothetical protein
XADLMG695_RS208055174.666779DUF4272 domain-containing protein
XADLMG695_RS208103123.033874DNA helicase Rep
55XADLMG695_RS20890XADLMG695_RS23575Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS20890-2133.273989hypothetical protein
XADLMG695_RS20895-2203.755154YchJ family protein
XADLMG695_RS20900-2163.926805TSUP family transporter
XADLMG695_RS209050102.810400ATP-binding protein
XADLMG695_RS209153101.988350hypothetical protein
XADLMG695_RS209205140.852165DUF4194 domain-containing protein
XADLMG695_RS209254111.012367DUF3375 domain-containing protein
XADLMG695_RS209304100.653537GTP cyclohydrolase I FolE
XADLMG695_RS209355100.991548MarR family transcriptional regulator
XADLMG695_RS235753160.182094DUF1656 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20925SECA332e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.9 bits (75), Expect = 2e-04
Identities = 10/16 (62%), Positives = 10/16 (62%)

Query: 8 DPCPCGRPANYAQCCG 23
DPCPCG Y QC G
Sbjct: 883 DPCPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20935IGASERPTASE340.004 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 34.3 bits (78), Expect = 0.004
Identities = 38/240 (15%), Positives = 74/240 (30%), Gaps = 32/240 (13%)

Query: 1116 RLKLPERARGDEPAVADAVDAAPSIEAAGEPAGVQGAVSADGMAI---DGAAVPTASPAT 1172
L PE + ++ + +I+A +V ++ I D A VP +PAT
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQAD------VPSVPSNNEEIARVDEAPVPPPAPAT 1032

Query: 1173 --DETLSAAPETQTGQRTPAV-----ATANKQNR------------NAKTAKTASSTRAA 1213
+ T + A ++ +T QNR N +T + A S
Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 1214 AQSQAMQTKKSTLRTPALASKAASDKRASGASAVPSAAASRSSVGKTTRSSKTPGKPIAA 1273
++Q +TK++ +K ++K + + ++ +
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 1274 TNASSATSGKGATAS----AAVKTAAAKPRATAASTGQPVRGAGKTSSKRAATTASPAKT 1329
N S TA A ++ + T ++T + T P
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS23575INTIMIN280.027 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 27.7 bits (61), Expect = 0.027
Identities = 10/53 (18%), Positives = 16/53 (30%)

Query: 98 TATATATATATATATATATATATATARVCISNTMLKAAKQQSSKAAKQQSSKA 150
T A T T+T + +ARV +KA + +
Sbjct: 709 TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI 761


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20950PHPHTRNFRASE320.008 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 31.7 bits (72), Expect = 0.008
Identities = 37/224 (16%), Positives = 74/224 (33%), Gaps = 44/224 (19%)

Query: 48 AVLHERLQRQLDALRADELSRELPRTAQAYLAHWLAQGWLERRLPEGATEEEYELSRATT 107
A +H ++ ++S E+ + A ++ + ++ E+ A
Sbjct: 19 AFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAHL 78

Query: 108 QAI-------RFIAGLRESSSSATESRLSLVIQQLVQLAGQTEADPEL--RLAALRDERA 158
+ + +A E L V V + + + + R A +RD
Sbjct: 79 LVLDDPELVDGIKGKIENEQMNA-EYALKEVSDMFVSMFESMD-NEYMKERAADIRDVSK 136

Query: 159 RIDAEIERVASGRVAALDGKRALERARDLIHLSDELAEDFHRVRDDFEQLNRQFRERIID 218
R+ + V +G +A + + + ++++L D QLN+QF +
Sbjct: 137 RVLGHLIGVETGSLATIA--------EETVIIAEDLTPS------DTAQLNKQFVKGFAT 182

Query: 219 DEGAR-------------------GDVLEQLFDGVDVIADSEAG 243
D G R +V E++ G VI D G
Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEG 226


56XADLMG695_RS21060XADLMG695_RS21220Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS21060229-4.683146hypothetical protein
XADLMG695_RS21065329-6.411831hypothetical protein
XADLMG695_RS21070636-7.967745hypothetical protein
XADLMG695_RS22925640-8.544375urea carboxylase
XADLMG695_RS23490744-9.920346allophanate hydrolase
XADLMG695_RS21085221-3.454474hypothetical protein
XADLMG695_RS23495016-0.886375hypothetical protein
XADLMG695_RS21090018-0.905248nuclear transport factor 2 family protein
XADLMG695_RS23500-118-1.405011TetR family transcriptional regulator
XADLMG695_RS22930-119-2.167916TetR/AcrR family transcriptional regulator
XADLMG695_RS21105-119-2.185153SDR family oxidoreductase
XADLMG695_RS21110129-4.505551hypothetical protein
XADLMG695_RS22935236-7.527557hypothetical protein
XADLMG695_RS21125231-6.800064hypothetical protein
XADLMG695_RS21130221-3.886744M61 family metallopeptidase
XADLMG695_RS23505017-1.915565nucleoside hydrolase
XADLMG695_RS21135118-2.020179HigA family addiction module antidote protein
XADLMG695_RS21140-120-2.386911AAA family ATPase
XADLMG695_RS21145-120-2.355249exodeoxyribonuclease V subunit beta
XADLMG695_RS21150-116-0.350684exodeoxyribonuclease V subunit gamma
XADLMG695_RS21160-3140.969633autotransporter domain-containing protein
XADLMG695_RS21165-4121.426388phage tail protein
XADLMG695_RS21170-1154.778166phage tail protein
XADLMG695_RS21180-2145.085083phage tail protein
XADLMG695_RS21190-2135.195188GNAT family N-acetyltransferase
XADLMG695_RS21195-2144.616667hypothetical protein
XADLMG695_RS22955-1144.513534ATP-binding cassette domain-containing protein
XADLMG695_RS21205-1144.586955ABC transporter permease
XADLMG695_RS212100133.610819outer membrane lipid asymmetry maintenance
XADLMG695_RS212151132.963987ABC transporter substrate-binding protein
XADLMG695_RS212204141.968060STAS domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21140HTHTETR695e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 5e-17
Identities = 35/190 (18%), Positives = 69/190 (36%), Gaps = 13/190 (6%)

Query: 7 DTQQKILATAEALIYQHGIHATGMDLLVKTSGVARKSIYRHFDNKDEVAAAALNARDVRW 66
+T+Q IL A L Q G+ +T + + K +GV R +IY HF +K ++ + +
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 67 LAWFRQQCDK-----ADRPEARILRMFTVLKEWFQSEGYRGCAF--INTAGEVGDPDDPV 119
+ K ++ + + F GE+
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 120 RKIARHHKQKLLDYTLELTGQLGITQPDALARQLLLLMEGAIT---VSRVMGDE--DAAD 174
R + ++ L+ + + D + R+ ++M G I+ + + + D
Sbjct: 131 RNLCLESYDRIEQT-LKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 175 TARDIAQLLL 184
ARD +LL
Sbjct: 190 EARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21145HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 2e-12
Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 3/93 (3%)

Query: 6 PLRADAQRNRERLLAAAEQVFLERGAEA-SMEDVAKRAGVGIGTLYRRFPTRESLFAAAY 64
+ +AQ R+ +L A ++F ++G + S+ ++AK AGV G +Y F + LF+ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 65 SGRFLSLAAASHARASSL--DALAALRAYLEDL 95
++ + D L+ LR L +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHV 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21150DHBDHDRGNASE1043e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 3e-29
Identities = 65/254 (25%), Positives = 111/254 (43%), Gaps = 12/254 (4%)

Query: 2 GKRFGGKVVVVTGGTDGIGLVTAKAFSAEGAQVY---ITGRRQDRLDAAVAEIGGGAVGV 58
K GK+ +TG GIG A+ +++GA + + +++ +++ A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 59 QGDVGVPEDMDRLYACIQQEHGRLDVVFANAGVSESAALGEIDIAHLERLLATNIKGTVF 118
DV +D + A I++E G +D++ AGV + + E + N G
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 119 TVQNALPLMAS--GGAVILAGSVAGSKGIGALSVYSATKAAIRSFARTWTSDLKRRGIRV 176
++ M G+++ GS +++ Y+++KAA F + +L IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 177 NVMSPGMVHTPAMQTYLDANAGAE-------DAFKQMIPFGRLGDAEEIAEAVLFLASDA 229
N++SPG T + GAE + FK IP +L +IA+AVLFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 SSFIAGHELFIDGG 243
+ I H L +DGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21155PRTACTNFAMLY270.044 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 26.9 bits (59), Expect = 0.044
Identities = 18/59 (30%), Positives = 25/59 (42%)

Query: 97 SEPGSERTAAGQSIPSQASELSGTWTNNGGDNLAPMVAHMQRLGTVSDAGAAGAGGTIT 155
S+PG RTA+G +I + G N L + G +SD G GT+T
Sbjct: 56 SDPGGVRTASGTTIKVSGRQAQGILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVT 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21205PF05616350.002 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 34.7 bits (79), Expect = 0.002
Identities = 27/95 (28%), Positives = 37/95 (38%), Gaps = 10/95 (10%)

Query: 299 PTPEMSPLPSTAPHPDAFPLPPAGEGARRAGEGSPPTDFPSTAPDPDAFPLLPAGEGARR 358
P P+++P +A P+A PLP A P + P T P+P+ P L
Sbjct: 311 PRPDLTP--GSAEAPNAQPLPEVSP-AENPANNPAPNENPGTRPNPEPDPDLNPDANPDT 367

Query: 359 AGEGSAPTDLPSTAPDPGVFPLPPAGEGARRAGEG 393
G+ P T PD P P G + EG
Sbjct: 368 DGQ-------PGTRPDSPAVPDRPNGRHRKERKEG 395



Score = 30.9 bits (69), Expect = 0.029
Identities = 32/100 (32%), Positives = 37/100 (37%), Gaps = 15/100 (15%)

Query: 290 QGLPPPCDTP----TPEMSPLPSTAP--HPDAFPLPPAGEGARRAGE--------GSPPT 335
Q +P P TP P PLP +P +P P P G R E +P T
Sbjct: 308 QVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDT 367

Query: 336 DF-PSTAPDPDAFPLLPAGEGARRAGEGSAPTDLPSTAPD 374
D P T PD A P P G + EG L PD
Sbjct: 368 DGQPGTRPDSPAVPDRPNGRHRKERKEGEDGGLLCKFFPD 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21220INTIMIN414e-05 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 41.2 bits (96), Expect = 4e-05
Identities = 71/400 (17%), Positives = 114/400 (28%), Gaps = 50/400 (12%)

Query: 433 NVSNVTGATVSDGQGLGTIVNDDAQPALSIDDVSVNEGNSGTTTATFTVSLSAASGQTVT 492
N SN T++ G +V+ + D S + T T TV + + V
Sbjct: 537 NSSNNVLLTITVLSN-GQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVP 595

Query: 493 VNYATADGTATAG-------SDYVARSGTLSFAPGVTAQGVAVTVNGDTAVEPNETFSVG 545
V++ GTA A S PG A T +A+ N V
Sbjct: 596 VSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVV-SAKTAEMTSALNANAVIFVD 654

Query: 546 LSGASNATIARATGTGTILNDDAVVTISPTSLPAATAGTAYSQTLTASGGTPGYSFVI-- 603
+ AS I A T + N +T + + + T T + G S
Sbjct: 655 QTKASITEIK-ADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTD 713

Query: 604 SAGTLPAGMTLNAAGVLSGTPTASGSFNFTV---TATDSGVPTSGSRAYTLTVAGANVTL 660
+ G +T G + S V T + G L
Sbjct: 714 TNGYAKVTLTSTTPGKSLVSARVSDV-AVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKL 772

Query: 661 PATTLPAGTAGQAYSSAITPATGGIAPYSYALIAGALPAGITLNSSSGTLTGTTTSVGSF 720
P L G A+GG Y++ A+ A + +S TL T+
Sbjct: 773 PTVWLQYGQVNL-------KASGGNGKYTWRSANPAI-ASVDASSGQVTLKEKGTTT--- 821

Query: 721 NFSVTATDSTSGTPSQGTRGYTLNIAAPTIALAPATVPTATRGTAYSQTLTAS------- 773
++ S + T + YT+ I + T +
Sbjct: 822 ---ISVISSDNQTAT-----YTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNE 873

Query: 774 --------GGTAAYTYAITSGALPAGITLASNGTLSGTAT 805
G Y Y +S + + + + SG A+
Sbjct: 874 LENVFKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVAS 913


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21240SACTRNSFRASE431e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.6 bits (100), Expect = 1e-07
Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 3/92 (3%)

Query: 74 YRQQFADADFLIVQANGLSIGRLYLHRAAAHHTLV-DISLLPDWRGKGIGSHLIAHAQAC 132
Y ++ A FL N IGR+ + + L+ DI++ D+R KG+G+ L+ A
Sbjct: 59 YVEEEGKAAFLYYLENNC-IGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEW 117

Query: 133 ARDAG-CVLSLHVLHANPAARRLYARHEFVAG 163
A++ C L L N +A YA+H F+ G
Sbjct: 118 AKENHFCGLMLETQDINISACHFYAKHHFIIG 149


57XADLMG695_RS00500XADLMG695_RS00570N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS00500016-0.812127type II secretion system secretin GspD
XADLMG695_RS005052162.862868hypothetical protein
XADLMG695_RS005101152.649336type II secretion system protein M
XADLMG695_RS005151142.060990PilN domain-containing protein
XADLMG695_RS005201141.038099general secretion pathway protein GspK
XADLMG695_RS005250150.140780type II secretion system protein J
XADLMG695_RS00530-3140.713050prepilin-type N-terminal cleavage/methylation
XADLMG695_RS005350152.268191GspH/FimT family pseudopilin
XADLMG695_RS005400152.024801type II secretion system major pseudopilin GspG
XADLMG695_RS005450162.145736type II secretion system F family protein
XADLMG695_RS00550-1132.505520type II secretion system ATPase GspE
XADLMG695_RS005550132.846854S8 family serine peptidase
XADLMG695_RS005600142.955686hypothetical protein
XADLMG695_RS00565-2122.510183S8 family serine peptidase
XADLMG695_RS00570-2142.808757phosphoribosylformylglycinamidine synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00500BCTERIALGSPD351e-113 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 351 bits (901), Expect = e-113
Identities = 157/684 (22%), Positives = 277/684 (40%), Gaps = 106/684 (15%)

Query: 91 ASSGSATFNFEGESVQAVVKAILGDMLGQNYVIAPGVQGTVTLATPNPVSPAQALNLLEM 150
A++ + +F+G +Q + + + L + +I P V+GT+T+ + + ++ Q
Sbjct: 25 AAAEEFSASFKGTDIQEFINTVSKN-LNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLS 83

Query: 151 VLG-WNNARMVFSGGRYNIVPA-DQALAGTVAPSTASPSAARGFEVRVVPLKFISASEMK 208
VL + A + + G +V + D A S A+P RVVPL ++A ++
Sbjct: 84 VLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLA 143

Query: 209 KVLEPYARPNAIVGTD---PARNVITLGGTRAELENYLRTVQIFDVDWLSGMSVGVFPIQ 265
+L NA VG+ NV+ + G A ++ L V+ VD SV P+
Sbjct: 144 PLLRQL-NDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE--RVDNAGDRSVVTVPLS 200

Query: 266 SGKAEKVSADLEKVFGEQSKT--PSAGMFRFMPLENANAVLVI---TPQPRYLDQIQQWL 320
A V + ++ + SK+ P + + + E NAVLV + R + I+Q L
Sbjct: 201 WASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ-L 259

Query: 321 DRIDSAGGGVRLFSYELKYIKAKDLADRLSEVFGGGRGNSGGSPSLVPGGVVNMLGNNSG 380
DR + G ++ LKY KA DL + L+ + +
Sbjct: 260 DRQQATQGNTKVIY--LKYAKASDLVEVLTGISSTMQSEKQA-------------AKPVA 304

Query: 381 SADRDESLGSSSGATGGSIGGASDGSSQSGTSGSFGGSNGSGMLQLQPSTNQNGSV---T 437
+ D++ + G + ++ P +
Sbjct: 305 ALDKNIII-------------------------KAHGQTNALIVTAAPDVMNDLERVIAQ 339

Query: 438 LDVEGGKVGVSAVAETNTLIVRATAQAWSSIRDVIEKLDVMPMQVHIEAQIAEVTLTGDL 497
LD+ +V V A+ + E D + + I+ +T
Sbjct: 340 LDIRRPQVLVEAI--------------------IAEVQDADGLNLGIQWANKNAGMTQFT 379

Query: 498 QYGVNWYFENAVTNPFNSDG---SGGPALPSAAGRRIWGDISGSITNNGVAWTFLGKNAA 554
G+ A N +N DG S + S+ G G N A
Sbjct: 380 NSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQG--------------NWA 425

Query: 555 AIISALDQVTNLRLLQTPSVFVRNNAEATLNVGSRIPINSTSINTGLGTDASYSSVQYID 614
+++AL T +L TPS+ +N EAT NVG +P+ + S T D +++V+
Sbjct: 426 MLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTT--SGDNIFNTVERKT 483

Query: 615 TGVILKVRPRVTKDGMVFLDIVQEVSTPGARPAACTAATATTINSAACNVDINTRRVKTE 674
G+ LKV+P++ + V L+I QEVS+ A A + S+ NTR V
Sbjct: 484 VGIKLKVKPQINEGDSVLLEIEQEVSSV---------ADAASSTSSDLGATFNTRTVNNA 534

Query: 675 AAVQNGDTIMLAGLIDDNTSDGSNGIPFLSKLPVVGALFGRKTQNSSRREVIVLITPSIV 734
V +G+T+++ GL+D + SD ++ +P L +PV+GALF ++ S+R +++ I P+++
Sbjct: 535 VLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVI 594

Query: 735 RNPQEARDLTDEYGAKFNAMKPLS 758
R+ E R + FN +
Sbjct: 595 RDRDEYRQASSGQYTAFNDAQSKQ 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00505IGASERPTASE320.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 0.002
Identities = 25/134 (18%), Positives = 40/134 (29%), Gaps = 28/134 (20%)

Query: 160 NGQGGQPPTANAAARGAATGAQPVPPP---------DAAALVPPQPPQPQPV-------A 203
N Q P + A PVPPP + A Q +
Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET 1061

Query: 204 PGQQQQPGGQAPPTVP--PQRSDGAQEAPRPSDDQMRAIRE----------RIEARRRQL 251
Q ++ +A V Q ++ AQ + Q +E ++E + Q
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121

Query: 252 QQQRQSGSTPGQTQ 265
+ S +P Q Q
Sbjct: 1122 VPKVTSQVSPKQEQ 1135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00530PilS_PF08805354e-05 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 34.9 bits (80), Expect = 4e-05
Identities = 8/55 (14%), Positives = 24/55 (43%), Gaps = 4/55 (7%)

Query: 1 MRHQRGYTLIEVIVAFALLALALSLLLGSLSGAARQVRAADESTRATLHAQSLLA 55
+G TL+EV++ ++ + + S ++ +S+ + +++A
Sbjct: 22 KEQDKGATLMEVLLVVGVIVVLAASAYKLYSMV----QSNIQSSNEQNNVLTVIA 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00535PilS_PF08805357e-05 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 34.9 bits (80), Expect = 7e-05
Identities = 12/61 (19%), Positives = 26/61 (42%), Gaps = 3/61 (4%)

Query: 23 RGTSLLEMLLVIALIAIAGVLAAAALNG---GIDGMRLRTAGKAIAAQLRYTRTQAIATG 79
+G +L+E+LLV+ +I + A + I + + A ++ + Q T
Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSLKFQGRYTD 85

Query: 80 T 80
+
Sbjct: 86 S 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00540BCTERIALGSPG1362e-44 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 136 bits (345), Expect = 2e-44
Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 18/132 (13%)

Query: 15 QAGMSLLEIIIVIVLIGAVLTLVGSRVLGGADRGKANLAKSQIQTLAGKIENFQLDTGKL 74
Q G +LLEI++VIV+IG + +LV ++G ++ A S I L ++ ++LD
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 75 PSKLDDLVTQPGDSSGWLGPYAKPAELN------------DPWGHAIEYRAPGDGQPFDL 122
P+ T G S P P N DPWG+ PG+ +DL
Sbjct: 67 PT------TNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 123 ISLGKDGKPGGS 134
+S G DG+ G
Sbjct: 121 LSAGPDGEMGTE 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00545BCTERIALGSPF430e-152 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 430 bits (1107), Expect = e-152
Identities = 133/411 (32%), Positives = 213/411 (51%), Gaps = 12/411 (2%)

Query: 1 MPLYRYKALDAHGEMLDGQMEAASDAEVALRLQEQGHLPV---ETRLATGENDSPSLRML 57
M Y Y+ALDA G+ G EA S + L+E+G +P+ E R ++ S L L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLS-L 59

Query: 58 LREKPFDNAALVQFTQQLATLIGAGQPLDRALSILMDLPEDEKSRRVIGDVRDTVRGGAP 117
R+ + L T+QLATL+ A PL+ AL + E +++ VR V G
Sbjct: 60 RRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHS 119

Query: 118 LSSALERQHGLFSKLYINMVRAGEAGGSMQDTLQRLADYLERSRALRGKVINALIYPAIL 177
L+ A++ G F +LY MV AGE G + L RLADY E+ + +R ++ A+IYP +L
Sbjct: 120 LADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVL 179

Query: 178 LAVVGCALLFLLGYVVPQFAQMYESLDVALPWFTQAVLSVGLLVRDW--WIVLVVVPGVL 235
V + LL VVP+ + + + ALP T+ ++ + VR + W++L ++ G +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 G--LWLDRKRRNAAFRASLDQWLLRQKVVGSLIARLETARLTRTLGTLLRNGVPLLAAIG 293
+ L +++R +F + LL ++G + L TAR RTL L + VPLL A+
Sbjct: 240 AFRVMLRQEKRRVSF----HRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMR 295

Query: 294 IARNVMSNLALVEDVANAADDVKNGHGLSMSLARGKRFPRLALQMIQVGEESGALDTMLL 353
I+ +VMSN ++ A D V+ G L +L + FP + MI GE SG LD+ML
Sbjct: 296 ISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLE 355

Query: 354 KTADTFELETAQAIDRALAALVPFITLVLASVVGLVIISVLVPLYDLTNAI 404
+ AD + E + + AL P + + +A+VV +++++L P+ L +
Sbjct: 356 RAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00555SUBTILISIN2054e-63 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 205 bits (524), Expect = 4e-63
Identities = 104/359 (28%), Positives = 148/359 (41%), Gaps = 69/359 (19%)

Query: 156 PQLVPNDPFYAQYQWHLSNPNGGINAPGAWDLSQGAGVVVAVLDTGILPDHPDFAGNLLQ 215
Q++ + + + I AP W+ ++G GV VAVLDTG DHPD ++
Sbjct: 10 YQVIKQEQQVNEIPRGV----EMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIG 65

Query: 216 GYDFITDAEVSRRPTDARVPGALDYGDWEEADNVCYAGSQAQESSWHGTHVSGTVAEATN 275
G +F D E + + HGTHV+GT+A AT
Sbjct: 66 GRNFTDDDEGDPEIFK--------------------------DYNGHGTHVAGTIA-ATE 98

Query: 276 NGVGMAGVAPKATILPVRVLGRCG-GYTSDIADAIVWASGGSVDGVPTNTNPAEVINMSL 334
N G+ GVAP+A +L ++VL + G G I I +A VD +I+MSL
Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVD----------IISMSL 148

Query: 335 GGGEPCDSATQLAINGAVSRGTTVVVAAGNSSEDASN----HSPASCNNTITVGATRITG 390
GG E A+ AV+ V+ AAGN + P N I+VGA
Sbjct: 149 GGPEDVP-ELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDR 207

Query: 391 GIAYYSNYGSKVDLSGPGGGGSVDGNPGGYVWQAGYTGATTPTSGSYTYMGLGGTSMASP 450
+ +SN ++VDL PG T Y GTSMA+P
Sbjct: 208 HASEFSNSNNEVDLVAPGED-------------------ILSTVPGGKYATFSGTSMATP 248

Query: 451 HVAGVVALVQSAAIGLGEGPLTPAAVEALLKQTSRPFPVTPPASTPIGSGIVDAKAALE 509
HVAG +AL++ A E LT + A L + + P +P G+G++ A E
Sbjct: 249 HVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKME---GNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00560OMADHESIN531e-08 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 52.6 bits (125), Expect = 1e-08
Identities = 55/154 (35%), Positives = 78/154 (50%), Gaps = 21/154 (13%)

Query: 71 GRGAAAPASKATAIGANSHASATGAVATGANSSASGVNSSAIGRQTNAIGENAVAIGYNS 130
G A+A + AIGA + A+ AVA GA S A+GVNS AIG + A+G++AV G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 131 FVRQSG----------ENGVALGANAGVTGANSVALGAGSRTHEDDVVSVGSGNGRGG-- 178
++ G + GVA+G N+ NSVA+G S + S+ G+
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 179 ---------PATRRITNVTAGVNATDAVNVAQLR 203
R++T++ AG TDAVNVAQL+
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215



Score = 52.2 bits (124), Expect = 2e-08
Identities = 64/200 (32%), Positives = 96/200 (48%), Gaps = 13/200 (6%)

Query: 2133 AAAVGSITPAATSTAVGTAAVANHVTGTAIGGSAYAHGPNDTAIGSNARVNADGSTAVGA 2192
A + SI AT+ A AAVA A G ++ A GP A+G +A STA
Sbjct: 67 AKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126

Query: 2193 NTQIAAVATNA---VAMGEGAQVTAASGTAIGQGARATAQG--AVALGQGSVADRANTVS 2247
I A A+ + VA+G ++ A + AIG + A ++A+G S DR N+VS
Sbjct: 127 GVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVS 186

Query: 2248 VGSVGGERQVANVAAGTRATDAVNKGQLDSGVAAANSYTDSRYNAMADSFESYQGDIEDR 2307
+G RQ+ ++AAGT+ TDAVN QL + T+ R + + +Y +
Sbjct: 187 IGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADN---- 242

Query: 2308 LNKGQLDSGVAAANSYTDSR 2327
+ S + AN+YTDS+
Sbjct: 243 ----KSSSVLGIANNYTDSK 258



Score = 51.8 bits (123), Expect = 3e-08
Identities = 60/172 (34%), Positives = 88/172 (51%), Gaps = 19/172 (11%)

Query: 342 GVGAYAAGTQSSAFGAVANAAGDYATAIGTQTSASGTSSTAVGGPVDYIPGLGFFVQTQA 401
G+ A A G S A GA A AA A A+G + A+G +S A+G ++A
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP------------LSKA 109

Query: 402 SGEASTALGAGATASGTYTTAVGTLSEASGTEATAVGYFAYAPGEGATAVGPESWASGEL 461
G+++ GA +TA A+G + S T AVG+ + A + + A+G S +
Sbjct: 110 LGDSAVTYGAASTAQKD-GVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVAANH 167

Query: 462 STALGYYSTARGANSVATRANTVSVGADGAERQITNVAAGTEGTDAVNLDQL 513
YS A G S R N+VS+G + RQ+T++AAGT+ TDAVN+ QL
Sbjct: 168 G-----YSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 51.1 bits (121), Expect = 4e-08
Identities = 65/209 (31%), Positives = 101/209 (48%), Gaps = 8/209 (3%)

Query: 1276 GGYSSASGFNSTALGNFSTASGSNTVAVGGDATATGAYSIAAGQGSVASGYNSVSVGGAL 1335
G +SA G +S A+G + A+ VAVG + ATG S+A G S A G ++V+ G A
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 1336 ------LGLLPTEASGDYSTAVGGAAWAPGLNSTALGNFAGSTGEG--SVALGAGSVADR 1387
+ + ++ D AVG + A NS A+G+ + S+A+G S DR
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 1388 DFAVSVGSAGNERQITNVAAGTQGTDAVNLDQLNAVAEAGAATSKYFQASGSADSDAGAY 1447
+ +VS+G RQ+T++AAGT+ TDAVN+ QL E + A A+++A A
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYAD 241

Query: 1448 VDGDNALAAGEGANATGTGTTALGAGAQA 1476
+ L + + T A +A
Sbjct: 242 NKSSSVLGIANNYTDSKSAETLENARKEA 270



Score = 48.7 bits (115), Expect = 2e-07
Identities = 60/182 (32%), Positives = 87/182 (47%), Gaps = 4/182 (2%)

Query: 1125 ATADGDYSSAFGSSSQATAIGAVAIGSGASATAQYANAAGYNAAASGFGSVSNGAFSQAS 1184
A AD ++ Q + A+G A G NA+A G S++ GA ++A+
Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82

Query: 1185 GDYAVAVGGESEAAGAQSTALGAAAGAYGDGSLAVGALSEAQGSESTAMGYFASASGESA 1244
AVAVG S A G S A+G + A GD ++ GA S AQ + A+G AS S ++
Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ-KDGVAIGARASTS-DTG 140

Query: 1245 TAVGAESVADGTSAAAFGFGAEATSN--YSTALGGYSSASGFNSTALGNFSTASGSNTVA 1302
AVG S AD ++ A G + +N YS A+G S NS ++G+ S +A
Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLA 200

Query: 1303 VG 1304
G
Sbjct: 201 AG 202



Score = 47.6 bits (112), Expect = 5e-07
Identities = 61/197 (30%), Positives = 90/197 (45%), Gaps = 46/197 (23%)

Query: 895 GGYASASGFFATAVGNNSRAVDYYATALGGDSMASGYFSTAVGGSSVASGRGATAMGVDS 954
G ASA G + A+G + A A A+G S+A+G S A+G S A G A G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 955 AARSDRDTAVGTESVADGGDSTALGANARADNYYSVALGTYALATGTSATSIGGQSYAPG 1014
A+ D VA+G A + T
Sbjct: 122 TAQKD-----------------------------GVAIGARASTSDT------------- 139

Query: 1015 TESVALGWQSNASGEQSISLGSGAYTPADN--SVALGAGSLADRANTVSVGAAGTERQIA 1072
VA+G+ S A + S+++G ++ A++ S+A+G S DR N+VS+G RQ+
Sbjct: 140 --GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLT 197

Query: 1073 NVAAGTEGTDAVNLDQL 1089
++AAGT+ TDAVN+ QL
Sbjct: 198 HLAAGTKDTDAVNVAQL 214



Score = 46.8 bits (110), Expect = 9e-07
Identities = 66/250 (26%), Positives = 106/250 (42%), Gaps = 23/250 (9%)

Query: 1631 GFIPARASGTGAAAFGAGAWATADYTTAIGWNSYADGVNASALGQSAAALADNTLALGGG 1690
G + A A G + A GA A A A+G S A GVN+ A+G + AL D+ + G
Sbjct: 61 GGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAA 120

Query: 1691 SRADAVGASAVGVDASATGINSTGVGRQVNAIGENAVSVGYNSFVRQSAVNGVALGANAG 1750
S A G A+G AS + + V+VG+NS + ++
Sbjct: 121 STAQKDGV-AIGARASTS---------------DTGVAVGFNSKADAKNSVAIGHSSHVA 164

Query: 1751 ATGADSVALGSGSRTYEADTVSIGSGNGRGGPATRRIVNVSAGQAATDAVNKGQLDALAA 1810
A S+A+G S+T ++VSIG + R++ +++AG TDAVN QL
Sbjct: 165 ANHGYSIAIGDRSKTDRENSVSIGHES-----LNRQLTHLAAGTKDTDAVNVAQLKKEIE 219

Query: 1811 DVQTTSGMLKTTGDGVASATGDRATAA--GAGATASGARSVAVASGSRASATGASAMGVD 1868
Q + A+A D +++ G + ++S +R A S ++
Sbjct: 220 KTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLN 279

Query: 1869 SSASGVNSTA 1878
+ + NS A
Sbjct: 280 MAKAHSNSVA 289



Score = 46.0 bits (108), Expect = 1e-06
Identities = 50/144 (34%), Positives = 73/144 (50%), Gaps = 3/144 (2%)

Query: 826 GANAAAADTDSIAVGTYANAYGPRAISLGGQSRATGDDSIALGWGAQAEGEQGIALGAGG 885
G NA+A SIA+G A A A+++G S ATG +S+A+G ++A G+ + GA
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 886 QADAYSTAIGGYASASGFFATAVGNNSRAVDYYATALGGDS--MASGYFSTAVGGSSVAS 943
A AIG AS S AVG NS+A + A+G S A+ +S A+G S
Sbjct: 122 TAQKDGVAIGARASTSD-TGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTD 180

Query: 944 GRGATAMGVDSAARSDRDTAVGTE 967
+ ++G +S R A GT+
Sbjct: 181 RENSVSIGHESLNRQLTHLAAGTK 204



Score = 44.9 bits (105), Expect = 3e-06
Identities = 56/170 (32%), Positives = 78/170 (45%), Gaps = 4/170 (2%)

Query: 607 ASGDASTAVGSASQATANGATALGYESIANGADSTALGVGSVAFGDTSTAVGGASVAFGT 666
A +A Q + N ALG E A G+ + A G S A+G + A
Sbjct: 25 ADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKG 84

Query: 667 DSAAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGDDSIALGASSQASALGTTA 726
+ A GA + A G S AIG S A G+ V G AS A D +A+GA + S G A
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ-KDGVAIGARASTSDTG-VA 142

Query: 727 VGSNANASIANATAVGFNS--SAGDDYATALGGDSNASGYFSTAVGGTSI 774
VG N+ A N+ A+G +S +A Y+ A+G S S ++G S+
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESL 192



Score = 42.6 bits (99), Expect = 2e-05
Identities = 39/141 (27%), Positives = 71/141 (50%)

Query: 1178 GAFSQASGDYAVAVGGESEAAGAQSTALGAAAGAYGDGSLAVGALSEAQGSESTAMGYFA 1237
G + A G +++A+G +EAA + A+GA + A G S+A+G LS+A G + G +
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 1238 SASGESATAVGAESVADGTSAAAFGFGAEATSNYSTALGGYSSASGFNSTALGNFSTASG 1297
+A + S +D A F A+A ++ + + +A+ S A+G+ S
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 1298 SNTVAVGGDATATGAYSIAAG 1318
N+V++G ++ +AAG
Sbjct: 182 ENSVSIGHESLNRQLTHLAAG 202



Score = 41.0 bits (95), Expect = 5e-05
Identities = 45/139 (32%), Positives = 78/139 (56%), Gaps = 11/139 (7%)

Query: 1566 AAFGGYSESTGRLSSALGYGAVASSDYSTAVGAVALASGASAVAVGEFSEAIGDESVAVG 1625
A G + + G S A+G A A+ + AVGA ++A+G ++VA+G S+A+GD +V G
Sbjct: 59 GAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118

Query: 1626 GSTFFGFIPARASGTGAAAFGAGAWATADYTTAIGWNSYADGVNASALGQSAAALADN-- 1683
++ + A GA A +T+D A+G+NS AD N+ A+G S+ A++
Sbjct: 119 AAS--------TAQKDGVAIGARA-STSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGY 169

Query: 1684 TLALGGGSRADAVGASAVG 1702
++A+G S+ D + ++G
Sbjct: 170 SIAIGDRSKTDRENSVSIG 188



Score = 41.0 bits (95), Expect = 6e-05
Identities = 42/129 (32%), Positives = 67/129 (51%), Gaps = 4/129 (3%)

Query: 756 GGDSNASGYFSTAVGGTSIANGRGATAIGYETIGNGTASTALGFASVAWGDGGTAIGTES 815
G +++A G S A+G T+ A A A+G +I G S A+G S A GD G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 816 LAYGDNSTAVGANAAAADTDSIAVGTYANAYGPRAISLGGQSRATGDD--SIALGWGAQA 873
A D A+GA A+ +DT +AVG + A ++++G S + SIA+G ++
Sbjct: 122 TAQKD-GVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 874 EGEQGIALG 882
+ E +++G
Sbjct: 180 DRENSVSIG 188



Score = 40.7 bits (94), Expect = 7e-05
Identities = 48/146 (32%), Positives = 72/146 (49%), Gaps = 10/146 (6%)

Query: 1827 ASATGDRATAAGAGATASGARSVAVASGSRASATGASAMGVDSSASGVNSTAMGRQTNSI 1886
A A A A GAG+ A+G SVA+ S+A A G S+A R + S
Sbjct: 79 AEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTS- 137

Query: 1887 GENGVALGYNSFVRESGSNAVALGANAGASGADSVALGSGSRTYEANTVSVGSGNGRGGP 1946
+ GVA+G+NS S A+ ++ A+ S+A+G S+T N+VS+G +
Sbjct: 138 -DTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES----- 191

Query: 1947 ATRRIVNVGAGTIASASTDAINGGQL 1972
R++ ++ AGT TDA+N QL
Sbjct: 192 LNRQLTHLAAGT---KDTDAVNVAQL 214



Score = 40.7 bits (94), Expect = 7e-05
Identities = 48/149 (32%), Positives = 71/149 (47%), Gaps = 11/149 (7%)

Query: 544 AAGSNALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAVAQNTTALGG 603
A G NA A +S A+G+++ A+ A AVG+G+ AT N+ A+G S A+ + G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 604 NSSASGDASTAVGSASQATANGATALGYESIANGADSTALGVG---------SVAFGDTS 654
S+A D AS T++ A+G+ S A+ +S A+G S+A GD S
Sbjct: 120 ASTAQKDGVAIGARAS--TSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRS 177

Query: 655 TAVGGASVAFGTDSAAFGANAAAGGTAST 683
SV+ G +S A GT T
Sbjct: 178 KTDRENSVSIGHESLNRQLTHLAAGTKDT 206



Score = 40.7 bits (94), Expect = 8e-05
Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 4/103 (3%)

Query: 1836 AAGAGATASGARSVAVASGSRASATGASAMGVDSSASGVNSTAMGRQTNSIGENGVALGY 1895
A G A+A G S+A+ + + A+ A A+G S A+GVNS A+G + ++G++ V G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 1896 NSFVRESGSNAVALGANAGASGADSVALGSGSRTYEANTVSVG 1938
S ++ G VA+GA A S VA+G S+ N+V++G
Sbjct: 120 ASTAQKDG---VAIGARASTSDT-GVAVGFNSKADAKNSVAIG 158



Score = 37.6 bits (86), Expect = 7e-04
Identities = 53/182 (29%), Positives = 83/182 (45%), Gaps = 18/182 (9%)

Query: 549 ALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAVAQNTTALGGNSSAS 608
A AD ++ S A+G A G N++A ++ A+G + A+
Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82

Query: 609 GDASTAVGSASQATANGATALGYESIANGADSTALGVGSVAFGDTSTAVGGASVAFGTDS 668
A+ AVG+ SIA G +S A+G S A GD++ G AS A D
Sbjct: 83 KGAAVAVGAG--------------SIATGVNSVAIGPLSKALGDSAVTYGAASTA-QKDG 127

Query: 669 AAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGDD--SIALGASSQASALGTTA 726
A GA A+ T A+G NS A + +VA+G +S+ + + SIA+G S+ + +
Sbjct: 128 VAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVS 186

Query: 727 VG 728
+G
Sbjct: 187 IG 188



Score = 37.2 bits (85), Expect = 0.001
Identities = 36/130 (27%), Positives = 64/130 (49%)

Query: 1455 AAGEGANATGTGTTALGAGAQAVVDNATAVGVGALAGGTGAAALGSNAQAVGENSSAVGS 1514
A G A+A G + A+GA A+A A AVG G++A G + A+G ++A+G+++ G+
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 1515 NALASDIGATANGAGAQAISTYTTALGSEAVASDNQAIAAGFRSTASSVGSAAFGGYSES 1574
+ A G + + + S+A A ++ AI A+ S A G S++
Sbjct: 120 ASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 1575 TGRLSSALGY 1584
S ++G+
Sbjct: 180 DRENSVSIGH 189



Score = 36.4 bits (83), Expect = 0.002
Identities = 52/180 (28%), Positives = 78/180 (43%), Gaps = 2/180 (1%)

Query: 705 ASGDDSIALGASSQASALGTTAVGSNANASIANATAVGFNSSAGDDYATALGGDSNASGY 764
A D I + Q S A+G A G N+SA ++ A+G + A+
Sbjct: 25 ADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKG 84

Query: 765 FSTAVGGTSIANGRGATAIGYETIGNGTASTALGFASVAWGDGGTAIGTESLAYGDNSTA 824
+ AVG SIA G + AIG + G ++ G AS A DG AIG + D A
Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDG-VAIGARAST-SDTGVA 142

Query: 825 VGANAAAADTDSIAVGTYANAYGPRAISLGGQSRATGDDSIALGWGAQAEGEQGIALGAG 884
VG N+ A +S+A+G ++ S+ R+ D ++ G ++ Q L AG
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202



Score = 35.6 bits (81), Expect = 0.002
Identities = 45/142 (31%), Positives = 73/142 (51%), Gaps = 4/142 (2%)

Query: 1113 AQGEDATAAGSNATADGDYSSAFGSSSQATAIGAVAIGSGASATAQYANAAGYNAAASGF 1172
+ A G NA+A G +S A G++++A AVA+G+G+ AT + A G + A G
Sbjct: 53 VRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGD 112

Query: 1173 GSVSNGAFSQASGDYAVAVGGESEAAGAQSTALGAAAGAYGDGSLAVGALSE--AQGSES 1230
+V+ GA S A D VA+G + + A+G + A S+A+G S A S
Sbjct: 113 SAVTYGAASTAQKD-GVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVAANHGYS 170

Query: 1231 TAMGYFASASGESATAVGAESV 1252
A+G + E++ ++G ES+
Sbjct: 171 IAIGDRSKTDRENSVSIGHESL 192



Score = 32.9 bits (74), Expect = 0.016
Identities = 44/149 (29%), Positives = 68/149 (45%), Gaps = 20/149 (13%)

Query: 236 AAGDGANATGTATTALGTGANAVANNATAVGANALASGQNSAAFGHNAQANGPASVAVGG 295
A G A+A G + A+G A A A AVGA ++A+G NS A GP S A+G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAI-------GPLSKALGD 112

Query: 296 AAVDEDGEPLVTNGGVPVTTGATSAGVGGTAVGASANADGFAASSFGVGAYAAGTQSSAF 355
+AV GV + A+++ G AVG ++ AD + + G ++ A
Sbjct: 113 SAVTYGAASTAQKDGVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVA------- 164

Query: 356 GAVANAAGDYATAIGTQTSASGTSSTAVG 384
A Y+ AIG ++ +S ++G
Sbjct: 165 -----ANHGYSIAIGDRSKTDRENSVSIG 188



Score = 32.6 bits (73), Expect = 0.021
Identities = 44/133 (33%), Positives = 65/133 (48%), Gaps = 4/133 (3%)

Query: 1105 GTGTGTADAQGEDATAAGSNATADGDYSSAFGSSSQATAIGAVAIGSGASATAQYANAAG 1164
G G A A+G + A G+ A A + A G+ S AT + +VAIG + A A G
Sbjct: 59 GAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118

Query: 1165 YNAAASGFGSVSNGAFSQASGDYAVAVGGESEAAGAQSTALGAAA--GAYGDGSLAVGAL 1222
+ A G V+ GA + S D VAVG S+A S A+G ++ A S+A+G
Sbjct: 119 AASTAQKDG-VAIGARASTS-DTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDR 176

Query: 1223 SEAQGSESTAMGY 1235
S+ S ++G+
Sbjct: 177 SKTDRENSVSIGH 189



Score = 32.2 bits (72), Expect = 0.026
Identities = 51/171 (29%), Positives = 72/171 (42%), Gaps = 4/171 (2%)

Query: 535 AIAQGVDSVAAGSNALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAV 594
A A D + + + ALG A G A+A ++ A+G + A
Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82

Query: 595 AQNTTALGGNSSASGDASTAVGSASQATANGATALGYESIANGADSTALGVGSVAFGDTS 654
A+G S A+G S A+G S+A + A G S A D A+G + DT
Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ-KDGVAIGARAST-SDTG 140

Query: 655 TAVGGASVAFGTDSAAFG--ANAAAGGTASTAIGANSSAFGERTVALGGAS 703
AVG S A +S A G ++ AA S AIG S E +V++G S
Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00565SUBTILISIN2041e-62 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 204 bits (520), Expect = 1e-62
Identities = 102/379 (26%), Positives = 146/379 (38%), Gaps = 80/379 (21%)

Query: 139 QVDLRMYPLQASGALPNDPLLQTNQWHLIDPVGGINVAQAWKTTQGEGVVVAVLDTGILP 198
+V + Y + + + + + I W T+G GV VAVLDTG
Sbjct: 4 KVHIIPYQV-----IKQEQQVNEIPRGV----EMIQAPAVWNQTRGRGVKVAVLDTGCDA 54

Query: 199 DHPDLAGNLLAGYDFITDPFFSRRATAERVPGALDLGDWIAEDGDCGLFSVASDSSWHGT 258
DHPDL ++ G +F D D G + D + HGT
Sbjct: 55 DHPDLKARIIGGRNFT--------------------------DDDEGDPEIFKDYNGHGT 88

Query: 259 HVAGTVAEATNNGIGGAGVAYRAKVLPVRVLGHCG-GRLSDISDAIVWASGGHVDGVPDN 317
HVAGT+A AT N G GVA A +L ++VL G G+ I I +A VD
Sbjct: 89 HVAGTIA-ATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVD----- 142

Query: 318 RDPAEVINLSLGGGGACGSTMQAAIDGAVARGTAVVVAAGNSTADVSTMA----PANCAN 373
+I++SLGG + A+ AVA V+ AAGN P
Sbjct: 143 -----IISMSLGGPED-VPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNE 196

Query: 374 VIAVAATRATGGLADYSNFGRQIDLAGPGGSSMSFVTNDGPIRSFVWQTLYTGKTTPTSG 433
VI+V A +++SN ++DL PG + T+ GK S
Sbjct: 197 VISVGAINFDRHASEFSNSNNEVDLVAPG--------------EDILSTVPGGKYATFS- 241

Query: 434 QFTYGGTHYAGTSMASPHVAGTAALVQSALIADGKPPLSPAAMESLLKRTARPFPVSIPV 493
GTSMA+PHVAG AL++ A + L+ + + L + P S
Sbjct: 242 ----------GTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNS--- 288

Query: 494 ATPAGAGIVDAGAAVARAL 512
G G++ A +
Sbjct: 289 PKMEGNGLLYLTAVEELSR 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS00580OMADHESIN310.044 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 30.6 bits (68), Expect = 0.044
Identities = 45/176 (25%), Positives = 66/176 (37%), Gaps = 27/176 (15%)

Query: 632 PKMHRDAAHPAAPQWPVLQTASLDLQQAGLRVLA--HPTVASKSFLVTIGDRSVGGLTAR 689
P + +P P PV L+ G+ +A A+K V +G S+
Sbjct: 45 PALG--LEYPVRP--PVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIA-TGVN 99

Query: 690 EQMIGPWQLPLADCAITLAGFDTFEGEAMSIGERTPLALLNAAASARMAVGEAITNLCAA 749
IGP L D A+T T + + ++IG R + + +AVG
Sbjct: 100 SVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARA------STSDTGVAVG--------- 144

Query: 750 PVQRLDSIKLSANWMAAAGHSGEDALLYDAVRAIGMELCPALELSVPVGKDSLSMQ 805
+S K A A GHS A + AIG E SV +G +SL+ Q
Sbjct: 145 ----FNS-KADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQ 195


58XADLMG695_RS01935XADLMG695_RS01975N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS019350151.051130response regulator
XADLMG695_RS019401130.867977ABC transporter permease
XADLMG695_RS019451120.090410cell division ATP-binding protein FtsE
XADLMG695_RS019500110.151203ATP-dependent RNA helicase RhlB
XADLMG695_RS01955-211-0.758206thioredoxin TrxA
XADLMG695_RS01960-111-0.974121transcription termination factor Rho
XADLMG695_RS01965-210-1.095025hypothetical protein
XADLMG695_RS01970-211-1.328381bifunctional isocitrate dehydrogenase
XADLMG695_RS01975-213-0.970786LysR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01935HTHFIS581e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.9 bits (140), Expect = 1e-11
Identities = 31/115 (26%), Positives = 46/115 (40%), Gaps = 12/115 (10%)

Query: 140 GATVLYIEDSRVVAEATKRMLERQSLKVVHVLTAEDAFALLTAESLGRTERRIDVVLTDV 199
GAT+L +D + + L R V A + + A D+V+TDV
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-------GDLVVTDV 55

Query: 200 TLKGELNGRDVVGRIRIDFAYGKRRLPVLVMTGDTNPRNQSELLRAGANDLVQKP 254
+ E N D++ RI+ LPVLVM+ + GA D + KP
Sbjct: 56 VMPDE-NAFDLLPRIKKARP----DLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105



Score = 51.0 bits (122), Expect = 2e-09
Identities = 22/88 (25%), Positives = 39/88 (44%), Gaps = 4/88 (4%)

Query: 12 DAPRVMVVDGSKLVRKLIADVLKRDLPNVQVIGCSNIAEARQALEAGAVDLVTTSLSLPD 71
++V D +R ++ L R V SN A + + AG DLV T + +PD
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 72 GDGLTLARSVRETAGQAYVPVIVVSGDA 99
+ L +++ + +PV+V+S
Sbjct: 60 ENAFDLLPRIKKA--RPDLPVLVMSAQN 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01940PF01540375e-05 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 37.4 bits (86), Expect = 5e-05
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 13/102 (12%)

Query: 34 MRKPWATLLTIVVMALALALPLGLSIALDNVKLLAGSVQQSREINLFLKVDVAADAAQAL 93
M+K +T+ +A LP+ +I+ ++ KL E N K D A A AL
Sbjct: 1 MKKSKKIFITLCGIAATAVLPIA-TISCNDDKL--------AEKNGKEKADAALKQANAL 51

Query: 94 AGELRARPDVAKVTLRTPEQGLAELRESAKLDEAADALGDNP 135
A EL+ PD +K+ L T + +AE +S K A + GD P
Sbjct: 52 AEELKKNPDYSKI-LETLNKEIAEATKSFK---EAGSYGDYP 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01950cloacin300.026 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.026
Identities = 23/66 (34%), Positives = 26/66 (39%), Gaps = 6/66 (9%)

Query: 433 GGGRGGPGGGSRSGSGGGRRDGAGADGKPRPRRKPRVEGQAPATSAPSA-TPVVAAAAVE 491
GGG G GG SGGG G P V PA S P A V+ +A
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAP-----VAFGFPALSTPGAGGLAVSISAGA 111

Query: 492 ASSTIA 497
S+ IA
Sbjct: 112 LSAAIA 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01960FLGFLIH310.008 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 31.3 bits (70), Expect = 0.008
Identities = 19/59 (32%), Positives = 29/59 (49%), Gaps = 3/59 (5%)

Query: 96 EGGQSQQQRFNQAQQQQNQTQGQNQG--QGQQQGQNQNQNAGQNQQGQGGQGQGQNQQG 152
E S +Q+ Q Q Q ++ QG G +G+QQG Q G Q + G + ++QQ
Sbjct: 35 EAEPSLEQQLAQLQMQAHE-QGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQA 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01965PF03544382e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 38.0 bits (88), Expect = 2e-05
Identities = 18/92 (19%), Positives = 30/92 (32%), Gaps = 19/92 (20%)

Query: 136 PPRYPEAAFRAGATGVVYLMLKIGRDGKVADLIAEQVNLTSLVPESKRARLRQVFADAAS 195
P+YP A G V + + DG+V ++ + A+ +F
Sbjct: 164 QPQYPARAQALRIEGQVKVKFDVTPDGRVDNV------------QILSAKPANMFEREVK 211

Query: 196 KKARTWTFLPPTEGPEVEAPYWVMRVPVSFDI 227
R W + P G + V + F I
Sbjct: 212 NAMRRWRYEPGKPGSGI-------VVNILFKI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS01975PF09025290.016 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 28.8 bits (64), Expect = 0.016
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 12/68 (17%)

Query: 38 QMRALEQRLG--YPLLQRHARGVTATPQGQQLLDRIAPHLDAIA----------EAFEPF 85
Q+ A EQ LG P R G+ G++LL R A L + A P
Sbjct: 28 QVLAFEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPL 87

Query: 86 GARREDTL 93
G +++ L
Sbjct: 88 GRQQQTFL 95


59XADLMG695_RS02005XADLMG695_RS02020N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS020051171.377033efflux RND transporter periplasmic adaptor
XADLMG695_RS020101140.995384efflux RND transporter permease subunit
XADLMG695_RS020151140.796646efflux RND transporter permease subunit
XADLMG695_RS020201120.858200VOC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02010RTXTOXIND547e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 53.7 bits (129), Expect = 7e-10
Identities = 30/149 (20%), Positives = 57/149 (38%), Gaps = 22/149 (14%)

Query: 64 ASALGTVTAL-NTVTVSPQVGGQLMSLNFKEGQEVKKGDLLAQIDPRT-------LQASY 115
A+A G +T + + P + + KEG+ V+KGD+L ++ Q+S
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSL 143

Query: 116 DQALAAKRQNQALLA---TSRVNYQRSNDPAYKQYVS-----------RTDLDTQRNQVA 161
QA + + Q L +++ + D Y Q VS + T +NQ
Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKY 203

Query: 162 QYEAAVAANDAQMRSAQVQLQFTRVTAPI 190
Q E + A+ + ++ + +
Sbjct: 204 QKELNLDKKRAERLTVLARINRYENLSRV 232



Score = 35.2 bits (81), Expect = 4e-04
Identities = 22/177 (12%), Positives = 64/177 (36%), Gaps = 29/177 (16%)

Query: 93 EGQEVKKGDLLAQIDPRTLQASYDQ-------ALAAKR----------QNQALLATSRVN 135
+ + ++ +LA+I+ + ++ +L K+ +N+ + A + +
Sbjct: 210 DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELR 269

Query: 136 YQRSNDPAYKQYVSRTDLD-TQRNQVAQYEAAVAANDAQMRSAQV---------QLQFTR 185
+S + + + Q+ + E + + Q +
Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASV 329

Query: 186 VTAPIDGIAGIRGV-DVGNIVTSSSTIVTLT-QIRPIYVSFNLPERELQAVRTGQTA 240
+ AP+ V G +VT++ T++ + + + V+ + +++ + GQ A
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNA 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02015ACRIFLAVINRP7340.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 734 bits (1896), Expect = 0.0
Identities = 298/1072 (27%), Positives = 497/1072 (46%), Gaps = 65/1072 (6%)

Query: 4 STIFIRRPIATSLLMAGVLLLGILGYRQLPVSALPEIDAPSLVVTTQYPGANATTMASLV 63
+ FIRRPI +L +++ G L QLPV+ P I P++ V+ YPGA+A T+ V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TTPLERQFGQISGLQMMTSDS-SAGLSTIILQFSMERDIDIASQDVQAAIRQAT--LPSS 120
T +E+ I L M+S S SAG TI L F D DIA VQ ++ AT LP
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 121 LPYQPVYNRVNPADAAILTLKLTSDS--LPLREVNRYADAILAQRLSQVPGVGLVSIAGN 178
+ Q + + + ++ SD+ +++ Y + + LS++ GVG V + G
Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 179 VRPAVRIQVNPAQLSNMGLTMESLRSALTQTNVSAPKGSLN------GKTQSYSIGTNDQ 232
A+RI ++ L+ LT + + L N G L G+ + SI +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 LTDAAQYRETII-SYKDGRPVRLADVANVVDGVENDQLAAWADGKQAVLLEIRRQPGANI 291
+ ++ + + DG VRL DVA V G EN + A +GK A L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 292 VQTVEQIRNILPQLRSVLPADVHLEVFSDRTETIRASVHEVKFTLVLTIALVVAVIFVFL 351
+ T + I+ L +L+ P + + D T ++ S+HEV TL I LV V+++FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 352 RRLWATIIPSVAVPLSLAGTFGVMAFAGMSLDNLSLMALVVATGFVVDDAIVMIENIVRY 411
+ + AT+IP++AVP+ L GTF ++A G S++ L++ +V+A G +VDDAIV++EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 412 IEQGKSGP-EAAEIGAKQIGFTVLSLTVSLVAVFLPLLLMPGVTGRLFHEFAWVLSIAVV 470
+ + K P EA E QI ++ + + L AVF+P+ G TG ++ +F+ + A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 471 ISMLVSLTLTPMMCAYLLKPDALPEGEDAHERATAAGKTNLWTRTVGAYERSLDWVLAHQ 530
+S+LV+L LTP +CA LLKP + E ++ + +V Y S+ +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHE--NKGGFFGWFNTTFDHSVNHYTNSVGKILGST 537

Query: 531 PLTLAVAIGAVALTVVLYVAIPKGLLPEQDTGLITGVVQADQNVAFPQMEQRTQAVAAAL 590
L + VA VVL++ +P LPE+D G+ ++Q + ++ V
Sbjct: 538 GRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYY 597

Query: 591 RKDPA--VTGVAAFIGAGTMNPTLNQGQLSIVLKTRGEREG----LDEVLPRLQKAVAGI 644
K+ V V G N G + LK ER G + V+ R + + I
Sbjct: 598 LKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657

Query: 645 PGVALFLKPVQDV-TLDTRVAATEYQYSISDVDSSELATWAGRMTESMRKLP-ELADVDN 702
+ + + L T + + L ++ + P L V
Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 703 NLANQGRALELSIDRDKASMLGVPMQTIDDTLYDSFGQRQISTIFTELNQYRVVLEVAPE 762
N +L +D++KA LGV + I+ T+ + G ++ ++ ++ +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 763 FRTSTALMNQLAVASNGSGALTGTNATSFGQVTSSNSSTATGVGAQNTGIVVGAGSIIPL 822
FR +++L V S G ++P
Sbjct: 778 FRMLPEDVDKLYVRSA-------------------------------------NGEMVPF 800

Query: 823 AALAEAKVTNTPLVVSHQQQLPAVTISFNLAPGHSLSQAVAAIEKAREELKMPTQVHAQF 882
+A + + LP++ I APG S A+A +E K+P + +
Sbjct: 801 SAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDW 858

Query: 883 VGKAAEFTGSQTDIVWLLLASIVVIYIVLGVLYESYIHPLTIISTLPPAGVGALLALMLC 942
G + + S L+ S VV+++ L LYES+ P++++ +P VG LLA L
Sbjct: 859 TGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLF 918

Query: 943 GLSLSVDGIVGIVLLIGIVKKNAIMMIDFAIDA-RREGASAHDAIRRACLLRFRPIMMTT 1001
V +VG++ IG+ KNAI++++FA D +EG +A A +R RPI+MT+
Sbjct: 919 NQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTS 978

Query: 1002 AAAMLGALPLALGTGIGSELRRPLGIAIVGGLLLSQLVTLYTTPVIYLYMER 1053
A +LG LPLA+ G GS + +GI ++GG++ + L+ ++ PV ++ + R
Sbjct: 979 LAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 76.4 bits (188), Expect = 4e-16
Identities = 58/319 (18%), Positives = 117/319 (36%), Gaps = 14/319 (4%)

Query: 747 FTELNQYRVVLEVAPEFRTSTALMNQLAVASNGS-GALTGTNATSFGQVTSSNSSTATGV 805
LN+Y++ L Q + G G + +
Sbjct: 190 ADLLNKYKLTPV-----DVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPE 244

Query: 806 GAQNTGIVVGA-GSIIPLAALAEAKVT--NTPLVVSHQQQLPAVTISFNLAPGHSLSQAV 862
+ V + GS++ L +A ++ N ++ + PA + LA G +
Sbjct: 245 EFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANALDTA 303

Query: 863 AAIEKAREELK--MPTQVHAQFVGKAAEF-TGSQTDIVWLLLASIVVIYIVLGVLYESYI 919
AI+ EL+ P + + F S ++V L +I+++++V+ + ++
Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 920 HPLTIISTLPPAGVGALLALMLCGLSLSVDGIVGIVLLIGIVKKNAIMMIDFAIDARRE- 978
L +P +G L G S++ + G+VL IG++ +AI++++ E
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 979 GASAHDAIRRACLLRFRPIMMTTAAAMLGALPLALGTGIGSELRRPLGIAIVGGLLLSQL 1038
+A ++ ++ +P+A G + R I IV + LS L
Sbjct: 424 KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL 483

Query: 1039 VTLYTTPVIYLYMERAGER 1057
V L TP + + +
Sbjct: 484 VALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02020ACRIFLAVINRP7510.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 751 bits (1941), Expect = 0.0
Identities = 287/1030 (27%), Positives = 489/1030 (47%), Gaps = 26/1030 (2%)

Query: 7 FIKRPIGTSLLAIGLFVIGLMCYLRLGVAALPNIQIPIIFVHATQSGADASTMASTVTAP 66
FI+RPI +LAI L + G + L+L VA P I P + V A GADA T+ TVT
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 67 LERHLGQLPGIDRMRSSS-SESSSLVVLVFQSSRNIDSAAQDIQTAINASQSDLPSGLGT 125
+E+++ + + M S+S S S + L FQS + D A +Q + + LP +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 126 PMYSKANPNDDPVIAIALTSET--QSADELYNVADSLLAQRLRQITGISSVDIAGASTPA 183
S + ++ S+ + D++ + S + L ++ G+ V + GA A
Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY-A 183

Query: 184 VRVDVDLRALNALGLTPDDLRNAVRAANVTSPTGFL------SDGNTTMAIISNDSVSKA 237
+R+ +D LN LTP D+ N ++ N G L +II+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 238 ADFAQLAISTQSNGRIVRLGDVATVYDGQQDAYQAAWFNGKPAVVMYAFTRAGANIVETV 297
+F ++ + S+G +VRL DVA V G ++ A NGKPA + GAN ++T
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTA 303

Query: 298 DQVKAQIPELRSYLQPGTTLTPYFDRTPTIRASLHEVQATLMISLAMVILTMALFLRRLA 357
+KA++ EL+ + G + +D TP ++ S+HEV TL ++ +V L M LFL+ +
Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 358 PTLIAAVTVPLSLAGSALVMYVLGFTLNNLSLLALVIAIGFVVDDAIVVIENVMRHL-DE 416
TLI + VP+ L G+ ++ G+++N L++ +V+AIG +VDDAIVV+ENV R + ++
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 417 GMSRLDAALAGAREIGFTIVSITASLVAVFIPMLFASGMIGAFFREFTVTLVAAIVVSML 476
+ +A +I +V I L AVFIPM F G GA +R+F++T+V+A+ +S+L
Sbjct: 424 KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL 483

Query: 477 VSLTLTPALCSRFLSPHTEP--EKPGRFGAWLDRMHERMLRVYTVALDFSLRHALLLSLT 534
V+L LTPALC+ L P + E G F W + + + YT ++ L L
Sbjct: 484 VALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLI 543

Query: 535 PLLLIAATIFLGSAVKKGSFPAQDTGLIWGRANSSATVSFADMVSRQRRITDMLMADP-- 592
L++A + L + P +D G+ A + ++TD + +
Sbjct: 544 YALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKA 603

Query: 593 ---AVKTVGARLGSGRQGSSASFNIELKKRDE--GRRDTTAEVVARLSAKADRYPDLDLR 647
+V TV SG+ ++ + LK +E G ++ V+ R + + D
Sbjct: 604 NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR--DGF 661

Query: 648 LRAIQDLPSDGGGGTSQGAQYRVSLQGNDLAQLQEWLPKLQAALKKNP-HLRDVGTDVDT 706
+ G + + G L + +L ++P L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 707 AGLRQNIVIDRAKAARLGISVGAIDGALYGAFGQRSISTIYSDLNQYSVVVNALPSQTAT 766
+ + +D+ KA LG+S+ I+ + A G ++ + V A
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 767 PKALDQIFVPNRAGQMVPITAVATQAPGLAPPQIIHENQYTTMDLSYNLAPGVSTGEADL 826
P+ +D+++V + G+MVP +A T P++ N +M++ APG S+G+A
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 827 IIKSTVQGLRMPDGIRLS-GDDSFNVQHSPNSMGILLLAAVLTVYIVLGMLYESLIHPVT 885
++++ ++P GI S+ + S N L+ + + V++ L LYES PV+
Sbjct: 842 LMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 886 ILSTLPAAGVGALLALFITNTELSVISMIALVLLIGIVKKNAIMMIDFALVAQRVHGMDA 945
++ +P VG LLA + N + V M+ L+ IG+ KNAI++++FA G
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 946 RAAAREASIVRFRPIMMTTMVAILAAVPLAVGLGEGSELRRPLGIAMIGGLVFSQSLTLL 1005
A A +R RPI+MT++ IL +PLA+ G GS + +GI ++GG+V + L +
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1006 STPALYVIFS 1015
P +V+
Sbjct: 1020 FVPVFFVVIR 1029



Score = 109 bits (274), Expect = 3e-26
Identities = 81/506 (16%), Positives = 165/506 (32%), Gaps = 31/506 (6%)

Query: 2 NISAPFIKRPIGTSLLAIGLFVIGLMCYLRLGVAALPNIQIPIIFVHA-TQSGADASTMA 60
N + L+ + ++ +LRL + LP + +GA
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 61 STVT----------APLERHLGQLPGIDRMRSSSSESSSLVVLVFQSSRNIDS-AAQDIQ 109
+ + + G + + + V L RN D +A+ +
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 110 TAINASQSDLPSGLGTPMYSKANPNDDPVIAIALTSETQSA-----DELYNVADSLLAQR 164
+ G P + A E D L + LL
Sbjct: 648 HRAKMELGKIRDGFVIPF--NMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMA 705

Query: 165 LRQITGISSVDIAG-ASTPAVRVDVDLRALNALGLTPDDLRNAVRAANVTSPTGFLSDGN 223
+ + SV G T +++VD ALG++ D+ + A + D
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 224 TTMAIIS---NDSVSKAADFAQLAISTQSNGRIVRLGDVATVYDGQQDAYQAAWFNGKPA 280
+ D +L + + +NG +V T + + + +NG P+
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRS-ANGEMVPFSAFTTSHWVYG-SPRLERYNGLPS 823

Query: 281 VVMYAFTRAGANIVETVDQVKAQIPELRSYLQPGTTLTPYFDRTPTIRASLHEVQATLMI 340
+ + G + A + L S L G + + R S ++ A + I
Sbjct: 824 MEIQGEAAPGTS----SGDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPALVAI 878

Query: 341 SLAMVILTMALFLRRLAPTLIAAVTVPLSLAGSALVMYVLGFTLNNLSLLALVIAIGFVV 400
S +V L +A + + + VPL + G L + + ++ L+ IG
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 401 DDAIVVIENVM-RHLDEGMSRLDAALAGAREIGFTIVSITASLVAVFIPMLFASGMIGAF 459
+AI+++E EG ++A L R I+ + + + +P+ ++G
Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 460 FREFTVTLVAAIVVSMLVSLTLTPAL 485
+ ++ +V + L+++ P
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVF 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02025YERSSTKINASE300.024 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.1 bits (67), Expect = 0.024
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 4/75 (5%)

Query: 117 DEAALSANPFRVFTSLLRLELIEDAALRAQAEQILQQRQIFTAGALQLIERHERQGGLDA 176
D ++ R + LLR L A + +L L +++ ER+GG+D
Sbjct: 436 DVRRITPKKLRELSDLLRTHLSSAATKQLDMGGVLSDLDTM----LVALDKAEREGGVDK 491

Query: 177 DQARQFVAEALETFR 191
DQ + F + L+T+R
Sbjct: 492 DQLKSFNSLILKTYR 506


60XADLMG695_RS02400XADLMG695_RS02450N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS024000142.345632YifB family Mg chelatase-like AAA ATPase
XADLMG695_RS02405-1132.186004lipocalin family protein
XADLMG695_RS024100120.794600EAL domain-containing protein
XADLMG695_RS024200120.627779acyl-CoA desaturase
XADLMG695_RS02425-2120.155653ferredoxin reductase
XADLMG695_RS02430-2100.168227HTH-type transcriptional repressor FabR
XADLMG695_RS02435-111-0.920930slipin family protein
XADLMG695_RS02450-113-1.427480RtcB family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02405HTHFIS330.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.9 bits (75), Expect = 0.003
Identities = 43/246 (17%), Positives = 70/246 (28%), Gaps = 57/246 (23%)

Query: 127 LAAAQAGRRLIVPLANGAEAAIAGHVEAFTARTL------LDVCATLNGSQKAPAAELAA 180
L + R + L A+ ++A D+ + +A A
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 181 QALGARALPDMADVRGQP----HARRALEIAAAGGHHLLLVGSPGCGKTLLASRLPGLLP 236
+ D + G+ R L L++ G G GK L+A L
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALH---- 181

Query: 237 EASEAEALETAAITSISGRGLDLARWRQRPYRAPHHTASPVALVG------------GGT 284
D + R P+ A + A P L+ G
Sbjct: 182 ---------------------DYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQ 220

Query: 285 HPRPGEISLSHNGVLFLDEL----PEWQRQTLEVLREPLESGVVTIARASRSVDFPARFQ 340
G + G LFLDE+ + Q + L VL++ + +
Sbjct: 221 TRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG------EYTTVGGRTPIRSDVR 274

Query: 341 LVAAMN 346
+VAA N
Sbjct: 275 IVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02410BCTLIPOCALIN1102e-33 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 110 bits (276), Expect = 2e-33
Identities = 59/155 (38%), Positives = 88/155 (56%), Gaps = 12/155 (7%)

Query: 5 PELATVPS-LDLNRYLGTWYEIARLPTRFEDADCTDVSAHYTLEDDGSVRVQNRCFTAE- 62
PE S +LN YLG WYE+ARL FE + V+A Y + +DG + V NR ++ E
Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERG-LSQVTAEYRVRNDGGISVLNRGYSEEK 78

Query: 63 GELEEAVGQARAIDD-THSRLEVTFLPEGLRWIPFTKGDYWVMRIDAD-YTAALVGSPDR 120
GE +EA G+A ++ T L+V+F PF G Y V +D + Y+ A V P+
Sbjct: 79 GEWKEAEGKAYFVNGSTDGYLKVSFFG------PFY-GSYVVFELDRENYSYAFVSGPNT 131

Query: 121 KYLWLLARLPQLDENIAQAYLAHAREQGFDLSPLI 155
+YLWLL+R P ++ I ++ ++E+GFD + LI
Sbjct: 132 EYLWLLSRTPTVERGILDKFIEMSKERGFDTNRLI 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02435HTHTETR515e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.8 bits (121), Expect = 5e-10
Identities = 20/102 (19%), Positives = 49/102 (48%), Gaps = 1/102 (0%)

Query: 12 PPSRKPAISREDLIAAALSLIGPHRSLSTVSLREVAREAGIAPNSFYRQFRDMDELAVAL 71
++ +R+ ++ AL L + +S+ SL E+A+ AG+ + Y F+D +L +
Sbjct: 4 KTKQEAQETRQHILDVALRLFS-QQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 72 IDLAGRSLRTIIGQARQRATSTDRSVIRVSVEAFMEQLRADD 113
+L+ ++ + + + + SV+R + +E ++
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEE 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02455PF00577310.007 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 31.4 bits (71), Expect = 0.007
Identities = 23/88 (26%), Positives = 37/88 (42%), Gaps = 16/88 (18%)

Query: 23 VRGVPLEEQAHAQL--RNIAAVPFVGPW----VAVMP-----DVHLGKGATVGSVIPTRG 71
+ +E Q + R A +P+ + VA+ +V L V +V+PTRG
Sbjct: 726 AKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLD--NAVANVVPTRG 783

Query: 72 AIIPAAVGVDIGCGMAAVRTTLRANDLP 99
AI+ A +G + TL N+ P
Sbjct: 784 AIVRAEFKARVG---IKLLMTLTHNNKP 808


61XADLMG695_RS02485XADLMG695_RS02520N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS024850130.968374TIGR01777 family protein
XADLMG695_RS024900130.405147response regulator transcription factor
XADLMG695_RS02495-2150.310677HAMP domain-containing protein
XADLMG695_RS02500-213-0.391181hypothetical protein
XADLMG695_RS02505014-0.215588DUF3313 domain-containing protein
XADLMG695_RS02510115-0.085547hypothetical protein
XADLMG695_RS02520211-0.185520Do family serine endopeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02495NUCEPIMERASE391e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 39.4 bits (92), Expect = 1e-05
Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 1 MHLLITGGTGFIGQALCPALLQAGHQV 27
M L+TG GFIG + LL+AGHQV
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQV 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02500HTHFIS861e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 1e-21
Identities = 26/155 (16%), Positives = 60/155 (38%), Gaps = 5/155 (3%)

Query: 2 HLLLVEDDTMLASAICDGVRQQSWTVDHVGHANAAKTVLVDHRYSAVLLDIGLPGESGLS 61
+L+ +DD + + + + + + V +A + V+ D+ +P E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VIRFMRSHYDATPVIALTARGQLTDRIRGLDAGADDYLVKPFQFDELMARLRAVTRRSQG 121
++ ++ PV+ ++A+ I+ + GA DYL KPF EL+ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RVVPLLSHGD-----VCLDPGSRKVTKDGKWVALS 151
R L V +++ + + +
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQT 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02505PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.002
Identities = 29/156 (18%), Positives = 61/156 (39%), Gaps = 31/156 (19%)

Query: 209 LETARRSNRLAEQLLDLARLDAGISSAAYHQVEMGELISHVLDEFSVQADAR---QMQLQ 265
LE ++ + L +L R S+A QV + + ++ V + + ++Q +
Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNA--RQVSLADELTVVDSYLQLA-SIQFEDRLQFE 243

Query: 266 VEASPCLLRCDVDAVGILIRNLVDNAIRYG----RLHGKVEVSCGYCVRADVLHPFLQVS 321
+ +P ++ V +L++ LV+N I++G GK+ + D L+V
Sbjct: 244 NQINPAIMDVQVPP--MLVQTLVENGIKHGIAQLPQGGKILLK----GTKDNGTVTLEVE 297

Query: 322 DDGPGVPEGAQTTIFERFYRVPGSAVQGSGIGLSLV 357
+ G + + + +G GL V
Sbjct: 298 NTGSLALKNTK---------------ESTGTGLQNV 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02520IGASERPTASE362e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 2e-04
Identities = 20/145 (13%), Positives = 40/145 (27%)

Query: 111 QKLTATKDAAKQTLTSTTQAAKQKLSSTSAAAKKKITDTKATTKRKLETAKANAKAEAAA 170
+K T D T + QA + S + + + AE +
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 171 LSAKTAAKSAARKTAVATVNARTAAKKAAKKAVAKSAAAKKPLVKPAAKKAPVAKQTATR 230
+KT K+ T N A + + + +
Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105

Query: 231 QAAVKKAPLKKAVTKTALKKAAKVT 255
+KA ++ T+ K ++V+
Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVS 1130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS02525V8PROTEASE822e-19 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 82.4 bits (203), Expect = 2e-19
Identities = 31/193 (16%), Positives = 71/193 (36%), Gaps = 40/193 (20%)

Query: 110 LGSGVIIDAQKGYVLTNHHVIENADDVQVTL------------GDGRTVKAEFIGSDADT 157
+ SGV++ K +LTN HV++ L +G + +
Sbjct: 103 IASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG 160

Query: 158 DIALIRIKAD--------NLTDIKLADSNALRVGDFVVAIGNPFG---FTQTVTSGIVSA 206
D+A+++ + + ++++ +V + G P T + G ++
Sbjct: 161 DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITY 220

Query: 207 VGRSGIRGLGYQNFIQTDASINPGNSGGALVNLQGQLVGINTASFNPQGSMAGNIGLGLA 266
+ +Q D S GNSG + N + +++GI+ + N + +
Sbjct: 221 L---------KGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNE----FNGAVFIN 267

Query: 267 --IPSNLARNVVE 277
+ + L +N+ +
Sbjct: 268 ENVRNFLKQNIED 280


62XADLMG695_RS03485XADLMG695_RS03535N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS03485-1123.8087424'-phosphopantetheinyl transferase superfamily
XADLMG695_RS03490218-1.913038GlsB/YeaQ/YmgE family stress response membrane
XADLMG695_RS03495220-3.106434GFA family protein
XADLMG695_RS03500220-3.508541exodeoxyribonuclease III
XADLMG695_RS03510127-5.437239ROK family transcriptional regulator
XADLMG695_RS035203112.047225hypothetical protein
XADLMG695_RS035351111.772605response regulator transcription factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03520ENTSNTHTASED290.010 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 29.2 bits (65), Expect = 0.010
Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 2/71 (2%)

Query: 68 SHSGEYLLVGLGQGVRLGVDLERIRARPRVLEIAQRFFHPDEIALLAALAPDAQHALFFR 127
SH L + + R+G+D+E+I ++ E+A DE +L A AL
Sbjct: 89 SHCATTALAVISRQ-RIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALTL- 146

Query: 128 LWCAKEALLKA 138
+ AKE++ KA
Sbjct: 147 AFSAKESVYKA 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03530SECYTRNLCASE260.018 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 25.9 bits (57), Expect = 0.018
Identities = 16/83 (19%), Positives = 33/83 (39%), Gaps = 2/83 (2%)

Query: 3 IIIWLIVGG-IVGWLASIIMRRDAQQGIILNVVVGIVGALIAGFL-FGGGINQAITLWTF 60
++I + G +V WL +I R G+ + + + I + A F
Sbjct: 163 MVICMTAGTCVVMWLGELITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEF 222

Query: 61 VWSLVGAVILLAIVNLVTRGRLR 83
+ +I++A+V V + + R
Sbjct: 223 GTVIAVGLIMVALVVFVEQAQRR 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03550CHANLCOLICIN270.050 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.0 bits (59), Expect = 0.050
Identities = 21/74 (28%), Positives = 31/74 (41%), Gaps = 1/74 (1%)

Query: 8 RTAAARGDAAAQRYLLAQRAADLMQRAVAAAPAGTQPTLSPDAEREVAVIVSELEALALA 67
A A+ A A R L QR D++ A+ A P+ + A A + +E E L LA
Sbjct: 75 AAAEAQAKAKANRDALTQRLKDIVNEALRHN-ASRTPSATELAHANNAAMQAEDERLRLA 133

Query: 68 GHRDAIDTLAQVVE 81
+ A+ E
Sbjct: 134 KAEEKARKEAEAAE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03560HTHFIS726e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.2 bits (177), Expect = 6e-17
Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 5/131 (3%)

Query: 2 TTLLIADDHPLFREALRGAVQRVMPGVELFEADNV-DALYTLADAQPDADLLLMDLNMPG 60
T+L+ADD R L A+ R G ++ N +A D L++ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPD 59

Query: 61 AQGFSALVHMRSLHPQLPVVVVSAREEPTVMRRAIDHGAFGFIPKSADSDTIGRALATVL 120
F L ++ P LPV+V+SA+ +A + GA+ ++PK D + + L
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 121 DGERWIPAEAQ 131
+ P++ +
Sbjct: 120 AEPKRRPSKLE 130


63XADLMG695_RS03700XADLMG695_RS03730N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS03700-2140.024250hypothetical protein
XADLMG695_RS03705-218-0.275298hypothetical protein
XADLMG695_RS03710-2170.592648twin-arginine translocase subunit TatC
XADLMG695_RS03720-2111.715651twin-arginine translocase subunit TatB
XADLMG695_RS03725-2121.830268Sec-independent protein translocase subunit
XADLMG695_RS03730-2131.672063hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03750PF04335310.005 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 31.0 bits (70), Expect = 0.005
Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 11/70 (15%)

Query: 168 LLWLLLTIATF--AAMTLALFVM-------PPQVMFDRSTGGHALRESLRASLHNLP--A 216
L W++ +A A +A+ + P + DR+TG ++ L A
Sbjct: 34 LAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLHGDATITYDEA 93

Query: 217 MLVFFVLAFI 226
+ +F+ ++
Sbjct: 94 VRKYFLATYV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03770TATBPROTEIN841e-22 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 83.9 bits (207), Expect = 1e-22
Identities = 47/171 (27%), Positives = 74/171 (43%), Gaps = 9/171 (5%)

Query: 1 MFDIGVGELTLIAVVALVVLGPERLPKAARFAGLWVRRARMQWDSVKQELERELEAEELK 60
MFDIG EL L+ ++ LVVLGP+RLP A + W+R R +V+ EL +EL+ +E +
Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ 60

Query: 61 RSLQDVQ-ASLREAEDQLRTTQQQVEQGARALHEDVGRDIDIRASATPVATPLELAHADL 119
SL+ V+ ASL +L+ + ++ Q A + A + AH
Sbjct: 61 DSLKKVEKASLTNLTPELKASMDELRQAA--------ESMKRSYVANDPEKASDEAHTIH 112

Query: 120 SASPNVDTAAGATEAAGTAHTAPVIAQAQPIAPAPQQPLVPAPHDTRVPAP 170
+ + AA A T + +P A + + AP
Sbjct: 113 NPVVKDNEAAHEGVTPAAAQTQASSPEQKPETTPEPVVKPAADAEPKTAAP 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03775TATBPROTEIN312e-04 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 31.1 bits (70), Expect = 2e-04
Identities = 10/41 (24%), Positives = 18/41 (43%)

Query: 1 MGGFSIWHWLIVLVIVLLVFGTKRLTSGAKDLGSAVKEFKK 41
M L+V +I L+V G +RL K + ++ +
Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRS 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS03780PERTACTIN290.032 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 28.9 bits (64), Expect = 0.032
Identities = 21/81 (25%), Positives = 26/81 (32%), Gaps = 5/81 (6%)

Query: 207 NERPSTDVIAFRDRLEEATYTARANRSTDAAADGAPPVPRPQTPPPAQAQQPANVPPPAS 266
N+ D+ +R RL A N APP P+P P Q PP
Sbjct: 538 NKDGKVDIGTYRYRL-----AANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPP 592

Query: 267 EASTVPMQPSTTPPAQQGFQP 287
+ P P P A P
Sbjct: 593 QPPQPPQPPQRQPEAPAPQPP 613


64XADLMG695_RS04960XADLMG695_RS05005N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS04960011-0.259071protein-export chaperone SecB
XADLMG695_RS04965-2110.318291NAD(P)-dependent glycerol-3-phosphate
XADLMG695_RS04970-1120.759876Ax21 family protein
XADLMG695_RS04975-1131.654572ubiquinone-dependent pyruvate dehydrogenase
XADLMG695_RS04980-1141.435743sigma-54-dependent Fis family transcriptional
XADLMG695_RS049902142.984626hypothetical protein
XADLMG695_RS050003123.295206hypothetical protein
XADLMG695_RS050052101.504789MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04970SECBCHAPRONE1955e-67 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 195 bits (498), Expect = 5e-67
Identities = 64/160 (40%), Positives = 99/160 (61%), Gaps = 3/160 (1%)

Query: 1 MSDEILNGAAAPADAAAGPAFTIEKIYVKDVSFESPNAPAVFNDANQPELQLNLNQKVQR 60
MS+E AA A P I++IYVKDVSFE+PN P +F +P+L +L+ + ++
Sbjct: 1 MSEENQVNAAD-TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQDWEPKLSFDLSTEAKQ 59

Query: 61 LNDNAFEVVLAVTLTCTA--GGKTAYVAEVQQAGVFGLVGLDPQAIDVLLGTQCPNILFP 118
+ D+ +EV L +++ T G A++ EV+QAGVF + GL+ + L +QCPN+LFP
Sbjct: 60 VGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFP 119

Query: 119 YVRTLVSDLIQAGGFPPFYLQPINFEALYAETLRQRQNEG 158
Y R LVS L+ G FP L P+NF+AL+ + L++++
Sbjct: 120 YARELVSSLVNRGTFPALNLSPVNFDALFMDYLQRQEQAE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04980OUTRMMBRANEA280.034 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.034
Identities = 20/94 (21%), Positives = 32/94 (34%), Gaps = 10/94 (10%)

Query: 49 KASYAIAPNFHVFGDYSKQ--NADDNNNVFENTDSDFQQWGV-GVGFNHEIATSTDFVAR 105
K Y I + ++ AD +NV + D V G E A + + R
Sbjct: 103 KLGYPITDDLDIYTRLGGMVWRADTKSNV-YGKNHDTGVSPVFAGGV--EYAITPEIATR 159

Query: 106 VAYRKL----DLDTPNINFDGYSVEAGLRNAFGE 135
+ Y+ D T D + G+ FG+
Sbjct: 160 LEYQWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQ 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS04995HTHFIS463e-163 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 463 bits (1193), Expect = e-163
Identities = 178/478 (37%), Positives = 262/478 (54%), Gaps = 37/478 (7%)

Query: 2 ARILIIDDDAAFRTTLQVTLRSLGHAVVAAENGPDGLARLSEGGIDMAFVDFRMPGMDGI 61
A IL+ DDDAA RT L L G+ V N ++ G D+ D MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 AVLRARLDDAQARQVPLVMLTAHVSSGNTIEAMTLGAFDHLVKPVGRADIVEVVERALLS 121
+L R+ A+ +P+++++A + I+A GA+D+L KP +++ ++ RAL
Sbjct: 64 DLLP-RIKKARPD-LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 RADAQAAAADSSPAPVEDDDALVGHSPAMRTVHKRIGLAAASDLPVLITGETGTGKELAA 181
+ +D LVG S AM+ +++ + +DL ++ITGE+GTGKEL A
Sbjct: 122 PKRRPSKL----EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA 177

Query: 182 RALHRASPRASAPFVAVNCAAIPLELMESELFGHRKGAFSGASSDRRGLIREADGGTLFL 241
RALH R + PFVA+N AAIP +L+ESELFGH KGAF+GA + G +A+GGTLFL
Sbjct: 178 RALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237

Query: 242 DEIGDMPLPMQAKLLRFLQEGEVTPLGGSGPQKVDVRVLAATHRDLAACVADGRFRSDLR 301
DEIGDMP+ Q +LLR LQ+GE T +GG P + DVR++AAT++DL + G FR DL
Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLY 297

Query: 302 YRLNVVPIELPPLPERGQDILLLAQHFL---SADAARAQSLSPAAQERLLAHRWPGNVRE 358
YRLNVVP+ LPPL +R +DI L +HF+ + + A E + AH WPGNVRE
Sbjct: 298 YRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 359 LRNVMQRSQVLVRGASIDAADLDD---------------------ALGEAGELPPPQPSA 397
L N+++R L I +++ ++ +A E Q A
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 398 -------VTGTLPEAVARLETQMIRSALEQSQGNRAEAARRLGIHRQLLYRKLEEYGL 448
+G +A +E +I +AL ++GN+ +AA LG++R L +K+ E G+
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05010TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.6 bits (77), Expect = 0.001
Identities = 64/375 (17%), Positives = 121/375 (32%), Gaps = 12/375 (3%)

Query: 30 PFLSVFLQSKGWSVAAIGTVMSVGGIAGMLATTPAGALVDATRRKRAVVVIGCLAILLAT 89
P L L A G ++++ + GAL D R R V+++ +
Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGR-RPVLLVSLAGAAVDY 87

Query: 90 ALIWLQPTSSGVVAAQIASALAAAGIGPALTGITLGLVHAHGFDHQLARNQVANHAGNVL 149
A++ P + +I + + A G + G V
Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146

Query: 150 AAVLAGWLGWRYGFAAVFLLTAFFGALALVAVLAIPAATIDHRAARGLATTNGGDALSGW 209
VL G +G + A F A L + + + H+ R + L+ +
Sbjct: 147 GPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPES--HKGERRPLRREALNPLASF 203

Query: 210 RVLLTCRPLALLAVTLGLFHLGNAAMLPLYGMAIVAAHAGDPSALTATTIVVAQATMVVV 269
R +A L + L L+ + D + + + +
Sbjct: 204 RWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQ 263

Query: 270 ALLAMRWIRVHGHWWVLLVAFMALPLRALVAASVIHGWGVFPVQILDGLGAGLQSVVVPA 329
A++ G L++ +A ++ A GW FP+ +L G + +PA
Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG----IGMPA 319

Query: 330 LVARLLQGTGRVNVG--QGAVMTVQGVGAALSPAFGGWL-AHAFGYRVAFLTLGAIALLA 386
L A L + G QG++ + + + + P + A + + + AL
Sbjct: 320 LQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYL 379

Query: 387 VALWAGCRGMLQAAA 401
+ L A RG+ A
Sbjct: 380 LCLPALRRGLWSGAG 394


65XADLMG695_RS05560XADLMG695_RS05585N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS05560-28-0.954757FMN reductase
XADLMG695_RS05565-29-0.960267DUF1852 domain-containing protein
XADLMG695_RS05570-28-0.632706methionine synthase
XADLMG695_RS05575-180.5477632-keto-3-deoxygluconate transporter
XADLMG695_RS055800100.854452OprO/OprP family phosphate-selective porin
XADLMG695_RS05585213-0.368710SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05565HTHFIS335e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 5e-04
Identities = 31/177 (17%), Positives = 56/177 (31%), Gaps = 22/177 (12%)

Query: 6 PLRVVAVSGGMQRPSKAVALAEHLLELIADQVPCERHLVEIGALAPHFAGALWRTQVPGA 65
PLR R L H ++ + +++ + PG
Sbjct: 309 PLR--------DRAEDIPDLVRHFVQQAE------KEGLDVKRFDQEALELMKAHPWPGN 354

Query: 66 VEQALCLVEQADILVVATPVYRGSFTGLFKHFFDFIDQDALIDTPVLLAATGGSDRHALV 125
V + LV + L + R + + + + AA GS +
Sbjct: 355 VRELENLVRRLTALYPQDVITR-------EIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 126 IDHQLRPLFSFFQARTLPLGVYATDRDFLDYRVHNEALAERARLAVQRALPLIELTR 182
++ +R F+ F P G+Y ++Y + AL R +A L+ L R
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAAL-TATRGNQIKAADLLGLNR 463


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05580INTIMIN280.047 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 28.5 bits (63), Expect = 0.047
Identities = 19/83 (22%), Positives = 33/83 (39%), Gaps = 2/83 (2%)

Query: 218 TIARTGASGVLLGVAVIAITGLPLLLADRWIGGGNGTAGVAASSTAGAAVATPALIAGMA 277
T+ + G + + V+ ++G +L A+ G+G A V S V A A M
Sbjct: 583 TVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEM- 641

Query: 278 PQFAPAAPAATALVASAVIVTSL 300
A A A + + +T +
Sbjct: 642 -TSALNANAVIFVDQTKASITEI 663


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05585PF05616320.006 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 32.0 bits (72), Expect = 0.006
Identities = 23/68 (33%), Positives = 26/68 (38%), Gaps = 3/68 (4%)

Query: 44 GRDKLGTF---VQVDENGKLPASAMPATPAQPLPPAPGATTPADTAVAQAAPAPAPVATP 100
GRD G VQV L + A AQPLP A PA+ P P P
Sbjct: 296 GRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEP 355

Query: 101 APAKSGDA 108
P + DA
Sbjct: 356 DPDLNPDA 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05600NUCEPIMERASE374e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.5 bits (87), Expect = 4e-05
Identities = 26/130 (20%), Positives = 42/130 (32%), Gaps = 35/130 (26%)

Query: 8 ILVTGASGQLGALVVEALLGHLPANRIVA---------TARDTASLAEFAKRDIAVRQAD 58
LVTGA+G +G V + LL +++V + A L A+ + D
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEA--GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 59 YANPHSLD--------------AAFAGVGRVL-----LVSSNAVGQRVPQHRNVIEAAKR 99
A+ + V L SN G N++E +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTG-----FLNILEGCRH 115

Query: 100 AGVELLAYTS 109
++ L Y S
Sbjct: 116 NKIQHLLYAS 125


66XADLMG695_RS21985XADLMG695_RS06020N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS21985-230-2.241993type III secretion system export apparatus
XADLMG695_RS05940-329-1.765427type III secretion system export apparatus
XADLMG695_RS05945-329-1.910700type III secretion system cytoplasmic ring
XADLMG695_RS05950-221-3.179067type III secretion system protein SctP
XADLMG695_RS05960-121-1.437753FHIPEP family type III secretion protein
XADLMG695_RS05965-121-1.051901type III secretion system export apparatus
XADLMG695_RS05970117-1.601626HrpB1 family type III secretion system apparatus
XADLMG695_RS05975-114-2.444985type III secretion protein HrpB2
XADLMG695_RS05980-214-2.504439type III secretion inner membrane ring
XADLMG695_RS05985-115-2.265735type III secretion protein HrpB4
XADLMG695_RS05990-213-3.104700type III secretion system stator protein SctL
XADLMG695_RS06000-116-1.829731type III secretion system ATPase SctN
XADLMG695_RS060051140.262483type III secretion protein HrpB7
XADLMG695_RS060102150.716590type III secretion system export apparatus
XADLMG695_RS060152160.112895type III secretion system outer membrane ring
XADLMG695_RS060201140.271958hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05975TYPE3IMQPROT622e-16 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 61.7 bits (150), Expect = 2e-16
Identities = 24/78 (30%), Positives = 43/78 (55%)

Query: 4 DDLVRFTSEALLLCLKVSLPVVGVAALAGLLIAFVQAVMSLQDASISFALKLVVVVAAIA 63
DDLV ++AL L L +S VA + GLL+ Q V LQ+ ++ F +KL+ V +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 VTAPWGASAIMQFGQALM 81
+ + W ++ +G+ ++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05980TYPE3IMPPROT2462e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 246 bits (630), Expect = 2e-85
Identities = 80/219 (36%), Positives = 130/219 (59%), Gaps = 8/219 (3%)

Query: 3 MPDVGSLLLVVIMLGLLPFAAMVVTSYTKIVVVLGLLRNAIGVQQVPPNMVLNGVALLVS 62
M + SL+ ++ LLPF T + K +V ++RNA+G+QQ+P NM LNGVALL+S
Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60

Query: 63 CFVMAPVGMEAFKA-AQNYGAGSDNSRVVVLLDACREPFRQFLLKHTREREKAFFMRSAQ 121
FVM P+ +A+ +D S + +D + +R +L+K++ FF +
Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120

Query: 122 QIWPKDKAAT-------LKSDDLLVLAPAFTLSELTEAFRIGFLLYLVFIVIDLVVANAL 174
+ ++ T ++ + L PA+ LSE+ AF+IGF LYL F+V+DLVV++ L
Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180

Query: 175 MAMGLSQVTPTNVAIPFKLLLFVAMDGWSMLIHGLVLSY 213
+A+G+ ++P ++ P KL+LFVA+DGW++L GL+L Y
Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS05985TYPE3OMOPROT649e-14 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 63.9 bits (155), Expect = 9e-14
Identities = 40/177 (22%), Positives = 74/177 (41%), Gaps = 15/177 (8%)

Query: 144 PTQLPAWLAALRVNTRLRIGGRTASAALLQSLRPGDVLLHCTASAAVTSGELLWGIAGGA 203
P LR R IG +LL + GDVLL T+ A V G
Sbjct: 138 PAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTSRAEVYCYAKKLG----- 192

Query: 204 VLRAPVRLNLQQMILEATPTMQHDTFE---PDVAPSTSNVAELELPVQLEVDQLALSLST 260
++ I+ T +QH E + A + + +L + ++ + + ++L+
Sbjct: 193 -----HFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYRKNVTLAE 247

Query: 261 LSGLQPGQILELSVPVDQADIRLVVYGQTIGTGRLLAVGEHLGVQILS-MSESTHAD 316
L + Q+L L + ++ ++ G +G G L+ + + LGV+I +SES + +
Sbjct: 248 LEAMGQQQLLSLPTNAEL-NVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESGNGE 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06000TYPE3IMSPROT332e-115 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 332 bits (854), Expect = e-115
Identities = 115/345 (33%), Positives = 191/345 (55%), Gaps = 2/345 (0%)

Query: 1 MSEEKTEKPTEKKLRDARKDGEVPVSPDVTAAAVLFGALLVMKSAGDYFADHVRALMTIG 60
MS EKTE+PT KK+RDARK G+V S +V + A++ ++ DY+ +H LM I
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 FDFPENTRDAAAINRALGHLGIQGLLLMLPFLAACLIAGVAGGAFQTGLNASLKPVAPKF 120
+ + A++ + ++ ++ L P L + +A Q G S + + P
Sbjct: 61 AE-QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 121 DSLNPAAGVKKLFSLRSLINLLKLIIKAILIGVVLWVGIRALMPMIIGLAYETPLDIAQI 180
+NP G K++FS++SL+ LK I+K +L+ +++W+ I+ + ++ L I +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 181 AWHTLGMLFALGVLLFVLVGAADWSVQHWLFIRDKRMSKDEQKREFKESEGDPEIKGKRK 240
L L + + FV++ AD++ +++ +I++ +MSKDE KRE+KE EG PEIK KR+
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 241 EFAKELVFGDPRERVAKAKVMVVNPTHYAVALAYEPDDFGLPQVVAKGVDDGALELRALA 300
+F +E+ + RE V ++ V+V NPTH A+ + Y+ + LP V K D +R +A
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 301 HNQGIPIVANPPLARALY-QVELGDAIPEPLFETVAVVLRWVDEL 344
+G+PI+ PLARALY + IP E A VLRW++
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06015FLGMRINGFLIF796e-19 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 79.2 bits (195), Expect = 6e-19
Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 11/188 (5%)

Query: 3 ALRCLVVLLVALLLSACSQQ---LYSGLTENDANDMLEVLLHAGVDASKVTPDDGKTWAV 59
A V ++VA++L A + L+S L++ D ++ L + + G A+
Sbjct: 30 AGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY-RFANGSG---AI 85

Query: 60 NAPHDQVSYSLEVLRAHGLPHEQHANLG-EMFKKDGLISTPTEERVRFIYGVSQQLSQTL 118
P D+V L GLP + +G E+ ++ + E+V + + +L++T+
Sbjct: 86 EVPADKVHELRLRLAQQGLP--KGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTI 143

Query: 119 SNIDGVISADVEIVLPNNDPLSTSVKPSSAAVFIKFRVGSDLT-SLVPNIKTMVMHSVEG 177
+ V SA V + +P K SA+V + G L + + +V +V G
Sbjct: 144 ETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAG 203

Query: 178 LTYENVSV 185
L NV++
Sbjct: 204 LPPGNVTL 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06035IGASERPTASE290.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.009
Identities = 13/75 (17%), Positives = 24/75 (32%), Gaps = 13/75 (17%)

Query: 93 AEQAQAAADQSLQSARDELASVQQALSKLQAQAQV-------------YADKAASARRAR 139
+E + A+ S Q ++ + Q A +V + A S +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 140 QAQRDAAEEEDAVEA 154
+ Q +E VE
Sbjct: 1094 ETQTTETKETATVEK 1108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06040TYPE3IMRPROT1776e-57 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 177 bits (451), Expect = 6e-57
Identities = 52/238 (21%), Positives = 105/238 (44%), Gaps = 3/238 (1%)

Query: 8 LLAISSQGVSLLALLALCGVRVFVMFIVLPATAQDSLPGIARNGVIYVLSSFIAYGQPAD 67
L S Q +S L L +RV + P ++ S+P + G+ +++ IA PA+
Sbjct: 2 LQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPAN 61

Query: 68 ALAKIQTVGLVGVVFKEAFIGLLIGFAASTVFWIAESVGLLIDDLAGYNNVQMTNPLSGQ 127
+ L + ++ IG+ +GF F + G +I G + +P S
Sbjct: 62 DVPVFSFFAL-WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 128 QSTPVSTVLLQLAIVSFYALGGMLLLLGALFESFRWWPLTQLGPNMGSVAESFVIQQSDS 187
++ ++ LA++ F G L L+ L ++F P+ + S A + +
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGG--EPLNSNAFLALTKAGSL 178

Query: 188 MMTAVVKLSAPVMLVLVLVDLAIGLVARAADKLEPSNLSQPIRGVLALLLLALLTSVF 245
+ + L+ P++ +L+ ++LA+GL+ R A +L + P+ + + L+A L +
Sbjct: 179 IFLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLI 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06045TYPE3OMGPROT334e-109 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 334 bits (857), Expect = e-109
Identities = 101/288 (35%), Positives = 155/288 (53%), Gaps = 13/288 (4%)

Query: 320 DVGGGAELASDAPVIEADPRTNAILIRDRPERMQSYGTLIQQLDNRPKLLQIDATIIEIR 379
+ A AS +EADP NAI++RD PERM Y LI LD +++ +I++I
Sbjct: 233 RIPQAATRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDIN 292

Query: 380 DGAMQDLGVDWRFHSQHTDIQTGNGSGSQLGFNGALSGAATDGATTPAGGTLTAVLGDAG 439
+ +LGVDWR I+TGN + G S A++GA G+L G
Sbjct: 293 ADQLTELGVDWR-----VGIRTGNNHQVVIKTTGDQSNIASNGAL----GSLVDARGL-- 341

Query: 440 RYLMTRVSALETTNKAKIVSSPQVATLDNVEAVMDHKQQAFVRVSGYASADLYNLSAGVS 499
YL+ RV+ LE A++VS P + T +N +AV+DH + +V+V+G A+L ++ G
Sbjct: 342 DYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTM 401

Query: 500 LRVLPSVVPGSPNGQMRLDVRIEDGQLGSNT--VDGIPVITSSEITTQAFVNEGQSLLIA 557
LR+ P V+ ++ L++ IEDG N+ ++GIP I+ + + T A V GQSL+I
Sbjct: 402 LRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIG 461

Query: 558 GYAYDADETDLNAVPGLSKIPLLGNLFKHRQKSGSRMQRLFLLTPHVV 605
G D L+ VP L IP +G LF+ + + R RLF++ P ++
Sbjct: 462 GIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRII 509



Score = 250 bits (639), Expect = 5e-77
Identities = 72/230 (31%), Positives = 115/230 (50%), Gaps = 6/230 (2%)

Query: 15 MAAVLMLSLLPLLSPHADAAQVPWHSRTFKYVADNKDLKEVLRDLSASQSIATWISPEVT 74
VL +LL LLS ++ A ++ W + YVA + L+++L D A+ +S ++
Sbjct: 9 FKRVLTGTLL-LLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKIN 67

Query: 75 GTLSGKFE-TSPQKFLDDLAATYGFVWYYDGAVLRIWGANESKSATLSLGTASTKSLRDA 133
+SG+FE +PQ FL +A+ Y VWYYDG VL I+ +E S + L + L+ A
Sbjct: 68 DKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQA 127

Query: 134 LARMRLDDPRFPVRYDEAAHVAVVSGPPGYVDTVSAIAKQVEQGVRQR----DATEVQVF 189
L R + +PRF R D + + VSGPP Y++ V A +EQ + R A +++F
Sbjct: 128 LQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIF 187

Query: 190 QLHYAQAADHTTRIGGQDVQIPGMASLLRSMYGARGAPVAAIAGPSANFG 239
L YA A+D T +V PG+A++L+ + +
Sbjct: 188 PLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQA 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06050ECOLIPORIN270.024 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 26.8 bits (59), Expect = 0.024
Identities = 13/50 (26%), Positives = 17/50 (34%)

Query: 63 QQAGQSNGSPSQYTQMLMNIVGDILQAQNGGGFGGGAGGDFGGGLGVSLA 112
Q +S + GD ++ NG GFG D G G A
Sbjct: 173 QGKNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAA 222


67XADLMG695_RS06120XADLMG695_RS06150N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS06120-291.247733SDR family oxidoreductase
XADLMG695_RS06125-190.858478FAD/NAD(P)-binding protein
XADLMG695_RS061300172.659229VirK family protein
XADLMG695_RS061350191.881369TetR/AcrR family transcriptional regulator
XADLMG695_RS231951133.544397efflux RND transporter periplasmic adaptor
XADLMG695_RS06140-381.829015efflux RND transporter permease subunit
XADLMG695_RS06150-281.632421oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06135DHBDHDRGNASE1173e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (295), Expect = 3e-34
Identities = 79/261 (30%), Positives = 114/261 (43%), Gaps = 9/261 (3%)

Query: 1 MPTPAIRPQRVLIAGGSRGIGLAIAEGFVRGGAHVSICARNAAGLAQAADALAAHGTPVH 60
M I + I G ++GIG A+A GAH++ N L + +L A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 61 TLACDLADAAQIDAYVQAAAQALGGLDVVINNAS----GFGHGNDDASWQAGLDVDLMAA 116
D+ D+A ID + +G +D+++N A G H D W+A V+
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 117 VRCNRAALPYLRLSDAAVILNISSINAQRPTPRAIAYSTAKAALDYYTTTLAAELARERI 176
+R+ Y+ + I+ + S A P AY+++KAA +T L ELA I
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 177 RVNAISPGSIE--FPDGLWDTRSREEPELY---ARIRDSIPFGGFGQVQHVADAALFLAS 231
R N +SPGS E LW + E + + IP + +ADA LFL S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 232 PQASWITGQVLAVDGGQSLGV 252
QA IT L VDGG +LGV
Sbjct: 241 GQAGHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06160HTHTETR587e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.5 bits (141), Expect = 7e-13
Identities = 27/194 (13%), Positives = 56/194 (28%), Gaps = 6/194 (3%)

Query: 18 DVRDQIVVAATEHFSRYGYEKTAVSDLAREIGFSKAYIYKFFESKQAIGEMICSHCLGEI 77
+ R I+ A FS+ G T++ ++A+ G ++ IY F+ K + I I
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 78 -EAEVLAAVSAAASPPEKLRSLFKTIIEASLRLYSRERKLYEIATSA-ATERWPPVI--- 132
E E+ P LR + ++E+++ R + I V
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 133 -AYEGHIQALLQEILVQGRQNGDFERKTPLDELTQATYLVMRPYINPVLLQHSLDHAGDV 191
+++ L + + + L
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKE 190

Query: 192 PLLLSSLVLRSLSP 205
+++L
Sbjct: 191 ARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06165RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 16/102 (15%), Positives = 33/102 (32%), Gaps = 7/102 (6%)

Query: 70 GKVSERLVDAGQRVKRGQALMRIDPVDLQLAARAQQDAVAAARARAQ-------QTAEDE 122
V E +V G+ V++G L+++ + + Q ++ AR ++
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 123 ARYRDLRGTGAISASAYDQIKAAADAAKAQLSAAQAQADVAR 164
L + +++ K Q S Q Q
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206



Score = 32.5 bits (74), Expect = 0.003
Identities = 11/102 (10%), Positives = 29/102 (28%), Gaps = 5/102 (4%)

Query: 98 QLAARAQQDAVAAARARAQQTAEDEARYRDLRGTGAISASAYDQIKAAADAAKAQLSAAQ 157
+ + V ++ ++ A+ T D+++ +
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQT----TDNIGLLT 315

Query: 158 AQADVARNANRYTDLLADADGVVMDTLV-EPGQVVAAGQTVV 198
+ + + + A V V G VV +T++
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06170ACRIFLAVINRP434e-137 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 434 bits (1118), Expect = e-137
Identities = 227/1048 (21%), Positives = 429/1048 (40%), Gaps = 65/1048 (6%)

Query: 8 LSALAVRERSITLFLIFLISLAGLVAFLKLGRAEDPAFTIKVMTIVTAWPGATPQEIQDQ 67
++ +R L ++ +AG +A L+L A+ P +++ +PGA Q +QD
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 68 VAEKLEKRMQELRWYDRTETYT-RPGLAFTTLTLMDSTPP----GEVQEQFYQARKKAGD 122
V + +E+ M + + + G TLT T P +VQ + A
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPL--- 117

Query: 123 EVANLPAGVIGPLINDEYADVTFAL---FALKAKGEPQRLLARDAE-TLRQRILHVPGVK 178
LP V I+ E + ++ + F G Q ++ ++ + + GV
Sbjct: 118 ----LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVG 173

Query: 179 KVNIIGEQPERIFVEFSHERLATLGVSPQDVFAALNAQNALNAAGSVETRGP------QV 232
V + G Q + + + L ++P DV L QN AAG +
Sbjct: 174 DVQLFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232

Query: 233 FIRLDGALDSLQKIRDTPLVVQ--GRTLKLSDIATVERGYEDPSTFMIRSGGEPALLLGI 290
I + ++ L V G ++L D+A VE G E+ + R G+PA LGI
Sbjct: 233 SIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI-ARINGKPAAGLGI 291

Query: 291 IMRDGWNGLDLGKSLDAEVGAINAELPLGMRLSKVTDQAVNIDASVGEFMTKFFVALLVV 350
+ G N LD K++ A++ + P GM++ D + S+ E + F A+++V
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 351 MLVCFVSMG-WRVGIVVAAAVPLTLAAVFVVMLATGKNFDRITLGSLILALGLLVDDAII 409
LV ++ + R ++ AVP+ L F ++ A G + + +T+ ++LA+GLLVDDAI+
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 410 AIEMMV-VKMEEGYSRVAASAYAWSHTAAPMLSGTLVTAVGFMPNGFAASTAGEYTSNMF 468
+E + V ME+ A+ + S ++ +V + F+P F + G
Sbjct: 412 VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFS 471

Query: 469 WIVGIALIVSWVVAVVFTPYLGVKMLPDLKKVEGGHAA--------MYDTPRYNRFRDAL 520
+ A+ +S +VA++ TP L +L + + +D N + +++
Sbjct: 472 ITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD-HSVNHYTNSV 530

Query: 521 GRVIASKWLVAGSVVGLFVLAVVGMGIVKKQFFPISDRPEVLVEVQLPYGTSINQTSAAA 580
G+++ S + VV + F P D+ L +QLP G + +T
Sbjct: 531 GKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVL 590

Query: 581 AKVEAWLSKQKEAKIVTAYIGQGAPRFFLAMGPELPDPSFAKIVV-----RTDDQHERDA 635
+V + K ++A + + + G + + + A + + R D++ +A
Sbjct: 591 DQVTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 636 LKLRLREAIAQ-----GLASEARVRVTQLTFGPYSKFPVAYRVSGPDPTVLRGIAAQVMQ 690
+ R + + + + V T + + +G L Q++
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELID--QAGLGHDALTQARNQLLG 703

Query: 691 VMQDSP-MLRTVNTDWGVRTPTLHFSLDQDRLQAVGLTSTAVAQQLQFLLTGVPITLVRE 749
+ P L +V + T +DQ++ QA+G++ + + Q + L G + +
Sbjct: 704 MAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID 763

Query: 750 DIRSVQVVARSAGDTRLDPARIADFTLAGGNGQRVPLSQVGKVDVRMEEPVMRRRDRVPT 809
R ++ ++ R+ P + + NG+ VP S P + R + +P+
Sbjct: 764 RGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPS 823

Query: 810 ITVGGDVDDQLQPPDVSAAITRQLQPIIDKLPGGYQIREAGSIEESGKATTAMLPLFPIM 869
+ + G+ P S ++ + KLP G G + + L I
Sbjct: 824 MEIQGEA----APGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAIS 879

Query: 870 LAATLLIIILQVRSISAMVMVFLTSPLGLIGVVPTLILFQQPFGINALVGLIALSGILMR 929
L + S S V V L PLG++GV+ LF Q + +VGL+ G+ +
Sbjct: 880 FVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAK 939

Query: 930 NTLILIGQIHH-NEAEGLDPFHALVEATVQRTRPVILTALAAILAFIPLTHSVFWGT--- 985
N ++++ E EG A + A R RP+++T+LA IL +PL S G+
Sbjct: 940 NAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQ 999

Query: 986 --LAYTLIGGTLAGTVLTLVFLPAMYSI 1011
+ ++GG ++ T+L + F+P + +
Sbjct: 1000 NAVGIGVMGGMVSATLLAIFFVPVFFVV 1027



Score = 70.3 bits (172), Expect = 3e-14
Identities = 59/330 (17%), Positives = 122/330 (36%), Gaps = 24/330 (7%)

Query: 712 LHFSLDQDRLQAVGLT----STAVAQQLQFLLTGVPITLVREDIRSVQVVARSAGDTRLD 767
+ LD D L LT + Q + G + + + + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242

Query: 768 PARIADFTL-AGGNGQRVPLSQVGKVDVRMEE-PVMRRRDRVPTITVGGDVDDQLQPPDV 825
P TL +G V L V +V++ E V+ R + P +G + D
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 826 SAAITRQLQPIIDKLPGGYQIREA----GSIEESGKATTAMLPLFPIMLAATLLIIILQV 881
+ AI +L + P G ++ ++ S L IML L++ L +
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL-FEAIMLVF--LVMYLFL 359

Query: 882 RSISAMVMVFLTSPLGLIGVVPTLILFQQPFGINA--LVGLIALSGILMRNTLILIGQIH 939
+++ A ++ + P+ L+G IL + IN + G++ G+L+ + ++++ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTF--AILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417

Query: 940 -HNEAEGLDPFHALVEATVQRTRPVILTALAAILAFIPL-----THSVFWGTLAYTLIGG 993
+ L P A ++ Q ++ A+ FIP+ + + + T++
Sbjct: 418 RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSA 477

Query: 994 TLAGTVLTLVFLPAMYSIWFKIRPDPGSGN 1023
++ L+ PA+ + K N
Sbjct: 478 MALSVLVALILTPALCATLLKPVSAEHHEN 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06175DHBDHDRGNASE941e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.0 bits (233), Expect = 1e-24
Identities = 56/191 (29%), Positives = 82/191 (42%), Gaps = 14/191 (7%)

Query: 45 VVLITGVSSGIGRAAAEHFARTGCIVYGSVRHLAGATPLTAVELVE--------MDIRDA 96
+ ITG + GIG A A A G + + + + E D+RD+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 97 ASVQRAVDGIIARAGRIDVLVNNAGTNLVGAIEETSVDEAAALFDINVLGILRTVQAVQA 156
A++ I G ID+LVN AG G I S +E A F +N G+ A
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF------NA 123

Query: 157 VQAVLPHMRARGQGRIVNVSSVLGFLPAPYMGVYAASKHAVEGLSETLDHELRQFGISVT 216
++V +M R G IV V S +P M YA+SK A ++ L EL ++ I
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 217 LVEPAYTKTSL 227
+V P T+T +
Sbjct: 184 IVSPGSTETDM 194


68XADLMG695_RS06435XADLMG695_RS06480N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS06435-1160.279386bacterioferritin
XADLMG695_RS064400131.916616EAL domain-containing response regulator
XADLMG695_RS064551122.563454DUF4126 domain-containing protein
XADLMG695_RS064601132.821258hypothetical protein
XADLMG695_RS064650132.938913polymer-forming cytoskeletal protein
XADLMG695_RS064700133.467310iron-sulfur cluster insertion protein ErpA
XADLMG695_RS064750123.474724NAD(+) diphosphatase
XADLMG695_RS06480-1132.974850regulatory signaling modulator protein AmpE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06465HELNAPAPROT290.004 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 29.5 bits (66), Expect = 0.004
Identities = 19/103 (18%), Positives = 41/103 (39%), Gaps = 10/103 (9%)

Query: 44 EYKESIDEMKHADKLSDRILFLEGLPNF---QALGKLRIGENP-----TEMFRCDLALER 95
E + E D +++R+L + G P + I + +EM + + +
Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109

Query: 96 EAVVVLREAVAYAETVKDYVSRQLLVDILESEEEHIDWLETQL 138
+ + + AE +D + L V ++E E+ + L + L
Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06475HTHFIS662e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.0 bits (161), Expect = 2e-13
Identities = 30/148 (20%), Positives = 61/148 (41%), Gaps = 11/148 (7%)

Query: 115 RVLIVEDDRSQALFAQSVLHGAGMHAQVEMTAASVPQAIQDYHPDLILMDLHMPELDGIR 174
+L+ +DD + L AG ++ AA++ + I DL++ D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 175 LTTLIRQQPGQQLLPIVFLTGDPDPERQFEVLDSGADDFLTKPIRPRHLIAAVSNRIRRA 234
L I++ + LP++ ++ + + GA D+L KP +
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL--------TELIGI 114

Query: 235 RQQALQQVGEQVSVRS-NPETGLPTRGH 261
+AL + + S + + G+P G
Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06485GPOSANCHOR391e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 39.3 bits (91), Expect = 1e-05
Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 1/79 (1%)

Query: 66 EAALQQARRSQAQQRRQIEQLQQRQVNLAMSDKISRAANTEVQASLAERDEQIAALRADV 125
A Q RR R +QL+ L +KIS A+ ++ L E L A+
Sbjct: 308 NANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEH 367

Query: 126 AFYERLVG-STAQRKGLNA 143
E S A R+ L
Sbjct: 368 QKLEEQNKISEASRQSLRR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS06505BCTERIALGSPF300.010 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 30.2 bits (68), Expect = 0.010
Identities = 17/82 (20%), Positives = 31/82 (37%), Gaps = 9/82 (10%)

Query: 4 TLVAV-VVALTLGHLVPAQVAKLRNFAWFGQWLRRLDSYAAGRGAWQGRYGVLLAVLPAL 62
T+VA+ VV++ L +VP +V + F Q L G +G + +
Sbjct: 180 TVVAIAVVSILLSVVVP-KVVEQ--FIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLA 236

Query: 63 LVLLVQWLLD-----DVWHGFL 79
+ + +L +H L
Sbjct: 237 GFMAFRVMLRQEKRRVSFHRRL 258


69XADLMG695_RS07490XADLMG695_RS07585N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS07490-1140.717387response regulator
XADLMG695_RS07495-110-0.065766hybrid sensor histidine kinase/response
XADLMG695_RS07500-18-0.1844132-amino-4-hydroxy-6-
XADLMG695_RS07505-110-0.404617pteridine reductase
XADLMG695_RS07510010-0.157731class I SAM-dependent methyltransferase
XADLMG695_RS07520-190.342502hypothetical protein
XADLMG695_RS07530-2100.362895TonB-dependent receptor
XADLMG695_RS07535-2120.4493592OG-Fe(II) oxygenase
XADLMG695_RS07540-1100.265296hypothetical protein
XADLMG695_RS07545-19-0.801425TonB-dependent receptor
XADLMG695_RS07550010-0.586980PDZ domain-containing protein
XADLMG695_RS0755509-0.214389type II secretion system secretin GspD
XADLMG695_RS075601100.227230type II secretion system ATPase GspE
XADLMG695_RS075651100.423584type II secretion system inner membrane protein
XADLMG695_RS075701100.430312type II secretion system major pseudopilin GspG
XADLMG695_RS075754131.471552GspH/FimT family pseudopilin
XADLMG695_RS075803121.309245type II secretion system minor pseudopilin GspI
XADLMG695_RS075854151.607036type II secretion system minor pseudopilin GspJ
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07525HTHFIS543e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 53.7 bits (129), Expect = 3e-11
Identities = 24/124 (19%), Positives = 51/124 (41%), Gaps = 14/124 (11%)

Query: 1 MTAIRTILLAEDSPADAEMAVDALREARLANPIVHVEDGVEAMDYLLRRGVFADREEGLP 60
MT IL+A+D A + AL R + + ++ G
Sbjct: 1 MTGAT-ILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWI---------AAGDG 48

Query: 61 AVLLLDIKMPRLDGLEVLKQVRSDETLKRLPVVILSSSREESDLARSWDLGVNAYVVKPV 120
+++ D+ MP + ++L +++ LPV+++S+ ++ + G Y+ KP
Sbjct: 49 DLVVTDVVMPDENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106

Query: 121 DVDQ 124
D+ +
Sbjct: 107 DLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07530HTHFIS692e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 2e-14
Identities = 35/147 (23%), Positives = 62/147 (42%), Gaps = 4/147 (2%)

Query: 12 KILLVEDSPEDAELLSDQLLEAGLDAAFERVDSEPSLRAALDEFQPDIVLSDLSMPGFSG 71
IL+ +D +L+ L AG D + +L + D+V++D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDV--RITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 72 HQALRLVRQNGA-TPFIFVSGTMGEETAVKALQDGANDYIIKH-NPTRLPSAVIRAIREA 129
L +++ P + +S TA+KA + GA DY+ K + T L + RA+ E
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 130 RADLERQRVESELMRAQRLESLAMLAA 156
+ + +S+ S AM
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEI 149



Score = 41.4 bits (97), Expect = 7e-06
Identities = 28/126 (22%), Positives = 53/126 (42%), Gaps = 15/126 (11%)

Query: 380 GQRILLVDGEATRLSLLGNALSSQGYQPQLASDGAAALQLVQQHAMPDLVIIDSDIILLS 439
G IL+ D +A ++L ALS GY ++ S+ A + + DLV+ D + +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDEN 61

Query: 440 AVSVLLSMQELGYQGPAIVLED-------VGAPLQRAHFPADLPVHVLRKPLEMRRVFRA 492
A +L +++ P +V+ + A + A+ L KP ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-------DYLPKPFDLTELIGI 114

Query: 493 VSHALE 498
+ AL
Sbjct: 115 IGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07540DHBDHDRGNASE1175e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 117 bits (293), Expect = 5e-34
Identities = 80/253 (31%), Positives = 126/253 (49%), Gaps = 16/253 (6%)

Query: 6 KVVLITGAARRIGAQIATTLHAAGYRVALHAHRSADALDARVAELCAQRAGSAHALHADL 65
K+ ITGAA+ IG +A TL + G +A + + L+ V+ L A+ A A A AD+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAE-ARHAEAFPADV 66

Query: 66 RLPDAPAQLVADCLAAFGRLDGVVNNASAFYPTPVGAATAAQWDELFAVNARAPFFIAQA 125
R A ++ A G +D +VN A P + + + +W+ F+VN+ F +++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 126 AAAQLRQRR-GAIVNLTDLHAQQPMRNHPLYGASKSALEMLTRSLALELAPQ-VRVNAVA 183
+ + RR G+IV + A P + Y +SK+A M T+ L LELA +R N V+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 184 PGAI-------LWPEEGKSADAKQALLAR----TPLARIGTPEEIAEAVRWLLDD-ASFV 231
PG+ LW +E + + L PL ++ P +IA+AV +L+ A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 232 TGHTLHVDGGRQL 244
T H L VDGG L
Sbjct: 247 TMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07560FLGBIOSNFLIP310.006 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 30.6 bits (69), Expect = 0.006
Identities = 11/61 (18%), Positives = 24/61 (39%), Gaps = 3/61 (4%)

Query: 25 EQALQPLLDQGWNEQDAIDAVEALVRAHIQQHAQANGLPMPVRV---PALQQDTDASLLA 81
A QP ++ + Q+A++ +R + + + L + R+ LQ +
Sbjct: 110 VDAYQPFSEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRI 169

Query: 82 L 82
L
Sbjct: 170 L 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07565PRTACTNFAMLY300.018 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 30.4 bits (68), Expect = 0.018
Identities = 21/45 (46%), Positives = 22/45 (48%), Gaps = 2/45 (4%)

Query: 39 PPAPAPAPTPAPTPAPTPAPAPSGPAADCPSG--FSNVGTIASNT 81
PPAP PAP P P P P P P PA P+G S A NT
Sbjct: 572 PPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNT 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07575BCTERIALGSPC434e-07 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 43.4 bits (102), Expect = 4e-07
Identities = 41/185 (22%), Positives = 68/185 (36%), Gaps = 20/185 (10%)

Query: 83 IVLHGVRAGG-AQAAAYLSGSDGRQGVYRVGDTV-ATGVVVQAIAADHVLLRAGGSVRRI 140
+ L GV AG + + D Q V + V + +I D V+L+ G +
Sbjct: 95 LSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVL 154

Query: 141 ALGESGAAAAALPPAATGATASAAVAATVQSNVSAAAGTSAATAVDPQQLLASAGLRASA 200
L + + P A V + A T+ + V ++ L+
Sbjct: 155 GLYSQEDSGSDGVPGAQ-----------VNEQLQQRASTTMSDYVSFSPIMNDNKLQ--- 200

Query: 201 DGGGFTIMPRGDGALLRQAGLAPGDVLTQINGRTL-DAEHLRELQDELRDGQSATLTCRR 259
G+ + P + GL D+ +NG L DAE ++ + + D + TLT R
Sbjct: 201 ---GYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVER 257

Query: 260 DGQTH 264
DGQ
Sbjct: 258 DGQRQ 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07580BCTERIALGSPD368e-120 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 368 bits (946), Expect = e-120
Identities = 209/679 (30%), Positives = 334/679 (49%), Gaps = 60/679 (8%)

Query: 8 WLLSAALLFALPAVPMTALHAADAPAVRLQDVDLRAFIQDVSRATGITFIVDTRVQGSVN 67
+ L+ + AL P AA+ + + D++ FI VS+ T I+D V+G++
Sbjct: 10 FSLTLLIFAALLFRPA----AAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTIT 65

Query: 68 VARAQAMSEADLLGMLLAVLRANGLIAVSSGPSTYRIIPDDTAAQQPG-----SAANGNL 122
V ++E L+VL G ++ +++ A +A
Sbjct: 66 VRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGD 125

Query: 123 GFATQVFTLQRVDARSAAEILKPLIGRGGVIMAM--PQGNSLLIADYADNLRRIRTLVAQ 180
T+V L V AR A +L+ L GV + N LL+ A ++R+ T+V +
Sbjct: 126 EVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVER 185

Query: 181 IDTDR-AAIDTVTLRNSSAQELARTLTSLF----GQGGERSNVLSVLPVESSNSLIVRGD 235
+D ++ TV L +SA ++ + +T L S V +V+ E +N+++V G+
Sbjct: 186 VDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGE 245

Query: 236 PALVQRVVRTAVDLDGRAERRGDVSVVRLQHASAEQLLPVLQQLVGQAPGNEAQAGQDTR 295
P QR++ LD + +G+ V+ L++A A L+ VL + +E QA +
Sbjct: 246 PNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTG-ISSTMQSEKQAAKPV- 303

Query: 296 TNAVDVAAASGAAQTQVIAPAAGKRPVIVRY-PGSNALIINADPETQRALMDVIRQLDVH 354
AA + +I++ +NALI+ A P+ L VI QLD+
Sbjct: 304 --------------------AALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIR 343

Query: 355 REQVLVEAIVVEISDTAAKRLGVQLLLAGRNGTVPLLATQYSGAAPGIVPLAAAAAGTRS 414
R QVLVEAI+ E+ D LG+Q A +N + TQ++ + +P++ A AG
Sbjct: 344 RPQVLVEAIIAEVQDADGLNLGIQW--ANKNAGM----TQFTNSG---LPISTAIAGANQ 394

Query: 415 GNADDDSVLEQARNVAAQSLLGLSGGLIGLAGQSDDAVFGMIIDAVKSDTGSNLLSTPSI 474
N D + SL G+A + M++ A+ S T +++L+TPSI
Sbjct: 395 YNKDGTV---------SSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSI 445

Query: 475 MTLDNEQARILVGQEVPITTGEVLGAANDNPFRTIQRQDVGVELEVRPQINTAGGITLAI 534
+TLDN +A VGQEVP+ TG + DN F T++R+ VG++L+V+PQIN + L I
Sbjct: 446 VTLDNMEATFNVGQEVPVLTGSQTTS-GDNIFNTVERKTVGIKLKVKPQINEGDSVLLEI 504

Query: 535 KQEVSAIAGPVSAQSSEL--VFNKRQIETRVVVENGAIVALGGLLDQNDRQTVEKVPLLG 592
+QEVS++A S+ SS+L FN R + V+V +G V +GGLLD++ T +KVPLLG
Sbjct: 505 EQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLG 564

Query: 593 DVPGLGALFRHRSRNRDKTNLMVFIRPTIIRDAADAQRMTAPRYNYLRERQLADGDPEAA 652
D+P +GALFR S+ K NLM+FIRPT+IRD + ++ ++ +Y + Q E
Sbjct: 565 DIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENN 624

Query: 653 LDALVRDYLRAQPPQLPAG 671
L +D L P Q A
Sbjct: 625 DAMLNQDLLEIYPRQDTAA 643


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07590BCTERIALGSPF341e-117 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 341 bits (877), Expect = e-117
Identities = 173/405 (42%), Positives = 242/405 (59%), Gaps = 8/405 (1%)

Query: 1 MPQFDYTVLDLHGRNRHGVISADSVHGARAQLEQRQWVPVRVEAAAATASTSG------- 53
M Q+ Y LD G+ G ADS AR L +R VP+ V+ SG
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 54 RAARFSGKDLVLFTRQLATLVETA-PLEEALRTIGTQSERRGVRRVTSRTHALVVEGFRL 112
R R S DL L TRQLATLV + PLEEAL + QSE+ + ++ + + V+EG L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 113 SDAMARQGKAFPALYRAMVAAGESAGALPQVLERLADLLERQAQVRSKLQSALVYPAALA 172
+DAM +F LY AMVAAGE++G L VL RLAD E++ Q+RS++Q A++YP L
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 173 VTAGAVVIVLMTFVVPKVVDQFDSMGRALPWLTRAVIGVSQFLLHAGIPLLVALVVALVA 232
V A AVV +L++ VVPKVV+QF M +ALP TR ++G+S + G +L+AL+ +A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 233 TARLLQRPALRLAADRALLRAPLLGRLIRDLHAARMARTLAIMVNSGLPLMEGLMIAART 292
+L++ R++ R LL PL+GR+ R L+ AR ARTL+I+ S +PL++ + I+
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 293 VDNRALRLATDSMVTAIREGGSLAAAMKRAGVFPPTLLYMASSGENSGRLAPMLERAADY 352
+ N R A+REG SL A+++ +FPP + +M +SGE SG L MLERAAD
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 353 LEREFEAFTAAAMSLLEPAIIVLLGGVVAVIVLSILLPILQFNTL 397
+REF + A+ L EP ++V + VV IVL+IL PILQ NTL
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTL 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07595BCTERIALGSPG1831e-62 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 183 bits (466), Expect = 1e-62
Identities = 65/141 (46%), Positives = 94/141 (66%), Gaps = 3/141 (2%)

Query: 13 LTARRRTRGFTLVELMVVIVIIGLLATVVMINVMPSQDRAMVEKARADVAVLEQALETYR 72
+ A + RGFTL+E+MVVIVIIG+LA++V+ N+M ++++A +KA +D+ LE AL+ Y+
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 73 LDNLSYPSTEQGLQALLNPPSGLTRPERYRQGGYIRRLPEDPWGHAYQYRRPGRQGGFDV 132
LDN YP+T QGL++L+ P+ Y + GYI+RLP DPWG+ Y PG G +D+
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 133 YSLGADGAEGGDADNADIGNW 153
S G DG G + DI NW
Sbjct: 121 LSAGPDGEMGTE---DDITNW 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07600BCTERIALGSPH518e-11 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 50.7 bits (121), Expect = 8e-11
Identities = 20/79 (25%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 9 RGFTLLEMLAVLVIAALASTLVVMTLPDTRRDLHDHADTLAS---ALIHARDEAILSLRM 65
RGFTLLEM+ +L++ +++ +V++ P +R D A TLA L + + + +
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDD--SAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 66 VEVGIDAGGYGFRRQAQQQ 84
V + + F +
Sbjct: 62 FGVSVHPDRWQFLVLEARD 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS07610BCTERIALGSPG354e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 4e-05
Identities = 12/48 (25%), Positives = 26/48 (54%)

Query: 1 MIRKQRTRGFTLIELLVALAVFALVAAAAVMVMRQSIDQRDAVRARLQ 48
M + RGFTL+E++V + + ++A+ V + + ++ D +A
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


70XADLMG695_RS09425XADLMG695_RS09490N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS0942519-1.623223response regulator transcription factor
XADLMG695_RS09430111-2.023296HAMP domain-containing histidine kinase
XADLMG695_RS09435114-2.436875type I toxin-antitoxin system SymE family toxin
XADLMG695_RS09440113-2.026962RHS repeat protein
XADLMG695_RS09445119-4.205911dephospho-CoA kinase
XADLMG695_RS09450323-5.022525prepilin peptidase
XADLMG695_RS09460219-5.134218type II secretion system F family protein
XADLMG695_RS09465122-5.895758pilin
XADLMG695_RS09470222-6.579016type IV-A pilus assembly ATPase PilB
XADLMG695_RS09475118-5.834746hypothetical protein
XADLMG695_RS09480-111-1.204450hypothetical protein
XADLMG695_RS09485-113-1.225315sigma-54-dependent Fis family transcriptional
XADLMG695_RS09490-122-0.054556HAMP domain-containing histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09440HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 32/119 (26%), Positives = 60/119 (50%), Gaps = 2/119 (1%)

Query: 2 RILVIEDNSDIAANLGDYLEDRGHTVDFAADGVTGLHLAVVHEFDAIVLDLNLPGMDGIE 61
ILV +D++ I L L G+ V ++ T + D +V D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCRKLRNEARKQTPVLMLTARDSLDNKLAGFDSGADDYLIKPFALQE-VEVRLNALSRR 119
+ +++ +AR PVL+++A+++ + + GA DYL KPF L E + + AL+
Sbjct: 65 LLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09470PREPILNPTASE330e-116 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 330 bits (848), Expect = e-116
Identities = 129/282 (45%), Positives = 176/282 (62%), Gaps = 1/282 (0%)

Query: 1 MAFLDQHPGLGFPAAAGLGLLIGSFLNVVILRLPKRMEWQWRRDAREILELPDI-YEPPP 59
+ P L F L+IGSFLNVVI RLP +E +W+ + R D + PP
Sbjct: 5 LELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPP 64

Query: 60 PGIVVEPSHDPVTGDKLKWWENIPVLSWAMLRGKSRYSGKPISIQYPLVELLTSILCVAS 119
++V S P + ENIP+LSW LRG+ R PIS +YPLVELLT++L VA
Sbjct: 65 YNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAV 124

Query: 120 VWRFGFGWQGFGAIVLSCFLVAMSGIDLRHKLLPDQLTLPLMWLGLVGSMDNLYMPAKPA 179
GW A++L+ LVA++ IDL LLPDQLTLPL+W GL+ ++ ++ A
Sbjct: 125 AMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDA 184

Query: 180 LLGAAVGYVSLWTVWWLFKQLTGKEGMGHGDFKLLAALGAWCGLKGILPIILISSLVGAI 239
++GA GY+ LW+++W FK LTGKEGMG+GDFKLLAALGAW G + + ++L+SSLVGA
Sbjct: 185 VIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAF 244

Query: 240 LGSIWLVAKGRDRATPIPFGPYLAIAGWVVFFWGNDLVDGYL 281
+G ++ + ++ PIPFGPYLAIAGW+ WG+ + YL
Sbjct: 245 MGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09475BCTERIALGSPF382e-133 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 382 bits (982), Expect = e-133
Identities = 113/405 (27%), Positives = 211/405 (52%), Gaps = 9/405 (2%)

Query: 23 LFLWEGTDKRGIKMKGEQTARNMNMLRAELRRQGINPSIVKLK--------PKPLFGAAG 74
+ ++ D +G K +G Q A + R LR +G+ P V L
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 75 KKITPKDIAFFSRQMATMMKSGVPIVGSLEIIGEGHKNPRMKKMVGQVRTDIEGGSSLYE 134
+++ D+A +RQ+AT++ + +P+ +L+ + + + P + +++ VR+ + G SL +
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 135 SISRHPVQFDELYRNLVRAGEGAGVLETVLDTVATYKENIEALKGKIKKALFYPAMVIAV 194
++ P F+ LY +V AGE +G L+ VL+ +A Y E + ++ +I++A+ YP ++ V
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 195 ALIVSAILLIFVVPQFEEVFKGFGAELPAFTQMIVGASRFMVSYWWIMFFVIAGAIVGFV 254
A+ V +ILL VVP+ E F LP T++++G S + ++ M + + F
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 255 FAYKRSPSMQHTMDRLILRVPVIGQIMHNSSIARFARTTAVTFKAGVPLVEALGIVAGAT 314
R + + R +L +P+IG+I + AR+ART ++ + VPL++A+ I
Sbjct: 243 VML-RQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301

Query: 315 GNRVYEDAVLRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDSMLFKVAEYF 374
N + D V G ++ A++Q LFP M+ M A GE +G LDSML + A+
Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361

Query: 375 EQEVNNAVDALSSLLEPMIMVFIGVVVGGMVIGMYLPIFKLGAVV 419
++E ++ + L EP+++V + VV +V+ + PI +L ++
Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09480BCTERIALGSPG471e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 46.8 bits (111), Expect = 1e-09
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 5/71 (7%)

Query: 1 MKKQNGFTLIELMIVVAIIAILAAIALPAYQDYTVRGRVSEAMVAASAAKTVVAENAANG 60
KQ GFTL+E+M+V+ II +LA++ +P + G +A + + V ENA +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVP-----NLMGNKEKADKQKAVSDIVALENALDM 58

Query: 61 SALNSGWTPPT 71
L++ P T
Sbjct: 59 YKLDNHHYPTT 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09500HTHFIS5150.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 515 bits (1328), Expect = 0.0
Identities = 169/474 (35%), Positives = 256/474 (54%), Gaps = 17/474 (3%)

Query: 6 SALVVDDERDIRELLVLTLGRMGLRISTAANLAEARELLANNPYDLCLTDMRLPDGNGIE 65
+ LV DD+ IR +L L R G + +N A +A DL +TD+ +PD N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LVTEIAKHYPQTPVAMITAFGSMDLAVEALKAGAFDFVSKPVDIGVLRGLVKHALELNNR 125
L+ I K P PV +++A + A++A + GA+D++ KP D+ L G++ AL R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 126 DRPAPPPPPPEQASRLLGDSSAMESLRATISKVARSQAPVYIVGESGVGKELVARTIHEQ 185
RP+ + L+G S+AM+ + ++++ ++ + I GESG GKELVAR +H+
Sbjct: 125 -RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 186 GARAAGPFVPVNCGAIPAELMESEFFGHKKGSFTGAHADKPGLFQAAHGGTLFLDEVAEL 245
G R GPFV +N AIP +L+ESE FGH+KG+FTGA G F+ A GGTLFLDE+ ++
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 246 PLQMQVKLLRAIQEKSVRPVGASSESLVDVRILSATHKDLGDLVSDGRFRHDLYYRINVI 305
P+ Q +LLR +Q+ VG + DVRI++AT+KDL ++ G FR DLYYR+NV+
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 306 ELRVPPLRERGGDLPQLAAAIIARLAHSHGRPIPLLTQSALDALNHYGFPGNVRELENIL 365
LR+PPLR+R D+P L + + A G + Q AL+ + + +PGNVRELEN++
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362

Query: 366 ERALALAEDDQISATDLRLPAH---------------GGHRLAAPPGGAAAEPREAVVDI 410
R AL D I+ + G ++ + + D
Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422

Query: 411 DPASAALPSYIEQLERAAIQKALEENRWNKTKTAAQLGITFRALRYKLKKLGME 464
P S + ++E I AL R N+ K A LG+ LR K+++LG+
Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09510FLGMOTORFLIM330.003 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 32.6 bits (74), Expect = 0.003
Identities = 19/91 (20%), Positives = 35/91 (38%), Gaps = 9/91 (9%)

Query: 210 TGVLVVDTHNHISLANEAALSLLG-DG---DQRTPSTDLSLVALTPELARRLQRWRSGWR 265
VL VD S+ L G G + TD+ + + R L R W
Sbjct: 114 NAVLEVDP----SITFSIIDRLFGGTGQAAKVQRDLTDIENSVMEGVIVRILANVRESW- 168

Query: 266 EEEAPLQLGADRPEVQPRFVRLLADSDLALV 296
+ L+ + E P+F +++ S++ ++
Sbjct: 169 TQVIDLRPRLGQIETNPQFAQIVPPSEMVVL 199


71XADLMG695_RS09675XADLMG695_RS09690N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS096753143.064147type III PLP-dependent enzyme
XADLMG695_RS096802122.732899iron transporter
XADLMG695_RS096852122.518659MFS transporter
XADLMG695_RS096904122.507296siderophore biosynthesis protein, IucA/IucC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09700ALARACEMASE401e-05 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 39.8 bits (93), Expect = 1e-05
Identities = 47/224 (20%), Positives = 78/224 (34%), Gaps = 32/224 (14%)

Query: 31 DLAALHTHAAWMRAQLPAQCELFYAAKANA----EPPVLRTLATHVDGFEAASGGELAWL 86
DL AL + + +R Q ++ KANA + + DGF + E L
Sbjct: 10 DLQALKQNLSIVR-QAATHARVWSVVKANAYGHGIERIWSAIGA-TDGFALLNLEEAITL 67

Query: 87 HAQQPQAPLLFGGPGKLDTELAQAAALPDCTVHVESLRELERLAAIATHGGRCVPVFLRM 146
+ + P+L G + + T V S +L+ L + ++L++
Sbjct: 68 RERGWKGPILMLE-GFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARL--KAPLDIYLKV 124

Query: 147 NIAVPGAQSTRLMMGGQPSPFGLDPCDLDAAMQRLQASPSLRLEGFHFHLMSHQRNATAQ 206
N + RL G P + Q+L+A ++ LMSH A
Sbjct: 125 NSGM-----NRL---------GFQPDRVLTVWQQLRAMANVGEMT----LMSHFAEAEHP 166

Query: 207 LHLVAAYLRTVQQWRQTYALGPLRVNAGGGFGVDYLAPEASFDW 250
+ A R ++Q + N+ PEA FDW
Sbjct: 167 DGISGAMAR-IEQAAEGLECRRSLSNSAATL----WHPEAHFDW 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09705PF041832937e-94 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 293 bits (751), Expect = 7e-94
Identities = 94/468 (20%), Positives = 165/468 (35%), Gaps = 46/468 (9%)

Query: 100 DAQALARCLLQALASTQAINPELLAQSANSVAVT------AAFLRQAQLTAATGEAMIDA 153
D LA+ LL L +++ +A+ + T R+ + D
Sbjct: 69 DEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNADR 128

Query: 154 EQSMLWGHALHPTPKSREGVDLDQVLACAPEARASFQLFWF-------------RIDPRL 200
Q +L GH K R G + + APE +F+L W +D
Sbjct: 129 LQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQ 188

Query: 201 LRIQGRDVRATLR-----QLSGSDDLY---PCHPWEAQRLLDAPLLRTMQARGLITPIGP 252
L D + R Q +G D + P HPW+ Q+ + + A G + +G
Sbjct: 189 LLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF-AEGRMVSLGE 247

Query: 253 LGDALRPTSSVRTLYHPE--LAYFLKCSVHVRLTNCVRKNAWYELESAVALSELLAPSWR 310
GD S+RTL + +K + + T+C R + + S L +
Sbjct: 248 FGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFA 307

Query: 311 ALAMQV-PGFDVMLEPAATSLDVALVDPALHAADPLAARTLSESFGILYRQGIPAAQRAR 369
A V G ++ EPAA V +AA A E G+++R+ +
Sbjct: 308 TDATLVQSGAVILGEPAA-----GYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPD 362

Query: 370 WQPQVAAALFTCDAQGNSVCAARLRALGSAQMNRRTATLLWFGAYAGLLLDGVWSALFQH 429
P + A L CD + A + G W +++ ++ L ++
Sbjct: 363 ESPVLMATLMECDENNQPLAGAYIDRSGLDAET-------WLTQLFRVVVVPLYHLLCRY 415

Query: 430 GIALEPHLQNTVIGFADGWPTRVWIRDLEGT-KLLAHHWPETRLRGVGERARQSLYYTPE 488
G+AL H QN + +G P RV ++D +G +L+ +PE + + + R
Sbjct: 416 GVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPE--MDSLPQEVRDVTSRLSA 473

Query: 489 QGWNRVAYCALVNNLAEAIFHLSQGDAALETQLWQCVGEIALRWQQRH 536
+ I L E + +Q + + + ++H
Sbjct: 474 DYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKH 521


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09710TCRTETA613e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 60.6 bits (147), Expect = 3e-12
Identities = 50/156 (32%), Positives = 67/156 (42%), Gaps = 3/156 (1%)

Query: 20 LGMPLFLPQVLAELAPAA-AVGWSGVLYVLPTLCTALTASSWGRWADRHGRKRSLLRAQL 78
L MP+ LP +L +L + G+L L L A G +DR GR+ LL +
Sbjct: 23 LIMPV-LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLA 81

Query: 79 GLALGFAIAGFAPSLSWLVIGLVVQGTCGGSLAAANAYLASQPQAGPLARALDWTQYSAR 138
G A+ +AI AP L L IG +V G G + A A AY+A AR +
Sbjct: 82 GAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFG 141

Query: 139 LAMVSAPALLGLALALGPAQSLYRALALLPLIAFAL 174
MV+ P L GL P + A A L + F
Sbjct: 142 FGMVAGPVLGGLMGGFSPHAPFFAA-AALNGLNFLT 176



Score = 35.6 bits (82), Expect = 3e-04
Identities = 35/110 (31%), Positives = 42/110 (38%), Gaps = 4/110 (3%)

Query: 278 LLPGLALFAVACVWQALLHDALALAVARLLFGL-GMLFALRGLNRSLAHIASGHGAGRLF 336
LL LA AV A L + R++ G+ G A+ G +A I G R F
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG--AYIADITDGDERARHF 133

Query: 337 GRFDACGKWAGVFAGAAAGALAQASGPATPFLAAALAAAAAALTVVVRFP 386
G AC G+ AG G L P PF AAA LT P
Sbjct: 134 GFMSACFG-FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09715PF041831482e-40 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 148 bits (376), Expect = 2e-40
Identities = 95/437 (21%), Positives = 145/437 (33%), Gaps = 62/437 (14%)

Query: 81 DSWIVRSDDGVHV---ERGAHAWLH-------RISAELDAQT--QQL-HRAYADEADCAA 127
D + + ERG WL + AQT QL +A A
Sbjct: 35 DRYCINLPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAE 94

Query: 128 AHRGLARQAYHAQAPALRTALHHPDAAERAYRCDQLASYRD-HPFYPTARAKAGLDAAEL 186
+ L L+ + D+L HP + + + G L
Sbjct: 95 HMQDLYATLLGDLQ-LLKARRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEAL 153

Query: 187 RHYAPEFAPTFALHWLAIPQALAQCTSAAP------------AELWPDFASLGLPPELAA 234
YAPE+A TF LHWLA+ + + + F+ + L
Sbjct: 154 ERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDH 213

Query: 235 THLPWPVHPLMWERLEQEGFA--LPEDVLR----APNAWLDVRPSLSVRTLVPPQHPQ-L 287
LP PVHP W++ F E + + W S+RTL L
Sbjct: 214 NWLPLPVHPWQWQQKIATDFIADFAEGRMVSLGEFGDQW---LAQQSLRTLTNASRRGGL 270

Query: 288 HLKLPIPMRTLGALNLRLIKPSTLYDGHWMERALRHIDALDPALQGRCVFV-DESHGGHV 346
+KLP+ + R I + G R L+ + A D L + E G+V
Sbjct: 271 DIKLPLTIYNTSC--YRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYV 328

Query: 347 -------------GQTRHLAYLVRRYPAL---DDATLVPVAALCAPMPDGRPMAIHLAER 390
L + R P D + V +A L + +P+A +R
Sbjct: 329 SHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDR 388

Query: 391 FAHGDVLRWWRDYTELLLAVHLRLWLGYGIALEANQQNSVLVYSDGQATRLLMKDN-DAA 449
D W +++ L YG+AL A+ QN L +G R+L+KD
Sbjct: 389 SGL-DAETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDM 447

Query: 450 RIALPQLRAALPELDAL 466
R+ + PE+D+L
Sbjct: 448 RLVKEE----FPEMDSL 460


72XADLMG695_RS09850XADLMG695_RS09905N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS09850-3181.638122Holliday junction branch migration DNA helicase
XADLMG695_RS09855-1172.178195tol-pal system-associated acyl-CoA thioesterase
XADLMG695_RS098600161.202239protein TolQ
XADLMG695_RS098650151.030319protein TolR
XADLMG695_RS098701150.828014cell envelope integrity protein TolA
XADLMG695_RS098750160.465688Tol-Pal system beta propeller repeat protein
XADLMG695_RS098800101.088206peptidoglycan-associated lipoprotein Pal
XADLMG695_RS09885-1120.631318tol-pal system protein YbgF
XADLMG695_RS098900160.3181997-carboxy-7-deazaguanine synthase QueE
XADLMG695_RS098950150.239018*7-cyano-7-deazaguanine synthase QueC
XADLMG695_RS099000170.048436MEKHLA domain-containing protein
XADLMG695_RS099051180.368272response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09885FERRIBNDNGPP280.046 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 28.4 bits (63), Expect = 0.046
Identities = 20/72 (27%), Positives = 28/72 (38%), Gaps = 17/72 (23%)

Query: 17 AADASIRPKRLADYLGQQPVRE----QMEIYIQAAKAR-----------GEAMD--HVLI 59
A A +AD L Q E Q E +I++ K R +D H+L+
Sbjct: 131 LAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190

Query: 60 FGPPGLGKTTLS 71
FGP L + L
Sbjct: 191 FGPNSLFQEILD 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09905IGASERPTASE613e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 61.2 bits (148), Expect = 3e-12
Identities = 46/280 (16%), Positives = 86/280 (30%), Gaps = 35/280 (12%)

Query: 39 LWSPE-----RSVEPAAGDPSMEASLDVSAAEARVARQALKATPVETPPPPAPLPEPAPE 93
L++PE ++V+ DV + + A PP PA E
Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 94 DSVPPPQ--PIPEPRPQDA--PTPQQAQAQERVAQPDKVDQERVDALAISAEKAKQEQEA 149
+ Q E QDA T Q + K + V A + E A+ E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREV-------AKEAKSNVKANTQTNEVAQSGSET 1092

Query: 150 KRRQEQIDLTERKRQEEAEQKLRLAKQQEEAD------AKKKQAAAQQAAEEAERQKKIA 203
K Q ++E + K+ K QE K++Q+ Q E R+
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 204 EIRRQRAQADKEMALAEQKLRQVAAARAQQASAAAATSAQPTAGQGGTSTDLSAKYAAAI 263
++ A EQ ++ ++ Q + + + + + +T +
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212

Query: 264 QQ-------------KVLAQWVRPPSVPPGQKCTINIRQL 290
+ + + V P + + T+ + L
Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252



Score = 37.0 bits (85), Expect = 1e-04
Identities = 33/217 (15%), Positives = 62/217 (28%), Gaps = 16/217 (7%)

Query: 47 EPAAGDPSMEASLDVSAAE-----ARVARQALKATPVETPPPPAPLPEPAPEDSVPPPQP 101
+ + EA +V A A+ + + ET E + V +
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV--EKEEKAKVETEKT 1119

Query: 102 IPEPRPQDAPTPQQAQAQERVAQPDKVDQERVDALAISAEKAKQEQEAKRRQEQIDLTER 161
P+ +P+Q Q++ Q + +E + I +++ A Q + +
Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEP-ARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178

Query: 162 KRQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEEAERQKKIAEIRRQRAQADKEMALAEQ 221
Q E + + A Q +E K R+ + +
Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVR-------SVP 1231

Query: 222 KLRQVAAARAQQASAAAATSAQPTAGQGGTSTDLSAK 258
+ A + S A T S D AK
Sbjct: 1232 HNVEPATTSSNDRSTVALCDLTSTNTNAVLS-DARAK 1267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09915OMPADOMAIN1063e-30 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 106 bits (266), Expect = 3e-30
Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 11/112 (9%)

Query: 67 VYFDLDQDSLKPEFQAIMACHAKYLR--DRPSSRITLQGNADERGSREYNMGLGERRGNA 124
V F+ ++ +LKPE QA + L D + + G D GS YN GL ERR +
Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280

Query: 125 VSSSLQAAGGSASQLTVVSYGEERPVCTESNE---------SCWSQNRRVEI 167
V L + G A +++ GE PV + + C + +RRVEI
Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09920RTXTOXIND347e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 7e-04
Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 7/88 (7%)

Query: 30 RVAVLEQQQANSQANNDL---LNQLQQARSDLQALRSTVEQLQHD--NEQLKQ--QSKDQ 82
+ AVLEQ+ +A N+L +QL+Q S++ + + + + NE L + Q+ D
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 83 YLDLDGRLNRLEGAGGATPPLPPATGSV 110
L L + E A+ P + V
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKV 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09940PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 2e-04
Identities = 40/209 (19%), Positives = 70/209 (33%), Gaps = 24/209 (11%)

Query: 146 EHKQHEQHLQLLINELN-HRVKNSLVMVQSLARQSFTNAGGLADAQEKLDARLLALSRAH 204
E L L ++N H + N+L +++L + ++ L L R
Sbjct: 155 ASMAQEAQLMALKAQINPHFMFNALNNIRALILED-------PTKAREMLTSLSELMRYS 207

Query: 205 DTLTRENWVS-ADVLELTRDAAALYESHDSQRFTLQGDSCRLDP--RRALALSMALHELC 261
+ VS AD L + L R + ++P M + L
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ---INPAIMDVQVPPMLVQTLV 264

Query: 262 TNALKHG-ALSLPAGNVMVSWERSTRGEQERLELIWRESGGPPVQP-PTHKGFGTRLLER 319
N +KHG A G +++ + + L +G ++ G G + +
Sbjct: 265 ENGIKHGIAQLPQGGKILL----KGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRE 320

Query: 320 GLKHDLKGE---VELSFDPAGVCFRVSIP 345
L+ L G ++LS V V IP
Sbjct: 321 RLQM-LYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS09945HTHFIS481e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.5 bits (113), Expect = 1e-09
Identities = 13/82 (15%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 4 RVLLVEDESLVAMLLEDCLAELGYEVAATVADVDAALQAVQEGNLDLALLDINLGGTLSF 63
+L+ +D++ + +L L+ GY+V ++ + + G+ DL + D+ + +F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDV-RITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 64 PIAEELDAR--GVPYIFVTGYA 83
+ + +P + ++
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN 85


73XADLMG695_RS22250XADLMG695_RS10110N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS22250-1123.300566hypothetical protein
XADLMG695_RS100600123.457239ATP-dependent DNA helicase
XADLMG695_RS232651113.927045tRNA
XADLMG695_RS10070-1123.340711ADP-ribosylglycohydrolase family protein
XADLMG695_RS10075-2133.850632energy transducer TonB
XADLMG695_RS100800134.097561glutathione synthase
XADLMG695_RS100850123.146012twitching motility response regulator PilG
XADLMG695_RS10095-3112.307367response regulator
XADLMG695_RS10100-2101.555289chemotaxis protein CheW
XADLMG695_RS101051130.642350methyl-accepting chemotaxis protein
XADLMG695_RS101103121.473706Hpt domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10080BACINVASINC290.006 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 29.5 bits (65), Expect = 0.006
Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 6/97 (6%)

Query: 69 RDTAKSKRQAGDLAGAAAALDQALGLVSGDPAILQERAEVSVLQADWPAAERFAKQAIDL 128
R A+ + GDL + + S A QER+E + Q + A + +A +
Sbjct: 315 RIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVNNRVASTASDEARES 374

Query: 129 GSKTGPLCRRHWATIEQSRLARGEKENAASAKAQIAG 165
K+ L + T+E ++ ASA A IAG
Sbjct: 375 SRKSTSLIQEMLKTMESI------NQSKASALAAIAG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10105PF035441323e-39 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 132 bits (332), Expect = 3e-39
Identities = 42/262 (16%), Positives = 88/262 (33%), Gaps = 37/262 (14%)

Query: 11 MDDGRRLMMTLVISLLLHGVLILGVGFAVSEDAPLVPTLDVIFSQTSAPLTPKQADFLAQ 70
+D RR ++S+ +HG ++ G+ + +P AP P +A
Sbjct: 8 LDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELP----------APAQPISVTMVAP 57

Query: 71 ANQQGGGDHDTAQRPRDSQPGVVPQDRTGLAPQAQRATSVNAPEPTQTRVVTSRRGEQAV 130
A D P P+ P+ + P +
Sbjct: 58 A--------DLEPPQAVQPP---PEPVVEPEPEPEPIPEPPKEAPVV------------I 94

Query: 131 PTPQPNPQTDPLTPAEAQRIQRDAEMARLAAEVHLRSEQYAKRPNRKFVSASTREYAYAN 190
P+P P+ P + ++ +RD + + A+ + +A+++
Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154

Query: 191 YLRAWVDRAERVGNLNYPDDARRRRLGGKVVISVGVRRDGSVESSRVLVSSGVPLLDDAA 250
+ R YP A+ R+ G+V + V DG V++ ++L + + +
Sbjct: 155 SGPRALSRN----QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREV 210

Query: 251 LRVVQLAQPFPPLPKTKDDVDI 272
++ + P P + V+I
Sbjct: 211 KNAMRRWRYEPGKPGSGIVVNI 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10115HTHFIS732e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.3 bits (180), Expect = 2e-18
Identities = 28/115 (24%), Positives = 49/115 (42%), Gaps = 2/115 (1%)

Query: 15 KVMVIDDSKTIRRTAETLLKREGCEVVTATDGFEALAKIADQQPQIIFVDIMMPRLDGYQ 74
++V DD IR L R G +V ++ IA ++ D++MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 75 TCALIKGNQLFKSTPVIMLSSKDGLFDKARGRIVGSEQYLTKPFTREELLSAIRT 129
IK + PV+++S+++ + G+ YL KPF EL+ I
Sbjct: 65 LLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10120HTHFIS881e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.0 bits (218), Expect = 1e-23
Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 2/116 (1%)

Query: 2 ARIILIEDSPTDRAVFSQWLEKAGHTVVATDNAEEGLELVRSQAPDLVLMDVVLPGMSGF 61
A I++ +D R V +Q L +AG+ V T NA + + DLV+ DVV+P + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRALARDQATKDIPVLLVSTKGMETDRAWGLRQGASDYIVKPPREDDLIARIRQ 117
+ + + D+PVL++S + +GA DY+ KP +LI I +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10135HTHFIS684e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 4e-13
Identities = 24/116 (20%), Positives = 53/116 (45%), Gaps = 2/116 (1%)

Query: 2276 QVPLVMVVDDSLTMRKVTSRVLERHNLDVTTARDGVEALELLEERVPDLMLLDIEMPRMD 2335
++V DD +R V ++ L R DV + + DL++ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 2336 GYELATAMRADPRFKAVPIVMITSRSGEKHRQRAFEIGVQRYLGKPYQELDLMRNV 2391
++L ++ +P++++++++ +A E G YL KP+ +L+ +
Sbjct: 62 AFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


74XADLMG695_RS10415XADLMG695_RS10460N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS10415-2120.487247hybrid sensor histidine kinase/response
XADLMG695_RS10420-322-0.411489hypothetical protein
XADLMG695_RS10430-219-0.323805hybrid sensor histidine kinase/response
XADLMG695_RS10435-217-0.208861hybrid sensor histidine kinase/response
XADLMG695_RS10440-217-0.199532multidrug transporter subunit MdtD
XADLMG695_RS10445-216-0.126513uroporphyrinogen decarboxylase
XADLMG695_RS10450-191.166777WGR domain-containing protein
XADLMG695_RS10455-2152.0165283-dehydroquinate synthase
XADLMG695_RS10460-2162.132487shikimate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10430HTHFIS712e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.6 bits (173), Expect = 2e-14
Identities = 24/126 (19%), Positives = 51/126 (40%)

Query: 1063 LLLVEDDATVAQVIVGLLQTRGHHVTHVLHGLAALAEVSTRNFDAGLCDLDLPGLDGAAL 1122
+L+ +DDA + V+ L G+ V + ++ + D + D+ +P + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 1123 VAQLRARGVRFPIVAVTARADTDAEPQAMAAGCNGFLRKPVTGDLLAQALARVLADADDG 1182
+ +++ P++ ++A+ +A G +L KP L + R LA+
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 1183 QRDREA 1188
E
Sbjct: 126 PSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10440HTHFIS724e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.2 bits (177), Expect = 4e-15
Identities = 21/101 (20%), Positives = 46/101 (45%)

Query: 1052 RILLVEDDPTVAEVISGLLTNRGHRVVHAAHGLAALSETVDGGFDIALLDLDLPGLDGFA 1111
IL+ +DD + V++ L+ G+ V ++ G D+ + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1112 LASQLRQLGHRFPLLAVTARADSAAQTQAKAAGFDGFMRKP 1152
L ++++ P+L ++A+ +A G ++ KP
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10450HTHFIS764e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 4e-16
Identities = 29/121 (23%), Positives = 51/121 (42%)

Query: 1058 RILLVEDDPTIAEVIIGLLRAQGHSVVHAPHGLAALTEAADNTFDLALLDLDLPGLDGFA 1117
IL+ +DD I V+ L G+ V + A DL + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1118 LARQLRVFGYEMPLIAVTARSDEAAEPSAQEAGFDRFLRKPLTGEMLATTIAEALRHARA 1177
L +++ ++P++ ++A++ A E G +L KP L I AL +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 1178 R 1178
R
Sbjct: 125 R 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10455TCRTETB1264e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 126 bits (319), Expect = 4e-34
Identities = 85/408 (20%), Positives = 176/408 (43%), Gaps = 17/408 (4%)

Query: 17 LLWLVSLAIFMQMLDATIVNTALPSMARSLRESPLQMQSVVFSYALAVAMFIPASGWIAD 76
L+WL L+ F +L+ ++N +LP +A + P V ++ L ++ G ++D
Sbjct: 16 LIWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 77 RFGTRRTFLAAIIVFTLGSLLCAAAQ-HLPQLVAARVVQGIGGAMLLPVGRLAVLKTVAR 135
+ G +R L II+ GS++ L+ AR +QG G A + + V + + +
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 136 ADFLRAMSFIAIPALIGPLIGPTLGGWLVEVASWHWVFLINLP-IGVLGFIAALKIMPDH 194
+ +A I +G +GP +GG + HW +L+ +P I ++ +K++
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAH--YIHWSYLLLIPMITIITVPFLMKLLKKE 192

Query: 195 YGDARKRFDLVGYLMLAFGMVALSLALDGISELGLRHAFVMLLAIGGLAALAGYWLHAGN 254
+ FD+ G ++++ G+V L S F+++ + + + H
Sbjct: 193 -VRIKGHFDIKGIILMSVGIVFFMLFTTSYSIS-----FLIVSVL----SFLIFVKHIRK 242

Query: 255 TPNALFPLALFKVASYRIGILGNLFARVGSGSMPFLIPLLLQVGLGMSPMNAG-LMMVPV 313
+ L K + IG+L ++P +++ +S G +++ P
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 314 ALAGMAAKRAAVKLVGRFGYRRVLMLNTVLVGVAMASFALIDIGQPLWLRLVQLACFGAV 373
++ + LV R G VL + + V+ + + + ++ ++ + G +
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL 362

Query: 374 NSLQFTVMNTVTLRDLDRDQASPGNSLLSMVMMLATGFGAAAAGSLLA 421
+ + TV++T+ L + +A G SLL+ L+ G G A G LL+
Sbjct: 363 SFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10460RTXTOXINA310.005 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.5 bits (71), Expect = 0.005
Identities = 33/126 (26%), Positives = 45/126 (35%), Gaps = 15/126 (11%)

Query: 174 ELLHHLLGTVTDAVIAYLAAQRAAGAQALQVFDTWGGVLSPAMYREFSLPYLTRIARELE 233
EL +LG V + Y+ AQRA AQ L G+++ A+ S IA + +
Sbjct: 274 ELTTKVLGNVGKGISQYIIAQRA--AQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFK 331

Query: 234 R----GTGAERTP---------LVLFGKGNGAYVADLAASGAEAVGVDWTISLADAAQRA 280
R ++R L F K GA A L V IS A
Sbjct: 332 RANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLV 391

Query: 281 GGRVAL 286
G V+
Sbjct: 392 GAPVSA 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS10475CARBMTKINASE270.035 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 27.1 bits (60), Expect = 0.035
Identities = 10/26 (38%), Positives = 17/26 (65%)

Query: 60 RQHETETLQALLEQDNKLISTGGGAV 85
E ET++ L+E+ +I++GGG V
Sbjct: 172 GHVEAETIKKLVERGVIVIASGGGGV 197


75XADLMG695_RS11160XADLMG695_RS11200N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS111602190.527250chemotaxis protein CheA
XADLMG695_RS111653171.005576STAS domain-containing protein
XADLMG695_RS111703170.602740transcription-repair coupling factor
XADLMG695_RS111801151.330518N-acetyltransferase
XADLMG695_RS111851131.53750523S rRNA (adenine(2030)-N(6))-methyltransferase
XADLMG695_RS111901131.635812two-component system response regulator CreB
XADLMG695_RS11200-1111.369158two-component system sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11180PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.4 bits (84), Expect = 3e-04
Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 8/52 (15%)

Query: 400 LVRNAMDHGIEPADVRVARGKPARGTVGLNAYHDSGSIVIQITDDGGGLNRD 451
LV N + HGI P G + L D+G++ +++ + G ++
Sbjct: 263 LVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11200SACTRNSFRASE356e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.9 bits (80), Expect = 6e-05
Identities = 11/53 (20%), Positives = 21/53 (39%)

Query: 109 ILVSSFVAGQGLGRQLMRKLVKWARRKYLDCLFGDVLQSNVPMLQLAESLGFK 161
I V+ +G+G L+ K ++WA+ + L + N+ F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11215HTHFIS831e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 1e-20
Identities = 36/139 (25%), Positives = 62/139 (44%), Gaps = 1/139 (0%)

Query: 4 SAARVLVVEDEAAIADTVLYALRSEGYAPEHCLLGRDALTRLRADPADVVILDVGLPDIN 63
+ A +LV +D+AAI + AL GY + A D+V+ DV +PD N
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 64 GFEVCRTLRS-FSEVPVIFLTARNDEIDRVLGLELGADDYMAKPFSPRELVARVRARLRR 122
F++ ++ ++PV+ ++A+N + + E GA DY+ KPF EL+ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 RHAGAAAESGWQPHGAFAI 141
+ G +
Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11220PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 25/103 (24%), Positives = 40/103 (38%), Gaps = 23/103 (22%)

Query: 384 LLENA----IAFSKQDSHVRLHARLRDGRWELVVEDRGSGVPDYALERVFERFYSLARPQ 439
L+EN IA Q + L +G L VE+ GS +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-----------------LK 305

Query: 440 TGQRSSGLGLPFVRE-VARLHGGDVMLG-NRHGGGARAVLRLP 480
+ S+G GL VRE + L+G + + + G A++ +P
Sbjct: 306 NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


76XADLMG695_RS11255XADLMG695_RS11300N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS11255013-0.267834TetR/AcrR family transcriptional regulator
XADLMG695_RS11260-112-0.420627efflux RND transporter periplasmic adaptor
XADLMG695_RS11265011-0.153877efflux RND transporter permease subunit
XADLMG695_RS11270-111-0.122300efflux transporter outer membrane subunit
XADLMG695_RS11275-111-0.031341TetR/AcrR family transcriptional regulator
XADLMG695_RS11280-113-0.638769cupin domain-containing protein
XADLMG695_RS112852141.541555AraC family transcriptional regulator
XADLMG695_RS112903162.727875hypothetical protein
XADLMG695_RS112951132.509157LysR family transcriptional regulator
XADLMG695_RS113001132.174252MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11270HTHTETR734e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.1 bits (179), Expect = 4e-18
Identities = 24/180 (13%), Positives = 53/180 (29%), Gaps = 10/180 (5%)

Query: 5 ENPMRVRTEEKREAIVQAASEVFLELGFEGASMSQIAARVGGSKRTLYGYFPSKEELFVA 64
+ +E R+ I+ A +F + G S+ +IA G ++ +Y +F K +LF
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 65 VAKDMSDRYFDPLLHALSQSSGPVDEAL-QRFGEDVLRFLCAPPNITSWQTIIGVSGRSA 123
+ + + + E ++ L + + ++ +
Sbjct: 62 IW----ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC 117

Query: 124 VGALYFSAGQEEGIQRFAEYLQAQVDCGLLHCEDTLLAAHQYAALLESETLMPCLFGALK 183
+ QR L + A A L + + G +
Sbjct: 118 EFV--GEMAVVQQAQRNLCLESYDRIEQTL---KHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11275RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 3e-06
Identities = 20/112 (17%), Positives = 37/112 (33%), Gaps = 12/112 (10%)

Query: 68 EIRPQVGGIIQSRQFTEGGDVKAGQTLYQIDPAQYRASYASAQASLAKAEATLRTAQLKA 127
EI+P I++ EG V+ G L ++ A+A K +++L A+L+
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALG-------AEADTLKTQSSLLQARLEQ 150

Query: 128 ERYKELAQIKAISQQEGDDTDAALGQAKADVAAGKASVETARINLAFARLDA 179
RY+ L E + + +L +
Sbjct: 151 TRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197



Score = 29.8 bits (67), Expect = 0.020
Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 67 SEIRPQVGGIIQSRQ-FTEGGDVKAGQTLYQIDPAQ 101
S IR V +Q + TEGG V +TL I P
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11280ACRIFLAVINRP12200.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1220 bits (3157), Expect = 0.0
Identities = 666/1034 (64%), Positives = 809/1034 (78%), Gaps = 3/1034 (0%)

Query: 1 MARFFIDRPIFAWVLAIIVMLAGILSIATLPIAQYPSIAPPAVAITANYPGASAQTLEDT 60
MA FFI RPIFAWVLAII+M+AG L+I LP+AQYP+IAPPAV+++ANYPGA AQT++DT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQKMKGLDHLSYMASTSESSGAVTITLTFENGTDPDTAQVQVQNKLSLATPLLPQ 120
VTQVIEQ M G+D+L YM+STS+S+G+VTITLTF++GTDPD AQVQVQNKL LATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVTVTKSATNFLNVLAFTSEDGSMSDSDLSDYVAANVQETISRVEGVGDTTLFGS 180
EVQQQG++V KS++++L V F S++ + D+SDYVA+NV++T+SR+ GVGD LFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRVWMDPNKLSNFNLTPVDVRNALQAQNAQISAGQLGALPAVANQQLNATITAQTRL 240
QYAMR+W+D + L+ + LTPVDV N L+ QN QI+AGQLG PA+ QQLNA+I AQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 KTAEQFESILLRTQSDGAQVRLRDVARIELGSESYNTVGRYNGKPAAGLAIKLATGANAL 300
K E+F + LR SDG+ VRL+DVAR+ELG E+YN + R NGKPAAGL IKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTVRAIDKSLEEQEKFFPPGMKVQKPYDTTPFVRISIEQVVHTLVEAVVLVFLVMYLFLQ 360
DT +AI L E + FFP GMKV PYDTTPFV++SI +VV TL EA++LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFGVLAAFGFTINTLTMFAMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTF +LAAFG++INTLTMF MVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 SEEQLSPKDATRKSMDQISGALIGVALVLAAVFVPMAFFSGSTGVIYRQFSITIVSAMTL 480
E++L PK+AT KSM QI GAL+G+A+VL+AVF+PMAFF GSTG IYRQFSITIVSAM L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVAMILTPALCATLLKPVHKGHGLATTGFFGWFNRLFDRGNTGYQGVVRHMLGKGWRY 540
SVLVA+ILTPALCATLLKPV H GFFGWFN FD Y V +LG RY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 MLAYAALLALVVFGFMKLPVGFLPDEDQGTLFVLVQLPPGATNARTSDVLKQVEHHFLVD 600
+L YA ++A +V F++LP FLP+EDQG ++QLP GAT RT VL QV ++L +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 QKDSVAGVFAVTGFSFAGSGQNVGFAFVKLRPWDERTGKGQSVTDVAAKAGAFFAGIRDA 660
+K +V VF V GFSF+G QN G AFV L+PW+ER G S V +A IRD
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 KVFAFAPPAVSELGNATGFDLMLQDRANLGHAALMQARNQLLAELSQD-KRLVAVRPNGQ 719
V F PA+ ELG ATGFD L D+A LGH AL QARNQLL +Q LV+VRPNG
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 720 EDTPEFKLEIDPHKAQAMGVSISDINDTFSSAWGSTYVNDFIDKGRVKKVMLQADAPYRM 779
EDT +FKLE+D KAQA+GVS+SDIN T S+A G TYVNDFID+GRVKK+ +QADA +RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 780 NPQDIDHWFVRNSAGTMVPFNAFATASWQSGSPRLERYNSVPSMEILGMALPGAASSGEA 839
P+D+D +VR++ G MVPF+AF T+ W GSPRLERYN +PSMEI G A PG SSG+A
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG-TSSGDA 839

Query: 840 MQIVEAAAAKLPPGIGFEWTGLSRQEKASSGQTGLLYSVSILIVFLCLAALYESWAIPFS 899
M ++E A+KLP GIG++WTG+S QE+ S Q L ++S ++VFLCLAALYESW+IP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 900 VILVVPLGVFGTLLAAMLTWKMNDVYFQVGLLTTIGLASKNAILIVEFARELHE-SGKSL 958
V+LVVPLG+ G LLAA L + NDVYF VGLLTTIGL++KNAILIVEFA++L E GK +
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 959 VAAALEAARMRLRPILMTSLAFILGVVPLVLTSGAGAGAQHALGTAVIGGMVSGTVLAIF 1018
V A L A RMRLRPILMTSLAFILGV+PL +++GAG+GAQ+A+G V+GGMVS T+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1019 FVPLFFVLVCGLFQ 1032
FVP+FFV++ F+
Sbjct: 1020 FVPVFFVVIRRCFK 1033



Score = 56.8 bits (137), Expect = 5e-10
Identities = 45/334 (13%), Positives = 109/334 (32%), Gaps = 18/334 (5%)

Query: 714 VRPNGQEDTPEFKLEIDPHKAQAMGVSISDINDTFSSA----WGSTYVNDFIDKGRVKKV 769
V+ G + ++ +D ++ D+ + G+
Sbjct: 175 VQLFGAQY--AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232

Query: 770 MLQADAPYRMNPQDIDHWFVR-NSAGTMVPFNAFATASWQSGSPR-LERYNSVPSMEILG 827
+ A ++ NP++ +R NS G++V A + + R N P+ +
Sbjct: 233 SIIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGI 291

Query: 828 MALPGA---ASSGEAMQIVEAAAAKLPPGIGFEWTGLSRQEKASSGQTGLLYSV--SILI 882
GA ++ + P G+ + ++ ++ +I++
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP-YDTTPFVQLSIHEVVKTLFEAIML 350

Query: 883 VFLCLAALYESWAIPFSVILVVPLGVFGTLLAAMLTWKMNDVYFQVGLLT-TIGLASKNA 941
VFL + ++ + VP+ + GT A + + + + + IGL +A
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTF-AILAAFGYSINTLTMFGMVLAIGLLVDDA 409

Query: 942 ILIVE-FARELHESGKSLVAAALEAARMRLRPILMTSLAFILGVVPLVLTSGAGAGAQHA 1000
I++VE R + E A ++ ++ ++ +P+ G+
Sbjct: 410 IVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ 469

Query: 1001 LGTAVIGGMVSGTVLAIFFVPLFFVLVCGLFQRR 1034
++ M ++A+ P +
Sbjct: 470 FSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11285RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.4 bits (79), Expect = 0.001
Identities = 33/202 (16%), Positives = 59/202 (29%), Gaps = 28/202 (13%)

Query: 229 SQLTLRQAQTTVETARVDVERYTA-QVAQDRNALVLLVGRSVPVELLPHALPDNASVEGN 287
++ + Q+++ AR++ RY + + N L P LP P +V
Sbjct: 132 AEADTLKTQSSLLQARLEQTRYQILSRSIELNKL--------PELKLP-DEPYFQNVSEE 182

Query: 288 VLASVPAGLPSQLLQRRPDILEAERNLRAANANIGAARAAFFPSISLTASTGSSSSSLSR 347
+ + + + Q + + E NL A A +L+ S S
Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS 242

Query: 348 LFDAGTRAWSFVPTLTLPIFNAGRNRANLDMAKANRDIEVARYEKSIQSA---------- 397
L + + A ++ +E E I SA
Sbjct: 243 LLHKQ-----AIAKHAVLEQENKYVEAVNELRVYKSQLEQI--ESEILSAKEEYQLVTQL 295

Query: 398 -FREVSDALAQRDTLGRQLQAQ 418
E+ D L Q L +
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLE 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11290HTHTETR632e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.5 bits (154), Expect = 2e-14
Identities = 40/211 (18%), Positives = 67/211 (31%), Gaps = 16/211 (7%)

Query: 5 APPVAPRRAPHEKRGAILAAAGVLFQQHGFDRTSMDTIAERAMVSKATVYAHFASKEVLF 64
A R IL A LF Q G TS+ IA+ A V++ +Y HF K LF
Sbjct: 2 ARKTKQEAQET--RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 65 RTTLEALAQASPNRWTALLELQGPLERRLAAVADAVLRVSASSMRDDAAYGLVRPPLLPG 124
E + N LE Q +V +L S + L+ +
Sbjct: 60 ---SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116

Query: 125 QMREEMWTLCFGRYDTM-------MRTLLAREVQRGALVIDNVPDASVH-FFGLMTGRPA 176
+ + + L ++ L D + + G ++G
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176

Query: 177 TAAARDDATDAQSVQLDADAYVSGAVALFLR 207
+ D + +A YV+ + ++L
Sbjct: 177 NWLFAPQSFDLKK---EARDYVAILLEMYLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS11315TCRTETA621e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 61.8 bits (150), Expect = 1e-12
Identities = 87/368 (23%), Positives = 134/368 (36%), Gaps = 22/368 (5%)

Query: 15 ALLALTIGAFGIGTTEFVIMGLLQQVAADLGVSLSAAGLLISGYALGVFVGAPVLTLASA 74
L + + A GIG V+ GLL+ + + G+L++ YAL F APVL S
Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHS-NDVTAHYGILLALYALMQFACAPVLGALSD 68

Query: 75 RLPRKAVLVGLMLIFTVGNVACALAPDYTSLMVARVLTSLAHGTFFGVGAVVATSLVPAE 134
R R+ VL+ + V A AP L + R++ + T GA +A + +
Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGD 127

Query: 135 RRASAISLMFAGLTVATLLGVPAGAWLGLQLGWRATFWAVAAIGVLATASVAVWVPAAAG 194
RA M A + G G +G A F+A AA+ L + +P +
Sbjct: 128 ERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186

Query: 195 AATPASWRQEVAVLQRGQVLLALAITVVGYAGVFAVFTYIQ-----PLLLQVT------G 243
R+ + L A + A + AVF +Q P L V
Sbjct: 187 GERRPLRREALNPL----ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242

Query: 244 FAQAAVSPVLLVFGV-GMIVGNLLGGRLADR-RPTAALLGSLAALVVVLGALGFALHSKA 301
+ + L FG+ + ++ G +A R AL+ + A L FA
Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302

Query: 302 AMVAVVGLLGVAAF--ATVAPLQLRVLEHARGAGQNLASSLNIAAFNLGNALGAWLGGVV 359
A +V L A A L +V E +G Q ++L +G L +
Sbjct: 303 AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362

Query: 360 IATQAGLV 367
I T G
Sbjct: 363 ITTWNGWA 370


77XADLMG695_RS12130XADLMG695_RS12195N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS12130-1121.051445GspH/FimT family pseudopilin
XADLMG695_RS121350131.078742type IV pilus modification protein PilV
XADLMG695_RS12140091.764946PilW family protein
XADLMG695_RS12145-1101.507665pilus assembly protein
XADLMG695_RS12150112-3.214898pilus assembly protein
XADLMG695_RS12160215-4.165477type IV pilin protein
XADLMG695_RS12165217-4.540245*GspH/FimT family pseudopilin
XADLMG695_RS12170320-5.883270excinuclease ABC subunit UvrB
XADLMG695_RS12175426-6.983209*hypothetical protein
XADLMG695_RS12180341-7.979830type IV secretory system conjugative DNA
XADLMG695_RS12185248-9.043187TcpQ domain-containing protein
XADLMG695_RS12190434-7.794798type IV secretion system protein
XADLMG695_RS12195428-7.123064TrbG/VirB9 family P-type conjugative transfer
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS22355BCTERIALGSPG384e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 37.9 bits (88), Expect = 4e-06
Identities = 15/49 (30%), Positives = 29/49 (59%)

Query: 5 RSRGFTLIELMVTIAVLAIVVAIGYPSFQGVLRSNRVAAANNELIALLN 53
+ RGFTL+E+MV I ++ ++ ++ P+ G A ++++AL N
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALEN 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12185PilS_PF08805359e-05 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 34.5 bits (79), Expect = 9e-05
Identities = 28/125 (22%), Positives = 45/125 (36%), Gaps = 7/125 (5%)

Query: 5 ARFNARSSRGFTLIEVLIAILVLAFGLLGFALLQTMNVRFVQSANYRTQATNLAYDLTDQ 64
AR +G TL+EVL+ + V+ L +M +QS+N + + ++
Sbjct: 18 ARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSL 77

Query: 65 MRSNRYLVTQYTAATFAAGSVTPTGACAYPTGTAVPVAQNIARWQCQVA-KALGDKAAAT 123
RY + Y +A G + T A+N W V DK +
Sbjct: 78 KFQGRYTDSNYIKTLYAQGLLPSDM----IADTTGASAKNP--WGGSVTITTSSDKYSFN 131

Query: 124 VTYVN 128
V N
Sbjct: 132 VVEAN 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12200BCTERIALGSPG507e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 50.3 bits (120), Expect = 7e-11
Identities = 25/106 (23%), Positives = 49/106 (46%), Gaps = 18/106 (16%)

Query: 1 MKRTAAQVRGFTLIELMIVVAVVAILSAIAYPSYTEHVRKSRRAQAKVDLVEYGQLAERF 60
M+ T Q RGFTL+E+M+V+ ++ +L+++ P+ + K+ + +A D+V
Sbjct: 1 MRATDKQ-RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIV--------- 50

Query: 61 HTVQNTYSGFTLPTNVSPR-EGGTAAYTLALTQQ------TQSGYV 99
++N + L + P G + A T + GY+
Sbjct: 51 -ALENALDMYKLDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYI 95


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12210BCTERIALGSPH290.006 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.006
Identities = 16/107 (14%), Positives = 40/107 (37%), Gaps = 13/107 (12%)

Query: 7 LSARGYTAVQLLIVMAVIGIGAAIGVPSFKSLIEWQRATTRVHLLTAHLAMARSLAVTQG 66
+ RG+T +++++++ ++G+ A + + +F + + A + A L + + G
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRD-DSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 67 EPVSLCPSTDGTRCRTDRIWSQGWILFKDPGRGGQPPTSASVIRAEY 113
+ + + W R G P A + Y
Sbjct: 60 QFFGV------------SVHPDRWQFLVLEARDGADPAPADDGWSGY 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12240PF043352123e-70 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 212 bits (542), Expect = 3e-70
Identities = 52/230 (22%), Positives = 104/230 (45%), Gaps = 12/230 (5%)

Query: 14 QVGAAVQKAVNYEVSIADLARRSEKRAWIVATLSMLVTVMTAGGYYYMLPLKEKVPYLVM 73
++ A ++A ++E A RS+K AW+VA ++ + + PLK PY++
Sbjct: 9 ELKAYFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVIT 68

Query: 74 ADAYSGTSTIAKLEPNFGGRAISTSEALARSNIARFIIARESYDASNISDRDWNTVVAMA 133
D +G ++I G I+ EA+ + +A ++ RE + A+ + ++ V+ M+
Sbjct: 69 VDRNTGEASI--AAKLHGDATITYDEAVRKYFLATYVRYREGWIAAAREE-YFDAVMVMS 125

Query: 134 TTGVLAEYRALHAANNAARPFNVYGRNRAIRISILSITLIGGKGKPFTGATVRFQRSLYD 193
+ + +N P N+ + + I ++ +GG A V F +
Sbjct: 126 ARPEQDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGN-----VAQVYFTKESVT 180

Query: 194 KSSTVSTLLDNKIATMEFAYQDNLQMSDDLRVENPLGFRVSDYRVDNDYS 243
S++ T + +AT+++ D + R +NPLG++V YR D +
Sbjct: 181 GSNSTKT---DAVATIKYKV-DGTPSKEVDRFKNPLGYQVESYRADVEVP 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12245TYPE4SSCAGX361e-04 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 35.9 bits (82), Expect = 1e-04
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 10/89 (11%)

Query: 44 TGLGITTQVELSPNEKILDYSTGFTGGWELTRRENVFYLKPKNVDVD-------TNMMIR 96
T L T ++L +E I +TGF GW + N +++PK+V + N +
Sbjct: 59 TSLDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALM 118

Query: 97 TATHSYILELK---VVATDWQRLEQAKQA 122
T + L+ K V A D + LE+ K+A
Sbjct: 119 TRDYQEFLKTKKLIVDAPDPKELEEQKKA 147



Score = 28.6 bits (63), Expect = 0.027
Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 165 YDYDYSTRTKKSWLVPSRVYDDGKFTY 191
Y+Y + + ++PS ++DDG FTY
Sbjct: 401 YNYYQAPEKRSKHIMPSEIFDDGTFTY 427


78XADLMG695_RS23300XADLMG695_RS12830N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS233000161.905375protein translocase subunit SecD
XADLMG695_RS12750-1171.393262protein translocase subunit SecF
XADLMG695_RS127600180.287316hypothetical protein
XADLMG695_RS12765013-1.148641hypothetical protein
XADLMG695_RS12770012-2.398837hypothetical protein
XADLMG695_RS12775112-4.103195DUF1629 domain-containing protein
XADLMG695_RS12780213-4.520190hypothetical protein
XADLMG695_RS12785528-8.460189hypothetical protein
XADLMG695_RS12790636-4.992007carbohydrate porin
XADLMG695_RS12795634-4.403005PTS fructose transporter subunit IIBC
XADLMG695_RS22400424-3.3650481-phosphofructokinase
XADLMG695_RS23305218-1.740205phosphoenolpyruvate--protein phosphotransferase
XADLMG695_RS12800215-1.014424LacI family DNA-binding transcriptional
XADLMG695_RS128050121.159167multidrug efflux RND transporter permease
XADLMG695_RS12810-1183.015542efflux RND transporter periplasmic adaptor
XADLMG695_RS12815-1202.365821NAD-glutamate dehydrogenase
XADLMG695_RS12820-2202.961680DHA2 family efflux MFS transporter permease
XADLMG695_RS12825-2172.628938response regulator
XADLMG695_RS12830-2181.358702response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12780SECFTRNLCASE884e-21 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 88.4 bits (219), Expect = 4e-21
Identities = 36/175 (20%), Positives = 83/175 (47%), Gaps = 3/175 (1%)

Query: 439 VIGPSLGAENVERGVTAVVYSFLFTLVFFTIYYRVFGAITSV-ALLFNLLIVVAVMSLFG 497
+GP + E V V +++ + + + + + + A+ +V AL+ ++L+ V + ++
Sbjct: 142 SVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQ 201

Query: 498 ATMTLPGFAGLALSVGLSVDANVLINERIREELRL--GVPAKSAIAAGYEKAGGTILDAN 555
L A L G S++ V++ +R+RE L +P + + + +
Sbjct: 202 LKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTG 261

Query: 556 LTGLIVAVALYAFGTGPLKGFALTMMIGIFASMFTAITVSRALAVLIYGSRKKLK 610
+T L+ V + +G ++GF M+ G+F ++++ V++ + + I R K K
Sbjct: 262 MTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEK 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12785SECFTRNLCASE2822e-96 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 282 bits (724), Expect = 2e-96
Identities = 98/320 (30%), Positives = 160/320 (50%), Gaps = 10/320 (3%)

Query: 4 FPLHLIPNDTKIDFMSWRKPVLILMLVLAVASVGIIVGKGFNYALEFTGGTLVQTSFQKT 63
F L L+P T DF W+ +V+ +ASV + + G N+ ++F GGT ++T
Sbjct: 3 FRLKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTA 62

Query: 64 VDVDQVREKLSKAGFENAQVQNAR------GGNEVMIRLQPHGQNNNRDDAAR---TVAE 114
+DV R L + + R + MIR+Q + +
Sbjct: 63 IDVGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVN 122

Query: 115 DVRKAVTSDENPATVQPGEFVGPQVGKDLALNGVYATVFMLVGFLIYIAFRFEWKFAVVA 174
V A+T+ + + E VGP+V +L V++ + V + YI RFEW+FA+ A
Sbjct: 123 KVETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGA 182

Query: 175 SLTALFDLLVTVAFVSLTGREFDLTVLAGLLSVMGFAINDIIVVFDRVRENFRALRVEPL 234
+ + D+L+TV ++ +FDLT +A LL++ G++IND +VVFDR+REN + PL
Sbjct: 183 VVALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPL 242

Query: 235 -EVLNRSINQTLSRTVITAVMFFLSALALYIYGGESMEGLAETHMIGAVIVVISSVIVAV 293
+V+N S+N+TLSRTV+T + L+ + + I+GG+ + G + G SSV VA
Sbjct: 243 RDVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAK 302

Query: 294 PMLSIGPFAVTKQDLLPKAK 313
++ K+ P K
Sbjct: 303 NIVLFIGLDRNKEKKDPSDK 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12805IGASERPTASE546e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.5 bits (128), Expect = 6e-09
Identities = 37/238 (15%), Positives = 78/238 (32%), Gaps = 20/238 (8%)

Query: 725 QARMQASVAAQARQEREQQERVAQEQHVAQVREHLQQAQPEHE-DRSQSEQAVQAQAVLE 783
A + Q + E+ E+ A E AQ RE ++A+ + + +E A E
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATET-TAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 784 GQRQAEQQRELEERQVQERQADNQQREQQDRQAQETRQVEAQEGQARQAQDQQQQTQALE 843
Q ++ E++ + + + E+ + T QV ++ Q+ Q Q + + +
Sbjct: 1095 TQTTETKETATVEKEEKAK----VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150

Query: 844 PTQDQRQQASQQPDTQLHAPELALTQQTTLPQSQEDACSRLETQNQPANERLAPDAHDSL 903
PT + ++ SQ T + + T ++ + + +
Sbjct: 1151 PTVNIKEPQSQTNTT----ADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 904 KQTSEAGDAQSHLAQGAERALESQAVQSRDTARIQVPLSEGRESGNPPLQSAQADAVS 961
Q + ++ + R++ S S N A D S
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT----------TSSNDRSTVALCDLTS 1254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12810FLAGELLIN290.012 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.2 bits (65), Expect = 0.012
Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 7/75 (9%)

Query: 7 AAMAEMMATLNASNTSLQETITVLTTLVASMQQREQRLRDV-VAEQ------LQVLQRAA 59
+ + + ++L A IT L V ++ R+ D A + Q+LQ+A
Sbjct: 429 SKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAG 488

Query: 60 SSADAKVNRVLENAL 74
+S A+ N+V +N L
Sbjct: 489 TSVLAQANQVPQNVL 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12830PHPHTRNFRASE5770.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 577 bits (1488), Expect = 0.0
Identities = 208/568 (36%), Positives = 321/568 (56%), Gaps = 11/568 (1%)

Query: 274 AIVGIGASPGVAIGIVHRLRAAQTEVADQPV-GLGDGGAQLHDALTRTRQQLAAIQDDTQ 332
I GI AS GVAI ++ + + +L AL +++++L AI+D T+
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTE 63

Query: 333 RRLGASDAAIFKAQAELLNDTDLITR-TCQLMVEGHGVAWSWHQAVEQMASGLAALGNPV 391
+GA A IF A +L+D +L+ ++ E ++ + + S ++ N
Sbjct: 64 ASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEY 123

Query: 392 LAGRAADLRDVGRRVLAQLDPAAAGAGLTDLPEQPCILLASDLSPSDTANLDTARVLGLA 451
+ RAAD+RDV +RVL L G+ L + E +++A DL+PSDTA L+ V G A
Sbjct: 124 MKERAADIRDVSKRVLGHLIGVETGS-LATIAE-ETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 452 TAQGGPTSHTAILSRTLGLPALVAAGGQLLDIEDGVTAIIDGSSGRLYIDPSAQDLDAAR 511
T GG TSH+AI+SR+L +PA+V I+ G I+DG G + ++P+ +++ A
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 512 THIAEQQAIREREAAQRALPAETSDGHHIDIGANVNLPDQVAMALTQGAEGVGLMRTEFL 571
A + ++ A P+ T DG H+++ AN+ P V L G EG+GL RTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 572 FLESGRTPSEDEQHATYLAMAQALDGRPLIVRALDIGGDKQVAHLELPHEENPFLGVRGA 631
+++ + P+E+EQ Y + Q +DG+P+++R LDIGGDK++++L+LP E NPFLG R
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 632 RLLLRRPDLLEPQLRALYRAAKDGARLSIMFPMITSVPELVALRAICARIRVDLDA---- 687
RL L + D+ QLRAL RA+ G L +MFPMI ++ EL +AI + L +
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYG-NLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVD 420

Query: 688 --PEVPIGIMIEVPAAAAQADVLARHADFFSIGTNDLTQYVLAIDRQNPELAAEADSLHP 745
+ +GIM+E+P+ A A++ A+ DFFSIGTNDL QY +A DR N ++ HP
Sbjct: 421 VSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHP 480

Query: 746 AVLRMIRSTIDGARKHERWVGVCGGLAGDAFGASLLAGLGVQELSMTPNDIPAVKARLRG 805
A+LR++ I A +WVG+CG +AGD LL GLG+ E SM+ I +++L
Sbjct: 481 AILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLK 540

Query: 806 AALSQLQQLAEQALACETAEQVRALEAK 833
+ +L+ A++AL +TAE+V L K
Sbjct: 541 LSKEELKPFAQKALMLDTAEEVEQLVKK 568


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12840ACRIFLAVINRP10810.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1081 bits (2796), Expect = 0.0
Identities = 518/1038 (49%), Positives = 706/1038 (68%), Gaps = 17/1038 (1%)

Query: 1 MPKFFIEHPVFAWVVAILISLAGVISILNLGIESYPTIAPPQVTVTANFPGASADTAEKA 60
M FFI P+FAWV+AI++ +AG ++IL L + YPTIAPP V+V+AN+PGA A T +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQQLTGIDHLLYFNSSSAANGRVTITLTFETGTDADIAQVQVQNKVSLATPRLPS 120
VTQVIEQ + GID+L+Y +S+S + G VTITLTF++GTD DIAQVQVQNK+ LATP LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVTQQGVVVAKANAGFLMVAALRSDNPSINRDALNDIVGSRVLEQISRVPGVGSTNQFGA 180
EV QQG+ V K+++ +LMVA SDNP +D ++D V S V + +SR+ GVG FGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 EYAMNIWLNPEKLQGYNLSATQVLTAVRNQNVQFAAGSVGADPTPEGISFTATVSAEGRF 240
+YAM IWL+ + L Y L+ V+ ++ QN Q AAG +G P G A++ A+ RF
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 SSPEQFENIILRTDNNGATVRLKDVARVTVGPSNYGFDTQYNGKPTGAFGIQLLPGANAL 300
+PE+F + LR +++G+ VRLKDVARV +G NY + NGKP GI+L GANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 NVSEAVGAKLDELQPTFPQGVTWFAPYESTTFVRISIEEVIHTLVEAIVLVFLVMLLFLQ 360
+ ++A+ AKL ELQP FPQG+ PY++T FV++SI EV+ TL EAI+LVFLVM LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATVIPTLVIPVALLGTFFGMYMIGFTINQLTLFAMVLAIGIVVDDAIVVIENVERIM 420
N RAT+IPT+ +PV LLGTF + G++IN LT+F MVLAIG++VDDAIVV+ENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 SEEHLEPKAATQKAMTQITGAVVAITVVLAAVFIPSSLQPGASGAIYKQFALTIAMSMGF 480
E+ L PK AT+K+M+QI GA+V I +VL+AVFIP + G++GAIY+QF++TI +M
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SAFLALSFTPALCAAFLK---STHSTKKNWVYRTFDKYYDKLAHRYVGVVGHTLKRSPPW 537
S +AL TPALCA LK + H K + F+ +D + Y VG L + +
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 538 MIAFVALVVLCGFLFTRMPGSFLPEEDQGFAVAIVQLPPGATKIRTNEAFAQMRAVLEKQ 597
++ + +V LF R+P SFLPEEDQG + ++QLP GAT+ RT + Q+ K
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 PA--VEGMLQIAGFSFLGSGENVGMGFIRLKPWEERDV---TAEQLIQQLNGAFYGIKGA 652
VE + + GFSF G +N GM F+ LKPWEER+ +AE +I + I+
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 653 QIFVVNLPTVQGLGQFGGFDMWLQDRSGAGQEALINARNIVLGKAAEKQDALVGVRPNGL 712
+ N+P + LG GFD L D++G G +AL ARN +LG AA+ +LV VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 713 ENSPQLQLHVDRVQAQSMGLDVSDIYSSIQLMLAPVYVNDYFAEGRIKRVNMRADDQFRA 772
E++ Q +L VD+ +AQ++G+ +SDI +I L YVND+ GR+K++ ++AD +FR
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 773 GPESLRDFFTPSATATGADGQPAMIPLSNVVKAEWNYASPALNRYNGYSAVNIVGNPAPG 832
PE + + S A+G+ M+P S + W Y SP L RYNG ++ I G APG
Sbjct: 781 LPEDVDKLYVRS-----ANGE--MVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833

Query: 833 GSSGQAMSAMEDIVNNDLPPGFGFDWSGMSYQEIIAGNAATLLLALSVVVVFLCLAALYE 892
SSG AM+ ME++ + LP G G+DW+GMSYQE ++GN A L+A+S VVVFLCLAALYE
Sbjct: 834 TSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYE 892

Query: 893 SWSIPVAVLLVVPIGVLGAITFSMLRGLPNDLYFKIGMITVIGLAAKNAILIVEFAVE-Q 951
SWSIPV+V+LVVP+G++G + + L ND+YF +G++T IGL+AKNAILIVEFA +
Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLM 952

Query: 952 RAAGKTLREATLEAAHLRFRPILMTSFAFILGVLPLAISTGAGANSRHSIGTGVIGGMVF 1011
GK + EATL A +R RPILMTS AFILGVLPLAIS GAG+ +++++G GV+GGMV
Sbjct: 953 EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVS 1012

Query: 1012 ATVLGVIFIPLFFVVVRR 1029
AT+L + F+P+FFVV+RR
Sbjct: 1013 ATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12845RTXTOXIND401e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 40.2 bits (94), Expect = 1e-05
Identities = 17/108 (15%), Positives = 37/108 (34%)

Query: 59 RSADVRARVDGVVLKRLYTEGANVTEGQPLFQIDPSQLKATLLQAQGQLAAAEATYTNAK 118
RS +++ + +V + + EG +V +G L ++ +A L+ Q L A T +
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 119 IAATRARSLAPQQYVSRADIDTAEANERSSGANVQQARGAVEAARIQL 166
I + + + +E + + Q
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202



Score = 32.1 bits (73), Expect = 0.004
Identities = 13/51 (25%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 59 RSADVRARVDGVVLK-RLYTEGANVTEGQPLFQIDPSQLKATLLQAQGQLA 108
+++ +RA V V + +++TEG VT + L I P L+ +
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED---DTLEVTALVQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12860TCRTETB1132e-29 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 113 bits (285), Expect = 2e-29
Identities = 79/411 (19%), Positives = 162/411 (39%), Gaps = 17/411 (4%)

Query: 23 LILACAI-FMEQMDATVLATALPTLARDFGVAAPAMSIAMTSYLLALAVLIPASGAIADR 81
LI C + F ++ VL +LP +A DF + + T+++L ++ G ++D+
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 82 FGLRRVFGASIWVFVGGSILCSLADS-LPTMVAARVLQGAGGAMMAPLGRLILLRTVERR 140
G++R+ I + GS++ + S ++ AR +QGAG A L +++ R + +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 141 HLVSAMAWTLVPAFIGPMLGPPLGGFFVSYLDWRWIFYINVPIGIAGFLLVRRFIPEIPS 200
+ A +G +GP +GG Y+ W ++ I + I I + + + +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVR 194

Query: 201 ESAPARFDLRGFLLCGTALGCLLFGLEMVSQQNGVGQASWLLAIGGSAGLG-YLWHARHH 259
FD++G +L + + S I ++ H R
Sbjct: 195 IKGH--FDIKGIILMSVGIVFFMLFTTS---------YSISFLIVSVLSFLIFVKHIRKV 243

Query: 260 PAPLLDLSLLRIASFRLSVIGGALMRITQGAQPFLLPLLFQIGFGMSAAHSGRLILATAL 319
P +D L + F + V+ G ++ T ++P + + +S A G +I+
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 320 GALLMRS-ITPQLLRRFGYRNSLIGNGVLASLGYMVCAFFRPDWPPSVMFGLLLCCGAFM 378
++++ I L+ R G L S+ ++ +F + ++ G +
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GL 362

Query: 379 SFQFAAYNTIAYENVPAARMSRASSLYTTLQQLMLSVGVCAGAMILNLAML 429
SF +TI ++ SL L G+ +L++ +L
Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12865HTHFIS642e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 2e-13
Identities = 31/145 (21%), Positives = 62/145 (42%), Gaps = 4/145 (2%)

Query: 1 MPSRPLLCVDDESSNLATLRQLL-RDDFALVFAKSGAEALDAVTRHTPKLILLDVELPDM 59
M +L DD+++ L Q L R + + + A + L++ DV +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 DGYAVARALKQQPSSNAIPILFVTSRNSEHDERLGLEAGAADYVSKPYSPALLKARIGTQ 119
+ + + +K+ +P+L ++++N+ E GA DY+ KP+ L IG
Sbjct: 61 NAFDLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 LKLAENARLAQQYREAIHLLGTAGQ 144
L R ++ ++ + G+
Sbjct: 119 LAE-PKRRPSKLEDDSQDGMPLVGR 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12870HTHFIS742e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-15
Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 4/142 (2%)

Query: 1029 LEGAHLLLVDDSDINCEVAQRILEGEGAMVTVAHDGEQAVSTLKRAPNLFHLVLMDVQMP 1088
+ GA +L+ DD V + L G V + + + LV+ DV MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMP 58

Query: 1089 VVDGYEATRRLRQIPALASLPVIALTAGAFRPQQEKALEAGMNGFIAKPFNVEELVTAIR 1148
+ ++ R+++ A LPV+ ++A KA E G ++ KPF++ EL+ I
Sbjct: 59 DENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 1149 HFLQPGTRRIPSLPHEAQAHAG 1170
L RR L ++Q
Sbjct: 117 RALAEPKRRPSKLEDDSQDGMP 138



Score = 61.4 bits (149), Expect = 1e-11
Identities = 29/138 (21%), Positives = 55/138 (39%), Gaps = 17/138 (12%)

Query: 891 PRVLIADDHDAALNNLVRIATELGWRVDAVASGHAALQAIEHATEPYDIFLLDWRMPDID 950
+L+ADD A L + + G+ V ++ + I A D+ + D MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 951 GVAIARQIRARATPGPH-PVIVM---------VTAYERRLLEQHPEQQDLDAVMTKPVTG 1000
+ +I+ P PV+VM + A E+ + P+ DL ++ + G
Sbjct: 62 AFDLLPRIKKA---RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG--IIG 116

Query: 1001 AALHRLVEQLLEQRPGAR 1018
AL + + ++
Sbjct: 117 RALAEPKRRPSKLEDDSQ 134


79XADLMG695_RS12895XADLMG695_RS12920N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS12895-214-0.082315glutamine synthetase
XADLMG695_RS12900-1130.693527aspartate aminotransferase family protein
XADLMG695_RS12905-1131.068900polyamine ABC transporter substrate-binding
XADLMG695_RS12910-1111.281945HlyD family secretion protein
XADLMG695_RS12915-2130.668425multidrug efflux MFS transporter
XADLMG695_RS12920-1121.056623efflux transporter outer membrane subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12940adhesinmafb320.005 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 32.0 bits (72), Expect = 0.005
Identities = 29/128 (22%), Positives = 43/128 (33%), Gaps = 21/128 (16%)

Query: 17 SALRRWLKERSITEVECLVPDITGNARG--KIIPADKFSHDYGTRLPEGIFATTVTGDFP 74
A+ RW++E P+ + A K + P V+GDF
Sbjct: 294 EAVDRWIQEN---------PNAAETVEAVFNVAAAAKVAKLAKAAKPG---KAAVSGDFA 341

Query: 75 DDYYALTSPSDSDMHLRPDASTVRMVPWAADPTAQVIHDCYTKDGQPHEL-APRNVLRRV 133
D Y + SDS L +A + + + D +K E+ A N
Sbjct: 342 DSYKKKLALSDSARQLYQNAKYREALDIHYEDLIRRKTDGSSKFINGREIDAVTN----- 396

Query: 134 LDAYAQAK 141
DA QAK
Sbjct: 397 -DALIQAK 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12955RTXTOXIND937e-23 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 93.4 bits (232), Expect = 7e-23
Identities = 52/371 (14%), Positives = 116/371 (31%), Gaps = 83/371 (22%)

Query: 81 SVAVAPRVSGYVTKVLVSDNQIVEAGQPLLQIDDRTYQATLQQAEAAIAARQADIVAATA 140
S + P + V +++V + + V G LL++ +A + ++++ + +
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 141 NVSAQESALLQARTQVTAAAASLKFAQAEVKRFAPLAASGADTHEHQES-LQHDLARARA 199
+ E L + ++ EV R L T ++Q+ + +L + RA
Sbjct: 156 LSRSIELNKLPELK-LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 200 QYDAAQAQAKAGESQIQASRAQLE------------------------QAQAGVKQATAD 235
+ A+ E+ + +++L+ +A ++ +
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 236 ADQARVAVEDTRLTSRIH------------------------------------------ 253
+Q + + ++
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 254 -GRVGD-KTVQVGQFLGAGTRTMTIVPQESLYLV-ANFKETQVGLMRPGQPAEIEVDALS 310
+V K G + M IVP++ V A + +G + GQ A I+V+A
Sbjct: 335 SVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394

Query: 311 GVK---LHGKVESLSPGTGSQFALLPPENATGNFTKVVQRVPVRIRVLAGEEARKVLVPG 367
+ L GKV++++ + G V+ + + L G
Sbjct: 395 YTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSG 445

Query: 368 MSVEVTVDTRS 378
M+V + T
Sbjct: 446 MAVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12960TCRTETB1031e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 103 bits (258), Expect = 1e-25
Identities = 83/407 (20%), Positives = 165/407 (40%), Gaps = 20/407 (4%)

Query: 25 WLAVLAGTIGSFMATLDISIVNAALPTIQGEVGASGTEGTWISTAYLVAEIIMIPLTGWF 84
WL +L SF + L+ ++N +LP I + W++TA+++ I + G
Sbjct: 18 WLCIL-----SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72

Query: 85 VRTLGLRNFLLICAVMFTAFSVVCGLSTS-LSMMIIGRVGQGLAGGALIPTALTIVATRL 143
LG++ LL ++ SV+ + S S++I+ R QG A + +VA +
Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132

Query: 144 PPSQQTMGTALFGMTVIMGPVIGPLLGGWLTENVSWHYAFFINVPICVGLVALLLLGLRH 203
P + L G V MG +GP +GG + + W Y + +P+ + L+ L
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSY--LLLIPMITIITVPFLMKLLK 190

Query: 204 EKGDWAGLLNADWLGIYGLTAGLGGLTVVLEEGQRERWFESSEINTLSLMALSGFIALVI 263
++ G D GI ++ G+ + +L F +S + ++++ F+ V
Sbjct: 191 KEVRIKGHF--DIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVK 238

Query: 264 SQFRRRPPVIRLSLLLQRSFGAVFIMVMAVGMILFGVMYMIPQFLAVISGYNTEQAGYVL 323
+ P + L F + + + G + M+P + + +T + G V+
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 324 LLSGLPTVLLMPMMPKLLETVDVRILVIAGLICFAAACFVNLSLTADTVGTHFVAGQLLQ 383
+ G +V++ + +L + V+ + F + F+ S +T +
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFV 358

Query: 384 GCGLALAMMSLNQAAISSVPPELAGDASGLFNAGRNLGGSVGLALIS 430
GL+ ++ SS+ + AG L N L G+A++
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS12965RTXTOXIND358e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.8 bits (80), Expect = 8e-04
Identities = 35/230 (15%), Positives = 66/230 (28%), Gaps = 33/230 (14%)

Query: 65 DPLLTQLVTQALADSPNLRAA--QARLRANRALAQQRRAERLPKLNASAVYAYAEPPQTI 122
D LL A AD+ +++ QARL R R E KL +
Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN-KLPELKLPDEPYFQNVS 180

Query: 123 VDTLGGLQQGQPGQPPAAGSQALDLEKTEIYSAGFDASWELDFFGRRRRAAEGALAQAQA 182
+ + L Q +Q E ++R LA+
Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELN---------------LDKKRAERLTVLARINR 225

Query: 183 SEAELADAQVQLA-----AEVGQV----YLNYRG----LQARLAIADANLDKIRQSLQLV 229
E + +L + L L + + L++I +
Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285

Query: 230 QQRRGQGVASDLQVEQIATQVQQQQAQRLPLEMQSQEALDQLALMVGREP 279
++ V + +I +++Q L ++ + ++ V R P
Sbjct: 286 KEEYQL-VTQLFK-NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333


80XADLMG695_RS22470XADLMG695_RS13625N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS22470220-3.470032flagellar biosynthesis protein FlhB
XADLMG695_RS13545215-2.093992bifunctional diguanylate
XADLMG695_RS22475115-1.631009GGDEF domain-containing protein
XADLMG695_RS13550116-0.327492diguanylate cyclase
XADLMG695_RS135550190.648456flagellar biosynthetic protein FliR
XADLMG695_RS135650272.710368flagellar biosynthetic protein FliQ
XADLMG695_RS135700272.475979flagellar type III secretion system pore protein
XADLMG695_RS135750252.801805flagellar biosynthetic protein FliO
XADLMG695_RS135800212.811317flagellar motor switch protein FliN
XADLMG695_RS135850182.908651flagellar motor switch protein FliM
XADLMG695_RS135901172.419304flagellar basal body-associated FliL family
XADLMG695_RS135951142.265171flagellar hook-length control protein FliK
XADLMG695_RS136001152.360816flagellar export protein FliJ
XADLMG695_RS136051122.314147FliI/YscN family ATPase
XADLMG695_RS136102152.065950flagellar assembly protein FliH
XADLMG695_RS136153180.311578flagellar motor switch protein FliG
XADLMG695_RS13620321-0.070862flagellar M-ring protein FliF
XADLMG695_RS136252221.907975flagellar hook-basal body complex protein FliE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13595TYPE3IMSPROT347e-120 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 347 bits (891), Expect = e-120
Identities = 104/344 (30%), Positives = 182/344 (52%), Gaps = 2/344 (0%)

Query: 8 GERTELPTEKRLREAREQGNIPQSRELSTAAVFGAGVFALMVLARGIGDGAAVWMKTALS 67
GE+TE PT K++R+AR++G + +S+E+ + A+ A LM L+ + + M +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LIP 60

Query: 68 PDPKMRENPMALFGHFGDLLLQLLWVMLPLIGICLAAGLAGPLMMSGLRFSGKAIMPDLS 127
+ AL ++LL+ ++ PL+ + +A ++ G SG+AI PD+
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 128 KLNPANGIKRMWGSNSLAELIKSVLRLLFVGLAASFCISKGLHGLRSLVNQPLEQAIGNG 187
K+NP G KR++ SL E +KS+L+++ + + I L L L +E
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 188 LDFTKSLLFYTAGALVLLAAIDAPYQKWNWLRKLKMTREEIKREMKESEGSPEVKGRIRQ 247
+ L+ V+++ D ++ + ++++LKM+++EIKRE KE EGSPE+K + RQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 248 MQMQMSQRQMMEAVPKADVVLMNPTHYAVALKYEGGKMRAPIVVAKGVDEMAFRIREAGE 307
++ R M E V ++ VV+ NPTH A+ + Y+ G+ P+V K D +R+ E
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 308 QHRVAIVTAPPLARALHREAQIGKEIPVRLYSVVAQVLSYVYQL 351
+ V I+ PLARAL+ +A + IP A+VL ++ +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13610GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.3 bits (78), Expect = 0.002
Identities = 20/74 (27%), Positives = 27/74 (36%), Gaps = 16/74 (21%)

Query: 768 KLLRRKRELEQLVAKRT-------AELEQDKRDLEAARAEL-SLKATHDELTGLLN---- 815
L K LE A A + +RDL+A+R L+A H +L
Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344

Query: 816 -RAGI---LAALRE 825
R + L A RE
Sbjct: 345 SRQSLRRDLDASRE 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13615TYPE3IMRPROT1241e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 124 bits (313), Expect = 1e-36
Identities = 79/239 (33%), Positives = 131/239 (54%), Gaps = 2/239 (0%)

Query: 23 WTMLRTGALLTAMPLIGTRAVPGRVRVMLAGTLAMVLAPILPPVPEWDGFTAQAVLSIAR 82
W +LR AL++ P++ R+VP RV++ LA + +AP LP L++ +
Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAV-Q 76

Query: 83 ELAVGASMGFMLKLIFEAGALAGELVSQSTGLSFAQMSDPMRGVTSGVIAQWFYLGFGLL 142
++ +G ++GF ++ F A AGE++ GLSFA DP + V+A+ + LL
Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136

Query: 143 FFSANGHLAVIALLVDSYKALPIGTALPDAGAFAEVAPTLFLQILRGGLTLALPMMVAML 202
F + NGHL +I+LLVD++ LPIG ++ AF + I GL LALP++ +L
Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALT-KAGSLIFLNGLMLALPLITLLL 195

Query: 203 AVNLAFGALAKAAPALNPMQLGLPLTVLLGLFLLSSFASEFAPPVQRMFDTAFDAAREL 261
+NLA G L + AP L+ +G PLT+ +G+ L+++ AP + +F F+ ++
Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13620TYPE3IMQPROT433e-09 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 43.2 bits (102), Expect = 3e-09
Identities = 17/69 (24%), Positives = 32/69 (46%)

Query: 13 GLVTVLWIAGPMLLAVLVVGVVIGVVQAATQLNEPTIAFVAKAVALTATLFATGSMLLGH 72
L VL ++G + ++G+++G+ Q TQL E T+ F K + + LF
Sbjct: 11 ALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEV 70

Query: 73 LVEFTIALF 81
L+ + +
Sbjct: 71 LLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13625FLGBIOSNFLIP2392e-81 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 239 bits (612), Expect = 2e-81
Identities = 123/228 (53%), Positives = 161/228 (70%), Gaps = 1/228 (0%)

Query: 51 PAGSNQLPSLPNVSVGRIGDQPVSLPLQTLLLMTAITLLPSMLLVLTAFTRITIVLGLLR 110
P QLP + + + G Q SLP+QTL+ +T++T +P++LL++T+FTRI IV GLLR
Sbjct: 17 PLAFAQLPGITSQPL-PGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLR 75

Query: 111 QALGTGQTPSNQVLLGLAMFLTALVMMPVWQKMWGAGLQPYLNNQIDFSTAWTLTTQPLR 170
ALGT P NQVLLGLA+FLT +M PV K++ QP+ +I A QPLR
Sbjct: 76 NALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLR 135

Query: 171 AFMLAQIRETDLMTFAGMAGDSKYAGPDAVPFPVLVASFVTSELKTAFEIGFLIFIPFVI 230
FML Q RE DL FA +A GP+AVP +L+ ++VTSELKTAF+IGF IFIPF+I
Sbjct: 136 EFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLI 195

Query: 231 IDLVVASVLMSMGMMMLSPMLISAPFKILLFILVDGWVLVVGTLAASF 278
IDLV+ASVLM++GMMM+ P I+ PFK++LF+LVDGW L+VG+LA SF
Sbjct: 196 IDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13635FLGMOTORFLIN1137e-36 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 113 bits (284), Expect = 7e-36
Identities = 50/90 (55%), Positives = 74/90 (82%)

Query: 22 DQNAADLNLDVILDVPVTLSLEVGRARIPIRNLLQLNQGSVVELERGAGEPLDVYVNGTL 81
D + A ++D+I+D+PV L++E+GR R+ I+ LL+L QGSVV L+ AGEPLD+ +NG L
Sbjct: 46 DVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYL 105

Query: 82 IAHGEVVVINDRFGIRLTDVVSPSERIRRL 111
IA GEVVV+ D++G+R+TD+++PSER+RRL
Sbjct: 106 IAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13640FLGMOTORFLIM2591e-86 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 259 bits (662), Expect = 1e-86
Identities = 91/327 (27%), Positives = 163/327 (49%), Gaps = 14/327 (4%)

Query: 3 VSDLLSQDEIDALLHGVDSGAVNTEPEPLPGEARQ-----YDLSSQDRIIRGRMPTLEMV 57
++++LSQDEID LL + SG + E + YD D+ + +M TL ++
Sbjct: 1 MTEVLSQDEIDQLLTAISSG--DASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLM 58

Query: 58 NERFARLWRIGLFNLIRRSADLSVRGIDLVKFNEYMHSLYVPTNLNLIRFKPLRGTGLIV 117
+E FARL L +R + V +D + + E++ S+ P+ L +I PL+G ++
Sbjct: 59 HETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLE 118

Query: 118 FEPTLVFTVVDNFFGGDGRFHTRIEGREFTATEMRVIQLMLKQTFADLKEAWAPVMDVDF 177
+P++ F+++D FGG G+ R+ T E V++ ++ + A+++E+W V+D+
Sbjct: 119 VDPSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 178 EYINSEINPHFANIVTPREYVVVCRFHVELEGGGGEIHITLPYSMLEPIRELLDAG--IQ 235
E NP FA IV P E VV+ ++ G ++ +PY +EPI L +
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 236 SDRNDRDDSWNVMLREQLDTAEVTLSSVLASKRMSLRQLTGLKVGDIL---PIDLPAQVP 292
S R + +LR++L T ++ + + + S R+S+R + GL+VGDI+ +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 293 LCVEDIPLFTGEFGVSNGNNAVKITAV 319
L + + F + GV A +I
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILER 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13650FLGHOOKFLIK485e-08 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 47.5 bits (112), Expect = 5e-08
Identities = 40/176 (22%), Positives = 80/176 (45%), Gaps = 6/176 (3%)

Query: 247 AAKALEPAADDSAAAAAPDAPAFVLPTTTAPALSRLQDPAPIFSASPTPTPELGSDTFDD 306
A+ L P ++ + A + + +P ++ Q A+P + LGS +
Sbjct: 183 PAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQ 242

Query: 307 AIGARMSWLADQKIGHAHIKVTPNEMGPVEVRLHLEGDKVNASFTAANADTRQALEQSLP 366
++ +S Q A +++ P ++G V++ L ++ ++ + + R ALE +LP
Sbjct: 243 SLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALP 302

Query: 367 RLREMLGQNGFQLGQADV------GQQQRNSSGNRNGGNDSGNGLTLDDAPPVGIP 416
LR L ++G QLGQ+++ GQQQ S ++ + L +D + +P
Sbjct: 303 VLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVP 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13655FLGFLIJ270.016 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 27.5 bits (60), Expect = 0.016
Identities = 35/142 (24%), Positives = 59/142 (41%), Gaps = 4/142 (2%)

Query: 1 MMQSKRIDPLLRRAQEQEDKVARDLAERQRVLETHQSRLEELRRYAEEYANSQMAGTSAV 60
M + + L A+++ + AR L E +R + + +L+ L Y EY N+ + SA
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 ALSNR----RAFLDRLDSAVLQQAQTVQSNIAKVEAERTRLLLASREKQVLEQLAASYRA 116
SNR + F+ L+ A+ Q Q + KV+ + Q + L
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 117 QENKVIERRDQREMDDLGARRA 138
R DQ++MD+ R A
Sbjct: 121 AALLAENRLDQKKMDEFAQRAA 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13665FLGFLIH433e-07 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 42.9 bits (100), Expect = 3e-07
Identities = 36/159 (22%), Positives = 76/159 (47%), Gaps = 7/159 (4%)

Query: 51 HEGFARGHAEGFAQGQSEVRRLTAQIDGILDNFTRPLARLENEVVGALGELAVRIAGQLV 110
EG A+G +G A+ +S+ + A++ ++ F L L++ + L ++A+ A Q++
Sbjct: 73 QEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVI 132

Query: 111 GRVYQADPQLLADLVGEAVDAVGGAGREVEVRLHPDDITALLPHLAPSSTT---RVAPDM 167
G+ D L + + + + ++R+HPDD+ + L + + R+ D
Sbjct: 133 GQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDP 192

Query: 168 SLSRGDLRVHAESVRIDGTLDARLRAALETVMRKSGAGL 206
+L G +V A+ +G LDA + + + R + G+
Sbjct: 193 TLHPGGCKVSAD----EGDLDASVATRWQELCRLAAPGV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13670FLGMOTORFLIG308e-106 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 308 bits (791), Expect = e-106
Identities = 106/329 (32%), Positives = 200/329 (60%)

Query: 1 MTGVQRAAVLLLSLGESDAAEVLKHMDPKEVQKIGIAMATMTGISRDQVEKVMDEFNGEL 60
+TG Q+AA+LL+S+G +++V K++ +E++ + +A + I+ + + V+ EF +
Sbjct: 15 LTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELM 74

Query: 61 AGKTSLGVGADDYIRNVLIQALGADKAGGLIDRILLGRNTTGLDTLKWMDPRAVADLVRN 120
+ + G DY R +L ++LG KA +I+ + + + ++ DP + + ++
Sbjct: 75 MAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQ 134

Query: 121 EHPQIIAIVMAHLDSDQAAEALKLLPERTRADVLLRIATLDGIPPNALSELNDIMERQFA 180
EHPQ IA+++++LD +A+ L LP + +V RIA +D P + E+ ++E++ A
Sbjct: 135 EHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLA 194

Query: 181 GNQNLKSSNVGGIKVAANILNFLDTGSDQGVLGEIGKIDADLAGKIQDLMFVFDNLVDLD 240
+ ++ GG+ I+N D +++ ++ + + D +LA +I+ MFVF+++V LD
Sbjct: 195 SLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLD 254

Query: 241 DRGLQTLLREVSGERLGLALRGADVKVREKITRNMSQRAAEILLEDMEARGPVRLADVEA 300
DR +Q +LRE+ G+ L AL+ D+ V+EKI +NMS+RAA +L EDME GP R DVE
Sbjct: 255 DRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEE 314

Query: 301 AQKEILTIVRRLADEGAISLGGAGAEAMV 329
+Q++I++++R+L ++G I + G E ++
Sbjct: 315 SQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13675FLGMRINGFLIF351e-116 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 351 bits (902), Expect = e-116
Identities = 187/575 (32%), Positives = 300/575 (52%), Gaps = 45/575 (7%)

Query: 16 KAGQWFDRVRSLQITRKLTMMAMIALAVAAGLAVFFWSQKPGYQSLYTGLDEKGNAEAAD 75
K +W +R+R+ ++ ++ + AVA +A+ W++ P Y++L++ L ++
Sbjct: 11 KPLEWLNRLRANP---RIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVA 67

Query: 76 LLRTAQIPYKIDQGTGAISVPQDRLYDARLKLAGSGLTGKETGGGFELMEKDPGFGVSQF 135
L IPY+ G+GAI VP D++++ RL+LA GL K GFEL++++ FG+SQF
Sbjct: 68 QLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLP-KGGAVGFELLDQEK-FGISQF 125

Query: 136 VESARYQHALETELSRTIGTLRPVREARVHLAIPKPSAFTRQRDVASASVVLELRGGQGL 195
E YQ ALE EL+RTI TL PV+ ARVHLA+PKPS F R++ SASV + L G+ L
Sbjct: 126 SEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL 185

Query: 196 ERNQVDAIVNLVASSIPDMTPERVTVVDQSGRMLSIADPNSDAAQHAAQFEQVRRQESSY 255
+ Q+ A+V+LV+S++ + P VT+VDQSG +L+ ++ + AQ + ES
Sbjct: 186 DEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLN-DAQLKFANDVESRI 244

Query: 256 NQRIRELLEPMTGPGRVNPETSVDMDFSVVEEARELYN----GEPAKLRSEQVSD-TSTS 310
+RI +L P+ G G V+ + + +DF+ E+ E Y+ A LRS Q++
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 311 ATGPQGPPGATSNSPGQPPAPAVAGAPGT--------PAAANGQAAAPATPTESSKSATR 362
A P G PGA SN P PP A P T P + + A P + ++ T
Sbjct: 305 AGYPGGVPGALSNQP-APPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETS 363

Query: 363 NYELDRTLQHTRQPAGRIKRVSVAVLLDNVPRPGAKGKMVEQPLTAAELTRIEGLVKQAV 422
NYE+DRT++HT+ G I+R+SVAV+++ K PLTA ++ +IE L ++A+
Sbjct: 364 NYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAM 419

Query: 423 GFDAARGDTVSVMNAPFVREAVAGEEGPKWWEDPRVQNGLRLLVGAVVVLALLF----GV 478
GF RGDT++V+N+PF G E P W + + L ++VL + +
Sbjct: 420 GFSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAG-RWLLVLVVAWILWRKA 478

Query: 479 VRPTLRQLTGVTAVKDKQGKAGKDGTPQSADVRMVEDDDDLMPRLEEDTAQIGQDKKTPI 538
VRP L + +Q + ++ + A + D+ L Q ++
Sbjct: 479 VRPQLTRRVEEAKAAQEQAQVRQET--EEAVEVRLSKDEQL------------QQRRANQ 524

Query: 539 ALPDAYEERMRLAREAVKADSKRVAQVVKGWVASE 573
L E + RE D + VA V++ W++++
Sbjct: 525 RLG--AEVMSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13680FLGHOOKFLIE603e-15 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 60.1 bits (145), Expect = 3e-15
Identities = 28/84 (33%), Positives = 48/84 (57%)

Query: 40 AGAQGTPATQAPSFSETLRGAIGGVNEAQQKAGALSKAFEMGDPNADLARVMVASQQSQV 99
A AQ + SF+ L A+ +++ Q A ++ F +G+P L VM Q++ V
Sbjct: 20 ARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASV 79

Query: 100 AFRATVEVRNRLVQAYQDVMNMPL 123
+ + ++VRN+LV AYQ+VM+M +
Sbjct: 80 SMQMGIQVRNKLVAAYQEVMSMQV 103


81XADLMG695_RS13675XADLMG695_RS13840N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS13675-210-1.335034SDR family oxidoreductase
XADLMG695_RS13680-216-3.450563ketoacyl-ACP synthase III
XADLMG695_RS13685-218-4.057376acyl carrier protein
XADLMG695_RS23320342-12.138074DegT/DnrJ/EryC1/StrS family aminotransferase
XADLMG695_RS13690338-11.129166sigma-54-dependent Fis family transcriptional
XADLMG695_RS13700234-10.270604response regulator transcription factor
XADLMG695_RS23325130-8.444070RNA polymerase factor sigma-54
XADLMG695_RS13705124-7.207825response regulator transcription factor
XADLMG695_RS13710114-3.621557PilZ domain-containing protein
XADLMG695_RS13715011-3.439085hypothetical protein
XADLMG695_RS13720010-1.556772flagellar export chaperone FliS
XADLMG695_RS13725210-1.815190flagellar filament capping protein FliD
XADLMG695_RS13730011-1.092820flagellin
XADLMG695_RS13735111-0.642532flagellar hook-associated protein FlgL
XADLMG695_RS13740111-0.155733flagellar hook-associated protein FlgK
XADLMG695_RS137452140.617202flagellar assembly peptidoglycan hydrolase FlgJ
XADLMG695_RS137550150.399814flagellar basal body P-ring protein FlgI
XADLMG695_RS13760120-0.573466flagellar basal body L-ring protein FlgH
XADLMG695_RS13765120-0.800537flagellar basal-body rod protein FlgG
XADLMG695_RS13770121-0.808146flagellar basal-body rod protein FlgF
XADLMG695_RS13775021-0.377030flagellar hook protein FlgE
XADLMG695_RS137800210.102730flagellar basal body rod modification protein
XADLMG695_RS13785-1210.466205flagellar basal body rod protein FlgC
XADLMG695_RS13790-1190.552250flagellar basal body rod protein FlgB
XADLMG695_RS13795-1180.660424chemotaxis protein CheV
XADLMG695_RS138000200.553055flagellar basal body P-ring formation protein
XADLMG695_RS13805119-0.317152flagellar biosynthesis anti-sigma factor FlgM
XADLMG695_RS13810018-1.102708flagella protein
XADLMG695_RS13820117-1.474848sensor histidine kinase
XADLMG695_RS13830217-0.645532EAL domain-containing protein
XADLMG695_RS13840111-0.976409PAS domain S-box protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13725DHBDHDRGNASE1097e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (273), Expect = 7e-31
Identities = 68/260 (26%), Positives = 127/260 (48%), Gaps = 13/260 (5%)

Query: 7 FNPFSLADKRILVSGASSGLGRAIALGCARMGGELIVSGRDPQRLDATLADLRAISERPH 66
N + K ++GA+ G+G A+A A G + +P++L+ ++ L+A +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 67 QALRADLTVATERASLVAALS---APLHGVVHSAGISRLCPARMVGEAHLREVQATNVDA 123
AD+ + + A + P+ +V+ AG+ R + + + N
Sbjct: 61 A-FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTG 119

Query: 124 PILLTQGLLKRNLIAADGAIVFIASIAAHIGVAGVGAYSASKAALIAYARCLAMEVVKRH 183
++ + K + G+IV + S A + + AY++SKAA + + +CL +E+ + +
Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 184 IRVNCLSPALVDTPLL-------DATAQVV-GSLETERSNYPLG-FGRPDDVANAAIFLL 234
IR N +SP +T + + QV+ GSLET ++ PL +P D+A+A +FL+
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 235 SGASRWITGTSLVMDGGLTI 254
SG + IT +L +DGG T+
Sbjct: 240 SGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13730PF04183290.028 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 29.1 bits (65), Expect = 0.028
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 4/45 (8%)

Query: 71 ERLQWKREEIDALIVVTQSPDYPIPATAII--LQDRLGLSHATVA 113
ER W IDA + D P+ A ++ L+ L +S ATVA
Sbjct: 51 ERGIWGWLWIDAQTLRCA--DEPVLAQTLLMQLKQVLSMSDATVA 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13745HTHFIS437e-152 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 437 bits (1125), Expect = e-152
Identities = 173/489 (35%), Positives = 249/489 (50%), Gaps = 16/489 (3%)

Query: 1 MSESRILLIDSDAVRAERTVSLLEFMDFNPRWVTDGADINPGRHRHDEWMAVMVGSAQDA 60
M+ + IL+ D DA L ++ R ++ A + R ++V
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLW--RWIAAGDGDLVVTDVVMP 58

Query: 61 -AQADKFFDWLADAKLPPPVLLMEGSPSAFAQTHGLHEANVWALDTPLRHAQLEALLRRA 119
A + A+ PVL+M + + L P +L ++ RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 S--LKRLDAEHQAGVQQDSGPTGNSEAVTRLRRLIDQVAAFDTTVLVLGESGTGKEVVAR 177
KR ++ + Q G S A+ + R++ ++ D T+++ GESGTGKE+VAR
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 178 AIHQHSPRRDGPFVAINCGAIPPDLLESELFGHEKGAFTGALTTRKGRFEMAEGGTLLLD 237
A+H + RR+GPFVAIN AIP DL+ESELFGHEKGAFTGA T GRFE AEGGTL LD
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 238 EIGDMSLPMQVKLLRVLQERSFERVGGGQTIRCNVRVIAATHRNLESRISDGQFREDLFY 297
EIGDM + Q +LLRVLQ+ + VGG IR +VR++AAT+++L+ I+ G FREDL+Y
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 298 RLNVFPIEMPALRERVDDLAMLVQTIAGQLARTGRGEVRFADEALQALRGYDWPGNVREL 357
RLNV P+ +P LR+R +D+ LV+ Q + G RF EAL+ ++ + WPGNVREL
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 358 TNLVERLAVLHPGGLVRVQDLPARYRGDFASAIPVELPPEPELVTAPVEVSALPSNVVTL 417
NLV RL L+P ++ + + R + + + ++ V
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 418 QPKTADAEPSATSSLPDDGIDLRGHMANIELALINEALERTQGVVAHAAQLLGLRRTTLV 477
+A +E LI AL T+G AA LLGL R TL
Sbjct: 419 FG-----------DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 478 EKLRKYGID 486
+K+R+ G+
Sbjct: 468 KKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13750HTHFIS553e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.2 bits (133), Expect = 3e-12
Identities = 20/118 (16%), Positives = 44/118 (37%), Gaps = 2/118 (1%)

Query: 1 MSKLTVLLVDDHEGFINAAMRHFRKVEWLDIVGSAANGLEAIERSESLRPNVVLMDLAMP 60
M+ T+L+ DD + + + V +N + ++V+ D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 EMGGLQATRLIKTQDDPPYIVIASHFDDAEHREHALRAGADNFVSKLSYIQEVMPILE 118
+ IK +++ S + A GA +++ K + E++ I+
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13760HTHFIS726e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 6e-17
Identities = 35/160 (21%), Positives = 66/160 (41%), Gaps = 9/160 (5%)

Query: 2 RVIIVDDHTLVRAGLSRLLQTFAGIDVVGEASNAQQALDMTSLHRPDLVLMDLSLPGRSG 61
+++ DD +R L++ L + AG DV SNA + DLV+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 LDAMTDVLRAAPRTHVVMMSMHDDPVHVRDALDRGAVGFVVKDAAPLELELALRAAAAGQ 121
D + + +A P V++MS + + A ++GA ++ K EL + A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA---- 118

Query: 122 VFLSPQISSKMIAPMLGREKPVGIAALSPRQREILREIGR 161
+ + + + + S +EI R + R
Sbjct: 119 ---LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13785FLAGELLIN1352e-37 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 135 bits (341), Expect = 2e-37
Identities = 124/360 (34%), Positives = 181/360 (50%), Gaps = 10/360 (2%)

Query: 2 AQVINTNVMSLNAQRNLNTSSASMSTSIQRLSSGLRINSAKDDAAGLAISERFTTQIRGL 61
AQVINTN +SL Q NLN S +S+S++I+RLSSGLRINSAKDDAAG AI+ RFT+ I+GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVASRNANDGISLAQTAEGAMVEIGSNLQRIRELSVQSSNATNSSTDRDALNSEVKQLTA 121
ASRNANDGIS+AQT EGA+ EI +NLQR+RELSVQ++N TNS +D ++ E++Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EIDRVANQTNFNGTKLLDGSFSGALFQVGADAGQTIGINSIVDANVDSLGKANFAAAVSG 181
EIDRV+NQT FNG K+L QVGA+ G+TI I + +V SLG F
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITI-DLQKIDVKSLGLDGFNVNGPK 178

Query: 182 AGVTGTATASGSVSGISLSFNDASGSAKSVTIADVKIAAGDTAADVNKKVASAINDKLDQ 241
G +S ++ + + + + +K +A N +L
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 242 TGMYASIKSDGSLQIESLKAGQDFTSLSAG--------TSSAAGITVGAGITTASAASGS 293
+ D +S + +++ T G+T T + +G
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 294 TASTLSSLDISTFSGSQKALEIVDKALTAVNSSRADMGAVQNRFTSTIANLSATSENLSA 353
++T++ ++ A A T +S V +FT + +++
Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358



Score = 97.4 bits (242), Expect = 4e-24
Identities = 74/340 (21%), Positives = 133/340 (39%), Gaps = 3/340 (0%)

Query: 60 GLDVASRNANDGISLAQTAEGAMVEIGSNLQRIRELSVQSSNATNSSTDRDALNSEVKQL 119
G +V L + + + + +S A + T + +V
Sbjct: 171 GFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVN 230

Query: 120 TAEIDRVANQTNFNGTKLLDGSFSGALFQVGADAGQTIGINSIVDANVDSLGKANFAAAV 179
A + N L + A A D G
Sbjct: 231 AANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTK 290

Query: 180 SGAGVTGTATASGSVSGISLSFNDASGSAKSVTIADVKIAAGDTAADVNKKVASAINDKL 239
+G G + + + ++L+ D + A +V A ++ + + VN + K
Sbjct: 291 TGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKN 350

Query: 240 DQTGMYASIKSDGSLQIESLKAGQDFTSLSAGTSSAAGITVGAGITTASAASGSTASTLS 299
+ + ++ + + ++ +T+ + ++ ++
Sbjct: 351 ESAKLSDLEANNAVKGESKITVN---GAEYTANAAGDKVTLAGKTMFIDKTASGVSTLIN 407

Query: 300 SLDISTFSGSQKALEIVDKALTAVNSSRADMGAVQNRFTSTIANLSATSENLSASRSRIR 359
+ + L +D AL+ V++ R+ +GA+QNRF S I NL T NL+++RSRI
Sbjct: 408 EDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIE 467

Query: 360 DTDYAKETAELTRTQILQQAGTAMLAQAKSVPQNVLSLLQ 399
D DYA E + +++ QILQQAGT++LAQA VPQNVLSLL+
Sbjct: 468 DADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13790FLAGELLIN592e-11 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 58.5 bits (141), Expect = 2e-11
Identities = 62/349 (17%), Positives = 111/349 (31%), Gaps = 6/349 (1%)

Query: 4 RISTSMMYSQSVASMGAKQSRLNQFESQLSSGQRLVTAKDDPVAAGTAVGLDRALAAITR 63
I+T+ + + ++ QS L+ +LSSG R+ +AKDD A + +T+
Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62

Query: 64 FGENANNVQNRLGLQENALSQAGDKMARVTELAVQASNSSLSPDDRKAIASELTALRESM 123
NAN+ + E AL++ + + RV EL+VQA+N + S D K+I E+ E +
Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122

Query: 124 VSLANSTDGTGRYLFGGTADGSAPFIKSNG---SVTYNGDQTQKQVEVAPDTFVSDTLPG 180
++N T G + ++G ++ + +
Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 181 SEIFMRIRTGDGTVDAHANAGNTGTGLLLDFSRDASTGSWNGGSYSVQFTAADTYEVRDS 240
++ + G A + +T V
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 241 TNTVVGTGTYKEG--EDINAAGVRMRISGAPAVGDSFQIGASGTKDVFSTID-DLVGALN 297
NT V + A + I G G + T D + D + +
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 298 SDTLTAPQKAAMINTLQTSMRDITQASSKMIDARTSGGAQLSAIDNANA 346
+ A I ++ T SSK + G N
Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351



Score = 36.6 bits (84), Expect = 1e-04
Identities = 50/269 (18%), Positives = 83/269 (30%), Gaps = 1/269 (0%)

Query: 127 ANSTDGTGRYLFGGTADGSAPFIKSNGSVTYNGDQTQKQVEVAPDTFVSDTLPGSEIFMR 186
AN T D + G+ + DTF + +
Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291

Query: 187 IRTGDGTVDAHANAGNTGTGLLLDFSRDASTGSWNGGSYSVQFTAADTYEVRDSTNTVVG 246
G+G V N + + A+ + S +T+ + T
Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351

Query: 247 TGTYKEGEDINAAGVRMRISGAPAVGDSFQIGASGTKDVFSTIDDLVGALNSDTLTAPQK 306
+ + E NA +I+ A + G T + D A TL
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFID-KTASGVSTLINEDA 410

Query: 307 AAMINTLQTSMRDITQASSKMIDARTSGGAQLSAIDNANALLESNEVTLKTSLSSIRDLD 366
AA + + I A SK+ R+S GA + D+A L + L ++ S I D D
Sbjct: 411 AAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 367 YASAIGQYQLEKASLQAAQTIFQQMQSSS 395
YA+ + + QA ++ Q
Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVP 499


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13795FLGHOOKAP12277e-69 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 227 bits (580), Expect = 7e-69
Identities = 141/437 (32%), Positives = 220/437 (50%), Gaps = 8/437 (1%)

Query: 2 SIMSTGTSALIAFQRALSTVSHNVANINTEGYSRQRVEFATRTPTDMGYAFVGNGAKITD 61
S+++ S L A Q AL+T S+N+++ N GY+RQ A T +VGNG ++
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 62 VGRVADQLAISRLLDSGGELSRLQQLSSLSNRVDALYSNTATNVAGLWSNFFDSTSAVSS 121
V R D ++L + + S L +++D + S + +++A +FF S + S
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 122 NASSTAERQSMLDSGNSLATRFKQLNGQMDSLSNEVNSGLTSSVDEVNRLTQQIAKLNGT 181
NA A RQ+++ L +FK + + +VN + +SVD++N +QIA LN
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 182 I----GSSAQAAAPDMLDQRDALVSKLVGFTGGTAVIQDGGFMNVFTAGGQPLVVGTTSS 237
I G A A+ ++LDQRD LVS+L G +QDGG N+ A G LV G+T+
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 238 KLVTAADPYEPTKLQVAMQTQGQNVSLSASSL--GGQIGGLLEFRSSVLEPTQAELGRLA 295
+L +P++ VA L G +GG+L FRS L+ T+ LG+LA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 296 VGMASTFNAGHSQGMDLYGAMGGNFFNIGSPAVAANPSNTGSASLSASFSNVSAVDGQNV 355
+ A FN H G D G G +FF IG PAV N N G ++ A+ ++ SAV +
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDY 361

Query: 356 TLSFDGTNWKAINASTGSAVPMTGTGTAADPLVLNGVSMVVGGTPASGDKFLLQPTAGLA 415
+SFD W+ ++ + T T A + +G+ + GTPA D F L+P +
Sbjct: 362 KISFDNNQWQVTRLASNTT--FTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419

Query: 416 GSLSVAITDPSRIAAAT 432
++ V ITD ++IA A+
Sbjct: 420 VNMDVLITDEAKIAMAS 436



Score = 82.3 bits (203), Expect = 1e-18
Identities = 38/105 (36%), Positives = 56/105 (53%)

Query: 517 AGSSDNGNAKLLANIDDAKALSGGTVTLNGALSGLTTSVGSAARAASYSADAQKVINDQA 576
AG SDN N + L ++ GG + N A + L + +G+ S+ Q + Q
Sbjct: 440 AGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQL 499

Query: 577 QASRDSISGVNLDEEAANMLKLQQAYQAAAQMISTADTIFQAILG 621
+ SISGVNLDEE N+ + QQ Y A AQ++ TA+ IF A++
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13800FLGFLGJ1305e-37 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 130 bits (327), Expect = 5e-37
Identities = 63/140 (45%), Positives = 82/140 (58%), Gaps = 4/140 (2%)

Query: 218 FVAKIWTHAQKAARELGVDPRALVAQAALETGWGRRGI--GNGGDSNNLFGIKATG-WSG 274
F+A++ AQ A+++ GV ++AQAALE+GWG+R I NG S NLFG+KA+G W G
Sbjct: 152 FLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKG 211

Query: 275 DKVTTGTHEYVNGVKTTETADFRAYGSAEESFADYVRLLKNNSRYQTALQAGTDIKGFAR 334
T EY NG A FR Y S E+ +DYV LL N RY A+ + A+
Sbjct: 212 PVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRY-AAVTTAASAEQGAQ 270

Query: 335 GLQQAGYATDPGYAAKIAAI 354
LQ AGYATDP YA K+ +
Sbjct: 271 ALQDAGYATDPHYARKLTNM 290



Score = 71.3 bits (174), Expect = 5e-16
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 22/178 (12%)

Query: 4 AASPIDLNPSTKADPA-KIDKVSRQLEGQFAQMLVKSMRDASGGDPMFPGQNQ-MFREMY 61
A S +L DPA I V+RQ+EG F QM++KSMRDA D +F ++ ++ MY
Sbjct: 15 AQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMY 74

Query: 62 DQQMAKALTDGKGLGLSAMISKQLSGDTGGPALNTSL--------------NTAEAAKAY 107
DQQ+A+ +T GKGLGL+ M+ KQ++ + P +T N A +
Sbjct: 75 DQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQ 134

Query: 108 ALVAGKRDASLPLPARDGAATGVTTSSVAKAALGAGNLSGIGMSQVLDLIAGRTGAGE 165
V D SLP ++ A ++ A A SG+ +L A +G G+
Sbjct: 135 KAVPRNYDDSLPGDSKAFLA------QLSLPAQLASQQSGVPHHLILAQAALESGWGQ 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13805FLGPRINGFLGI361e-126 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 361 bits (928), Expect = e-126
Identities = 156/364 (42%), Positives = 220/364 (60%), Gaps = 9/364 (2%)

Query: 10 LLAAAVALCALAAPASAERIKDLAQVGGVRGNALVGYGLVVGLDGSGDRTSQAPFTVQSL 69
+ +A L A A RIKD+A + R N L+GYGLVVGL G+GD +PFT QS+
Sbjct: 12 VFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSM 71

Query: 70 KNLLGELGVNVPANVNPQLKNVAAVAIHAELPPFAKPGQPIDITVSSIANAVSLRGGSLL 129
+ +L LG+ KN+AAV + A LPPFA PG +D+TVSS+ +A SLRGG+L+
Sbjct: 72 RAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130

Query: 130 MAPLKGADGQVYAMAQGNLVVGGFGAQGKDGSRVSVNVPSVGRIPNGATVERALPDVFAG 189
M L GADGQ+YA+AQG L+V GF AQG D + ++ V + R+PNGA +ER LP F
Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189

Query: 190 TGEITLNLHQNDFTTVSRMVAAIDS----SFGAGTARAVDGVTVAVRSPTDPGARIGLLS 245
+ + L L DF+T R+ +++ +G A D +AV+ P L++
Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248

Query: 246 RLENVELSPGDAPAKVVVNARTGTVVIGQLVRVMPAAIAHGSLTVTISENTNVSQPGAFS 305
+EN+ + D PAKVV+N RTGT+VIG VR+ A+++G+LTV ++E+ V QP FS
Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307

Query: 306 GGRTAVTQQSTITATSEGSRMFKFEGGTTLDQIVRAVNEVGAAPGDLVAILEALKQAGAL 365
G+TAV Q+ I A EGS++ E G L +V +N +G ++AIL+ +K AGAL
Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366

Query: 366 TAEL 369
AEL
Sbjct: 367 QAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13810FLGLRINGFLGH1482e-46 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 148 bits (374), Expect = 2e-46
Identities = 79/199 (39%), Positives = 111/199 (55%), Gaps = 15/199 (7%)

Query: 39 VPVVAPVAQPTAGAIYAAGPSLN-----LYGDRRARDVGDLLTVNLVESTTASSTANTSI 93
VP PVA G+I+ + +N L+ DRR R++GD LT+ L E+ +AS +++ +
Sbjct: 40 VPGPTPVA---NGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANA 96

Query: 94 SKKDATTM---AAPTLLGAPLTVGGLNVLENSTSGDRSFAGKGNTAQSNRMQGSVTVTVM 150
S+ T P L +V SG +F GKG SN G++TVTV
Sbjct: 97 SRDGKTNFGFDTVPRYLQGLFGNARADV---EASGGNTFNGKGGANASNTFSGTLTVTVD 153

Query: 151 QRLPNGNLVIQGQKQLRLTQGDELVQVQGIVRAADIAPDNTVPSSKVADARIAYGGRGAI 210
Q L NGNL + G+KQ+ + QG E ++ G+V I+ NTVPS++VADARI Y G G I
Sbjct: 154 QVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYI 213

Query: 211 AQSNAMGWLSRFFNSRLSP 229
++ MGWL RFF + LSP
Sbjct: 214 NEAQNMGWLQRFFLN-LSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13815FLGHOOKAP1391e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.2 bits (91), Expect = 1e-05
Identities = 12/41 (29%), Positives = 20/41 (48%)

Query: 219 LEGSNVNTVEELVSMIETQRAYEMNAKAISTTDSMLGYLNN 259
S VN EE ++ Q+ Y NA+ + T +++ L N
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 37.6 bits (87), Expect = 3e-05
Identities = 19/82 (23%), Positives = 31/82 (37%), Gaps = 20/82 (24%)

Query: 5 LWVAKTGLDAQQTRMSVISNNLANTNTTGFKRDRAAFEDLLYQQVRAPGGSTSAQTQLPT 64
+ A +GL+A Q ++ SNN+++ N G+ R T T
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT-----------------TIMAQANST 46

Query: 65 ---GLQLGTGVRVVSTFKGFDQ 83
G +G GV V + +D
Sbjct: 47 LGAGGWVGNGVYVSGVQREYDA 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13820FLGHOOKAP1300.009 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.009
Identities = 9/31 (29%), Positives = 19/31 (61%)

Query: 5 LYVAMTGARASLQAQSTVSHNLANVDTVGFK 35
+ AM+G A+ A +T S+N+++ + G+
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYT 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13825FLGHOOKAP1462e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 46.1 bits (109), Expect = 2e-07
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 3/69 (4%)

Query: 2 GFNTSLSGINAANADLNVTSNNIANVNTTGFKESRAEFADMFQSTSYGLSRNAVGSGVRV 61
N ++SG+NAA A LN SNNI++ N G+ A Q+ S + VG+GV V
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA---QANSTLGAGGWVGNGVYV 59

Query: 62 SNVAQQFSQ 70
S V +++
Sbjct: 60 SGVQREYDA 68



Score = 44.2 bits (104), Expect = 7e-07
Identities = 31/188 (16%), Positives = 69/188 (36%), Gaps = 16/188 (8%)

Query: 232 LQFSDTGALTTPANGIIAMDPFTPSTGAGVLN-MQLNVTGSTQYGEAFALRDTRQDGYAS 290
+ F + T + G + ++L TG+ ++F L+ A
Sbjct: 363 ISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSD---AI 419

Query: 291 GKLNEISIDTSGVVFARYSNGADKPLGQVALSSFVNPQGLQSQGNNMWA-ESY------- 342
++ + D + + A + D + ++ G ++Y
Sbjct: 420 VNMDVLITDEAKIAMASEEDAGDSDNRNGQ-ALLDLQSNSKTVGGAKSFNDAYASLVSDI 478

Query: 343 ---TSGAARTGAPDTSDLGQIESGSLEASTVDLTEQLVNMIVAQRNFQANSQMISTQDQV 399
T+ + A + + Q+ + S V+L E+ N+ Q+ + AN+Q++ T + +
Sbjct: 479 GNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAI 538

Query: 400 TQTIINIR 407
+INIR
Sbjct: 539 FDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13845HTHFIS392e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.7 bits (90), Expect = 2e-05
Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 9/75 (12%)

Query: 184 VLVVDDSRVARQQIRSVLDQLGVSATLLSDGRQALDHLLQVAASGENPADRYAMVISDIE 243
+LV DD R + L + G + S+ + A +V++D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDVV 56

Query: 244 MPAMDGYTLTTEIRR 258
MP + + L I++
Sbjct: 57 MPDENAFDLLPRIKK 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13860PYOCINKILLER280.007 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 28.2 bits (62), Expect = 0.007
Identities = 15/66 (22%), Positives = 24/66 (36%), Gaps = 2/66 (3%)

Query: 35 DKLSALQALEAAMPAGEEERLRELAEANRANGALLARRRREVNWALRHLGRTESAPSYDA 94
+ +S+LQ + A + A R A A+R+ E R +A +Y
Sbjct: 195 EAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEE--QARQQAAIRAANTYAM 252

Query: 95 KGQSSV 100
SV
Sbjct: 253 PANGSV 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13875HTHFIS992e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99 bits (249), Expect = 2e-24
Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 1/115 (0%)

Query: 447 TLLLLDDEENVLRSLVRLFRRDGYRILAAGNVRDAFDLLATNDVQVILSDQRMSDMSGTE 506
T+L+ DD+ + L + R GY + N + +A D ++++D M D + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 507 FLGRVKMLYPDTVRLVLSGYTDLATVTEAINRGAIYRFLTKPWNDDELREHIRQA 561
L R+K PD LV+S T +A +GA Y +L KP++ EL I +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGA-YDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS13880PF06580393e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 3e-05
Identities = 18/84 (21%), Positives = 29/84 (34%), Gaps = 10/84 (11%)

Query: 622 NALRHA---CAGEVHLRLHSI-DADSFRLEVSDDGDGFEPEGPR--GLGLIVMRERAQTV 675
N ++H + L D + LEV + G G GL +RER Q +
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQML 325

Query: 676 GG---TLAIESAPGAGTRVTLRLP 696
G + + G + +P
Sbjct: 326 YGTEAQIKLSEKQG-KVNAMVLIP 348


82XADLMG695_RS22505XADLMG695_RS14180N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS225054160.019161response regulator transcription factor
XADLMG695_RS141504160.127310HAMP domain-containing histidine kinase
XADLMG695_RS141554160.316142efflux transporter outer membrane subunit
XADLMG695_RS141704150.716967efflux RND transporter permease subunit
XADLMG695_RS141753130.556144efflux RND transporter periplasmic adaptor
XADLMG695_RS141804130.942693S8 family serine peptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14185HTHFIS941e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.7 bits (233), Expect = 1e-24
Identities = 25/131 (19%), Positives = 52/131 (39%)

Query: 5 APVVYLIDDDASMRAALEDLFASVGLQVYAFGSTDQFLAHRLHEVPACLVLDIRMPGQSG 64
+ + DDDA++R L + G V + +V D+ MP ++
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 65 MEFHRRMVESGVALPTIFITGHGDIAMSVEAMKNGAIEFLTKPFRDQALLDAIQDGIRRD 124
+ R+ ++ LP + ++ +++A + GA ++L KPF L+ I +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 125 RARRQSEAVAA 135
+ R +
Sbjct: 123 KRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14200ACRIFLAVINRP6450.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 645 bits (1665), Expect = 0.0
Identities = 233/1034 (22%), Positives = 426/1034 (41%), Gaps = 43/1034 (4%)

Query: 11 QRRGIVWLVFVLIALYGTWSWTQLPVEAYPDIADVTSQVVTQVPGLGAEEVEQQITVPLE 70
+R W++ +++ + G + QLPV YP IA V PG A+ V+ +T +E
Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIE 66

Query: 71 RALMGTPGLHVLRSRSLFA-LSLITLVFDDGTEGYFARQRVLERIQAVT--LPYGA-IPG 126
+ + G L + S S A ITL F GT+ A+ +V ++Q T LP G
Sbjct: 67 QNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQG 126

Query: 127 LDPYTSPTGEIYRYTLES--KTRSLRELSDLQFWTVIPRLQKVPGVADVTNFGGLTTQFS 184
+ S + + S + ++SD V L ++ GV DV FG
Sbjct: 127 ISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMR 185

Query: 185 LALEPDRLTRYGVSLQQVKSAITSNNAD------GGGSVMDRGEQSYVIRGIGLLHSLQD 238
+ L+ D L +Y ++ V + + N GG + + + I + ++
Sbjct: 186 IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEE 245

Query: 239 IGNVVVSSS-NGVPVLVKDLGEVRYDNVERRGILGKDGNPDTIEGIALLLKDSNPSVALQ 297
G V + + +G V +KD+ V I +G P L +N +
Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKP-AAGLGIKLATGANALDTAK 304

Query: 298 GIHSAVEELNNSVLPKDVKVVPYLDRTALIDATLHTVSATLTEGMLLVCVVLLIFLGSPR 357
I + + EL P+ +KV+ D T + ++H V TL E ++LV +V+ +FL + R
Sbjct: 305 AIKAKLAELQPF-FPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 358 AAAIVSLTIPLSLLIAFIFMHHLKIPANLLSLG--AIDFGILVDGAVVLVENVLRLREEN 415
A I ++ +P+ LL F + N L++ + G+LVD A+V+VENV R+ E+
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 416 SQRALTARDAIDATLQVARPIFFGMAVIGCAYLPLLAFERIEYKLFSPMAYAVGAALIGA 475
A + Q+ + V+ ++P+ F ++ + + +A+ +
Sbjct: 424 KLPPKEA--TEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 LLVALTLIPGLAWLAFRKPRKMLH-----------NRALETLGQRYRAVLERSVGRRGWL 524
+LVAL L P L KP H N + Y + + +G G
Sbjct: 482 VLVALILTPALCAT-LLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 525 LACAALALCVLAVLGGSIGRDFLPYIDEGSLWLQVQMPPGITLDKAATMANALRKATL-- 582
L AL + + VL + FLP D+G +Q+P G T ++ + + + L
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 583 EFPEVSYVVTQTGRNDDGTDYWTPSHTEASVGLRPYKDWP-AGMDKQALIAALGARYAQM 641
E V V T G + G + A V L+P+++ +A+I ++
Sbjct: 601 EKANVESVFTVNGFSFSGQ---AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657

Query: 642 PGYTVSMMQPMIDGVQDKLSGAHSDLTVKVFGDDLQQVRGVADQVAAALHKVPGA-ADIA 700
V +G +L + G + +Q+ + P + +
Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFEL-IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 701 VDVEPPLPNLQVRFDRAAAARYGINAADVSDLISTGIGGSPIGQMYLGEKSYDLTVRFPQ 760
+ ++ D+ A G++ +D++ IST +GG+ + + L V+
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 761 RYRNDPQAIGALRLRTAAGAEIPLSAVASITTTSGRSVIVREMGRRNIIVRLNVRGRDLS 820
++R P+ + L +R+A G +P SA + G + R G ++ ++
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP---G 833

Query: 821 SFLSDAQATLARQVRVDPQHMQLVWGGQFENLQRAQARLLVVLPTTLCIMFVLLFGAFGN 880
+ DA A + P + W G + + + ++ + ++F+ L + +
Sbjct: 834 TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893

Query: 881 LRQPTLVLAAVPLAMIGGLAALHLRGMTLNVSSAVGFVALFGVAVLNAVLMLAQIHRLRH 940
P V+ VPL ++G L A L +V VG + G++ NA+L++ L
Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953

Query: 941 DVGMPLREAVVAGAVSRMRPVLMTATVAALGLAPAMLATGLGSDVQRPLATVVVGGLVTA 1000
G + EA + R+RP+LMT+ LG+ P ++ G GS Q + V+GG+V+A
Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSA 1013

Query: 1001 TVLTLVLLPSLYYL 1014
T+L + +P + +
Sbjct: 1014 TLLAIFFVPVFFVV 1027



Score = 92.6 bits (230), Expect = 4e-21
Identities = 67/344 (19%), Positives = 137/344 (39%), Gaps = 15/344 (4%)

Query: 682 VADQVAAALHKVPGAADIAVDVEPPLPNLQVRFDRAAAARYGINAADVSDLISTG----I 737
VA V L ++ G D+ + +++ D +Y + DV + +
Sbjct: 158 VASNVKDTLSRLNGVGDVQLFGAQYA--MRIWLDADLLNKYKLTPVDVINQLKVQNDQIA 215

Query: 738 GGSPIGQMYLGEKSYDLTVRFPQRYRNDPQAIGALRLRTAA-GAEIPLSAVASITTTS-G 795
G G L + + ++ R++N P+ G + LR + G+ + L VA +
Sbjct: 216 AGQLGGTPALPGQQLNASIIAQTRFKN-PEEFGKVTLRVNSDGSVVRLKDVARVELGGEN 274

Query: 796 RSVIVREMGRRNIIVRLNVR-GRDLSSFLSDAQATLARQVRVDPQHMQLVWGGQFENLQR 854
+VI R G+ + + + G + +A LA PQ M++++ +
Sbjct: 275 YNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQ 334

Query: 855 AQARLLV---VLPTTLCIMFVLLFGAFGNLRQPTLVLAAVPLAMIGGLAALHLRGMTLNV 911
+V L + + LF N+R + AVP+ ++G A L G ++N
Sbjct: 335 LSIHEVVKTLFEAIMLVFLVMYLF--LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392

Query: 912 SSAVGFVALFGVAVLNAVLMLAQIHRLRHDVGMPLREAVVAGAVSRMRPVLMTATVAALG 971
+ G V G+ V +A++++ + R+ + +P +EA ++ A V +
Sbjct: 393 LTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAV 452

Query: 972 LAPAMLATGLGSDVQRPLATVVVGGLVTATVLTLVLLPSLYYLM 1015
P G + R + +V + + ++ L+L P+L +
Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14205RTXTOXIND551e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.2 bits (133), Expect = 1e-10
Identities = 38/238 (15%), Positives = 76/238 (31%), Gaps = 52/238 (21%)

Query: 115 AELANAYSEAGKARATLEQARLELARQKTLAADSISAARDLQAAQQAFDSAGNDARAASD 174
AE + + + L +L A + + + A N+ R
Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKS 273

Query: 175 RLAQLGVAAQASSHR--------------------------------------RYVLRAP 196
+L Q+ ++ V+RAP
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333

Query: 197 IAGRVVDLSA-ALGGFWNDTSASLMTVADISQVWLTASVPEREVGQVFEGQPVTASLDAY 255
++ +V L GG ++ V + + +TA V +++G + GQ ++A+
Sbjct: 334 VSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF 393

Query: 256 PGQRF---VGHVQHV--DDLLDPAT-------RTLKVRVALTNRDGL-LKPGMFARAQ 300
P R+ VG V+++ D + D +++ T + L GM A+
Sbjct: 394 PYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAE 451



Score = 36.3 bits (84), Expect = 2e-04
Identities = 26/132 (19%), Positives = 46/132 (34%), Gaps = 9/132 (6%)

Query: 80 RLVRVVPPLAGRVVALPKTLGDTVHAGDVLCVLDSAELANAYSEAGKARATLEQARLELA 139
R + P V + G++V GDVL L + A ++ K +++L QARLE
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA---LGAEADTLKTQSSLLQARLEQT 151

Query: 140 RQKTLAADSISAARDLQAAQQAFDSAGNDARAASDRLAQLGVAAQASSHRR---YVLRAP 196
R S S + + D + + L + + S + Y
Sbjct: 152 R---YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208

Query: 197 IAGRVVDLSAAL 208
+ + + L
Sbjct: 209 LDKKRAERLTVL 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14210SUBTILISIN1215e-32 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 121 bits (304), Expect = 5e-32
Identities = 72/325 (22%), Positives = 117/325 (36%), Gaps = 43/325 (13%)

Query: 78 NADLAQQAGARGQGVKLAVLDDNLVPSYAPISGKVDSFNDYTASPGTPESSANALRGHGT 137
A G+GVK+AVLD + + ++ ++T GHGT
Sbjct: 30 QAPAVWNQTR-GRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGT 88

Query: 138 IVSALVLGSAQDGFAGGVAPDADLFYARICAENSCGTQQTRRAAVDLAAA-GVRIANLSI 196
V+ + + + GVAP+ADL ++ + G + A V I ++S+
Sbjct: 89 HVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSL 148

Query: 197 GASYPDATASANAALAWKYALTPLVQADALIVASTGNEGAAEAS-----YPAATPVQEAS 251
G A K A V + L++ + GNEG + YP
Sbjct: 149 GGPEDVPELHE----AVKKA----VASQILVMCAAGNEGDGDDRTDELGYPGCYN----- 195

Query: 252 VRNNWLAVGAINIDSAGNAAGLTSYSNHCGAAAQWCLVAPGTYTAPALAGTELGGQIAGT 311
++VGAIN D + +SN + LVAPG + G + +GT
Sbjct: 196 ---EVISVGAINFDR-----HASEFSNSN---NEVDLVAPGEDILSTVPGGKY-ATFSGT 243

Query: 312 SFSTAAVSGVAAQVLGVYPW-----MSASNLQQTLLTTATDLGDPGVDALYGWGLVNAAK 366
S +T V+G A + + ++ L L+ LG + G GL+
Sbjct: 244 SMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGNGLLYLTA 301

Query: 367 AIKGPGQFASNWAANVTAGYDSTFS 391
+ + + AG ST S
Sbjct: 302 V----EELSRIFDTQRVAGILSTAS 322


83XADLMG695_RS14535XADLMG695_RS14565N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS14535021-1.097904epoxide hydrolase
XADLMG695_RS14540024-1.739435NmrA family NAD(P)-binding protein
XADLMG695_RS14545-123-1.995781TetR/AcrR family transcriptional regulator
XADLMG695_RS14550-125-2.075902hypothetical protein
XADLMG695_RS14555026-1.833673NfuA family Fe-S biogenesis protein
XADLMG695_RS14560-126-2.3209844a-hydroxytetrahydrobiopterin dehydratase
XADLMG695_RS14565-128-2.281792energy transducer TonB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14580HTHFIS300.016 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.016
Identities = 16/76 (21%), Positives = 30/76 (39%)

Query: 85 LNVRSPEPGALPLLLTHGWPGSILEFRDVIGPLSHPVAHGGKASDAFHLVIPSLPGFGFS 144
L+V+ + AL L+ H WPG++ E +++ L+ + + S
Sbjct: 333 LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPI 392

Query: 145 GKPTARGWGVGRTAAA 160
K AR + + A
Sbjct: 393 EKAAARSGSLSISQAV 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14585NUCEPIMERASE348e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 33.6 bits (77), Expect = 8e-04
Identities = 17/69 (24%), Positives = 30/69 (43%), Gaps = 6/69 (8%)

Query: 9 IVVAGATGNLGYRIAAALKDQGAAVVALVRHGAG------QSRVTALEGRGVQVRRVEFD 62
+V GA G +G+ ++ L + G VV + Q+R+ L G Q +++
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 63 DAERLRDAI 71
D E + D
Sbjct: 63 DREGMTDLF 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14590HTHTETR603e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 3e-13
Identities = 28/121 (23%), Positives = 47/121 (38%), Gaps = 2/121 (1%)

Query: 12 RPPPDKAGDVDRRLLDAALQLFLERGFEHTSCEDIARLAGAGKASLYARYANKDAIFEAV 71
R +A + + +LD AL+LF ++G TS +IA+ AG + ++Y + +K +F +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 72 IRRDVQTQPLPAASSAPLDLEARLRLAGRAILAHALQ-PQTVAMMRLVVGTSIRAPELAA 130
R IL H L+ T RL++ E
Sbjct: 63 WELSES-NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 131 E 131
E
Sbjct: 122 E 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14595PF03544354e-04 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 35.3 bits (81), Expect = 4e-04
Identities = 16/111 (14%), Positives = 30/111 (27%), Gaps = 3/111 (2%)

Query: 317 VIPERPQIAAPAARLREISPTVRMPEVAVRPAELPNVPDPAPAPVAAAPIVPATPATPDP 376
+ + P + E P PE P + V P P P
Sbjct: 59 DLEPPQAVQPPPEPVVEPEP---EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPK 115

Query: 377 RPAPVAAPSAQAAAQPAPSQASPAQSERSSSAAAAASMPAKPAASSHAGPA 427
R + + + + ++++ S+ + P A S P
Sbjct: 116 RDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQ 166



Score = 33.0 bits (75), Expect = 0.003
Identities = 16/101 (15%), Positives = 30/101 (29%), Gaps = 1/101 (0%)

Query: 345 VRPAELPNVPDPAPAPVAAAPIVPATPATPDP-RPAPVAAPSAQAAAQPAPSQASPAQSE 403
V PA+L P P P P+P + APV + +P P +
Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQP 114

Query: 404 RSSSAAAAASMPAKPAASSHAGPAPADRSGGWDVAANADDW 444
+ + + ++ A P + + +
Sbjct: 115 KRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155



Score = 30.7 bits (69), Expect = 0.013
Identities = 30/123 (24%), Positives = 40/123 (32%), Gaps = 4/123 (3%)

Query: 154 PTSEASTQAAATAASSPAHAGVS--AAESEPAASPTPMPAQPATDPVERPDMAQAPENAP 211
P S A A P A EP P P+P P PV P+ P
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105

Query: 212 EPVQAASEPITAEIPQVTVQVPPVTIESPLQVTETPVATNDFVVPPPPTITLTPRAIERA 271
+PV+ +P P + P +P T P ++ PRA+ R
Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENTAP--ARPTSSTATAATSKPVTSVASGPRALSRN 163

Query: 272 APQ 274
PQ
Sbjct: 164 QPQ 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14610PF03544557e-12 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 54.6 bits (131), Expect = 7e-12
Identities = 15/103 (14%), Positives = 35/103 (33%), Gaps = 5/103 (4%)

Query: 18 GGCGKSPQQAAAPTVAPTELAAVKTPPPEYSPQLACAGVGGTTVLRVVVGPQGSPTDVSV 77
+ + T + A+ P+Y + + G ++ V P G +V +
Sbjct: 138 TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQI 197

Query: 78 AQSSGQPVLDEAAQTRVREWQFKAATRNGQAVAQTIQVPVSFK 120
+ + + + +R W+++ V V + FK
Sbjct: 198 LSAKPANMFEREVKNAMRRWRYEPGKPGSGIV-----VNILFK 235


84XADLMG695_RS14780XADLMG695_RS14810N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS14780-29-0.340695efflux RND transporter permease subunit
XADLMG695_RS22535-390.658234efflux RND transporter permease subunit
XADLMG695_RS14785-2121.527347efflux RND transporter periplasmic adaptor
XADLMG695_RS14790-1152.115644c-type cytochrome
XADLMG695_RS14795-1130.912271cytochrome c
XADLMG695_RS14800-1120.648890**hypothetical protein
XADLMG695_RS14805-2120.503923***response regulator
XADLMG695_RS14810-2110.297209PAS domain S-box protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14820ACRIFLAVINRP5560.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 556 bits (1435), Expect = 0.0
Identities = 232/1043 (22%), Positives = 448/1043 (42%), Gaps = 59/1043 (5%)

Query: 3 VAAFSIRRPVTTIMCFVSLVVVGLIAAFRLPLEALPDISAPFLFVQLPYTGSTPDEVERN 62
+A F IRRP+ + + L++ G +A +LP+ P I+ P + V Y G+ V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 LVRPAEEALATMTGIKRMRSTATADG-ANIFIEFSDWDRDIAIAASDARERLDAVRDDFP 121
+ + E+ + + + M ST+ + G I + F D IA + +L P
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQS-GTDPDIAQVQVQNKLQLATPLLP 119

Query: 122 EDLQRFHVFKWSSSDEPVLKVRLAS---QTDLTGAYDMLDREFKRRIERIPGVAKVEISG 178
+++Q+ + SS ++ S T D + K + R+ GV V++ G
Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179

Query: 179 APPNEVEIAIAPDRLTAHDLSLNDLSERLGKLNFSVSAGQI------DDNGQRIRVQPIG 232
A + I + D L + L+ D+ +L N ++AGQ+ +
Sbjct: 180 AQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 233 ELRDLQELRELVLNAKG----VRLGDIAEVRLKPTRMNYGRRLDGRPAIGLDVYKERSAN 288
++ +E ++ L VRL D+A V L N R++G+PA GL + AN
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 289 LVEVSKAALKEVEDIRAQ-PALRDVQVKVIDNQGKAVTSSLAELAEAGAVGLLLSITVLF 347
++ +KA ++ +++ P ++V + V S+ E+ + ++L V++
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQ--GMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMY 356

Query: 348 FFLRHWPSTLMVTLAIPICFAITLGFMYFVGVTLNILTMMGLLLAVGMLVDNAVVVVESI 407
FL++ +TL+ T+A+P+ T + G ++N LTM G++LA+G+LVD+A+VVVE++
Sbjct: 357 LFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENV 416

Query: 408 YQERERMPDQPQLAALLGTRSVAIALSAGTLCHCIVFVPNLFGETNNISIFMAQIAITIS 467
+ P+ A + AL + VF+P + + Q +ITI
Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP-MAFFGGSTGAIYRQFSITIV 475

Query: 468 VSLLASWLVAISLIPMLSARMKTPPMVTSEHG------------VIARLQRRYAKVLAWT 515
++ S LVA+ L P L A + P V++EH Y +
Sbjct: 476 SAMALSVLVALILTPALCATLLKP--VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 516 LAHRG-WSVAGIILVSAISLVPMKLTKVDMFGGDGGNEAFIQYQWKGSYTREQLGEEIGR 574
L G + + ++V+ + ++ ++L + D G Q T+E+ + + +
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQG-VFLTMIQLPAGATQERTQKVLDQ 592

Query: 575 VENYLQANRAK--YHITQIYSWFSEVEGSNTVVTFDASKVKDLPPLLEKIRKELPRSARA 632
V +Y N + + + + N + F + K + E + + A+
Sbjct: 593 VTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652

Query: 633 DYSIGNQG----------DGGNGNQGVQVQLV---GDSTDALKALADDVIPLLAQR-KEL 678
+ G G +L+ G DAL + ++ + AQ L
Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 679 RDVHVDTGDRTSELAIRVDRERAAAFGFSAEQVASFVGLALRGTPLREFRRGDNEVPVWV 738
V + + T++ + VD+E+A A G S + + AL GT + +F ++V
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYV 772

Query: 739 RFAGAEQSKPEDLASFTVRTKDGRSVPLLSLVEVQIRPAATQIGRTNRQTTLTIKANLAE 798
+ + PED+ VR+ +G VP + + ++ R N ++ I+ A
Sbjct: 773 QADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP 832

Query: 799 KVTMPEARAAMEAPLKAMSFPAGYSYTFDGGDYQNDGEAMNQMVFNLVIALVMIYVVMAA 858
+ +A A ME + PAG Y + G + + NQ + I+ V++++ +AA
Sbjct: 833 GTSSGDAMALMENLASKL--PAGIGYDW-TGMSYQERLSGNQAPALVAISFVVVFLCLAA 889

Query: 859 VFESLLFPAAIMSGVLFSIFGVFWLFWITGTSFGIMSFIGILVLMGVVVNNGIVMIEHIN 918
++ES P ++M V I GV + + +G+L +G+ N I+++E
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949

Query: 919 NLRRR-GMGRTQALVEGSRERLRPIMMTMGTAILAMVPISLTSTTMFSDGPPYFPMARAI 977
+L + G G +A + R RLRPI+MT IL ++P+++++ + +
Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGA---GSGAQNAVGIGV 1006

Query: 978 AGGLAFSTVVSLLFLPTIYAILD 1000
GG+ +T++++ F+P + ++
Sbjct: 1007 MGGMVSATLLAIFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14825ACRIFLAVINRP6640.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 664 bits (1715), Expect = 0.0
Identities = 263/1142 (23%), Positives = 481/1142 (42%), Gaps = 137/1142 (11%)

Query: 24 LVAFATRRRVTIAMITVTMLLFGLIALRSLKVNLLPDLSYPTLTVRTEYTGAAPAEIETL 83
+ F RR + ++ + +++ G +A+ L V P ++ P ++V Y GA ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 84 VTEPVEEAVGVVKNLRKLKSIS-RTGQSDVVLEFAWGTNMDQASLEVRDKMEAL--SLPL 140
VT+ +E+ + + NL + S S G + L F GT+ D A ++V++K++ LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 141 ETKPPVLLRFNPSTEPIMRLALSPKQAPASDTDAIRQLTGLRRYADEDLKKKLEPVAGVA 200
E + + S+ +M + + Y ++K L + GV
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFV-------SDNPGTTQDDISDYVASNVKDTLSRLNGVG 173

Query: 201 AVKVGGGLEDEIQVDIDQQKLAQLNLPIDNVITRLKEENVNISGGRL------EEGSQRY 254
V++ G + +++ +D L + L +VI +LK +N I+ G+L
Sbjct: 174 DVQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232

Query: 255 LVRTVNQFVDLDEIRNMLVTTQSSSGSAAEAAMQQMYAIAASTGSQAALAAAAEVQSTSS 314
+ +F + +E + + S
Sbjct: 233 SIIAQTRFKNPEEFGKVTLRVNSD------------------------------------ 256

Query: 315 SSSSIAGGMPVRLKDVAQVRQGYKEREAIIRLGGKEAVELAIYKEGDANTVSTAAALRKR 374
G VRLKDVA+V G + I R+ GK A L I AN + TA A++ +
Sbjct: 257 -------GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAK 309

Query: 375 LEQLKATVPGDVEITTIEDQSHFIEHAISDVKKDAVIGGVLAILIIFLFLRDGWSTFVIS 434
L +L+ P +++ D + F++ +I +V K +L L+++LFL++ +T + +
Sbjct: 310 LAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPT 369

Query: 435 LSLPVSIITTFFFMGQLGLSLNVMSLGGLALATGLVVDDSIVVLESIAKA-RERGLSVLD 493
+++PV ++ TF + G S+N +++ G+ LA GL+VDD+IVV+E++ + E L +
Sbjct: 370 IAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKE 429

Query: 494 AAIAGTREVSMAVMASTLTTIAVFLPLVFVEGIAGQLFRDQALTVAIAIAISLVVSMTLI 553
A ++ A++ + AVF+P+ F G G ++R ++T+ A+A+S++V++ L
Sbjct: 430 ATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILT 489

Query: 554 PMLSSLKGAPPMAFPDEPSHPDWQPEQRWLKPVAAGRRGAGASVRYGFFGAAWAVVKVWR 613
P L + LKPV+A + GFFG
Sbjct: 490 PALCA----------------------TLLKPVSAEHHEN----KGGFFGWFNTT----- 518

Query: 614 GLSRVVGPVMRKASDLAMAPYARAERGYLAMLPAALRRPWLVLGLAAAAFIGTVFLVPML 673
+ + Y + L L + A G V L L
Sbjct: 519 --------------------FDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558

Query: 674 GADLIPQLAQDRFEMTVKLPSGTPLAQTDAVVRELQ--LAHDKDPGVASLYGVSGSGTRL 731
+ +P+ Q F ++LP+G +T V+ ++ ++ V S++ V+G
Sbjct: 559 PSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSF-- 616

Query: 732 DANPTESGENIGKLTVVMAG-----GGSPAVEAAATERLRSSMVGHPGAQV-DFARPALF 785
+ +N G V + G + EA R + + V F PA+
Sbjct: 617 ----SGQAQNAGMAFVSLKPWEERNGDENSAEAVI-HRAKMELGKIRDGFVIPFNMPAIV 671

Query: 786 SF--STPLEVEL---RGQDLGELERAGQKLAAMLRAN-GHYADVKSTVEEGFPEIQIRFD 839
+T + EL G L +A +L M + V+ E + ++ D
Sbjct: 672 ELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVD 731

Query: 840 QERAGALGLTTRQIADVIVKKVRGDVATRYSFRDRKIDVLVRAQQADRASVDAIRQLIVN 899
QE+A ALG++ I I + G + R R + V+A R + + +L V
Sbjct: 732 QEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVR 791

Query: 900 PGSSRPVRLAAVAEVLATTGPSEIHRADQTRVAIVSASL-KDIDLGGAVREVETMVRKDP 958
+ V +A G + R + + G A+ +E + K P
Sbjct: 792 SANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLP 851

Query: 959 LAAGVGMHIGGQGEELAQSVKSLLFAFGLAIFLVYLVMASQFESLLHPFVILFTIPLAMV 1018
AG+G G + S ++ +V+L +A+ +ES P ++ +PL +V
Sbjct: 852 --AGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIV 909

Query: 1019 GAVLALLMTGKPISVVVFIGLILLVGLVTKNAIILIDKVNQLRE-DGVPKREALIEGARS 1077
G +LA + + V +GL+ +GL KNAI++++ L E +G EA + R
Sbjct: 910 GVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRM 969

Query: 1078 RLRPIIMTTLCTLFGFLPLAVAMGEGAEVRAPMAITVIGGLLVSTLLTLLVIPVVYDLLD 1137
RLRPI+MT+L + G LPLA++ G G+ + + I V+GG++ +TLL + +PV + ++
Sbjct: 970 RLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029

Query: 1138 RR 1139
R
Sbjct: 1030 RC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14830RTXTOXIND552e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.8 bits (132), Expect = 2e-10
Identities = 41/259 (15%), Positives = 87/259 (33%), Gaps = 34/259 (13%)

Query: 33 EGEAKAAEEKKAVDAVPVEIAKAARRAVAASYTGTAALEPRAEAQVVAKTSGVALSVMVE 92
+ E +++ V + + Q +AK +V+ +
Sbjct: 204 QKELNLDKKRAERLTV-LARINRYENLSRVEKSRLDDFSSLLHKQAIAK-----HAVLEQ 257

Query: 93 EGQKVSAGQALVRLDPDRAHL--AVAQSEAQLRKLENSYRRATQLVGQQLVSA-ADVDQL 149
E + V A L + + ++ + + + ++ + +L ++ L
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN---EILDKLRQTTDNIGLL 314

Query: 150 KFDVENSRAQHRLASLELSYTTVQAPISGVIASRSIKT-GNFVQINTPIFRIV-DDSQLE 207
+ + ++AP+S + + T G V + IV +D LE
Sbjct: 315 -------TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLE 367

Query: 208 ATLNVPERELATLKSGQPVTLLADALPGQQF---VGKVDRIAP--VVDSGSGT-FRVVCA 261
T V +++ + GQ + +A P ++ VGKV I + D G F V+ +
Sbjct: 368 VTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIIS 427

Query: 262 FGQGAEA-------LQPGM 273
+ + L GM
Sbjct: 428 IEENCLSTGNKNIPLSSGM 446



Score = 43.7 bits (103), Expect = 8e-07
Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 9/74 (12%)

Query: 78 VVAKTSGVALSVMVEEGQKVSAGQALVRLDPDRAHLAVAQSEAQLRKLENSYR--RATQL 135
+ + + ++V+EG+ V G L++L +EA K ++S R Q
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSLLQARLEQT 151

Query: 136 VGQQLVSAADVDQL 149
Q L + ++++L
Sbjct: 152 RYQILSRSIELNKL 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14890HTHFIS802e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-17
Identities = 32/130 (24%), Positives = 58/130 (44%), Gaps = 4/130 (3%)

Query: 1002 RVLLVDDDQDSREAVMQFLMLAGAQVQAAGSVDAAEHCLANAHFDVLVSDIAMPLRDGYD 1061
+L+ DDD R + Q L AG V+ + +A D++V+D+ MP + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1062 LIRTVRSGRADLPRHIPAIALTAYVREEDRDRAVVAGFDAHMGKPVEPPGLVDLIERLIL 1121
L+ ++ R DLP + ++A +A G ++ KP + L+ +I R +
Sbjct: 65 LLPRIKKARPDLPV----LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 1122 PTHAVRAALE 1131
+ LE
Sbjct: 121 EPKRRPSKLE 130


85XADLMG695_RS14840XADLMG695_RS14910N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS1484009-0.841604class III poly(R)-hydroxyalkanoic acid synthase
XADLMG695_RS1485509-0.987562CDP-alcohol phosphatidyltransferase family
XADLMG695_RS14885-18-0.5913513-hydroxybutyrate dehydrogenase
XADLMG695_RS14890-18-0.5119508-oxo-dGTP diphosphatase
XADLMG695_RS14900-310-1.104250DUF1249 domain-containing protein
XADLMG695_RS14905-310-0.915573kinase/pyrophosphorylase
XADLMG695_RS14910-113-0.534795phosphoenolpyruvate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14925RTXTOXIND330.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.002
Identities = 26/158 (16%), Positives = 47/158 (29%), Gaps = 17/158 (10%)

Query: 151 REENAPWLDMPAFGLNRN----HQSRLQKLARAQ----QEFQAQSEAYGEQLKAAIEQAF 202
P L +P +N RL L + Q Q + Q E ++ +A
Sbjct: 161 ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL 220

Query: 203 ARFASKLSEHESSGSQLTSARALFD------LWIEAAEESYADVALSEQFRKVYGGFANA 256
AR + S+L +L + E Y + +VY
Sbjct: 221 ARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA---VNELRVYKSQLEQ 277

Query: 257 HMRLRAALQEEVEQLSERFGMPTRSEMDAAHRRIAELE 294
+ +EE + +++ F ++ I L
Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14935DHBDHDRGNASE1017e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 101 bits (252), Expect = 7e-28
Identities = 71/256 (27%), Positives = 109/256 (42%), Gaps = 13/256 (5%)

Query: 2 RSILITGAGSGIGAGIASQLAADGHHLLVSDVQLAAAERTADALRQVGGSAEALALDVTD 61
+ ITGA GIG +A LA+ G H+ D E+ +L+ AEA DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 ANSIAQALAHASRAPQ---VLVNNAGLQQVAALEEFPMQQWALLVDVMLTGAARLSRAVL 118
+ +I + A R +LVN AG+ + + ++W V TG SR+V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMRAAGYGRIVNIGSIHSLVASPYKSAYVAAKHGLVGLAKVIALETADCDITVNTLCPS 178
M G IV +GS + V +AY ++K V K + LE A+ +I N + P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 YVRT----PLVERQIADQARTRGIAEDAVVRDVMLKPMPKGAFIEYDELAGTVAFLMSHA 234
T L + + +G E +P + ++A V FL+S
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKT------GIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 235 ARNITGQSIAIDGGWT 250
A +IT ++ +DGG T
Sbjct: 243 AGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14945BACTRLTOXIN290.011 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 28.7 bits (64), Expect = 0.011
Identities = 7/30 (23%), Positives = 14/30 (46%)

Query: 73 YDLCDPVTGEPDPSAYVRLYRDARQAETTH 102
YD+ + D S Y+ +Y D + ++
Sbjct: 225 YDMMPAPGDKFDQSKYLMMYNDNKTVDSKS 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14950CLENTEROTOXN320.003 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 31.6 bits (71), Expect = 0.003
Identities = 14/64 (21%), Positives = 21/64 (32%), Gaps = 2/64 (3%)

Query: 3 TIRPVFYVSDGTGITAETIGHSLLTQF--SGFNFVTDRMSFIDDADKARDAAMRVRAAGE 60
+ V+ G T+E I S+ F + T S A +V A
Sbjct: 78 SKEVSINVNFSVGFTSEFIQASVEYGFGITIGEQNTIERSVSTTAGPNEYVYYKVYATYR 137

Query: 61 RYQV 64
+YQ
Sbjct: 138 KYQA 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS14955PHPHTRNFRASE2792e-86 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 279 bits (715), Expect = 2e-86
Identities = 138/574 (24%), Positives = 234/574 (40%), Gaps = 89/574 (15%)

Query: 260 KAIRMVYSDVPGERVRTEDTPAE---LRSTFSISDEDVQELSKQAL---------VIEKH 307
KA + +V E+ D E L + S E+++ + Q + H
Sbjct: 18 KAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAH 77

Query: 308 YGRPMDIEWAKDGVSGKLFIVQARPETVKSRSHATQIERFALEAKGAKILAEGRAVGAKI 367
D E + GK+ Q E + F E+ + + E RA A I
Sbjct: 78 LLVLDDPELVDG-IKGKIENEQMNAEYALKEVSDMFVSMF--ESMDNEYMKE-RA--ADI 131

Query: 368 GSGVARVVRSLDDMNRVQAGD-----VLIA-DMTDPDWEPVMK-RASAIVTNRGGRTCHA 420
RV+ L + V+IA D+T D + K T+ GGRT H+
Sbjct: 132 RDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHS 191

Query: 421 AIIARELGVPAVVGSGNATAVIKDGQEVTVSCAEG---------DTGFIYEGKLAFERTT 471
AI++R L +PAVVG+ T I+ G V V EG + E + AFE+
Sbjct: 192 AIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQK 251

Query: 472 TDLGNMPPAP--------LKIMMNVANPERAFDFGQLPNAGIGLARLEMIIASHIGVHPN 523
+ + P +++ N+ P+ GIGL R E + +
Sbjct: 252 QEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDR-----D 306

Query: 524 ALLEYDKQDADVRKKIDAKIAGYGDPVSFYVNRLAEGIATLTASVAPNTVIVRLSDFKSN 583
L ++Q ++ + G P V++R D +
Sbjct: 307 QLPTEEEQFEAYKEVVQRM---DGKP-----------------------VVIRTLDIGGD 340

Query: 584 EYANLIGGSRYEPHEENPMIGFRGASRYVAPSFTKAFALECKAVLKVRNEMGLDNLWVMI 643
+ + + P E NP +GFR + F + +A+L+ NL VM
Sbjct: 341 KELSYL----QLPKELNPFLGFRAIRLCL--EKQDIFRTQLRALLRAS---TYGNLKVMF 391

Query: 644 PFVRTLEEGRKVIEVLEQNGLKQGENG------LKIIMMCELPSNALLADEFLEIFDGFS 697
P + TLEE R+ ++++ K G +++ +M E+PS A+ A+ F + D FS
Sbjct: 392 PMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFS 451

Query: 698 IGSNDLTQLTLGLDRDSSIVAHLFDERNLAVKKLLSLAIKSARAKGKYVGICGQGPSDHP 757
IG+NDL Q T+ DR + V++L+ + A+ +L+ + IK+A ++GK+VG+CG+ D
Sbjct: 452 IGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD-E 510

Query: 758 ELAEWLMQEGIESVSLNPDTVVDTWLRLAKLKSE 791
L+ G++ S++ +++ +L KL E
Sbjct: 511 VAIPLLLGLGLDEFSMSATSILPARSQLLKLSKE 544


86XADLMG695_RS14955XADLMG695_RS14985N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS14955112-0.790350MFS transporter
XADLMG695_RS14960112-1.190880amidohydrolase
XADLMG695_RS14965115-0.755630DoxX family protein
XADLMG695_RS14970013-0.411268hydrolase
XADLMG695_RS149750130.294264helix-turn-helix transcriptional regulator
XADLMG695_RS149800110.024012sensor histidine kinase
XADLMG695_RS14985-1120.102375response regulator transcription factor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15000TCRTETB456e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.9 bits (106), Expect = 6e-07
Identities = 36/163 (22%), Positives = 68/163 (41%), Gaps = 8/163 (4%)

Query: 24 LWLAILG--SNIGTWINDVAASWVMAEQTGSPLMVAAVQSATTLPVVLLALVAGTLADIV 81
+WL IL S + + +V+ + + P V +A L + V G L+D +
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 82 DRRRYLLLTQAWMLLVAGLLALLAHLQLLTPWVLVALTFAMGVGAAMAMPAQAAIVSELV 141
+R LL +++ +++ + +L+ F G GAA +V+ +
Sbjct: 77 GIKRLLLFG----IIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132

Query: 142 PRPMLASAVALNSIGMNIARSIGPAVGGLIVAQFGPPWAFLLN 184
P+ A L + + +GPA+GG+I W++LL
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH--WSYLLL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15005UREASE356e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 35.5 bits (82), Expect = 6e-04
Identities = 25/84 (29%), Positives = 34/84 (40%), Gaps = 9/84 (10%)

Query: 32 SSAPGKSPMVDLVVRNARITTLDPRQPTATAIAVADGRIVAVGD-------DAKIMALAR 84
S + VD V+ NA I LD I + DGRI A+G + +
Sbjct: 59 SQVTREGGAVDTVITNALI--LDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGP 116

Query: 85 GVRAIDAQGRRLLPGLNDSHTHLI 108
G I +G+ + G DSH H I
Sbjct: 117 GTEVIAGEGKIVTAGGMDSHIHFI 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15015ISCHRISMTASE433e-07 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 43.1 bits (101), Expect = 3e-07
Identities = 27/139 (19%), Positives = 56/139 (40%), Gaps = 5/139 (3%)

Query: 53 KGFKVPVILTTVAEKSFSGPLFPELPDIFPGEPVFDRTSMNAWEDQGVIDRVNALGKQRL 112
F P + + E+ L PE + V + +A++ +++ + G+ +L
Sbjct: 92 TDFWGPGLNSGPYEEKIITELAPE-----DDDLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 113 VIAGLWTSVCIVGPTLSAIEQGFQVYVITDACGDVSDEAHERAVTRMVQAGAAPMTSVQY 172
+I G++ + + A + + + + DA D S E H+ A+ A + +
Sbjct: 147 IITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSL 206

Query: 173 LLELQRDWARSDTYALTTG 191
L +LQ A + TG
Sbjct: 207 LDQLQNAPADVQKTSANTG 225


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15030HTHFIS734e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.6 bits (178), Expect = 4e-17
Identities = 33/132 (25%), Positives = 59/132 (44%), Gaps = 5/132 (3%)

Query: 8 TEVLVVDDHPLLRDGLSAMLAAE-HDMRVVGEAEDGEQAVACYTRLRPDVVLMDLQMPRV 66
+LV DD +R L+ L+ +D+R+ A + +A D+V+ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---AGDGDLVVTDVVMPDE 60

Query: 67 DGVQAIQRIRQVDSAAKVIVLTTYTGDVRAVRALQAGACGYLLKSALRRELVDTI-RDVR 125
+ + RI++ V+V++ + A++A + GA YL K EL+ I R +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 126 RGQRRHVPASVA 137
+RR
Sbjct: 121 EPKRRPSKLEDD 132


87XADLMG695_RS15640XADLMG695_RS15665N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS15640-2120.253962beta-ketoacyl-ACP reductase
XADLMG695_RS15645-211-0.166255polyhydroxyalkanoate synthesis repressor PhaR
XADLMG695_RS15650-1110.283856TraB/GumN family protein
XADLMG695_RS15655-1111.016919DUF1684 domain-containing protein
XADLMG695_RS15660-2111.373054DNA mismatch repair endonuclease MutL
XADLMG695_RS15665-3101.221768N-acetylmuramoyl-L-alanine amidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15665DHBDHDRGNASE1371e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 137 bits (345), Expect = 1e-41
Identities = 82/252 (32%), Positives = 123/252 (48%), Gaps = 10/252 (3%)

Query: 4 RVALVTGGTGGIGTAICKRLADQGHRVASNFRNEEKARDWQQRMQAQGYEVALFRGDVAS 63
++A +TG GIG A+ + LA QG +A+ N EK ++A+ F DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 64 SEHARALVEEVEASLGPIEVLVNNAGITRDTTFHRMSAEQWHEVINTNLNSVFNVTRPVI 123
S + +E +GPI++LVN AG+ R H +S E+W + N VFN +R V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 124 EGMRKRGWGRVIQISSINGLKGQYGQANYAAAKAGMHGFTISLARENAAFGVTVNTVSPG 183
+ M R G ++ + S + A YA++KA FT L E A + + N VSPG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 184 YVATDM--VMAVPEEVRAKIVA--------DIPTGRLGRPEEIAYAVAFLVAEEAAWITG 233
TDM + E +++ IP +L +P +IA AV FLV+ +A IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 234 SNLDINGGHHMG 245
NL ++GG +G
Sbjct: 249 HNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15670cloacin270.037 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.4 bits (60), Expect = 0.037
Identities = 14/44 (31%), Positives = 15/44 (34%), Gaps = 1/44 (2%)

Query: 143 GAGFGRPGGPG-TPPNPPGASGLGSGPMGTGTHGSAGGNHGTTG 185
G G G G G + N P G GSG G G G
Sbjct: 28 GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15690CHANLCOLICIN300.022 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.022
Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 3/92 (3%)

Query: 322 LQDALAHTRAGVTPNSIGSEGATDMGGTFGGMGSLAGSGAPRDGSTSGSGNGTYGYASWT 381
++ A+A+ + GV + G T + GT G GS G G + GS S S + A W+
Sbjct: 1 METAVAYYKDGVPYDDKGQVIITLLNGTPDGSGS--GGGGGKGGSKSESSAAIHATAKWS 58

Query: 382 PSQTPLGLRVDEARAAYSALYAAPPSSAQQSA 413
+Q ARA +A A + A + A
Sbjct: 59 TAQLKKTQAEQAARAK-AAAEAQAKAKANRDA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS15695TONBPROTEIN330.002 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 33.0 bits (75), Expect = 0.002
Identities = 30/139 (21%), Positives = 42/139 (30%), Gaps = 1/139 (0%)

Query: 139 SAVAAAAPTAVPAPRPLNAQAEAARATAALAASAQRASSVPPPQPS-TPPPAPPVPASAM 197
+ VA T+V L A A+ T A + +V PP P P P
Sbjct: 22 AVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEP 81

Query: 198 PTVTQAPVPTTVATGVPTPRPATSASAPAPTGVAGNAPNRASVTNANANANVASGAGVAG 257
P + P P+P V AS A A + S A
Sbjct: 82 PKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAA 141

Query: 258 SSASAAAILNGGRAPMGAP 276
+S ++ +G RA
Sbjct: 142 TSKPVTSVASGPRALSRNQ 160


88XADLMG695_RS17290XADLMG695_RS17320N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS17290-1131.213203HlyD family efflux transporter periplasmic
XADLMG695_RS172950111.998309TolC family protein
XADLMG695_RS17300-1112.274876hypothetical protein
XADLMG695_RS17305-1122.166128prephenate dehydrogenase
XADLMG695_RS173100111.628286pyridoxal kinase
XADLMG695_RS17315-1101.909075molecular chaperone DnaJ
XADLMG695_RS173200101.952539molecular chaperone DnaK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17320RTXTOXIND755e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 75.3 bits (185), Expect = 5e-17
Identities = 29/194 (14%), Positives = 70/194 (36%), Gaps = 18/194 (9%)

Query: 16 AMRPPS-IAKAVAWMLLIGIGIAAAILALAPWVQTASGKGQVVSLDPSDRQQPVTAFVPG 74
P S + VA+ ++ + IA + L A+ G++ S R + +
Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLT---HSGRSKEIKPIENS 105

Query: 75 RVERWYVHDGQHVSKGDPIARVGDLDPDLLTRLASERAQAQAEIAAIQQSRAVASIDVAR 134
V+ V +G+ V KGD + ++ L + + Q+ A ++Q+R
Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALG----AEADTLKTQSSLLQARLEQTRYQILSRSIE 161

Query: 135 SRQLLAEGLAGRRDYELTQIKVAEADAKLAES-----RAKLTRIDIQLNRQSAQLVRAPR 189
+L L ++ + L + + + + ++ L+++ A+
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER----- 216

Query: 190 DGRVQQLNAASGSA 203
+ ++N +
Sbjct: 217 LTVLARINRYENLS 230



Score = 73.7 bits (181), Expect = 2e-16
Identities = 32/175 (18%), Positives = 63/175 (36%), Gaps = 20/175 (11%)

Query: 104 LTRLASERAQAQAEIAAIQQSRAVASIDVARSRQLLAEGLAGRRDYELTQIKVAEADAKL 163
+E ++++ I+ A + QL K+ + +
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF---------KNEILDKLRQTTDNI 311

Query: 164 AESRAKLTRIDIQLNRQSAQLVRAPRDGRVQQLNAASGSAMVSPGTVLAVIAPERVERAV 223
+L + + + ++RAP +VQQL + +V+ L VI PE V
Sbjct: 312 GLLTLELAKNEERQQAS---VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368

Query: 224 ELYIDGRDVPLIRPGRPVRLEFEGWPAIQFSGWPSVAHGMFDGRVRAIDPNAAPD 278
+ +D+ I G+ I+ +P +G G+V+ I+ +A D
Sbjct: 369 TALVQNKDIGFINVGQNAI--------IKVEAFPYTRYGYLVGKVKNINLDAIED 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17325RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 0.001
Identities = 25/196 (12%), Positives = 61/196 (31%), Gaps = 24/196 (12%)

Query: 287 RAVLTRIDQATARLMLAQNDLKPRLDVSVEVSKDLGPPGVGGPNRSLTDAIIGFRFSVPL 346
+ R++Q +++ +L ++ + + V ++I +FS
Sbjct: 142 SLLQARLEQTRYQILSRSIELNKLPELKL--PDEPYFQNVSEEEVLRLTSLIKEQFST-W 198

Query: 347 ENRAARGRV--AEARAEIEALDQRSRFLRDQISIEVESIVISLNAAERLAK--------I 396
+N+ + + + RAE + R + +E L+ L +
Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR----LDDFSSLLHKQAIAKHAV 254

Query: 397 ADEERGLAD---RLAAAERRRFELGSG----DFFLVNQREETANDARVRLIDAQARIASA 449
++E + L + + ++ S + N+ +L I
Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 450 RAELAAATADRDALQL 465
ELA + A +
Sbjct: 315 TLELAKNEERQQASVI 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17340PF04183320.002 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 32.2 bits (73), Expect = 0.002
Identities = 17/95 (17%), Positives = 29/95 (30%), Gaps = 11/95 (11%)

Query: 106 SLANGEAFADWLEQTLPQAPQLRYCLDPVIGDTHTGPYVEPGLERVFAERLLPHAWLVTP 165
+A G + WL+Q L ++G+ G G A P+ +
Sbjct: 291 YIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYA---ALARAPYRY---- 343

Query: 166 NAFELG---RLTGLPSLQQGDAIVAARALLARGPQ 197
LG R L+ ++ V L+
Sbjct: 344 -QEMLGVIWRENPCRWLKPDESPVLMATLMECDEN 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17350SHAPEPROTEIN1375e-38 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 137 bits (348), Expect = 5e-38
Identities = 82/392 (20%), Positives = 145/392 (36%), Gaps = 91/392 (23%)

Query: 5 IGIDLGTTNSCVSIMDGGKARVIENSEGDRTTPSIVAYTKDGE------VLVGASAKRQA 58
+ IDLGT N+ + + G + E PS+VA +D VG AK+
Sbjct: 13 LSIDLGTANTLIYVKGQGIV-LNE--------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63

Query: 59 VTNPKNTFYAVKRLIGRKFTDGEVQKDISHVPYGILAHDNGDAWVQTSDAKRMAPQEISA 118
P N A++ + KD + + +++
Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFV-------------------TEKMLQ 92

Query: 119 RVLEKMKKTAEDFLGEKVTEAVITVPAYFNDSQRQATKDAGRIAGLDVKRIINEPTAAAL 178
++++ + ++ VP +R+A +++ + AG +I EP AAA+
Sbjct: 93 HFIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAI 149

Query: 179 AYGLDKNGGDRKIAVYDLGGGTFDVSIIEIAEVDGEKQFEVLATNGDTFLGGEDFDNRVI 238
GL + V D+GGGT +V++I + V + +GG+ FD +I
Sbjct: 150 GAGLPVS-EATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAII 199

Query: 239 EYLVDEFNKDQGIDLRKDPLALQRLKDAAERAKIELSSS----QQTEVNLPYVTADASGP 294
Y+ + G + AER K E+ S+ + E+ + P
Sbjct: 200 NYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP 246

Query: 295 KHLNIKLTRAKLEALVE------DLVKKSIEPCRTALNDAGLRASDINE--VILVGGQTR 346
+ + + LEAL E V ++E C L ASDI+E ++L GG
Sbjct: 247 RGFTLN-SNEILEALQEPLTGIVSAVMVALEQCPPEL------ASDISERGMVLTGGGAL 299

Query: 347 MPKVQQAVADFFGKEPRKDVNPDEAVAVGAAI 378
+ + + + + G +P VA G
Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGK 331


89XADLMG695_RS17440XADLMG695_RS17480N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS17440731-0.399984hypothetical protein
XADLMG695_RS226950210.216987GGDEF domain-containing protein
XADLMG695_RS17450-1190.334405efflux transporter outer membrane subunit
XADLMG695_RS17455-116-0.166003SDR family oxidoreductase
XADLMG695_RS23380-1130.934165efflux RND transporter permease subunit
XADLMG695_RS17465-2120.744693efflux RND transporter periplasmic adaptor
XADLMG695_RS17470-2120.017020hypothetical protein
XADLMG695_RS17475-1140.085036LysR family transcriptional regulator
XADLMG695_RS17480-1120.811402OmpA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17475STREPKINASE290.004 Streptococcus streptokinase protein signature.
		>STREPKINASE#Streptococcus streptokinase protein signature.

Length = 440

Score = 28.5 bits (63), Expect = 0.004
Identities = 16/45 (35%), Positives = 21/45 (46%)

Query: 18 FWLSESAMPTREELATRLDALQEQLPKLSADEDADFDYLDFQARA 62
F AM + E A L A+QEQL D F+ +DF + A
Sbjct: 89 FATDSGAMSHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17490DHBDHDRGNASE922e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 92.0 bits (228), Expect = 2e-24
Identities = 66/216 (30%), Positives = 93/216 (43%), Gaps = 17/216 (7%)

Query: 5 KIALVTGATRGIGLETVRQLAQAGVHTLLAGRKRDDAVAAALKLQAEGLPVEAIQLDVND 64
KIA +TGA +GIG R LA G H + L+AE EA DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 DISVAAAVGTVEQRHGHLDILINNAGIMLDDLQRTPSQQ-SLEVWKRTFDTNLFAVVEVT 123
++ +E+ G +DIL+N AG+ L+ S E W+ TF N V +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV----LRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 124 KAFLPLLRRSLAGRIVNVSSLLGSLTLHSQPGSPIYDFKIPAYNISKSALNSWTVHLAHE 183
++ + +G IV V S + G P + AY SK+A +T L E
Sbjct: 125 RSVSKYMMDRRSGSIVTVGS--------NPAGVP--RTSMAAYASSKAAAVMFTKCLGLE 174

Query: 184 LRDTAIKVNAVHPGSVKTDMNGGGELEVEQGAASSV 219
L + I+ N V PGS +TDM L ++ A V
Sbjct: 175 LAEYNIRCNIVSPGSTETDMQ--WSLWADENGAEQV 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17495ACRIFLAVINRP10500.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1050 bits (2716), Expect = 0.0
Identities = 434/1041 (41%), Positives = 642/1041 (61%), Gaps = 20/1041 (1%)

Query: 4 SRFFIDRPIFAAVLSIIIFAAGLIAMPLLPISEYPEVVPPSVQVRAVYPGANPKVIAETV 63
+ FFI RPIFA VL+II+ AG +A+ LP+++YP + PP+V V A YPGA+ + + +TV
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 ATPLEEAINGVENMMYMKSVAGSDGVLVVTVTFKPGTDPDQAQVQVQNRVSQAQARLPED 123
+E+ +NG++N+MYM S + S G + +T+TF+ GTDPD AQVQVQN++ A LP++
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 124 VRRQGVTTQKQSPTLTMVVHLTSPKGKYNSLYLSNYATLKVKDELSRLPGVGQIQIFGAG 183
V++QG++ +K S + MV S +S+Y VKD LSRL GVG +Q+FG
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-A 180

Query: 184 DYAMRIWLNPDKVAARGLTASDVVAAIREQNVQVSAGQLGAEPMPNKSDFLLSINAQGRL 243
YAMRIWL+ D + LT DV+ ++ QN Q++AGQLG P SI AQ R
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 244 TTEEEFGNIVIRSGNSGEIVRLSDVARLELGAGNYTLRSQLDNQNAVGMGVFQSPGANAI 303
EEFG + +R + G +VRL DVAR+ELG NY + ++++ + A G+G+ + GANA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 304 ELSDAVRAKMAELEKQFPQDMAWSAAYDPTVFVRDSISAVVHTLLEAVLLVVLVVILFLQ 363
+ + A++AK+AEL+ FPQ M YD T FV+ SI VV TL EA++LV LV+ LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 364 TWRASIIPLLAVPVSVVGTFAALYLLGFSINTLSLFGLVLAIGIVVDDAIVVVENVER-N 422
RA++IP +AVPV ++GTFA L G+SINTL++FG+VLAIG++VDDAIVVVENVER
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 423 IEEGLSPLAAAHQAMREVSGPIIAIALVLCAVFVPMAFLSGVTGQFYKQFAVTIAISTVI 482
+E+ L P A ++M ++ G ++ IA+VL AVF+PMAF G TG Y+QF++TI + +
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 483 SAINSLTLSPALAAMLLKLHDAPKDGPSRLIDRLFGWLFRPFNRFFNTSSHKYQGAVSRA 542
S + +L L+PAL A LLK FGW FN F+ S + Y +V +
Sbjct: 481 SVLVALILTPALCATLLK---PVSAEHHENKGGFFGW----FNTTFDHSVNHYTNSVGKI 533

Query: 543 LGKRGAVFVVYLLLLVGTGFMFKLVPGGFIPTQDKLYLIAGTKLPEGSSLERTNEVIRQI 602
LG G ++Y L++ G +F +P F+P +D+ + +LP G++ ERT +V+ Q+
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 603 TQIALQT--DGVDHAIAFPGLNPLQFTNTPNTGTVFLTLKPFSQRSR---TAAQINAEIN 657
T L+ V+ G + N G F++LKP+ +R+ +A +
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFS--GQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651

Query: 658 ARISQIQQGFAFAFMPPPILGLGQGSGYSLYIQDRAGLGYGQLQSAVNAMSGAISQTPG- 716
+ +I+ GF F P I+ LG +G+ + D+AGLG+ L A N + G +Q P
Sbjct: 652 MELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPAS 711

Query: 717 MQFPIGTYQANVPQLDAKVDRDKAKAQGVPLTNLFDTLQTYLGSSYINDFNRFGRTYQVI 776
+ + Q +VD++KA+A GV L+++ T+ T LG +Y+NDF GR ++
Sbjct: 712 LVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLY 771

Query: 777 AQADGQFRDSVEDIANLRTRNANGDMVPIGSMVTLGQTYGPDPVIRYNGYPAADLIGEAD 836
QAD +FR ED+ L R+ANG+MVP + T YG + RYNG P+ ++ GEA
Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831

Query: 837 PRVLSSTEAMQKLSSMAPQVLPNGMNIEWTDLSYQQSTQGNSALIVFPMAVLLAFLVLAA 896
P SS +AM + ++A + LP G+ +WT +SYQ+ GN A + ++ ++ FL LAA
Sbjct: 832 PGT-SSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889

Query: 897 LYESWTLPLAVILIVPMTLLSALFGVWLTGGDNNVFVQVGLVVLMGLACKNAILIVEFAR 956
LYESW++P++V+L+VP+ ++ L L N+V+ VGL+ +GL+ KNAILIVEFA+
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949

Query: 957 EL-EMHGKGIVEAALEACRLRLRPIVMTSIAFIAGTVPLVFGHGAGAEVRSVTGITVFAG 1015
+L E GKG+VEA L A R+RLRPI+MTS+AFI G +PL +GAG+ ++ GI V G
Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGG 1009

Query: 1016 MLGVTLFGLFLTPVFYVALRK 1036
M+ TL +F PVF+V +R+
Sbjct: 1010 MVSATLLAIFFVPVFFVVIRR 1030



Score = 101 bits (253), Expect = 8e-24
Identities = 60/321 (18%), Positives = 124/321 (38%), Gaps = 17/321 (5%)

Query: 185 YAMRIWLNPDKVAARGLTASDVVAAIREQNVQVSAGQLGAEPMPNKS-DFLLSINAQGRL 243
++ ++ +K A G++ SD+ I + + + + + +A+ R+
Sbjct: 724 AQFKLEVDQEKAQALGVSLSDINQTI---STALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 244 TTEEEFGNIVIRSGNSGEIVRLSDVARLELGAGNYTLRSQLDNQNAVGMGVFQSPGANAI 303
E+ + +RS +GE+V S G+ +L+ N + Q A
Sbjct: 781 LPED-VDKLYVRS-ANGEMVPFSAFTTSHWVYGS----PRLERYNGLPSMEIQGEAAPGT 834

Query: 304 ELSDAVRAKMAELEKQFPQD--MAWSAAYDPTVFVRDSISAVVHTLLEAVLLVVLVVILF 361
DA A M L + P W+ R S + + + ++V L +
Sbjct: 835 SSGDA-MALMENLASKLPAGIGYDWTGMSYQ---ERLSGNQAPALVAISFVVVFLCLAAL 890

Query: 362 LQTWRASIIPLLAVPVSVVGTFAALYLLGFSINTLSLFGLVLAIGIVVDDAIVVVENV-E 420
++W + +L VP+ +VG A L + + GL+ IG+ +AI++VE +
Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950

Query: 421 RNIEEGLSPLAAAHQAMREVSGPIIAIALVLCAVFVPMAFLSGVTGQFYKQFAVTIAIST 480
+EG + A A+R PI+ +L +P+A +G + +
Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010

Query: 481 VISAINSLTLSPALAAMLLKL 501
V + + ++ P ++ +
Sbjct: 1011 VSATLLAIFFVPVFFVVIRRC 1031



Score = 89.9 bits (223), Expect = 3e-20
Identities = 90/514 (17%), Positives = 182/514 (35%), Gaps = 41/514 (7%)

Query: 548 AVFVVYLLLLVGTGFMFKLVPGGFIPTQDKLYLIAGTKLPEGSSLERTNEVIRQITQIAL 607
+V+ ++L++ +P PT + P + + V + I Q
Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMN 70

Query: 608 QTDGVDHAIAFPGLNPLQFTNTPNTGTVFLTLKPFSQRSRTAAQINAEINARISQIQQGF 667
D + + + +++ + T+ LT + + Q+ ++ + Q
Sbjct: 71 GIDNLMYMSST--------SDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE- 121

Query: 668 AFAFMPPPILGLGQGSG----YSLYIQDRAGLGYGQLQS-AVNAMSGAISQTPGMQFPIG 722
+ + + + S + ++ D G + + + +S+ G +G
Sbjct: 122 ----VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG----VG 173

Query: 723 TYQANVPQLDAKV--DRDKAKAQGVPLTNLFDTLQTYL----GSSYINDFNRFGRTYQVI 776
Q Q ++ D D + ++ + L+ G+
Sbjct: 174 DVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233

Query: 777 AQADGQFRDSVEDIANLRTR-NANGDMVPIGSMVTLGQTYGPDPVI-RYNGYPAADLI-- 832
A +F+ + E+ + R N++G +V + + + VI R NG PAA L
Sbjct: 234 IIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIK 292

Query: 833 ---GEADPRVLSSTEAMQKLSSMAPQVLPNGMNIEWT-DLSYQQSTQGNSALIVFPMAVL 888
G + KL+ + P P GM + + D + + + A++
Sbjct: 293 LATGANALDT--AKAIKAKLAELQP-FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIM 349

Query: 889 LAFLVLAALYESWTLPLAVILIVPMTLLSALFGVWLTGGDNNVFVQVGLVVLMGLACKNA 948
L FLV+ ++ L + VP+ LL + G N G+V+ +GL +A
Sbjct: 350 LVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDA 409

Query: 949 ILIVE-FARELEMHGKGIVEAALEACRLRLRPIVMTSIAFIAGTVPLVFGHGAGAEVRSV 1007
I++VE R + EA ++ +V ++ A +P+ F G+ +
Sbjct: 410 IVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ 469

Query: 1008 TGITVFAGMLGVTLFGLFLTPVFYVALRKWVTRR 1041
IT+ + M L L LTP L K V+
Sbjct: 470 FSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17500RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 29/186 (15%), Positives = 61/186 (32%), Gaps = 44/186 (23%)

Query: 8 FRFPLRTVLAGAVLAVVLAGCGSKAAETGAPPPPSVSVAPVLMKQISQWDEFSGRIEPV- 66
R ++ V+A +L+ G VA +G++
Sbjct: 57 PRLVAYFIMGFLVIAFILSVLG-----------QVEIVATA-----------NGKLTHSG 94

Query: 67 ESVELRPRVSGYIDKVNYTEGAEVKKGDVLFTIDERSYRAEFARANASLVRARTQA---- 122
S E++P + + ++ EG V+KGDVL + A+ + +SL++AR +
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 123 -----------------TLARSEAARARKLSEQQAISTETWEQRRAAADQADADLQAAQA 165
+ ++ ++ E + + Q + +L +A
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 166 AVDTAR 171
T
Sbjct: 215 ERLTVL 220



Score = 38.3 bits (89), Expect = 4e-05
Identities = 18/102 (17%), Positives = 37/102 (36%), Gaps = 7/102 (6%)

Query: 104 YRAEFARANASLVRARTQATLARSEAARARKLSEQ--QAISTETWEQRRAAADQADADLQ 161
++ A L ++Q SE A++ + Q E ++ R Q ++
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR----QTTDNIG 312

Query: 162 AAQAAVDTARLNLDWTRVRAPIDGRAGRAMV-TAGNLVTAGD 202
+ + +RAP+ + + V T G +VT +
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354



Score = 31.7 bits (72), Expect = 0.006
Identities = 12/73 (16%), Positives = 29/73 (39%)

Query: 99 IDERSYRAEFARANASLVRARTQATLARSEAARARKLSEQQAISTETWEQRRAAADQADA 158
++ RAE A + R + + +S L +QAI+ ++ +A
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266

Query: 159 DLQAAQAAVDTAR 171
+L+ ++ ++
Sbjct: 267 ELRVYKSQLEQIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17520OMPADOMAIN1142e-32 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 114 bits (287), Expect = 2e-32
Identities = 50/170 (29%), Positives = 78/170 (45%), Gaps = 22/170 (12%)

Query: 68 ERRQHAMVGAGIGALSGAAIGQYQDRQERALRERTANTGIEVQRQGDNISLNLPDGITFD 127
R + M+ G+ G + + EVQ + L + F+
Sbjct: 176 TRPDNGMLSLGVSYRFGQG-------EAAPVVAPAPAPAPEVQTK----HFTLKSDVLFN 224

Query: 128 FGKSALKPQFYTALNGVASTLREYN--QTMVEVVGHTDSVGSDAVNQRLSEERASAVAQY 185
F K+ LKP+ AL+ + S L + V V+G+TD +GSDA NQ LSE RA +V Y
Sbjct: 225 FNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDY 284

Query: 186 LTAQGVQRERMETMGAGKRYPIADNTTDAGR---------AKNRRVEIRL 226
L ++G+ +++ G G+ P+ NT D + A +RRVEI +
Sbjct: 285 LISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


90XADLMG695_RS17645XADLMG695_RS17670N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS17645-115-2.672825glycoside hydrolase family 3 C-terminal
XADLMG695_RS17650-312-1.696593*hypothetical protein
XADLMG695_RS17655-311-1.070554hypothetical protein
XADLMG695_RS17660-212-0.799785DHA2 family efflux MFS transporter permease
XADLMG695_RS17665-310-0.909100efflux RND transporter periplasmic adaptor
XADLMG695_RS17670-210-0.807463efflux transporter outer membrane subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17680PYOCINKILLER359e-04 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 35.2 bits (80), Expect = 9e-04
Identities = 37/195 (18%), Positives = 63/195 (32%), Gaps = 13/195 (6%)

Query: 391 VMSGGGSSRVDYTINGGNAVPGITPTTWPGPVIIHPSSPLQALRAALPNVQIDYVDGKDR 450
+ G++ + I+ AV G + P + + +S + R A D R
Sbjct: 268 IQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTA--EQWQDQTPDSVR 325

Query: 451 NAAARVAKAADVAIVFATQW-----AAESVDLPDMRLPDNQDALIDAVA-KANPKTAVVL 504
A AA + + + A+ +VDLP MRL + ++ + +V
Sbjct: 326 YALG--MDAAKLGLPPSVNLNAVAKASGTVDLP-MRLTNEARGNTTTLSVVSTDGVSVPK 382

Query: 505 ETNGPVRMPWAERVPAVLQAWYPGIGGGEAIANLLTGAVNPSGHLPVTWPVDESQLPRPS 564
PVRM + + P L +P G+ + P P
Sbjct: 383 AV--PVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPV 440

Query: 565 IPGLGFKPAKPGEDT 579
G P K +T
Sbjct: 441 YEGATLTPVKATPET 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17700TCRTETB1162e-30 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 116 bits (293), Expect = 2e-30
Identities = 95/402 (23%), Positives = 163/402 (40%), Gaps = 30/402 (7%)

Query: 33 LAMASFMQVLDTTIANVSLPTIAGNLGASSQQATWVITSFAVSTAIALPLTGWLSRRFGE 92
L + SF VL+ + NVSLP IA + WV T+F ++ +I + G LS + G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 93 TKLFVWSTLAFTIASLLCGLAQSM-GMLVVARALQGFVAGPMYPITQSLLVSIY-PREKR 150
+L ++ + S++ + S +L++AR +QG +P ++V+ Y P+E R
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQG-AGAAAFPALVMVVVARYIPKENR 137

Query: 151 GQALALLAMITVVAPIAGPILGGWITDNYSWEWIFLINVPLGIIASSIVGSQLRH--RPE 208
G+A L+ I + GP +GG I W +L+ +P + + I L + E
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIP---MITIITVPFLMKLLKKE 192

Query: 209 QLEKPRMDYIGLILLVVGVGALQLVLDLGNDEDWFSSDKIVVLACIAAVALVVFVIWELT 268
K D G+IL+ VG+ L F++ + ++ ++ ++FV
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRK 242

Query: 269 DKDPIVDLKLFRHRNFRAGTLAMVVAYAAFFSVSLLIPQWLQRDMGYTAIWAGLATAPIG 328
DP VD L ++ F G L + + ++P ++ + G G
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 329 ILPVLMT-PFVGKYALRFDLRMLATIAFIFMS---FTSFFRSNFNLQVDFGHVATIQLVM 384
+ V++ G R + I F+S T+ F TI +V
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-----FMTIIIVF 357

Query: 385 GVGVALFFMPVLQ-ILLSDLDGREIAAGSGLATFLRTLGGSF 425
+G F V+ I+ S L +E AG L F L
Sbjct: 358 VLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17705RTXTOXIND763e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 76.0 bits (187), Expect = 3e-17
Identities = 49/295 (16%), Positives = 93/295 (31%), Gaps = 40/295 (13%)

Query: 82 VERGQLLVQLDPADTEVALQQAEANLAKTVRQVRGLYRTVEGAQAELSAREVTLRSARSD 141
V R L++ + + Q E NL K + A ++ E R +S
Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER-------LTVLARINRYENLSRVEKSR 236

Query: 142 FARRKDLAATGAIS--------------NEELAHARDELAAAEAAVSGSRESLERNRAL- 186
L AI+ EL + +L E+ + ++E + L
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 187 ---VDDSAVANQPDVQTAAAQLRQAYLNHARTGVVAPVSGYVARRSAQ-VGQRVQPGSVL 242
+ D ++ +L + + + APVS V + G V L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 243 MAVVPLEQV-WVEANFKETQLKHMRLGQEVELHSDLYGGGVDYT--GRIESLGLGTGSAF 299
M +VP + V A + + + +GQ + + + YT G + G
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF----PYTRYGYLV----GK---V 405

Query: 300 SLLPAQNASGNWIKIVQRVPVRIAVDAKQLASNPLRIGLSMKVDVNLHDQQGSVL 354
+ + +V V + I + + + + M V + SV+
Sbjct: 406 KNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVI 460


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS17710RTXTOXIND320.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.5 bits (74), Expect = 0.004
Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 20/187 (10%)

Query: 72 AQLDALIAEGLQHSPSLAAADARLQQAQARIGSAQAERG--PSLSVSGGYTGLQLPESMV 129
+L AL AE + ARL+Q + +I S E P L + + E V
Sbjct: 125 LKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEV 184

Query: 130 GEELGGSYGGSAQVVLDFRYGIDLWGGKRSAWEAAVDQAHAAEVDAQAARLNLSSAIAEG 189
L + W ++ E +D+ A + A +
Sbjct: 185 ----------LRLTSL-IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 190 YAQLAYAWSLHDLANDELSRAQKTLELTRQRRSAGIDSELQVRQAQARVPAAQQQLQSAQ 249
++L L + + LE + +++ ++R ++++ + ++ SA+
Sbjct: 234 KSRLD---DFSSLLHKQAIAKHAVLEQENKY----VEAVNELRVYKSQLEQIESEILSAK 286

Query: 250 QQIDEAR 256
++
Sbjct: 287 EEYQLVT 293


91XADLMG695_RS18085XADLMG695_RS18135N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS180850112.393920MFS transporter
XADLMG695_RS180901123.004416alkene reductase
XADLMG695_RS180951122.955670alkylphosphonate utilization protein
XADLMG695_RS18100-1132.526007cation transporter
XADLMG695_RS18105-1111.366274peptidylprolyl isomerase
XADLMG695_RS18110-1101.002951Hsp70 family protein
XADLMG695_RS18115-2100.486775hypothetical protein
XADLMG695_RS18120-313-0.115035mechanosensitive ion channel family protein
XADLMG695_RS18125-214-0.137150DksA/TraR family C4-type zinc finger protein
XADLMG695_RS18135-2120.503409DUF2789 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18135TCRTETA629e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 62.1 bits (151), Expect = 9e-13
Identities = 89/400 (22%), Positives = 146/400 (36%), Gaps = 31/400 (7%)

Query: 17 VLILLALAMGGFAIGISEFSTMGLMTQIAQGLQITEPQVGHVISAYALGVVVGAPLLAIL 76
++IL +A+ IG+ GL+ + +T G +++ YAL AP+L L
Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTA-HYGILLALYALMQFACAPVLGAL 66

Query: 77 GARWPRRTLLLMLMVFYALGNVASALAPSYHTMLLCRFIAGLPHGAYFGVASLVAASISP 136
R+ RR +LL+ + A+ A AP + + R +AG+ GA VA A I+
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITD 125

Query: 137 PNQRATAVGRVLLGLSVALLVGNPLATWLGQIVSWRWAYASVSVIALGTVAAV-AILLPP 195
++RA G + ++ G L +G +A+ AL + + L P
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAA---ALNGLNFLTGCFLLP 182

Query: 196 QPEEPRQQPLRELRAFNQPQVWLALAIGAVGFSGMFCVF------SYLAPTLTAVTGVTA 249
+ + ++PLR P A G + + VF + L + G
Sbjct: 183 ESHKGERRPLRREAL--NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240

Query: 250 ARIPLAMVAF--GVGGVLGSILGGWLFDR-MQFRAVPVLLLWSMVVMLT--FPLAALSDV 304
+ G+L S+ + L+ M+ T LA +
Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG 300

Query: 305 WVFVSIVAVGTMGALA-PALQTRL-MDVAAEAQTLAAASNHAAFNTANALGPWLGG---- 358
W+ I+ + G + PALQ L V E Q S A + + +GP L
Sbjct: 301 WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360

Query: 359 --MAITAGWGWTSTGYVGAATALGGLLVYAAAVWQERHQQ 396
+ GW W GAA L L +W Q+
Sbjct: 361 ASITTWNGWAWI----AGAALYLLCLPALRRGLWSGAGQR 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18150FRAGILYSIN280.009 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 27.7 bits (61), Expect = 0.009
Identities = 17/81 (20%), Positives = 32/81 (39%)

Query: 19 GATSICAECGFEWSAGDAAADTTVVRDSNGNVLQAGDTVTVIKDLKVKGSSIPLKQGTVI 78
G ++ A C E + + D V + + D T + D+ G I LK
Sbjct: 19 GTAALLAACSNEADSLTTSIDAPVTASIDLQSVSYTDLATQLNDVSDFGKMIILKDNGFN 78

Query: 79 RNIRLVEDDAEHIEGNSEKIK 99
R + + D I+ ++E ++
Sbjct: 79 RQVHVSMDKRTKIQLDNENVR 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18160INFPOTNTIATR290.007 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 28.8 bits (64), Expect = 0.007
Identities = 19/64 (29%), Positives = 27/64 (42%), Gaps = 1/64 (1%)

Query: 8 VATIHYTLSDDNGQVLDRSTPDTPLSYLHGAGNIVPGLEQALEGKQLGDTLTADVVPEQG 67
T+ YT + +G V D ST ++PG +AL+ G T V +
Sbjct: 146 TVTVEYTGTLIDGTVFD-STEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLA 204

Query: 68 YGPR 71
YGPR
Sbjct: 205 YGPR 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18165SHAPEPROTEIN515e-09 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 50.5 bits (121), Expect = 5e-09
Identities = 27/75 (36%), Positives = 37/75 (49%), Gaps = 5/75 (6%)

Query: 203 AAIAAGFDNVDFLEEPAAAAMHYHVSHDSRHDTVVVDIGGGTTDIAHASVGGSAAPQVHR 262
+A AG V +EEP AAA+ + ++VVDIGGGTT++A S+ G R
Sbjct: 129 SAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVR 188

Query: 263 AWGIARGGTDIDLAL 277
GG D A+
Sbjct: 189 I-----GGDRFDEAI 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18190PERTACTIN280.004 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 27.8 bits (61), Expect = 0.004
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 1/53 (1%)

Query: 28 IAEHSPLPGDMRLEEAPVWTPAQSQLLREERLDDADWIVTIDQLNIALHTTAD 80
I S P D+ L WT A ++ + +D+A W++T + AL +D
Sbjct: 401 IPGASSGPLDVALASQARWTGA-TRAVDSLSIDNATWVMTDNSNVGALRLASD 452


92XADLMG695_RS18295XADLMG695_RS18335N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS182950111.213947response regulator
XADLMG695_RS18300-1101.081955DNA repair protein RecO
XADLMG695_RS18305-1120.584956GTPase Era
XADLMG695_RS18310-1131.216208ribonuclease III
XADLMG695_RS183150130.880527DUF4845 domain-containing protein
XADLMG695_RS183201100.365769signal peptidase I
XADLMG695_RS18325111-0.053130elongation factor 4
XADLMG695_RS1833009-0.833592DegQ family serine endoprotease
XADLMG695_RS1833519-1.729971sigma-E factor negative regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18330HTHFIS676e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.8 bits (163), Expect = 6e-15
Identities = 29/118 (24%), Positives = 44/118 (37%)

Query: 11 PRLLLVEDDPISRGFLQAVLESLPATVDCADSLSSALDRARERRHDLWLIDVNLPDGTGS 70
+L+ +DD R L L V + ++ DL + DV +PD
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 GLLRALRLLHPDVPALAHTADATMSMQHSLQSDGFLEMLVKPLTSERLLQAVRRGLAR 128
LL ++ PD+P L +A T G + L KP L+ + R LA
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18340TCRTETOQM320.004 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.7 bits (72), Expect = 0.004
Identities = 20/70 (28%), Positives = 34/70 (48%), Gaps = 10/70 (14%)

Query: 62 LVDTPGLHREQKRAMNRVMNRAARGSLEGVDAAVLVIEAGRWDEEDT-LAFRVLSDASVP 120
++DTPG H + + R SL +D A+L+I A + T + F L +P
Sbjct: 72 IIDTPG-HMDFLAEVYR--------SLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 121 VVLVVNKVDR 130
+ +NK+D+
Sbjct: 123 TIFFINKIDQ 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18360TCRTETOQM1461e-39 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 146 bits (371), Expect = 1e-39
Identities = 93/453 (20%), Positives = 181/453 (39%), Gaps = 85/453 (18%)

Query: 3 NIRNFSIIAHVDHGKSTLADRIIQLCGG---LQAREMEAQVLDSNPIERERGITIKAQSV 59
I N ++AHVD GK+TL + ++ G L + + D+ +ER+RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 60 SLPYTAKDGQTYHLNFIDTPGHVDFSYEVSRSLAACEGALLVVDAAQGVEAQSVANCYTA 119
S + + +N IDTPGH+DF EV RSL+ +GA+L++ A GV+AQ+ +
Sbjct: 62 SFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 120 VEQGLEVVPVLNK-----IDLP----------TADVDRAKA----------------EIE 148
+ G+ + +NK IDL +A++ + + +
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 149 AVIG--------------IDAEDAVAV----------------SAKTGLNIDLVLEAIVH 178
VI ++A + SAK + ID ++E I +
Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236

Query: 179 RIPPPKPRDTDKLQALIIDSWFDNYLGVVSLVRVMQGEIKPGSKILVMSTGRTHLVDKVG 238
+ R +L + + ++ +R+ G + + + + + +
Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYT 296

Query: 239 VFTPKRKELPALGAGEVGWINASIKDVHGAPVGDTLTLAGDPAPHALPGFQEMQPRVFAG 298
+ ++ +GE+ + + + +GDT L + P +
Sbjct: 297 SINGELCKIDKAYSGEIVILQNEFLKL-NSVLGDTKLLPQRERI------ENPLPLLQTT 349

Query: 299 LFPVDAEDYPDLREALDKLRLNDAALRFE--PESSEAMGFGFRCGFLGMLHMEIVQERLE 356
+ P + L +AL ++ +D LR+ + E + FLG + ME+ L+
Sbjct: 350 VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCALLQ 404

Query: 357 REYNLDLISTAPTVVY--EVLKTDGTIVNMDNP 387
+Y++++ PTV+Y LK ++++ P
Sbjct: 405 EKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP 437



Score = 33.7 bits (77), Expect = 0.003
Identities = 21/103 (20%), Positives = 38/103 (36%), Gaps = 18/103 (17%)

Query: 362 DLISTAPTVVYEVLKTDGTIVNMDNPAKLPQLNLVQEIREPIIRANVLTPEEYIGNIIKL 421
D AP V+ +VLK GT E+ EP + + P+EY+
Sbjct: 515 DFRMLAPIVLEQVLKKAGT-----------------ELLEPYLSFKIYAPQEYLSRAYTD 557

Query: 422 CEEKRGTQIGINYLGSQVQISYELPMAEVVLDFFDKLKSVSRG 464
+ + ++V +S E+P + ++ L + G
Sbjct: 558 APKYCANIVDTQLKNNEVILSGEIPARC-IQEYRSDLTFFTNG 599


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18365V8PROTEASE734e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.7 bits (178), Expect = 4e-16
Identities = 33/163 (20%), Positives = 58/163 (35%), Gaps = 28/163 (17%)

Query: 133 AGKSMGSGFIISADGYVLTNHHVVDGASEVTVKLTDRR-----------EFKA-KVVGSD 180
G + SG ++ +LTN HVVD L F A ++
Sbjct: 99 TGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYS 157

Query: 181 EQYDVALLKIEA--------KGLPTVRLGDSNTLKPGQWVVAIGSPFGLDHSVTAGIVSA 232
+ D+A++K + + + ++ + Q + G P V+
Sbjct: 158 GEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VAT 210

Query: 233 TGRSNPYADQRYVPFIQTDVAINQGNSGGPLLNTRGEVVGINS 275
S +Q D++ GNSG P+ N + EV+GI+
Sbjct: 211 MWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS18370SURFACELAYER320.004 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 31.6 bits (71), Expect = 0.004
Identities = 25/90 (27%), Positives = 30/90 (33%), Gaps = 18/90 (20%)

Query: 170 AALAAAVPAAALASTRRGAATRNQQVARNAAARQQQAPTRMVAAAAPASTGAASAVAATP 229
AAL A P AA A A T N A N A+T A V TP
Sbjct: 13 AALLAVAPIAATAMPVNAATTINADSAIN------------------ANTNAKYDVDVTP 54

Query: 230 SNPFTHPDTTLQARPWPRAALSGAGESSLN 259
S P +L+G+ +S N
Sbjct: 55 SISAIAAVAKSDTMPAIPGSLTGSISASYN 84


93XADLMG695_RS19880XADLMG695_RS19915N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS19880-1132.085472molybdenum ABC transporter ATP-binding protein
XADLMG695_RS198850121.683516hypothetical protein
XADLMG695_RS198900121.551542TonB family protein
XADLMG695_RS198950121.485724BlaI/MecI/CopY family transcriptional regulator
XADLMG695_RS199001110.350146two-component sensor histidine kinase
XADLMG695_RS19905119-1.513523response regulator transcription factor
XADLMG695_RS19910-116-0.450337hypothetical protein
XADLMG695_RS19915-1110.603376SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19950PF05272280.025 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.025
Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 25 VVALVGPSGAGKTTVLNAIAGL 46
V L G G GK+T++N + GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19960PF03544656e-14 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 64.6 bits (157), Expect = 6e-14
Identities = 20/79 (25%), Positives = 41/79 (51%), Gaps = 5/79 (6%)

Query: 309 PPRYPPDAVAAGLAGFVELQIAVSATGAPEHIAIVRSTPTGVFDQTVLDAARHWRFTPAL 368
P+YP A A + G V+++ V+ G +++ I+ + P +F++ V +A R WR+ P
Sbjct: 164 QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGK 223

Query: 369 EDGKAVASEVRVPVKFELD 387
+ V + F+++
Sbjct: 224 PGSG-----IVVNILFKIN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19970PF06580310.011 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.011
Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 20/79 (25%)

Query: 348 SLLLRNLLENAVRY----TPPGGRILVS-THSAPSPTLVVEDSGPGIPEAARARVFHRFH 402
+L++ L+EN +++ P GG+IL+ T + TL VE++G + +
Sbjct: 257 PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK-------- 308

Query: 403 RELGTGVEGSGLGLSIVHD 421
E +G GL V +
Sbjct: 309 -------ESTGTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19975HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 36/143 (25%), Positives = 58/143 (40%)

Query: 2 RILLVEDDLSLGEGIRTALRRAAYAVDWVHDGVSALMALQEETMDLVILDLGLPRMDGIE 61
IL+ +DD ++ + AL RA Y V + + + DLV+ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VIRTARARAVDTPILVLSARERAADRALGLDVGADDYLGKPFDTNELLARTRALLRRSAG 121
++ + D P+LV+SA+ + GA DYL KPFD EL+ L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RAQPAVQAGALRLDPAGMSVRWH 144
R + G S
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQ 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS19985DHBDHDRGNASE594e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 58.5 bits (141), Expect = 4e-12
Identities = 40/187 (21%), Positives = 72/187 (38%), Gaps = 2/187 (1%)

Query: 3 LHGKCVIVTGATGGIGSVLCAGLVEAGSTVVAVGRTEQTLQRLAAAHAPGRVVP--VVAD 60
+ GK +TGA GIG + L G+ + AV + L+++ ++ AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 LASDSGRAVLLARTHEMRPAPSVLVLAHAQSHFGLLQDQDPADLAAVVHLNLTVPMLLVQ 120
+ + + AR +LV GL+ + A +N T +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 ALLPAFARQPEAAMVAVGSTFGSIGFAGFAGYSASKFGLRGLFEALAREHAGTSVRFQYL 180
++ + ++V VGS + A Y++SK + L E A ++R +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 SPRATAT 187
SP +T T
Sbjct: 186 SPGSTET 192


94XADLMG695_RS19985XADLMG695_RS20035N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS19985-1120.367748VWA domain-containing protein
XADLMG695_RS19990011-0.780523VWA domain-containing protein
XADLMG695_RS19995-112-0.647202DUF4381 domain-containing protein
XADLMG695_RS20005-111-1.583335DUF58 domain-containing protein
XADLMG695_RS20010010-1.713737MoxR family ATPase
XADLMG695_RS2001519-1.910514type IV pilus secretin PilQ family protein
XADLMG695_RS2002009-0.934459pilus assembly protein PilP
XADLMG695_RS20025010-1.099121type 4a pilus biogenesis protein PilO
XADLMG695_RS20030-19-0.366913PilN domain-containing protein
XADLMG695_RS20035-4110.656697pilus assembly protein PilM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20065IGASERPTASE340.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 34.3 bits (78), Expect = 0.002
Identities = 30/191 (15%), Positives = 68/191 (35%), Gaps = 13/191 (6%)

Query: 392 LYNLGNALARQGQYDAAIAAYDRALKQHPNQQDAIANRAAVDAARKRQQQNNKDGKGQSK 451
LYN Q I + P+ A VD A +
Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 452 DQKPSGQDGKGQQQAGQNQQDKQSGQDGQNQQDSKSQPSEAQPPQDSRSQDAQSKNGQGE 511
+ S Q+ K ++ Q+ + + ++ + + Q + ++S +++K Q
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG-SETKETQTT 1098

Query: 512 QRKQDTPPQSADTKAQQQADEAQRRKMQQAMAQAGDKQ---------ADGSDKPEAAVAS 562
+ K+ T + KA+ + ++ Q ++ + +Q KQ A+ + + + V
Sbjct: 1099 ETKE-TATVEKEEKAKVETEKTQ--EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 563 ETPEQREQRQA 573
+ P+ + A
Sbjct: 1156 KEPQSQTNTTA 1166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20075BCTERIALGSPF270.025 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 27.5 bits (61), Expect = 0.025
Identities = 6/34 (17%), Positives = 16/34 (47%), Gaps = 3/34 (8%)

Query: 25 GWWLVIAMVVLVVGSAFFWWWRRRQRQRRWLAAF 58
G W+++A++ + F R++++R
Sbjct: 227 GPWMLLALLAGFMA---FRVMLRQEKRRVSFHRR 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20085HTHFIS320.002 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.5 bits (74), Expect = 0.002
Identities = 38/158 (24%), Positives = 59/158 (37%), Gaps = 24/158 (15%)

Query: 35 IVGQA----ALVERLLIALLADGHLLVEGAPGLAKTT---AIRALASRLEADFARVQ--- 84
+VG++ + L + D L++ G G K A+ R F +
Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198

Query: 85 FTPDLLPADLTG------TEIWRPQDSRFEFMPGPIFHPILLADEINRAPAKVQSALLEA 138
DL+ ++L G T RFE G L DEI P Q+ LL
Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT----LFLDEIGDMPMDAQTRLLRV 254

Query: 139 MGERQVT-VGRHTYALPQLFLVMATQNPIEQ---EGTF 172
+ + + T VG T + +V AT ++Q +G F
Sbjct: 255 LQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLF 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20090BCTERIALGSPD2227e-66 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 222 bits (567), Expect = 7e-66
Identities = 94/430 (21%), Positives = 166/430 (38%), Gaps = 49/430 (11%)

Query: 230 VPWDQALDIVLRAKGLDKRRDGGVVWVAPQPELAKFEQDKEDARIAIENREDLITDYVQ- 288
+ W A D+V L+K + + + E+ N I ++
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ 258

Query: 289 ---------------INYHNAAVIFKALTEAKGIGGGGGGQGGQGGQGGAGQQDNGFLSP 333
+ Y A+ + + LT G Q + D +
Sbjct: 259 LDRQQATQGNTKVIYLKYAKASDLVEVLT-----GISSTMQSEKQAAKPVAALDKNII-- 311

Query: 334 RGRLVADERTNTLMISDIPKKVAQMRELISHIDRPVDQVLIESRIVIATDTFARDLGARF 393
+ A +TN L+++ P + + +I+ +D QVL+E+ I D +LG ++
Sbjct: 312 ---IKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQW 368

Query: 394 GITGATGRGILSGALESNVNFQNTAAQRANEIANTGTSTTLASHLFPSGLNVDLGASGFT 453
A + L + + ++ ++ L
Sbjct: 369 ANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASAL------------------- 409

Query: 454 NSRAAGLAYTLLGSNFNLDIELSAMQEEGRGEVVSNPRIVTANQREGVIKQGREIGYVTI 513
S G+A N+ + L+A+ + ++++ P IVT + E G+E+ +T
Sbjct: 410 -SSFNGIAAGFYQGNWAM--LLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 514 SGGGAAGSAAQANVQFKEVLLELKVTPTITNDNRVFLNMNVKKDEVARFIILEGYGTVPE 573
S + + V+ K V ++LKV P I + V L + + VA
Sbjct: 467 SQTTSGDNIFN-TVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 574 INRREVNTAVLVGDGETVVIGGVYEFTDRESVSKVPFLGDIPFLGNLFKKRGRSKEKAEL 633
N R VN AVLVG GETVV+GG+ + + ++ KVP LGDIP +G LF+ + K L
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 634 LVFVTPKVLR 643
++F+ P V+R
Sbjct: 586 MLFIRPTVIR 595



Score = 50.7 bits (121), Expect = 1e-08
Identities = 31/209 (14%), Positives = 75/209 (35%), Gaps = 30/209 (14%)

Query: 175 AAAQIAARGYSGRPVTFNFQDVPVRTVLQLIAEESNLNIVASDTVQGNVTLR----LMNV 230
A + R + + +F+ ++ + +++ N ++ +V+G +T+R L
Sbjct: 16 IFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEE 75

Query: 231 PWDQALDIVLRAKGLDK-RRDGGVVWVAPQPELAKFEQDKEDARIAIENREDLITDYVQI 289
+ Q VL G + GV+ V + AK + A ++++T V +
Sbjct: 76 QYYQFFLSVLDVYGFAVINMNNGVLKVVRSKD-AKTAAVPVASDAAPGIGDEVVTRVVPL 134

Query: 290 NYHNAAVIFKALTEAKGIGGGGGGQGGQGGQGGAGQQDNGFLSPRGRLVADERTNTLMIS 349
A + L + G G +V E +N L+++
Sbjct: 135 TNVAARDLAPLLRQLNDNAGV------------------------GSVVHYEPSNVLLMT 170

Query: 350 DIPKKVAQMRELISHIDRPVDQVLIESRI 378
+ ++ ++ +D D+ ++ +
Sbjct: 171 GRAAVIKRLLTIVERVDNAGDRSVVTVPL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20105PF05272280.039 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.039
Identities = 14/53 (26%), Positives = 17/53 (32%), Gaps = 1/53 (1%)

Query: 199 GTAPGAVDPAAPGTAAPGAAPAGATPAAPAAAPAPATPPAAAPAPTQAAPAPA 251
GTA + + TAA G A G P + T P P P
Sbjct: 378 GTARALLADVSSPTAAAGGAGGGEPPKKRDPSAGAGTDP-GGPGGGDDGEDPF 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20110SHAPEPROTEIN347e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 34.0 bits (78), Expect = 7e-04
Identities = 52/210 (24%), Positives = 82/210 (39%), Gaps = 45/210 (21%)

Query: 153 RQSALELGGLTAKVMDVEAFAVENAFALVASELPVAADAVVALVDIGATMTTLSVLRSGR 212
R+SA G +++ E A A + + LPV+ +VDIG T ++V+
Sbjct: 127 RESAQGAGAREVFLIE-EPMA-----AAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180

Query: 213 SLYSREQVFGGKQLTDEVM----RRYGL-----TYEEA----GLAKRQG----------- 248
+YS GG + + ++ R YG T E G A
Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240

Query: 249 ---GLPESYEV---EVLEPFKE---ATVQQISRLLQFF---YAGSEFNRVDCIVLAGGCA 296
G+P + + E+LE +E V + L+ A R +VL GG A
Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER--GMVLTGGGA 298

Query: 297 ALARLPEMVEEQLGVTTVVA-NPLAQMTLG 325
L L ++ E+ G+ VVA +PL + G
Sbjct: 299 LLRNLDRLLMEETGIPVVVAEDPLTCVARG 328


95XADLMG695_RS20855XADLMG695_RS20975N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS20855-1131.278289TCR/Tet family MFS transporter
XADLMG695_RS20860-2112.541340lytic murein transglycosylase
XADLMG695_RS20890-2133.273989hypothetical protein
XADLMG695_RS20895-2203.755154YchJ family protein
XADLMG695_RS20900-2163.926805TSUP family transporter
XADLMG695_RS209050102.810400ATP-binding protein
XADLMG695_RS209153101.988350hypothetical protein
XADLMG695_RS209205140.852165DUF4194 domain-containing protein
XADLMG695_RS209254111.012367DUF3375 domain-containing protein
XADLMG695_RS209304100.653537GTP cyclohydrolase I FolE
XADLMG695_RS209355100.991548MarR family transcriptional regulator
XADLMG695_RS235753160.182094DUF1656 domain-containing protein
XADLMG695_RS20945-1142.052361efflux RND transporter periplasmic adaptor
XADLMG695_RS20950-1142.920861efflux transporter outer membrane subunit
XADLMG695_RS209551112.881522FUSC family protein
XADLMG695_RS20960-1112.036402MFS transporter
XADLMG695_RS20965-1112.189549DUF1905 domain-containing protein
XADLMG695_RS209700102.414770LysR family transcriptional regulator
XADLMG695_RS20975-192.345552SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20905TCRTETA2516e-82 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 251 bits (642), Expect = 6e-82
Identities = 156/398 (39%), Positives = 223/398 (56%), Gaps = 7/398 (1%)

Query: 17 ALIFIFITVLIDVLSFGVIIPVLPDLVRHFTGGDYVVAAGWIGWFGFLFAAIQFVCSPLQ 76
LI I TV +D + G+I+PVLP L+R + V A G L+A +QF C+P+
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAH--YGILLALYALMQFACAPVL 63

Query: 77 GALSDRFGRRPVILLSCLGLGLDFVLMAIAHSLPMLLLARIISGVCSASFSTANAYIADV 136
GALSDRFGRRPV+L+S G +D+ +MA A L +L + RI++G+ A+ + A AYIAD+
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADI 123

Query: 137 TPPDKRAGAFGMLGAAFGIGFVAGPLIGGWLGSIGLRWPFWFAAGLALLNVLYGWFVLPE 196
T D+RA FG + A FG G VAGP++GG +G PF+ AA L LN L G F+LPE
Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 197 SLPAERRTARLDWSHANPLGALKLLRRYPQVFGLASVVFLANLAHYVYPSTFVLFAGYQY 256
S ERR R + NPL + + R V L +V F+ L V + +V+F ++
Sbjct: 184 SHKGERRPLRREAL--NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 257 HWGPREVSWVLAGVGVCNIIVNALLVGRLVRRLGERRALLLGLGCGVIGFIIYGLADSGT 316
HW + LA G+ + + A++ G + RLGERRAL+LG+ G+I+ A G
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 317 AFLVGVPISALWAIAAPSAQALITREVGADAQGRVQGALTGLVSLAGIAGPLLFANVFAW 376
+ + A I P+ QA+++R+V + QG++QG+L L SL I GPLLF ++A
Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361

Query: 377 FIGS--GAPLHLPGAPWLLAAVLLAAG-WGMAWKRAAR 411
I + G A +LL L G W A +RA R
Sbjct: 362 SITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRADR 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20925SECA332e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 32.9 bits (75), Expect = 2e-04
Identities = 10/16 (62%), Positives = 10/16 (62%)

Query: 8 DPCPCGRPANYAQCCG 23
DPCPCG Y QC G
Sbjct: 883 DPCPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20935IGASERPTASE340.004 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 34.3 bits (78), Expect = 0.004
Identities = 38/240 (15%), Positives = 74/240 (30%), Gaps = 32/240 (13%)

Query: 1116 RLKLPERARGDEPAVADAVDAAPSIEAAGEPAGVQGAVSADGMAI---DGAAVPTASPAT 1172
L PE + ++ + +I+A +V ++ I D A VP +PAT
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQAD------VPSVPSNNEEIARVDEAPVPPPAPAT 1032

Query: 1173 --DETLSAAPETQTGQRTPAV-----ATANKQNR------------NAKTAKTASSTRAA 1213
+ T + A ++ +T QNR N +T + A S
Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 1214 AQSQAMQTKKSTLRTPALASKAASDKRASGASAVPSAAASRSSVGKTTRSSKTPGKPIAA 1273
++Q +TK++ +K ++K + + ++ +
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 1274 TNASSATSGKGATAS----AAVKTAAAKPRATAASTGQPVRGAGKTSSKRAATTASPAKT 1329
N S TA A ++ + T ++T + T P
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS23575INTIMIN280.027 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 27.7 bits (61), Expect = 0.027
Identities = 10/53 (18%), Positives = 16/53 (30%)

Query: 98 TATATATATATATATATATATATATARVCISNTMLKAAKQQSSKAAKQQSSKA 150
T A T T+T + +ARV +KA + +
Sbjct: 709 TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI 761


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20950PHPHTRNFRASE320.008 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 31.7 bits (72), Expect = 0.008
Identities = 37/224 (16%), Positives = 74/224 (33%), Gaps = 44/224 (19%)

Query: 48 AVLHERLQRQLDALRADELSRELPRTAQAYLAHWLAQGWLERRLPEGATEEEYELSRATT 107
A +H ++ ++S E+ + A ++ + ++ E+ A
Sbjct: 19 AFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAHL 78

Query: 108 QAI-------RFIAGLRESSSSATESRLSLVIQQLVQLAGQTEADPEL--RLAALRDERA 158
+ + +A E L V V + + + + R A +RD
Sbjct: 79 LVLDDPELVDGIKGKIENEQMNA-EYALKEVSDMFVSMFESMD-NEYMKERAADIRDVSK 136

Query: 159 RIDAEIERVASGRVAALDGKRALERARDLIHLSDELAEDFHRVRDDFEQLNRQFRERIID 218
R+ + V +G +A + + + ++++L D QLN+QF +
Sbjct: 137 RVLGHLIGVETGSLATIA--------EETVIIAEDLTPS------DTAQLNKQFVKGFAT 182

Query: 219 DEGAR-------------------GDVLEQLFDGVDVIADSEAG 243
D G R +V E++ G VI D G
Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEG 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20970RTXTOXIND569e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 55.6 bits (134), Expect = 9e-11
Identities = 30/208 (14%), Positives = 71/208 (34%), Gaps = 17/208 (8%)

Query: 83 YEIALEQARAALAERQATLTQLRREIARDRSLQDLVAAEDAEVRRSNVQKAQAAVATAQS 142
E +A L ++ L Q+ EI + LV +++ +
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTL 316

Query: 143 AVDLAQLNLDRTQVRSPAEGRVSDRTVR-VGDYVTAGCPVVAVL-DTGSFRVDGYFEETR 200
+ + + +R+P +V V G VT ++ ++ + + V +
Sbjct: 317 ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKD 376

Query: 201 LQGVHPGQRVDVQLMGEPLTLHGHVQSIAAGIEDRYRSGSAGALPNVTPAFDWVRLAQRI 260
+ ++ GQ +++ P T +G++ I + A+ + L +
Sbjct: 377 IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI-------NLDAIEDQRLG-----LVFNV 424

Query: 261 PVRIVLDHVPA---HVQLIAGRTATVSI 285
+ I + + ++ L +G T I
Sbjct: 425 IISIEENCLSTGNKNIPLSSGMAVTAEI 452



Score = 47.5 bits (113), Expect = 3e-08
Identities = 25/168 (14%), Positives = 59/168 (35%), Gaps = 19/168 (11%)

Query: 10 PALLTLAMVMVAALVLQHLWRYYMQAPWTRDAHVGADVV------QVAPDVSGLVESVAV 63
+A ++ LV+ + A + ++ P + +V+ + V
Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVL--GQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIV 112

Query: 64 ADNQPVRRGQLLLVVDRARYEIALEQARAALAERQATLTQLRREIARD----RSLQDLVA 119
+ + VR+G +LL + E + +++L QA L Q R +I L +L
Sbjct: 113 KEGESVRKGDVLLKLTALGAEADTLKTQSSLL--QARLEQTRYQILSRSIELNKLPELKL 170

Query: 120 AEDAEVRRSNVQKAQAAVATAQSAVD-----LAQLNLDRTQVRSPAEG 162
++ + + ++ + + Q L+ + R+
Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20975RTXTOXIND340.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 0.001
Identities = 16/117 (13%), Positives = 36/117 (30%), Gaps = 5/117 (4%)

Query: 357 TLPSSGARARVRATEAGADAALAQFDNTVLQA-LREVQTTLSRYAQDLDRLHLLEQA-QQ 414
LP V E +L + + Q + + L + + + +
Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYEN 228

Query: 415 QAELASSQN---RRLYQGGRTPYLSSLDAERTLASADMTLANAQAQVSQDQIQLFLA 468
+ + S+ L + L+ E A L ++Q+ Q + ++ A
Sbjct: 229 LSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS20985TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 12/85 (14%)

Query: 69 AIFAMTFLMRPIGAWYFGRFADRYGRRLALTISVSVMALCSFVIAITPTVATIGIAAPII 128
A++A LM+ A G +DR+GRR L +S++ A+ ++A P + +
Sbjct: 50 ALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW--------V 98

Query: 129 LLVARLLQGFATGGEYGTSATYMSE 153
L + R++ G TG + Y+++
Sbjct: 99 LYIGRIVAGI-TGATGAVAGAYIAD 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21000DHBDHDRGNASE822e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.6 bits (201), Expect = 2e-20
Identities = 50/193 (25%), Positives = 84/193 (43%), Gaps = 9/193 (4%)

Query: 3 KTWLITGASSGFGRLLAETVLARGDRIVATVRTPQALA------DLQARYGDAATVLQLD 56
K ITGA+ G G +A T+ ++G I A P+ L +AR+ +A D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---FPAD 65

Query: 57 VRDFAAVHAAVAQAFAALGRIDVVVSNAGYGTLGAAEAATEAQVRAIIDTNLIGSIALIQ 116
VRD AA+ A+ +G ID++V+ AG G + ++ + A N G +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 117 AVLPRLRQQGGGHVVQVSSEGGQIAYPGFSLYHASKWGIEGFVEAVQQEVAGFGIHFTLA 176
+V + + G +V V S + + Y +SK F + + E+A + I +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 177 EPGPARTNFGAAL 189
PG T+ +L
Sbjct: 186 SPGSTETDMQWSL 198


96XADLMG695_RS23500XADLMG695_RS21110N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS23500-118-1.405011TetR family transcriptional regulator
XADLMG695_RS22930-119-2.167916TetR/AcrR family transcriptional regulator
XADLMG695_RS21105-119-2.185153SDR family oxidoreductase
XADLMG695_RS21110129-4.505551hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21140HTHTETR695e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 69.3 bits (169), Expect = 5e-17
Identities = 35/190 (18%), Positives = 69/190 (36%), Gaps = 13/190 (6%)

Query: 7 DTQQKILATAEALIYQHGIHATGMDLLVKTSGVARKSIYRHFDNKDEVAAAALNARDVRW 66
+T+Q IL A L Q G+ +T + + K +GV R +IY HF +K ++ + +
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 67 LAWFRQQCDK-----ADRPEARILRMFTVLKEWFQSEGYRGCAF--INTAGEVGDPDDPV 119
+ K ++ + + F GE+
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 120 RKIARHHKQKLLDYTLELTGQLGITQPDALARQLLLLMEGAIT---VSRVMGDE--DAAD 174
R + ++ L+ + + D + R+ ++M G I+ + + + D
Sbjct: 131 RNLCLESYDRIEQT-LKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 175 TARDIAQLLL 184
ARD +LL
Sbjct: 190 EARDYVAILL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21145HTHTETR572e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.9 bits (137), Expect = 2e-12
Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 3/93 (3%)

Query: 6 PLRADAQRNRERLLAAAEQVFLERGAEA-SMEDVAKRAGVGIGTLYRRFPTRESLFAAAY 64
+ +AQ R+ +L A ++F ++G + S+ ++AK AGV G +Y F + LF+ +
Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63

Query: 65 SGRFLSLAAASHARASSL--DALAALRAYLEDL 95
++ + D L+ LR L +
Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHV 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21150DHBDHDRGNASE1043e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 3e-29
Identities = 65/254 (25%), Positives = 111/254 (43%), Gaps = 12/254 (4%)

Query: 2 GKRFGGKVVVVTGGTDGIGLVTAKAFSAEGAQVY---ITGRRQDRLDAAVAEIGGGAVGV 58
K GK+ +TG GIG A+ +++GA + + +++ +++ A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 59 QGDVGVPEDMDRLYACIQQEHGRLDVVFANAGVSESAALGEIDIAHLERLLATNIKGTVF 118
DV +D + A I++E G +D++ AGV + + E + N G
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 119 TVQNALPLMAS--GGAVILAGSVAGSKGIGALSVYSATKAAIRSFARTWTSDLKRRGIRV 176
++ M G+++ GS +++ Y+++KAA F + +L IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 177 NVMSPGMVHTPAMQTYLDANAGAE-------DAFKQMIPFGRLGDAEEIAEAVLFLASDA 229
N++SPG T + GAE + FK IP +L +IA+AVLFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 SSFIAGHELFIDGG 243
+ I H L +DGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21155PRTACTNFAMLY270.044 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 26.9 bits (59), Expect = 0.044
Identities = 18/59 (30%), Positives = 25/59 (42%)

Query: 97 SEPGSERTAAGQSIPSQASELSGTWTNNGGDNLAPMVAHMQRLGTVSDAGAAGAGGTIT 155
S+PG RTA+G +I + G N L + G +SD G GT+T
Sbjct: 56 SDPGGVRTASGTTIKVSGRQAQGILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVT 114


97XADLMG695_RS21265XADLMG695_RS21295N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS21265-2121.456304glutathione S-transferase
XADLMG695_RS21270-1141.364917hypothetical protein
XADLMG695_RS21275-1151.534827amino acid permease
XADLMG695_RS21280-1141.577077glycoside hydrolase family 92 protein
XADLMG695_RS212850172.438580DUF2628 domain-containing protein
XADLMG695_RS212901142.298055diguanylate cyclase
XADLMG695_RS212950141.792792helix-turn-helix domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21320DHBDHDRGNASE270.037 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 27.3 bits (60), Expect = 0.037
Identities = 18/82 (21%), Positives = 30/82 (36%), Gaps = 8/82 (9%)

Query: 89 ARALVEQWMDWQATELNTAWRYAFMATVRGSAAH--------ADAQAIAASVEQWNRHMA 140
AR L Q A + N +++++ A H D+ AI + R M
Sbjct: 25 ARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAIDEITARIEREMG 84

Query: 141 ILDAQLQRGGPFVLGARFTLAD 162
+D + G G +L+D
Sbjct: 85 PIDILVNVAGVLRPGLIHSLSD 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21340ACRIFLAVINRP280.009 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.009
Identities = 9/43 (20%), Positives = 20/43 (46%), Gaps = 3/43 (6%)

Query: 71 GLIGVGLVVGIVASFL---PASIGNALSIPLALLAGMSANYAY 110
+ LV ++ FL A++ +++P+ LL + A+
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAF 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21345TYPE3OMOPROT300.029 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 30.0 bits (67), Expect = 0.029
Identities = 38/140 (27%), Positives = 48/140 (34%), Gaps = 22/140 (15%)

Query: 245 VITEAVDACVRDGTSWDLELPLTSATGRRL---------WVHSTGSVEHVDGRKRLIGAV 295
++ + C R G LE P RL W+ +EHV L GA
Sbjct: 14 LLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWLEHVS--PALAGAA 71

Query: 296 QDVTDRHRAVDALAASERKFRKMFQYSLGLICTHDMHGRLVSINPAAARSL--GRSVEQM 353
H V LAA+ER F L H RL NP +L G+ + M
Sbjct: 72 VSAGAEHLVVPWLAATERPFE--------LPVPHLSCRRLCVENPVPGSALPEGKLLHIM 123

Query: 354 EGRSLVEFVR-PERHAALRG 372
R + F PE A G
Sbjct: 124 SDRGGLWFEHLPELPAVGGG 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21350HTHFIS290.030 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.4 bits (66), Expect = 0.030
Identities = 11/49 (22%), Positives = 21/49 (42%), Gaps = 3/49 (6%)

Query: 308 PLQALLAQDRRCQLLKTLSVWFGAGMRMAPTAKALGIHRNTLDYRMQRI 356
+LA+ +L L+ A LG++RNTL +++ +
Sbjct: 428 LYDRVLAEMEYPLILAALTA---TRGNQIKAADLLGLNRNTLRKKIREL 473


98XADLMG695_RS21335XADLMG695_RS21370N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
XADLMG695_RS213350152.994908rhomboid family intramembrane serine protease
XADLMG695_RS213401183.309728glycerophosphodiester phosphodiesterase
XADLMG695_RS213452183.599467TonB-dependent receptor
XADLMG695_RS213500192.967913phosphatase PAP2 family protein
XADLMG695_RS213550201.892985tRNA uridine-5-carboxymethylaminomethyl(34)
XADLMG695_RS213601211.180537polysaccharide deacetylase family protein
XADLMG695_RS21370-1160.383342membrane protein insertase YidC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21390ACRIFLAVINRP361e-04 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 36.4 bits (84), Expect = 1e-04
Identities = 31/143 (21%), Positives = 56/143 (39%), Gaps = 28/143 (19%)

Query: 79 ANAAALLILGTLAGSV-YPRATVMALPLLWLGSGLGAWLLGEPGSRH-------LGASGV 130
+ L L L S P + ++ +PL +G L A L + +
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 131 THGLMFLVFVLGLLR----------------RDRPAIATSMIAFLFYGGMLMTILPHEAG 174
+ ++ + F L+ R RP + TS +AF+ G+L + + AG
Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTS-LAFIL--GVLPLAISNGAG 995

Query: 175 VSWQSHLGGAV-AGLIAALLLRL 196
Q+ +G V G+++A LL +
Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAI 1018


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21410PF05272290.038 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.038
Identities = 35/123 (28%), Positives = 47/123 (38%), Gaps = 24/123 (19%)

Query: 220 VLIGPPNAGKSSLLNALAGSDRAIVTDV-AGTTRDTLREAIQLDGFELTLVDTAGLRDGG 278
VL G GKS+L+N L G D T GT +D+ + + +EL+
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELS----------- 648

Query: 279 DAIEREGMRRARAELERADLALVVLDARDPQAARDAIGDAIDAVPRQLWI---HNKCDLL 335
E RRA AE A+ + R A G + PRQ+ I NK L
Sbjct: 649 ---EMTAFRRADAE------AVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYL 699

Query: 336 ADA 338
D
Sbjct: 700 FDI 702


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS21415SYCDCHAPRONE310.011 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 31.1 bits (70), Expect = 0.011
Identities = 20/94 (21%), Positives = 36/94 (38%), Gaps = 7/94 (7%)

Query: 800 QRYADAAEQF-------AEALKLRPDFALAANNLGFVYYRQGRFAESARWLENTLKIDPS 852
Q Y A E F A ++ D +L F Y+ G++ ++ + + +D
Sbjct: 9 QEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHY 68

Query: 853 RAVAYLNLGDAYAKAGDRDKARKAYSTYLELQPQ 886
+ +L LG G D A +YS + +
Sbjct: 69 DSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIK 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
XADLMG695_RS2142060KDINNERMP459e-158 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 459 bits (1181), Expect = e-158
Identities = 207/572 (36%), Positives = 300/572 (52%), Gaps = 42/572 (7%)

Query: 1 MNQTRVFLIFAWLMVAALLWMEWGKDKAAANAPVVAATQSVPAARDLDAAAPSAPNVPSA 60
M+ R L+ A L V+ ++W W +DK P A Q+ +A +
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKN----PQPQAQQTT-------QTTTTAAGSAAD 49

Query: 61 QAIPQAGALGTVPATSSTAATPAAAGAAPVVTLTSDVLRLKLD--GRSVLDAELLQFPQT 118
Q +P A+G ++++ +DVL L ++ G V A L +P+
Sbjct: 50 QGVP-------------------ASGQGKLISVKTDVLDLTINTRGGDVEQALLPAYPKE 90

Query: 119 KDGTAPVSLLTEDPAHPYNATSGWASEHSPVPGVGGFRA--EQPGTTFELAKGQNTLVVP 176
+ T P LL P Y A SG P G R + LA+GQN L VP
Sbjct: 91 LNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVP 150

Query: 177 FVWNGPDGVSIRRTFTLERGRYAISIKDEVINKSGAPWNGYVFRKLSR---VPTILSRGM 233
+ G + +TF L+RG YA+++ V N P F +L + +P L G
Sbjct: 151 MTYTDAAGNTFTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGS 210

Query: 234 TNPDSFSFNGATWYSPQEGYERRAFKDYMDDGGLNRQITGGWVALLQHHFFTAWIPQKDQ 293
+N +F GA + +P E YE+ F D+ LN GGWVA+LQ +F TAWIP D
Sbjct: 211 SNFALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHNDG 270

Query: 294 ASLYVLAQDGPRD-VAELRGPAFTVAPGQTASTEARLWVGPKLVSLIAKEDVKGLDRVVD 352
+ + A G + V PGQT + + LWVGP++ + LD VD
Sbjct: 271 TNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKM-AAVAPHLDLTVD 329

Query: 353 YSRFSIMAIIGQGLFWVLSHLHSFLHNWGWAIIGLVVLLRLALYPLSAAQYKSGAKMRRF 412
Y I Q LF +L +HSF+ NWG++II + ++R +YPL+ AQY S AKMR
Sbjct: 330 YGWLWF---ISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRML 386

Query: 413 QPRLAQLKERYGDDRVKYQQATMELFKKEKINPMGGCLPLLIQMPIFFALYWVLVESVEL 472
QP++ ++ER GDD+ + Q M L+K EK+NP+GGC PLLIQMPIF ALY++L+ SVEL
Sbjct: 387 QPKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVEL 446

Query: 473 RQAPWLGWIQDLTARDPYFILPLLNISIMWATQKLTPTPGMDPMQAKMMQFMPLVFGVMM 532
RQAP+ WI DL+A+DPY+ILP+L M+ QK++PT DPMQ K+M FMP++F V
Sbjct: 447 RQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFF 506

Query: 533 AFMPAGLVLYWVVNGGLGLLIQWWMIRQHGEK 564
+ P+GLVLY++V+ + ++ Q + R ++
Sbjct: 507 LWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKR 538



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.