PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeAXO1947.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NZ_CP013666 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1AXO1947_RS00010AXO1947_RS00035Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS00010213-0.768375DNA polymerase III subunit beta
AXO1947_RS00015214-0.574602DNA replication and repair protein RecF
AXO1947_RS00020219-0.737996DNA gyrase subunit B
AXO1947_RS00025319-1.648869CPBP family intramembrane metalloprotease
AXO1947_RS00030322-2.603859peptidase
AXO1947_RS00035221-2.389166hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00035SYCDCHAPRONE352e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 34.9 bits (80), Expect = 2e-04
Identities = 29/152 (19%), Positives = 53/152 (34%), Gaps = 3/152 (1%)

Query: 152 QLQLQDNQQVEGLATLDKYLEESKSQRPEDLILKGQALYQAERYKEAIPVLKQAIAASPE 211
+ QL ++G T+ E S S E L YQ+ +Y++A V +
Sbjct: 10 EYQLAMESFLKGGGTIAMLNEIS-SDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHY 68

Query: 212 PKDTWNQLLMASYAEAGQTGEAVAAAEALAAKTPNDKKAQLNLASMYMQADQMDKAAAVM 271
+ L A GQ A+ + A + + + A +Q ++ +A + +
Sbjct: 69 DSRFFLG-LGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGL 127

Query: 272 DKLRA-AGQLTEEKEYKQLYSIYANTENKEKD 302
+ TE KE S +K+
Sbjct: 128 FLAQELIADKTEFKELSTRVSSMLEAIKLKKE 159


2AXO1947_RS00285AXO1947_RS00330Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS002852192.384823two-component sensor histidine kinase
AXO1947_RS002902211.475125ammonia channel protein
AXO1947_RS002952211.055144nitrogen regulatory protein P-II 1
AXO1947_RS003001211.295958type I glutamate--ammonia ligase
AXO1947_RS003052220.787208undecaprenyl-diphosphatase
AXO1947_RS003100210.361814hypothetical protein
AXO1947_RS00315-2182.211484phosphatidylethanolamine-binding protein
AXO1947_RS003251143.745674adenylate cyclase
AXO1947_RS003300153.050051hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00335SYCDCHAPRONE361e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.4 bits (84), Expect = 1e-04
Identities = 20/102 (19%), Positives = 32/102 (31%), Gaps = 3/102 (2%)

Query: 96 DPNQFNAYVMQAHLAVARGDLDEAARLSRTAARLVPEHPQLLAVDGLVEMRRGQNDRALS 155
Q + + + G ++A ++ + L + G GQ D A+
Sbjct: 35 TLEQLYSLAFNQYQS---GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIH 91

Query: 156 LLTRAAEQLPDDARVLFSLGFAYLQKEHFAFAERAFERVIEL 197
+ A + R F LQK A AE EL
Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133


3AXO1947_RS20330AXO1947_RS00705Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS203304171.440916IS5/IS1182 family transposase
AXO1947_RS005101100.686730hypothetical protein
AXO1947_RS005200100.570125acetyl-CoA acetyltransferase
AXO1947_RS00530-290.134389hypothetical protein
AXO1947_RS00535-280.229287porphyrin biosynthesis protein
AXO1947_RS00545-290.159438uroporphyrin-III methyltransferase
AXO1947_RS00550-170.110917uroporphyrinogen-III synthase
AXO1947_RS005600101.103204glycosyl transferase
AXO1947_RS005654112.436538thioesterase
AXO1947_RS005756121.862091hypothetical protein
AXO1947_RS005805122.294282rhodanese-like domain-containing protein
AXO1947_RS005901121.100048protein-export chaperone SecB
AXO1947_RS006003200.179684glycerol-3-phosphate dehydrogenase
AXO1947_RS00605012-0.522455Ax21 family protein
AXO1947_RS00610-211-0.088018pyruvate oxidase
AXO1947_RS00620-1121.488160two-component sensor histidine kinase
AXO1947_RS006302162.202903sigma-54-dependent Fis family transcriptional
AXO1947_RS006402112.074183hypothetical protein
AXO1947_RS006453112.182170hypothetical protein
AXO1947_RS006504121.015350MFS transporter
AXO1947_RS006552100.615102IS5/IS1182 family transposase
AXO1947_RS00660-211-2.381437tRNA
AXO1947_RS00670-311-2.059825hypothetical protein
AXO1947_RS00675-211-2.7351793-oxoacyl-ACP synthase III
AXO1947_RS0068009-1.049437addiction module protein
AXO1947_RS00685113-1.053173alpha/beta hydrolase
AXO1947_RS006902110.574032hypothetical protein
AXO1947_RS203403140.612745zinc/iron-chelating domain-containing protein
AXO1947_RS006952140.520622peptide synthase
AXO1947_RS007003130.5573353-beta hydroxysteroid dehydrogenase
AXO1947_RS007052110.228265DUF1328 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00585PF06580310.009 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.6 bits (69), Expect = 0.009
Identities = 9/50 (18%), Positives = 21/50 (42%)

Query: 13 LAWLLLVVAVAAAGVALFLSWRAWQSYQAAQLQAAQAQQQRWDGTQQMLE 62
L+ + VV V L+ W +++Y+ A++ + + L+
Sbjct: 118 LSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALK 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00615SECBCHAPRONE1958e-67 Bacterial protein-transport SecB chaperone protein ...
		>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein

signature.
Length = 170

Score = 195 bits (497), Expect = 8e-67
Identities = 64/162 (39%), Positives = 100/162 (61%), Gaps = 3/162 (1%)

Query: 1 MSDEILNGAAAPADAAAGPAFTIEKIYVKDVSFESPNAPAVFNDANQPELQLNLNQKVQR 60
MS+E AA A P I++IYVKDVSFE+PN P +F +P+L +L+ + ++
Sbjct: 1 MSEENQVNAAD-TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQDWEPKLSFDLSTEAKQ 59

Query: 61 LNDNAFEVVLAVTLTCTA--GGKTAYVAEVQQAGVFGLVGLDPQAIDVLLGTQCPNILFP 118
+ D+ +EV L +++ T G A++ EV+QAGVF + GL+ + L +QCPN+LFP
Sbjct: 60 VGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFP 119

Query: 119 YVRTLVSDLIQAGGFPPFYLQPINFEALYAETLRQRQNESAS 160
Y R LVS L+ G FP L P+NF+AL+ + L++++ +
Sbjct: 120 YARELVSSLVNRGTFPALNLSPVNFDALFMDYLQRQEQAEQT 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00625OUTRMMBRANEA280.033 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 27.6 bits (61), Expect = 0.033
Identities = 15/95 (15%), Positives = 30/95 (31%), Gaps = 12/95 (12%)

Query: 49 KASYAIAPNFHVFGE----YSKQNADDNNNLFENTNSDFQQWGVGVGFNHEIATSTDFVA 104
K Y I + ++ + + N + + GV E A + +
Sbjct: 103 KLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGV----EYAITPEIAT 158

Query: 105 RVAYRRL----DLDSPNINFDGYSVEAGLRNAFGE 135
R+ Y+ D + D + G+ FG+
Sbjct: 159 RLEYQWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQ 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00640HTHFIS472e-166 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 472 bits (1215), Expect = e-166
Identities = 177/476 (37%), Positives = 257/476 (53%), Gaps = 37/476 (7%)

Query: 4 ILIIDDDAAFRTTLQATLRSFGHTVVAADNGPDGLARLSEGGIDMALVDFRMPGMDGIAV 63
IL+ DDDAA RT L L G+ V N ++ G D+ + D MP + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LRARLDDAQARQVPLVMLTAHVSSGNTIEAMTLGAFDHLVKPVGRADIVEVVERALLSRA 123
L R+ A+ +P+++++A + I+A GA+D+L KP +++ ++ RAL
Sbjct: 66 LP-RIKKARPD-LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 124 DAQAAATETPQAPHEDDDALVGHSPAMRTVHKRIGLAAASDLPVLITGETGTGKELAARA 183
+ + Q LVG S AM+ +++ + +DL ++ITGE+GTGKEL ARA
Sbjct: 124 RRPSKLEDDSQDGMP----LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 184 LHRASPRAKAPFVAVNCAAIPLELMESELFGHRKGAFSGASSDRLGLIREADGGTLFLDE 243
LH R PFVA+N AAIP +L+ESELFGH KGAF+GA + G +A+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 244 IGDMPLPMQAKLLRFLQEGEVTPLGGSGPQKVDVRVLAATHRDLAACVADGRFRSDLRYR 303
IGDMP+ Q +LLR LQ+GE T +GG P + DVR++AAT++DL + G FR DL YR
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 304 LNVVPIELPPLRERGQDILLLAQHFLSTNAA---RAQSLSPAAQERLLAHRWPGNVRELR 360
LNVVP+ LPPLR+R +DI L +HF+ + A E + AH WPGNVREL
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 361 NVMQRSQVMVRGASIDAADLD----------------------------EALAEAAEATP 392
N+++R + I ++ E A+
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 393 DVASPMTGTLPEAVARLEKQMIQSALEQSHGNRAEAARRLGIHRQLLYRKLEEYGL 448
A P +G +A +E +I +AL + GN+ +AA LG++R L +K+ E G+
Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00655TCRTETA362e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.3 bits (84), Expect = 2e-04
Identities = 65/374 (17%), Positives = 123/374 (32%), Gaps = 12/374 (3%)

Query: 30 PFLSVFLQSKGWSVAAIGTVMSVGGIAGMLATTPAGALVDATRRKRAVVVVGCLAILLAT 89
P L L A G ++++ + GAL D R R V++V +
Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGR-RPVLLVSLAGAAVDY 87

Query: 90 ALIWLQPTSSGVVAAQIASALAAAGIGPALTGITLGLVHAHGFDHQLARNQVANHAGNVL 149
A++ P + +I + + A G + G V
Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146

Query: 150 AAVLAGWLGWRYGFAAVFLLTAFFGVLALVAVLAIPAAAIDHRAARGLASNDNGDALSGW 209
VL G +G + A F A L + + + H+ R + + L+ +
Sbjct: 147 GPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPES--HKGERRPLRREALNPLASF 203

Query: 210 RVLLTCRPLALLAVTLGLFHLGNAAMLPLYGMAIVAAHAGDPSALTATIIVVAQATMVVV 269
R +A L + L L+ + D + + ++ +
Sbjct: 204 RWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQ 263

Query: 270 ALLAMRWIRVHGHWWVLLVAFMALPLRALVAASVIHGWGVFPVQILDGLGAGLQSVVVPA 329
A++ G L++ +A ++ A GW FP+ +L G + +PA
Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG----IGMPA 319

Query: 330 LVARLLQGTGRVNVG--QGAVMTVQGIGAALSPAFGGWL-AHAFGYRTAFLALGAIALLA 386
L A L + G QG++ + + + + P + A + + + AL
Sbjct: 320 LQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYL 379

Query: 387 VALWAGCRGMLQAA 400
+ L A RG+ A
Sbjct: 380 LCLPALRRGLWSGA 393


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS00705NUCEPIMERASE1311e-37 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 131 bits (330), Expect = 1e-37
Identities = 74/362 (20%), Positives = 129/362 (35%), Gaps = 78/362 (21%)

Query: 1 MKLLVTGGGGFLGQALCRGLRARGHEVV-----------SFQRGDYPVLQSLGVGQIRGD 49
MK LVTG GF+G + + L GH+VV S ++ +L G + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 50 LADPQAVRHALA--GIDAVFHNAAKAG---AWGSYDSYHQANVVGTQNVIEACRATGVPR 104
LAD + + A + VF + + + + +Y +N+ G N++E CR +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 105 LIYTSTPSVTHRATNPVEGLGADE-VPYGDNLRAA-----YAVTKAIAERAVLAANDA-Q 157
L+Y S+ SV G + +P+ + YA TK E +
Sbjct: 121 LLYASSSSV----------YGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 158 LATVALRPRLIWGP-GD-NHLLPRLAARARAGR-LRMVGDGSNLVDSTYIDNAAQAHFDA 214
L LR ++GP G + L + G+ + + G D TYID+ A+A
Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230

Query: 215 FEHLAVGAACA-------------GKAYFISNGEPLPMRELLNRLLAAVDAPAVTRSLSF 261
+ + + Y I N P+ + + + L A+ A L
Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPL 290

Query: 262 KTAYRIGAVCETLWPLLRLPGEVPLTRFLVEQLCTPHWYSMQPARRDFGYVPGISIEEGL 321
+ PG+V T + G+ P ++++G+
Sbjct: 291 Q------------------PGDVLET-----------SADTKALYEVIGFTPETTVKDGV 321

Query: 322 QR 323
+
Sbjct: 322 KN 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS20345PF04335240.029 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 24.0 bits (52), Expect = 0.029
Identities = 5/20 (25%), Positives = 9/20 (45%)

Query: 28 TNIAWILFVVFLILAVISMF 47
+AW++ V LA +
Sbjct: 32 KKLAWVVAGVAGALATAGVV 51


4AXO1947_RS01260AXO1947_RS01380Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS01260212-0.830994alkaline phosphatase
AXO1947_RS012651140.29216116S rRNA methyltransferase G
AXO1947_RS012700131.130418hypothetical protein
AXO1947_RS012800112.31532650S ribosomal protein L28
AXO1947_RS01285-2112.42290250S ribosomal protein L33
AXO1947_RS01295-2112.4439024-oxalomesaconate hydratase
AXO1947_RS01305-212-0.0588434-oxalomesaconate hydratase
AXO1947_RS01310-111-0.7804844-carboxy-4-hydroxy-2-oxoadipate
AXO1947_RS01315214-0.477346IS5/IS1182 family transposase
AXO1947_RS01320313-0.157890LysR family transcriptional regulator
AXO1947_RS01325111-0.064028cardiolipin synthase
AXO1947_RS013302131.542758pyridine nucleotide-disulfide oxidoreductase
AXO1947_RS204101142.226511DNA-dependent helicase II
AXO1947_RS013351142.196413hypothetical protein
AXO1947_RS013403172.392166maltose acetyltransferase
AXO1947_RS013450151.928888universal stress protein UspA
AXO1947_RS013500162.009649hypothetical protein
AXO1947_RS013551141.5340763-hydroxyisobutyrate dehydrogenase
AXO1947_RS013601141.406586hypothetical protein
AXO1947_RS013651141.339619hypothetical protein
AXO1947_RS01370-1120.370339hypothetical protein
AXO1947_RS013803141.109067DNA polymerase I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01345PF05043340.001 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.1 bits (78), Expect = 0.001
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01380cdtoxinb280.021 Cytolethal distending toxin B signature.
		>cdtoxinb#Cytolethal distending toxin B signature.

Length = 269

Score = 28.0 bits (62), Expect = 0.021
Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 148 GVTIGDDALFGAGAVATRDVPAGA 171
G+ IG+DA F A A+A R+ A A
Sbjct: 142 GIRIGNDAFFTAHAIAMRNNDAPA 165


5AXO1947_RS01620AXO1947_RS20490Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS01620317-6.016746hypothetical protein
AXO1947_RS01625517-5.861089hypothetical protein
AXO1947_RS20465623-7.405734hypothetical protein
AXO1947_RS20470634-7.8489002-dehydropantoate 2-reductase
AXO1947_RS01640126-4.275392hypothetical protein
AXO1947_RS01645128-4.895596ABC transporter permease
AXO1947_RS01650-125-3.763106ABC transporter
AXO1947_RS01655232-5.481507ferredoxin--NADP(+) reductase
AXO1947_RS01665014-2.471505cation transporter
AXO1947_RS016701130.002158hypothetical protein
AXO1947_RS01675114-0.803982hypothetical protein
AXO1947_RS016802181.180574transcriptional regulator
AXO1947_RS016851161.087596cysteine protease
AXO1947_RS016901170.563432ABC transporter ATP-binding protein
AXO1947_RS017001140.287442sodium ABC transporter permease
AXO1947_RS204902140.095664energy transducer TonB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01680ABC2TRNSPORT651e-14 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 65.3 bits (159), Expect = 1e-14
Identities = 57/247 (23%), Positives = 105/247 (42%), Gaps = 4/247 (1%)

Query: 12 NWIALATIVRREVQRILRIWGQTLVPPAITMTLYFLIFGGLIGSRVGEMGGYSYMQFIVP 71
NWIA + RR + +L+ +Y G +G VG +GG SY F+
Sbjct: 15 NWIA---VWRRNYIAWKKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAA 71

Query: 72 GLVMMSVIQNS-YGNISSSFFGAKFGRHVEELLVSPMPNWVILWGYVSGAVLRGVMVGAI 130
G+V S + + + I ++F + R E +L + + I+ G ++ A + + GA
Sbjct: 72 GMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG 131

Query: 131 VLIIAMFFTPVRIPHPIVTLTTVLLGATIFSLAGFVNAVYAKKFDDVAIVPTFILTPLTY 190
+ ++A + + L + L F+ G V A +D T ++TP+ +
Sbjct: 132 IGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILF 191

Query: 191 LGGVFYSVKLLPGWAEAATHANPIFYMVNAFRYGLLGSSDVPIWVAYALMLGFVAVLSAL 250
L G + V LP + A P+ + ++ R +LG V + + ++ + L
Sbjct: 192 LSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFL 251

Query: 251 ALWLLRR 257
+ LLRR
Sbjct: 252 STALLRR 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01690adhesinmafb280.041 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 28.1 bits (62), Expect = 0.041
Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 3/56 (5%)

Query: 15 IAPTVAHCQFVRDDGQPLDFQPGQFIQIHFDYADGTPTKRSYSLATIHD--HALGP 68
+A +A F+ D+ Q ++PG + F G+ + R+ + I D H +G
Sbjct: 26 LAADLAQDPFITDNAQRQHYEPGGKYHL-FGDPRGSVSDRTGKINVIQDYTHQMGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01730PF03544704e-16 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 70.4 bits (172), Expect = 4e-16
Identities = 35/145 (24%), Positives = 63/145 (43%), Gaps = 1/145 (0%)

Query: 180 QVEVKAKQAAEQKRLGEQQTREAAAAQQIAAQQEAARQQTAEAERQAAARRQAEAPASTP 239
+ E + E + E+ + + + E ++ E + A+ + APA
Sbjct: 79 EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPT 138

Query: 240 AAPLPAPAATAAAAPAAQSLRPISTPAPHYPPEALRAGTSGEVLVELTVGTDGSITASRV 299
++ A + + A+ R +S P YP A G+V V+ V DG + ++
Sbjct: 139 SSTATAATSKPVTSVAS-GPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQI 197

Query: 300 LRANPPRVFDREALNAVKHWRFEPV 324
L A P +F+RE NA++ WR+EP
Sbjct: 198 LSAKPANMFEREVKNAMRRWRYEPG 222


6AXO1947_RS01770AXO1947_RS02085Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS01770-3143.087932shikimate dehydrogenase
AXO1947_RS017751143.141708hypothetical protein
AXO1947_RS017800122.356850ATP-dependent DNA helicase DinG
AXO1947_RS017850112.668283hypothetical protein
AXO1947_RS017901103.150820hemolysin D
AXO1947_RS017950102.457573IS5/IS1182 family transposase
AXO1947_RS018000112.256308catalase
AXO1947_RS018050111.983320hypothetical protein
AXO1947_RS01810-1101.751403hypothetical protein
AXO1947_RS01815-1111.007349hypothetical protein
AXO1947_RS20510011-1.551615tRNA dihydrouridine(20/20a) synthase DusA
AXO1947_RS20515011-3.329443hypothetical protein
AXO1947_RS20520113-3.848828DNA-binding response regulator
AXO1947_RS01830113-3.638847two-component sensor histidine kinase
AXO1947_RS01835010-2.838328hypothetical protein
AXO1947_RS01840114-2.397950hypothetical protein
AXO1947_RS01845119-1.419010hypothetical protein
AXO1947_RS018505230.749123hypothetical protein
AXO1947_RS018554230.682443bifunctional biotin--[acetyl-CoA-carboxylase]
AXO1947_RS018654221.343760pantothenate kinase
AXO1947_RS018703190.840743*thymidylate kinase
AXO1947_RS01880013-0.548172GTPase
AXO1947_RS20535-29-0.260519hypothetical protein
AXO1947_RS01885-39-0.269906arginase
AXO1947_RS01890-211-0.785450entericidin
AXO1947_RS01895-211-0.831233entericidin
AXO1947_RS01900-112-0.107832CsbD family protein
AXO1947_RS019101142.611691tryptophan--tRNA ligase
AXO1947_RS019152154.342800MBL fold metallo-hydrolase
AXO1947_RS019201105.719438peptidase M20
AXO1947_RS205401125.341818hypothetical protein
AXO1947_RS01925094.692303drug/metabolite exporter YedA
AXO1947_RS019300123.950740protein RarD
AXO1947_RS019350133.138874alpha/beta hydrolase
AXO1947_RS019450172.002384cytochrome b
AXO1947_RS205450160.813933catalase
AXO1947_RS01955-112-0.260163DNA-directed RNA polymerase sigma-70 factor
AXO1947_RS01960015-0.583225transmembrane regulator protein PrtR
AXO1947_RS01965018-0.677872cytochrome c oxidase subunit II
AXO1947_RS01970017-0.126288haloacid dehalogenase
AXO1947_RS019751170.173645AI-2E family transporter
AXO1947_RS019802180.676247hypothetical protein
AXO1947_RS019853171.463622hypothetical protein
AXO1947_RS019955131.778068hypothetical protein
AXO1947_RS020053122.029486serine endoprotease DegQ
AXO1947_RS020104121.880858histidine biosynthesis protein HisIE
AXO1947_RS205553142.474489hypothetical protein
AXO1947_RS020203142.567692epimerase
AXO1947_RS020251121.996257hypothetical protein
AXO1947_RS020302121.276365hypothetical protein
AXO1947_RS020351131.117069hypothetical protein
AXO1947_RS02045-1110.521306hypothetical protein
AXO1947_RS02050412-0.190615IS5/IS1182 family transposase
AXO1947_RS020555150.269014TetR family transcriptional regulator
AXO1947_RS020603161.248469hypothetical protein
AXO1947_RS020653171.113291acyl-CoA desaturase
AXO1947_RS020703140.833321cyclic diguanylate phosphodiesterase
AXO1947_RS020755190.086643hypothetical protein
AXO1947_RS020801180.352528hypothetical protein
AXO1947_RS02085317-0.560637ATP-dependent protease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01830RTXTOXIND661e-15 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 66.4 bits (162), Expect = 1e-15
Identities = 29/148 (19%), Positives = 58/148 (39%), Gaps = 6/148 (4%)

Query: 4 AEKGIVTKVQLLQQQDIAIQNQGQLEELQKHALDLRVEHCQLQLQLEQTPATLEA----K 59
K + K +L+Q++ ++ +L + + E + + + + K
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 60 RNDIARQIADLAQSLSETGARCS-VVLRAPTDGMVTNLLVHA-GQPVGAQQPLITLLSKD 117
I L L++ R V+RAP V L VH G V + L+ ++ +D
Sbjct: 304 LRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED 363

Query: 118 IALRAELWVPSKAVGFVTCGDRVLLRYQ 145
L V +K +GF+ G +++ +
Sbjct: 364 DTLEVTALVQNKDIGFINVGQNAIIKVE 391



Score = 28.6 bits (64), Expect = 0.009
Identities = 7/32 (21%), Positives = 15/32 (46%)

Query: 82 SVVLRAPTDGMVTNLLVHAGQPVGAQQPLITL 113
S ++ + +V ++V G+ V L+ L
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01895HTHFIS971e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.2 bits (242), Expect = 1e-25
Identities = 29/117 (24%), Positives = 58/117 (49%)

Query: 2 RILLVEDEAPLRETLAARLKREGFAVDAAQDGEEGLYMGREVPFDVGIIDLGLPKMSGME 61
IL+ +D+A +R L L R G+ V + D+ + D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LIKALRDEGKKFPVLILTARSSWQDKVEGLKQGADDYLVKPFHVEELLARVNALLRR 118
L+ ++ PVL+++A++++ ++ ++GA DYL KPF + EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01910PF07201300.014 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 30.2 bits (68), Expect = 0.014
Identities = 19/141 (13%), Positives = 42/141 (29%), Gaps = 11/141 (7%)

Query: 160 LTFQGGSDLPNVSLRSIGKGGLDARQAQILSQMYDSTPLAAAARDGVALRQQVTAQLRDE 219
L+ L + GK + Q ++L + D+ L +Q + +E
Sbjct: 110 LSNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEE 169

Query: 220 MEQAAR----GAASARTFADETRRMATLMRERYRLGFVDVGG----WDT-HANQGSVEGG 270
+ A + +R+ YR + G W + +
Sbjct: 170 QGETIVLGARITPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGD-- 227

Query: 271 LANNLRNLGEGLAAYADALGP 291
+ + + L + L+A +
Sbjct: 228 IDSVILFLQKALSADLQSQQS 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01930PF033091132e-32 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 113 bits (284), Expect = 2e-32
Identities = 53/251 (21%), Positives = 92/251 (36%), Gaps = 23/251 (9%)

Query: 5 LFDLGNSRFKYAPLHGNRAGQ--VQAWAHGAE--------AMDATALAALPSGQIAHVA- 53
D+ N+ + G+ VQ W E A+ L + ++ +
Sbjct: 4 AIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTGASG 63

Query: 54 SVAAPALTQRMIACLQERFTQV-RIVRTAAECAGIRIAYADPSRFGVDRFLALLGARG-- 110
P++ + L++ + V ++ GI + +P G DR + L A
Sbjct: 64 LSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAYHKY 123

Query: 111 DAPVLVAGVGTALTIDVLGDDGLHHGGRIAASPTTMREALHARAVQLPA---SGGDYVEL 167
+V G+++ +DV+ G GG IA +A AR+ L + V +
Sbjct: 124 GTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPRSV-I 182

Query: 168 AIDTDDALTSG----CDGAAVALIERSLQHAQRSLGAPVRLLVHGGGAPPLLPLLPGA-T 222
+T + + +G G L+ R GA V ++ G AP +LP L
Sbjct: 183 GKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLVLPDLRTVEH 242

Query: 223 FRAALVLDGLA 233
+ L LDGL
Sbjct: 243 YDRHLTLDGLR 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02070V8PROTEASE823e-19 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 82.4 bits (203), Expect = 3e-19
Identities = 31/193 (16%), Positives = 70/193 (36%), Gaps = 40/193 (20%)

Query: 110 LGSGVIIDAQKGYVLTNHHVIENADDVQVTL------------GDGRTVKADFIGSDADT 157
+ SGV++ K +LTN HV++ L +G +
Sbjct: 103 IASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG 160

Query: 158 DIALIRIKAD--------NLTDIKLADSNALRVGDFVVAIGNPFG---FTQTVTSGIVSA 206
D+A+++ + + ++++ +V + G P T + G ++
Sbjct: 161 DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITY 220

Query: 207 VGRSGIRGLGYQNFIQTDASINPGNSGGALVNLQGQLVGINTASFNPQGSMAGNIGLGLA 266
+ +Q D S GNSG + N + +++GI+ + N + +
Sbjct: 221 L---------KGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNE----FNGAVFIN 267

Query: 267 --IPSNLARNVVE 277
+ + L +N+ +
Sbjct: 268 ENVRNFLKQNIED 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02075IGASERPTASE320.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 0.003
Identities = 19/145 (13%), Positives = 39/145 (26%)

Query: 111 QKLVSTKDAAKHKLTATTDAAKQKLSSTSAAAKKKITDTKANTKRKLEIAKANAKAEAAA 170
+K T D A + S + + AE +
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 171 LSAKTAAKSAARKTAVATVNARTAAKKAAAKSAAAKKSVAKTPAKPVAKKAPVAKQTATK 230
+KT K+ T N A + + A + + + +
Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105

Query: 231 QAAVKKAPLKKAVTKTALKKAAKVT 255
+KA ++ T+ K ++V+
Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVS 1130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02085NUCEPIMERASE407e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 40.2 bits (94), Expect = 7e-06
Identities = 19/85 (22%), Positives = 26/85 (30%), Gaps = 21/85 (24%)

Query: 1 MHLLITGGTGFIGQALCPALLQAGYQV----------SVLTRDVRRAQRTLPGVTAVET- 49
M L+TG GFIG + LL+AG+QV V + R PG +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 50 ----------LDGVRADAVINLAGE 64
+ V
Sbjct: 61 LADREGMTDLFASGHFERVFISPHR 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02135HTHTETR483e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 48.5 bits (115), Expect = 3e-09
Identities = 24/171 (14%), Positives = 68/171 (39%), Gaps = 6/171 (3%)

Query: 12 PPSRKPAISREDLIAATLSLIGPHRSLSTLSLREVAREAGIAPNSFYRQFRDMDELAVAL 71
++ +R+ ++ L L + +S+ SL E+A+ AG+ + Y F+D +L +
Sbjct: 4 KTKQEAQETRQHILDVALRLFS-QQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 72 IDLAGRSLRTIIGQARQRATSTDRSVIRVSVETFMEQLRADDK---LLHVLLREGAVGSD 128
+L+ ++ + + + + SV+R + +E +++ L+ ++ + +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 129 AFKLAVERELSYFED-ELRVDLIRLAAADNAKLHAPALVSKAITRLVFAMG 178
+ + E + ++ L A + +A + +
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKM-LPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02155BCTLIPOCALIN1024e-30 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 102 bits (255), Expect = 4e-30
Identities = 56/160 (35%), Positives = 85/160 (53%), Gaps = 11/160 (6%)

Query: 15 ACMSHHPELATVPSLDLNRYLGTWYEIARLPIHFEDADCTDVSAHYTLEGDGSVRVQNRC 74
C+ + V +LN YLG WYE+ARL FE + V+A Y + DG + V NR
Sbjct: 15 GCLGMPESVKPVSDFELNNYLGKWYEVARLDHSFERG-LSQVTAEYRVRNDGGISVLNRG 73

Query: 75 LTAE-GELEEAIGQARAIDD-THSRLEVTFLPEGLRWIPFTKEHYWVMRID-PDYTAALV 131
+ E GE +EA G+A ++ T L+V+F PF Y V +D +Y+ A V
Sbjct: 74 YSEEKGEWKEAEGKAYFVNGSTDGYLKVSFFG------PFYGS-YVVFELDRENYSYAFV 126

Query: 132 GSPDRKYLWLLARLPQLDENVAQAYLAHAREQGFDLAPLI 171
P+ +YLWLL+R P ++ + ++ ++E+GFD LI
Sbjct: 127 SGPNTEYLWLLSRTPTVERGILDKFIEMSKERGFDTNRLI 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02165HTHFIS340.001 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.4 bits (79), Expect = 0.001
Identities = 41/242 (16%), Positives = 65/242 (26%), Gaps = 49/242 (20%)

Query: 127 LAAAQAGRRLIVPLANGAEAAIAGHVEAFTARTLLEVCATLDGSQKAPAAELVVQALGAR 186
L + R + L A+ ++A + D ++ + R
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 187 ALPDMADVRGQP----------HARRALEIAAAGGHHLLLIGSPGCGKTLLASRLPGLLP 236
D + R L L++ G G GK L+A L
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 237 EASEA-EALETAAITSTSGRGLDLARWRQRPYRAPHHTASAVALVG-------GGTLPRP 288
+ A+ AAI L G G
Sbjct: 186 RRNGPFVAINMAAIPRD---------------------LIESELFGHEKGAFTGAQTRST 224

Query: 289 GEISLAHNGVLFLDEL----PEWQRQTLEVLREPLESGLVTISRAARSVDFPARFQLVAA 344
G A G LFLDE+ + Q + L VL++ + + ++VAA
Sbjct: 225 GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG------EYTTVGGRTPIRSDVRIVAA 278

Query: 345 MN 346
N
Sbjct: 279 TN 280


7AXO1947_RS02205AXO1947_RS02245Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS02205213-0.540107phosphomannomutase/phosphoglucomutase
AXO1947_RS022107251.582725hypothetical protein
AXO1947_RS206007261.393015membrane protein
AXO1947_RS022156231.310402dolichol-phosphate mannosyltransferase
AXO1947_RS206055211.767787hypothetical protein
AXO1947_RS022205191.969134mitomycin resistance protein
AXO1947_RS022254172.453124chromosome partitioning protein ParB
AXO1947_RS02235-2131.916124chromosome partitioning protein
AXO1947_RS02240-3163.027131hypothetical protein
AXO1947_RS02245-2153.064595orotate phosphoribosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02295NUCEPIMERASE470e-170 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 470 bits (1211), Expect = e-170
Identities = 168/333 (50%), Positives = 212/333 (63%), Gaps = 15/333 (4%)

Query: 1 MTILVTGAAGFIGAYTCRALAARGEAVVGLDNYNRYYDPQLKHDRVAALC-PGVDIRTLD 59
M LVTGAAGFIG + + L G VVG+DN N YYD LK R+ L PG +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 60 LTDRDGLAALFDEIQPTRAVHLAAQAGVRYSLENPSAYVDSNLVGFVNMLELCRHRGVQH 119
L DR+G+ LF R + VRYSLENP AY DSNL GF+N+LE CRH +QH
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 120 LVYASSSSVYGDSATPPFSEDQRVDQPRSLYAATKAANELMGYTYAQLYGLRATGLRFFT 179
L+YASSSSVYG + PFS D VD P SLYAATK ANELM +TY+ LYGL ATGLRFFT
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 180 VYGPWGRPDMAPLIFSRAVLAGRPIEVFNHGKMQRDFTFVDDIVAGVLGALD-------- 231
VYGPWGRPDMA F++A+L G+ I+V+N+GKM+RDFT++DDI ++ D
Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQ 240

Query: 232 ------TPSSEPVPHRVFNLGNHTPVELEYFIDVIAQAAGRPAEKVYRPMQPGDMIRTMA 285
TP++ P+RV+N+GN +PVEL +I + A G A+K P+QPGD++ T A
Sbjct: 241 WTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSA 300

Query: 286 DTRRAQAAFGFDPATPVERGLPQVVNWCRQYFG 318
DT+ GF P T V+ G+ VNW R ++
Sbjct: 301 DTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


8AXO1947_RS02420AXO1947_RS02445Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS024202150.197395glutamyl-tRNA amidotransferase
AXO1947_RS02425315-0.11382430S ribosomal protein S21
AXO1947_RS024302120.859298tRNA
AXO1947_RS024352110.714382dihydroneopterin aldolase
AXO1947_RS024402120.521023beta-glucosidase
AXO1947_RS024452110.260668glucose dehydrogenase
9AXO1947_RS02660AXO1947_RS02695Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS026602160.448121hypothetical protein
AXO1947_RS207002130.222509hypothetical protein
AXO1947_RS026652130.512309zinc/iron-chelating domain-containing protein
AXO1947_RS026702140.492409glutathione-dependent formaldehyde
AXO1947_RS026753130.461590hypothetical protein
AXO1947_RS026802130.609386hypothetical protein
AXO1947_RS026852130.750182ubiquinol oxidase subunit II
AXO1947_RS026904191.077893cytochrome ubiquinol oxidase subunit I
AXO1947_RS026953200.842602hypothetical protein
10AXO1947_RS02910AXO1947_RS02965Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS029100133.175955hypothetical protein
AXO1947_RS029150123.314941AsnC family transcriptional regulator
AXO1947_RS02920-1123.451544D-amino acid dehydrogenase small subunit
AXO1947_RS02925-2123.374199alanine racemase
AXO1947_RS02930-2130.986823membrane protein
AXO1947_RS029350161.360847hypothetical protein
AXO1947_RS029401171.216425hypothetical protein
AXO1947_RS029450161.659257hybrid sensor histidine kinase/response
AXO1947_RS029500141.360733hypothetical protein
AXO1947_RS029600143.515043sorbosone dehydrogenase
AXO1947_RS029650153.007367hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02985ALARACEMASE388e-137 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 388 bits (999), Expect = e-137
Identities = 155/357 (43%), Positives = 208/357 (58%), Gaps = 6/357 (1%)

Query: 2 RPAQASIDLEALRHNYRLAKRLGG-SKALAVVKADAYGHGAVRCAQALEPEADGFAVACI 60
RP QAS+DL+AL+ N + ++ ++ +VVKA+AYGHG R A+ DGFA+ +
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIG-ATDGFALLNL 61

Query: 61 EEALELRQAGIRAPILLLEGFFEHDELRLIAEHDLWTVAATPQQVRALAAFQSPRPLRVW 120
EEA+ LR+ G + PIL+LEGFF +L + +H L T + Q++AL + PL ++
Sbjct: 62 EEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIY 121

Query: 121 LKMDSGMHRLGLSPEDFRAAWLRLRGLPQIASLVLMTHLARADELDCSRTDEQAVAFALT 180
LK++SGM+RLG P+ W +LR + + + LM+H A A+ D
Sbjct: 122 LKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPD--GISGAMARIEQA 179

Query: 181 AGGMRAETSLRNSPGLLGWPALRNDWSRPGLMLYGANPFPQ-DTENTAQLRPVMTLRSRI 239
A G+ SL NS L P DW RPG++LYGA+P Q LRPVMTL S I
Sbjct: 180 AEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEI 239

Query: 240 ISVRDLPVGEPVGYGARFVAERPTRVGVVAMGYADGYPQFAPNGTPVLVDGQVCPLIGRV 299
I V+ L GE VGYG R+ A R+G+VA GYADGYP+ AP GTPVLVDG +G V
Sbjct: 240 IGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTV 299

Query: 300 SMDMLTVDLTDHPQADIGATVQLWGQAPRVGPLATQCNISAYQLLCGL-KRVPRTYV 355
SMDML VDLT PQA IG V+LWG+ ++ +A Y+L+C L RVP V
Sbjct: 300 SMDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03005HTHFIS701e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.9 bits (171), Expect = 1e-14
Identities = 26/121 (21%), Positives = 52/121 (42%), Gaps = 7/121 (5%)

Query: 506 RLLAVEDQPDMLDYLRRLLEEQGAEVVTAGSATDALALIDHRGHARFDLMLTDIGMPGMD 565
+L +D + L + L G +V +A I DL++TD+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---AGDGDLVVTDVVMPDEN 61

Query: 566 GYGLIRTVRENLGLDATALPAVAVTALAREDDRKRALESGFQEHLAKPYSVAQLVTAVRA 625
+ L+ +++ LP + ++A +A E G ++L KP+ + +L+ +
Sbjct: 62 AFDLLPRIKK----ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 626 A 626
A
Sbjct: 118 A 118


11AXO1947_RS03105AXO1947_RS03180Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS031052221.661731ATP synthase subunit B
AXO1947_RS03115120-0.128975F0F1 ATP synthase subunit delta
AXO1947_RS03120121-0.629976ATP synthase subunit alpha
AXO1947_RS03125218-2.281201F0F1 ATP synthase subunit gamma
AXO1947_RS03130219-2.064006ATP synthase subunit beta
AXO1947_RS03135223-1.269439ATP synthase epsilon chain
AXO1947_RS03140224-1.604940chorismate mutase
AXO1947_RS03145429-1.101976hypothetical protein
AXO1947_RS03150428-1.157294membrane protein
AXO1947_RS03155225-0.549843UDP-N-acetylglucosamine
AXO1947_RS03160222-1.031790hypothetical protein
AXO1947_RS03165-116-1.132950glutamine--fructose-6-phosphate
AXO1947_RS03170-1140.277282hypothetical protein
AXO1947_RS031752150.329077lactoylglutathione lyase
AXO1947_RS031802170.082471copper resistance protein B
12AXO1947_RS03340AXO1947_RS03470Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS03340216-0.377500ABC transporter ATP-binding protein
AXO1947_RS03345212-1.985086sugar ABC transporter permease
AXO1947_RS03350116-3.519358electron transfer flavoprotein subunit alpha
AXO1947_RS03355223-5.915127EtfB protein
AXO1947_RS03360125-6.269730dTDP-glucose 4,6-dehydratase
AXO1947_RS03365244-9.345840glucose-1-phosphate thymidylyltransferase
AXO1947_RS20800561-13.366535dTDP-4-dehydrorhamnose 3,5-epimerase
AXO1947_RS03370455-11.568815NAD(P)-dependent oxidoreductase
AXO1947_RS03375447-10.250651mannose-1-phosphate
AXO1947_RS20805238-7.729317phosphomannomutase/phosphoglucomutase
AXO1947_RS20810236-6.729549succinyl-CoA--3-ketoacid-CoA transferase
AXO1947_RS03380434-6.617803succinyl-CoA--3-ketoacid-CoA transferase
AXO1947_RS03390330-5.684674polysaccharide biosynthesis protein
AXO1947_RS03395326-5.383577electron transfer flavoprotein-ubiquinone
AXO1947_RS03400224-5.286112DNA methylase
AXO1947_RS03405324-6.452790ABC transporter
AXO1947_RS03410325-7.084605ABC transporter substrate-binding protein
AXO1947_RS03415219-5.089864ABC transporter ATP-binding protein
AXO1947_RS03420016-2.488596ABC transporter permease
AXO1947_RS03425015-2.844689membrane protein
AXO1947_RS03430-114-2.796651DNA-binding protein
AXO1947_RS03435-115-2.446579proline--tRNA ligase
AXO1947_RS03440-213-1.585193DUF4124 domain-containing protein
AXO1947_RS03445-116-3.388292CDP-diacylglycerol--serine
AXO1947_RS03450012-3.114858alanine acetyltransferase
AXO1947_RS03455012-3.065201ribosomal-protein-alanine N-acetyltransferase
AXO1947_RS03460017-3.817172IS30 family transposase
AXO1947_RS03465021-4.209518IS30 family transposase
AXO1947_RS03470121-3.893776IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03410INTIMIN290.045 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 28.9 bits (64), Expect = 0.045
Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 305 GDPLVEFNSSEDIRLTVVARLNKSLSYPAVGFMLKDRKGQYIL 347
G+ + + + S+DI L+ + LNK L Y + M+K GQ I+
Sbjct: 70 GETVADLSKSQDINLSTIWSLNKHL-YSSESEMMKAEPGQQII 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03415ABC2TRNSPORT375e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 36.8 bits (85), Expect = 5e-05
Identities = 24/105 (22%), Positives = 45/105 (42%), Gaps = 6/105 (5%)

Query: 158 LAILPVVLF----MMGLAWLLSALGVFLRDTAQITAIITTAIMFLTPIFYPIDAIPPTFR 213
L LPV+ L +++AL ++ T I+FL+ +P+D +P F+
Sbjct: 148 LYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQ 207

Query: 214 PILNINPLAPVIAQIRNVLIWGHG--LKPLEYSLCLVISALVFIA 256
PL+ I IR +++ + +LC+ I F++
Sbjct: 208 TAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLS 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03430NUCEPIMERASE1912e-60 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 191 bits (486), Expect = 2e-60
Identities = 88/346 (25%), Positives = 135/346 (39%), Gaps = 42/346 (12%)

Query: 5 LVTGGAGFIGGNFVLEAVARGIRVVNLDALT--YAGNLNTL-ASLEGNPDHVFVKGDIGD 61
LVTG AGFIG + + G +VV +D L Y +L L P F K D+ D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 62 GMLVARLLQEHQPDAVLNFAAESHVDRSIEGPGAFIHTNVVGTLALLEAVRDYWKSLPTA 121
+ L + V V S+E P A+ +N+ G L +LE R
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN------- 116

Query: 122 RSDAFRFLHVSTDEVYGTLGETGKFTETTPYA-PNSPYSASKAASDHLVRAFHHTYGLPV 180
L+ S+ VYG L F+ P S Y+A+K A++ + + H YGLP
Sbjct: 117 --KIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 181 LTTNCSNNYGPYHFPEKLIPLVIAKALAGEPLPVYGDGKQVRDWLFVSDHCEAIRTVL-- 238
YGP+ P+ + L G+ + VY GK RD+ ++ D EAI +
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 239 ----------------AKGKVGETYNVGGNSERQNIEVVQAICALLDQHRPRDDGKPRAS 282
A YN+G +S + ++ +QA+ L +
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG----------IEA 283

Query: 283 QITYVTDRPGHDRRYAIDASKLKNELGWEPSYTFEQGIAQTVQWYL 328
+ + +PG + D L +G+ P T + G+ V WY
Sbjct: 284 KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03445NUCEPIMERASE414e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 40.9 bits (96), Expect = 4e-06
Identities = 36/159 (22%), Positives = 55/159 (34%), Gaps = 17/159 (10%)

Query: 1 MTTLVFGANGQVGTELLRALAVDG----AVQATT----------RSGQLP-DGSACETAD 45
M LV GA G +G + + L G + R L G D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 46 FDAPETLTALLDRIKPSRVVNAAAYTAVDRAEQDRERATRANATAPGVIAAWCASNRVP- 104
E +T L RV + AV + ++ +N T I C N++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 105 LVHYSTDYVFDGQGTAPYREDAQTS-PLGVYGETKLAGE 142
L++ S+ V+ P+ D P+ +Y TK A E
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03495PYOCINKILLER320.002 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 32.5 bits (73), Expect = 0.002
Identities = 28/131 (21%), Positives = 44/131 (33%), Gaps = 5/131 (3%)

Query: 15 IVAGLALLLFGLWAAKYSSDRTWQQYRVVFREAVTGLSVGSPVQYNGIAVGSIT-----E 69
+ G A L + A+ D+T R L + V N +A S T
Sbjct: 299 MAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMR 358

Query: 70 LTLAPNDPRQVVAHVRVNSTTPIKSDTRAKLAITSLTGPSIIQLSGGTPEAPALTTIDKS 129
LT ++ V + + K+ A + TG + + T EAP L
Sbjct: 359 LTNEARGNTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTP 418

Query: 130 DAPIIQTTPSA 140
+P PS+
Sbjct: 419 ASPPGNQNPSS 429


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03540SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 1e-05
Identities = 15/59 (25%), Positives = 25/59 (42%)

Query: 67 DEAHVLNVCIAPEAQSQGHGRVLLRALIKGACDRGARRAFLEVRPSNPSAIALYHSEGF 125
A + ++ +A + + +G G LL I+ A + LE + N SA Y F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03570PF05043357e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.9 bits (80), Expect = 7e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


13AXO1947_RS20825AXO1947_RS03705Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS20825-220-3.996311hypothetical protein
AXO1947_RS03555-219-3.751341phosphoribosylformylglycinamidine synthase
AXO1947_RS03560-211-1.652066hypothetical protein
AXO1947_RS03565-212-0.862341protease
AXO1947_RS20830-28-0.775089type II secretion system protein GspE
AXO1947_RS03570-210-1.782098hypothetical protein
AXO1947_RS20835020-4.644431general secretion pathway protein GspF
AXO1947_RS03575118-4.140760type II secretion system protein GspG
AXO1947_RS20840122-5.610551type II secretion system protein GspH
AXO1947_RS03580119-3.916620general secretion pathway protein GspI
AXO1947_RS20845321-3.895655general secretion pathway protein GspJ
AXO1947_RS20850015-1.988801general secretion pathway protein GspK
AXO1947_RS035900171.792394general secretion pathway protein GspL
AXO1947_RS035951141.691945general secretion pathway protein GspM
AXO1947_RS036001121.982916hypothetical protein
AXO1947_RS03605381.281570type II secretion system protein GspD
AXO1947_RS036102101.372410hypothetical protein
AXO1947_RS20855-1102.795675GntR family transcriptional regulator
AXO1947_RS03615-2113.070339aminotransferase
AXO1947_RS03620-1113.123683TonB-dependent siderophore receptor
AXO1947_RS036250132.880721aminoglycoside phosphotransferase
AXO1947_RS20860-1132.303643glycosyl transferase
AXO1947_RS03630-1132.268308hypothetical protein
AXO1947_RS036350161.621784glycosyl transferase
AXO1947_RS03645115-0.508779hypothetical protein
AXO1947_RS036501140.519508hypothetical protein
AXO1947_RS036552161.206734nicotinate phosphoribosyltransferase
AXO1947_RS036653171.804215serine protease
AXO1947_RS03675-1141.480854hypothetical protein
AXO1947_RS03685-1131.251496arsenate reductase (glutaredoxin)
AXO1947_RS036900121.279126IS5/IS1182 family transposase
AXO1947_RS036951111.182765peptide deformylase
AXO1947_RS037052101.634111cellulase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03630OMADHESIN320.017 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 32.2 bits (72), Expect = 0.017
Identities = 46/187 (24%), Positives = 71/187 (37%), Gaps = 27/187 (14%)

Query: 634 PKMHRDAVHPAAPQWPVLQTASLDLQQAGLRVLA--HPTVASKSFLVTIGDRSVGGLTAR 691
P + +P P PV L+ G+ +A A+K V +G S+
Sbjct: 45 PALG--LEYPVRP--PVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIA-TGVN 99

Query: 692 EQMIGPWQLPLADCAITMAGFDTFEGEAMSIGERTPLALLNAAASARMAVGEAITNLCAA 751
IGP L D A+T T + + ++IG R + + +AVG
Sbjct: 100 SVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARA------STSDTGVAVG--------- 144

Query: 752 PVQRLDSIKLSANWMAAAGHAGEDALLYDAVRAVGMELCPALELSVPVGKDSLSMQAQWV 811
+S K A A GH+ A + A+G E SV +G +SL+ Q +
Sbjct: 145 ----FNS-KADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHL 199

Query: 812 EAGIGDS 818
AG D+
Sbjct: 200 AAGTKDT 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03635OMADHESIN612e-11 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 61.5 bits (148), Expect = 2e-11
Identities = 68/225 (30%), Positives = 109/225 (48%), Gaps = 14/225 (6%)

Query: 957 AMGVDSVARRDSDTAIGTESVADGGYSTALGANAQASYDSSTALGANAMAEDYYSVALGT 1016
A+G++ R A G + A G +S A+GA A+A+ ++ A+GA ++A SVA+G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 1017 YALATGTSAISLGGQSYA------------PGTESVALGWQSNASGTRSIGLGSGAVASA 1064
+ A G SA++ G S A VA+G+ S A S+ +G + +A
Sbjct: 106 LSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAA 165

Query: 1065 DN--SVALGAGSIADRANAVSVGAADNARQIANVAAGTEGTDAVNLNQLNAVAETAQTTG 1122
++ S+A+G S DR N+VS+G RQ+ ++AAGT+ TDAVN+ QL E Q
Sbjct: 166 NHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENT 225

Query: 1123 KYFKASGSPDNDAGAYVEGENALAAGEGANAAGTGTTALGAGAQA 1167
A + +A A + + L + + T A +A
Sbjct: 226 NKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEA 270



Score = 53.0 bits (126), Expect = 8e-09
Identities = 65/201 (32%), Positives = 93/201 (46%), Gaps = 22/201 (10%)

Query: 72 GRGASAPASKATAIGANSHASATGAVATGADSSASGVNSSAIGRQTNAIGENALAIGYNS 131
G ASA + AIGA + A+ AVA GA S A+GVNS AIG + A+G++A+ G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 132 FVRQSG----------ENGVALGANAGVSGANSVALGAGSRTYEDDVVSIGSGNGRGG-- 179
++ G + GVA+G N+ NSVA+G S + SI G+
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 180 ---------PATRRITNVTAGVNATDAVNVAQL-RDVADVAENTAQFFKASPAEDSVGAY 229
R++T++ AG TDAVNVAQL +++ ENT + A + A
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYAD 241

Query: 230 VEGDSALAAGEGANAVGTATT 250
+ S L +A T
Sbjct: 242 NKSSSVLGIANNYTDSKSAET 262



Score = 51.1 bits (121), Expect = 4e-08
Identities = 56/168 (33%), Positives = 82/168 (48%), Gaps = 5/168 (2%)

Query: 1775 SITPAATSTAVGTAAVANHVTGTAIGGSAYAHGPNDTAIGSNARVNADGSTAVGANTQIA 1834
SI AT+ A AAVA A G ++ A GP A+G +A STA I
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIG 131

Query: 1835 AVATNA---VAMGEGAQVTAASGTAIGQGARATAQG--AVALGQGSVADRANTVSLGSVG 1889
A A+ + VA+G ++ A + AIG + A ++A+G S DR N+VS+G
Sbjct: 132 ARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191

Query: 1890 GERQVANVAAGTRATDAVNKGQLDNGVAAANSYTDSRYNAMADSFESY 1937
RQ+ ++AAGT+ TDAVN QL + T+ R + + +Y
Sbjct: 192 LNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAY 239



Score = 49.1 bits (116), Expect = 1e-07
Identities = 58/165 (35%), Positives = 81/165 (49%), Gaps = 26/165 (15%)

Query: 371 GTQTRASGISSTAVGGPVVLIPGLGLFVQTQASGEASTALGAGAIASGAYATAVGTLSEA 430
G A GI S A+G +A+ A+ A+GAG+IA+G + A+G LS+A
Sbjct: 62 GLNASAKGIHSIAIGA------------TAEAAKGAAVAVGAGSIATGVNSVAIGPLSKA 109

Query: 431 SGTEATAVGYFAYAPGEG------------ATAVGPESSAIGELSTALGYFS--TARGAN 476
G A G + A +G AVG S A + S A+G+ S A
Sbjct: 110 LGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGY 169

Query: 477 SVALGANSVATRANTVSVGAAGTERQITNVAAATDGTDAVNLDQL 521
S+A+G S R N+VS+G RQ+T++AA T TDAVN+ QL
Sbjct: 170 SIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 49.1 bits (116), Expect = 1e-07
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 3/144 (2%)

Query: 834 GANAAAADTGSIAVGTYANAYGPRAISLGGQSRATGDESIALGWEAQAESDQSIALGASS 893
G NA+A SIA+G A A A+++G S ATG S+A+G ++A D ++ GA+S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 894 QAAAFSTAIGGYARASGAGATAVGNNSSAVDDRATALGSDS--MASGYFSTAVGSSSVAS 951
A AIG A S G AVG NS A + A+G S A+ +S A+G S
Sbjct: 122 TAQKDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTD 180

Query: 952 GRGATAMGVDSVARRDSDTAIGTE 975
+ ++G +S+ R+ + A GT+
Sbjct: 181 RENSVSIGHESLNRQLTHLAAGTK 204



Score = 47.6 bits (112), Expect = 4e-07
Identities = 69/250 (27%), Positives = 108/250 (43%), Gaps = 23/250 (9%)

Query: 1322 GLIPARASGTGAAAFGAGAWATADYTTAIGRDSYADSVNATALGQSAAALADNTLALGGG 1381
G + A A G + A GA A A A+G S A VN+ A+G + AL D+ + G
Sbjct: 61 GGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAA 120

Query: 1382 SRANAVGASVIGVNASATGINSTGVGRQVNVIGENAVSVGYNSFVRQSAVNGVALGANAG 1441
S A G + IG AS + + V+VG+NS + ++
Sbjct: 121 STAQKDGVA-IGARASTS---------------DTGVAVGFNSKADAKNSVAIGHSSHVA 164

Query: 1442 ATGADSVALGSGSRTYEADTVSIGSGNGRGGPATRRIVNVSDGQAATDAVNKGQLDAVSA 1501
A S+A+G S+T ++VSIG + R++ +++ G TDAVN QL
Sbjct: 165 ANHGYSIAIGDRSKTDRENSVSIGHES-----LNRQLTHLAAGTKDTDAVNVAQLKKEIE 219

Query: 1502 DVQKTASKFKATGDAVATATGDRSTAAGSGAAA--TGARSVAIASGSRALATGASAMGVD 1559
Q+ +K A A A A D +++ G A T ++S +R A S ++
Sbjct: 220 KTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLN 279

Query: 1560 SSASGVNSTA 1569
+ + NS A
Sbjct: 280 MAKAHSNSVA 289



Score = 44.9 bits (105), Expect = 3e-06
Identities = 44/132 (33%), Positives = 69/132 (52%), Gaps = 4/132 (3%)

Query: 764 GADSNASGYFSTAVGGTSIANGRGATAIGYESIGNGTASTALGFAGVAWGDGGTAIGTES 823
G +++A G S A+G T+ A A A+G SI G S A+G A GD G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 824 LAYGDNSTAVGANAAAADTGSIAVGTYANAYGPRAISLGGQSRATGDE--SIALGWEAQA 881
A D A+GA A+ +DTG +AVG + A ++++G S + SIA+G ++
Sbjct: 122 TAQKD-GVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 882 ESDQSIALGASS 893
+ + S+++G S
Sbjct: 180 DRENSVSIGHES 191



Score = 42.6 bits (99), Expect = 1e-05
Identities = 40/148 (27%), Positives = 71/148 (47%)

Query: 1131 PDNDAGAYVEGENALAAGEGANAAGTGTTALGAGAQAVVDNATAVGVGALASGIGAAALG 1190
P V A G A+A G + A+GA A+A A AVG G++A+G+ + A+G
Sbjct: 45 PALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIG 104

Query: 1191 NTAQALGENSSAVGSNAVASDIGATANGAGAQALSTYTTALGSKAVASDNQAIAAGFRST 1250
++ALG+++ G+ + A G + + + SKA A ++ AI
Sbjct: 105 PLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVA 164

Query: 1251 ASNIGSAAFGGYSESSGRLSSALGYSAV 1278
A++ S A G S++ S ++G+ ++
Sbjct: 165 ANHGYSIAIGDRSKTDRENSVSIGHESL 192



Score = 41.0 bits (95), Expect = 5e-05
Identities = 47/149 (31%), Positives = 72/149 (48%), Gaps = 11/149 (7%)

Query: 552 AAGSNALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAIAQNTTALGG 611
A G NA A +S A+G+++ A+ A AVG+G+ AT N+ A+G S A+ + G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 612 KSSASGDGSTAVGGASQATASGATALGYESIANGADATALGVG---------SVAFGNTS 662
S+A DG GA +T+ A+G+ S A+ ++ A+G S+A G+ S
Sbjct: 120 ASTAQKDGVAI--GARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRS 177

Query: 663 TAVGGASVAFGADSAAFGANAAAGGTAST 691
SV+ G +S A GT T
Sbjct: 178 KTDRENSVSIGHESLNRQLTHLAAGTKDT 206



Score = 41.0 bits (95), Expect = 5e-05
Identities = 40/114 (35%), Positives = 57/114 (50%), Gaps = 2/114 (1%)

Query: 1516 AVATATGDRSTAAGSGAAATGARSVAIASGSRALATGASAMGVDSSASGVNSTAMGRQTN 1575
A A A + A G+G+ ATG SVAI S+AL A G S+A R +
Sbjct: 77 ATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARAST 136

Query: 1576 SIGENGVALGYNSFVRQSGSNAVALGAKAGASGADSVALGSGSRTYDANTVSVG 1629
S + GVA+G+NS S A+ + A+ S+A+G S+T N+VS+G
Sbjct: 137 S--DTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIG 188



Score = 40.7 bits (94), Expect = 6e-05
Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 4/103 (3%)

Query: 1527 AAGSGAAATGARSVAIASGSRALATGASAMGVDSSASGVNSTAMGRQTNSIGENGVALGY 1586
A G A+A G S+AI + + A A A+G S A+GVNS A+G + ++G++ V G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 1587 NSFVRQSGSNAVALGAKAGASGADSVALGSGSRTYDANTVSVG 1629
S ++ G VA+GA+A S VA+G S+ N+V++G
Sbjct: 120 ASTAQKDG---VAIGARASTSDT-GVAVGFNSKADAKNSVAIG 158



Score = 37.6 bits (86), Expect = 5e-04
Identities = 46/144 (31%), Positives = 68/144 (47%), Gaps = 14/144 (9%)

Query: 677 AAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGEDSIALGASSQASALGTTAVG 736
A G NA+A G S AIGA + A VA+G S A+G +S+A+G S+A G
Sbjct: 59 GAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118

Query: 737 SNANA------------SIANATAVGFNSSAGDDYATALGADSN--ASGYFSTAVGGTSI 782
+ + A + AVGFNS A + A+G S+ A+ +S A+G S
Sbjct: 119 AASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSK 178

Query: 783 ANGRGATAIGYESIGNGTASTALG 806
+ + +IG+ES+ A G
Sbjct: 179 TDRENSVSIGHESLNRQLTHLAAG 202



Score = 36.0 bits (82), Expect = 0.001
Identities = 44/187 (23%), Positives = 76/187 (40%)

Query: 557 ALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAIAQNTTALGGKSSAS 616
A AD ++ S A+G A G N++A ++ A+G + A+
Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82

Query: 617 GDGSTAVGGASQATASGATALGYESIANGADATALGVGSVAFGNTSTAVGGASVAFGADS 676
+ AVG S AT + A+G S A G A G S A + AS + +
Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVA 142

Query: 677 AAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGEDSIALGASSQASALGTTAVG 736
F + A A + + ++ +A ++A+G S E+S+++G S L A G
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202

Query: 737 SNANASI 743
+ ++
Sbjct: 203 TKDTDAV 209



Score = 35.3 bits (80), Expect = 0.002
Identities = 47/160 (29%), Positives = 77/160 (48%), Gaps = 4/160 (2%)

Query: 1174 AVGVGALASGIGAAALGNTAQALGENSSAVGSNAVASDIGATANGAGAQALSTYTTALGS 1233
A+G+ A G A A G +S A+G+ A A+ A A GAG+ A + A+G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 1234 KAVASDNQAIAAGFRSTASNIGSAAFGGYSESSGRLSSALGYSAVASSDYSTAVGAVA-- 1291
+ A + A+ G STA G A G S+ A+G+++ A + S A+G +
Sbjct: 106 LSKALGDSAVTYGAASTAQKDGVAI--GARASTSDTGVAVGFNSKADAKNSVAIGHSSHV 163

Query: 1292 LASGASAVAVGQFSKATGDESVAVGGSAFFGLIPARASGT 1331
A+ ++A+G SK + SV++G + + A+GT
Sbjct: 164 AANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGT 203



Score = 33.3 bits (75), Expect = 0.010
Identities = 49/167 (29%), Positives = 70/167 (41%), Gaps = 10/167 (5%)

Query: 393 GLGLFVQTQASGEASTALGAGAIASGAYATAVGTLSEASGTEATAVGYFAYAPGEGATAV 452
G+ Q S A ALG A G + A G + A+G A A A AV
Sbjct: 30 GIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAV 89

Query: 453 GPESSAIGELSTALGYFSTARGANSVALGANSVATRANTVSVGAAGTERQITNVAAATDG 512
G S A G S A+G S A G ++V GA S A + + V++GA A+ +D
Sbjct: 90 GAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQK-DGVAIGAR---------ASTSDT 139

Query: 513 TDAVNLDQLTAVSDVASTTARSFVASGDGVAIAQGVDSVAAGSNALA 559
AV + + + S VA+ G +IA G S N+++
Sbjct: 140 GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVS 186



Score = 32.2 bits (72), Expect = 0.027
Identities = 37/113 (32%), Positives = 56/113 (49%), Gaps = 8/113 (7%)

Query: 237 AAGEGANAVGTATTALGTGANAVAENATAVGADALASGQDSAAFGHNAQANGPASVAVGG 296
A G A+A G + A+G A A A AVGA ++A+G +S A GP S A+G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAI-------GPLSKALGD 112

Query: 297 AAVNEDGEPLITNGGVPVTTGATSAGVGATAVGASAKADGFAASSFGVGAYAA 349
+AV GV + A+++ G AVG ++KAD + + G ++ A
Sbjct: 113 SAVTYGAASTAQKDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03640SUBTILISIN1998e-61 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 199 bits (508), Expect = 8e-61
Identities = 100/359 (27%), Positives = 147/359 (40%), Gaps = 69/359 (19%)

Query: 156 PQLVPNDPLYAQYQWHLSNRNGGINAPGAWDLSQGAGVVVAVLDTGILPSHPDFAGNILQ 215
Q++ + + + I AP W+ ++G GV VAVLDTG HPD I+
Sbjct: 10 YQVIKQEQQVNEIPRGV----EMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIG 65

Query: 216 GYDFITDAEVSRRPTDARVPGALDYGDWQEADNVCYVGSTAQASTWHGTHVSSTVAEATN 275
G +F D E HGTHV+ T+A AT
Sbjct: 66 GRNFTDDDEGDPEIFKD--------------------------YNGHGTHVAGTIA-ATE 98

Query: 276 NGVGMAGVAPKATILPVRVVGRCG-GYTSDIVDAIVWASGGTVEGVPANTNPAEVINISL 334
N G+ GVAP+A +L ++V+ + G G I+ I +A ++I++SL
Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYA----------IEQKVDIISMSL 148

Query: 335 GGGGPCDSATQLAINGAVSRGTTVVVAAGNGGGDAAN----HSPAGCNNTITVGATRITG 390
GG A+ AV+ V+ AAGN G P N I+VGA
Sbjct: 149 GGPEDVP-ELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDR 207

Query: 391 GITYYSNYGSKVDLSGPGGGGSVDGNPGGYIWQAGYTGATTPTSGRYAYMGLGGTSMASP 450
+ +SN ++VDL PG I +T G+YA GTSMA+P
Sbjct: 208 HASEFSNSNNEVDLVAPGED----------IL-------STVPGGKYATFS--GTSMATP 248

Query: 451 HVAGVVALVQSAAIGLGKGPLTPAAVKALLKKTSRRFPVTPPASTPIGSGIVDAKAALK 509
HVAG +AL++ A + LT + A L K + +P G+G++ A +
Sbjct: 249 HVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKME---GNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03650BCTERIALGSPF433e-153 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 433 bits (1114), Expect = e-153
Identities = 134/411 (32%), Positives = 212/411 (51%), Gaps = 12/411 (2%)

Query: 1 MPLYRYKALDAHGEMLDGQMEAASDAEVALRLQEQGHLPV---ETRLATGENDSPSLRML 57
M Y Y+ALDA G+ G EA S + L+E+G +P+ E R ++ S L L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLS-L 59

Query: 58 LRKKPFDNAALVQFTQQLATLIGAGQPLDRALSILMDLPEDEKSRRVIGDVRDTVRGGAP 117
RK + L T+QLATL+ A PL+ AL + E +++ VR V G
Sbjct: 60 RRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHS 119

Query: 118 LSSALERQHGLFSKLYINMVRAGEAGGSMQDTLQRLADYLERSRALRGKVINALIYPAIL 177
L+ A++ G F +LY MV AGE G + L RLADY E+ + +R ++ A+IYP +L
Sbjct: 120 LADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVL 179

Query: 178 LAVVGCALLFLLGYVVPQFAQMYESLDVALPWFTQAVLSVGLLVRDW--WLVLIVVPGVL 235
V + LL VVP+ + + + ALP T+ ++ + VR + W++L ++ G +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 G--LWLDRKRRNAAFRASLDEWLLRQKVVGSLIARLETARLTRTLGTLLRNGVPLLAAIG 293
+ L +++R +F LL ++G + L TAR RTL L + VPLL A+
Sbjct: 240 AFRVMLRQEKRRVSF----HRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMR 295

Query: 294 IARNVMSNLALVEDVANAADDVKNGHGLSMSLARGKRFPRLALQMIQVGEESGALDTMLL 353
I+ +VMSN ++ A D V+ G L +L + FP + MI GE SG LD+ML
Sbjct: 296 ISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLE 355

Query: 354 KTADTFELETAQAIDRALAALVPFITLVLASVVGLVIISVLVPLYDLTNAI 404
+ AD + E + + AL P + + +A+VV +++++L P+ L +
Sbjct: 356 RAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03655BCTERIALGSPG1363e-44 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 136 bits (343), Expect = 3e-44
Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 18/132 (13%)

Query: 15 QAGMSLLEIIIVIVLIGAVLTLVGSRVLGGADRGKANLAKSQIQTLAGKIENFQLDTGKL 74
Q G +LLEI++VIV+IG + +LV ++G ++ A S I L ++ ++LD
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 75 PSKLDDLVTQPGDSSGWLGPYAKPAELN------------DPWGHAIEYRVPGDGQPFDL 122
P+ T G S P P N DPWG+ PG+ +DL
Sbjct: 67 PT------TNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 123 MSLGKDGKPGGS 134
+S G DG+ G
Sbjct: 121 LSAGPDGEMGTE 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03660BCTERIALGSPH300.002 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 30.3 bits (68), Expect = 0.002
Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 21 RTRGTSLLEMLLVIALIAMAGVLAAAALNGGIDGMRLRTAGKAIASQLRYTRTQAIATGT 80
R RG +LLEM+L++ L+ ++ + A D +T + A QLR+ + + + TG
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEA-QLRFVQQRGLQTGQ 60

Query: 81 PQRFLIDPQQRRWE 94
+ P RW+
Sbjct: 61 FFGVSVHPD--RWQ 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03665BCTERIALGSPG345e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.1 bits (78), Expect = 5e-05
Identities = 14/45 (31%), Positives = 26/45 (57%), Gaps = 4/45 (8%)

Query: 1 MKRQRGYTLIEVIVAFALLALALSL----LLGSLSGAARQVRAAD 41
+QRG+TL+E++V ++ + SL L+G+ A +Q +D
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03690PERTACTIN346e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 33.9 bits (77), Expect = 6e-04
Identities = 19/52 (36%), Positives = 23/52 (44%)

Query: 192 AVPPPQQQPQPQPQPQPQSVPPAQQPGGQAPPTVQPQRSDGAQEAPRPSDDQ 243
A PP +P PQP PQP PP Q P QP + AP+P +
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGR 616



Score = 31.2 bits (70), Expect = 0.004
Identities = 21/58 (36%), Positives = 22/58 (37%)

Query: 160 NGHGGQPPTANAAARGAGTATAPVPSPDAAAVAVPPPQQQPQPQPQPQPQSVPPAQQP 217
NG+G A A P P P P P Q PQP PQ Q PA QP
Sbjct: 555 NGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQP 612



Score = 30.1 bits (67), Expect = 0.011
Identities = 18/48 (37%), Positives = 20/48 (41%)

Query: 200 PQPQPQPQPQSVPPAQQPGGQAPPTVQPQRSDGAQEAPRPSDDQMRAI 247
P+P PQP PQ P QP P PQ EAP P R +
Sbjct: 571 PKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGREL 618



Score = 29.7 bits (66), Expect = 0.014
Identities = 23/62 (37%), Positives = 26/62 (41%), Gaps = 1/62 (1%)

Query: 175 GAGTATAPVPSPDAAAVAVPPPQQQPQPQPQPQPQSVPPAQQPGGQAPPTVQPQRSDGAQ 234
GA AP P+P P P Q PQP PQP PP +QP AP + A
Sbjct: 564 GAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQ-PPQRQPEAPAPQPPAGRELSAAA 622

Query: 235 EA 236
A
Sbjct: 623 NA 624


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03695BCTERIALGSPD2601e-78 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 260 bits (665), Expect = 1e-78
Identities = 124/535 (23%), Positives = 230/535 (42%), Gaps = 57/535 (10%)

Query: 254 GMSVGVFPIQSGKAEKVSADLEKVFGEQSKTPSAGMFRFMPLENANAVLVITPQPRYLDQ 313
+ V P+ + A DL + + + +AG+ + E +N VL++T + + +
Sbjct: 126 EVVTRVVPLTNVAA----RDLAPLLRQLN--DNAGVGSVVHYEPSN-VLLMTGRAAVIKR 178

Query: 314 IQQWLDRIDSAGGGVRLFSYELKYIKAKDLADRLAEVFGGHSSGG-------------DS 360
+ ++R+D+AG + + L + A D+ + E+ S +
Sbjct: 179 LLTIVERVDNAGDRS-VVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERT 237

Query: 361 NASLVPGSE------TSVLGGALGNRDSNMGGSSGMTGGSIGDSGDGSSSGSSFGGSSGS 414
NA LV G +++ +R G++ + + D + +
Sbjct: 238 NAVLVSGEPNSRQRIIAMIKQL--DRQQATQGNTKVIYLKYAKASDLVEVLTGISSTM-- 293

Query: 415 SGGLGNGSLQLSPRSNGNGAVTLDVAGDKVGVSAVAETNTLLVRSTPQAWSSIRDVIEKL 474
S + LD + + A +TN L+V + P + + VI +L
Sbjct: 294 ----------QSEKQAAKPVAALD---KNIIIKAHGQTNALIVTAAPDVMNDLERVIAQL 340

Query: 475 DVMPMQVHIEAQVAEVNLTGQLSYGVNWYFENAVNAATDSNS--NGPGFKGGAGLPSAAG 532
D+ QV +EA +AEV L+ G+ W +NA ++ G G
Sbjct: 341 DIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKD-G 399

Query: 533 RNIWGDIAGKVTGDGVAWSFLGKNAAAIITALDKVTDVRLLQTPSVFVRNNAEATLNVGS 592
+ + +G+A F N A ++TAL T +L TPS+ +N EAT NVG
Sbjct: 400 TVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQ 459

Query: 593 RIAINSTSINTGIGVDSSYSSVQYIDTGVILKVRPRVTKDGMVFLDIVQEVSSPGDRPAA 652
+ + + S T D+ +++V+ G+ LKV+P++ + V L+I QEVSS D
Sbjct: 460 EVPVLTGSQTTS--GDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVAD---- 513

Query: 653 CTSATATVNAAACNVDINTRRVKTEAAVQSGDTIMLAGLIDDTTSDGSNGIPFLSKLPVV 712
A+ ++ NTR V V SG+T+++ GL+D + SD ++ +P L +PV+
Sbjct: 514 ----AASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVI 569

Query: 713 GALFGSKSRNSARREVIVLITPSIVHNPQEARNLTDEYGQKFKAMEPLKPSQKPQ 767
GALF S S+ ++R +++ I P+++ + E R + F + + ++
Sbjct: 570 GALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENN 624



Score = 191 bits (487), Expect = 5e-54
Identities = 73/292 (25%), Positives = 125/292 (42%), Gaps = 21/292 (7%)

Query: 89 ASSGSATFNFEGESVQAVVKAILGDMLGQNYVIAPGVQGTVTLATPNPVSPAQALNLLEM 148
A++ + +F+G +Q + + + L + +I P V+GT+T+ + + ++ Q
Sbjct: 25 AAAEEFSASFKGTDIQEFINTVSKN-LNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLS 83

Query: 149 VLG-WNNARMVFSGGRYNIVPA-DQALAGTVAPSTASPSAARGFEVRVVPLKFISASEMK 206
VL + A + + G +V + D A S A+P RVVPL ++A ++
Sbjct: 84 VLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLA 143

Query: 207 KVLEPYARPNAIVGTD---PARNVITLGGTRAELENYLRTVQIFDVDWLSGMSVGVFPIQ 263
+L NA VG+ NV+ + G A ++ L V+ VD SV P+
Sbjct: 144 PLLRQL-NDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE--RVDNAGDRSVVTVPLS 200

Query: 264 SGKAEKVSADLEKVFGEQSKT--PSAGMFRFMPLENANAVLVI---TPQPRYLDQIQQWL 318
A V + ++ + SK+ P + + + E NAVLV + R + I+Q L
Sbjct: 201 WASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ-L 259

Query: 319 DRIDSAGGGVRLFSYELKYIKAKDLADRLAEVFGGHSSGGDSNASLVPGSET 370
DR + G ++ LKY KA DL + L + SS S
Sbjct: 260 DRQQATQGNTKVIY--LKYAKASDLVEVLTGI----SSTMQSEKQAAKPVAA 305



Score = 39.9 bits (93), Expect = 3e-05
Identities = 35/236 (14%), Positives = 81/236 (34%), Gaps = 21/236 (8%)

Query: 187 ARGFEVRVVPLKFISASEMKKVLEPYAR---PNAIVGT-------DPARNVITLGGTRAE 236
A V VPL + SA+++ K++ + +A+ G+ D N + + G
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNS 248

Query: 237 LENYLRTVQIFDVDWLSGMSVGVFPIQSGKAEKVSADLEKVFGEQSKTPSAGM------- 289
+ + ++ D + + V ++ KA + L + A
Sbjct: 249 RQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK 308

Query: 290 -FRFMPLENANAVLVITPQPRYLDQIQQWLDRIDSAGGGVRLFS--YELKYIKAKDLADR 346
NA L++T P ++ +++ + ++D V + + E++ +L +
Sbjct: 309 NIIIKAHGQTNA-LIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQ 367

Query: 347 LAEVFGGHSSGGDSNASLVPGSETSVLGGALGNRDSNMGGSSGMTGGSIGDSGDGS 402
A G + +S + + G S++ + G G+
Sbjct: 368 WANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGN 423


14AXO1947_RS03980AXO1947_RS20970Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS03980025-4.804536TetR family transcriptional regulator
AXO1947_RS03985127-5.303376LysR family transcriptional regulator
AXO1947_RS03995-221-2.3075453-isopropylmalate dehydratase large subunit
AXO1947_RS20945123-2.4293763-isopropylmalate dehydratase small subunit
AXO1947_RS20950123-2.479937SAM-dependent methyltransferase
AXO1947_RS04000328-4.1567053-isopropylmalate dehydrogenase
AXO1947_RS04005432-5.6027572-isopropylmalate synthase
AXO1947_RS20955536-6.479530serine/threonine dehydratase
AXO1947_RS04010437-6.805543acetolactate synthase
AXO1947_RS20960338-7.486137acetolactate synthase 2 catalytic subunit
AXO1947_RS20965230-6.138230ketol-acid reductoisomerase
AXO1947_RS20970023-3.932998phosphomethylpyrimidine synthase ThiC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04065HTHTETR721e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 71.6 bits (175), Expect = 1e-17
Identities = 35/172 (20%), Positives = 58/172 (33%), Gaps = 7/172 (4%)

Query: 15 GPGRPKDLGKRAAILGAARAMFMELGYAGVSMDGIAARAGVSKLTVYSHFGDKESLFSEA 74
+ + R IL A +F + G + S+ IA AGV++ +Y HF DK LFSE
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 75 IRAQCQHM--MPDDLFDHAPKGALRDQLTEIAHAFFVMVSTESAISTHRMMM---APGTG 129
++ + + P G L EI TE ++ G
Sbjct: 63 WELSESNIGELELEYQAKFP-GDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 130 DVHIREMFWDAGPKRTQRALADFLSARVADGQLEIP-DVARAASQFFCLLKG 180
++ + + + + L + L RAA + G
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04090ISCHRISMTASE280.040 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 27.7 bits (61), Expect = 0.040
Identities = 11/46 (23%), Positives = 19/46 (41%)

Query: 85 GLTQISTLQGVAERLPFEAGSMDAVVSRYSAHHWSDLGQALREVRR 130
GL + + L E + RYSA ++L + +R+ R
Sbjct: 98 GLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGR 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04125TYPE3OMGPROT300.032 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 29.9 bits (67), Expect = 0.032
Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 492 HLGLPNRQDVRDGIMAYK-IAAHAADLAKGHPGAQVRDNALSKARFEFRWDDQ 543
HL L N QD+R GI+ I+ + L K G+Q + L+KA+ +W Q
Sbjct: 516 HLALGNGQDLRTGILTVDEISNQSTTLNKLLGGSQCQ--PLNKAQEVQKWLSQ 566


15AXO1947_RS04890AXO1947_RS04960Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS04890212-2.784464hypothetical protein
AXO1947_RS04905210-3.151561membrane-bound PQQ-dependent dehydrogenase,
AXO1947_RS04910110-3.066200trehalose-6-phosphate synthase
AXO1947_RS04915112-2.955913glucoamylase
AXO1947_RS04920013-2.039127trehalose-phosphatase
AXO1947_RS04930118-4.663279hypothetical protein
AXO1947_RS04935222-4.449430ligand-gated channel
AXO1947_RS04940427-4.956730lipoprotein
AXO1947_RS04950328-5.111607DUF4432 domain-containing protein
AXO1947_RS04960119-3.123247hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05055TYPE3IMPPROT290.021 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.4 bits (66), Expect = 0.021
Identities = 16/87 (18%), Positives = 34/87 (39%)

Query: 79 FANRTLWPLLHFRLDLVDYDRATREGYMRVNRLFAEKLAPLLKDSDTLWIHDYHMIPLGA 138
+ L + + D + ++ R + E+ + +D D + +
Sbjct: 91 HVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQYGEETETVKRDKDEIEKPSIFALLPAY 150

Query: 139 MLRELGVGCKMGFFLHVPMPSADLVQA 165
L E+ K+GF+L++P DLV +
Sbjct: 151 ALSEIKSAFKIGFYLYLPFVVVDLVVS 177


16AXO1947_RS05480AXO1947_RS05595Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS054804151.740162aspartate--tRNA ligase
AXO1947_RS054854152.158340N-acetyltransferase
AXO1947_RS054904152.694114cobalamin adenosyltransferase
AXO1947_RS054954162.733367YebC/PmpR family DNA-binding transcriptional
AXO1947_RS055003132.042381crossover junction endodeoxyribonuclease RuvC
AXO1947_RS055052141.933937Holliday junction branch migration protein RuvA
AXO1947_RS055103141.672459potassium transporter Kup
AXO1947_RS055154151.346407Holliday junction branch migration DNA helicase
AXO1947_RS055202130.551374tol-pal system-associated acyl-CoA thioesterase
AXO1947_RS055252120.700795Tol-Pal system subunit TolQ
AXO1947_RS055301110.972042protein TolR
AXO1947_RS055351121.130822protein TolA
AXO1947_RS055400121.496651translocation protein TolB
AXO1947_RS055451142.276547peptidoglycan-associated lipoprotein
AXO1947_RS055552163.666516tol-pal system protein YbgF
AXO1947_RS055601143.3558547-carboxy-7-deazaguanine synthase
AXO1947_RS055651143.098736*7-cyano-7-deazaguanine synthase
AXO1947_RS055750140.706472chemotaxis protein CheY
AXO1947_RS055851160.411056amino acid transporter
AXO1947_RS055952150.312827hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05670SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 1e-05
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 1/55 (1%)

Query: 99 KDYQNSGWGSRLFETALQWLERDGPRTLWIGVWSENFGAQRLYARYGFEKVGKYD 153
KDY+ G G+ L A++W + + L + N A YA++ F +G D
Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF-IIGAVD 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05700FERRIBNDNGPP290.031 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 28.8 bits (64), Expect = 0.031
Identities = 21/80 (26%), Positives = 31/80 (38%), Gaps = 17/80 (21%)

Query: 8 SSSIREDDAADASIRPKRLADYLGQQPVRE----QMEIYIQAAKAR-----------GEA 52
+ S + A A +AD L Q E Q E +I++ K R
Sbjct: 123 NFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTL 182

Query: 53 MD--HVLIFGPPGLGKTTLS 70
+D H+L+FGP L + L
Sbjct: 183 IDPRHMLVFGPNSLFQEILD 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05720IGASERPTASE553e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 54.7 bits (131), Expect = 3e-10
Identities = 37/220 (16%), Positives = 65/220 (29%), Gaps = 17/220 (7%)

Query: 39 LWSPE-----RSVEPAAGDPSMEASLDVSAADARVARQALKATPVETPPPAPLPEPAPE- 92
L++PE ++V+ DV + + A PP P E
Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 93 --DSVPPPQPIPEPRPQDA--PTPQQAQAQERVAQPDKVDQDRVDALAISAEKAKQEQEA 148
++ E QDA T Q + K + V A + E A+ E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREV-------AKEAKSNVKANTQTNEVAQSGSET 1092

Query: 149 KRRQEQIDLTERKRQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEDAERQKKIDDIRRQR 208
K Q ++E + K+ K QE + + Q+ +E + Q +
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 209 AQADKDMALAEQKLRQVAAARAQQSSAATATSAQPTAGQG 248
+ + A+ S+ + T G
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192



Score = 51.6 bits (123), Expect = 3e-09
Identities = 35/224 (15%), Positives = 63/224 (28%), Gaps = 39/224 (17%)

Query: 59 LDVSAADARVARQALKATPVETPPPAPLPEPAPE---DSVPPPQPIPEPRPQDAPTPQQA 115
L+VS V A K L P E +V TP
Sbjct: 953 LNVSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNI---------TTPNNI 1003

Query: 116 QAQERVAQPDKVDQDRVDALAIS--------------AEKAKQEQ-----------EAKR 150
QA + + RVD + AE +KQE E
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTA 1063

Query: 151 RQEQIDLTERKRQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEDAERQKKIDDIRRQRAQ 210
+ ++ + + Q +A+ E + + A + E + K++ + Q
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 211 ADKDMALAEQKLRQVA--AARAQQSSAATATSAQPTAGQGGTST 252
+Q+ + A + + T +P + T+
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167



Score = 35.0 bits (80), Expect = 4e-04
Identities = 28/203 (13%), Positives = 60/203 (29%), Gaps = 11/203 (5%)

Query: 47 EPAAGDPSMEASLDVSAAD-----ARVARQALKATPVETPPPAPLPEPAPEDSVPPPQPI 101
+ + EA +V A A+ + + ET A + E + V +
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV-EKEEKAKVETEKTQ 1120

Query: 102 PEPRPQDAPTPQQAQAQERVAQPDKVDQDRVDALAISAEKAKQEQEAKRRQEQIDLTERK 161
P+ +P+Q Q++ Q + ++ + I +++ A Q + +
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEP-ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 162 RQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEDAERQKKIDD----IRRQRAQADKDMAL 217
Q E + + A Q ++E K + R +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATT 1239

Query: 218 AEQKLRQVAAARAQQSSAATATS 240
+ VA ++ S
Sbjct: 1240 SSNDRSTVALCDLTSTNTNAVLS 1262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05730OMPADOMAIN1063e-30 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 106 bits (266), Expect = 3e-30
Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 11/112 (9%)

Query: 67 VYFDLDQDSLKPEFQAIMACHAKYLR--DRPSSRITLQGNADERGSREYNMGLGERRGNA 124
V F+ ++ +LKPE QA + L D + + G D GS YN GL ERR +
Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280

Query: 125 VSSSLQAAGGSASQLTVVSYGEERPVCTESNE---------SCWSQNRRVEI 167
V L + G A +++ GE PV + + C + +RRVEI
Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05735RTXTOXIND345e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 5e-04
Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 7/88 (7%)

Query: 30 RVAVLEQQQANSQANNDL---LNQLQQARSDLQALRSTVEQLQHD--NEQLKQ--QSKDQ 82
+ AVLEQ+ +A N+L +QL+Q S++ + + + + NE L + Q+ D
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 83 YLDLDGRLNRLEGAGGATPSLPPATGSV 110
L L + E A+ P + V
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKV 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05760HTHFIS443e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.7 bits (103), Expect = 3e-08
Identities = 16/82 (19%), Positives = 38/82 (46%), Gaps = 3/82 (3%)

Query: 4 RVLLVEDESLVAMLLEDCLTELGYEVAATVADVDAALQAVHAGNLDLALPDVNLCGTLSF 63
+L+ +D++ + +L L+ GY+V ++ + + AG+ DL + DV + +F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDV-RITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 64 PIAEELDA--CGLPYIFVTGYA 83
+ + LP + ++
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN 85


17AXO1947_RS06025AXO1947_RS06065Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS06025022-3.934674membrane protein
AXO1947_RS06030233-5.769905hypothetical protein
AXO1947_RS06035336-7.746731glycine cleavage system protein T
AXO1947_RS06045542-9.346298glycine cleavage system protein H
AXO1947_RS06050335-7.180636hypothetical protein
AXO1947_RS06055029-6.000797histidine biosynthesis protein HisIE
AXO1947_RS06060131-6.725845serine hydrolase
AXO1947_RS06065-122-4.093830membrane protein
18AXO1947_RS06150AXO1947_RS06205Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS06150218-1.327346cystathionine gamma-synthase
AXO1947_RS06160316-0.849708L-serine ammonia-lyase
AXO1947_RS06165322-1.583866thioredoxin family protein
AXO1947_RS06170424-1.836284cold-shock protein
AXO1947_RS21270116-1.510250polyisoprenoid-binding protein
AXO1947_RS21275-115-0.963681cytochrome b
AXO1947_RS06180-112-0.318372polyisoprenoid-binding protein
AXO1947_RS06185015-2.532735hybrid sensor histidine kinase/response
AXO1947_RS06190113-2.105180histidine kinase
AXO1947_RS06195212-1.794741histidine kinase
AXO1947_RS06200213-1.931329histidine kinase
AXO1947_RS06205312-2.019566alpha-L-fucosidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06310HTHFIS757e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 7e-16
Identities = 23/117 (19%), Positives = 48/117 (41%)

Query: 1062 RLLLVEDDATVAQVIVGLLQTRGHHVTHVVHGLAALAEVSTRRFDAGLCDLDLPGLDGVA 1121
+L+ +DDA + V+ L G+ V + ++ D + D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1122 LVAQLRARGVRFPIVAVTARADADAEPQAMAAGCNGFLRKPVTGDLLAQALARVLTE 1178
L+ +++ P++ ++A+ +A G +L KP L + R L E
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06315HTHFIS758e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 8e-16
Identities = 24/114 (21%), Positives = 49/114 (42%)

Query: 1054 RILLVEDEPTVAEVISGLLINRGHRVVHAAHGLAALAEAVDGGFDVALLDLDLPCLDGFA 1113
IL+ +D+ + V++ L G+ V ++ G D+ + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1114 LASQLRQLGHRFPLLAVTARADSAAEAQALAAGFDGFLRKPVTADLLVEAIAAA 1167
L ++++ P+L ++A+ +A G +L KP L+ I A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06320HTHFIS755e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 5e-16
Identities = 28/115 (24%), Positives = 51/115 (44%)

Query: 1070 RILLVEDDPTIAEVIVGLLRAQGHSVVHAPHGLAALTEAADNTFDLALLDLDLPGLDGFA 1129
IL+ +DD I V+ L G+ V + A DL + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1130 LARQLRAFGYEMPLIAVTARSDEVAEPKAQDAGFDSFLRKPLTGDMLADTIAEAL 1184
L +++ ++P++ ++A++ + KA + G +L KP L I AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06325HTHFIS756e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 6e-16
Identities = 28/115 (24%), Positives = 49/115 (42%)

Query: 1056 RILLVEDDPTIAEVIVGLLRAQGHSVVHAPHGLAALTEAADNTFDLALLDLDLPGLDGFA 1115
IL+ +DD I V+ L G+ V + A DL + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1116 LARQLRVFGYEMPLIAVTARSDEAAEPNAHEAGFDSFLRKPLTGDMLADTIAEAL 1170
L +++ ++P++ ++A++ A E G +L KP L I AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


19AXO1947_RS06425AXO1947_RS06515Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS06425214-0.951697cell shape determination protein CcmA
AXO1947_RS06430210-1.858391GGDEF domain-containing protein
AXO1947_RS06435112-1.018008hypothetical protein
AXO1947_RS064401110.048323hypothetical protein
AXO1947_RS064450100.358177acetyl-CoA acetyltransferase
AXO1947_RS064550130.472377serine protease
AXO1947_RS064601120.280094leucine dehydrogenase
AXO1947_RS064702120.952114metal-dependent hydrolase
AXO1947_RS064802120.795746DUF1328 domain-containing protein
AXO1947_RS064852121.427271hypothetical protein
AXO1947_RS064952142.055174hypothetical protein
AXO1947_RS065002132.051309mechanosensitive ion channel protein MscS
AXO1947_RS065102142.711219RNA polymerase sigma24 factor
AXO1947_RS065152152.572478hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06570SUBTILISIN1312e-35 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 131 bits (330), Expect = 2e-35
Identities = 78/316 (24%), Positives = 123/316 (38%), Gaps = 46/316 (14%)

Query: 70 LTNARAAQALGFTGAGYRIGVIDTGINANHPALQGRVSDSFIYVDPRINNTA-VGDVVGH 128
+ A A G G ++ V+DTG +A+HP L+ R+ + D + D GH
Sbjct: 28 MIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGH 86

Query: 129 GTVVAELAAGRAVGQWPGGIAPGAGLVSARIISDRAPVDDGTGNGNEIDGPLGLGPVHAD 188
GT VA A G+AP A L+ ++++ G+G + I +
Sbjct: 87 GTHVAGTIAATENENGVVGVAPEADLLIIKVLNK-----QGSGQYDWIIQGI------YY 135

Query: 189 LISAGVRIMNNSWGGLYWNDPTVTNQIAQEYRPFILSNNGLVVFASGNESRSQPSDTAAL 248
I V I++ S GG + P + + + + LV+ A+GNE
Sbjct: 136 AIEQKVDIISMSLGGPE-DVPELHEAVKKAVA-----SQILVMCAAGNEGDG-------- 181

Query: 249 PSQPGPNGTLPAADLERGWLVVGAVDTANPTQLASYSNACGVAMRYCLVAPGTSLFIDPD 308
+ G + + VGA++ + +SN+ LVAPG +
Sbjct: 182 DDRTDELGYPGCYN---EVISVGAINFDR--HASEFSNSNN---EVDLVAPGEDIL---- 229

Query: 309 ATAGNIRYFYGSGTSFAAPLVSGAAALVWQAFPY-FNNDL----VRQTLLGTATDLGAAG 363
+T +Y SGTS A P V+GA AL+ Q F DL + L+ LG
Sbjct: 230 STVPGGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--N 287

Query: 364 VDPVFGYGLLNVGKAV 379
+ G GLL +
Sbjct: 288 SPKMEGNGLLYLTAVE 303


20AXO1947_RS06660AXO1947_RS06730Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS066602122.788685methionine--tRNA ligase
AXO1947_RS066652142.947401hypothetical protein
AXO1947_RS213500132.485638VOC family protein
AXO1947_RS066752142.392768amino acid transporter
AXO1947_RS066852140.976397homocysteine S-methyltransferase
AXO1947_RS066902130.910213DUF885 domain-containing protein
AXO1947_RS066952130.995703membrane protein
AXO1947_RS067002120.992843membrane protein
AXO1947_RS067050110.794027hypothetical protein
AXO1947_RS067101120.832062hypothetical protein
AXO1947_RS06715-1111.110006polyhydroxyalkanoate depolymerase
AXO1947_RS06720-1111.541992methyltransferase
AXO1947_RS067251122.094575membrane protein
AXO1947_RS067303121.663748hypothetical protein
21AXO1947_RS07155AXO1947_RS07180Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS07155316-3.824527IS5/IS1182 family transposase
AXO1947_RS07160115-4.368049glutamine synthetase
AXO1947_RS07165122-5.021732homoserine O-succinyltransferase
AXO1947_RS07170121-4.557527amino acid permease
AXO1947_RS07175116-4.404674gamma-glutamyl-gamma-aminobutyrate hydrolase
AXO1947_RS07180121-4.221784dihydrorhizobitoxine desaturase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07340PF05043355e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 5e-04
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAHM 140
+ L+ H NTAH+
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAHL 325


22AXO1947_RS07330AXO1947_RS07450Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS07330-121-4.004645ABC transporter ATP-binding protein
AXO1947_RS21430236-8.972714IMPACT family protein
AXO1947_RS07340245-11.408040Rossman fold protein, TIGR00730 family
AXO1947_RS07345658-15.447073dihydrolipoyl dehydrogenase
AXO1947_RS21435659-15.692525dihydrolipoamide succinyltransferase
AXO1947_RS21440762-16.6839982-oxoglutarate dehydrogenase subunit E1
AXO1947_RS07355661-16.024701N-acetyltransferase
AXO1947_RS21445555-14.225737transcriptional regulator
AXO1947_RS07360447-11.646892hypothetical protein
AXO1947_RS07365342-9.981133hypothetical protein
AXO1947_RS21450343-10.475865adenylosuccinate lyase
AXO1947_RS07375135-7.455938hypothetical protein
AXO1947_RS07380127-4.787176IS5/IS1182 family transposase
AXO1947_RS07385125-4.918000class II fumarate hydratase
AXO1947_RS07395031-5.290772hypothetical protein
AXO1947_RS07400-130-5.286012hypothetical protein
AXO1947_RS21455-120-1.752916hypothetical protein
AXO1947_RS07405-215-1.157350hypothetical protein
AXO1947_RS07410-114-1.269665hypothetical protein
AXO1947_RS21460-213-0.597440multidrug ABC transporter ATP-binding protein
AXO1947_RS07420012-0.832645GntR family transcriptional regulator
AXO1947_RS07425010-0.989570hypothetical protein
AXO1947_RS07430114-2.537086glutathione peroxidase
AXO1947_RS074352171.644056peptidylprolyl isomerase
AXO1947_RS074400152.380414UDP-glucose 6-dehydrogenase
AXO1947_RS074450192.376770protein SlyX
AXO1947_RS074502261.646988nucleoprotein/polynucleotide-associated enzyme
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07530RTXTOXIND290.034 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.034
Identities = 15/87 (17%), Positives = 33/87 (37%), Gaps = 2/87 (2%)

Query: 47 EVPSPVDGVLKEIKFEAGSTVTSNQILAIIEEGAVAAAAPAEQKKAAAPAAAAPAAAPDA 106
E+ + ++KEI + G +V +L + A+ A A + +++ A
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 107 AAAPAPASKSAADSLPPGARFSAITQG 133
+ +K LP F +++
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEE 182


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07600TYPE3OMOPROT280.033 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 28.4 bits (63), Expect = 0.033
Identities = 18/46 (39%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 213 ATSKPFLWAVLFPLLACVMLSILSAMPGVSLPIGWIWYIVGYRGLL 258
AT +PF V P L+C L + + +PG +LP G + +I+ RG L
Sbjct: 86 ATERPFELPV--PHLSCRRLCVENPVPGSALPEGKLLHIMSDRGGL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07620INFPOTNTIATR1364e-41 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 136 bits (344), Expect = 4e-41
Identities = 67/178 (37%), Positives = 107/178 (60%), Gaps = 8/178 (4%)

Query: 127 EIDLSILMDAVRTVFAKGTTRLTQQEAMATMQAFAS---AKQGAAGAKNREE----GNAF 179
+I+ +L ++ + LT+++ + F AK+ A K EE G+AF
Sbjct: 52 DINPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAF 111

Query: 180 LAKNKTEKGVITTASGLQYMVLRQGSGERPMRTNKVRVNYEGKLLNGQVFDSSYQRGQPA 239
L+ NK++ G++ SGLQY ++ G+G +P +++ V V Y G L++G VFDS+ + G+PA
Sbjct: 112 LSANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPA 171

Query: 240 EFGLDQVIPGWTEGVALMPVGSKYRFWIPSNLAYGPNGTQG-IGPDATLTFDVELMGI 296
F + QVIPGWTE + LMP GS + ++P++LAYGP G IGP+ TL F + L+ +
Sbjct: 172 TFQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISV 229



Score = 41.5 bits (97), Expect = 2e-06
Identities = 18/72 (25%), Positives = 43/72 (59%)

Query: 15 ALNASSEKSTRDNVSYAIGMDVARSFEPIAQDIDVNAMQRAIENAFKGGKPLLSDEQTQA 74
A +A+S + +D +SY+IG D+ ++F+ DI+ + + + +++ G + +L++EQ +
Sbjct: 21 ATDATSLTTDKDKLSYSIGADLGKNFKNQGIDINPDVLAKGMQDGMSGAQLILTEEQMKD 80

Query: 75 TDTALRTALAAR 86
+ + L A+
Sbjct: 81 VLSKFQKDLMAK 92


23AXO1947_RS07605AXO1947_RS07720Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS07605212-2.762708peptidylprolyl isomerase
AXO1947_RS07610212-2.966557CoA pyrophosphatase
AXO1947_RS21485113-2.110985sulfurtransferase
AXO1947_RS07615013-1.643143hypothetical protein
AXO1947_RS07620-213-0.842569N-acetylmuramoyl-L-alanine amidase
AXO1947_RS07630-115-0.616332MOSC domain-containing protein
AXO1947_RS07635-214-1.027651alpha/beta hydrolase
AXO1947_RS07640-211-1.855915N-acetyltransferase
AXO1947_RS07645-111-2.114236hypothetical protein
AXO1947_RS07655210-1.89482223S rRNA
AXO1947_RS07660211-2.014905NAD(+) kinase
AXO1947_RS07665313-1.6389535'-nucleotidase
AXO1947_RS07670110-0.112420hypothetical protein
AXO1947_RS076752130.460941oxidoreductase
AXO1947_RS076801130.281992TIGR02453 family protein
AXO1947_RS07685-1110.994133exodeoxyribonuclease I
AXO1947_RS21495-2121.025804kynurenine 3-monooxygenase
AXO1947_RS077000120.702325kynureninase
AXO1947_RS077051150.7478713-hydroxyanthranilate 3,4-dioxygenase
AXO1947_RS07710114-0.030231carbonic anhydrase
AXO1947_RS07715115-0.756538hypothetical protein
AXO1947_RS07720217-0.988589hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07795INFPOTNTIATR1402e-43 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 140 bits (354), Expect = 2e-43
Identities = 91/239 (38%), Positives = 132/239 (55%), Gaps = 16/239 (6%)

Query: 1 MKLRSIAVAVAALALTGNALAQDTTS---EKGKLSYYFGYDYGNNLAELTGRGEQLDINS 57
MK++ + A+ LA++ A D TS +K KLSY G D G N +G ++ +
Sbjct: 1 MKMKLVTAAIMGLAMSTAMAATDATSLTTDKDKLSYSIGADLGKNFKN---QGIDINPDV 57

Query: 58 VVKGLQDAYAKKQPAITADQLKPAVEAFQKREQGRAQQAKAEYDKAAAANKTKSDAFLAK 117
+ KG+QD + Q +T +Q+K + FQK + AE++K A NK K DAFL+
Sbjct: 58 LAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLMAKRS---AEFNKKAEENKAKGDAFLSA 114

Query: 118 NKGTAGVQTLPSGVQYRVIEAGKGAKPTQASTVQLEVAGPFPFGDREKARPAQQIPA-IK 176
NK G+ LPSG+QY++I+AG GAKP ++ TV +E G G + PA +
Sbjct: 115 NKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQ 174

Query: 177 VSEVEMQAMRDTLLQMPAGSKWEVTLPPEKAYGADPRT---PFPPNVAVQFEIKLVSVK 232
VS+V + + L MPAGS WEV +P + AYG PR+ P PN + F+I L+SVK
Sbjct: 175 VSQV-IPGWTEALQLMPAGSTWEVFVPADLAYG--PRSVGGPIGPNETLIFKIHLISVK 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07820CHLAMIDIAOMP270.043 Chlamydia major outer membrane protein signature.
		>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature.

Length = 393

Score = 26.9 bits (59), Expect = 0.043
Identities = 9/15 (60%), Positives = 11/15 (73%)

Query: 116 GTDPCDPCSRMEDAL 130
G DPCDPC+ DA+
Sbjct: 44 GGDPCDPCTTWCDAI 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07830SACTRNSFRASE280.010 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.010
Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 83 LAVGPGHQRQGLGTRLVQAALATLRERGAAG 113
+AV ++++G+GT L+ A+ +E G
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCG 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07905BCTERIALGSPF290.006 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.4 bits (66), Expect = 0.006
Identities = 10/39 (25%), Positives = 15/39 (38%), Gaps = 1/39 (2%)

Query: 134 PLKNIETDFPPVFDHFYRSLALRTCSQCGHLHPAPERYA 172
L + FP F+ Y ++ + GHL R A
Sbjct: 119 SLADAMKCFPGSFERLYCAM-VAAGETSGHLDAVLNRLA 156


24AXO1947_RS07925AXO1947_RS08080Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS07925320-0.439776HNH endonuclease
AXO1947_RS07935523-1.282973acyl-CoA dehydrogenase
AXO1947_RS215156160.495736hypothetical protein
AXO1947_RS079405170.609981membrane protein
AXO1947_RS079455171.051281*hypothetical protein
AXO1947_RS07950410-0.724618hypothetical protein
AXO1947_RS07955410-0.277146hypothetical protein
AXO1947_RS07960311-0.021815methyltransferase type 12
AXO1947_RS07965014-0.221371hybrid sensor histidine kinase/response
AXO1947_RS07975-110-1.266320Fe(2+)-trafficking protein
AXO1947_RS07980-1120.296630A/G-specific adenine glycosylase
AXO1947_RS07985014-0.474804signal recognition particle-docking protein
AXO1947_RS07990115-1.234839AraC family transcriptional regulator
AXO1947_RS07995016-2.060459hydroxyproline-2-epimerase
AXO1947_RS08000116-2.392290D-amino-acid oxidase
AXO1947_RS21525217-2.381712(2Fe-2S)-binding protein
AXO1947_RS08005012-0.818979oxidoreductase
AXO1947_RS08010112-0.762564dihydrodipicolinate synthase family protein
AXO1947_RS08015-114-0.299053ketoglutarate semialdehyde dehydrogenase
AXO1947_RS08020-113-0.664412X-Pro dipeptidase
AXO1947_RS08025015-1.633571hypothetical protein
AXO1947_RS08030015-2.125007DUF885 domain-containing protein
AXO1947_RS08035118-4.987318aspartate:proton symporter
AXO1947_RS08040-117-4.764178transposase
AXO1947_RS08050-217-4.365865IS5/IS1182 family transposase
AXO1947_RS08055-218-4.345925hypothetical protein
AXO1947_RS08060-117-3.515537hypothetical protein
AXO1947_RS08065-219-3.300921hypothetical protein
AXO1947_RS08070020-2.702443hypothetical protein
AXO1947_RS08075-218-3.101712hypothetical protein
AXO1947_RS08080-118-3.032186hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08145BCTLIPOCALIN927e-26 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 91.6 bits (227), Expect = 7e-26
Identities = 53/152 (34%), Positives = 82/152 (53%), Gaps = 11/152 (7%)

Query: 24 VRAVPQLDISRYAGQWHEIAHLPVSFQKKCRSDITASYTLRDDGLVGVRN-GCRIADGSL 82
V+ V +++ Y G+W+E+A L SF++ S +TA Y +R+DG + V N G G
Sbjct: 23 VKPVSDFELNNYLGKWYEVARLDHSFERGL-SQVTAEYRVRNDGGISVLNRGYSEEKGEW 81

Query: 83 TQAEGVARPVEGQP-GQLQVRFAPEWLGWLPLVWADYWVIALD-PDYQWAVVGEPDRKYL 140
+AEG A V G G L+V F + G Y V LD +Y +A V P+ +YL
Sbjct: 82 KEAEGKAYFVNGSTDGYLKVSFFGPFYG-------SYVVFELDRENYSYAFVSGPNTEYL 134

Query: 141 WILSRSPQMQRAQFERLKAQAAEMGYDLSPLI 172
W+LSR+P ++R ++ + E G+D + LI
Sbjct: 135 WLLSRTPTVERGILDKFIEMSKERGFDTNRLI 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08180HTHFIS661e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 1e-13
Identities = 27/118 (22%), Positives = 45/118 (38%), Gaps = 2/118 (1%)

Query: 417 TILVVEDKQDVAVVARMFLENAGYRILSASSGREAVEVLEKNPAVDALFTDLIMPGGMNG 476
TILV +D + V L AGY + S+ + D + TD++MP N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMP-DENA 62

Query: 477 VMLAREARRMLPKIKILLTTGYADASIQRTDVGGAEFDVVNKPYTQKELLKRIRMLLD 534
L ++ P + +L+ + +D + KP+ EL+ I L
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08200IGASERPTASE404e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 4e-05
Identities = 38/221 (17%), Positives = 59/221 (26%), Gaps = 28/221 (12%)

Query: 45 APSAAPPASPVEAMPHAEPRIDTAAAPAVAPGTPAAARTEAAAPAPVAATPP----AATP 100
+ P + +P + A AP P A T + VA
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 101 ADQLAQEIAARTGQAQSIAAVPSTPATPAAPAPTALQALPDPTPASAPSTPVTVV----- 155
+Q A E A+ + A + A + T + TV
Sbjct: 1054 NEQDATETTAQNREVAK-EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112

Query: 156 ---------VPPVAGQ--PQTTPTPSDAAPAQPVTPSHSAAPTVLAPPVTAPPVVAPAVA 204
VP V Q P+ + + A+P + PTV +
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN---DPTVNIKEPQSQTNTTADTE 1169

Query: 205 APAPIVPSAAAAPVAAAPANAAPTVATPAIVNTPLPTTQDS 245
PA S PV + ++V P TT +
Sbjct: 1170 QPAKETSSNVEQPV----TESTTVNTGNSVVENPENTTPAT 1206



Score = 31.2 bits (70), Expect = 0.014
Identities = 22/80 (27%), Positives = 28/80 (35%), Gaps = 14/80 (17%)

Query: 157 PPVAGQPQTTPTPSDAAPA-----QPVTPSHSAAPTVLAPPVTAPPVVAPAVAAPAPIVP 211
P V + QT T + P P PS++ +A AP V PAP P
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEE---IARVDEAP------VPPPAPATP 1033

Query: 212 SAAAAPVAAAPANAAPTVAT 231
S VA + TV
Sbjct: 1034 SETTETVAENSKQESKTVEK 1053


25AXO1947_RS08460AXO1947_RS08505Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS08460018-4.421027glutamine synthetase
AXO1947_RS08465428-5.468675IS5/IS1182 family transposase
AXO1947_RS21570518-1.523755multidrug ABC transporter permease
AXO1947_RS08485413-0.378277MFS transporter
AXO1947_RS084902141.209710hypothetical protein
AXO1947_RS085000255.311469polyamine ABC transporter ATP-binding protein
AXO1947_RS08505-3203.306210spermidine/putrescine ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08675adhesinmafb290.033 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 29.3 bits (65), Expect = 0.033
Identities = 37/167 (22%), Positives = 58/167 (34%), Gaps = 25/167 (14%)

Query: 13 KQPESALRRWLKERSITEVECLVPDITGNARG--KIIPADKFSHDYGTRLPEGIFATTVT 70
K A+ RW++E P+ + A K + P V+
Sbjct: 290 KNTREAVDRWIQEN---------PNAAETVEAVFNVAAAAKVAKLAKAAKPG---KAAVS 337

Query: 71 GDFPDDYYALTSPSDSDMHLRPDASTVRMVPWATDPTAQVIHDCYTKDGDPHEL-APRNV 129
GDF D Y + SDS L +A + + + D +K + E+ A N
Sbjct: 338 GDFADSYKKKLALSDSARQLYQNAKYREALDIHYEDLIRRKTDGSSKFINGREIDAVTN- 396

Query: 130 LRRVLDAYAQVK--LQPVVAPELEFFLVQKNTDPDFPLLPPAGRSGR 174
DA Q K + + P + FL QKN + A + G+
Sbjct: 397 -----DALIQAKRTISAIDKP--KNFLNQKNRKQIKATIEAANQQGK 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08700RTXTOXIND966e-24 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 96.4 bits (240), Expect = 6e-24
Identities = 51/371 (13%), Positives = 113/371 (30%), Gaps = 83/371 (22%)

Query: 81 SVAVAPRVSGYVTKVLVSDNQIVEAGQPLLQIDDRTYQATLQQAEAAIAARQADIVAATA 140
S + P + V +++V + + V G LL++ +A + ++++ + +
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 141 NVSAQESALLQARTQVTAAAASLRFAQAEVKRFAPLAASGADTHEHQES-LQHDLARARA 199
+ E L + ++ EV R L T ++Q+ + +L + RA
Sbjct: 156 LSRSIELNKLPELK-LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 200 QYDAAQAQAKAGESHIQASRAQLE------------------------QAQAGVKQATAD 235
+ A+ E+ + +++L+ +A ++ +
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 236 ADQARVAVEDTRLTSRIH------------------------------------------ 253
+Q + + ++
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 254 -GRVGD-KTVQVGQFLGAGTRTMTIVPQQSLYLV-ANFKETQVGLMRPGQPVEIEVDALS 310
+V K G + M IVP+ V A + +G + GQ I+V+A
Sbjct: 335 SVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394

Query: 311 GVK---LHGKVESLSPGTGSQFALLPPENATGNFTKVVQRVPVRIRVLAGDEARKVLVPG 367
+ L GKV++++ + G V+ + L G
Sbjct: 395 YTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSG 445

Query: 368 MSVEVTVDTRS 378
M+V + T
Sbjct: 446 MAVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08705TCRTETB1013e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 101 bits (254), Expect = 3e-25
Identities = 82/407 (20%), Positives = 165/407 (40%), Gaps = 20/407 (4%)

Query: 25 WLAVLAGTIGSFMATLDISIVNAALPTIQGEVGASGTEGTWISTAYLVAEIIMIPLTGWF 84
WL +L SF + L+ ++N +LP I + W++TA+++ I + G
Sbjct: 18 WLCIL-----SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72

Query: 85 VRTLGLRNFLLICAVMFTAFSVVCGLSTS-LSMMIIGRVGQGLAGGALIPTALTIVATRL 143
LG++ LL ++ SV+ + S S++I+ R QG A + +VA +
Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132

Query: 144 PPSQQTMGTALFGMTVIMGPVIGPLLGGWLTENVSWHYAFFINVPICVGLVALLLLGLKH 203
P + L G V MG +GP +GG + + W Y + +P+ + L+ L
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSY--LLLIPMITIITVPFLMKLLK 190

Query: 204 EKGDWAGLLNADWLGIYGLTAGLGGLTVVLEEGQRERWFESSEINTLSLIALSGFIALVI 263
++ G D GI ++ G+ + +L F +S + ++++ F+ V
Sbjct: 191 KEVRIKGHF--DIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVK 238

Query: 264 SQFRRRPPVIRLSLLVQRSFGAVFIMVMAVGMILFGVMYMIPQFLAVISGYNTEQAGYVL 323
+ P + L F + + + G + M+P + + +T + G V+
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 324 LLSGLPTVLLMPMMPKLLEMVDVRILVIAGLICFAAACFVNLTLTADTVGTHFVAGQLLQ 383
+ G +V++ + +L + V+ + F + F+ + +T +
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFV 358

Query: 384 GCGLALAMMSLNQAAISSVPPELAGDASGLFNAGRNLGGSVGLALIS 430
GL+ ++ SS+ + AG L N L G+A++
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08710RTXTOXIND419e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 9e-06
Identities = 32/217 (14%), Positives = 58/217 (26%), Gaps = 12/217 (5%)

Query: 73 TLTQLVTQALADSPNLRAAQARLRANRALAQRRRAERLPTLNASALYAYAEPPQTIVDTL 132
LT L +A QARL R R E L L
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN-KLPELKLPDEPYFQNV----- 179

Query: 133 GGLQQQGQAGQPPAAGNQALDLEKTQIYSAGFDASWELDVFGRRRRAAEGALAQAQ---A 189
++ + +K Q E R E +
Sbjct: 180 -SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 190 SEAELADAQVQLAAEVGQVYLNYRGLQARLAIADANLDKIRQTLQLVQQRRGQGAASDLQ 249
+ L Q V + Y L + + L++I + ++ +
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL-VTQLFK 297

Query: 250 VEQIATQVQQQQAQRLPLEMQSQEAQDQLALMVGRAP 286
+I +++Q L ++ + +++ V RAP
Sbjct: 298 -NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333


26AXO1947_RS09335AXO1947_RS09495Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS093350123.022232oligoribonuclease
AXO1947_RS093400240.119834mechanosensitive ion channel protein MscS
AXO1947_RS09345224-0.686876phosphoenolpyruvate synthase
AXO1947_RS09350222-0.608265phosphoenolpyruvate synthase regulatory protein
AXO1947_RS09355220-0.962092hypothetical protein
AXO1947_RS09360119-1.3316477,8-dihydro-8-oxoguanine triphosphatase
AXO1947_RS09365116-1.1919883-hydroxybutyrate dehydrogenase
AXO1947_RS09370-114-0.645480class III poly(R)-hydroxyalkanoic acid synthase
AXO1947_RS09375-212-0.231147poly-beta-hydroxybutyrate polymerase
AXO1947_RS09385-3110.136769stress-responsive transcriptional regulator
AXO1947_RS216651120.658870hypothetical protein
AXO1947_RS094001130.372880amino acid oxidase
AXO1947_RS094052130.575475hypothetical protein
AXO1947_RS09410217-0.558599RNA-binding transcriptional accessory protein
AXO1947_RS09415217-1.475805hypothetical protein
AXO1947_RS09420520-2.893008histidine kinase
AXO1947_RS09425418-2.483184response regulator
AXO1947_RS09430115-2.377042*hypothetical protein
AXO1947_RS09435114-2.532870**hypothetical protein
AXO1947_RS09440-217-2.631028hypothetical protein
AXO1947_RS09445-322-1.780318**cytochrome C biogenesis protein CcsA
AXO1947_RS09450-221-1.757436cytochrome c
AXO1947_RS09455-122-1.298896acriflavin resistance protein
AXO1947_RS09465113-0.844773IS110 family transposase
AXO1947_RS216752140.9662402-dehydro-3-deoxy-phosphogluconate aldolase
AXO1947_RS094851151.758964phosphogluconate dehydratase
AXO1947_RS094902151.8648756-phosphogluconolactonase
AXO1947_RS094952141.852532glucokinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09570PHPHTRNFRASE2777e-86 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 277 bits (711), Expect = 7e-86
Identities = 139/574 (24%), Positives = 235/574 (40%), Gaps = 84/574 (14%)

Query: 260 KAIRMVYSDVPGERVRTEDTPVE---LRSTFSISDEDVQELSKQAL---------VIEKH 307
KA + +V E+ D E L + S E+++ + Q + H
Sbjct: 18 KAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAH 77

Query: 308 YGRPMDIEWAKDGVSGKLFIVQARPETVKSRSHATQIERFSLEAKDAKILVEGRAVGAKI 367
D E + GK+ Q E + F E+ D + + E RA A I
Sbjct: 78 LLVLDDPELVDG-IKGKIENEQMNAEYALKEVSDMFVSMF--ESMDNEYMKE-RA--ADI 131

Query: 368 GSGVARVVRSLDDMNRVQAGD-----VLIA-DMTDPDWEPVMK-RASAIVTNRGGRTCHA 420
RV+ L + V+IA D+T D + K T+ GGRT H+
Sbjct: 132 RDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHS 191

Query: 421 AIIARELGVPAVVGSGNATDVLSDGQEVTVSCAEG---------DTGFIYEGLLPFERTT 471
AI++R L +PAVVG+ T+ + G V V EG + E FE+
Sbjct: 192 AIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQK 251

Query: 472 TDLGNMPPAP--------LKIMMNVANPERAFDFGQLPNAGIGLARLEMIIAAHIGIHPN 523
+ + P +++ N+ P+ GIGL R E + +
Sbjct: 252 QEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDR-----D 306

Query: 524 ALLEYDKQDADVRKKIDAKIAGYGDPVSFYVNRLAEGIATLTASVAPNTVIVRLSDFKSN 583
L ++Q ++ + G PV ++R D +
Sbjct: 307 QLPTEEEQFEAYKEVVQRM---DGKPV-----------------------VIRTLDIGGD 340

Query: 584 EYANLIGGSRYEPHEENPMIGFRGASRYVDPSFTKAFALECKAVLKVRNEMGLDNLWVMI 643
+ + + P E NP +GFR ++ F + +A+L+ NL VM
Sbjct: 341 KELSYL----QLPKELNPFLGFRAIRLCLE--KQDIFRTQLRALLRAS---TYGNLKVMF 391

Query: 644 PFVRTIEEGRKVIEVLEQNGLKQ-GDGADGKPGLKIIMMCELPSNALLADEFLDIFDGFS 702
P + T+EE R+ ++++ K +G D +++ +M E+PS A+ A+ F D FS
Sbjct: 392 PMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFS 451

Query: 703 IGSNDLTQLTLGLDRDSSIVAHLFDERNPAVKKLLSMAIKSARAKGKYVGICGQGPSDHP 762
IG+NDL Q T+ DR + V++L+ +PA+ +L+ M IK+A ++GK+VG+CG+ D
Sbjct: 452 IGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD-E 510

Query: 763 ELAEWLMQEGIGSVSLNPDTVVDTWLRLAKLKSE 796
L+ G+ S++ +++ +L KL E
Sbjct: 511 VAIPLLLGLGLDEFSMSATSILPARSQLLKLSKE 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09575CLENTEROTOXN320.003 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 31.6 bits (71), Expect = 0.003
Identities = 14/64 (21%), Positives = 21/64 (32%), Gaps = 2/64 (3%)

Query: 3 TIRPVFYVSDGTGITAETIGHSLLTQF--SGFNFVTDRMSFIDDAEKARDAAMRVRAAGE 60
+ V+ G T+E I S+ F + T S A +V A
Sbjct: 78 SKEVSINVNFSVGFTSEFIQASVEYGFGITIGEQNTIERSVSTTAGPNEYVYYKVYATYR 137

Query: 61 RYQV 64
+YQ
Sbjct: 138 KYQA 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09580BACTRLTOXIN280.012 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 28.3 bits (63), Expect = 0.012
Identities = 7/30 (23%), Positives = 14/30 (46%)

Query: 73 YDLCDPVTGEPDPSAYVRLYRDARQAETTH 102
YD+ + D S Y+ +Y D + ++
Sbjct: 225 YDMMPAPGDKFDQSKYLMMYNDNKTVDSKS 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09590DHBDHDRGNASE1051e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (264), Expect = 1e-29
Identities = 73/255 (28%), Positives = 109/255 (42%), Gaps = 11/255 (4%)

Query: 2 RSILITGAGSGIGAGIATQLATDGHHLIVSDMELPAAERTAHALRQAGGSAEALALDVTD 61
+ ITGA GIG +A LA+ G H+ D E+ +L+ AEA DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 ADSIAQALASASRTPQ---VLVNNAGLQHVAALDEFPMRQWALLVDVMLTGAARLSRAVL 118
+ +I + A R +LVN AG+ + +W V TG SR+V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMRAAGYGRIVNIGSIHSLVASPYKSAYVAAKHGLVGLAKVIALETADCDITVNTLCPS 178
M G IV +GS + V +AY ++K V K + LE A+ +I N + P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 YVRTPLVERQIADQARTSGIAEEAVIRDVMLK---PMPKGAFIDYDELAGTVAFLMSHAA 235
T + AD+ E VI+ + +P ++A V FL+S A
Sbjct: 189 STETDMQWSLWADEN-----GAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 236 RNITGQSIAIDGGWT 250
+IT ++ +DGG T
Sbjct: 244 GHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09600RTXTOXIND389e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 9e-05
Identities = 36/189 (19%), Positives = 60/189 (31%), Gaps = 19/189 (10%)

Query: 140 QEAAQTLQKWREENA-PWLDMPAFGLNRN----HQSRLQKLARAQ----QDFQAQSEAYG 190
Q Q L + E N P L +P +N RL L + Q Q+ + Q E
Sbjct: 150 QTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL 209

Query: 191 EQLKAAIEQAFARFASKLSEHESSGSQLTSARALFD------LWIEAAEESYADVALSNQ 244
++ +A AR + S+L +L + E Y +
Sbjct: 210 DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA---VN 266

Query: 245 FREVYGGFANAHMRLRAALQEEIEQLSERIGMPTRSEMDAAHRRIAELE-RLVRRMLRTA 303
VY + +EE + +++ ++ I L L + R
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQ 326

Query: 304 ASPARKPAA 312
AS R P +
Sbjct: 327 ASVIRAPVS 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09635HTHFIS802e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 2e-17
Identities = 31/130 (23%), Positives = 60/130 (46%), Gaps = 4/130 (3%)

Query: 1002 RLLLVDDDQDSREAVMQFLTLAGAQVQAAGSVDAAEQCLANAHFDVLVSDIAMPLRDGYD 1061
+L+ DDD R + Q L+ AG V+ + + +A D++V+D+ MP + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1062 LIRTVRSGRADLPRQIPAIALTAYVREEDRDRAVVAGFDAHMGKPVEPPGLVDLIERLIL 1121
L+ ++ R DLP + ++A +A G ++ KP + L+ +I R +
Sbjct: 65 LLPRIKKARPDLPV----LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 1122 PTRAVRDAVE 1131
+ +E
Sbjct: 121 EPKRRPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09695ACRIFLAVINRP6340.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 634 bits (1637), Expect = 0.0
Identities = 257/1120 (22%), Positives = 468/1120 (41%), Gaps = 132/1120 (11%)

Query: 1 MLLFGLIALRSLKVNLLPDLSYPTLTVRTEYTGAAPAEIETLVTEPVEEAVGVVKNLRKL 60
+++ G +A+ L V P ++ P ++V Y GA ++ VT+ +E+ + + NL +
Sbjct: 19 LMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMNGIDNLMYM 78

Query: 61 KSIS-RTGQSDVVLEFAWGTNMDQASLEVRDKMEAL--SLPLETRPPVLLRFNPSTEPIM 117
S S G + L F GT+ D A ++V++K++ LP E + + S+ +M
Sbjct: 79 SSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLM 138

Query: 118 RLALSPKQAPASDTDAIRQLTGLRRYADEDLKKKLEPVAGVAAVKVGGGLEDEIQVDIDQ 177
+ + Y ++K L + GV V++ G + +++ +D
Sbjct: 139 VAGFV-------SDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMRIWLDA 190

Query: 178 QKLAQLNLPIDNVITRLKEENVNISGGRL------EEGSQRYLVRTVNQFVDLDEIRNML 231
L + L +VI +LK +N I+ G+L + +F + +E +
Sbjct: 191 DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVT 250

Query: 232 VTTQSSSGSAAEAAMLQMYAIAASTGSQAALAAAAEVQSTSSSSSSSIAGGMPVRLKDVA 291
+ S G VRLKDVA
Sbjct: 251 LRVNSD--------------------------------------------GSVVRLKDVA 266

Query: 292 QVSQGYKEREAIIRLGGKEAVELAIYKEGDANTVSTAAALRKRLQQLKATVPGDVEITTI 351
+V G + I R+ GK A L I AN + TA A++ +L +L+ P +++
Sbjct: 267 RVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP 326

Query: 352 EDQSHFIEHAISDVKKDAVIGGVLAILIIFLFMRDGWSTFVISLSLPVSIITTF----FL 407
D + F++ +I +V K +L L+++LF+++ +T + ++++PV ++ TF
Sbjct: 327 YDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAF 386

Query: 408 GLSLNVMSLGGLALATGLGVDDSIVVLESIAKA-RERGLSVLDAAIAGTREVSMAVMAST 466
G S+N +++ G+ LA GL VDD+IVV+E++ + E L +A ++ A++
Sbjct: 387 GYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIA 446

Query: 467 LTTIAVFVPLVFVEGIAGQLFRDQALTVAIAIAISLVVSMTLIPMLSSLKGAPPMAFPDE 526
+ AVF+P+ F G G ++R ++T+ A+A+S++V++ L P L +
Sbjct: 447 MVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCA------------ 494

Query: 527 PSHPQWQPERRWLKPVAAGRRGAGASVRYAFFGAAWAVVKVWRGLSRVVGPVMRKASDLA 586
LKPV+A + FFG
Sbjct: 495 ----------TLLKPVSAEHHEN----KGGFFGWFNTT---------------------- 518

Query: 587 MAPYARAERGYLAMLPAALRRPGLVLGLAAAAFIGTVFLVPMLGADLIPQLAQDRFEMTV 646
+ + Y + L G L + A G V L L + +P+ Q F +
Sbjct: 519 ---FDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMI 575

Query: 647 KLPSGTPLAQTDALVRELQ--LAHDKDPGVASLYGVSGSGTRLDANPTESGENIGKLTVV 704
+LP+G +T ++ ++ ++ V S++ V+G +G L
Sbjct: 576 QLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPW 633

Query: 705 MAGCGSPAVEAAATERLRSSMVGHPSAQV-DFARPALFSF--STPLEVEL---RGQDLSE 758
G A R + + V F PA+ +T + EL G
Sbjct: 634 EERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDA 693

Query: 759 LERAGQKLAAMLRAN-GHYADVKSTGEEGFPEIQIRFDQERAGALGLTTRQIADVIVKKV 817
L +A +L M + V+ G E + ++ DQE+A ALG++ I I +
Sbjct: 694 LTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTAL 753

Query: 818 RGDVATRYSFRDRKIDVLVRAQQSDRASVDAIRQLIVNPGSSRPVRLAAVAKVLAATGPS 877
G + R R + V+A R + + +L V + V +A G
Sbjct: 754 GGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSP 813

Query: 878 QIHRADQTRVAIVSASL-KDIDLGGAVREVETMVRKAPLAAGVGMHIGGQGEELAQSVKS 936
++ R + + G A+ +E + K P AG+G G + S
Sbjct: 814 RLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLP--AGIGYDWTGMSYQERLSGNQ 871

Query: 937 LLFAFGLAIFLVYLVMASQFESLLHPFVILFTIPLAMVGAVLALLMTGKPISVVVFIGLI 996
++ +V+L +A+ +ES P ++ +PL +VG +LA + + V +GL+
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931

Query: 997 LLVGLVTKNAIILIDKVNQLRE-DGVPKREALIEGARSRLRPIIMTTLCTLFGFLPLAVA 1055
+GL KNAI++++ L E +G EA + R RLRPI+MT+L + G LPLA++
Sbjct: 932 TTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAIS 991

Query: 1056 MGEGAEVRAPMAITVIGGLLVSTLLTLLVIPVVYDLLDRR 1095
G G+ + + I V+GG++ +TLL + +PV + ++ R
Sbjct: 992 NGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09705OMADHESIN300.013 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 29.9 bits (66), Expect = 0.013
Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 161 RQLQQQVAQLDARIAQQLAARAELQVLRQVTGVGPV 196
RQL ++ +LD R+ + LA+ A L L Q GVG V
Sbjct: 369 RQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKV 404


27AXO1947_RS09945AXO1947_RS10185Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS09945-121-3.231858DNA-binding response regulator
AXO1947_RS09950-124-3.212919hypothetical protein
AXO1947_RS09955-121-2.890834hypothetical protein
AXO1947_RS09960014-2.810885hypothetical protein
AXO1947_RS21735012-3.186760TetR family transcriptional regulator
AXO1947_RS21745-112-1.903771RND transporter
AXO1947_RS09975-210-0.662731multidrug efflux RND transporter permease
AXO1947_RS09985-280.675981metal-dependent hydrolase
AXO1947_RS09995-18-0.329965mbth-like protein
AXO1947_RS10005-311-1.671987ABC transporter ATP-binding protein
AXO1947_RS10010-311-2.122997enoyl-CoA hydratase
AXO1947_RS10015-124-4.602913GntR family transcriptional regulator
AXO1947_RS10020-121-3.593985enoyl-CoA hydratase
AXO1947_RS10025023-4.417293enoyl-CoA hydratase
AXO1947_RS10030015-2.767417stilbene synthase
AXO1947_RS10035015-1.381130hypothetical protein
AXO1947_RS10040013-0.212320non-ribosomal peptide synthetase
AXO1947_RS10045-2101.071926hypothetical protein
AXO1947_RS10050-2112.071912hypothetical protein
AXO1947_RS10055-1113.277153hypothetical protein
AXO1947_RS100601123.960706non-ribosomal peptide synthetase
AXO1947_RS100650123.949313non-ribosomal peptide synthetase
AXO1947_RS100701114.413699hypothetical protein
AXO1947_RS100751114.831854hypothetical protein
AXO1947_RS100801114.438538transcriptional activator feaR
AXO1947_RS100902123.348089IS5/IS1182 family transposase
AXO1947_RS100951142.936444avirulence protein
AXO1947_RS101051141.172821hypothetical protein
AXO1947_RS10115019-0.189882avirulence protein
AXO1947_RS101202230.524251serine protease
AXO1947_RS101304230.315576cation transporter
AXO1947_RS101352230.407263CusA/CzcA family heavy metal efflux RND
AXO1947_RS101402220.926595histidine kinase
AXO1947_RS101452191.091577ATPase
AXO1947_RS101500160.748396DNA-binding response regulator
AXO1947_RS10155-1130.420054hypothetical protein
AXO1947_RS10160-2100.175583GGDEF domain-containing protein
AXO1947_RS10165-38-0.216569peptidase
AXO1947_RS10170-110-0.334743YciE/YciF family protein
AXO1947_RS10175-110-1.008479IS5/IS1182 family transposase
AXO1947_RS10180011-2.551065stress-induced protein
AXO1947_RS21755213-3.494064hypothetical protein
AXO1947_RS10185312-3.769758nitrate transport ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10125HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 36/117 (30%), Positives = 61/117 (52%), Gaps = 4/117 (3%)

Query: 2 LVVDDDQAMAQVVMGHIRSHGMEAFVATNSSELAEALRRREPDILLLDLMLKHEDGLDLL 61
LV DDD A+ V+ + G + + +N++ L + + D+++ D+++ E+ DLL
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 62 RALRKE-SDIPVIIMTGHRRDEIDRVV-GLELGADDYLPKPFGLHELTARIRAVLRR 116
++K D+PV++M+ + + E GA DYLPKPF L EL I L
Sbjct: 67 PRIKKARPDLPVLVMSAQ--NTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10170HTHTETR357e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 35.4 bits (81), Expect = 7e-05
Identities = 13/91 (14%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 2 RQDDQRLIRLLAATLTRRPRSNLT--ELAAGAGISRATLYRFAPTRAAIVEKVTAEAWVR 59
++ Q ++ + +++ S+ + E+A AG++R +Y ++ + ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 60 LQAALPG--GDASPDPMARLRRMTHALVEDL 88
+ DP++ LR + ++E
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLEST 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10175RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 40/213 (18%), Positives = 70/213 (32%), Gaps = 28/213 (13%)

Query: 100 EAALARARGELTRSEAELENATAQFERSQQLVQRQVISRQDFDT-ARSNFKSTQAAVASA 158
E A EL +++LE ++ +++ + Q F + T +
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKE---EYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 159 RAALKTAQLDLGFATVRAPIDGRIGRALV-TEGALVGQGGDATEMALVQQLDPIFADFNR 217
L + + +RAP+ ++ + V TEG +V T M +V + D +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA--ETLMVIVPEDDTLEVTALV 372

Query: 218 PVAEALKLRGRARKGDAPLKVVIDIPELGETREGDL------LFADMRVDETTDTV--SL 269
K G G +I + TR G L + D D+ V +
Sbjct: 373 QN----KDIGFINVG---QNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVI 425

Query: 270 RAQ------FDNRDNLLLPGMFVRVRTPNGTAS 296
+ N++ L GM V G S
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRS 458



Score = 40.6 bits (95), Expect = 8e-06
Identities = 22/133 (16%), Positives = 46/133 (34%), Gaps = 8/133 (6%)

Query: 55 PGRVSPM-RVAQVRARVAGIVLARRFEEGSDVKAGQVLFQIDPAPFEAALARARGELTRS 113
G+++ R +++ IV +EG V+ G VL ++ EA + + L ++
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146

Query: 114 EAELENATAQFERSQQLVQRQVISRQDFDTARS----NFKSTQAAVASARAALK--TAQL 167
E RS +L + + D ++ + + + + Q
Sbjct: 147 RLEQTRYQIL-SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 168 DLGFATVRAPIDG 180
+L RA
Sbjct: 206 ELNLDKKRAERLT 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10180ACRIFLAVINRP10720.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1072 bits (2774), Expect = 0.0
Identities = 503/1030 (48%), Positives = 696/1030 (67%), Gaps = 10/1030 (0%)

Query: 1 MSRFFIDRPNFAWVVAIFISLAGVLALRTLPVEKYPEVAPPQISIMATYPGASAQVVNDA 60
M+ FFI RP FAWV+AI + +AG LA+ LPV +YP +APP +S+ A YPGA AQ V D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTSVIEQELNGVRDMLYYDSSS-SNGSAQITIMFQPGTDPNIAQVDVQNRIRQSESRLPA 119
VT VIEQ +NG+ +++Y S+S S GS IT+ FQ GTDP+IAQV VQN+++ + LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 AVTQLGLQVEQTTAGFLMLYSLVYKDATAAQDVVRLNDYAARVVNDEIRRVPGVGRVQFF 179
V Q G+ VE++++ +LM+ V + QD ++DY A V D + R+ GVG VQ F
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQD--DISDYVASNVKDTLSRLNGVGDVQLF 178

Query: 180 GAEAAMRVWVDTQALRGYGLSIVDVNNAIRAQNLQVAAGSLGERPGAQDQELTTTLVVRG 239
GA+ AMR+W+D L Y L+ VDV N ++ QN Q+AAG LG P Q+L +++ +
Sbjct: 179 GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 240 QMESPQEFGQIVLRAQANGAVVHLSDVAKLELGLENYQFDVQENGGPAAGAAVQLAPGGN 299
+ ++P+EFG++ LR ++G+VV L DVA++ELG ENY + NG PAAG ++LA G N
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 300 AVATVAAVRKRLQELSQSFPADIAYSVPFDSSTFVNVAIKKVLHTLLEAMALVFLVMFVF 359
A+ T A++ +L EL FP + P+D++ FV ++I +V+ TL EA+ LVFLVM++F
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 360 LQNIRYTLIPAIVVPVCLLGTFAVMKLLGFSVNMMSMFAMVLAIGILVDDAIVVVENVER 419
LQN+R TLIP I VPV LLGTFA++ G+S+N ++MF MVLAIG+LVDDAIVVVENVER
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 420 LMADEGLLPRDASMKAMTQVGGAIVGITLVLTAVFLPLAFMSGSVGVIYRQFSAVLAVSI 479
+M ++ L P++A+ K+M+Q+ GA+VGI +VL+AVF+P+AF GS G IYRQFS + ++
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 480 LFSGFLALTMTPALCATCLAPI--DGHQEKKGFFGWFDRNFNALTSRFDRLNHRLVHRAG 537
S +AL +TPALCAT L P+ + H+ K GFFGWF+ F+ + + +++ G
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 538 RCMLVYAVLLGVLGLAYVRLPEAFVPQEDEGYMIVDMQLPPGASYSRTRAVGQQVNDYL- 596
R +L+YA+++ + + ++RLP +F+P+ED+G + +QLP GA+ RT+ V QV DY
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 597 -AARPSMQDVTLVYGFSFSGSGANAAMAFPSLKDWSER-GDSESVANEVAAANVALGRIS 654
+ +++ V V GFSFSG NA MAF SLK W ER GD S + A + LG+I
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 655 DVTIMAVMPPPIEGLGNSGGFSLRVQDRGNLGRDALMQAVNQLLRAANQSP-KLAYAMVE 713
D ++ P I LG + GF + D+ LG DAL QA NQLL A Q P L
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 714 GLADAPQLRLEVDRGKAEALGVSFQSAMDVLSSAFGSTIVNDFVNRGRLQRVVVQGAAGD 773
GL D Q +LEVD+ KA+ALGVS +S+A G T VNDF++RGR++++ VQ A
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 774 RATPQSLDTLHVTSSTGRQVPLTAFTTQRWEQGPVQIARYNGYASVNLTGEAAPGISSGD 833
R P+ +D L+V S+ G VP +AFTT W G ++ RYNG S+ + GEAAPG SSGD
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 834 ALAEMERLAAALPQGIGYAWSALSYQEKAAGTQAPMLLGLALLVVFLLLVALYESWAIPF 893
A+A ME LA+ LP GIGY W+ +SYQE+ +G QAP L+ ++ +VVFL L ALYESW+IP
Sbjct: 839 AMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 894 SVMLIVPIGAVGATAAVWVAGLSNDVYFKVGLITIIGLAAKNAILIVEFAKELHAR-GAR 952
SVML+VP+G VG A + NDVYF VGL+T IGL+AKNAILIVEFAK+L + G
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 953 VPEAAMQAARLRFRPIVMTSLAFILGVIPLVIARGAGAASQNALGTGVIGGMLAASTLGV 1012
V EA + A R+R RPI+MTSLAFILGV+PL I+ GAG+ +QNA+G GV+GGM++A+ L +
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1013 VFTPIFFTWV 1022
F P+FF +
Sbjct: 1019 FFVPVFFVVI 1028



Score = 61.4 bits (149), Expect = 1e-11
Identities = 85/509 (16%), Positives = 173/509 (33%), Gaps = 48/509 (9%)

Query: 540 MLVYAVLLGVLGL-AYVRLPEAFVPQEDEGYMIVDMQLPPGASYSRTRAVGQQVNDYL-A 597
V A++L + G A ++LP A P + V P GA + V V +
Sbjct: 12 AWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GAD---AQTVQDTVTQVIEQ 67

Query: 598 ARPSMQDVTLVYGFSFSGSGANAAMAFPSLKDWSERGDSESVANEVAAANVALGRISDVT 657
+ ++ + S S + F D + +V L + +
Sbjct: 68 NMNGIDNLMYMSSTSDSAGSVTITLTFQ------SGTDPDIAQVQVQNK---LQLATPLL 118

Query: 658 IMAVMPPPIEGLGNSGGF---SLRVQDRGNLGRDALMQAVNQLLRAANQSPKLAYAMVEG 714
V I +S + + V D +D + V ++ L+ + G
Sbjct: 119 PQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVK-----DTLS--RLNG 171

Query: 715 LADAP------QLRLEVDRGKAEALGVSFQSAMDVLSS-----AFGSTIVNDFVNRGRLQ 763
+ D +R+ +D ++ ++ L A G + +L
Sbjct: 172 VGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLN 231

Query: 764 RVVVQGAAGDRATPQSLDTLHVTSST-GRQVPLTAFTTQRW-EQGPVQIARYNGYASVNL 821
++ A P+ + + ++ G V L + IAR NG + L
Sbjct: 232 ASII--AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 822 TGEAAPGISSGDAL----AEMERLAAALPQG--IGYAWSALSYQEKAAGTQAPMLLGLAL 875
+ A G ++ D A++ L PQG + Y + + + + L +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIM 349

Query: 876 LVVFLLLVALYESWAIPFSVMLIVPIGAVGATAAVWVAGLSNDVYFKVGLITIIGLAAKN 935
LV ++ + L ++ + VP+ +G A + G S + G++ IGL +
Sbjct: 350 LVFLVMYLFL-QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 936 AILIVE-FAKELHARGARVPEAAMQAARLRFRPIVMTSLAFILGVIPLVIARGAGAASQN 994
AI++VE + + EA ++ +V ++ IP+ G+ A
Sbjct: 409 AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 995 ALGTGVIGGMLAASTLGVVFTPIFFTWVM 1023
++ M + + ++ TP ++
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLL 497


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10265ISCHRISMTASE320.015 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.5 bits (71), Expect = 0.015
Identities = 22/101 (21%), Positives = 38/101 (37%), Gaps = 2/101 (1%)

Query: 736 KLDRAALPAPDTQALDLHSYVTPQGELEQMLATLWSGLLGAAQVGRNDDFFALGGHSLLA 795
+L A T A V + + +A L + +D G S+
Sbjct: 209 QLQNAPADVQKTSANTGKKNVFTCENIRKQIAELLQ--ETPEDITDQEDLLDRGLDSVRI 266

Query: 796 VKLIERLRRLGWQIDVRALFARPTLAGLAENLQAASTNMVP 836
+ L+E+ RR G ++ L RPT+ + L S ++P
Sbjct: 267 MTLVEQWRREGAEVTFVELAERPTIEEWQKLLTTRSQQVLP 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10325SUBTILISIN1214e-32 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 121 bits (305), Expect = 4e-32
Identities = 72/325 (22%), Positives = 120/325 (36%), Gaps = 43/325 (13%)

Query: 78 NADLAQQAGAKGKGVKLAVLDDNLVQSYTPISGKVDSFNDYTASPGTPESSANALRGHGT 137
A +G+GVK+AVLD + + ++ ++T GHGT
Sbjct: 30 QAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGT 88

Query: 138 IVSALVLGSAQDGFAGGVAPDADLFYARICAENSCGTQATRRAAVDLAAA-GVRIANLSI 196
V+ + + + GVAP+ADL ++ + G + A V I ++S+
Sbjct: 89 HVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSL 148

Query: 197 GASYPDAAASANAALAWKYALTPLVQADALIVASTGNEGAAEAS-----YPAATPVQEAS 251
G A K A V + L++ + GNEG + YP
Sbjct: 149 GGPEDVPELHE----AVKKA----VASQILVMCAAGNEGDGDDRTDELGYPGCYN----- 195

Query: 252 VRNNWLAVGAINIDSAGNAAGLTSYSNHCGAAAQWCLVAPGSYTVPALAGSELGGQIAGT 311
++VGAIN D + +SN + LVAPG + + G + +GT
Sbjct: 196 ---EVISVGAINFDR-----HASEFSNSN---NEVDLVAPGEDILSTVPGGKY-ATFSGT 243

Query: 312 SFSTAAVSGVAAQVLGVYPW-----MTASQLQQTLLTTATDLGDPGVDALYGWGLVNAAK 366
S +T V+G A + + +T +L L+ LG + G GL+
Sbjct: 244 SMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGNGLLYLTA 301

Query: 367 AIKGPGQFASNWAVNVTSGYDSTFS 391
+ + + +G ST S
Sbjct: 302 V----EELSRIFDTQRVAGILSTAS 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10330RTXTOXIND569e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.0 bits (135), Expect = 9e-11
Identities = 38/205 (18%), Positives = 72/205 (35%), Gaps = 28/205 (13%)

Query: 114 SAELATAYSDAGKARAMLQQARLELARQKVLAADSIAAARDLQAAQQAFDSAQNDARAAS 173
EL S + + + A+ E L + I L+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD--KLRQTTDNIGLLTLELAKNE 322

Query: 174 DRLAQLGVAAQATSHRRYVLRAPIAGRVVDLSAALGGFWNDTSAPLMTVADISQV-WLTA 232
+R + V+RAP++ +V L G T+ LM + +TA
Sbjct: 323 ERQ------------QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 233 SVPEREIGQVFEGQQVTASLDAYPGQ---HFTGLVQHL--DDLLDPTTRTL-KVRVALNN 286
V ++IG + GQ ++A+P + G V+++ D + D + V +++
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEE 430

Query: 287 HDGL-------LKPGMFARAQFQTR 304
+ L GM A+ +T
Sbjct: 431 NCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 36.0 bits (83), Expect = 2e-04
Identities = 27/136 (19%), Positives = 46/136 (33%), Gaps = 9/136 (6%)

Query: 76 VLPERLVRVVPPLAGRVVALPKTLGDTVRAGDVLCVLDSAELATAYSDAGKARAMLQQAR 135
R + P V + G++VR GDVL L + A +D K ++ L QAR
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA---LGAEADTLKTQSSLLQAR 147

Query: 136 LELARQKVLAADSIAAARDLQAAQQAFDSAQNDARAASDRLAQLGVAAQATSHRR---YV 192
LE R S + + + D + + L + + S + Y
Sbjct: 148 LEQTR---YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 193 LRAPIAGRVVDLSAAL 208
+ + + L
Sbjct: 205 KELNLDKKRAERLTVL 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10335ACRIFLAVINRP6390.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 639 bits (1651), Expect = 0.0
Identities = 233/1034 (22%), Positives = 426/1034 (41%), Gaps = 43/1034 (4%)

Query: 11 QRRGIVWLVFVLIALYGTWSWTQLPVEAYPDIADVTSQVVTQVPGLGAEEVEQQITVPLE 70
+R W++ +++ + G + QLPV YP IA V PG A+ V+ +T +E
Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIE 66

Query: 71 RALMGTPGLHVLRSRSLFA-LSLITLVFDDGTEGYFARQRVLERIQAVT--LPYGA-IPG 126
+ + G L + S S A ITL F GT+ A+ +V ++Q T LP G
Sbjct: 67 QNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQG 126

Query: 127 LDPYTSPTGEIYRYTLES--KTRSLRELSDLQFWTVIPRLQKVQGVADVTNFGGLTTQFS 184
+ S + + S + ++SD V L ++ GV DV FG
Sbjct: 127 ISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMR 185

Query: 185 LALEPDRLTRYGVSLQQVKSAITSNNAD------GGGSVMDRGEQSYVIRGIGLLGSLQD 238
+ L+ D L +Y ++ V + + N GG + + + I + ++
Sbjct: 186 IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEE 245

Query: 239 IGDVVV-SNSNGVPVLVKDLGEVRYDNVERRGILGKDKNPDTIEGIALLLKDSNPSVALQ 297
G V + NS+G V +KD+ V I + P L +N +
Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKP-AAGLGIKLATGANALDTAK 304

Query: 298 GIHSAVEELNNSLLPKDVKVVPYLDRTALIDATLHTVSATLTEGMLLVCVVLLIFLGSPR 357
I + + EL P+ +KV+ D T + ++H V TL E ++LV +V+ +FL + R
Sbjct: 305 AIKAKLAELQPFF-PQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 358 AAAIVSLTIPLSLLIAFIFMHHLKIPANLLSLG--AIDFGILVDGAVVLVENVLRLREEN 415
A I ++ +P+ LL F + N L++ + G+LVD A+V+VENV R+ E+
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 416 SERALTAGDAIDATLHVARPIFFGMAVIGCAYLPLLAFERIEYKLFSPMAYAVGAALIGA 475
A + + + V+ ++P+ F ++ + + +A+ +
Sbjct: 424 KLPPKEA--TEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 LLVALMLIPALAWLAFRKPRKMMH-----------NRVLEALGQRYRALLERSVGRRGWL 524
+LVAL+L PAL KP H N + Y + + +G G
Sbjct: 482 VLVALILTPALCAT-LLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 525 LACAALALCVLAVLGGSIGRDFLPYIDEGSLWLQVQMPPGITLDKAARMANALRTATL-- 582
L AL + + VL + FLP D+G +Q+P G T ++ ++ + + L
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 583 EFPEVSYVVTQTGRNDDGTDYWTPSHIEASVGLRPYKDWPS-GMDKQGLIAALGARYAQM 641
E V V T G + G + A V L+P+++ + +I ++
Sbjct: 601 EKANVESVFTVNGFSFSGQ---AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657

Query: 642 PGYTVSMMQPMIDGVQDKLSGAHSDLTIKVFGDDLQQVRGVAEQVATALHAVPGA-ADIA 700
V +G +L I G + Q+ P + +
Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFEL-IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 701 VDVEPPLPNLQVRFDREAAARYGINAADVSDLIATGIGGSPIGQMYIGEKSYDLTVRFPQ 760
+ ++ D+E A G++ +D++ I+T +GG+ + + L V+
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 761 RYRNDPQAIGALRLRTAAGAEIPLSAVASITTTSGQSVIVREMGRRNIIVRLNVRGRDLS 820
++R P+ + L +R+A G +P SA + G + R G ++ ++
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP---G 833

Query: 821 SFLSDAQATLVRHVRIDPQHMQLVWGGQFENLQRAQARLLVVLPTTLCIMFVLLFGAFGN 880
+ DA A + P + W G + + + ++ + ++F+ L + +
Sbjct: 834 TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893

Query: 881 LRQPTLVLAAVPLAMIGGLAALHLRGMTLNVSSAVGFIALFGVAVLNAVLMLAQINRLRQ 940
P V+ VPL ++G L A L +V VG + G++ NA+L++ L +
Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953

Query: 941 DPGMSLREAVVAGAVSRMRPVLMTATVAALGLTPAMLAAGLGSDVQRPLATVVVGGLITA 1000
G + EA + R+RP+LMT+ LG+ P ++ G GS Q + V+GG+++A
Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSA 1013

Query: 1001 TALTLLLLPSLYYL 1014
T L + +P + +
Sbjct: 1014 TLLAIFFVPVFFVV 1027



Score = 82.6 bits (204), Expect = 5e-18
Identities = 62/351 (17%), Positives = 139/351 (39%), Gaps = 29/351 (8%)

Query: 682 VAEQVATALHAVPGAADIAVDVEPPLPNLQVRFDREAAARYGINAADVSDLIATG----I 737
VA V L + G D+ + +++ D + +Y + DV + +
Sbjct: 158 VASNVKDTLSRLNGVGDVQLFGAQYA--MRIWLDADLLNKYKLTPVDVINQLKVQNDQIA 215

Query: 738 GGSPIGQMYIGEKSYDLTVRFPQRYRNDPQAIGALRLRTAA-GAEIPLSAVASITTTS-G 795
G G + + + ++ R++N P+ G + LR + G+ + L VA +
Sbjct: 216 AGQLGGTPALPGQQLNASIIAQTRFKN-PEEFGKVTLRVNSDGSVVRLKDVARVELGGEN 274

Query: 796 QSVIVREMGRRNIIVRLNVRG--------RDLSSFLSDAQATLVRHVRID-PQHMQLVWG 846
+VI R G+ + + + + + + L++ Q + +++ P
Sbjct: 275 YNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQ 334

Query: 847 GQFENLQRA--QARLLVVLPTTLCIMFVLLFGAFGNLRQPTLVLAAVPLAMIGGLAALHL 904
+ + +A +LV L +M++ L N+R + AVP+ ++G A L
Sbjct: 335 LSIHEVVKTLFEAIMLVFL-----VMYLFL----QNMRATLIPTIAVPVVLLGTFAILAA 385

Query: 905 RGMTLNVSSAVGFIALFGVAVLNAVLMLAQINRLRQDPGMSLREAVVAGAVSRMRPVLMT 964
G ++N + G + G+ V +A++++ + R+ + + +EA ++
Sbjct: 386 FGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGI 445

Query: 965 ATVAALGLTPAMLAAGLGSDVQRPLATVVVGGLITATALTLLLLPSLYYLM 1015
A V + P G + R + +V + + + L+L P+L +
Sbjct: 446 AMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10350HTHFIS962e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.1 bits (239), Expect = 2e-25
Identities = 28/153 (18%), Positives = 61/153 (39%)

Query: 5 APVVYLIDDDASMRAALEDLFASVGLQVCAFGSTDQFLAHRLQDAPACLVLDIRMPGQSG 64
+ + DDDA++R L + G V + +V D+ MP ++
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 65 MEFHRRMVESGFALPTIFITGHGDIAMGVEAMKNGAIEFLTKPFRDQALLDAIQDGIRRD 124
+ R+ ++ LP + ++ ++A + GA ++L KPF L+ I +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 125 RTRRQSDAVAAELRARWESLSSGEQDVTRLVVQ 157
+ R ++ S+ Q++ R++ +
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10355FLAGELLIN330.027 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 32.7 bits (74), Expect = 0.027
Identities = 48/325 (14%), Positives = 84/325 (25%), Gaps = 12/325 (3%)

Query: 518 GNNTYSGGTTLGAGSVLLETSGALGTGTVTAAGGSLDTTAPLSLTNNFALTNTLGLGPSG 577
++G L + + GA T+T +D + N +G
Sbjct: 127 NQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLK 186

Query: 578 NALTLSGTLAGVGGVNKTGAGTLTLGGLNTYSGGTNLASGTLQLGTASALGTGALNVTGA 637
++ + G + T + + L T A
Sbjct: 187 SSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTA 246

Query: 638 SNLSTTAPLTVANAISLAAALNLPSTQALTLTGAISGAGSLIKSGAGDLTLTNANAYTGG 697
+L T T A + A A + G G K + N G
Sbjct: 247 VDLFKTTKSTAGTAEAKAI--------AGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 698 TTLSAGRLVVGSNAALGTGTLTASGGELDATTATTLGNAMALTGTMGVGSSGNALNLTGT 757
+ + G L +TA +DA T + N N +
Sbjct: 299 VSTTIN----GEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAK 354

Query: 758 ISGAGALNKLGTGTLTLGGLNTYSGGTSLNAGTLQVASGTALGTGALDVTGAATLQNTAA 817
+S A N + + Y+ + + TL + T + T A
Sbjct: 355 LSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAK 414

Query: 818 ATLNNAVTLSTGTLTLDGAQALTLG 842
+ N + L+ A +LG
Sbjct: 415 KSTANPLASIDSALSKVDAVRSSLG 439


28AXO1947_RS21765AXO1947_RS10390Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS217654180.522282membrane protein
AXO1947_RS102859261.609896type IV pilus biogenesis/stability protein PilW
AXO1947_RS1029010262.06937823S rRNA (adenine(2503)-C(2))-methyltransferase
AXO1947_RS2177011272.426597nucleoside-diphosphate kinase
AXO1947_RS103009232.646307TetR family transcriptional regulator
AXO1947_RS103057202.2217423-hydroxyacyl-CoA dehydrogenase
AXO1947_RS103106172.202211acetyl-CoA acetyltransferase
AXO1947_RS217750100.275089voltage-gated chloride channel protein
AXO1947_RS10325180.930528hypothetical protein
AXO1947_RS217802110.420062fluoride ion transporter CrcB
AXO1947_RS103353140.091575hypothetical protein
AXO1947_RS103404150.309214recombination factor protein RarA
AXO1947_RS10345415-0.079429hypothetical protein
AXO1947_RS10350414-0.257598outer-membrane lipoprotein carrier protein
AXO1947_RS10355414-0.451970transglutaminase
AXO1947_RS10360-213-1.824616DNA translocase FtsK
AXO1947_RS10365-218-1.702408thioredoxin-disulfide reductase
AXO1947_RS10370-119-1.685965hypothetical protein
AXO1947_RS10375-219-1.395272cinnamoyl-CoA reductase
AXO1947_RS10380124-0.890517leucyl/phenylalanyl-tRNA--protein transferase
AXO1947_RS103852201.895883translation initiation factor IF-1
AXO1947_RS103902201.893961ATP-dependent Clp protease ATP-binding subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10525HTHTETR652e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.4 bits (159), Expect = 2e-15
Identities = 29/142 (20%), Positives = 51/142 (35%), Gaps = 10/142 (7%)

Query: 9 TKDRILGAAEELFAQHGFAGTSLRQLTTQADVNIAAVNYHFGSKENLVNEVFRRRMDEMT 68
T+ IL A LF+Q G + TSL ++ A V A+ +HF K +L +E++ +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 69 TARLQQLEAAKKSQPGELTAVLAAFVEPALALAQDRQNGGAFVRVIAR-----AYAEKND 123
L+ PG+ +VL + L + + +I
Sbjct: 72 ELELEYQAK----FPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 124 NL-RTFLSDHYGHVLREFGKAI 144
R + Y + + I
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCI 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10565RTXTOXIND368e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.3 bits (84), Expect = 8e-04
Identities = 31/211 (14%), Positives = 59/211 (27%), Gaps = 23/211 (10%)

Query: 182 AHAQTLQAAREQLALRARRAANLSDSISVLTRERAALLQQLHGCNVQTDAVSAAMQDLQA 241
A Q++ Q L R LS SI + L + + NV + V ++
Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193

Query: 242 NILSARAALHALPVDPQLRAVVTHASKSAANTHQSADSSPLQLTCTRLENALRDARARGD 301
+ + K + A+ + R EN R ++R D
Sbjct: 194 QFSTWQNQK---------------YQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 302 ALAKAVFFGRGSAQA--------AEAEQALARTNAQIYQLQSAYAQARDALQSDQASSSA 353
+ + + A EA L +Q+ Q++S A++ Q
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 354 SAPASRTLASHLAAEQSQEIQALRTQLARDE 384
+ + E+ +
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASV 329


29AXO1947_RS10800AXO1947_RS10885Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS10800-214-3.218271hypothetical protein
AXO1947_RS10805-117-4.351341hypothetical protein
AXO1947_RS10810-118-4.299226IS5/IS1182 family transposase
AXO1947_RS10815021-5.085990hypothetical protein
AXO1947_RS10820-212-2.739220TonB-dependent receptor
AXO1947_RS10825-212-2.585506hypothetical protein
AXO1947_RS10830-110-1.667125TonB-dependent siderophore receptor
AXO1947_RS10835-19-0.524477hypothetical protein
AXO1947_RS10840-1100.264925superoxide dismutase
AXO1947_RS108502303.080771TonB dependent receptor
AXO1947_RS108551283.809472TonB dependent receptor
AXO1947_RS108601294.016006flagellar motor protein
AXO1947_RS108651283.462805flagellar motor protein MotD
AXO1947_RS108701272.851678chromosome partitioning protein ParA
AXO1947_RS108752241.823321chemotaxis protein
AXO1947_RS10885220-0.447755hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11095OMPADOMAIN715e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 71.1 bits (174), Expect = 5e-16
Identities = 33/118 (27%), Positives = 47/118 (39%), Gaps = 16/118 (13%)

Query: 162 INSDILFGTGSAALAGNARTTLSTLASVLRD---APNGVRVEGYTDNQPIATAQFPSNWE 218
+ SD+LF A L + L L S L + V V GYTD I + + N
Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--NQG 272

Query: 219 LSAARAASVVHLFADDGVAPQRLAMVGYGEFRARADNSTEAGRNA---------NRRV 267
LS RA SVV G+ +++ G GE N+ + + +RRV
Sbjct: 273 LSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRV 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11105FLGHOOKFLIK320.003 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 32.1 bits (72), Expect = 0.003
Identities = 37/160 (23%), Positives = 54/160 (33%), Gaps = 16/160 (10%)

Query: 28 PPAAVAVETAALAHADLALAEPPVQAAALPEAAAVSEAATSNAIAAVLS--------ADA 79
P + V A A+ + + E P + A + A+AAV AD
Sbjct: 63 PLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKADD 122

Query: 80 IAADFLAEMDADPAFGPPVVAAPSAADIAADFLAEMDADPAFGTAPTAVDLLTADLLAEM 139
+ D A + A A P P D + L PT LT++ L
Sbjct: 123 LNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPT--------EKPTLFTKLTSEQLTTA 174

Query: 140 DADPAFGLETAPVAVPAPAPAPKPEPHAAPAPMRAAPAPT 179
D A G P+ K E + P+P+ AA +P
Sbjct: 175 QPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPL 214


30AXO1947_RS10955AXO1947_RS11110Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS109552241.233765methyl-accepting chemotaxis protein
AXO1947_RS10965-1140.776864pilus assembly protein PilZ
AXO1947_RS10970-1150.311357chemotaxis protein CheW
AXO1947_RS21830122-2.385529hypothetical protein
AXO1947_RS21835120-2.545954methyl-accepting chemotaxis protein
AXO1947_RS10975120-2.642129chemotaxis protein CheR
AXO1947_RS10980021-3.388168chemoreceptor glutamine deamidase CheD
AXO1947_RS10985020-2.140602chemotaxis response regulator protein-glutamate
AXO1947_RS21840-127-2.406255bifunctional diguanylate
AXO1947_RS11000127-1.0551632-succinyl-6-hydroxy-2,
AXO1947_RS11005-126-1.375235aconitate hydratase B
AXO1947_RS11010-227-1.705142DNA-binding protein
AXO1947_RS11020113-0.869997AbrB family transcriptional regulator
AXO1947_RS11025314-0.597296aconitate hydratase
AXO1947_RS110353150.263611long-chain fatty acid--CoA ligase
AXO1947_RS110454171.217318enoyl-CoA hydratase
AXO1947_RS218452190.907434hybrid sensor histidine kinase/response
AXO1947_RS110503220.729228hypothetical protein
AXO1947_RS110554220.232717two-component system response regulator
AXO1947_RS218503250.184499lysine--tRNA ligase
AXO1947_RS11065221-0.046287peptide chain release factor 2
AXO1947_RS11070218-0.411404LytTR family transcriptional regulator
AXO1947_RS218552160.735721membrane protein
AXO1947_RS218601131.039963single-stranded-DNA-specific exonuclease RecJ
AXO1947_RS218650141.065326phosphoglycerate mutase
AXO1947_RS110900150.695121transcription elongation factor GreA
AXO1947_RS111000120.765776carbamoyl phosphate synthase large subunit
AXO1947_RS11105214-0.865565hypothetical protein
AXO1947_RS11110214-2.148745carbamoyl-phosphate synthase small subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11220PYOCINKILLER310.019 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 31.3 bits (70), Expect = 0.019
Identities = 20/76 (26%), Positives = 29/76 (38%), Gaps = 2/76 (2%)

Query: 709 QNAALVEEATAAARSMEEQAGHLAEAVSVFKLDQSAAPVAQTARVRPIASRPAAVKGAAA 768
AA AAA EQA AEA + + A + + + V AA
Sbjct: 207 LTAAKASIEAAAANKAREQAA--AEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAG 264

Query: 769 KPLARAAATAARPAKS 784
+ L + A AA A++
Sbjct: 265 RGLIQVAQGAASLAQA 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11235HTHFIS688e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 8e-15
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 5/105 (4%)

Query: 8 RVLIVDDSAVVRQMLTEILSRDAGIEVVGSAADPLLAREKIKRLNPDVITLDVEMPRMDG 67
+L+ DD A +R +L + LSR AG +V ++ I + D++ DV MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSR-AGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 68 LVFLENLMRLRP-TPVVMISSLTERGADTTLQALSLGAVDFVSKP 111
L + + RP PV+++S+ T ++A GA D++ KP
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNT--FMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11285HTHFIS904e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.3 bits (224), Expect = 4e-21
Identities = 38/150 (25%), Positives = 67/150 (44%), Gaps = 5/150 (3%)

Query: 462 RMLVADDHEANRMVLQRLLEKAGHRVMCVNGAEQVLDAMADEDFDAVIVDLHMPGMSGLD 521
+LVADD A R VL + L +AG+ V + A + +A D D V+ D+ MP + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 522 MLKQLRVMQASGMRYTPVVVLSADVTPEAIRACEQAGARAFLAKPVVAAKLLDTVAELAV 581
+L +++ + PV+V+SA T + GA +L KP +L+ + A+
Sbjct: 65 LLPRIKKARP----DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII-GRAL 119

Query: 582 STRPLATQAPVVQAPTSFEGVLDASVLDEL 611
+ + V ++ + E+
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11295HTHFIS846e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.5 bits (209), Expect = 6e-20
Identities = 36/161 (22%), Positives = 64/161 (39%), Gaps = 11/161 (6%)

Query: 30 IVIVDDQMSARTMLRHVIEDIAPELKVYDFGDPLDALAWCEAGRVDLLLLDYRMPGMDGL 89
I++ DD + RT+L + V + W AG DL++ D MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 90 EFARRLRRLPSHRDIPIILITIVGDEPIRQAALEAGVIDFLVKPIRPRELRARCSNLLQL 149
+ R+++ D+P+++++ A E G D+L KP EL L
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 150 RQQSESVKQRALSLEQRLL---ASMNEVEERERETLSRLAR 187
++ S + L+ A+M E+ L+RL +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEI----YRVLARLMQ 158


31AXO1947_RS11870AXO1947_RS11945Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS118702152.798807inorganic phosphate transporter
AXO1947_RS118752163.160491pit accessory protein
AXO1947_RS118804161.679193hemolysin
AXO1947_RS118903221.056514hypothetical protein
AXO1947_RS118953240.573329general stress protein
AXO1947_RS11900326-0.513892LLM class flavin-dependent oxidoreductase
AXO1947_RS11905226-1.612323hypothetical protein
AXO1947_RS21965115-1.107715MFS transporter
AXO1947_RS11920212-1.680318ABC transporter ATP-binding protein
AXO1947_RS11930312-1.318523serine kinase
AXO1947_RS1194039-0.195892Mg-protoporphyrin IX monomethyl ester oxidative
AXO1947_RS219703100.279889hypothetical protein
AXO1947_RS119452120.055127hexosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12120TCRTETA385e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.3 bits (89), Expect = 5e-05
Identities = 55/306 (17%), Positives = 83/306 (27%), Gaps = 65/306 (21%)

Query: 73 FGMSDQAAFAAATFLGLF-----VGAALLSPFADRFGRRPVFTFALIWYTAATVAMGLQS 127
S+ L L+ A +L +DRFGRRPV +L M
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP 94

Query: 128 TAAGVIVLRFVVGIGLGIELVTIDTYLSELVPRHMRGAAFAF---AFFVQFLAVPTVALT 184
+ + R V GI G Y++++ R F F F +A P +
Sbjct: 95 FLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL 153

Query: 185 AWLLVPHAPLGVSGWRWVVLLSGAFALAIWWLRRRLPESARWLAAHDRHAEADAVVTELE 244
PHAP + L F + L E
Sbjct: 154 MGGFSPHAPFFAAA----ALNGLNFLTGCFLLP--------------------------E 183

Query: 245 ARCMRDAGAPLPEPQTATSLPAGVVPTALLWRAPYRARIGMLVVFHVFQAIGFFG----- 299
+ S W ++ VF + Q +G
Sbjct: 184 SHKGERRPLRREALNPLAS---------FRWARGMTVVAALMAVFFIMQLVGQVPAALWV 234

Query: 300 -FG----NWLPALISAQGAGAGVTKSLAYSFAISLAYPLAPLLLLRFAQRWENKWQITAS 354
FG +W I A G+ SLA + + + R +R + A
Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLAQAM-------ITGPVAARLGERRALMLGMIAD 287

Query: 355 ALGAVL 360
G +L
Sbjct: 288 GTGYIL 293


32AXO1947_RS12045AXO1947_RS12230Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS12045223-0.388237cytochrome c
AXO1947_RS12055116-0.800616hypothetical protein
AXO1947_RS12065-116-0.802064DNA-directed RNA polymerase sigma-70 factor
AXO1947_RS12070015-0.977258hypothetical protein
AXO1947_RS12075114-0.907016serine protease
AXO1947_RS12080115-0.858263heme ABC transporter permease
AXO1947_RS12085217-0.954859heme exporter protein CcmD
AXO1947_RS12090016-0.841592cytochrome c biogenesis protein CcmE
AXO1947_RS21980013-0.862910c-type cytochrome biogenesis protein CcmF
AXO1947_RS12095-112-1.887348thiol:disulfide interchange protein
AXO1947_RS12105015-1.051973cytochrome c biogenesis protein
AXO1947_RS12110017-0.559341hypothetical protein
AXO1947_RS21985016-1.109468hybrid sensor histidine kinase/response
AXO1947_RS21990116-0.718534circadian clock protein KaiC
AXO1947_RS121202150.098114response regulator
AXO1947_RS121253140.392748hypothetical protein
AXO1947_RS121303120.772255methyl-accepting chemotaxis protein
AXO1947_RS121353101.222668hypothetical protein
AXO1947_RS121403121.947267hypothetical protein
AXO1947_RS121452132.298336hypothetical protein
AXO1947_RS121503131.861239integrase
AXO1947_RS121554151.465516*LysR family transcriptional regulator
AXO1947_RS121604150.654196FMN-dependent NADH-azoreductase
AXO1947_RS12165315-0.174167serine--tRNA ligase
AXO1947_RS12170317-0.536809energy transducer TonB
AXO1947_RS12175318-0.851811IS5/IS1182 family transposase
AXO1947_RS12180317-1.223895energy transducer TonB
AXO1947_RS12185215-1.5293933-phosphoshikimate 1-carboxyvinyltransferase
AXO1947_RS12190114-0.978996chorismate mutase
AXO1947_RS12200-111-0.683280phosphoserine transaminase
AXO1947_RS12205-115-0.545054mononuclear molybdenum enzyme YedY
AXO1947_RS122102180.752826sulfoxide reductase heme-binding subunit YedZ
AXO1947_RS122151211.025173FHA domain-containing protein
AXO1947_RS122201191.179523polyhydroxyalkanoic acid synthase
AXO1947_RS122251172.092277poly(hydroxyalcanoate) granule associated
AXO1947_RS220002151.860883hypothetical protein
AXO1947_RS122302151.885947histidine utilization repressor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12245SUBTILISIN1521e-44 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 152 bits (386), Expect = 1e-44
Identities = 86/289 (29%), Positives = 126/289 (43%), Gaps = 37/289 (12%)

Query: 173 QRGFIDTDAASAQTVTQGRGVVIAVVDTGVDTNHPDLKARIRDVHDLVD----DKPVMTS 228
RG A + T+GRGV +AV+DTG D +HPDLKARI + D D +
Sbjct: 23 PRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKD 82

Query: 229 TDSHGTEVAGIIAAGSNNHQGIVGMAPKAMLSVYKACWYAPTVGATARCNTFTLAKALAA 288
+ HGT VAG IAA N + G+VG+AP+A L + K + + + +
Sbjct: 83 YNGHGTHVAGTIAATENEN-GVVGVAPEADLLIIKVLNKQGSGQYDW------IIQGIYY 135

Query: 289 INNSSARVINLSLGGPAD-PLLSKMLEQLVQQGRIVVAAM------PPNERLDGFPNDVP 341
+I++SLGGP D P L + +++ V +V+ A G+P
Sbjct: 136 AIEQKVDIISMSLGGPEDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYN 195

Query: 342 GVLVV--------RSSSATPAMPGVLSAPGKDILTTQPNGRYDFTSGSSMATAHVSGMAA 393
V+ V S + L APG+DIL+T P G+Y SG+SMAT HV+G A
Sbjct: 196 EVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGGKYATFSGTSMATPHVAGALA 255

Query: 394 LLLSLQPSMDAKALRELMQRTSKVS-----------NGQLQVNAGAAVQ 431
L+ L + + L E + G + A +
Sbjct: 256 LIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12285HTHFIS685e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.9 bits (166), Expect = 5e-14
Identities = 34/121 (28%), Positives = 54/121 (44%), Gaps = 1/121 (0%)

Query: 568 QGQQVLVVEDDEQVRLLVTELLSELGYQADVVADADAALPILASPRRIDLLVTDVGLPGL 627
G +LV +DD +R ++ + LS GY + ++A +A+ DL+VTDV +P
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMPDE 60

Query: 628 NGRQLAEIARQSRRDLPVIFMTGYAETARDRGEFLGEGMSMIAKPFTLGEFSGKLHEVLG 687
N L +++R DLPV+ M+ + KPF L E G + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 688 P 688

Sbjct: 121 E 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12295HTHFIS745e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 5e-19
Identities = 31/114 (27%), Positives = 53/114 (46%), Gaps = 3/114 (2%)

Query: 2 LVEDDDAIREMAADILGDEGYHVVVSADAEQALTQLTEACPFDLLLSDICLPGMNGRDLA 61
+ +DD AIR + L GY V ++++A + DL+++D+ +P N DL
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDENAFDLL 66

Query: 62 DQARSICPALPIVFMTGYAGEIAKRADFLDTGMR-LLTKPFSLRDLLVIVQTAL 114
+ + P LP++ M+ + G L KPF L +L+ I+ AL
Sbjct: 67 PRIKKARPDLPVLVMSAQ-NTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12380PF03544761e-18 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 76.2 bits (187), Expect = 1e-18
Identities = 34/156 (21%), Positives = 52/156 (33%), Gaps = 10/156 (6%)

Query: 87 PAQPTAGAPADDTIA---PLPAPLPAQDGASDMPQAKPPADTPTLVDTAPPAPPAPRPLT 143
+P P + + P P P P + Q K +P AP
Sbjct: 79 EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAP---- 134

Query: 144 EDAAPGAPGSAAPSKAPVAGGDRPVPIEGQMPPPRYPSAALRRGDSGDVVVRVDVDATGN 203
A P + + A + PV P P+YP+ A G V V+ DV G
Sbjct: 135 --ARPTSSTATAATSKPVTSVASG-PRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGR 191

Query: 204 PGGVTLVQRSGSRDLDRAAMEAVRHWRFHPAQRNGQ 239
V ++ + +R A+R WR+ P +
Sbjct: 192 VDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSG 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12390PF03544483e-09 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 47.7 bits (113), Expect = 3e-09
Identities = 24/120 (20%), Positives = 39/120 (32%), Gaps = 5/120 (4%)

Query: 26 AKDPQPLPAASRQILPTTDGPRAMTMVAHQPARPAVSSVEAKPLP---GNAMPSYPPAVA 82
P SR P + A + A + P P YP
Sbjct: 113 QPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQ 172

Query: 83 HAGVQGNRTARLQLDVQGRVSDVTIVDRGGSDDPRLDAAVVESLRQWRFEPATRDGHAVV 142
++G + + GRV +V I+ + + V ++R+WR+EP VV
Sbjct: 173 ALRIEGQVKVKFDVTPDGRVDNVQILSAKPA--NMFEREVKNAMRRWRYEPGKPGSGIVV 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12450PF03544310.005 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 31.1 bits (70), Expect = 0.005
Identities = 15/72 (20%), Positives = 21/72 (29%)

Query: 174 SVIAGAALATLAVMAVLELRPPAPAPAALRTAVSSPAPAPARPVASNTPSATTAQPTPPP 233
SV A+ + + PAPA + P A P +P P P
Sbjct: 21 SVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEP 80

Query: 234 AAVPMPLAPVTP 245
+P P
Sbjct: 81 EPIPEPPKEAPV 92


33AXO1947_RS12305AXO1947_RS12355Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS123054140.445468type IV pilus modification protein PilV
AXO1947_RS123103140.320605pre-pilin like leader sequence
AXO1947_RS123153160.419982LOG family protein
AXO1947_RS123205220.802529IS5/IS1182 family transposase
AXO1947_RS12325623-0.212749Oar protein
AXO1947_RS12330623-0.399041membrane protein
AXO1947_RS12335828-4.459595short-chain dehydrogenase
AXO1947_RS22005824-3.723322monothiol glutaredoxin, Grx4 family
AXO1947_RS12345622-3.002252superoxide dismutase
AXO1947_RS22010516-2.8267315-(carboxyamino)imidazole ribonucleotide
AXO1947_RS12355210-0.8391465-(carboxyamino)imidazole ribonucleotide mutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12545BCTERIALGSPG280.011 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 27.9 bits (62), Expect = 0.011
Identities = 9/22 (40%), Positives = 18/22 (81%), Gaps = 2/22 (9%)

Query: 12 RTKGFSLLEVLIAIVVLAFGLL 33
+ +GF+LLE+++ IV++ G+L
Sbjct: 6 KQRGFTLLEIMVVIVII--GVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12550BCTERIALGSPG371e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 36.8 bits (85), Expect = 1e-05
Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 4 RRFAGFTLVELMITIVVLAILLTIAFPSFRG 34
+ GFTL+E+M+ IV++ +L ++ P+ G
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12575DHBDHDRGNASE717e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 71.2 bits (174), Expect = 7e-17
Identities = 43/194 (22%), Positives = 82/194 (42%), Gaps = 3/194 (1%)

Query: 12 ALAGRVVLITGAAGGLGAAAAQACAAAGATVVLLGRKLRPLERVYDAVAALGSEPLLYPL 71
+ G++ ITGAA G+G A A+ A+ GA + + LE+V ++ A +P
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 72 DLAGATPDDYATLARRLQTELGGLHGLLQCAADFAGLTPAELAAPADFARTLHVNLTARA 131
D+ + R++ E+G + L+ A + ++ T VN T
Sbjct: 65 DV--RDSAAIDEITARIEREMGPIDILV-NVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 132 WLTQACLPLLRQQHDAAVVFVVDDPARVGQAYWGAYGAAQHAQRGLIASLHHETAAGPVR 191
+++ + + ++V V +PA V + AY +++ A L E A +R
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 192 VSGLQPGPMRTALR 205
+ + PG T ++
Sbjct: 182 CNIVSPGSTETDMQ 195


34AXO1947_RS12450AXO1947_RS12565Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS124502102.540146NADH dehydrogenase (quinone) subunit G
AXO1947_RS124551132.369859NADH oxidoreductase (quinone) subunit F
AXO1947_RS12460-1111.137332NADH-quinone oxidoreductase subunit E
AXO1947_RS12465-1110.448777NADH dehydrogenase subunit D
AXO1947_RS12470-212-0.207570NADH-quinone oxidoreductase subunit C
AXO1947_RS12475-210-1.314359NADH-quinone oxidoreductase subunit B
AXO1947_RS12480-111-2.553243NADH-quinone oxidoreductase subunit A
AXO1947_RS12485218-4.013866*preprotein translocase subunit SecG
AXO1947_RS22035320-3.455253triose-phosphate isomerase
AXO1947_RS22040326-5.517419dehydrogenase
AXO1947_RS12500329-6.227008hypothetical protein
AXO1947_RS12510429-6.906075cyanoglobin
AXO1947_RS12515436-7.217941hypothetical protein
AXO1947_RS12525436-7.507298GGDEF domain-containing protein
AXO1947_RS12530437-7.293687methylamine utilization protein
AXO1947_RS22045321-4.460855flavonol synthase
AXO1947_RS12535116-3.376217phosphoglucosamine mutase
AXO1947_RS12540216-5.090641acetyl-CoA carboxylase carboxyl transferase
AXO1947_RS12545013-4.866531tryptophan synthase subunit alpha
AXO1947_RS12550011-3.703568tryptophan synthase subunit beta
AXO1947_RS12555010-3.796940transcriptional regulator
AXO1947_RS12560012-4.270192N-(5'-phosphoribosyl)anthranilate isomerase
AXO1947_RS12565012-3.288319tRNA pseudouridine(38-40) synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12730SECGEXPORT932e-27 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 93.1 bits (231), Expect = 2e-27
Identities = 41/111 (36%), Positives = 63/111 (56%), Gaps = 9/111 (8%)

Query: 5 ILNVVYVLVALAMIALILMQRGAGAAAGSGFGAGASGTVFGSQGASNFLSKSTKWLAVVF 64
L VV+++VA+ ++ LI++Q+G GA G+ FGAGAS T+FGS G+ NF+++ T LA +F
Sbjct: 4 ALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLATLF 63

Query: 65 FSISLFMAWYATHGARPTDQNLGVMSQSATPAPAAAGELTQPLPQAPAAGA 115
F ISL + ++ + + APA + Q P APA
Sbjct: 64 FIISLVLGNINSNKTNKGSEWENL------SAPA---KTEQTQPAAPAKPT 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12740DHBDHDRGNASE652e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 64.7 bits (157), Expect = 2e-14
Identities = 51/179 (28%), Positives = 79/179 (44%), Gaps = 2/179 (1%)

Query: 8 ALITGASSGIGREIARAYAKRGVPLLLTARREDRLHALADELRSAVRV-EVLPADLADPA 66
A ITGA+ GIG +AR A +G + ++L + L++ R E PAD+ D A
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70

Query: 67 AAEALATQIQRNGWTVGTLVNNAGYGVPGRYLHNDWPTHARFLQVMVTAVCELTWRLLPM 126
A + + +I+R + LVN AG PG V T V + +
Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 127 IRASGQGRILNVASFAALTPSADGQTLYAASKSFMLRFSESLALENADCGVKVCALCPG 185
+ G I+ V S A P YA+SK+ + F++ L LE A+ ++ + PG
Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRT-SMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS22060PF05272270.012 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.3 bits (60), Expect = 0.012
Identities = 8/40 (20%), Positives = 15/40 (37%)

Query: 20 PILEQARKRTKPVTVDMYEVWCAVLYLLRTGRPWRALPSD 59
P+L R + +++ L+L G + P D
Sbjct: 710 PVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPED 749


35AXO1947_RS12610AXO1947_RS12705Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS12610416-0.353476lytic transglycosylase
AXO1947_RS12615519-0.584863hypothetical protein
AXO1947_RS12625316-0.341395hypothetical protein
AXO1947_RS12635117-1.711117hypothetical protein
AXO1947_RS12640016-2.622946IS30 family transposase
AXO1947_RS12655117-3.080631transcription elongation factor GreB
AXO1947_RS12660117-3.642806IS5/IS1182 family transposase
AXO1947_RS12665117-2.23667730S ribosomal protein S12 methylthiotransferase
AXO1947_RS12670118-1.640862carboxymethylenebutenolidase
AXO1947_RS12675119-1.518087HIT family protein
AXO1947_RS12680120-2.078549hypothetical protein
AXO1947_RS12690122-1.155035IS5/IS1182 family transposase
AXO1947_RS12700222-2.263291hypothetical protein
AXO1947_RS12705220-1.539064deoxycytidine triphosphate deaminase
36AXO1947_RS13055AXO1947_RS13145Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS130554110.142946potassium transporter
AXO1947_RS13060113-1.126040serine hydrolase
AXO1947_RS13065-111-0.292959ribonuclease activity regulator protein RraA
AXO1947_RS22110-112-0.738493hypothetical protein
AXO1947_RS13080-3151.161334EamA family transporter
AXO1947_RS13085-1131.52601223S rRNA pseudouridine synthase F
AXO1947_RS22115-2123.353674GGDEF domain-containing protein
AXO1947_RS13095-1143.004981RNA helicase
AXO1947_RS13100-1133.078858DNA-directed RNA polymerase sigma-70 factor
AXO1947_RS13105-1133.043372hypothetical protein
AXO1947_RS13110-1111.003138RNA-binding protein
AXO1947_RS13115-1100.841241hypothetical protein
AXO1947_RS13120112-1.308170hypothetical protein
AXO1947_RS13125013-1.145923hypothetical protein
AXO1947_RS13130014-1.534440hypothetical protein
AXO1947_RS13135013-1.849396hypothetical protein
AXO1947_RS13140-1131.303308cysteine methyltransferase
AXO1947_RS131452122.125683IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13300BLACTAMASEA340.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.0 bits (78), Expect = 0.001
Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 5/78 (6%)

Query: 66 ADTLFAIASNTKAFTAASLSILADEGKLSLEDKVI----DHLPWFRMSDPYVSGEMRIRD 121
AD F + S K ++ D G LE K+ D + + +S+ +++ M + +
Sbjct: 58 ADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGE 117

Query: 122 LLAHRSGLS-LGAGDLLF 138
L A +S A +LL
Sbjct: 118 LCAAAITMSDNSAANLLL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13305TYPE3OMBPROT300.006 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 29.7 bits (66), Expect = 0.006
Identities = 15/36 (41%), Positives = 16/36 (44%), Gaps = 3/36 (8%)

Query: 70 HALLGDQIAANAVANGWAGVLIHG---CVRDVEMLA 102
+LLGD N V GWA I C DV LA
Sbjct: 363 CSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLA 398


37AXO1947_RS13335AXO1947_RS13415Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS133354150.966119hypothetical protein
AXO1947_RS133405151.349142hypothetical protein
AXO1947_RS133450122.142435hypothetical protein
AXO1947_RS13350-1153.236300hypothetical protein
AXO1947_RS13355-1143.369457hypothetical protein
AXO1947_RS13360-1131.706961IS5/IS1182 family transposase
AXO1947_RS13365-1112.236055hypothetical protein
AXO1947_RS13370-192.282995hypothetical protein
AXO1947_RS13375-1111.527264hypothetical protein
AXO1947_RS13385-1110.615609hypothetical protein
AXO1947_RS13390-181.095241hypothetical protein
AXO1947_RS13395080.145785hypothetical protein
AXO1947_RS1340027-0.628515type IV secretion protein Rhs
AXO1947_RS1340519-0.726873pirin
AXO1947_RS1341019-0.897782carbon starvation protein A
AXO1947_RS1341529-1.165704hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13605CHLAMIDIAOM6300.010 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 29.7 bits (66), Expect = 0.010
Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 3/33 (9%)

Query: 181 PCTQRLKFVDSSVSFQPTADQHRLIFVIDRLGQ 213
PC +FV S + PTAD +L++ IDRLGQ
Sbjct: 141 PC--EAEFVRSDPATTPTADG-KLVWKIDRLGQ 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13640PF05043355e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 5e-04
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAHM 140
+ L+ H NTAH+
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAHL 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13695ACRIFLAVINRP374e-04 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 36.7 bits (85), Expect = 4e-04
Identities = 20/113 (17%), Positives = 40/113 (35%), Gaps = 15/113 (13%)

Query: 133 MVLFLSSRRNGRSLGD---LVREEMGQVPGTIAL------------FGAFLIMIIILAVL 177
+ G S GD L+ ++P I ++ I V+
Sbjct: 823 SMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVV 882

Query: 178 AMVVVKALAESPWGMFTVIATMPIALMMGVYMRYIRVGKIGEISVVGLILLLG 230
+ + AL ES +V+ +P+ ++ + + K +VGL+ +G
Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIG 935


38AXO1947_RS13640AXO1947_RS13690Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS13640-116-3.408147IS5/IS1182 family transposase
AXO1947_RS13645220-4.559175HutD-family protein
AXO1947_RS13650220-4.396234competence protein ComEA
AXO1947_RS136551150.453778peptidase M20
AXO1947_RS222051182.219471GlsB/YeaQ/YmgE family stress response membrane
AXO1947_RS136651182.831426aminoglycoside phosphotransferase
AXO1947_RS222101202.815215mannose-1-phosphate guanylyltransferase
AXO1947_RS136751213.119529DnaA regulatory inactivator Hda
AXO1947_RS136802243.703163AI-2E family transporter
AXO1947_RS13690215-0.164856hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13945FLGLRINGFLGH280.043 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 27.6 bits (61), Expect = 0.043
Identities = 11/34 (32%), Positives = 15/34 (44%)

Query: 77 FGNRDADVDTDTDTDTSASSSANVSANQTGTVAG 110
FGN ADV+ + AN S +GT+
Sbjct: 117 FGNARADVEASGGNTFNGKGGANASNTFSGTLTV 150


39AXO1947_RS14315AXO1947_RS14435Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS14315213-3.103394septum site-determining protein MinD
AXO1947_RS14320113-2.838766cell division topological specificity factor
AXO1947_RS14325118-3.851874hypothetical protein
AXO1947_RS14330218-3.883650two-component sensor histidine kinase
AXO1947_RS143400150.807599chemotaxis protein CheY
AXO1947_RS143550192.878216membrane protein
AXO1947_RS143601233.812052membrane protein
AXO1947_RS143650264.285662phosphoethanolamine transferase
AXO1947_RS143702231.858313peptidase M20
AXO1947_RS143753220.952526hemolysin D
AXO1947_RS14380519-0.813374sulfotransferase
AXO1947_RS14385619-1.074505hypothetical protein
AXO1947_RS14390621-1.627853glycine dehydrogenase
AXO1947_RS14395521-0.727888hypothetical protein
AXO1947_RS144004210.188889hypothetical protein
AXO1947_RS144050254.520503VRR-NUC domain-containing protein
AXO1947_RS144100203.671365oxidoreductase
AXO1947_RS144150162.705951catalase HPII
AXO1947_RS144200121.501633hypothetical protein
AXO1947_RS144251111.237382hypothetical protein
AXO1947_RS144300100.713633hypothetical protein
AXO1947_RS14435-121-3.834442hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14655HTHFIS823e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.2 bits (203), Expect = 3e-20
Identities = 32/136 (23%), Positives = 56/136 (41%), Gaps = 4/136 (2%)

Query: 2 RLLVIEDNRNMVANLFDYFEARGYTLDAAPDGITGLHLATTQHYDALILDWMMPRMDGPA 61
+LV +D+ + L GY + + T D ++ D +MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLRRLREQHQSELPVIMLTARDELPDKIAGFRAGADDYLTKPFALPE---LEVRIEALLA 118
+L R+++ + +LPV++++A++ I GA DYL KPF L E + R A
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 119 RAHGRRRGKLLQVADL 134
R + L
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14695RTXTOXIND1137e-30 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 113 bits (285), Expect = 7e-30
Identities = 66/395 (16%), Positives = 125/395 (31%), Gaps = 48/395 (12%)

Query: 51 GTVVPADGMIAITTPQSGVVANVGVVQGQRVAAGQVLFVL-AAEHRDDRGRPSQQAAAVL 109
G + + I ++ +V + V +G+ V G VL L A D +
Sbjct: 88 GKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR 147

Query: 110 AEQQRLTAEAM--------------------------VQLRAQGRLQQQAAARALAGLRN 143
EQ R + ++L + + Q
Sbjct: 148 LEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKEL 207

Query: 144 RLEQVDAEL-GVLRHRQQLTQSIE------QRYRTALTRGLVSQQFVDEKQADVLDQRAH 196
L++ AE VL + + + L + +++ V E++ ++
Sbjct: 208 NLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE 267

Query: 197 ALELQRERLTLADALAQAQAELQQLPVSLRQQLA--LAGASLQADRRTAIEQAAA---SR 251
+ + + + A+ E Q + + ++ L + T
Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327

Query: 252 WEVRAPRAGRVA-LRPLQRGQAVGQGQRLADLLPTSTATEVVLYAPSRAAGLIGPGIPVQ 310
+RAP + +V L+ G V + L ++P EV ++ G I G
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAI 387

Query: 311 LRFDALPYQHYGQFAGRVVEIAA-APEPPRADAALVSEPLYRVRVRLAGDAALRAGHAAV 369
++ +A PY YG G+V I A E R ++ V + + +
Sbjct: 388 IKVEAFPYTRYGYLVGKVKNINLDAIEDQR------LGLVFNVIISIEENCLSTGNKNIP 441

Query: 370 LRPGMRVQGTLALEWRRFSQWAFEPLS-SLHGTLR 403
L GM V + R + PL S+ +LR
Sbjct: 442 LSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLR 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14705RTXTOXIND300.045 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.045
Identities = 20/145 (13%), Positives = 41/145 (28%), Gaps = 14/145 (9%)

Query: 422 ISLDETTTRADVVAL-AQLFGAMADVDALDAATADALPQGLLRSSAFLTHPVF------- 473
+ L AD + + L A + + ++ L P F
Sbjct: 125 LKLTALGAEADTLKTQSSLLQARLEQ-TRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 474 ---NTHHSEHELLRYMRSLADKDLAMDRTMIPLGSCTMKLNATAEMIPVTWPEFGAIHPL 530
T + + + K+L +D+ + ++N + V L
Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243

Query: 531 APAEQSAGYAQLIDELEAMLVECTG 555
+Q+ ++ E E VE
Sbjct: 244 L-HKQAIAKHAVL-EQENKYVEAVN 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14760IGASERPTASE340.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.003
Identities = 31/173 (17%), Positives = 59/173 (34%), Gaps = 25/173 (14%)

Query: 4 IESTKQALPDPAQLVQTAPTPSGSASVAS------TSADSAVPSAQLGGLSIRPRLLGRA 57
+++T P+ Q + PS + +A A PS ++ + +
Sbjct: 992 VDTTNITTPNNIQADVPS-VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 58 STANPSAASSSNAQNVRAALFAAQSVPQGPDPSEVLVKASSRLKRFNDLAQKVVPTEPSD 117
N A+ + AQN A A +V +EV ++ Q E +
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV-----AQSGSETKETQTTETKETAT 1105

Query: 118 IKALE------------ARLRSGTSALESARQALQALAELNIKKRMPVDNIEE 158
++ E ++ S S + + +Q AE ++ P NI+E
Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA-RENDPTVNIKE 1157


40AXO1947_RS15680AXO1947_RS15900Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS15680321-2.480552xylanase
AXO1947_RS15690424-3.478614asparaginase
AXO1947_RS15695425-3.347757aminoacyl-tRNA deacylase
AXO1947_RS15700427-3.990181branched chain amino acid aminotransferase
AXO1947_RS15705427-3.513454putative Fe-S cluster assembly protein SufT
AXO1947_RS15715225-2.987705hypothetical protein
AXO1947_RS15720631-2.510303hypothetical protein
AXO1947_RS15725532-3.354777NAD(P) transhydrogenase subunit beta
AXO1947_RS15735631-4.449141NAD(P)(+) transhydrogenase
AXO1947_RS15740730-4.992835RNA polymerase sigma factor
AXO1947_RS15745728-5.938643hypothetical protein
AXO1947_RS15750728-5.716803hypothetical protein
AXO1947_RS15755727-5.160865NAD(P) transhydrogenase subunit alpha
AXO1947_RS15760424-4.360067hypothetical protein
AXO1947_RS15765323-4.654371nitroreductase
AXO1947_RS15775327-3.715732exodeoxyribonuclease IX
AXO1947_RS15780427-4.136287DNA mismatch repair protein MutT
AXO1947_RS15785530-3.993617N-formylglutamate amidohydrolase
AXO1947_RS15790732-3.368392prolyl aminopeptidase
AXO1947_RS15795831-3.430642hypothetical protein
AXO1947_RS15800729-2.596598protein-(glutamine-N5) methyltransferase,
AXO1947_RS15810721-1.956554peroxiredoxin
AXO1947_RS15820621-2.619497alkyl hydroperoxide reductase subunit F
AXO1947_RS15830727-2.091992transcriptional regulator
AXO1947_RS15840730-2.352538nucleoside diphosphate kinase regulator
AXO1947_RS15845731-2.104728transaldolase
AXO1947_RS15850732-1.774001peptide-methionine (S)-S-oxide reductase
AXO1947_RS15855631-1.839319hypothetical protein
AXO1947_RS15860629-2.037908helix-turn-helix transcriptional regulator
AXO1947_RS15865831-2.089185glutamine--tRNA ligase
AXO1947_RS15870627-2.852902hypothetical protein
AXO1947_RS15875519-2.854166hypothetical protein
AXO1947_RS15880617-3.202091hypothetical protein
AXO1947_RS22425615-2.844860tRNA-specific adenosine deaminase
AXO1947_RS15885618-3.02784423S rRNA (cytidine(2498)-2'-O)-methyltransferase
AXO1947_RS15900316-1.103957glucose-fructose oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS16075DHBDHDRGNASE330.001 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 33.5 bits (76), Expect = 0.001
Identities = 24/96 (25%), Positives = 36/96 (37%), Gaps = 19/96 (19%)

Query: 174 GAGVAGLQAIATAKRLGAQVEGFDVRPETREQIASLGARFLDLGVSAAGEGGYARQLTDD 233
G G A + +A+ GA + D PE E++ S S E +A D
Sbjct: 19 GIGEAVARTLASQ---GAHIAAVDYNPEKLEKVVS----------SLKAEARHAEAFPAD 65

Query: 234 ER-----AEQQRRLAEHLKGVDVVVCTAAVPGRPAP 264
R E R+ + +D++V A V RP
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVL-RPGL 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS16100FLGMRINGFLIF280.048 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 28.0 bits (62), Expect = 0.048
Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 6/47 (12%)

Query: 116 GNRQLMASDRQQRIDAIFAP------YHARITAELDARAKRNQPTIV 156
+ S Q+RI+AI +P HA++TA+LD K
Sbjct: 235 KFANDVESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHY 281


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS16135STREPTOPAIN310.011 Streptopain (C10) cysteine protease family signature.
		>STREPTOPAIN#Streptopain (C10) cysteine protease family signature.

Length = 398

Score = 31.2 bits (70), Expect = 0.011
Identities = 13/53 (24%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 2 LDANLKTQLTAYLERVTRPIQINASIDDSP-GSREMLELLEELVLLSDTISLD 53
DAN K + +++E I+ N +D + G+ E+ + + + +L S I +
Sbjct: 109 FDANGKENIASFMESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYN 161


41AXO1947_RS17600AXO1947_RS22710Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS176004180.464499cytochrome c biogenesis protein
AXO1947_RS176053170.2551903-dehydroquinate dehydratase
AXO1947_RS22690520-0.423485acetyl-CoA carboxylase biotin carboxyl carrier
AXO1947_RS17620418-0.955031hypothetical protein
AXO1947_RS22695222-4.729348acetyl-CoA carboxylase biotin carboxylase
AXO1947_RS22700117-2.593229hypothetical protein
AXO1947_RS17635015-2.303707hypothetical protein
AXO1947_RS17640020-2.26733150S ribosomal protein L11 methyltransferase
AXO1947_RS17655024-2.614142hypothetical protein
AXO1947_RS17665115-2.105634transcriptional regulator
AXO1947_RS17670217-2.607735hypothetical protein
AXO1947_RS17685518-3.695925hypothetical protein
AXO1947_RS17690320-1.003634Fis family transcriptional regulator
AXO1947_RS22705418-1.522722IS481 family transposase
AXO1947_RS227102160.935026CDP-alcohol phosphatidyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18150DNABINDNGFIS1151e-37 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 115 bits (290), Expect = 1e-37
Identities = 39/74 (52%), Positives = 55/74 (74%)

Query: 16 KSPLREHVAQSVRRYLRDLDGSDADDVYEIVLREMEIPLFVEVLNHCEGNQSRAAAMLGI 75
+ PLR+ V Q+++ Y L+G D +D+YE+VL E+E PL V+ + GNQ+RAA M+GI
Sbjct: 24 QKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQPLLDMVMQYTRGNQTRAALMMGI 83

Query: 76 HRATLRKKLKEYGM 89
+R TLRKKLK+YGM
Sbjct: 84 NRGTLRKKLKKYGM 97


42AXO1947_RS17855AXO1947_RS17970Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS178552171.395187anthranilate phosphoribosyltransferase
AXO1947_RS178600161.703003Asp/Glu/hydantoin racemase
AXO1947_RS178651191.052568hypothetical protein
AXO1947_RS178701150.726002glutamine amidotransferase
AXO1947_RS178751140.088807SIMPL domain-containing protein
AXO1947_RS17880114-0.371203amino acid lyase
AXO1947_RS17885217-2.451550anthranilate synthase component I
AXO1947_RS17890215-2.804277lipid kinase YegS
AXO1947_RS22735216-2.649684N-acetyltransferase
AXO1947_RS17895112-1.199850ribulose-phosphate 3-epimerase
AXO1947_RS17905113-1.166568J domain-containing protein
AXO1947_RS17915-1140.721624phosphoribosylaminoimidazolesuccinocarboxamide
AXO1947_RS17920-1141.203597NYN domain-containing protein
AXO1947_RS179300143.289467membrane protein
AXO1947_RS179402142.822297monovalent cation/H+ antiporter subunit A
AXO1947_RS179451162.411874Na+/H+ antiporter subunit C
AXO1947_RS179551173.278798monovalent cation/H+ antiporter subunit D
AXO1947_RS179602152.933266Na+/H+ antiporter subunit E
AXO1947_RS179701133.035460K+/H+ antiporter subunit F
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18365SACTRNSFRASE413e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.5 bits (97), Expect = 3e-07
Identities = 18/64 (28%), Positives = 24/64 (37%), Gaps = 2/64 (3%)

Query: 76 STWLGRNGLYLEDLFVRPEARGRGAGLALLRHLAQLAVQRGCGRFEWSVLDWNQPAIDFY 135
S W G +ED+ V + R +G G ALL + A + D N A FY
Sbjct: 84 SNWNGY--ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141

Query: 136 RAVG 139

Sbjct: 142 AKHH 145


43AXO1947_RS18040AXO1947_RS18185Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS180401123.007180hypothetical protein
AXO1947_RS180450133.766551IS5/IS1182 family transposase
AXO1947_RS180501154.472905hypothetical protein
AXO1947_RS180552144.572962RNA helicase
AXO1947_RS180602185.403107hypothetical protein
AXO1947_RS180651195.470032IS5/IS1182 family transposase
AXO1947_RS180701204.974870IS5/IS1182 family transposase
AXO1947_RS180800203.946741hypothetical protein
AXO1947_RS18085-1163.269961pyridine nucleotide-disulfide oxidoreductase
AXO1947_RS18090-1182.302651hypothetical protein
AXO1947_RS18095314-0.0072033-oxoacyl-ACP reductase
AXO1947_RS181001110.931585IS5/IS1182 family transposase
AXO1947_RS181051110.609289NAD-dependent dehydratase
AXO1947_RS18110-1121.002621glycogen debranching enzyme
AXO1947_RS181150122.315326hypothetical protein
AXO1947_RS181202143.993983malto-oligosyltrehalose synthase
AXO1947_RS181252144.5165844-alpha-glucanotransferase
AXO1947_RS227502165.545300malto-oligosyltrehalose trehalohydrolase
AXO1947_RS181301185.259295glycogen-branching enzyme
AXO1947_RS181353174.680081starch synthase
AXO1947_RS181403154.036765GGDEF domain-containing protein
AXO1947_RS227600152.906100membrane protein
AXO1947_RS22765-2173.135446hypothetical protein
AXO1947_RS18150-2182.125005ABC transporter substrate-binding protein
AXO1947_RS18155-1161.901398IS5/IS1182 family transposase
AXO1947_RS181600152.254855hypothetical protein
AXO1947_RS18165-1142.310848Ion channel protein
AXO1947_RS181752131.705330hypothetical protein
AXO1947_RS181802121.067864hypothetical protein
AXO1947_RS181852110.408879lytic transglycosylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18520PF05043357e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.9 bits (80), Expect = 7e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18525PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18540DHBDHDRGNASE1131e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 1e-32
Identities = 75/260 (28%), Positives = 113/260 (43%), Gaps = 9/260 (3%)

Query: 1 MSNTALRPQRVLIAGGSRGIGLAIAEGFVRNGAQVSICARTAAGLAQAAAALAAHGAPVH 60
M+ + + I G ++GIG A+A GA ++ L + ++L A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 61 TRPCDLADATQIDAYVHAAAQALGGLDVVINNAS----GFGHGNDDASWQAGLDVDLMAA 116
P D+ D+ ID + +G +D+++N A G H D W+A V+
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 117 VRCNRAALPYLRSSDAAVILNISSINAQRPTPRAIAYSTAKAALNYYTTTLAAELARERI 176
+R+ Y+ + I+ + S A P AY+++KAA +T L ELA I
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 177 RVNAISPGSIE--FPDGLWDKRSREEPELY---ARIRDSIPFGGFGQVQHVADAALFLAS 231
R N +SPGS E LW + E + + IP + +ADA LFL S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 232 PQASWITGQVLAVDGGQSLG 251
QA IT L VDGG +LG
Sbjct: 241 GQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18545PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


44AXO1947_RS18260AXO1947_RS18405Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS18260321-0.855846EscR/YscR/HrcR family type III secretion system
AXO1947_RS182652210.177105EscS/YscS/HrcS family type III secretion system
AXO1947_RS18270519-2.303900hypothetical protein
AXO1947_RS18285718-2.533714hypothetical protein
AXO1947_RS18290615-2.014400hypothetical protein
AXO1947_RS18295413-1.461109serine kinase
AXO1947_RS18300411-1.7021224-hydroxyphenylacetate 3-monooxygenase
AXO1947_RS18305311-1.578555hypothetical protein
AXO1947_RS183101100.210323hypothetical protein
AXO1947_RS183152110.098200hypothetical protein
AXO1947_RS18320113-0.491692serine kinase
AXO1947_RS18325013-0.441040HpaF protein
AXO1947_RS18330-211-0.545450*4-hydroxybenzoate octaprenyltransferase
AXO1947_RS18335-2130.481195amidophosphoribosyltransferase
AXO1947_RS18340-2130.726244biotin synthase BioB
AXO1947_RS18345-2130.8636538-amino-7-oxononanoate synthase
AXO1947_RS18350-1151.273033hypothetical protein
AXO1947_RS183552181.198007pimeloyl-[acyl-carrier protein] methyl ester
AXO1947_RS18360215-0.584303malonyl-[acyl-carrier protein]
AXO1947_RS18370414-3.114437aspartyl beta-hydroxylase
AXO1947_RS18375313-3.130014hypothetical protein
AXO1947_RS18380212-3.440637hypothetical protein
AXO1947_RS18385018-3.524355serine/threonine dehydratase
AXO1947_RS22780112-1.229907hypothetical protein
AXO1947_RS22785113-1.205665hypothetical protein
AXO1947_RS18390113-0.574729tRNA uridine-5-carboxymethylaminomethyl(34)
AXO1947_RS18395115-0.566217kinase
AXO1947_RS18400215-0.545562IclR family transcriptional regulator
AXO1947_RS18405216-1.1452344-carboxymuconolactone decarboxylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18710TYPE3IMPPROT2449e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 244 bits (625), Expect = 9e-85
Identities = 79/219 (36%), Positives = 129/219 (58%), Gaps = 8/219 (3%)

Query: 3 MPDVGSLLLVVIMLGLLPFAAMVVTSYTKIVVVLGLLRNAIGVQQVPPNMVLNGVALLVS 62
M + SL+ ++ LLPF T + K +V ++RNA+G+QQ+P NM LNGVALL+S
Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60

Query: 63 CFVMAPVGMEAFKA-AQNYGAGSDNSRIVVLLDACREPFRQFLLKHTREREKAFFMRSAQ 121
FVM P+ +A+ +D S + +D + +R +L+K++ FF +
Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120

Query: 122 QIWPKDKAAT-------LKSDDLLILAPAFTLGELTEAFRIGFLLYLVFIVIDLVVANAL 174
+ ++ T ++ + L PA+ L E+ AF+IGF LYL F+V+DLVV++ L
Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180

Query: 175 MAMGLSQVTPTNVAIPFKLLLFVAMDGWSMLIHGLVLSY 213
+A+G+ ++P ++ P KL+LFVA+DGW++L GL+L Y
Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18715TYPE3IMQPROT643e-17 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 63.6 bits (155), Expect = 3e-17
Identities = 25/78 (32%), Positives = 44/78 (56%)

Query: 4 DDLVRFTSEALLLCLKVSLPVVGVAALTGLLIAFFQAVMSLQDASISFALKLVVVVAAIA 63
DDLV ++AL L L +S VA + GLL+ FQ V LQ+ ++ F +KL+ V +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 VTAPWGASAIMQFGQALM 81
+ + W ++ +G+ ++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18855IGASERPTASE310.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.002
Identities = 14/70 (20%), Positives = 18/70 (25%), Gaps = 7/70 (10%)

Query: 88 RPQYRPPRPNGSFYNGSRPADSRPQQPDQSPATGAQPSRPPPRIGAPPRVIREIQRQTP- 146
+PQ P R N N PQ + A QP++ P
Sbjct: 1140 QPQAEPARENDPTVNI-----KEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194

Query: 147 -PRNTREQIP 155
N P
Sbjct: 1195 VVENPENTTP 1204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18895PF06057260.037 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 26.3 bits (58), Expect = 0.037
Identities = 7/27 (25%), Positives = 16/27 (59%)

Query: 55 LPAHTRSLLTLAMMVALGHDEEFKLHV 81
+PA R + A++++ +F++HV
Sbjct: 138 MPARYRKNVLGAVLLSPSQSSDFEIHV 164


45AXO1947_RS18655AXO1947_RS18805Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS186552150.883124oxidoreductase
AXO1947_RS186602150.654507oxidoreductase
AXO1947_RS18665-216-1.266764NAD-dependent deacetylase
AXO1947_RS18670-116-2.569238MFS transporter
AXO1947_RS18675-117-2.805558LysR family transcriptional regulator
AXO1947_RS18680-118-2.260198hypothetical protein
AXO1947_RS18685-117-2.514320preprotein translocase subunit TatD
AXO1947_RS18690-116-2.612843hypothetical protein
AXO1947_RS18695219-1.834684hypothetical protein
AXO1947_RS18700020-1.385775hypothetical protein
AXO1947_RS18705019-1.844881hypothetical protein
AXO1947_RS18710020-3.023092IS5/IS1182 family transposase
AXO1947_RS18715122-3.369168hypothetical protein
AXO1947_RS18720-224-2.522357peptidase
AXO1947_RS18725-125-2.670824hypothetical protein
AXO1947_RS18730-125-3.261948hypothetical protein
AXO1947_RS18735028-3.340134aminotransferase V
AXO1947_RS18740-125-2.606954hypothetical protein
AXO1947_RS18745-128-1.874899hydroxyisourate hydrolase
AXO1947_RS18750424-3.448296hypothetical protein
AXO1947_RS18755324-3.127443hypothetical protein
AXO1947_RS22820424-2.563930ATP-dependent helicase HrpB
AXO1947_RS18765220-1.761802hypothetical protein
AXO1947_RS18775219-1.086035hypothetical protein
AXO1947_RS18780112-0.705880pseudouridine synthase
AXO1947_RS187850111.826145methyltransferase
AXO1947_RS228252122.969064alcohol dehydrogenase
AXO1947_RS187952112.933348NADP-dependent oxidoreductase
AXO1947_RS188050103.061939IS630 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19165TCRTETA545e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.1 bits (130), Expect = 5e-10
Identities = 96/379 (25%), Positives = 142/379 (37%), Gaps = 43/379 (11%)

Query: 75 VQPVLPEFARAFHVDAATAS-LPLSLATGALALAIFC--TGAVSENLGRRGLMFASIAIA 131
+ PVLP R + + LA AL GA+S+ GRR ++ S+A A
Sbjct: 24 IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83

Query: 132 AVLNLIAAFLPHWGALVVIRTLSGIALGGVPAVAMVYLGEELPANK-------MGAATGL 184
AV I A P L + R ++GI G AVA Y+ + ++ M A G
Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 185 -YVAGNAFGGMSGRIVMSVLTDHTDWRTALAVLSGFDLLCALAFFWLLPP---SRNFVRR 240
VAG GG+ G + H + A A L+G + L F L R +RR
Sbjct: 143 GMVAGPVLGGLMGGF-----SPHAPFFAA-AALNGLNFL--TGCFLLPESHKGERRPLRR 194

Query: 241 HGINLRFHLRAWAGHLRDRNLPFLFALPFLLM---GVFVCLYNYAGFRLGGPEFGLSQSQ 297
+N R WA + + L A+ F++ V L+ G F +
Sbjct: 195 EALNPLASFR-WARGM--TVVAALMAVFFIMQLVGQVPAALWVI----FGEDRFHWDATT 247

Query: 298 IGMIFSAYVFGIVSS----SVAGAASDRFGRGPVVTAGIVLCVLGVTLTLAHVLAVVVAG 353
IG+ + FGI+ S + G + R G + G++ G L +
Sbjct: 248 IGISLA--AFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305

Query: 354 IVLLTIGFFIAHSAASAWVSRLGGAHRSHAASLYLLAYYAGASTIGALGGWFWQHGGWGA 413
I++L I A A +SR R L A + S +G L + A
Sbjct: 306 IMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI----YAA 361

Query: 414 LVGMWLTLLAIAFAAAYIL 432
+ W IA AA Y+L
Sbjct: 362 SITTWNGWAWIAGAALYLL 380


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19305ACRIFLAVINRP290.018 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.4 bits (66), Expect = 0.018
Identities = 29/169 (17%), Positives = 59/169 (34%), Gaps = 12/169 (7%)

Query: 12 MSVVVAM--SIPVAMAVTPADSATLPAPDTALQVAING---NWRDRVYVQR-DQYRHPGQ 65
+++V AM S+ VA+ +TPA ATL P +A G W + + + Y +
Sbjct: 472 ITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVG 531

Query: 66 TLAFFGIKPTQTVIEITPGGGW-YSEILAPYL-REKGKYVAAVVDPASAPEGRSRDYAQR 123
+ + I G + + + +L E ++ P G +++ Q+
Sbjct: 532 KILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQ---LPAGATQERTQK 588

Query: 124 ARDDLEKKLQAKPQVYGKPSFVSYVPKSPSFGVDNSADLVLTFRNVHNW 172
D + + S + S S N+ ++ +
Sbjct: 589 VLDQVTDYYLKNEKA-NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEER 636


46AXO1947_RS18855AXO1947_RS18925Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS188552180.009412xylanase
AXO1947_RS18860-1120.920762hypothetical protein
AXO1947_RS18870-2161.940551TonB-dependent receptor
AXO1947_RS18875-1133.189153MFS transporter
AXO1947_RS188800113.356379alpha-N-arabinofuranosidase
AXO1947_RS188853113.754625membrane protein
AXO1947_RS228401103.235899IS5/IS1182 family transposase
AXO1947_RS18900192.940786hypothetical protein
AXO1947_RS189051102.695967hypothetical protein
AXO1947_RS189101101.723835hypothetical protein
AXO1947_RS189151111.679396alcohol dehydrogenase
AXO1947_RS189201111.454271glycerol-3-phosphate 1-O-acyltransferase
AXO1947_RS189252110.611407hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19380TCRTETB462e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.0 bits (109), Expect = 2e-07
Identities = 37/148 (25%), Positives = 59/148 (39%), Gaps = 3/148 (2%)

Query: 92 VDRQVLGVLAPFLQTQIGWNEIQYGYIVTAFQAAYALGLLCSGAVIDRFGTRLGYALAIG 151
++ VL V P + ++ TAF +++G G + D+ G + I
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 152 IWSLAAMGHALATSVVGFAI-ARFFLGLGESGNFPAALK-TVAEWFPRRERALATGIFNS 209
I ++ + S I ARF G G + FPA + VA + P+ R A G+ S
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYIPKENRGKAFGLIGS 146

Query: 210 GSNIGAVVAPLLVPLIATAWGWQSAFLF 237
+G V P + +IA W L
Sbjct: 147 IVAMGEGVGPAIGGMIAHYIHWSYLLLI 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19400BCTLIPOCALIN672e-16 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 66.6 bits (162), Expect = 2e-16
Identities = 36/157 (22%), Positives = 65/157 (41%), Gaps = 12/157 (7%)

Query: 26 ALPRAETV----DVPRFMGDWYVIAHIPTRPERNAYDAVESYALRPDGRIQTT---FTFR 78
+P + ++ ++G WY +A + ER Y +R DG I ++
Sbjct: 18 GMPESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEE 77

Query: 79 KGSFQAPLKSMHPIGQVAKEGNGALWSMQFLWPFKAEYVIAWLD-AGYTQTIVARSKRDY 137
KG ++ + + G L + F PF YV+ LD Y+ V+ +Y
Sbjct: 78 KGEWKEAEGKAYFVNGSTD---GYL-KVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEY 133

Query: 138 VWYMARTPQVSDADYQQAVQRIAAMGYDVSQLRRVPQ 174
+W ++RTP V + ++ G+D ++L V Q
Sbjct: 134 LWLLSRTPTVERGILDKFIEMSKERGFDTNRLIYVQQ 170


47AXO1947_RS19010AXO1947_RS19045Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS190103160.840399DNA-formamidopyrimidine glycosylase
AXO1947_RS190152160.380637IS5/IS1182 family transposase
AXO1947_RS190205200.341844**phytochrome
AXO1947_RS19025421-0.053875heme oxygenase
AXO1947_RS19030421-0.400448tetracycline resistance MFS efflux pump
AXO1947_RS19035423-0.692479epimerase
AXO1947_RS22855133-5.919968hypothetical protein
AXO1947_RS22860-125-4.058658hypothetical protein
AXO1947_RS19040-125-4.023156hypothetical protein
AXO1947_RS19045-124-3.324503IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19520PF05043357e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.9 bits (80), Expect = 7e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19565TCRTETA2436e-79 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 243 bits (622), Expect = 6e-79
Identities = 150/403 (37%), Positives = 222/403 (55%), Gaps = 19/403 (4%)

Query: 17 ALIFIFITVLIDVLSFGVIIPVLPGLVRHFTGGDYVQAAVWIGWFGFLFAAIQFVCSPLQ 76
LI I TV +D + G+I+PVLPGL+R + G L+A +QF C+P+
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN--DVTAHYGILLALYALMQFACAPVL 63

Query: 77 GALSDRFGRRPVILLSCLG--LDFILMAVAHSLPMLLLARVISGVCSASFSTATAYIADV 134
GALSDRFGRRPV+L+S G +D+ +MA A L +L + R+++G+ A+ + A AYIAD+
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADI 123

Query: 135 TPADKRAGAFGMLGAAFGIGLVAGPLIGGWLGSMGLRWPFWFAAGLALLNVLYGWFVLPE 194
T D+RA FG + A FG G+VAGP++GG +G PF+ AA L LN L G F+LPE
Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 195 SLPVERRTARLDWSHANPLGALKLLRRYPQVFGLASAVFLANLAHYVYPSIFLLFAGYQY 254
S ERR R + NPL + + R V L + F+ L V +++++F ++
Sbjct: 184 SHKGERRPLRREAL--NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 255 HWGPREVSWVLAGVGVCSIIVNVLLVGRLVRWLGERRALMLGLGCGVIGFVIYGLADSGA 314
HW + LA G+ + ++ G + LGERRALMLG+ G+++ A G
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 315 AFLIGVPISALWALAAPSAQALITREVGADAQGRVQGALTGLVSLAGIAGPLLFANVFAW 374
+ + A + P+ QA+++R+V + QG++QG+L L SL I GPLLF ++A
Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361

Query: 375 FIGS--------GAPLHLPGAPWLLAGVLLAAGWGMAWKRAGR 409
I + GA L+L P L G+ W A +RA R
Sbjct: 362 SITTWNGWAWIAGAALYLLCLPALRRGL-----WSGAGQRADR 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19580SECA341e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 33.7 bits (77), Expect = 1e-04
Identities = 10/17 (58%), Positives = 11/17 (64%)

Query: 7 NDPCPCGRPADYARCCG 23
NDPCPCG Y +C G
Sbjct: 882 NDPCPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19590PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


48AXO1947_RS19120AXO1947_RS19400Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS191202152.718319hypothetical protein
AXO1947_RS191302122.044492hypothetical protein
AXO1947_RS191352131.643096plasmid stabilization protein ParE
AXO1947_RS191402142.336133hypothetical protein
AXO1947_RS191451122.485105peptidase
AXO1947_RS191501122.077230ATPase
AXO1947_RS191550122.368278thiol reductase thioredoxin
AXO1947_RS191650142.585103attachment protein
AXO1947_RS228902192.030531DNA mismatch repair protein MutT
AXO1947_RS191802251.800886hypothetical protein
AXO1947_RS19190637-1.542699ATPase
AXO1947_RS22905233-3.996556histidine kinase
AXO1947_RS19195127-4.251590CsbD family protein
AXO1947_RS19200027-3.459340hypothetical protein
AXO1947_RS19205124-3.534194hypothetical protein
AXO1947_RS22910012-1.313416hypothetical protein
AXO1947_RS19210011-1.024381TonB-dependent receptor
AXO1947_RS19215-112-0.5072282-keto-3-deoxygluconate kinase
AXO1947_RS192200130.060245trans-2-enoyl-CoA reductase
AXO1947_RS192301141.229140alpha-hydroxy-acid oxidizing enzyme
AXO1947_RS192403212.498695IS5/IS1182 family transposase
AXO1947_RS192453193.029724hypothetical protein
AXO1947_RS192500182.648195ligand-gated channel
AXO1947_RS19255-2222.215904aromatic amino acid aminotransferase
AXO1947_RS19260-1163.920614fructose-bisphosphatase class I
AXO1947_RS19265-1173.912470hypothetical protein
AXO1947_RS229200163.546157IS5/IS1182 family transposase
AXO1947_RS192751143.559107hypothetical protein
AXO1947_RS192800122.834058IS5/IS1182 family transposase
AXO1947_RS19285-1112.369497hypothetical protein
AXO1947_RS19290-113-0.513703hypothetical protein
AXO1947_RS19295-212-1.015872hypothetical protein
AXO1947_RS19300-210-0.651426hypothetical protein
AXO1947_RS19305013-3.203749IS630 family transposase
AXO1947_RS19310-114-3.127541ABC transporter ATP-binding protein
AXO1947_RS19315-115-3.667580IS5/IS1182 family transposase
AXO1947_RS19320021-4.391182enterochelin esterase
AXO1947_RS19325023-5.124353hypothetical protein
AXO1947_RS19330130-7.021657hypothetical protein
AXO1947_RS19335127-4.093866hypothetical protein
AXO1947_RS22925124-4.947003pseudouridylate synthase
AXO1947_RS19340-119-3.384798hypothetical protein
AXO1947_RS19345-116-3.364243restriction endonuclease
AXO1947_RS19350019-3.384277IS630 family transposase
AXO1947_RS19355018-2.739679IS5/IS1182 family transposase
AXO1947_RS19360217-1.387688IS630 family transposase
AXO1947_RS19365013-1.374801hypothetical protein
AXO1947_RS22935-112-1.404547IS5/IS1182 family transposase
AXO1947_RS19370-111-1.018008hypothetical protein
AXO1947_RS19380-211-1.418867IS5/IS1182 family transposase
AXO1947_RS19385-211-2.083033hypothetical protein
AXO1947_RS19395-214-2.348928hypothetical protein
AXO1947_RS19400-118-3.192351hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19685PF05616401e-05 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 39.7 bits (92), Expect = 1e-05
Identities = 41/128 (32%), Positives = 52/128 (40%), Gaps = 22/128 (17%)

Query: 132 PPQGSASGGRIKVDFVGDTSQPDQLVPSPTPVPPTPTPTPVQPPPAASPVQSTLVQQAKN 191
P Q A+ GR D G+T+ Q++P P P + QP P SP ++ A N
Sbjct: 288 PVQVVATFGR---DSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENP----ANN 340

Query: 192 PVPPQGNTAPGSLAERRQPRRQQRPTPP-QPPAPPAASAQP--RPDTWT--GRPPGMLEE 246
P P N PG+ R P P P A P QP RPD+ RP G +
Sbjct: 341 PAP---NENPGT-------RPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRK 390

Query: 247 PADGAEDG 254
EDG
Sbjct: 391 ERKEGEDG 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS22990PF05272270.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.3 bits (60), Expect = 0.015
Identities = 8/44 (18%), Positives = 16/44 (36%)

Query: 20 PILEQARKRTKPVTVDMYEVWCAVLYLLRTGCPWRALPSDFPKW 63
P+L R + +++ L+L G + P D +
Sbjct: 710 PVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIY 753


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19825PF05043354e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 4e-04
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAHM 140
+ L+ H NTAH+
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAHL 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19860PF05043355e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 5e-04
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAHM 140
+ L+ H NTAH+
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAHL 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19880RTXTOXIND1324e-36 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 132 bits (333), Expect = 4e-36
Identities = 80/437 (18%), Positives = 148/437 (33%), Gaps = 75/437 (17%)

Query: 47 LAGVLIIALIVLFLATFSTSR---KVQWQGVVVPAGGTVSVIAPSAGSVSKVLVREGEQV 103
L I+ +V+ + G + +G + + V +++V+EGE V
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 104 HAGQAMFMLSE------------SLGNARLEG------------------------GDGA 127
G + L+ SL ARLE
Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 128 IPELLDERRQSLKRETIERQRKAELLKNSMLERLGDYDEEIRASSQQEQLQVQIIALAEQ 187
+ E R SL +E + + K L++ E + + + +
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKK---RAERLTVLARINRYENLSRVEKS 235

Query: 188 TARTYSDLESSRYVSGIAVREKHL-------DLLGKKQQLVDIRRQIQQLRRERSSLASD 240
+S L + ++ AV E+ +L K QL I +I + E +
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 241 MQSQM------AQSKLDSLDLQRKTFELDQQLTENTGTRTVVVRALGKGVVGTINVAA-G 293
++++ + L L+ E QQ + +RA V + V G
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASV--------IRAPVSVKVQQLKVHTEG 347

Query: 294 QGVGALLSLAVLIPEGRPLEVEMYAPSQAVGLMREGMDVSLRYSAFPYQKFGQQKGRVKA 353
V +L V++PE LEV ++ +G + G + ++ AFPY ++G G+VK
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 354 ISRSTMRVEEVRLPASLMKSGAEPLYRVRIDIGSPTVRVYGREVPLRPGMLVEGSVNLER 413
I+ + + + L + V I I + + +PL GM V +
Sbjct: 408 INLDAIEDQRLGLV-----------FNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456

Query: 414 RKLYEWIIDPLFTVTGR 430
R + +++ PL
Sbjct: 457 RSVISYLLSPLEESVTE 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19900PF05043358e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.5 bits (79), Expect = 8e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 96 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 155
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 156 E-------------ELLAHTINTAH 167
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19930PF06057290.025 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 28.7 bits (64), Expect = 0.025
Identities = 7/21 (33%), Positives = 12/21 (57%)

Query: 100 GGWRQFEQLVADAFRRQGYSV 120
GGW ++ V ++QG+ V
Sbjct: 61 GGWATLDKAVGGILQQQGWPV 81


49AXO1947_RS19520AXO1947_RS19625Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS19520-1153.180623ABC transporter permease
AXO1947_RS195500183.897556outer membrane lipid asymmetry maintenance
AXO1947_RS195550204.168880organic solvent ABC transporter
AXO1947_RS19560-1164.041680anti-sigma B factor antagonist
AXO1947_RS19565-1132.781639hypothetical protein
AXO1947_RS19570-1123.536625hypothetical protein
AXO1947_RS195750111.937458IS5/IS1182 family transposase
AXO1947_RS195801122.024260glutathione peroxidase
AXO1947_RS195902101.561126DNA recombination protein RmuC
AXO1947_RS19600510-0.428159IS5/IS1182 family transposase
AXO1947_RS19605411-1.303160PucR family transcriptional regulator
AXO1947_RS19610412-1.992523glycerate kinase
AXO1947_RS19615316-3.335902MFS transporter
AXO1947_RS19620117-1.131917hypothetical protein
AXO1947_RS196252191.977488oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS20135VACJLIPOPROT2431e-81 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 243 bits (621), Expect = 1e-81
Identities = 83/249 (33%), Positives = 115/249 (46%), Gaps = 14/249 (5%)

Query: 94 FDALYGSTTPQAGANGAPAQPGAAPAYDPWERYNRGMHRFNMAV-DRGVARPLATAYTKV 152
AL TT G + DP E +NR M+ FN V D + RP+A A+
Sbjct: 5 LSALALGTTLLVGCASSGTD--QQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDY 62

Query: 153 VPSPARLGVTNFFDNLGTPLTMVNQLLQGHPVYAVQSLGRFVMNSTLGVAGLFDPASAAG 212
VP PAR G++NF NL P MVN LQG P + RF +N+ LG+ G D A A
Sbjct: 63 VPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMAN 122

Query: 213 IPRR---SEDFGQTLGAWGWRNSRYFELPLFGPRTVRDTFGLAGDI---PLSWIRHVDDG 266
+ FG TLG +G Y +LP +G T+RD G D LSW+
Sbjct: 123 PKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWL----TW 178

Query: 267 GTRFALQGLQLVDTRAQLMSLDSLRDQAPDEYALTRDAWMQRRNYQITRDLRSHNEKKNN 326
L+ ++TRAQL+ D L Q+ D Y + R+A+ QR ++ E N
Sbjct: 179 PMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNA 238

Query: 327 E-LPDYLRE 334
+ + D L++
Sbjct: 239 QAIQDDLKD 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS20145PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS20180HTHFIS300.004 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.004
Identities = 12/65 (18%), Positives = 23/65 (35%), Gaps = 3/65 (4%)

Query: 67 LADDLRQAWQAEQLRKPLQALLAQDRRGQLLKTLSVWSVAGMRMAPTAKALGIHRNTLSY 126
+ A +LA+ +L L+ A LG++RNTL
Sbjct: 412 MRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTA---TRGNQIKAADLLGLNRNTLRK 468

Query: 127 RMQRI 131
+++ +
Sbjct: 469 KIREL 473


50AXO1947_RS19680AXO1947_RS19745Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS196802112.08008750S ribosomal protein L34
AXO1947_RS196851101.249382
AXO1947_RS196951100.168861
AXO1947_RS197003130.424417
AXO1947_RS197051140.376034
AXO1947_RS197101140.065063
AXO1947_RS197151130.599717
AXO1947_RS197200192.415770
AXO1947_RS197251152.160132
AXO1947_RS229702142.840440
AXO1947_RS229754122.295330
AXO1947_RS197353133.060883
AXO1947_RS197452162.283820
51AXO1947_RS19845AXO1947_RS20085Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS19845-319-3.051906
AXO1947_RS19850030-6.725858
AXO1947_RS19855132-7.080546
AXO1947_RS19860335-7.142816
AXO1947_RS23010540-8.498166
AXO1947_RS19865436-7.636023
AXO1947_RS19875334-6.328332
AXO1947_RS19880225-4.768908
AXO1947_RS19885020-4.136820
AXO1947_RS19890017-3.633638
AXO1947_RS19895118-3.970494
AXO1947_RS199002100.497557
AXO1947_RS199052130.636823
AXO1947_RS199101140.480042
AXO1947_RS199151110.158345
AXO1947_RS23015014-1.592451
AXO1947_RS19920014-1.844973
AXO1947_RS19930320-3.433948
AXO1947_RS19935322-4.258580
AXO1947_RS23020324-5.341440
AXO1947_RS19945021-3.659894
AXO1947_RS19950021-3.739830
AXO1947_RS19960122-4.331848
AXO1947_RS19975119-2.862668
AXO1947_RS19985324-2.830757
AXO1947_RS19995224-1.399385
AXO1947_RS20000323-2.670669
AXO1947_RS20005019-0.274389
AXO1947_RS20010-1170.016037
AXO1947_RS23025-1170.251773
AXO1947_RS230300234.352921
AXO1947_RS20020-2224.588141
AXO1947_RS20030-2153.510036
AXO1947_RS20035-1142.829510
AXO1947_RS20045-1130.559473
AXO1947_RS20055032-6.403502
AXO1947_RS20060-129-4.104339
AXO1947_RS23035019-3.181516
AXO1947_RS20075217-0.407216
AXO1947_RS200802140.604471
AXO1947_RS200852140.460501
52AXO1947_RS01015AXO1947_RS01045N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS01015-2132.125257hypothetical protein
AXO1947_RS01020-3142.415576twin-arginine translocase subunit TatA
AXO1947_RS01025-2162.571915twin-arginine translocase subunit TatB
AXO1947_RS01030-1182.407726twin-arginine translocase subunit TatC
AXO1947_RS01035-2152.120962hypothetical protein
AXO1947_RS010400101.094484GMP synthase
AXO1947_RS010451131.880165IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01035PERTACTIN300.013 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.1 bits (67), Expect = 0.013
Identities = 25/86 (29%), Positives = 31/86 (36%), Gaps = 5/86 (5%)

Query: 207 NERPSTDVIAFRDRLEEATYTARANRGTDAAAAGAPPAPRPQTPPPAQAQQPTTVPPANE 266
N+ D+ +R RL A N A APPAP+P P Q PP
Sbjct: 538 NKDGKVDIGTYRYRL-----AANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPP 592

Query: 267 ASTVPMQPSATPPAKQGFQPVSEGEI 292
P QP P QP + E+
Sbjct: 593 QPPQPPQPPQRQPEAPAPQPPAGREL 618


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01040TATBPROTEIN312e-04 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 31.1 bits (70), Expect = 2e-04
Identities = 10/41 (24%), Positives = 18/41 (43%)

Query: 1 MGGFSIWHWLIVLVIVLLVFGTKRLTSGAKDLGSAVKEFKK 41
M L+V +I L+V G +RL K + ++ +
Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRS 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01045TATBPROTEIN823e-22 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 82.4 bits (203), Expect = 3e-22
Identities = 36/89 (40%), Positives = 54/89 (60%), Gaps = 1/89 (1%)

Query: 1 MFDIGVGELTLIAVVALVVLGPERLPKAARFAGLWVRRARMQWDSVKQELERELEAEELK 60
MFDIG EL L+ ++ LVVLGP+RLP A + W+R R +V+ EL +EL+ +E +
Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ 60

Query: 61 RSLQDVQ-ASLREAEDQLRNKQQQVEQGA 88
SL+ V+ ASL +L+ ++ Q A
Sbjct: 61 DSLKKVEKASLTNLTPELKASMDELRQAA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS01065PF05043358e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.5 bits (79), Expect = 8e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


53AXO1947_RS02550AXO1947_RS02575N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS02550212-0.234208DUF1338 domain-containing protein
AXO1947_RS025600130.433513hypothetical protein
AXO1947_RS02565-116-0.125693acriflavin resistance protein
AXO1947_RS025701140.369849acriflavine resistance protein B
AXO1947_RS025750110.680506membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02605YERSSTKINASE300.019 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.1 bits (67), Expect = 0.019
Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 4/75 (5%)

Query: 117 DEAALSANPFRVFTSLLRLELIEDATLRAQAEQILQQRQIFTAGVLQLIERYEQQGGLDA 176
D ++ R + LLR L AT + +L +L +++ E++GG+D
Sbjct: 436 DVRRITPKKLRELSDLLRTHLSSAATKQLDMGGVLSDLDT----MLVALDKAEREGGVDK 491

Query: 177 DQARQFVAEALETFR 191
DQ + F + L+T+R
Sbjct: 492 DQLKSFNSLILKTYR 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02615ACRIFLAVINRP7320.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 732 bits (1890), Expect = 0.0
Identities = 282/1035 (27%), Positives = 483/1035 (46%), Gaps = 28/1035 (2%)

Query: 3 ISAPFIKRPIGTSLLAIGLFVIGLMCYLRLGVASLPNIQIPIIFVHATQSGTDASTMAST 62
++ FI+RPI +LAI L + G + L+L VA P I P + V A G DA T+ T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTAPLERHLGQLPGIDRMRSSSSEN-SSVVVLVFQSSRNIDSAAQDIQTAINASQSDLPS 121
VT +E+++ + + M S+S S + L FQS + D A +Q + + LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 GLGTPMYSKANPNEDPVIAIALTSET--QSADELYNVADSLLAQRLRQITGISLVDIAGA 179
+ S + ++ S+ + D++ + S + L ++ G+ V + GA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 STPAVRVDVDLRALNALGLTTDNLRNAVRAANVTSPTGFL------SDGNTTMAIIANDS 233
A+R+ +D LN LT ++ N ++ N G L +IIA
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 234 VSKAADFAQLAIATQSNGRIVRLGDVATVYDGQQDAYQAAWFNGKPAVVMYAFTRAGANI 293
+F ++ + S+G +VRL DVA V G ++ A NGKPA + GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 VETVDQVKAQIPELRSYLQPGTTLTPYFDRTPTIRASLHEVQAALMISLAMVILTMALFL 353
++T +KA++ EL+ + G + +D TP ++ S+HEV L ++ +V L M LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 RRLAPTLIAAIAVPLSLAGSALVMYVLGFTLNNLSLLALVIAIGFVVDDAIVVIENVMRH 413
+ + TLI IAVP+ L G+ ++ G+++N L++ +V+AIG +VDDAIVV+ENV R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 L-DEGMSRMEAALAGAREIGFTIVSITASLVAVFIPMLFASGMIGAFFREFTVTLVAAIV 472
+ ++ + EA +I +V I L AVFIPM F G GA +R+F++T+V+A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 VSMLVSLTLTPALCSRFLSAHAEP--EKPGRLGAWLDRMHERMLRVYTVALDFSLRHALL 530
+S+LV+L LTPALC+ L + E G W + + + YT ++ L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 531 LSLTPLLLIAATIFLVGAVKKGSFPAQDTGLIWGRANSSATVPFADMVSRQRRITDMLMA 590
L L++A + L + P +D G+ A ++TD +
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 591 DPA------VKTVGARLGSSRQGSSASFNIELKKRDE--GRRDTTADVVARLSAKADRYP 642
+ G Q + +F + LK +E G ++ V+ R + +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAF-VSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 643 DLDLRLRAIQDLPSDGGGGTSQGAQYRVSLQGNDLAQLQEWLPKLQAALKKNP-RLRNVG 701
D + G + + G L + +L ++P L +V
Sbjct: 659 --DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 702 TDVDTSGLRQNIVIDRAKAARLGVSVGAIDGALYGAFGQRSISTIYSDLNQYSVVVNALP 761
+ + + +D+ KA LGVS+ I+ + A G ++ + V A
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 762 SQTATPKALDQIFVPNRAGRMVPITAVATQVPGLAPPQIIHENQYTTMDLSYNLAPGVNT 821
P+ +D+++V + G MVP +A T P++ N +M++ APG ++
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 822 GEADLIIKSTVDGLRMPDGIRLS-GGDSFNVQLSPNSMGILLLAAVLTVYIVLGMLYESL 880
G+A ++++ ++P GI G S+ +LS N L+ + + V++ L LYES
Sbjct: 837 GDAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894

Query: 881 IHPVTILSTLPAAGVGALLALFITNTELSVISMIALVLLIGIVKKNAIMMIDFALVAQRV 940
PV+++ +P VG LLA + N + V M+ L+ IG+ KNAI++++FA
Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954

Query: 941 HGMDARAAAREASIVRFRPIMMTTMVAILAAVPLAVGLGEGAELRRPLGIAMIGGLMFSQ 1000
G A A +R RPI+MT++ IL +PLA+ G G+ + +GI ++GG++ +
Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014

Query: 1001 SLTLLSTPALYVIFS 1015
L + P +V+
Sbjct: 1015 LLAIFFVPVFFVVIR 1029



Score = 106 bits (266), Expect = 3e-25
Identities = 79/506 (15%), Positives = 163/506 (32%), Gaps = 31/506 (6%)

Query: 2 NISAPFIKRPIGTSLLAIGLFVIGLMCYLRLGVASLPNIQIPIIFVHA-TQSGTDASTMA 60
N + L+ + ++ +LRL + LP + +G
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 61 STVT----------APLERHLGQLPGIDRMRSSSSENSSVVVLVFQSSRNIDS-AAQDIQ 109
+ + + G + + + V L RN D +A+ +
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 110 TAINASQSDLPSGLGTPMYSKANPNEDPVIAIALTSETQSA-----DELYNVADSLLAQR 164
+ G P + A E D L + LL
Sbjct: 648 HRAKMELGKIRDGFVIPF--NMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMA 705

Query: 165 LRQITGISLVDIAG-ASTPAVRVDVDLRALNALGLTTDNLRNAVRAANVTSPTGFLSDGN 223
+ + V G T +++VD ALG++ ++ + A + D
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 224 TTMAIIA---NDSVSKAADFAQLAIATQSNGRIVRLGDVATVYDGQQDAYQAAWFNGKPA 280
+ D +L + + +NG +V T + + + +NG P+
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRS-ANGEMVPFSAFTTSHWVYG-SPRLERYNGLPS 823

Query: 281 VVMYAFTRAGANIVETVDQVKAQIPELRSYLQPGTTLTPYFDRTPTIRASLHEVQAALMI 340
+ + G + A + L S L G + + R S ++ A + I
Sbjct: 824 MEIQGEAAPGTS----SGDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPALVAI 878

Query: 341 SLAMVILTMALFLRRLAPTLIAAIAVPLSLAGSALVMYVLGFTLNNLSLLALVIAIGFVV 400
S +V L +A + + + VPL + G L + + ++ L+ IG
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 401 DDAIVVIENVM-RHLDEGMSRMEAALAGAREIGFTIVSITASLVAVFIPMLFASGMIGAF 459
+AI+++E EG +EA L R I+ + + + +P+ ++G
Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 460 FREFTVTLVAAIVVSMLVSLTLTPAL 485
+ ++ +V + L+++ P
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVF 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02620ACRIFLAVINRP7320.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 732 bits (1892), Expect = 0.0
Identities = 298/1072 (27%), Positives = 497/1072 (46%), Gaps = 65/1072 (6%)

Query: 4 STIFIRRPIATSLLMAGVLLLGILGYRQLPVSALPEIDAPSLVVTTQYPGANATTMASLV 63
+ FIRRPI +L +++ G L QLPV+ P I P++ V+ YPGA+A T+ V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 64 TTPLERQFGQISGLKMMTSDS-SAGLSTIILQFSMERDINIASQDVQAAIRQAT--LPSS 120
T +E+ I L M+S S SAG TI L F D +IA VQ ++ AT LP
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 121 LPYQPVYNRVNPADAAILTLKLTSDS--LPLREVNRYADAILAQRLSQVPGVGLVSIAGN 178
+ Q + + + ++ SD+ +++ Y + + LS++ GVG V + G
Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 179 VRPAVRIQVNPAQLSNMGLTMESLRSALTQTNVSAPKGSLN------GKTQSYSIGTNDQ 232
A+RI ++ L+ LT + + L N G L G+ + SI +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 LTDAAEYRQTII-SYKDGRPVRLADVANVVDGVENDQLAAWADNKPAVLLEIRRQPSANI 291
+ E+ + + DG VRL DVA V G EN + A + KPA L I+ AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 292 VQTVEQIRSILPQLQSVLPADVHLEVLSDRTETIRASVHEVKFTLVLTIALVVAVIFVFL 351
+ T + I++ L +LQ P + + D T ++ S+HEV TL I LV V+++FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 352 RRLWATIIPSVAVPLSLAGTFAVMAFAGMSLDNLSLMALVVATGFVVDDAIVMIENIVRY 411
+ + AT+IP++AVP+ L GTFA++A G S++ L++ +V+A G +VDDAIV++EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 412 IEQGKSGP-EAAEIGAKQIGFTVLSLTVSLVAVFLPLLLMPGVTGRLFHEFAWVLSIAVV 470
+ + K P EA E QI ++ + + L AVF+P+ G TG ++ +F+ + A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 471 TSMLVSLTLTPMMCAYLLKPDALPEGEDAHERATAAGKRNLWTRTVGTYERSLDWVLAHQ 530
S+LV+L LTP +CA LLKP + E ++ + +V Y S+ +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHE--NKGGFFGWFNTTFDHSVNHYTNSVGKILGST 537

Query: 531 PLTLAVAIGAVALTVVLYVAIPKGLLPEQDTGLITGVVQADQNVAFPQMEQRTQAVAAAL 590
L + VA VVL++ +P LPE+D G+ ++Q + ++ V
Sbjct: 538 GRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYY 597

Query: 591 QKDPA--VTGVAAFIGAGTMNPTLNQGQLSIVLKTRSDRDG----LDEVLPRLQKAVAGI 644
K+ V V G N G + LK +R+G + V+ R + + I
Sbjct: 598 LKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657

Query: 645 PGVALFLKPVQDV-TLDTRVAATEYQYSISDVDSSELATWAGRMAEAMRKLP-ELADVDN 702
+ + + L T + + L ++ + P L V
Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 703 NLANQGRALELSIDRDKASMLGVPMQTIDDTLYDAFGQRQISTIFTELNQYRVVLDVAPE 762
N +L +D++KA LGV + I+ T+ A G ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 763 FRSSTALMNQLAVASNGSGALTGTNATSFGQVTSSNSSTATGVGAQNTGIVVGAGSIIPL 822
FR +++L V S G ++P
Sbjct: 778 FRMLPEDVDKLYVRSA-------------------------------------NGEMVPF 800

Query: 823 AALAEAKVTNTPLVVSHQQQLPAVTISFNLAPGHSLSQAVAAIEKARQELNIPTQVHAQF 882
+A + + LP++ I APG S A+A +E +L P + +
Sbjct: 801 SAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKL--PAGIGYDW 858

Query: 883 VGKAAEFTGSQTDIIWLLLASIVVIYIVLGVLYESYIHPLTIISTLPPAGVGALLALMMC 942
G + + S L+ S VV+++ L LYES+ P++++ +P VG LLA +
Sbjct: 859 TGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLF 918

Query: 943 GLSLSVDGIVGIVLLIGIVKKNAIMMIDFAIDA-RREGANAHDAIRRACLLRFRPIMMTT 1001
V +VG++ IG+ KNAI++++FA D +EG +A A +R RPI+MT+
Sbjct: 919 NQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTS 978

Query: 1002 AAAMLGALPLALGTGIGSELRRPLGIAIVGGLLLSQLVTLYTTPVIYLYMER 1053
A +LG LPLA+ G GS + +GI ++GG++ + L+ ++ PV ++ + R
Sbjct: 979 LAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 75.3 bits (185), Expect = 9e-16
Identities = 76/460 (16%), Positives = 160/460 (34%), Gaps = 52/460 (11%)

Query: 611 TLNQGQLSIVLKTRSDRD---GLDEVLPRLQKAVAGIPGVALFLKPVQDVTLDTRVAATE 667
+ + G ++I L +S D +V +LQ A +P + + + +
Sbjct: 82 SDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAG 141

Query: 668 YQYSISDVDSSELATWAGR-MAEAMRKLPELADVDNNLANQGRALELSIDRDKASMLGVP 726
+ +++ + + + + +L + DV L A+ + +D D
Sbjct: 142 FVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV--QLFGAQYAMRIWLDADL------- 192

Query: 727 MQTIDDTLYDAFGQRQISTIFTELNQYRVVL-DVAPEFRSSTALMNQLAVASNGS-GALT 784
LN+Y++ DV L Q + G G
Sbjct: 193 -----------------------LNKYKLTPVDV------INQLKVQNDQIAAGQLGGTP 223

Query: 785 GTNATSFGQVTSSNSSTATGVGAQNTGIVVGA-GSIIPLAALAEAKVT--NTPLVVSHQQ 841
+ + + V + GS++ L +A ++ N ++
Sbjct: 224 ALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARING 283

Query: 842 QLPAVTISFNLAPGHSLSQAVAAIEKARQELN--IPTQVHAQFVGKAAEF-TGSQTDIIW 898
+ PA + LA G + AI+ EL P + + F S +++
Sbjct: 284 K-PAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVK 342

Query: 899 LLLASIVVIYIVLGVLYESYIHPLTIISTLPPAGVGALLALMMCGLSLSVDGIVGIVLLI 958
L +I+++++V+ + ++ L +P +G L G S++ + G+VL I
Sbjct: 343 TLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAI 402

Query: 959 GIVKKNAIMMIDFAIDARRE-GANAHDAIRRACLLRFRPIMMTTAAAMLGALPLALGTGI 1017
G++ +AI++++ E +A ++ ++ +P+A G
Sbjct: 403 GLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGS 462

Query: 1018 GSELRRPLGIAIVGGLLLSQLVTLYTTPVIYLYMERGGER 1057
+ R I IV + LS LV L TP + + +
Sbjct: 463 TGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02625RTXTOXIND516e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 6e-09
Identities = 30/149 (20%), Positives = 58/149 (38%), Gaps = 22/149 (14%)

Query: 64 ASALGTVTAL-NTVTVSPQVSGQLMSLNFKEGQEVKKGDLLAQIDPRT-------LQASY 115
A+A G +T + + P + + + KEG+ V+KGD+L ++ Q+S
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSL 143

Query: 116 DQALAAKRQNQALLA---TSRVNYQRSNDPAYKQYVS-----------RTDLDTQRNQVA 161
QA + + Q L +++ + D Y Q VS + T +NQ
Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKY 203

Query: 162 QYEAAVSANDAQMRSAQVQLQFTRITAPI 190
Q E + A+ + ++ + +
Sbjct: 204 QKELNLDKKRAERLTVLARINRYENLSRV 232



Score = 37.1 bits (86), Expect = 1e-04
Identities = 23/178 (12%), Positives = 63/178 (35%), Gaps = 29/178 (16%)

Query: 93 EGQEVKKGDLLAQIDPRTLQASYDQ-------ALAAKR----------QNQALLATSRVN 135
+ + ++ +LA+I+ + ++ +L K+ +N+ + A + +
Sbjct: 210 DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELR 269

Query: 136 YQRSNDPAYKQYVSRTDLD-TQRNQVAQYEAAVSANDAQMRSAQV---------QLQFTR 185
+S + + + Q+ + E + + Q +
Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASV 329

Query: 186 ITAPIDGIAGIRGV-DVGNIVSSTSTLVTLT-QIRPIYVSFNLPERELQAVRAGQAAT 241
I AP+ V G +V++ TL+ + + + V+ + +++ + GQ A
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAI 387


54AXO1947_RS02705AXO1947_RS02725N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS02705-118-0.920920sigma-54-dependent Fis family transcriptional
AXO1947_RS02710-221-2.523384hypothetical protein
AXO1947_RS02715-220-2.548928response regulator
AXO1947_RS02720-119-2.692320IS5/IS1182 family transposase
AXO1947_RS02725-115-2.627411methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02765HTHFIS359e-123 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 359 bits (924), Expect = e-123
Identities = 146/470 (31%), Positives = 226/470 (48%), Gaps = 49/470 (10%)

Query: 7 MYRLSCAIIDDDVEFCDQVVELATDSGFRAKGIHTLGEASRWLDSNFPDLLVVDVGLPDG 66
M + + DDD + + + +G+ + RW+ + DL+V DV +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 67 SGFDLIERL-DPDHTPQIVVVSGDYARETQGRAQQFGVSEFLTKPFAPER---------- 115
+ FDL+ R+ ++V+S T +A + G ++L KPF
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 116 -LERVLGGLREAQQGNLGIVGNSDSIAMLRKEIVRVAPTDLNVLVTGETGTGKDLVARAI 174
+R L + Q + +VG S ++ + + + R+ TDL +++TGE+GTGK+LVARA+
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 175 HRVSGRSGR-FVPVNCGAIPEELLASQLFGHERGSFTGADRRHAGFLEQAAGGTLFLDEI 233
H R FV +N AIP +L+ S+LFGHE+G+FTGA R G EQA GGTLFLDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 234 GEMPKRLQVYLLRAIESRSFMRVGGNEEIALDARVVAATHQHVQRE--QAVLREDLFYRL 291
G+MP Q LLR ++ + VGG I D R+VAAT++ +++ Q + REDL+YRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 292 NEYPLQVPPLRERRGDARLLGLRVIDELNVKYGKRKLPTKALLRYLACHAWPGNVRELRS 351
N PL++PPLR+R D L + + + K + L + H WPGNVREL +
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 352 FIHYLYLRSDGDLLSAPDVEQAVPQ----------------------------------A 377
+ L D+++ +E +
Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420

Query: 378 DEDGLLIPAGWTMRQAEDAMIESALARTRFNKKAAARELGISVRTLHNRL 427
D + + E +I +AL TR N+ AA LG++ TL ++
Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKI 470


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02770FERRIBNDNGPP260.013 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 26.1 bits (57), Expect = 0.013
Identities = 10/35 (28%), Positives = 16/35 (45%)

Query: 20 NETSKPLETLRTAHQSAVELLNQLGEAERALQQAD 54
++ +PL R + +LLN AE L Q +
Sbjct: 125 SDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYE 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02775HTHFIS353e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 3e-05
Identities = 22/86 (25%), Positives = 33/86 (38%), Gaps = 4/86 (4%)

Query: 6 LTGRRILVVEDDFLLAESLNDLLVEAGVRVLGPVGNVPDALSLVASGQAIDGALLDVNVR 65
+TG ILV +DD + LN L AG V N +A+G D + DV +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGD-GDLVVTDVVMP 58

Query: 66 GHAVFPVADALMER--GVPFSFCSGY 89
F + + + +P S
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02785GPOSANCHOR350.001 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.4 bits (81), Expect = 0.001
Identities = 15/113 (13%), Positives = 36/113 (31%), Gaps = 2/113 (1%)

Query: 630 QEVEARELMELPAPRDHEHDDSRVHV--LESELRITRERLQSMIEEIESTNEQLKSSNEE 687
E M + LE+E + Q + +S L +S E
Sbjct: 265 LEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREA 324

Query: 688 LQSANEDLETSKEELQSVNEEVTTVNGELAHRVQELAHANSDLKNLLESTQIA 740
+ + + +E+ + ++ +L + ++ + L E +I+
Sbjct: 325 KKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKIS 377



Score = 30.4 bits (68), Expect = 0.049
Identities = 11/85 (12%), Positives = 34/85 (40%)

Query: 656 LESELRITRERLQSMIEEIESTNEQLKSSNEELQSANEDLETSKEELQSVNEEVTTVNGE 715
LE+ + L+ + + + ++K+ E + + + + Q +N ++ +
Sbjct: 258 LEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRD 317

Query: 716 LAHRVQELAHANSDLKNLLESTQIA 740
L + ++ + L E +I+
Sbjct: 318 LDASREAKKQLEAEHQKLEEQNKIS 342


55AXO1947_RS02840AXO1947_RS02900N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS02840-1140.132931sugar ABC transporter permease
AXO1947_RS02845-1160.019793glutathione peroxidase
AXO1947_RS02855-1180.764377hypothetical protein
AXO1947_RS028600150.786955protease
AXO1947_RS02865-1130.663116adhesin
AXO1947_RS02870-1150.616242two-component sensor histidine kinase
AXO1947_RS02880-113-0.316253PhoP family transcriptional regulator
AXO1947_RS02885-215-0.641515hypothetical protein
AXO1947_RS02890-214-1.153966hypothetical protein
AXO1947_RS20735-215-0.880077flagellar motor protein MotB
AXO1947_RS02895-1130.218578flagellar motor stator protein MotA
AXO1947_RS029000132.247658hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02905ABC2TRNSPORT731e-17 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 73.4 bits (180), Expect = 1e-17
Identities = 55/241 (22%), Positives = 105/241 (43%), Gaps = 1/241 (0%)

Query: 7 AAIYRFEMARAFRTLTQSIASPVLSTSLYFVVFGAAIGARMGDIDGISYGAFIIPGLVML 66
A++R + S+ + +Y GA +G +G + G+SY AF+ G+V
Sbjct: 17 IAVWRRNYIAWKKAALASLLGHLAEPLIYLFGLGAGLGVMVGRVGGVSYTAFLAAGMVAT 76

Query: 67 SLLNESISNASFGIYMPRWA-GTIYEVLSAPVAWWEIVIGYVGAAATKSVMLGLLILLTA 125
S + + + + T +L + +IV+G + AATK+ + G I + A
Sbjct: 77 SAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVA 136

Query: 126 RLFVPYQIAHPVWMLGFLVLTALTFSLFGFIIGIWADGFEKLQVIPLMVVTPLTFLGGSF 185
Q ++ L + LT L F+ G ++ A ++ +V+TP+ FL G+
Sbjct: 137 AALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAV 196

Query: 186 YSINMLPPLWQKVTLFNPVVYLISGFRWSFYGKADVHIAVSTGMTFLFLVVCLGVVAAIF 245
+ ++ LP ++Q F P+ + I R G V + G +++V+ + A+
Sbjct: 197 FPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALL 256

Query: 246 R 246
R
Sbjct: 257 R 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02920SUBTILISIN1947e-59 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 194 bits (495), Expect = 7e-59
Identities = 109/367 (29%), Positives = 150/367 (40%), Gaps = 68/367 (18%)

Query: 175 YQWHMQDSAGGIRAPKAWETSTGGGVVVAVIDTGILPDHPDLKNNDHILQGYDFITNAIV 234
+ I+AP W + G GV VAV+DTG DHPDLK I+ G +F
Sbjct: 18 QVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKAR--IIGGRNF------ 69

Query: 235 SRRATDDRVPGALDYGDWIDDDNTCLQRARASSWHGTHTTGTIGELTNNGIGGVGAAHDA 294
DDD + + + HGTH GTI T N G VG A +A
Sbjct: 70 ------------------TDDDEGDPEIFKDYNGHGTHVAGTIA-ATENENGVVGVAPEA 110

Query: 295 QILPIRALGQCG-GMSSDIADAIVWASGGHVDGVPDNTHPAEVISMSLGGFGSCDSNTQQ 353
+L I+ L + G G I I +A VD +ISMSLGG +
Sbjct: 111 DLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVD----------IISMSLGGPEDVPE-LHE 159

Query: 354 AINTAVANGTTVVVAAGNDGIDAAE----SSPASCSNVITVGATRITGGIAFYSNFGSVV 409
A+ AVA+ V+ AAGN+G P + VI+VGA + +SN + V
Sbjct: 160 AVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNEV 219

Query: 410 DLAGPGGGQDQDTGHGGWDGLVLSTGYSGKTTPTSGQYKYLGYAGTSMASPHVAAVAALV 469
DL PG +LST G KY ++GTSMA+PHVA AL+
Sbjct: 220 DLVAPGED-------------ILSTVPGG---------KYATFSGTSMATPHVAGALALI 257

Query: 470 QSALASTGKTPLNPSQLQAVLKQTARAFPVPPPTATPIGTGIVDATAAMDYVRTNCSGSS 529
+ ++ + L +L A L + P G G++ TA + R +
Sbjct: 258 KQLANASFERDLTEPELYAQLIKRTIPLGNSPKME---GNGLLYLTAVEELSRIFDTQRV 314

Query: 530 CKPVSTA 536
+STA
Sbjct: 315 AGILSTA 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02925OMADHESIN551e-09 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 54.9 bits (131), Expect = 1e-09
Identities = 50/140 (35%), Positives = 74/140 (52%), Gaps = 5/140 (3%)

Query: 736 VAATSTATGSTAVANDVTGTAIGGSAYAHGPNDTAIGSNARVNADGSTAVGANTQIAAVA 795
+ AT+ A AVA A G ++ A GP A+G +A STA I A A
Sbjct: 75 IGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARA 134

Query: 796 TNA---VAMGEGAQVSAASATAIGQGARATAQG--AVAVGQGAVADRANTVSVGSVGAER 850
+ + VA+G ++ A ++ AIG + A ++A+G + DR N+VS+G R
Sbjct: 135 STSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNR 194

Query: 851 QITNVAAGRSDTDAANVAQV 870
Q+T++AAG DTDA NVAQ+
Sbjct: 195 QLTHLAAGTKDTDAVNVAQL 214



Score = 39.1 bits (90), Expect = 7e-05
Identities = 63/254 (24%), Positives = 101/254 (39%), Gaps = 23/254 (9%)

Query: 312 SNYAIALGYNANVFPNLPGNTDSVAIGHSAGSFAPNTVSLGAYALASDQDGIAVGHNSWA 371
++ A+ L Y G ++ A G + + + A+A IA G NS A
Sbjct: 43 ADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVA 102

Query: 372 LRANSVVLGSGAISSWFNPNSTALGAATRTDGVDATSIGYGAKVGSWVDDAWNRAPVSAV 431
+ S LG A+ + + + DGV + + G V ++V
Sbjct: 103 IGPLSKALGDSAV-------TYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSV 155

Query: 432 ALGAFSH--ATRNYSVAVGDVA------------SGLTRQITSVAAGTEATDAVNKGQLD 477
A+G SH A YS+A+GD + L RQ+T +AAGT+ TDAVN QL
Sbjct: 156 AIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215

Query: 478 ALAADVQATSGVLQANGEGTASATGEH--STAAGAGASASGARSAAVAAGSRASAAGASA 535
Q + A A+A ++ S+ G + + ++SA +R A S
Sbjct: 216 KEIEKTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSK 275

Query: 536 LGVDSSANGVNSTA 549
++ + NS A
Sbjct: 276 DVLNMAKAHSNSVA 289



Score = 39.1 bits (90), Expect = 8e-05
Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 4/105 (3%)

Query: 491 QANGEGTASATGEHSTAAGAGASASGARSAAVAAGSRASAAGASALGVDSSANGVNSTAM 550
G ASA G HS A GA A A+ + AV AGS A+ + A+G S A G ++
Sbjct: 58 PGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTY 117

Query: 551 GYNSFVRQSGVNGVALGANAGASGADSVALGSGSRTYEANTVSVG 595
G S ++ +GVA+GA A S VA+G S+ N+V++G
Sbjct: 118 GAASTAQK---DGVAIGARASTSDT-GVAVGFNSKADAKNSVAIG 158



Score = 38.3 bits (88), Expect = 1e-04
Identities = 52/175 (29%), Positives = 83/175 (47%), Gaps = 18/175 (10%)

Query: 192 ALGGGAKATAALATAVGSGSEARNVQSTALGYRARAFEDGATAVGGLSVASGYLSTANGY 251
ALG + A G + A+ + S A+G A A + A AVG S+A+G S A G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 252 FARATGTSSVALGNTALASGIDSVAIGGVSTAAAAAGNSSSLTAATGVGSVALGAGAVTQ 311
++A G S+V G + A D VAIG A+T VA+G +
Sbjct: 106 LSKALGDSAVTYGAASTAQK-DGVAIGA--------------RASTSDTGVAVGFNSKAD 150

Query: 312 SNYAIALGYNANVFPNLPGNTDSVAIGHSAGSFAPNTVSLGAYALASDQDGIAVG 366
+ ++A+G++++V N + S+AIG + + N+VS+G +L +A G
Sbjct: 151 AKNSVAIGHSSHVAAN---HGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202



Score = 32.6 bits (73), Expect = 0.009
Identities = 51/178 (28%), Positives = 81/178 (45%), Gaps = 12/178 (6%)

Query: 94 LQIVGGSDPAEDVGAFAAEPYAVAIGEASNALGEGGIALGAGATVTAKHAIATGYAAAAS 153
+QI +DPA + P A G ++A G IA+GA A A+A G + A+
Sbjct: 37 VQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIAT 96

Query: 154 GESAVAIGGTTEIFDYNDAGEIIGSHQDSTEASQFGAVALGGGAKATAALATAVGSGSEA 213
G ++VAIG ++ D+ G + +Q VA+G A +T+ AVG S+A
Sbjct: 97 GVNSVAIGPLSKAL--GDSAVTYG----AASTAQKDGVAIGARA-STSDTGVAVGFNSKA 149

Query: 214 RNVQSTALGYRARAFEDGATAVGGLSVASGYLSTANGYFARATGTSSVALGNTALASG 271
S A+G+ + + G S+A G S + + + G S+ T LA+G
Sbjct: 150 DAKNSVAIGHSSHVAAN-----HGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02935HTHFIS614e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 4e-13
Identities = 34/121 (28%), Positives = 58/121 (47%), Gaps = 2/121 (1%)

Query: 15 SVAVLEDDALLREDILIPGLREFGFRVSGAGTAGELYRLMLQQAFDLVVLDLGLPDESGL 74
++ V +DDA +R L L G+ V A L+R + DLVV D+ +PDE+
Sbjct: 5 TILVADDDAAIRTV-LNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 75 SVVTYLRSLFAGLGIVVLTGNCGRSDHARALHGGADAFLRKPTD-PEILALTLRNLAQRL 133
++ ++ L ++V++ +A GA +L KP D E++ + R LA+
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 134 R 134
R
Sbjct: 124 R 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02945cloacin290.010 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 29.3 bits (65), Expect = 0.010
Identities = 18/61 (29%), Positives = 27/61 (44%)

Query: 126 QNANADEDGDADAEDAADAAGTDADAEDAAAENATASADDSAADAEASDDAEGDSAGKKA 185
Q A D + A DAA +DADA ++A + +D AE + + E + K
Sbjct: 398 QRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDKKRSAENNLNDEKNKPRKGF 457

Query: 186 K 186
K
Sbjct: 458 K 458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02950OMPADOMAIN392e-05 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 39.1 bits (91), Expect = 2e-05
Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 17/116 (14%)

Query: 174 FDIGRDQLKPYTVAILHELSNFINQV-PNHISIT--GHTDTTAYSSDAGYTNWELSADRA 230
F+ + LKP A L +L + ++ + P S+ G+TD SDA N LS RA
Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIG--SDA--YNQGLSERRA 278

Query: 231 NAARRALVGGGMSDAKVTRV-VGLSSSVLFDKTDPQNP---------INRRISIVV 276
+ L+ G+ K++ +G S+ V + D +RR+ I V
Sbjct: 279 QSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS02960GPOSANCHOR270.044 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 27.0 bits (59), Expect = 0.044
Identities = 19/68 (27%), Positives = 23/68 (33%), Gaps = 3/68 (4%)

Query: 13 LATALAGCSAGSPPPTTEAAKPAGTEPATANSADTRP-PAVGTDASVAGPLPRPGRT--P 69
LA AG ++ S P + A A A T+P LP G T P
Sbjct: 455 LAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANP 514

Query: 70 AHTAPAAA 77
TA A
Sbjct: 515 FFTAAALT 522


56AXO1947_RS03320AXO1947_RS03370N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS03320-1181.821166NAD-dependent dehydratase
AXO1947_RS207900191.106844hypothetical protein
AXO1947_RS03325-1191.077596hypothetical protein
AXO1947_RS207951140.837023glycosyl transferase
AXO1947_RS033301140.355913acyltransferase
AXO1947_RS03335115-0.469593IS5/IS1182 family transposase
AXO1947_RS03340216-0.377500ABC transporter ATP-binding protein
AXO1947_RS03345212-1.985086sugar ABC transporter permease
AXO1947_RS03350116-3.519358electron transfer flavoprotein subunit alpha
AXO1947_RS03355223-5.915127EtfB protein
AXO1947_RS03360125-6.269730dTDP-glucose 4,6-dehydratase
AXO1947_RS03365244-9.345840glucose-1-phosphate thymidylyltransferase
AXO1947_RS20800561-13.366535dTDP-4-dehydrorhamnose 3,5-epimerase
AXO1947_RS03370455-11.568815NAD(P)-dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03370NUCEPIMERASE1493e-44 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 149 bits (378), Expect = 3e-44
Identities = 80/367 (21%), Positives = 131/367 (35%), Gaps = 62/367 (16%)

Query: 5 VIVTGGAGFIGCALSGQLKAFGLPVVAIDNLHP--QIHAESKRPEAL-DEAAHLHIGDVT 61
+VTG AGFIG +S +L G VV IDNL+ + + R E L H D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 62 EENTWGQVLENWQPTVVVHLAAETGTGQSLTEATRHAHVNVVGTTAMLDAFSARKLVPEH 121
+ + + V SL +A N+ G +L+ R +H
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEG--CRHNKIQH 120

Query: 122 VLLASSRAVYGEGAWLDANGTTFYPPPRSHEVLARSQWNPLSPSGGGAASPLSHRADTVF 181
+L ASS +VYG P S + D
Sbjct: 121 LLYASSSSVYGLNR----------KMPFSTD----------------------DSVDH-- 146

Query: 182 PNPTSVYGATKLAQEHILAAWCGAMQVPLSVFRLQNVYGPGQSPFNSYTGIITLFHRMAR 241
P S+Y ATK A E + + +P + R VYGP P + F +
Sbjct: 147 --PVSLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMAL----FKFTKAML 200

Query: 242 KAQTLEIYEDGEIGRDFVFIDDVVVALMAGLRQPP-----------------AGLRTLDV 284
+ +++++Y G++ RDF +IDD+ A++ P A R ++
Sbjct: 201 EGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNI 260

Query: 285 GSGVVTTIAEAAKSIAAMHGAPDPQISGKFRDGDVRWAVADGAPLEQSLGVQARINFQEG 344
G+ + + +++ G + + GDV AD L + +G ++G
Sbjct: 261 GNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDG 320

Query: 345 ANRVGEW 351
W
Sbjct: 321 VKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03400PF05043357e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.9 bits (80), Expect = 7e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03410INTIMIN290.045 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 28.9 bits (64), Expect = 0.045
Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 305 GDPLVEFNSSEDIRLTVVARLNKSLSYPAVGFMLKDRKGQYIL 347
G+ + + + S+DI L+ + LNK L Y + M+K GQ I+
Sbjct: 70 GETVADLSKSQDINLSTIWSLNKHL-YSSESEMMKAEPGQQII 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03415ABC2TRNSPORT375e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 36.8 bits (85), Expect = 5e-05
Identities = 24/105 (22%), Positives = 45/105 (42%), Gaps = 6/105 (5%)

Query: 158 LAILPVVLF----MMGLAWLLSALGVFLRDTAQITAIITTAIMFLTPIFYPIDAIPPTFR 213
L LPV+ L +++AL ++ T I+FL+ +P+D +P F+
Sbjct: 148 LYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQ 207

Query: 214 PILNINPLAPVIAQIRNVLIWGHG--LKPLEYSLCLVISALVFIA 256
PL+ I IR +++ + +LC+ I F++
Sbjct: 208 TAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLS 252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03430NUCEPIMERASE1912e-60 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 191 bits (486), Expect = 2e-60
Identities = 88/346 (25%), Positives = 135/346 (39%), Gaps = 42/346 (12%)

Query: 5 LVTGGAGFIGGNFVLEAVARGIRVVNLDALT--YAGNLNTL-ASLEGNPDHVFVKGDIGD 61
LVTG AGFIG + + G +VV +D L Y +L L P F K D+ D
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63

Query: 62 GMLVARLLQEHQPDAVLNFAAESHVDRSIEGPGAFIHTNVVGTLALLEAVRDYWKSLPTA 121
+ L + V V S+E P A+ +N+ G L +LE R
Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN------- 116

Query: 122 RSDAFRFLHVSTDEVYGTLGETGKFTETTPYA-PNSPYSASKAASDHLVRAFHHTYGLPV 180
L+ S+ VYG L F+ P S Y+A+K A++ + + H YGLP
Sbjct: 117 --KIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173

Query: 181 LTTNCSNNYGPYHFPEKLIPLVIAKALAGEPLPVYGDGKQVRDWLFVSDHCEAIRTVL-- 238
YGP+ P+ + L G+ + VY GK RD+ ++ D EAI +
Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233

Query: 239 ----------------AKGKVGETYNVGGNSERQNIEVVQAICALLDQHRPRDDGKPRAS 282
A YN+G +S + ++ +QA+ L +
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG----------IEA 283

Query: 283 QITYVTDRPGHDRRYAIDASKLKNELGWEPSYTFEQGIAQTVQWYL 328
+ + +PG + D L +G+ P T + G+ V WY
Sbjct: 284 KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03445NUCEPIMERASE414e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 40.9 bits (96), Expect = 4e-06
Identities = 36/159 (22%), Positives = 55/159 (34%), Gaps = 17/159 (10%)

Query: 1 MTTLVFGANGQVGTELLRALAVDG----AVQATT----------RSGQLP-DGSACETAD 45
M LV GA G +G + + L G + R L G D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 46 FDAPETLTALLDRIKPSRVVNAAAYTAVDRAEQDRERATRANATAPGVIAAWCASNRVP- 104
E +T L RV + AV + ++ +N T I C N++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 105 LVHYSTDYVFDGQGTAPYREDAQTS-PLGVYGETKLAGE 142
L++ S+ V+ P+ D P+ +Y TK A E
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE 159


57AXO1947_RS03555AXO1947_RS03605N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS03555-219-3.751341phosphoribosylformylglycinamidine synthase
AXO1947_RS03560-211-1.652066hypothetical protein
AXO1947_RS03565-212-0.862341protease
AXO1947_RS20830-28-0.775089type II secretion system protein GspE
AXO1947_RS03570-210-1.782098hypothetical protein
AXO1947_RS20835020-4.644431general secretion pathway protein GspF
AXO1947_RS03575118-4.140760type II secretion system protein GspG
AXO1947_RS20840122-5.610551type II secretion system protein GspH
AXO1947_RS03580119-3.916620general secretion pathway protein GspI
AXO1947_RS20845321-3.895655general secretion pathway protein GspJ
AXO1947_RS20850015-1.988801general secretion pathway protein GspK
AXO1947_RS035900171.792394general secretion pathway protein GspL
AXO1947_RS035951141.691945general secretion pathway protein GspM
AXO1947_RS036001121.982916hypothetical protein
AXO1947_RS03605381.281570type II secretion system protein GspD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03630OMADHESIN320.017 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 32.2 bits (72), Expect = 0.017
Identities = 46/187 (24%), Positives = 71/187 (37%), Gaps = 27/187 (14%)

Query: 634 PKMHRDAVHPAAPQWPVLQTASLDLQQAGLRVLA--HPTVASKSFLVTIGDRSVGGLTAR 691
P + +P P PV L+ G+ +A A+K V +G S+
Sbjct: 45 PALG--LEYPVRP--PVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIA-TGVN 99

Query: 692 EQMIGPWQLPLADCAITMAGFDTFEGEAMSIGERTPLALLNAAASARMAVGEAITNLCAA 751
IGP L D A+T T + + ++IG R + + +AVG
Sbjct: 100 SVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARA------STSDTGVAVG--------- 144

Query: 752 PVQRLDSIKLSANWMAAAGHAGEDALLYDAVRAVGMELCPALELSVPVGKDSLSMQAQWV 811
+S K A A GH+ A + A+G E SV +G +SL+ Q +
Sbjct: 145 ----FNS-KADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHL 199

Query: 812 EAGIGDS 818
AG D+
Sbjct: 200 AAGTKDT 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03635OMADHESIN612e-11 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 61.5 bits (148), Expect = 2e-11
Identities = 68/225 (30%), Positives = 109/225 (48%), Gaps = 14/225 (6%)

Query: 957 AMGVDSVARRDSDTAIGTESVADGGYSTALGANAQASYDSSTALGANAMAEDYYSVALGT 1016
A+G++ R A G + A G +S A+GA A+A+ ++ A+GA ++A SVA+G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 1017 YALATGTSAISLGGQSYA------------PGTESVALGWQSNASGTRSIGLGSGAVASA 1064
+ A G SA++ G S A VA+G+ S A S+ +G + +A
Sbjct: 106 LSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAA 165

Query: 1065 DN--SVALGAGSIADRANAVSVGAADNARQIANVAAGTEGTDAVNLNQLNAVAETAQTTG 1122
++ S+A+G S DR N+VS+G RQ+ ++AAGT+ TDAVN+ QL E Q
Sbjct: 166 NHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENT 225

Query: 1123 KYFKASGSPDNDAGAYVEGENALAAGEGANAAGTGTTALGAGAQA 1167
A + +A A + + L + + T A +A
Sbjct: 226 NKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEA 270



Score = 53.0 bits (126), Expect = 8e-09
Identities = 65/201 (32%), Positives = 93/201 (46%), Gaps = 22/201 (10%)

Query: 72 GRGASAPASKATAIGANSHASATGAVATGADSSASGVNSSAIGRQTNAIGENALAIGYNS 131
G ASA + AIGA + A+ AVA GA S A+GVNS AIG + A+G++A+ G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 132 FVRQSG----------ENGVALGANAGVSGANSVALGAGSRTYEDDVVSIGSGNGRGG-- 179
++ G + GVA+G N+ NSVA+G S + SI G+
Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181

Query: 180 ---------PATRRITNVTAGVNATDAVNVAQL-RDVADVAENTAQFFKASPAEDSVGAY 229
R++T++ AG TDAVNVAQL +++ ENT + A + A
Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYAD 241

Query: 230 VEGDSALAAGEGANAVGTATT 250
+ S L +A T
Sbjct: 242 NKSSSVLGIANNYTDSKSAET 262



Score = 51.1 bits (121), Expect = 4e-08
Identities = 56/168 (33%), Positives = 82/168 (48%), Gaps = 5/168 (2%)

Query: 1775 SITPAATSTAVGTAAVANHVTGTAIGGSAYAHGPNDTAIGSNARVNADGSTAVGANTQIA 1834
SI AT+ A AAVA A G ++ A GP A+G +A STA I
Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIG 131

Query: 1835 AVATNA---VAMGEGAQVTAASGTAIGQGARATAQG--AVALGQGSVADRANTVSLGSVG 1889
A A+ + VA+G ++ A + AIG + A ++A+G S DR N+VS+G
Sbjct: 132 ARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191

Query: 1890 GERQVANVAAGTRATDAVNKGQLDNGVAAANSYTDSRYNAMADSFESY 1937
RQ+ ++AAGT+ TDAVN QL + T+ R + + +Y
Sbjct: 192 LNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAY 239



Score = 49.1 bits (116), Expect = 1e-07
Identities = 58/165 (35%), Positives = 81/165 (49%), Gaps = 26/165 (15%)

Query: 371 GTQTRASGISSTAVGGPVVLIPGLGLFVQTQASGEASTALGAGAIASGAYATAVGTLSEA 430
G A GI S A+G +A+ A+ A+GAG+IA+G + A+G LS+A
Sbjct: 62 GLNASAKGIHSIAIGA------------TAEAAKGAAVAVGAGSIATGVNSVAIGPLSKA 109

Query: 431 SGTEATAVGYFAYAPGEG------------ATAVGPESSAIGELSTALGYFS--TARGAN 476
G A G + A +G AVG S A + S A+G+ S A
Sbjct: 110 LGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGY 169

Query: 477 SVALGANSVATRANTVSVGAAGTERQITNVAAATDGTDAVNLDQL 521
S+A+G S R N+VS+G RQ+T++AA T TDAVN+ QL
Sbjct: 170 SIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214



Score = 49.1 bits (116), Expect = 1e-07
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 3/144 (2%)

Query: 834 GANAAAADTGSIAVGTYANAYGPRAISLGGQSRATGDESIALGWEAQAESDQSIALGASS 893
G NA+A SIA+G A A A+++G S ATG S+A+G ++A D ++ GA+S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 894 QAAAFSTAIGGYARASGAGATAVGNNSSAVDDRATALGSDS--MASGYFSTAVGSSSVAS 951
A AIG A S G AVG NS A + A+G S A+ +S A+G S
Sbjct: 122 TAQKDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTD 180

Query: 952 GRGATAMGVDSVARRDSDTAIGTE 975
+ ++G +S+ R+ + A GT+
Sbjct: 181 RENSVSIGHESLNRQLTHLAAGTK 204



Score = 47.6 bits (112), Expect = 4e-07
Identities = 69/250 (27%), Positives = 108/250 (43%), Gaps = 23/250 (9%)

Query: 1322 GLIPARASGTGAAAFGAGAWATADYTTAIGRDSYADSVNATALGQSAAALADNTLALGGG 1381
G + A A G + A GA A A A+G S A VN+ A+G + AL D+ + G
Sbjct: 61 GGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAA 120

Query: 1382 SRANAVGASVIGVNASATGINSTGVGRQVNVIGENAVSVGYNSFVRQSAVNGVALGANAG 1441
S A G + IG AS + + V+VG+NS + ++
Sbjct: 121 STAQKDGVA-IGARASTS---------------DTGVAVGFNSKADAKNSVAIGHSSHVA 164

Query: 1442 ATGADSVALGSGSRTYEADTVSIGSGNGRGGPATRRIVNVSDGQAATDAVNKGQLDAVSA 1501
A S+A+G S+T ++VSIG + R++ +++ G TDAVN QL
Sbjct: 165 ANHGYSIAIGDRSKTDRENSVSIGHES-----LNRQLTHLAAGTKDTDAVNVAQLKKEIE 219

Query: 1502 DVQKTASKFKATGDAVATATGDRSTAAGSGAAA--TGARSVAIASGSRALATGASAMGVD 1559
Q+ +K A A A A D +++ G A T ++S +R A S ++
Sbjct: 220 KTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLN 279

Query: 1560 SSASGVNSTA 1569
+ + NS A
Sbjct: 280 MAKAHSNSVA 289



Score = 44.9 bits (105), Expect = 3e-06
Identities = 44/132 (33%), Positives = 69/132 (52%), Gaps = 4/132 (3%)

Query: 764 GADSNASGYFSTAVGGTSIANGRGATAIGYESIGNGTASTALGFAGVAWGDGGTAIGTES 823
G +++A G S A+G T+ A A A+G SI G S A+G A GD G S
Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121

Query: 824 LAYGDNSTAVGANAAAADTGSIAVGTYANAYGPRAISLGGQSRATGDE--SIALGWEAQA 881
A D A+GA A+ +DTG +AVG + A ++++G S + SIA+G ++
Sbjct: 122 TAQKD-GVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179

Query: 882 ESDQSIALGASS 893
+ + S+++G S
Sbjct: 180 DRENSVSIGHES 191



Score = 42.6 bits (99), Expect = 1e-05
Identities = 40/148 (27%), Positives = 71/148 (47%)

Query: 1131 PDNDAGAYVEGENALAAGEGANAAGTGTTALGAGAQAVVDNATAVGVGALASGIGAAALG 1190
P V A G A+A G + A+GA A+A A AVG G++A+G+ + A+G
Sbjct: 45 PALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIG 104

Query: 1191 NTAQALGENSSAVGSNAVASDIGATANGAGAQALSTYTTALGSKAVASDNQAIAAGFRST 1250
++ALG+++ G+ + A G + + + SKA A ++ AI
Sbjct: 105 PLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVA 164

Query: 1251 ASNIGSAAFGGYSESSGRLSSALGYSAV 1278
A++ S A G S++ S ++G+ ++
Sbjct: 165 ANHGYSIAIGDRSKTDRENSVSIGHESL 192



Score = 41.0 bits (95), Expect = 5e-05
Identities = 47/149 (31%), Positives = 72/149 (48%), Gaps = 11/149 (7%)

Query: 552 AAGSNALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAIAQNTTALGG 611
A G NA A +S A+G+++ A+ A AVG+G+ AT N+ A+G S A+ + G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 612 KSSASGDGSTAVGGASQATASGATALGYESIANGADATALGVG---------SVAFGNTS 662
S+A DG GA +T+ A+G+ S A+ ++ A+G S+A G+ S
Sbjct: 120 ASTAQKDGVAI--GARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRS 177

Query: 663 TAVGGASVAFGADSAAFGANAAAGGTAST 691
SV+ G +S A GT T
Sbjct: 178 KTDRENSVSIGHESLNRQLTHLAAGTKDT 206



Score = 41.0 bits (95), Expect = 5e-05
Identities = 40/114 (35%), Positives = 57/114 (50%), Gaps = 2/114 (1%)

Query: 1516 AVATATGDRSTAAGSGAAATGARSVAIASGSRALATGASAMGVDSSASGVNSTAMGRQTN 1575
A A A + A G+G+ ATG SVAI S+AL A G S+A R +
Sbjct: 77 ATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARAST 136

Query: 1576 SIGENGVALGYNSFVRQSGSNAVALGAKAGASGADSVALGSGSRTYDANTVSVG 1629
S + GVA+G+NS S A+ + A+ S+A+G S+T N+VS+G
Sbjct: 137 S--DTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIG 188



Score = 40.7 bits (94), Expect = 6e-05
Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 4/103 (3%)

Query: 1527 AAGSGAAATGARSVAIASGSRALATGASAMGVDSSASGVNSTAMGRQTNSIGENGVALGY 1586
A G A+A G S+AI + + A A A+G S A+GVNS A+G + ++G++ V G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119

Query: 1587 NSFVRQSGSNAVALGAKAGASGADSVALGSGSRTYDANTVSVG 1629
S ++ G VA+GA+A S VA+G S+ N+V++G
Sbjct: 120 ASTAQKDG---VAIGARASTSDT-GVAVGFNSKADAKNSVAIG 158



Score = 37.6 bits (86), Expect = 5e-04
Identities = 46/144 (31%), Positives = 68/144 (47%), Gaps = 14/144 (9%)

Query: 677 AAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGEDSIALGASSQASALGTTAVG 736
A G NA+A G S AIGA + A VA+G S A+G +S+A+G S+A G
Sbjct: 59 GAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118

Query: 737 SNANA------------SIANATAVGFNSSAGDDYATALGADSN--ASGYFSTAVGGTSI 782
+ + A + AVGFNS A + A+G S+ A+ +S A+G S
Sbjct: 119 AASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSK 178

Query: 783 ANGRGATAIGYESIGNGTASTALG 806
+ + +IG+ES+ A G
Sbjct: 179 TDRENSVSIGHESLNRQLTHLAAG 202



Score = 36.0 bits (82), Expect = 0.001
Identities = 44/187 (23%), Positives = 76/187 (40%)

Query: 557 ALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAIAQNTTALGGKSSAS 616
A AD ++ S A+G A G N++A ++ A+G + A+
Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82

Query: 617 GDGSTAVGGASQATASGATALGYESIANGADATALGVGSVAFGNTSTAVGGASVAFGADS 676
+ AVG S AT + A+G S A G A G S A + AS + +
Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVA 142

Query: 677 AAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGEDSIALGASSQASALGTTAVG 736
F + A A + + ++ +A ++A+G S E+S+++G S L A G
Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202

Query: 737 SNANASI 743
+ ++
Sbjct: 203 TKDTDAV 209



Score = 35.3 bits (80), Expect = 0.002
Identities = 47/160 (29%), Positives = 77/160 (48%), Gaps = 4/160 (2%)

Query: 1174 AVGVGALASGIGAAALGNTAQALGENSSAVGSNAVASDIGATANGAGAQALSTYTTALGS 1233
A+G+ A G A A G +S A+G+ A A+ A A GAG+ A + A+G
Sbjct: 46 ALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP 105

Query: 1234 KAVASDNQAIAAGFRSTASNIGSAAFGGYSESSGRLSSALGYSAVASSDYSTAVGAVA-- 1291
+ A + A+ G STA G A G S+ A+G+++ A + S A+G +
Sbjct: 106 LSKALGDSAVTYGAASTAQKDGVAI--GARASTSDTGVAVGFNSKADAKNSVAIGHSSHV 163

Query: 1292 LASGASAVAVGQFSKATGDESVAVGGSAFFGLIPARASGT 1331
A+ ++A+G SK + SV++G + + A+GT
Sbjct: 164 AANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGT 203



Score = 33.3 bits (75), Expect = 0.010
Identities = 49/167 (29%), Positives = 70/167 (41%), Gaps = 10/167 (5%)

Query: 393 GLGLFVQTQASGEASTALGAGAIASGAYATAVGTLSEASGTEATAVGYFAYAPGEGATAV 452
G+ Q S A ALG A G + A G + A+G A A A AV
Sbjct: 30 GIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAV 89

Query: 453 GPESSAIGELSTALGYFSTARGANSVALGANSVATRANTVSVGAAGTERQITNVAAATDG 512
G S A G S A+G S A G ++V GA S A + + V++GA A+ +D
Sbjct: 90 GAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQK-DGVAIGAR---------ASTSDT 139

Query: 513 TDAVNLDQLTAVSDVASTTARSFVASGDGVAIAQGVDSVAAGSNALA 559
AV + + + S VA+ G +IA G S N+++
Sbjct: 140 GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVS 186



Score = 32.2 bits (72), Expect = 0.027
Identities = 37/113 (32%), Positives = 56/113 (49%), Gaps = 8/113 (7%)

Query: 237 AAGEGANAVGTATTALGTGANAVAENATAVGADALASGQDSAAFGHNAQANGPASVAVGG 296
A G A+A G + A+G A A A AVGA ++A+G +S A GP S A+G
Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAI-------GPLSKALGD 112

Query: 297 AAVNEDGEPLITNGGVPVTTGATSAGVGATAVGASAKADGFAASSFGVGAYAA 349
+AV GV + A+++ G AVG ++KAD + + G ++ A
Sbjct: 113 SAVTYGAASTAQKDGVAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03640SUBTILISIN1998e-61 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 199 bits (508), Expect = 8e-61
Identities = 100/359 (27%), Positives = 147/359 (40%), Gaps = 69/359 (19%)

Query: 156 PQLVPNDPLYAQYQWHLSNRNGGINAPGAWDLSQGAGVVVAVLDTGILPSHPDFAGNILQ 215
Q++ + + + I AP W+ ++G GV VAVLDTG HPD I+
Sbjct: 10 YQVIKQEQQVNEIPRGV----EMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIG 65

Query: 216 GYDFITDAEVSRRPTDARVPGALDYGDWQEADNVCYVGSTAQASTWHGTHVSSTVAEATN 275
G +F D E HGTHV+ T+A AT
Sbjct: 66 GRNFTDDDEGDPEIFKD--------------------------YNGHGTHVAGTIA-ATE 98

Query: 276 NGVGMAGVAPKATILPVRVVGRCG-GYTSDIVDAIVWASGGTVEGVPANTNPAEVINISL 334
N G+ GVAP+A +L ++V+ + G G I+ I +A ++I++SL
Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYA----------IEQKVDIISMSL 148

Query: 335 GGGGPCDSATQLAINGAVSRGTTVVVAAGNGGGDAAN----HSPAGCNNTITVGATRITG 390
GG A+ AV+ V+ AAGN G P N I+VGA
Sbjct: 149 GGPEDVP-ELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDR 207

Query: 391 GITYYSNYGSKVDLSGPGGGGSVDGNPGGYIWQAGYTGATTPTSGRYAYMGLGGTSMASP 450
+ +SN ++VDL PG I +T G+YA GTSMA+P
Sbjct: 208 HASEFSNSNNEVDLVAPGED----------IL-------STVPGGKYATFS--GTSMATP 248

Query: 451 HVAGVVALVQSAAIGLGKGPLTPAAVKALLKKTSRRFPVTPPASTPIGSGIVDAKAALK 509
HVAG +AL++ A + LT + A L K + +P G+G++ A +
Sbjct: 249 HVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKME---GNGLLYLTAVEE 304


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03650BCTERIALGSPF433e-153 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 433 bits (1114), Expect = e-153
Identities = 134/411 (32%), Positives = 212/411 (51%), Gaps = 12/411 (2%)

Query: 1 MPLYRYKALDAHGEMLDGQMEAASDAEVALRLQEQGHLPV---ETRLATGENDSPSLRML 57
M Y Y+ALDA G+ G EA S + L+E+G +P+ E R ++ S L L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLS-L 59

Query: 58 LRKKPFDNAALVQFTQQLATLIGAGQPLDRALSILMDLPEDEKSRRVIGDVRDTVRGGAP 117
RK + L T+QLATL+ A PL+ AL + E +++ VR V G
Sbjct: 60 RRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHS 119

Query: 118 LSSALERQHGLFSKLYINMVRAGEAGGSMQDTLQRLADYLERSRALRGKVINALIYPAIL 177
L+ A++ G F +LY MV AGE G + L RLADY E+ + +R ++ A+IYP +L
Sbjct: 120 LADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVL 179

Query: 178 LAVVGCALLFLLGYVVPQFAQMYESLDVALPWFTQAVLSVGLLVRDW--WLVLIVVPGVL 235
V + LL VVP+ + + + ALP T+ ++ + VR + W++L ++ G +
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239

Query: 236 G--LWLDRKRRNAAFRASLDEWLLRQKVVGSLIARLETARLTRTLGTLLRNGVPLLAAIG 293
+ L +++R +F LL ++G + L TAR RTL L + VPLL A+
Sbjct: 240 AFRVMLRQEKRRVSF----HRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMR 295

Query: 294 IARNVMSNLALVEDVANAADDVKNGHGLSMSLARGKRFPRLALQMIQVGEESGALDTMLL 353
I+ +VMSN ++ A D V+ G L +L + FP + MI GE SG LD+ML
Sbjct: 296 ISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLE 355

Query: 354 KTADTFELETAQAIDRALAALVPFITLVLASVVGLVIISVLVPLYDLTNAI 404
+ AD + E + + AL P + + +A+VV +++++L P+ L +
Sbjct: 356 RAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03655BCTERIALGSPG1363e-44 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 136 bits (343), Expect = 3e-44
Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 18/132 (13%)

Query: 15 QAGMSLLEIIIVIVLIGAVLTLVGSRVLGGADRGKANLAKSQIQTLAGKIENFQLDTGKL 74
Q G +LLEI++VIV+IG + +LV ++G ++ A S I L ++ ++LD
Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66

Query: 75 PSKLDDLVTQPGDSSGWLGPYAKPAELN------------DPWGHAIEYRVPGDGQPFDL 122
P+ T G S P P N DPWG+ PG+ +DL
Sbjct: 67 PT------TNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 123 MSLGKDGKPGGS 134
+S G DG+ G
Sbjct: 121 LSAGPDGEMGTE 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03660BCTERIALGSPH300.002 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 30.3 bits (68), Expect = 0.002
Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 3/74 (4%)

Query: 21 RTRGTSLLEMLLVIALIAMAGVLAAAALNGGIDGMRLRTAGKAIASQLRYTRTQAIATGT 80
R RG +LLEM+L++ L+ ++ + A D +T + A QLR+ + + + TG
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEA-QLRFVQQRGLQTGQ 60

Query: 81 PQRFLIDPQQRRWE 94
+ P RW+
Sbjct: 61 FFGVSVHPD--RWQ 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03665BCTERIALGSPG345e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.1 bits (78), Expect = 5e-05
Identities = 14/45 (31%), Positives = 26/45 (57%), Gaps = 4/45 (8%)

Query: 1 MKRQRGYTLIEVIVAFALLALALSL----LLGSLSGAARQVRAAD 41
+QRG+TL+E++V ++ + SL L+G+ A +Q +D
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03690PERTACTIN346e-04 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 33.9 bits (77), Expect = 6e-04
Identities = 19/52 (36%), Positives = 23/52 (44%)

Query: 192 AVPPPQQQPQPQPQPQPQSVPPAQQPGGQAPPTVQPQRSDGAQEAPRPSDDQ 243
A PP +P PQP PQP PP Q P QP + AP+P +
Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGR 616



Score = 31.2 bits (70), Expect = 0.004
Identities = 21/58 (36%), Positives = 22/58 (37%)

Query: 160 NGHGGQPPTANAAARGAGTATAPVPSPDAAAVAVPPPQQQPQPQPQPQPQSVPPAQQP 217
NG+G A A P P P P P Q PQP PQ Q PA QP
Sbjct: 555 NGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQP 612



Score = 30.1 bits (67), Expect = 0.011
Identities = 18/48 (37%), Positives = 20/48 (41%)

Query: 200 PQPQPQPQPQSVPPAQQPGGQAPPTVQPQRSDGAQEAPRPSDDQMRAI 247
P+P PQP PQ P QP P PQ EAP P R +
Sbjct: 571 PKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGREL 618



Score = 29.7 bits (66), Expect = 0.014
Identities = 23/62 (37%), Positives = 26/62 (41%), Gaps = 1/62 (1%)

Query: 175 GAGTATAPVPSPDAAAVAVPPPQQQPQPQPQPQPQSVPPAQQPGGQAPPTVQPQRSDGAQ 234
GA AP P+P P P Q PQP PQP PP +QP AP + A
Sbjct: 564 GAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQ-PPQRQPEAPAPQPPAGRELSAAA 622

Query: 235 EA 236
A
Sbjct: 623 NA 624


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03695BCTERIALGSPD2601e-78 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 260 bits (665), Expect = 1e-78
Identities = 124/535 (23%), Positives = 230/535 (42%), Gaps = 57/535 (10%)

Query: 254 GMSVGVFPIQSGKAEKVSADLEKVFGEQSKTPSAGMFRFMPLENANAVLVITPQPRYLDQ 313
+ V P+ + A DL + + + +AG+ + E +N VL++T + + +
Sbjct: 126 EVVTRVVPLTNVAA----RDLAPLLRQLN--DNAGVGSVVHYEPSN-VLLMTGRAAVIKR 178

Query: 314 IQQWLDRIDSAGGGVRLFSYELKYIKAKDLADRLAEVFGGHSSGG-------------DS 360
+ ++R+D+AG + + L + A D+ + E+ S +
Sbjct: 179 LLTIVERVDNAGDRS-VVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERT 237

Query: 361 NASLVPGSE------TSVLGGALGNRDSNMGGSSGMTGGSIGDSGDGSSSGSSFGGSSGS 414
NA LV G +++ +R G++ + + D + +
Sbjct: 238 NAVLVSGEPNSRQRIIAMIKQL--DRQQATQGNTKVIYLKYAKASDLVEVLTGISSTM-- 293

Query: 415 SGGLGNGSLQLSPRSNGNGAVTLDVAGDKVGVSAVAETNTLLVRSTPQAWSSIRDVIEKL 474
S + LD + + A +TN L+V + P + + VI +L
Sbjct: 294 ----------QSEKQAAKPVAALD---KNIIIKAHGQTNALIVTAAPDVMNDLERVIAQL 340

Query: 475 DVMPMQVHIEAQVAEVNLTGQLSYGVNWYFENAVNAATDSNS--NGPGFKGGAGLPSAAG 532
D+ QV +EA +AEV L+ G+ W +NA ++ G G
Sbjct: 341 DIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKD-G 399

Query: 533 RNIWGDIAGKVTGDGVAWSFLGKNAAAIITALDKVTDVRLLQTPSVFVRNNAEATLNVGS 592
+ + +G+A F N A ++TAL T +L TPS+ +N EAT NVG
Sbjct: 400 TVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQ 459

Query: 593 RIAINSTSINTGIGVDSSYSSVQYIDTGVILKVRPRVTKDGMVFLDIVQEVSSPGDRPAA 652
+ + + S T D+ +++V+ G+ LKV+P++ + V L+I QEVSS D
Sbjct: 460 EVPVLTGSQTTS--GDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVAD---- 513

Query: 653 CTSATATVNAAACNVDINTRRVKTEAAVQSGDTIMLAGLIDDTTSDGSNGIPFLSKLPVV 712
A+ ++ NTR V V SG+T+++ GL+D + SD ++ +P L +PV+
Sbjct: 514 ----AASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVI 569

Query: 713 GALFGSKSRNSARREVIVLITPSIVHNPQEARNLTDEYGQKFKAMEPLKPSQKPQ 767
GALF S S+ ++R +++ I P+++ + E R + F + + ++
Sbjct: 570 GALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENN 624



Score = 191 bits (487), Expect = 5e-54
Identities = 73/292 (25%), Positives = 125/292 (42%), Gaps = 21/292 (7%)

Query: 89 ASSGSATFNFEGESVQAVVKAILGDMLGQNYVIAPGVQGTVTLATPNPVSPAQALNLLEM 148
A++ + +F+G +Q + + + L + +I P V+GT+T+ + + ++ Q
Sbjct: 25 AAAEEFSASFKGTDIQEFINTVSKN-LNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLS 83

Query: 149 VLG-WNNARMVFSGGRYNIVPA-DQALAGTVAPSTASPSAARGFEVRVVPLKFISASEMK 206
VL + A + + G +V + D A S A+P RVVPL ++A ++
Sbjct: 84 VLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLA 143

Query: 207 KVLEPYARPNAIVGTD---PARNVITLGGTRAELENYLRTVQIFDVDWLSGMSVGVFPIQ 263
+L NA VG+ NV+ + G A ++ L V+ VD SV P+
Sbjct: 144 PLLRQL-NDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE--RVDNAGDRSVVTVPLS 200

Query: 264 SGKAEKVSADLEKVFGEQSKT--PSAGMFRFMPLENANAVLVI---TPQPRYLDQIQQWL 318
A V + ++ + SK+ P + + + E NAVLV + R + I+Q L
Sbjct: 201 WASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ-L 259

Query: 319 DRIDSAGGGVRLFSYELKYIKAKDLADRLAEVFGGHSSGGDSNASLVPGSET 370
DR + G ++ LKY KA DL + L + SS S
Sbjct: 260 DRQQATQGNTKVIY--LKYAKASDLVEVLTGI----SSTMQSEKQAAKPVAA 305



Score = 39.9 bits (93), Expect = 3e-05
Identities = 35/236 (14%), Positives = 81/236 (34%), Gaps = 21/236 (8%)

Query: 187 ARGFEVRVVPLKFISASEMKKVLEPYAR---PNAIVGT-------DPARNVITLGGTRAE 236
A V VPL + SA+++ K++ + +A+ G+ D N + + G
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNS 248

Query: 237 LENYLRTVQIFDVDWLSGMSVGVFPIQSGKAEKVSADLEKVFGEQSKTPSAGM------- 289
+ + ++ D + + V ++ KA + L + A
Sbjct: 249 RQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDK 308

Query: 290 -FRFMPLENANAVLVITPQPRYLDQIQQWLDRIDSAGGGVRLFS--YELKYIKAKDLADR 346
NA L++T P ++ +++ + ++D V + + E++ +L +
Sbjct: 309 NIIIKAHGQTNA-LIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQ 367

Query: 347 LAEVFGGHSSGGDSNASLVPGSETSVLGGALGNRDSNMGGSSGMTGGSIGDSGDGS 402
A G + +S + + G S++ + G G+
Sbjct: 368 WANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGN 423


58AXO1947_RS03820AXO1947_RS03855N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS038200121.233196DNA-binding response regulator
AXO1947_RS038251131.305225histidine kinase
AXO1947_RS038301132.102519hypothetical protein
AXO1947_RS038351162.482271rhizopine catabolism protein mocA
AXO1947_RS038400172.105792LysR family transcriptional regulator
AXO1947_RS03845-1161.089966molybdenum ABC transporter substrate-binding
AXO1947_RS038500140.701015MFS transporter
AXO1947_RS03855-1120.462195hybrid sensor histidine kinase/response
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03920HTHFIS781e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.6 bits (191), Expect = 1e-18
Identities = 31/123 (25%), Positives = 60/123 (48%), Gaps = 1/123 (0%)

Query: 2 RLLLVEDNADLADAIVRRMRRSGHAVDWQADGLAAASVLRYQSFDLVVLDIGLPKLDGLR 61
+L+ +D+A + + + + R+G+ V ++ + DLVV D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLAGMRERGDTTPVLMLTARDGIEDRVQALDVGADDYLGKPFDFREF-EARCRVLLRRNR 120
+L +++ PVL+++A++ ++A + GA DYL KPFD E R L R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 GQA 123
+
Sbjct: 125 RPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS20925HTHFIS623e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.8 bits (150), Expect = 3e-12
Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 7/110 (6%)

Query: 364 MADLTILVADDHPLFRAAVLHVLQQTLPQAN--VVEASSAATLSAMLRSHPQAELVLLDL 421
M TILVADD R VL Q L +A V S+AATL + + +LV+ D+
Sbjct: 1 MTGATILVADDDAAIRT----VLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDV 55

Query: 422 AMPGARGFSALLHVRGEHPDIPVVVISSNDHPRVIRRAQQFGAAGFIPKS 471
MP F L ++ PD+PV+V+S+ + +A + GA ++PK
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03955TCRTETA348e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.4 bits (79), Expect = 8e-04
Identities = 71/369 (19%), Positives = 127/369 (34%), Gaps = 62/369 (16%)

Query: 76 LMRPLGAVILGAYIDDVGRRKGLIVTL-------AIMASGTVLIVLVPGYASIGLWAPAL 128
LM+ A +LGA D GRR L+V+L AIMA+ L VL
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVL-------------- 99

Query: 129 VLLGRLLQGFSAGAEMGGVSVYLAEMATPGRRGFYASWQSASQQLAIVAAAAIGFALNQL 188
+GR++ G + GA Y+A++ R + + SA +VA +G
Sbjct: 100 -YIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL---- 153

Query: 189 MAPQDLAQWGWRIPFGICCVIIPFIFLLRRSLEETAEFAQRRQPVTMKQVMRGLANNASI 248
+ + PF + FL L + +RR +++ +
Sbjct: 154 -----MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERR---PLRREALNPLASFRW 205

Query: 249 VIAGGLMVALTTTAFYLI-------TVYAPTFGKTVLKLSTGDALIVTLLVGVSN-FMWL 300
++ AL F + ++ FG+ I G+ +
Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGILHSLAQA 264

Query: 301 PIGGTLSDRFGRKPLLVTMSVMCILTAYPVLAFLAAAPSFAHMLQALLWLSFIYGLYNGA 360
I G ++ R G + L + ++ T Y +LAF + + G
Sbjct: 265 MITGPVAARLGERRAL-MLGMIADGTGYILLAFATRGWMA--------FPIMVLLASGGI 315

Query: 361 MIPALTEQMPAHV------RVAGFSLAYSLATAVFGGFTPVISTWLIHVTGDKAAPGYWL 414
+PAL + V ++ G A + T++ G P++ T + + W+
Sbjct: 316 GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG---PLLFTAIYAASITTWNGWAWI 372

Query: 415 IFASVCALA 423
A++ L
Sbjct: 373 AGAALYLLC 381



Score = 30.6 bits (69), Expect = 0.015
Identities = 13/27 (48%), Positives = 18/27 (66%)

Query: 291 LVGVSNFMWLPIGGTLSDRFGRKPLLV 317
L + F P+ G LSDRFGR+P+L+
Sbjct: 51 LYALMQFACAPVLGALSDRFGRRPVLL 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS03960HTHFIS564e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 56.0 bits (135), Expect = 4e-10
Identities = 22/120 (18%), Positives = 42/120 (35%), Gaps = 3/120 (2%)

Query: 762 RVWCVDDEPLVCEATRTLLERWECRVDFAGGPDEALSAANAEEVPELLLLDVRMGAHYGP 821
+ DD+ + L R V A + +L++ DV M
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63

Query: 822 MLLPQLVERWRREPRLILVTAEPDPALREHALDLG-WGLLSKPVRPPALRALVTQMLMRR 880
LLP++ + P ++++A+ A + G + L KP L ++ + L
Sbjct: 64 DLLPRIKKARPDLPV-LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


59AXO1947_RS04340AXO1947_RS04390N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS04340-1140.658830IS5/IS1182 family transposase
AXO1947_RS04345-1141.510931peptidase
AXO1947_RS04350-2140.855531pilus assembly protein PilM
AXO1947_RS04355-3141.302809fimbrial protein
AXO1947_RS04365-1121.238366fimbrial protein
AXO1947_RS04375-1130.895868fimbrial protein
AXO1947_RS043850131.296403fimbrial protein
AXO1947_RS04390-1130.463474ATPase AAA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04460PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04470SHAPEPROTEIN349e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 33.6 bits (77), Expect = 9e-04
Identities = 52/210 (24%), Positives = 82/210 (39%), Gaps = 45/210 (21%)

Query: 153 RQSALELGGLTAKVMDVEAFAVENAFALVASELPVAADAVVALVDIGATMTTLSVLRSGR 212
R+SA G +++ E A A + + LPV+ +VDIG T ++V+
Sbjct: 127 RESAQGAGAREVFLIE-EPMA-----AAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180

Query: 213 SLYSREQVFGGKQLTDEVM----RRYGL-----TYEEA----GLAKRQG----------- 248
+YS GG + + ++ R YG T E G A
Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240

Query: 249 ---GLPESYEV---EVLEPFKE---ATVQQISRLLQFF---YAGSEFNRVDCIVLAGGCA 296
G+P + + E+LE +E V + L+ A R +VL GG A
Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER--GMVLTGGGA 298

Query: 297 ALACLPEMVEEQLGVTTVVA-NPLAQMTLG 325
L L ++ E+ G+ VVA +PL + G
Sbjct: 299 LLRNLDRLLMEETGIPVVVAEDPLTCVARG 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04475FLGHOOKFLIK290.019 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 29.0 bits (64), Expect = 0.019
Identities = 21/98 (21%), Positives = 30/98 (30%), Gaps = 4/98 (4%)

Query: 166 AGQPGASSMDTKTLPYVFTLKVKLANPNEADKNGTAPGAVDPAAPGTAAPG---AAPAGA 222
P K + +L D GT + P + + P+
Sbjct: 148 TDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPV 207

Query: 223 TPAA-PAAAPAPATPPAAAPAPTQAAPAPANRPQQGAS 259
T AA P P P AP +AP ++ QQ S
Sbjct: 208 TAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQSLS 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04490BCTERIALGSPD2226e-66 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 222 bits (568), Expect = 6e-66
Identities = 99/436 (22%), Positives = 169/436 (38%), Gaps = 49/436 (11%)

Query: 230 VPWDQALDIVLRAKGLDKRRDGGVVWVAPQPELAKFEQDKEDARIAIENREDLITDYVQ- 288
+ W A D+V L+K + + + E+ N I ++
Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ 258

Query: 289 ---------------INYHNAAVIFKALTEAKGIGGGGGSGAGQGGQGGAGQQDNGFLSP 333
+ Y A+ + + LT G + + D +
Sbjct: 259 LDRQQATQGNTKVIYLKYAKASDLVEVLT-----GISSTMQSEKQAAKPVAALDKNII-- 311

Query: 334 RGRLVADERTNTLMISDIPKKVAQMRELISHIDRPVDQVLIESRIVIATDTFARDLGARF 393
+ A +TN L+++ P + + +I+ +D QVL+E+ I D +LG ++
Sbjct: 312 ---IKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQW 368

Query: 394 GITGATGRGILSGSLDSNTNYLNTSAKRASEIANGGTSTTLPAHLFPSGLNVNLGAGGGF 453
A + L +T A +G S++L + L G GF
Sbjct: 369 ANKNAGMTQFTNSGLPISTAI----AGANQYNKDGTVSSSLASALSSFN-----GIAAGF 419

Query: 454 TTNTPGGLAYTLLGSNFNLDIELSAMQQEGRGEVVSNPRIVTANQREGVIKQGREIGYVT 513
N + L+A+ + ++++ P IVT + E G+E+ +T
Sbjct: 420 --------------YQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT 465

Query: 514 ISGGGAGGGSQANVQFKEVLLELKVTPTITNDNRVFLNMNVKKDEVARLIDLPLYGTVPE 573
S +G V+ K V ++LKV P I + V L + + VA
Sbjct: 466 GSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 574 INRREINTAVLVGDGETVVIGGVYEFTDRESVAKVPFLGDIPFLGNLFKKRGRSKEKAEL 633
N R +N AVLVG GETVV+GG+ + + ++ KVP LGDIP +G LF+ + K L
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 634 LVFVTPKVLRVASATR 649
++F+ P V+R R
Sbjct: 586 MLFIRPTVIRDRDEYR 601



Score = 50.7 bits (121), Expect = 1e-08
Identities = 31/209 (14%), Positives = 75/209 (35%), Gaps = 30/209 (14%)

Query: 175 AAAQIAARGYSGRPVTFNFQDVPVRTVLQLIAEESNLNIVASDTVQGNVTLR----LMNV 230
A + R + + +F+ ++ + +++ N ++ +V+G +T+R L
Sbjct: 16 IFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEE 75

Query: 231 PWDQALDIVLRAKGLDK-RRDGGVVWVAPQPELAKFEQDKEDARIAIENREDLITDYVQI 289
+ Q VL G + GV+ V + AK + A ++++T V +
Sbjct: 76 QYYQFFLSVLDVYGFAVINMNNGVLKVVRSKD-AKTAAVPVASDAAPGIGDEVVTRVVPL 134

Query: 290 NYHNAAVIFKALTEAKGIGGGGGSGAGQGGQGGAGQQDNGFLSPRGRLVADERTNTLMIS 349
A + L + G G +V E +N L+++
Sbjct: 135 TNVAARDLAPLLRQLNDNAGV------------------------GSVVHYEPSNVLLMT 170

Query: 350 DIPKKVAQMRELISHIDRPVDQVLIESRI 378
+ ++ ++ +D D+ ++ +
Sbjct: 171 GRAAVIKRLLTIVERVDNAGDRSVVTVPL 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04495HTHFIS347e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.0 bits (78), Expect = 7e-04
Identities = 39/158 (24%), Positives = 59/158 (37%), Gaps = 24/158 (15%)

Query: 35 IVGQS----ALVERLLIALLADGHLLVEGAPGLAKTT---AIRALASRLEADFARVQ--- 84
+VG+S + L + D L++ G G K A+ R F +
Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198

Query: 85 FTPDLLPADLTG------TEIWRPQDSRFEFMPGPIFHPILLADEINRAPAKVQSALLEA 138
DL+ ++L G T RFE G L DEI P Q+ LL
Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT----LFLDEIGDMPMDAQTRLLRV 254

Query: 139 MGERQVT-VGRHTYALPQLFLVMATQNPIEQ---EGTF 172
+ + + T VG T + +V AT ++Q +G F
Sbjct: 255 LQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLF 292


60AXO1947_RS04490AXO1947_RS04525N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS04490-1170.923537short chain dehydrogenase
AXO1947_RS044951182.458326hypothetical protein
AXO1947_RS045001192.060994DNA-binding response regulator
AXO1947_RS045050161.110869two-component sensor histidine kinase
AXO1947_RS04510-2120.787395transcriptional regulator
AXO1947_RS04515-3110.526647energy transducer TonB
AXO1947_RS04520-111-0.159947hypothetical protein
AXO1947_RS04525011-1.322513molybdenum ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04585DHBDHDRGNASE614e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 61.2 bits (148), Expect = 4e-13
Identities = 43/188 (22%), Positives = 72/188 (38%), Gaps = 2/188 (1%)

Query: 3 LHGKCIILTGATGGIGSALCAGLVEAGATVVAVGRTEETLRRLSAAHAPGGVVP--VVAD 60
+ GK +TGA GIG A+ L GA + AV E L ++ ++ AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 LASESGRAVLLARAHEMRPAPSVLVLAHAQSHFGLLQDQDPAALTALVHLNLTVPMLLVQ 120
+ + + AR +LV GL+ A +N T +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 ALLPAFVRQPDAAMVAVGSTFGSIGFAGFAGYSASKFGLRGLFEALAREHAGTSVRFQYL 180
++ + + ++V VGS + A Y++SK + L E A ++R +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 SPRATATA 188
SP +T T
Sbjct: 186 SPGSTETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04595HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-20
Identities = 36/143 (25%), Positives = 58/143 (40%)

Query: 2 RILLVEDDLSLGEGIRTALRRAAYAVDWVHDGVSALMALQEQTMDLVILDLGLPRMDGIE 61
IL+ +DD ++ + AL RA Y V + + + DLV+ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VIRTARARAVDTPILVLSARERAADRALGLDVGADDYLGKPFDTNELLARTRALLRRSAG 121
++ + D P+LV+SA+ + GA DYL KPFD EL+ L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 RAQPALQAGALRLDPAGMSVRWH 144
R + G S
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQ 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04600PF06580355e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.8 bits (80), Expect = 5e-04
Identities = 24/104 (23%), Positives = 45/104 (43%), Gaps = 21/104 (20%)

Query: 348 SLLLRNLLENAVRY----TPAGGRIRV-GTHNAPQPTLVVEDSGPGIPEAARARVFHRFH 402
+L++ L+EN +++ P GG+I + GT + TL VE++G + +
Sbjct: 257 PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK-------- 308

Query: 403 RELGTGVEGSGLGLSIVHD-IAVAHAARVQLDESPALGGLRVRV 445
E +G GL V + + + + Q+ S G + V
Sbjct: 309 -------ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04610PF03544678e-15 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 66.9 bits (163), Expect = 8e-15
Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 5/79 (6%)

Query: 309 PPKYPADAIAAGLAGFVELQVAVSPNGTPDHIAIVRSTPAGVFDRAVLDAARHWRFAPAV 368
P+YPA A A + G V+++ V+P+G D++ I+ + PA +F+R V +A R WR+ P
Sbjct: 164 QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGK 223

Query: 369 VDGEAVASDVRVPVKFELD 387
+ V + F+++
Sbjct: 224 PGSG-----IVVNILFKIN 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04620PF05272280.023 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.023
Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 25 VVALVGPSGAGKTTVLNAIAGL 46
V L G G GK+T++N + GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


61AXO1947_RS04675AXO1947_RS21090N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS04675-1152.270462adenylyl-sulfate kinase
AXO1947_RS046800122.888498MexH family multidrug efflux RND transporter
AXO1947_RS046850132.959137multidrug transporter AcrB
AXO1947_RS04690-1132.501545aminopeptidase N
AXO1947_RS046951102.547290hypothetical protein
AXO1947_RS047000120.824811RNA methyltransferase
AXO1947_RS047053150.583724IS630 family transposase
AXO1947_RS210902120.610872large-conductance mechanosensitive channel
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04750TCRTETOQM611e-11 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 60.6 bits (147), Expect = 1e-11
Identities = 45/156 (28%), Positives = 74/156 (47%), Gaps = 19/156 (12%)

Query: 61 VDDGKSTLIGRLLYDSKRLFDDQLAALESDSRRHGTQGESIDYALLMDGLAAEREQGITI 120
VD GK+TL LLY+S + +L +++ + R D ER++GITI
Sbjct: 12 VDAGKTTLTESLLYNSGAI--TELGSVDKGTTR-------------TDNTLLERQRGITI 56

Query: 121 DVAYRYFDTDRRKFIVADCPGHEQYTRNMATGASTADVAVVLVDARKGLLAQTRRHSYIV 180
F + K + D PGH + + S D A++L+ A+ G+ AQTR + +
Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 181 SLLGIGHVVLAVNKMDL--VDYDAQVFADIAQRYGA 214
+GI + +NK+D +D V+ DI ++ A
Sbjct: 117 RKMGIPTIFF-INKIDQNGIDLS-TVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04755RTXTOXIND392e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.4 bits (92), Expect = 2e-05
Identities = 34/181 (18%), Positives = 70/181 (38%), Gaps = 24/181 (13%)

Query: 99 QAALTAAQATFEETDQLYRRQLSLVGQQLVAKSTVDTQRALRDAAQARVQQMRAEITDRE 158
++ + +A+ ++ QL++ ++ +Q + T A+ +Q + I
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLEL----AKNEERQQASVI---- 330

Query: 159 VRAPFSG-VLGMRQISPGALITS-STVIATLDDVARMYVDFKVPESQFGLVQVGNAVSGS 216
RAP S V ++ + G ++T+ T++ + + + V V G + VG
Sbjct: 331 -RAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIK 389

Query: 217 AAAYPGAQF---EGEVVTI--DSRIDETTRSVT-VRADFP-------NDDRRLRPGMLLD 263
A+P ++ G+V I D+ D+ V V N + L GM +
Sbjct: 390 VEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVT 449

Query: 264 V 264

Sbjct: 450 A 450



Score = 35.2 bits (81), Expect = 4e-04
Identities = 14/88 (15%), Positives = 34/88 (38%), Gaps = 6/88 (6%)

Query: 72 VVEQVYFDSGDEVKAGQLLLRLRGNSQQAALTAAQATF------EETDQLYRRQLSLVGQ 125
+V+++ G+ V+ G +LL+L +A Q++ + Q+ R + L
Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165

Query: 126 QLVAKSTVDTQRALRDAAQARVQQMRAE 153
+ + + + R+ + E
Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKE 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04760ACRIFLAVINRP8410.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 841 bits (2173), Expect = 0.0
Identities = 337/1033 (32%), Positives = 544/1033 (52%), Gaps = 27/1033 (2%)

Query: 3 LSDLSITRPVMAVVMSLLLIVLGVTSFTRLTLRELPAIDPPIVSVNVEYTGASAAVVESR 62
+++ I RP+ A V++++L++ G + +L + + P I PP VSV+ Y GA A V+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITQVLEDGLAGIEGISTIEAHS-RNGSSDISIEFVQSRDVEAAANDVRDAVSRVSDRMPD 121
+TQV+E + GI+ + + + S GS I++ F D + A V++ + + +P
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 QARAPEISKVEADADPILWLNMSSSTMDTLQ--LSDYAERYVVDRFSSLDGVAQVRIGGR 179
+ + IS ++ + ++ S T Q +SDY V D S L+GV V++ G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 QRYAMRIWLDRDQLAARELTVADVEAALQNENVEVPAGSIESA------QRDFTLRVERS 233
Q YAMRIWLD D L +LT DV L+ +N ++ AG + Q + ++ +
Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 234 YLKPEDFAKLPLNKGEGGYVVRLGDVARVELSSAERRAYFQSNGVPNVGLGIVRNSTANA 293
+ PE+F K+ L G VVRL DVARVEL + NG P GLGI + ANA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 LDVAREARARAIEVQKSLPQGTNIFVAFDTTTFIDAAVERVYHTLVEAVVLVLVVIWVFL 353
LD A+ +A+ E+Q PQG + +DTT F+ ++ V TL EA++LV +V+++FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 GSARAASIPAVTVPVCLIASFIALYAFDFSINLLTLLALVLCIGLVVDDAIVVVENVQRR 413
+ RA IP + VPV L+ +F L AF +SIN LT+ +VL IGL+VDDAIVVVENV+R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 I-DLGEPPLVAAKRGTGQVAFAVIATTAVLVAVFLPVGFLQGDTGRLFRELAVALAAAVA 472
+ + PP A ++ Q+ A++ VL AVF+P+ F G TG ++R+ ++ + +A+A
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 ISAFVALTLTPMMSSKLLR---AHGQAKPNRFHQWFDGRMQAVSGAYGRSLERHVHRTWV 529
+S VAL LTP + + LL+ A F WF+ Y S+ + + T
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 530 FALLMLLALGASAWLMGRIPSEVAPAEDRGNFQIMIDGPEGAGFDYTVGQMHQVEDILRP 589
+ L+ L + L R+PS P ED+G F MI P GA + T + QV D
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDY--- 596

Query: 590 FVGPDKPIVRANPRVPGSFGSSEEMHTGRVSVFLQDWKKRTRPTTEVADEVQQKLNALSG 649
++ +K V + V G S + + G V L+ W++R + + L
Sbjct: 597 YLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGK 656

Query: 650 VR-ARTQ------VSGGLVRSRGQPFQLVLGGPDYAEIAQWRDRIMQRMEANPG-LVGPD 701
+R + + + G + + Q R++++ +P LV
Sbjct: 657 IRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 702 SDYKETRPQMRVNIDRVRAADLGVPVTAIGGALEALMGSRRVTTFVDNGEEYDVMLQAGR 761
+ E Q ++ +D+ +A LGV ++ I + +G V F+D G + +QA
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 762 EGRMSPEDLTAIRVRSNRGELIPLSNLVTLSEVAEAGTLNRFNRLRAITIMAGLAPGYPL 821
+ RM PED+ + VRS GE++P S T V + L R+N L ++ I APG
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836

Query: 822 GDAIAWAQQAAQEELPEYAQVDWKGESREYQQSGSAVLFTFGMALLVVYLLLAAQFESFA 881
GDA+A + A +LP DW G S + + SG+ ++ +VV+L LAA +ES++
Sbjct: 837 GDAMALMENLA-SKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 882 HPLVIMLTVPLAVLGALVGLWLTGGTLNLFSQIGIVMLVGLAAKNGILIVEFANQLRD-E 940
P+ +ML VPL ++G L+ L +++ +G++ +GL+AKN ILIVEFA L + E
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 941 GRSVHAAIVASASVRLRPILMTSIATVVGAIPLVVAGGPGSASRATIGVVVIFGVSLSTV 1000
G+ V A + + +RLRPILMTS+A ++G +PL ++ G GS ++ +G+ V+ G+ +T+
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1001 LSLYVVPAFYSLI 1013
L+++ VP F+ +I
Sbjct: 1016 LAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04805MECHCHANNEL1426e-47 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 142 bits (360), Expect = 6e-47
Identities = 74/136 (54%), Positives = 97/136 (71%), Gaps = 7/136 (5%)

Query: 1 MGMVSEFKQFAIRGNVIDLAVGVVIGAAFGKIVTALVEKIIMPPIGWAIGNVDFSRLAWV 60
M ++ EF++FA+RGNV+DLAVGV+IGAAFGKIV++LV IIMPP+G IG +DF + A
Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60

Query: 61 LKPAGVDATGKDIPAVAIGYGDFINTVVQFVIVAFAIFLLVKLINRVTNRK--PDAPKGP 118
L+ A DIPAV + YG FI V F+IVAFAIF+ +KLIN++ +K P A P
Sbjct: 61 LRDA-----QGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEPAAAPAP 115

Query: 119 SEEVLLLREIRDSLKN 134
++E +LL EIRD LK
Sbjct: 116 TKEEVLLTEIRDLLKE 131


62AXO1947_RS04785AXO1947_RS04840N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS04785-319-1.945326DNA-binding response regulator
AXO1947_RS04790-115-3.066576histidine kinase
AXO1947_RS04795014-3.092711dephospho-CoA kinase
AXO1947_RS04800018-2.680987prepilin peptidase
AXO1947_RS04805-19-2.150708type II secretory pathway protein
AXO1947_RS04810-210-1.212271pilin
AXO1947_RS21120-111-1.111788hypothetical protein
AXO1947_RS04820013-0.139821hypothetical protein
AXO1947_RS04825-1150.669667type IV-A pilus assembly ATPase PilB
AXO1947_RS04830-1120.304691cell filamentation protein Fic
AXO1947_RS04835011-0.314893sigma-54-dependent Fis family transcriptional
AXO1947_RS04840013-0.698391ATPase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04930HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 32/119 (26%), Positives = 60/119 (50%), Gaps = 2/119 (1%)

Query: 2 RILVIEDNSDIAANLGDYLEDRGHTVDFAADGVTGLHLAVVHEFDAIVLDLNLPGMDGIE 61
ILV +D++ I L L G+ V ++ T + D +V D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCRKLRNEARKQTPVLMLTARDSLDNKLAGFDSGADDYLIKPFALQE-VEVRLNALSRR 119
+ +++ +AR PVL+++A+++ + + GA DYL KPF L E + + AL+
Sbjct: 65 LLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04945PREPILNPTASE329e-116 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 329 bits (845), Expect = e-116
Identities = 130/282 (46%), Positives = 175/282 (62%), Gaps = 1/282 (0%)

Query: 1 MAFLDQHPGLGFPAAAGLGLLIGSFLNVVILRLPRRMEWQWRRDAREILELPDI-YEPPP 59
+ P L F L+IGSFLNVVI RLP +E +W+ + R D + PP
Sbjct: 5 LELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPP 64

Query: 60 PGIVVEPSHDPVTGDQLKWWENIPLFSWLMLRGKSRYSGKPISIQYPLVELLTSILCVAS 119
++V S P + ENIPL SWL LRG+ R PIS +YPLVELLT++L VA
Sbjct: 65 YNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAV 124

Query: 120 VWRFGFGWQGFGAIVLSCFLVAMSGIDLRHKLLPDQLTLPLMWLGLVGSMDNLYMPAKPA 179
GW A++L+ LVA++ IDL LLPDQLTLPL+W GL+ ++ ++ A
Sbjct: 125 AMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDA 184

Query: 180 LLGAAVGYVSLWTVWWLFKQLTGKEGMGHGDFKLLAALGAWCGLKGILPIILFSSLVGAI 239
++GA GY+ LW+++W FK LTGKEGMG+GDFKLLAALGAW G + + ++L SSLVGA
Sbjct: 185 VIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAF 244

Query: 240 LGSAWLVAKGRDRATPIPFGPYLAIAGWVVFFWGNDLVDGYL 281
+G ++ + ++ PIPFGPYLAIAGW+ WG+ + YL
Sbjct: 245 MGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04950BCTERIALGSPF382e-133 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 382 bits (982), Expect = e-133
Identities = 115/405 (28%), Positives = 212/405 (52%), Gaps = 9/405 (2%)

Query: 22 TFVWEGADKRGVKMKGEQQARNANMLRAELRRQGIVPSMV-------KQKPKPLLGAA-G 73
+ ++ D +G K +G Q+A +A R LR +G+VP V ++ L
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 74 KKITPKDIAFFSRQMATMMKSGVPIVSSLEIIGEGHKNPRMKKMVGQIRTDIEGGSSLYE 133
+++ D+A +RQ+AT++ + +P+ +L+ + + + P + +++ +R+ + G SL +
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 134 SISKHPVQFDELYRNLVRAGEGAGVLETVLETVATYKENIEALKGKIKKAMFYPATVVAV 193
++ P F+ LY +V AGE +G L+ VL +A Y E + ++ +I++AM YP + V
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 194 AIIVSAILLIFVVPQFEQVFKSFGAELPAFTQLLVNASRFMVSYWWLTLMVTVGSVVGFI 253
AI V +ILL VVP+ + F LP T++L+ S + ++ L+ + + F
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 254 FAYKRSPRMQHGLDRLILKVPVIGKIMHNSAIARFARTTAVTFKAGVPLVEALGIVAGAT 313
R + + R +L +P+IG+I AR+ART ++ + VPL++A+ I
Sbjct: 243 VML-RQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301

Query: 314 GNKLYEEAVFRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDAMLFKVAEYF 373
N + D V G ++ A++Q LFP M+ M A GE +G LD+ML + A+
Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361

Query: 374 EEEVNNAVDALSSLLEPLIMVFIGTIVGGIVIGMYLPIFKLGAVV 418
+ E ++ + L EPL++V + +V IV+ + PI +L ++
Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04955BCTERIALGSPG433e-08 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 43.3 bits (102), Expect = 3e-08
Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 10/73 (13%)

Query: 1 MKKQQGFTLIELMIVVAIIAILAAIALPAYQDYTTRAKLSEALTMSAPAKLAVTETSSSL 60
KQ+GFTL+E+M+V+ II +LA++ +P +A + AV++ +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD----------KQKAVSDIVALE 53

Query: 61 GGLTNVTLANSGY 73
L L N Y
Sbjct: 54 NALDMYKLDNHHY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04985HTHFIS5100.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 510 bits (1316), Expect = 0.0
Identities = 170/479 (35%), Positives = 255/479 (53%), Gaps = 18/479 (3%)

Query: 4 MSEPKSALVVDDERDIRELLVLTLGRMGLRISTAANLAEARELLANNPYDLCLTDMRLPD 63
M+ LV DD+ IR +L L R G + +N A +A DL +TD+ +PD
Sbjct: 1 MTGAT-ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 64 GNGIELVTEIARQYPQTPVAMITAFGSMDLAVEALKAGAFDFVSKPVDIGVLRGLVKHAL 123
N +L+ I + P PV +++A + A++A + GA+D++ KP D+ L G++ AL
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 124 ELNNRDRPAPPPPPPEQASSLLGDSSAMESLRATIGKVARSQAPVYIVGESGVGKELVAR 183
R RP+ + L+G S+AM+ + + ++ ++ + I GESG GKELVAR
Sbjct: 120 AEPKR-RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 184 TIHEQGARAAGPFVPVNCGAIPAELMESEFFGHKKGSFTGAHADKPGLFQAAHGGTLFLD 243
+H+ G R GPFV +N AIP +L+ESE FGH+KG+FTGA G F+ A GGTLFLD
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 244 EVAELPLQMQVKLLRAIQEKSVRPVGASGETLVDVRILSATHKDLGDLVSDGRFRHDLYY 303
E+ ++P+ Q +LLR +Q+ VG DVRI++AT+KDL ++ G FR DLYY
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 304 RINVIELRVPPLRERGGDLPQLAAAIIARLAHSHGRPIPLLTQSALDALDHYGFPGNVRE 363
R+NV+ LR+PPLR+R D+P L + + A G + Q AL+ + + +PGNVRE
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 364 LENILERALALAEDDQISATDLRLPAHGGHRLAATPGSAAVEPRE--------------- 408
LEN++ R AL D I+ + + +AA
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 409 AVVDIDPASAALPSYIEQLERAAIQKALEENRWNKTKTAAQLGITFRALRYKLKKLGME 467
+ D P S + ++E I AL R N+ K A LG+ LR K+++LG+
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS04990PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 13/70 (18%)

Query: 431 ILTALVHNALKYG-RVMDEPARVKLRVERLERMAVIDVMDRGPGIPKAVAAQLFRPFYTT 489
++ LV N +K+G + + ++ L+ + ++V + G K
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------------N 306

Query: 490 SEHGTGLGLY 499
++ TG GL
Sbjct: 307 TKESTGTGLQ 316


63AXO1947_RS05515AXO1947_RS05575N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS055154151.346407Holliday junction branch migration DNA helicase
AXO1947_RS055202130.551374tol-pal system-associated acyl-CoA thioesterase
AXO1947_RS055252120.700795Tol-Pal system subunit TolQ
AXO1947_RS055301110.972042protein TolR
AXO1947_RS055351121.130822protein TolA
AXO1947_RS055400121.496651translocation protein TolB
AXO1947_RS055451142.276547peptidoglycan-associated lipoprotein
AXO1947_RS055552163.666516tol-pal system protein YbgF
AXO1947_RS055601143.3558547-carboxy-7-deazaguanine synthase
AXO1947_RS055651143.098736*7-cyano-7-deazaguanine synthase
AXO1947_RS055750140.706472chemotaxis protein CheY
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05700FERRIBNDNGPP290.031 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 28.8 bits (64), Expect = 0.031
Identities = 21/80 (26%), Positives = 31/80 (38%), Gaps = 17/80 (21%)

Query: 8 SSSIREDDAADASIRPKRLADYLGQQPVRE----QMEIYIQAAKAR-----------GEA 52
+ S + A A +AD L Q E Q E +I++ K R
Sbjct: 123 NFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTL 182

Query: 53 MD--HVLIFGPPGLGKTTLS 70
+D H+L+FGP L + L
Sbjct: 183 IDPRHMLVFGPNSLFQEILD 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05720IGASERPTASE553e-10 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 54.7 bits (131), Expect = 3e-10
Identities = 37/220 (16%), Positives = 65/220 (29%), Gaps = 17/220 (7%)

Query: 39 LWSPE-----RSVEPAAGDPSMEASLDVSAADARVARQALKATPVETPPPAPLPEPAPE- 92
L++PE ++V+ DV + + A PP P E
Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 93 --DSVPPPQPIPEPRPQDA--PTPQQAQAQERVAQPDKVDQDRVDALAISAEKAKQEQEA 148
++ E QDA T Q + K + V A + E A+ E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREV-------AKEAKSNVKANTQTNEVAQSGSET 1092

Query: 149 KRRQEQIDLTERKRQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEDAERQKKIDDIRRQR 208
K Q ++E + K+ K QE + + Q+ +E + Q +
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 209 AQADKDMALAEQKLRQVAAARAQQSSAATATSAQPTAGQG 248
+ + A+ S+ + T G
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192



Score = 51.6 bits (123), Expect = 3e-09
Identities = 35/224 (15%), Positives = 63/224 (28%), Gaps = 39/224 (17%)

Query: 59 LDVSAADARVARQALKATPVETPPPAPLPEPAPE---DSVPPPQPIPEPRPQDAPTPQQA 115
L+VS V A K L P E +V TP
Sbjct: 953 LNVSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNI---------TTPNNI 1003

Query: 116 QAQERVAQPDKVDQDRVDALAIS--------------AEKAKQEQ-----------EAKR 150
QA + + RVD + AE +KQE E
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTA 1063

Query: 151 RQEQIDLTERKRQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEDAERQKKIDDIRRQRAQ 210
+ ++ + + Q +A+ E + + A + E + K++ + Q
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123

Query: 211 ADKDMALAEQKLRQVA--AARAQQSSAATATSAQPTAGQGGTST 252
+Q+ + A + + T +P + T+
Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167



Score = 35.0 bits (80), Expect = 4e-04
Identities = 28/203 (13%), Positives = 60/203 (29%), Gaps = 11/203 (5%)

Query: 47 EPAAGDPSMEASLDVSAAD-----ARVARQALKATPVETPPPAPLPEPAPEDSVPPPQPI 101
+ + EA +V A A+ + + ET A + E + V +
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV-EKEEKAKVETEKTQ 1120

Query: 102 PEPRPQDAPTPQQAQAQERVAQPDKVDQDRVDALAISAEKAKQEQEAKRRQEQIDLTERK 161
P+ +P+Q Q++ Q + ++ + I +++ A Q + +
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEP-ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 162 RQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEDAERQKKIDD----IRRQRAQADKDMAL 217
Q E + + A Q ++E K + R +
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATT 1239

Query: 218 AEQKLRQVAAARAQQSSAATATS 240
+ VA ++ S
Sbjct: 1240 SSNDRSTVALCDLTSTNTNAVLS 1262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05730OMPADOMAIN1063e-30 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 106 bits (266), Expect = 3e-30
Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 11/112 (9%)

Query: 67 VYFDLDQDSLKPEFQAIMACHAKYLR--DRPSSRITLQGNADERGSREYNMGLGERRGNA 124
V F+ ++ +LKPE QA + L D + + G D GS YN GL ERR +
Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280

Query: 125 VSSSLQAAGGSASQLTVVSYGEERPVCTESNE---------SCWSQNRRVEI 167
V L + G A +++ GE PV + + C + +RRVEI
Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05735RTXTOXIND345e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 5e-04
Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 7/88 (7%)

Query: 30 RVAVLEQQQANSQANNDL---LNQLQQARSDLQALRSTVEQLQHD--NEQLKQ--QSKDQ 82
+ AVLEQ+ +A N+L +QL+Q S++ + + + + NE L + Q+ D
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310

Query: 83 YLDLDGRLNRLEGAGGATPSLPPATGSV 110
L L + E A+ P + V
Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKV 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05760HTHFIS443e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 43.7 bits (103), Expect = 3e-08
Identities = 16/82 (19%), Positives = 38/82 (46%), Gaps = 3/82 (3%)

Query: 4 RVLLVEDESLVAMLLEDCLTELGYEVAATVADVDAALQAVHAGNLDLALPDVNLCGTLSF 63
+L+ +D++ + +L L+ GY+V ++ + + AG+ DL + DV + +F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDV-RITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 64 PIAEELDA--CGLPYIFVTGYA 83
+ + LP + ++
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN 85


64AXO1947_RS05760AXO1947_RS05815N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS05760021-1.621331hypothetical protein
AXO1947_RS05765-121-1.468129helicase
AXO1947_RS05770-121-2.339281tRNA
AXO1947_RS21220-221-2.350074ADP-ribosylglycohydrolase
AXO1947_RS05785015-1.579883energy transducer TonB
AXO1947_RS05790014-0.044950glutathione synthetase
AXO1947_RS21225016-0.010256response regulator
AXO1947_RS05795-1150.362844response regulator
AXO1947_RS058052121.904458chemotaxis protein CheW
AXO1947_RS05810-1120.601158chemotaxis protein
AXO1947_RS05815-1110.207338hybrid sensor histidine kinase/response
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05900BACINVASINC290.012 Salmonella/Shigella invasin protein C signature.
		>BACINVASINC#Salmonella/Shigella invasin protein C signature.

Length = 409

Score = 28.7 bits (63), Expect = 0.012
Identities = 25/97 (25%), Positives = 39/97 (40%), Gaps = 6/97 (6%)

Query: 69 RDTAKSKRQAGDLAGAAAALDQALGLVSGDPAILQERAEVSVLQADWPAAERFAKQAIAL 128
R A+ + GDL + + S A QER+E + Q + A + +A
Sbjct: 315 RIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVNNRVASTASDEARES 374

Query: 129 GSKTGPLCRRHWATIEQSRLARGEKENAASARAQIAG 165
K+ L + T+E ++ ASA A IAG
Sbjct: 375 SRKSTSLIQEMLKTMESI------NQSKASALAAIAG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05905SECA300.039 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.039
Identities = 31/121 (25%), Positives = 45/121 (37%), Gaps = 25/121 (20%)

Query: 10 EALSEGGALARQLDAFAPRAAQLR-----LTGAIAEAFEQRDVLLAEAGTGTGKTYAYLV 64
+ E A+ R+ + R +R L G + +R + AE TG GKT +
Sbjct: 62 NLIPEAFAVVREA---SKRVFGMRHFDVQLLGGMV--LNERCI--AEMRTGEGKTLTATL 114

Query: 65 PALLSGLKTIVSTGTH--ALQDQLFHRD---LPRVRAALG--VGLRSALL---KGRANYL 114
PA L+ L G H + D L RD + LG VG+ + R Y
Sbjct: 115 PAYLNALT---GKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYA 171

Query: 115 C 115

Sbjct: 172 A 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05920PF035441336e-40 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 133 bits (336), Expect = 6e-40
Identities = 40/262 (15%), Positives = 86/262 (32%), Gaps = 37/262 (14%)

Query: 11 MDDGRRLMMTLVISLLLHGVLILGVVFAVSEDAPLVPTLDVIFSQTSTPLTPKQADFLAQ 70
+D RR ++S+ +HG ++ G+++ +P P P +A
Sbjct: 8 LDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPA----------PAQPISVTMVAP 57

Query: 71 ANQQGGGNHDTAQRPRDSQPGVVPQDRTGLAPQAQRATSVNAPTPTQTRVVTSRRGEQAV 130
A D P P+ P+ + P +
Sbjct: 58 A--------DLEPPQAVQPP---PEPVVEPEPEPEPIPEPPKEAPVV------------I 94

Query: 131 PTPQPNPQTDPLTPADAQRLQHDAEMARLAAEVHLRSEQYAKRPNRKFVSASTREYAYAN 190
P+P P+ P ++ + D + + A+ + +A+++
Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154

Query: 191 YLRAWVDRAERVGNLNYPDDARRRRLGGKVVISVGVRRDGSVESSRVLVSSGVPLLDDAA 250
+ R YP A+ R+ G+V + V DG V++ ++L + + +
Sbjct: 155 SGPRALSRN----QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREV 210

Query: 251 LRVVQLAQPFPPLPKTKDDVDI 272
++ + P P + V+I
Sbjct: 211 KNAMRRWRYEPGKPGSGIVVNI 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05930HTHFIS732e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.3 bits (180), Expect = 2e-18
Identities = 28/115 (24%), Positives = 49/115 (42%), Gaps = 2/115 (1%)

Query: 15 KVMVIDDSKTIRRTAETLLKREGCEVVTATDGFEALAKIADQQPQIIFVDIMMPRLDGYQ 74
++V DD IR L R G +V ++ IA ++ D++MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 75 TCALIKGNQLFKSTPVIMLSSKDGLFDKARGRIVGSEQYLTKPFTREELLSAIRT 129
IK + PV+++S+++ + G+ YL KPF EL+ I
Sbjct: 65 LLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05935HTHFIS859e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.3 bits (211), Expect = 9e-23
Identities = 34/116 (29%), Positives = 56/116 (48%), Gaps = 2/116 (1%)

Query: 2 ARIILIEDSPTDSAVFSQWLEKAGHTVVATDNAEEGLELVRSQVPDLVLMDVVLPGMSGF 61
A I++ +D V +Q L +AG+ V T NA + + DLV+ DVV+P + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QATRALARDQATKDIPVLLVSTKGMETDRAWGLRQGASDYIVKPPREDDLIARIRQ 117
+ + + D+PVL++S + +GA DY+ KP +LI I +
Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS05950HTHFIS667e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 7e-13
Identities = 24/117 (20%), Positives = 53/117 (45%), Gaps = 4/117 (3%)

Query: 1949 QVPLVMVVDDSLTMRKVTGRVLERHNLDVTTARDGVEALELLEERVPDLMLLDIEMPRMD 2008
++V DD +R V + L R DV + + DL++ D+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 2009 GYELATAMRA-DPRFKAVPIVMITSRSGEKHRQRAFQIGVERYLGKPYQELDLMRNV 2064
++L ++ P +P++++++++ +A + G YL KP+ +L+ +
Sbjct: 62 AFDLLPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


65AXO1947_RS05940AXO1947_RS05970N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS059401131.450461IS5/IS1182 family transposase
AXO1947_RS059451131.304974IS5/IS1182 family transposase
AXO1947_RS05955-391.140238rhomboid family intramembrane serine protease
AXO1947_RS05960-1100.438303ABC transporter
AXO1947_RS05965-415-0.921555glycoside hydrolase family 3
AXO1947_RS05970-316-0.792147beta-mannosidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06070PF05043355e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 5e-04
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAHM 140
+ L+ H NTAH+
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAHL 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06075PF05043358e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.5 bits (79), Expect = 8e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06090BINARYTOXINB320.011 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 32.0 bits (72), Expect = 0.011
Identities = 20/104 (19%), Positives = 37/104 (35%), Gaps = 14/104 (13%)

Query: 49 SREEKVAQAMNDAPAIPRLGIP-AYEWWSEGLHGIARNGYATVFPQAIGLAASWNTHLMQ 107
SR+++ A P GIP + E GY + W +++ +
Sbjct: 192 SRKKRSTSAGPTVPDRDNDGIPDSLE----------VEGYTVDVKNKRTFLSPWISNIHE 241

Query: 108 QVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINI-FRDP 150
+ G + K++ A P D ++ G N++ R P
Sbjct: 242 KKGLTKYKSSPEKWSTASDPYSDFEKVTGR--IDKNVSPEARHP 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06095TYPE3IMRPROT300.045 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 29.7 bits (67), Expect = 0.045
Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 1 MPLFRPRSVPAPARLGLALALAI 23
P+ RSVP +LGLA+ +
Sbjct: 30 APILSERSVPKRVKLGLAMMITF 52


66AXO1947_RS06185AXO1947_RS06225N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS06185015-2.532735hybrid sensor histidine kinase/response
AXO1947_RS06190113-2.105180histidine kinase
AXO1947_RS06195212-1.794741histidine kinase
AXO1947_RS06200213-1.931329histidine kinase
AXO1947_RS06205312-2.019566alpha-L-fucosidase
AXO1947_RS062150110.074814hypothetical protein
AXO1947_RS06225-213-0.563966IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06310HTHFIS757e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 7e-16
Identities = 23/117 (19%), Positives = 48/117 (41%)

Query: 1062 RLLLVEDDATVAQVIVGLLQTRGHHVTHVVHGLAALAEVSTRRFDAGLCDLDLPGLDGVA 1121
+L+ +DDA + V+ L G+ V + ++ D + D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1122 LVAQLRARGVRFPIVAVTARADADAEPQAMAAGCNGFLRKPVTGDLLAQALARVLTE 1178
L+ +++ P++ ++A+ +A G +L KP L + R L E
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06315HTHFIS758e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 8e-16
Identities = 24/114 (21%), Positives = 49/114 (42%)

Query: 1054 RILLVEDEPTVAEVISGLLINRGHRVVHAAHGLAALAEAVDGGFDVALLDLDLPCLDGFA 1113
IL+ +D+ + V++ L G+ V ++ G D+ + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1114 LASQLRQLGHRFPLLAVTARADSAAEAQALAAGFDGFLRKPVTADLLVEAIAAA 1167
L ++++ P+L ++A+ +A G +L KP L+ I A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06320HTHFIS755e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 5e-16
Identities = 28/115 (24%), Positives = 51/115 (44%)

Query: 1070 RILLVEDDPTIAEVIVGLLRAQGHSVVHAPHGLAALTEAADNTFDLALLDLDLPGLDGFA 1129
IL+ +DD I V+ L G+ V + A DL + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1130 LARQLRAFGYEMPLIAVTARSDEVAEPKAQDAGFDSFLRKPLTGDMLADTIAEAL 1184
L +++ ++P++ ++A++ + KA + G +L KP L I AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06325HTHFIS756e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 6e-16
Identities = 28/115 (24%), Positives = 49/115 (42%)

Query: 1056 RILLVEDDPTIAEVIVGLLRAQGHSVVHAPHGLAALTEAADNTFDLALLDLDLPGLDGFA 1115
IL+ +DD I V+ L G+ V + A DL + D+ +P + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1116 LARQLRVFGYEMPLIAVTARSDEAAEPNAHEAGFDSFLRKPLTGDMLADTIAEAL 1170
L +++ ++P++ ++A++ A E G +L KP L I AL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06355PF05043350.001 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.5 bits (79), Expect = 0.001
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


67AXO1947_RS21280AXO1947_RS06325N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS21280-212-0.054139peptidase S1
AXO1947_RS06295-217-1.427477elongation factor 4
AXO1947_RS06300-218-1.374735S26 family signal peptidase
AXO1947_RS06305-219-1.417675DUF4845 domain-containing protein
AXO1947_RS06310-218-1.259676ribonuclease III
AXO1947_RS06315-119-1.501504GTPase Era
AXO1947_RS06320-118-1.693736DNA repair protein RecO
AXO1947_RS06325-213-0.245926transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06425V8PROTEASE726e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.3 bits (177), Expect = 6e-16
Identities = 33/163 (20%), Positives = 58/163 (35%), Gaps = 28/163 (17%)

Query: 136 AGKSMGSGFIISADGYVLTNHHVVDGASEVTVKLTDRR-----------EFKA-KVVGSD 183
G + SG ++ +LTN HVVD L F A ++
Sbjct: 99 TGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYS 157

Query: 184 EQYDVALLKIEA--------KGLPTVRLGDSNTLKPGQWVVAIGSPFGLDHSVTAGIVSA 235
+ D+A++K + + + ++ + Q + G P V+
Sbjct: 158 GEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VAT 210

Query: 236 TGRSNPYADQRYVPFIQTDVAINQGNSGGPLLNTRGEVVGINS 278
S +Q D++ GNSG P+ N + EV+GI+
Sbjct: 211 MWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06430TCRTETOQM1462e-39 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 146 bits (370), Expect = 2e-39
Identities = 95/455 (20%), Positives = 179/455 (39%), Gaps = 85/455 (18%)

Query: 8 NIRNFSIIAHVDHGKSTLADRIIQLCGG---LQAREMEAQVLDSNPIERERGITIKAQSV 64
I N ++AHVD GK+TL + ++ G L + + D+ +ER+RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 65 SLPYTAKDGQVYHLNFIDTPGHVDFSYEVSRSLAACEGALLVVDAAQGVEAQSVANCYTA 124
S + +N IDTPGH+DF EV RSL+ +GA+L++ A GV+AQ+ +
Sbjct: 62 SFQWEN-----TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 125 VEQGLEVVPVLNK-----IDLP----------TADVDRAKA----------------EIE 153
+ G+ + +NK IDL +A++ + + +
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 154 AVIG--------------IDAEDAVAV----------------SAKTGLNIDLVLEAIVH 183
VI ++A + SAK + ID ++E I +
Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236

Query: 184 RIPPPTPRDTDKLQALIIDSWFDNYLGVVSLVRVMQGEIKPGSKILVMSTGRTHLVDKVG 243
+ T R +L + + ++ +R+ G + + + + + +
Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYT 296

Query: 244 VFTPKRKELSALGAGEVGWINASIKDVHGAPVGDTLTLAVDPAPHALPGFQEMQPRVFAG 303
+ ++ +GE+ + + + +GDT L + P +
Sbjct: 297 SINGELCKIDKAYSGEIVILQNEFLKL-NSVLGDTKLLPQRERI------ENPLPLLQTT 349

Query: 304 LFPVDAEYYPDLREALDKLRLNDAALRFE--PESSEAMGFGFRCGFLGMLHMEIVQERLE 361
+ P + L +AL ++ +D LR+ + E + FLG + ME+ L+
Sbjct: 350 VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCALLQ 404

Query: 362 REYNLNLISTAPTVVY--EVLKTDGSVIPMDNPSK 394
+Y++ + PTV+Y LK I ++ P
Sbjct: 405 EKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVPPN 439


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06450TCRTETOQM320.004 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.7 bits (72), Expect = 0.004
Identities = 20/70 (28%), Positives = 34/70 (48%), Gaps = 10/70 (14%)

Query: 62 LVDTPGLHREQKRAMNRVMNRAARGSLEGVDAAVLVIEAGRWDEEDT-LAFRVLSDAEVP 120
++DTPG H + + R SL +D A+L+I A + T + F L +P
Sbjct: 72 IIDTPG-HMDFLAEVYR--------SLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 121 VVLVVNKVDR 130
+ +NK+D+
Sbjct: 123 TIFFINKIDQ 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS06460HTHFIS661e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.0 bits (161), Expect = 1e-14
Identities = 27/127 (21%), Positives = 46/127 (36%)

Query: 7 SHPRLLLVEDDPISRGFLQAVLEGLPATLDCADSLSSALDRARERRHDLWLIDVNLPDGT 66
+ +L+ +DD R L L + + ++ DL + DV +PD
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GSGLLRALRLLHPDVPALAHTADTSTAMQSGLQSDGFLQVLIKPLTSERLLQAVRRGLAR 126
LL ++ PD+P L +A + G L KP L+ + R LA
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 127 GRSGVAP 133
+ +
Sbjct: 122 PKRRPSK 128


68AXO1947_RS06905AXO1947_RS06945N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS06905025-0.465649bacterioferritin
AXO1947_RS069100220.170607DNA topoisomerase IV subunit A
AXO1947_RS06920-2181.010056AraC family transcriptional regulator
AXO1947_RS069250141.353947MarR family transcriptional regulator
AXO1947_RS069302161.571510multidrug RND transporter
AXO1947_RS069353150.737822multidrug transporter
AXO1947_RS069401111.008136MFS transporter
AXO1947_RS06945-1121.042002*glycosyl hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07015HELNAPAPROT451e-08 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 44.9 bits (106), Expect = 1e-08
Identities = 20/111 (18%), Positives = 44/111 (39%), Gaps = 10/111 (9%)

Query: 38 MALYERINHEMEEETEHADALLRRILFLEGDPDMRPAEFA---------PGKTVVEMLER 88
L+E+ + E D + R+L + G P E+ + EM++
Sbjct: 44 FTLHEKFEELYDHAAETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQA 103

Query: 89 DLVVEYEVRAALAAGMKLCEDHGDYVSRDILLKQLQDTEEDHAWWLEQQLG 139
+ ++ + + L E++ D + D+ + +++ E+ W L LG
Sbjct: 104 LVNDYKQISSESKFVIGLAEENQDNATADLFVGLIEEVEK-QVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07040RTXTOXIND741e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 74.5 bits (183), Expect = 1e-16
Identities = 48/293 (16%), Positives = 95/293 (32%), Gaps = 36/293 (12%)

Query: 82 VERGQLLVQLDPADTEAAMQQAEANLAKTVRQVRGLYRSVEGAQAELSAREVALRRARSD 141
V R L++ + + Q E NL K + A ++ E R +S
Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER-------LTVLARINRYENLSRVEKSR 236

Query: 142 FARRKDLAATGAIS--------------NEELAHARDELAAAEAAVSGSRESFERNRAL- 186
L AI+ EL + +L E+ + ++E ++ L
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 187 ---VDDSAVANQPDVQTAAAQLRQAYLNHARTGVVAPVSGYVARRAAQ-VGQRVQPGSVL 242
+ D ++ +L + + + APVS V + G V L
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356

Query: 243 MVVVPLEQV-WVEANFKETQLKHMRLGQEVELHSDLYGGGVSYTGRIQSLGLGTGSAFSL 301
MV+VP + V A + + + +GQ + + + + G + G
Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPY--TRYGYL------VGK-VKN 407

Query: 302 LPAQNASGNWIKIVQRVPVRIAVDSKQLASNPLRIGLSMKVEVKLHDQQGSVL 354
+ + +V V + I + + + + M V ++ SV+
Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVI 460


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07045TCRTETB1171e-30 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 117 bits (295), Expect = 1e-30
Identities = 93/400 (23%), Positives = 164/400 (41%), Gaps = 26/400 (6%)

Query: 33 LAMASFMQVLDSTIANVSLPTIAGNLGASSQQATWVITSFAVSTAIALPLTGWLSRRFGE 92
L + SF VL+ + NVSLP IA + WV T+F ++ +I + G LS + G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 93 TKLFVWSTLAFTIASLLCGLAQSM-GMLVVARALQGFVAGPMYPITQSLLVSIY-PREKR 150
+L ++ + S++ + S +L++AR +QG +P ++V+ Y P+E R
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQG-AGAAAFPALVMVVVARYIPKENR 137

Query: 151 GHALALLSMITVVAPIAGPILGGWITDNYSWEWIFLINVPLGIIASSIVGSQLRH--RPE 208
G A L+ I + GP +GG I W +L+ +P + + I L + E
Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIP---MITIITVPFLMKLLKKE 192

Query: 209 QLEKPRMDYIGLILLVAGVGALQLVLDLGNDEDWFSSDKIVVLACIAAVALVVFVIWELT 268
K D G+IL+ G+ L F++ + ++ ++ ++FV
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRK 242

Query: 269 DKDPIVDLKLFRHRNFRTGTLAMVVAYAAFFSVSLLIPQWLQRDMGYTAIWAGLATAPIG 328
DP VD L ++ F G L + + ++P ++ + G G
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 329 ILPVLMT-PFVGKYALRFDLRLLATIAFIFMSFTSFFRSNFNLQVDFSHVATIQLVMGVG 387
+ V++ G R + I F+S SF ++F L + + +++ V
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLS-VSFLTASFLL--ETTSWFMTIIIVFVL 359

Query: 388 VALFFMPVL--HILLSDLDGREIAAGSGLATFLRTLGGSF 425
L F + I+ S L +E AG L F L
Sbjct: 360 GGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS07070PYOCINKILLER382e-04 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 37.9 bits (87), Expect = 2e-04
Identities = 46/263 (17%), Positives = 83/263 (31%), Gaps = 20/263 (7%)

Query: 410 VMSGGGSSRVDVTINGGNAVPGITPTTWPGPVIIHPSSPLQALRAALPNAQIDYVDGKDR 469
+ G++ + I+ AV G + P + + +S + R A D
Sbjct: 268 IQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQT----PDS 323

Query: 470 AAAARAAKAADVAIVFATQWST-----ESVDLPDMRLPDNQDALIEAVA-KANPKTTVVL 523
A AA + + + + +VDLP MRL + ++ + +V
Sbjct: 324 VRYALGMDAAKLGLPPSVNLNAVAKASGTVDLP-MRLTNEARGNTTTLSVVSTDGVSVPK 382

Query: 524 ETNGPVRMPWAERVPAVLQAWYPGIGGGEAIANLLTGAVNPSGHLPVTWPVDESQLPRPS 583
PVRM + + P L +P G+ + P P
Sbjct: 383 AV--PVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPV 440

Query: 584 IPGLGFKPVEPGEDTIDYAIEGANVG-YKWFAARKLTPRYPFGHGLSYTQFRMGGLRVEA 642
G PV+ +T I + A + P Y + + R
Sbjct: 441 YEGATLTPVKATPETYPGVITLPEDLIIGFPADSGIKPIY-----VMFRDPRDVPGAATG 495

Query: 643 NGSQLTASFEVENIGQREGAAVP 665
G ++ ++ + Q EGA +P
Sbjct: 496 KGQPVSGNW-LGAASQGEGAPIP 517


69AXO1947_RS08225AXO1947_RS08260N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS082250131.319447AsnC family transcriptional regulator
AXO1947_RS08230-1140.312655S-adenosylmethionine:tRNA
AXO1947_RS08235-1120.189080tRNA-guanine(34) transglycosylase
AXO1947_RS08245-213-1.764483preprotein translocase subunit YajC
AXO1947_RS08250-222-1.286918protein translocase subunit SecD
AXO1947_RS08255-227-1.426799protein translocase subunit SecF
AXO1947_RS08260-331-1.290981IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08435HTHFIS270.033 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 26.7 bits (59), Expect = 0.033
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 16 EDARASTAQIARRLGLSRTTVQSRIEKL 43
R + + A LGL+R T++ +I +L
Sbjct: 446 TATRGNQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08455SECFTRNLCASE886e-21 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 87.6 bits (217), Expect = 6e-21
Identities = 36/175 (20%), Positives = 83/175 (47%), Gaps = 3/175 (1%)

Query: 439 VIGPSLGAENVERGVTAVVYSFLFTLVFFTIYYRVFGAITSV-ALLFNLLIVVAVMSLFG 497
+GP + E V V +++ + + + + + + A+ +V AL+ ++L+ V + ++
Sbjct: 142 SVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQ 201

Query: 498 ATMTLPGFAGLALSVGLSVDANVLINERIREELRL--GVPAKSAIAAGYEKAGGTILDAN 555
L A L G S++ V++ +R+RE L +P + + + +
Sbjct: 202 LKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTG 261

Query: 556 LTGLIVAVALYAFGTGPLKGFALTMMIGIFASMFTAITVSRALAVLIYGSRKKLK 610
+T L+ V + +G ++GF M+ G+F ++++ V++ + + I R K K
Sbjct: 262 MTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEK 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08460SECFTRNLCASE2831e-96 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 283 bits (725), Expect = 1e-96
Identities = 98/320 (30%), Positives = 160/320 (50%), Gaps = 10/320 (3%)

Query: 4 FPLHLIPNDTKIDFMRWRKPVLILMLVLAVVSVGIIVGKGFNYALEFTGGTLVQTSFQKT 63
F L L+P T DF RW+ +V+ + SV + + G N+ ++F GGT ++T
Sbjct: 3 FRLKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTA 62

Query: 64 VDVDQVREKLSKAGFENAQVQNAR------GGNEVMIRLQPHGQSNNRDDAAR---TVAE 114
+DV R L + + R + MIR+Q + +
Sbjct: 63 IDVGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVN 122

Query: 115 DVRKAVTSDENPATVQPGEFVGPQVGKDLALNGVYATVFMLVGFLIYIAFRFEWKFAVVA 174
V A+T+ + + E VGP+V +L V++ + V + YI RFEW+FA+ A
Sbjct: 123 KVETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGA 182

Query: 175 SLTALFDLLVTVAFVSLTGREFDLTVLAGLLSVMGFAINDIIVVFDRVRENFRALRVEPL 234
+ + D+L+TV ++ +FDLT +A LL++ G++IND +VVFDR+REN + PL
Sbjct: 183 VVALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPL 242

Query: 235 -EVLNRSINQTLSRTVITAVMFFLSALALYIYGGESMEGLAETHMIGAVIVVISSVIVAV 293
+V+N S+N+TLSRTV+T + L+ + + I+GG+ + G + G SSV VA
Sbjct: 243 RDVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAK 302

Query: 294 PMLSIGPFAVTKQDLLPKAK 313
++ K+ P K
Sbjct: 303 NIVLFIGLDRNKEKKDPSDK 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08480PF05043357e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.9 bits (80), Expect = 7e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


70AXO1947_RS08295AXO1947_RS08350N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS08295281.237659PTS fructose transporter subunit IIBC
AXO1947_RS0830018-0.3586811-phosphofructokinase
AXO1947_RS08310280.006018phosphoenolpyruvate--protein phosphotransferase
AXO1947_RS2154528-0.134253LacI family transcriptional regulator
AXO1947_RS0831518-0.167021multidrug efflux RND transporter permease
AXO1947_RS0832518-0.515014GntR family transcriptional regulator
AXO1947_RS08335-28-1.452155TetR family transcriptional regulator
AXO1947_RS21550110-2.757260glutamate dehydrogenase
AXO1947_RS08340010-1.489167IS5/IS1182 family transposase
AXO1947_RS0834519-0.643000hypothetical protein
AXO1947_RS08350011-0.794544IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08510RTXTOXINA310.016 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 31.1 bits (70), Expect = 0.016
Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 7/103 (6%)

Query: 53 LASGITQILVVGDADADTARFGDAQLVRLSLGAVLDDPAAALNQLAAP--AAATASTGAG 110
+ S I+ ++ +ADADT A V L+ + + + A A +++ A
Sbjct: 248 ILSAISASFILSNADADTRTKAAAG-VELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAA 306

Query: 111 GESASSKRIVAITSCP-TGIAHTFMAAEGLQQAA---KKLGYQ 149
+S +AI+ IA F A +++ + KKLGY
Sbjct: 307 AGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYD 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08520PHPHTRNFRASE5800.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 580 bits (1496), Expect = 0.0
Identities = 211/568 (37%), Positives = 324/568 (57%), Gaps = 11/568 (1%)

Query: 274 AIVGIGASPGVAIGIVHRLRAAQTEVADQPV-GLGDGGAQLHDALTRTRQQLAAIQDDTQ 332
I GI AS GVAI ++ + + +L AL +++++L AI+D T+
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTE 63

Query: 333 RQLGASDAAIFKAQAELLNDTDLITR-TCQLMVEGHGVAWSWHQAVEQIASGLAALGNPV 391
+GA A IF A +L+D +L+ ++ E ++ + + S ++ N
Sbjct: 64 ASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEY 123

Query: 392 LAGRAADLRDVGRRVLAQLDPAAAGAGLTDLPAQPCILLASDLSPSDTANLDTARVLGLA 451
+ RAAD+RDV +RVL L G+ L + + +++A DL+PSDTA L+ V G A
Sbjct: 124 MKERAADIRDVSKRVLGHLIGVETGS-LATIA-EETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 452 TAQGGPTSHTAILSRTLGLPALVAVGGQLLDIEDGVTAIIDGSSGRLYLNPSELDLDAAR 511
T GG TSH+AI+SR+L +PA+V I+ G I+DG G + +NP+E ++ A
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 512 THIAEQQAIRQREAAQRALPAETTDGHHIDIGANVNLPDQVAMALTQGAEGVGLMRTEFL 571
A + +Q A P+ T DG H+++ AN+ P V L G EG+GL RTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 572 FLESGRTPSEDEQHATYLAMAQVLDGRPLIVRALDIGGDKQVAHLELPHEENPFLGVRGA 631
+++ + P+E+EQ Y + Q +DG+P+++R LDIGGDK++++L+LP E NPFLG R
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 632 RLLLRRPDLLEPQLRALYRAVKDGARLSIMFPMITSVPELIALRAICARIRAELDA---- 687
RL L + D+ QLRAL RA G L +MFPMI ++ EL +AI + +L +
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYG-NLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVD 420

Query: 688 --PEVPIGIMIEVPAAAAQADVLARHADFFSIGTNDLTQYVLAIDRQNPELAAEADSLHP 745
+ +GIM+E+P+ A A++ A+ DFFSIGTNDL QY +A DR N ++ HP
Sbjct: 421 VSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHP 480

Query: 746 AVLRMIRSTIEGARTHGRWVGVCGGLAGDAFGASLLAGLGVQELSMTPNDIPAVKARLRG 805
A+LR++ I+ A + G+WVG+CG +AGD LL GLG+ E SM+ I +++L
Sbjct: 481 AILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLK 540

Query: 806 AALHQLQELAEQALACETAEQVRALEAK 833
+ +L+ A++AL +TAE+V L K
Sbjct: 541 LSKEELKPFAQKALMLDTAEEVEQLVKK 568


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08540ACRIFLAVINRP10800.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1080 bits (2794), Expect = 0.0
Identities = 512/1038 (49%), Positives = 705/1038 (67%), Gaps = 17/1038 (1%)

Query: 1 MPKFFIEHPVFAWVVAILISLAGVISILNLGIESYPTIAPPQVTVTANFPGASADTAEKS 60
M FFI P+FAWV+AI++ +AG ++IL L + YPTIAPP V+V+AN+PGA A T + +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQQLTGIDHLLYFNSSSAANGRVTITLTFETGTDADIAQVQVQNKVSLATPRLPS 120
VTQVIEQ + GID+L+Y +S+S + G VTITLTF++GTD DIAQVQVQNK+ LATP LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVTQQGVVVAKANAGFLMVAALRSDNPSINRDALNDIVGSRLLEQISRVPGVGSTNQFGA 180
EV QQG+ V K+++ +LMVA SDNP +D ++D V S + + +SR+ GVG FGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 EYAMNIWLNPEKLQGYNLSATQVLTAVRNQNVQFAAGSVGADPTPEGISFTATVSAEGRF 240
+YAM IWL+ + L Y L+ V+ ++ QN Q AAG +G P G A++ A+ RF
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 SSPEQFENIILRTDNNGATVRLKDVARVTVGPSSYGFDTQYNGKPTGAFGIQLLPGANAL 300
+PE+F + LR +++G+ VRLKDVARV +G +Y + NGKP GI+L GANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 NVSDAVGAKLDELQPTFPQGVTWFAPYESTTFVRISIEEVIHTLVEAIVLVFLVMLLFLQ 360
+ + A+ AKL ELQP FPQG+ PY++T FV++SI EV+ TL EAI+LVFLVM LFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATVIPTLVIPVALLGTFFGMYVIGFTINQLTLFAMVLAIGIVVDDAIVVIENVERIM 420
N RAT+IPT+ +PV LLGTF + G++IN LT+F MVLAIG++VDDAIVV+ENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 SEEHLEPKAATQKAMTQITGAVVAITVVLAAVFIPSSLQPGASGAIYKQFALTIAMSMGF 480
E+ L PK AT+K+M+QI GA+V I +VL+AVFIP + G++GAIY+QF++TI +M
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SAFLALSFTPALCGAFL---TSTHSTKKNWVYRTFDKYYDKLAHRYVGVVGHTLKRSPPW 537
S +AL TPALC L ++ H K + F+ +D + Y VG L + +
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 538 MIAFVALMVLCGFLFTRMPGSFLPDEDQGFAVAIVQLPPGATKIRTNEAFAQMRAVLEKQ 597
++ + ++ LF R+P SFLP+EDQG + ++QLP GAT+ RT + Q+ K
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 PA--VEGLLQIAGFSFLGSGENVGMGFIRLKPWEERDV---TAEQLIQQLNGAFYGIKGA 652
VE + + GFSF G +N GM F+ LKPWEER+ +AE +I + I+
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 653 QIFVVNLPTVQGLGQFGGFDMWLQDRSGAGQEALINARNIVLGKAAKKQDTLVGVRPNGL 712
+ N+P + LG GFD L D++G G +AL ARN +LG AA+ +LV VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 713 ENSPQLQLHVDRVQAQSMGLNVSDIYNSIQLMLAPVYVNDYFAEGRIKRVNMRADDQFRA 772
E++ Q +L VD+ +AQ++G+++SDI +I L YVND+ GR+K++ ++AD +FR
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 773 GPESLRNFFTPSTTATGADGQPAMIPLSNVVKAEWNYASPALNRYNGYSAVNIVGNPAPG 832
PE + + S A+G+ M+P S + W Y SP L RYNG ++ I G APG
Sbjct: 781 LPEDVDKLYVRS-----ANGE--MVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833

Query: 833 GSSGQAMSAMEEIINNDLPPGFGFDWSGMSYQEIIAGNAATLLLALSVVVVFLCLAALYE 892
SSG AM+ ME + + LP G G+DW+GMSYQE ++GN A L+A+S VVVFLCLAALYE
Sbjct: 834 TSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYE 892

Query: 893 SWSIPVAVLMVVPIGVLGAITFSMLRGLPNDLYFKIGMITVIGLAAKNAILIVEFALE-Q 951
SWSIPV+V++VVP+G++G + + L ND+YF +G++T IGL+AKNAILIVEFA +
Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLM 952

Query: 952 RAAGKTLREATLEAAHLRFRPILMTSFAFILGVLPLAISTGAGANARHSIGTGVIGGMVF 1011
GK + EATL A +R RPILMTS AFILGVLPLAIS GAG+ A++++G GV+GGMV
Sbjct: 953 EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVS 1012

Query: 1012 ATVLGLIFIPLFFVVVRR 1029
AT+L + F+P+FFVV+RR
Sbjct: 1013 ATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08545RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.1 bits (99), Expect = 3e-06
Identities = 16/108 (14%), Positives = 42/108 (38%), Gaps = 7/108 (6%)

Query: 59 RSADVRARVDGVVLKRLYTEGANVKEGQPLFQIDPSQLKATLLQAQGQLAAAEATYTNAK 118
RS +++ + +V + + EG +V++G L ++ L A+ +++ A+
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSLLQAR 147

Query: 119 IAATRARSLAPQQYVSRADIDTAEANERSSGASVQQARGAVEAARIQL 166
+ TR + L+ +++ S ++ + Q
Sbjct: 148 LEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195



Score = 31.0 bits (70), Expect = 0.010
Identities = 12/51 (23%), Positives = 23/51 (45%), Gaps = 4/51 (7%)

Query: 59 RSADVRARVDGVVLK-RLYTEGANVKEGQPLFQIDPSQLKATLLQAQGQLA 108
+++ +RA V V + +++TEG V + L I P L+ +
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED---DTLEVTALVQ 373


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08550HTHTETR544e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.9 bits (129), Expect = 4e-11
Identities = 30/170 (17%), Positives = 59/170 (34%), Gaps = 9/170 (5%)

Query: 7 RAARRSDCDRRIHAAVHALLAERGMR-LSMDAVAERAGCSKQTLYSYYGCKENLLRDVLQ 65
+ + I L +++G+ S+ +A+ AG ++ +Y ++ K +L ++ +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 66 DHVH----LAVGPLGTVSGDLRADLLAFARGHLDRLNNPDV---LQTCRLVEAESHRFPD 118
L + GD + L L+ + L + E
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 119 QSQQIFHDGVVGMQQRLAHRFEQAIEAGQLRHD-DPRFMAELLLSMIVGL 167
QQ + + R+ + IEA L D R A ++ I GL
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08575PF05043357e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.9 bits (80), Expect = 7e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


71AXO1947_RS08430AXO1947_RS08490N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS08430-1161.196334MFS transporter
AXO1947_RS084350170.149261diguanylate cyclase response regulator
AXO1947_RS08445-213-1.187624FAD-dependent oxidoreductase
AXO1947_RS08455-18-1.661737gamma-glutamylputrescine synthetase
AXO1947_RS08460018-4.421027glutamine synthetase
AXO1947_RS08465428-5.468675IS5/IS1182 family transposase
AXO1947_RS21570518-1.523755multidrug ABC transporter permease
AXO1947_RS08485413-0.378277MFS transporter
AXO1947_RS084902141.209710hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08650PREPILNPTASE310.008 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 30.9 bits (70), Expect = 0.008
Identities = 43/180 (23%), Positives = 72/180 (40%), Gaps = 25/180 (13%)

Query: 180 AALDDAQFMQWGWRVPFLASALLVLTGLWVRLSITETPDFQKTLDSKTRVTLPLGTVVSQ 239
A L A M LA L+LT + V L+ + D D T L G + +
Sbjct: 118 ALLSVAVAMTLAPGWGTLA--ALLLTWVLVALTFIDL-DKMLLPDQLTLPLLWGGLLFNL 174

Query: 240 HGP--ALVIGTLGAFATFVLFYLMT-VFALGYGTKTLGYDKEQFLLLQMAGIVFFVLGIP 296
G +L +GA A +++ + + F L G + +GY F LL G LG
Sbjct: 175 LGGFVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYG--DFKLLAALG---AWLGWQ 229

Query: 297 LSAKFGDRHGAPLAMLLASIAIVAFGLAFAPLFQANHPLQVLAF---LSLGFFFMGLTYG 353
P+ +LL+S+ G+ L + +H + + F L++ ++ L +G
Sbjct: 230 ---------ALPIVLLLSSLVGAFMGIGLI-LLRNHHQSKPIPFGPYLAIA-GWIALLWG 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08655HTHFIS734e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 72.9 bits (179), Expect = 4e-16
Identities = 28/111 (25%), Positives = 43/111 (38%), Gaps = 3/111 (2%)

Query: 120 RIAALVVDDSLSARTYAGALLSMYGYRVVLAADGPAGLEAIERDPSIRLTIVDQEMPGMD 179
LV DD + RT LS GY V + ++ I L + D MP +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDEN 61

Query: 180 GVEFTRRLRAIRSRDKVALIGISGNSDSSLIPRFLKNGANDFLRKPFSREE 230
+ R++ R V + +S + + + GA D+L KPF E
Sbjct: 62 AFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08675adhesinmafb290.033 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 29.3 bits (65), Expect = 0.033
Identities = 37/167 (22%), Positives = 58/167 (34%), Gaps = 25/167 (14%)

Query: 13 KQPESALRRWLKERSITEVECLVPDITGNARG--KIIPADKFSHDYGTRLPEGIFATTVT 70
K A+ RW++E P+ + A K + P V+
Sbjct: 290 KNTREAVDRWIQEN---------PNAAETVEAVFNVAAAAKVAKLAKAAKPG---KAAVS 337

Query: 71 GDFPDDYYALTSPSDSDMHLRPDASTVRMVPWATDPTAQVIHDCYTKDGDPHEL-APRNV 129
GDF D Y + SDS L +A + + + D +K + E+ A N
Sbjct: 338 GDFADSYKKKLALSDSARQLYQNAKYREALDIHYEDLIRRKTDGSSKFINGREIDAVTN- 396

Query: 130 LRRVLDAYAQVK--LQPVVAPELEFFLVQKNTDPDFPLLPPAGRSGR 174
DA Q K + + P + FL QKN + A + G+
Sbjct: 397 -----DALIQAKRTISAIDKP--KNFLNQKNRKQIKATIEAANQQGK 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08700RTXTOXIND966e-24 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 96.4 bits (240), Expect = 6e-24
Identities = 51/371 (13%), Positives = 113/371 (30%), Gaps = 83/371 (22%)

Query: 81 SVAVAPRVSGYVTKVLVSDNQIVEAGQPLLQIDDRTYQATLQQAEAAIAARQADIVAATA 140
S + P + V +++V + + V G LL++ +A + ++++ + +
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155

Query: 141 NVSAQESALLQARTQVTAAAASLRFAQAEVKRFAPLAASGADTHEHQES-LQHDLARARA 199
+ E L + ++ EV R L T ++Q+ + +L + RA
Sbjct: 156 LSRSIELNKLPELK-LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214

Query: 200 QYDAAQAQAKAGESHIQASRAQLE------------------------QAQAGVKQATAD 235
+ A+ E+ + +++L+ +A ++ +
Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274

Query: 236 ADQARVAVEDTRLTSRIH------------------------------------------ 253
+Q + + ++
Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334

Query: 254 -GRVGD-KTVQVGQFLGAGTRTMTIVPQQSLYLV-ANFKETQVGLMRPGQPVEIEVDALS 310
+V K G + M IVP+ V A + +G + GQ I+V+A
Sbjct: 335 SVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394

Query: 311 GVK---LHGKVESLSPGTGSQFALLPPENATGNFTKVVQRVPVRIRVLAGDEARKVLVPG 367
+ L GKV++++ + G V+ + L G
Sbjct: 395 YTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSG 445

Query: 368 MSVEVTVDTRS 378
M+V + T
Sbjct: 446 MAVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08705TCRTETB1013e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 101 bits (254), Expect = 3e-25
Identities = 82/407 (20%), Positives = 165/407 (40%), Gaps = 20/407 (4%)

Query: 25 WLAVLAGTIGSFMATLDISIVNAALPTIQGEVGASGTEGTWISTAYLVAEIIMIPLTGWF 84
WL +L SF + L+ ++N +LP I + W++TA+++ I + G
Sbjct: 18 WLCIL-----SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72

Query: 85 VRTLGLRNFLLICAVMFTAFSVVCGLSTS-LSMMIIGRVGQGLAGGALIPTALTIVATRL 143
LG++ LL ++ SV+ + S S++I+ R QG A + +VA +
Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132

Query: 144 PPSQQTMGTALFGMTVIMGPVIGPLLGGWLTENVSWHYAFFINVPICVGLVALLLLGLKH 203
P + L G V MG +GP +GG + + W Y + +P+ + L+ L
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSY--LLLIPMITIITVPFLMKLLK 190

Query: 204 EKGDWAGLLNADWLGIYGLTAGLGGLTVVLEEGQRERWFESSEINTLSLIALSGFIALVI 263
++ G D GI ++ G+ + +L F +S + ++++ F+ V
Sbjct: 191 KEVRIKGHF--DIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVK 238

Query: 264 SQFRRRPPVIRLSLLVQRSFGAVFIMVMAVGMILFGVMYMIPQFLAVISGYNTEQAGYVL 323
+ P + L F + + + G + M+P + + +T + G V+
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 324 LLSGLPTVLLMPMMPKLLEMVDVRILVIAGLICFAAACFVNLTLTADTVGTHFVAGQLLQ 383
+ G +V++ + +L + V+ + F + F+ + +T +
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFV 358

Query: 384 GCGLALAMMSLNQAAISSVPPELAGDASGLFNAGRNLGGSVGLALIS 430
GL+ ++ SS+ + AG L N L G+A++
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS08710RTXTOXIND419e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 9e-06
Identities = 32/217 (14%), Positives = 58/217 (26%), Gaps = 12/217 (5%)

Query: 73 TLTQLVTQALADSPNLRAAQARLRANRALAQRRRAERLPTLNASALYAYAEPPQTIVDTL 132
LT L +A QARL R R E L L
Sbjct: 126 KLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN-KLPELKLPDEPYFQNV----- 179

Query: 133 GGLQQQGQAGQPPAAGNQALDLEKTQIYSAGFDASWELDVFGRRRRAAEGALAQAQ---A 189
++ + +K Q E R E +
Sbjct: 180 -SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 190 SEAELADAQVQLAAEVGQVYLNYRGLQARLAIADANLDKIRQTLQLVQQRRGQGAASDLQ 249
+ L Q V + Y L + + L++I + ++ +
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL-VTQLFK 297

Query: 250 VEQIATQVQQQQAQRLPLEMQSQEAQDQLALMVGRAP 286
+I +++Q L ++ + +++ V RAP
Sbjct: 298 -NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333


72AXO1947_RS08830AXO1947_RS08845N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS088300130.022426N-ethylammeline chlorohydrolase
AXO1947_RS08835-1150.846088bifunctional 3-demethylubiquinone
AXO1947_RS088400141.201514phosphoglycolate phosphatase
AXO1947_RS088452161.182042phytoene synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09050UREASE320.006 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 31.6 bits (72), Expect = 0.006
Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 356 TLGGARALGFGDTIGSIEIGKQADLI 381
T+ A A G IGS+E+GK+ADL+
Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLV 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09055DHBDHDRGNASE270.047 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 27.3 bits (60), Expect = 0.047
Identities = 20/101 (19%), Positives = 40/101 (39%), Gaps = 10/101 (9%)

Query: 54 LAGARVLDVGCGGGL---LSESMARLGAQVTAIDLAPELVKVARLHGLESGVQVDYRVQS 110
+ G G G+ ++ ++A GA + A+D PE +L + S ++ + R
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPE-----KLEKVVSSLKAEARHAE 60

Query: 111 VEDLAAEQPGSFDAVTCMEMLEHVPDPTAIIRACARLLKPG 151
+ D + + + P I+ A +L+PG
Sbjct: 61 AFPADVRDSAAIDEI-TARIEREM-GPIDILVNVAGVLRPG 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09060BACINVASINB320.003 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 31.6 bits (71), Expect = 0.003
Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 4/77 (5%)

Query: 49 APIALASLRPVVSKGARAMLGVAFAELDAEACVALVPEFLQRYEDVIGTQSQLF-DGVEE 107
A +A+ + VV KGA A LG A +++ E LVP L++ S+LF G++
Sbjct: 418 AMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQ---NGSKLFTQGMQR 474

Query: 108 LLVRLENAGCVWGIVTN 124
+ L N G G+ TN
Sbjct: 475 ITSGLGNVGSKMGLQTN 491


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09065PHAGEIV300.011 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 29.5 bits (66), Expect = 0.011
Identities = 20/72 (27%), Positives = 28/72 (38%), Gaps = 14/72 (19%)

Query: 114 QQALGTLHAFAEAV------VAVEAVLFARTQPGDAAAVAVQWLDARQRVAGDSAAPGGV 167
+ L L F V + +E ++F Q GDA + R VA GGV
Sbjct: 178 KDILDNLPQFLSTVDLPTDQILIEGLIF-EVQQGDALDFSFAAGSQRGTVA------GGV 230

Query: 168 ATAAWLTQLLQA 179
T LT +L +
Sbjct: 231 NTDR-LTSVLSS 241


73AXO1947_RS09345AXO1947_RS09370N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS09345224-0.686876phosphoenolpyruvate synthase
AXO1947_RS09350222-0.608265phosphoenolpyruvate synthase regulatory protein
AXO1947_RS09355220-0.962092hypothetical protein
AXO1947_RS09360119-1.3316477,8-dihydro-8-oxoguanine triphosphatase
AXO1947_RS09365116-1.1919883-hydroxybutyrate dehydrogenase
AXO1947_RS09370-114-0.645480class III poly(R)-hydroxyalkanoic acid synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09570PHPHTRNFRASE2777e-86 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 277 bits (711), Expect = 7e-86
Identities = 139/574 (24%), Positives = 235/574 (40%), Gaps = 84/574 (14%)

Query: 260 KAIRMVYSDVPGERVRTEDTPVE---LRSTFSISDEDVQELSKQAL---------VIEKH 307
KA + +V E+ D E L + S E+++ + Q + H
Sbjct: 18 KAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAH 77

Query: 308 YGRPMDIEWAKDGVSGKLFIVQARPETVKSRSHATQIERFSLEAKDAKILVEGRAVGAKI 367
D E + GK+ Q E + F E+ D + + E RA A I
Sbjct: 78 LLVLDDPELVDG-IKGKIENEQMNAEYALKEVSDMFVSMF--ESMDNEYMKE-RA--ADI 131

Query: 368 GSGVARVVRSLDDMNRVQAGD-----VLIA-DMTDPDWEPVMK-RASAIVTNRGGRTCHA 420
RV+ L + V+IA D+T D + K T+ GGRT H+
Sbjct: 132 RDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHS 191

Query: 421 AIIARELGVPAVVGSGNATDVLSDGQEVTVSCAEG---------DTGFIYEGLLPFERTT 471
AI++R L +PAVVG+ T+ + G V V EG + E FE+
Sbjct: 192 AIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQK 251

Query: 472 TDLGNMPPAP--------LKIMMNVANPERAFDFGQLPNAGIGLARLEMIIAAHIGIHPN 523
+ + P +++ N+ P+ GIGL R E + +
Sbjct: 252 QEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDR-----D 306

Query: 524 ALLEYDKQDADVRKKIDAKIAGYGDPVSFYVNRLAEGIATLTASVAPNTVIVRLSDFKSN 583
L ++Q ++ + G PV ++R D +
Sbjct: 307 QLPTEEEQFEAYKEVVQRM---DGKPV-----------------------VIRTLDIGGD 340

Query: 584 EYANLIGGSRYEPHEENPMIGFRGASRYVDPSFTKAFALECKAVLKVRNEMGLDNLWVMI 643
+ + + P E NP +GFR ++ F + +A+L+ NL VM
Sbjct: 341 KELSYL----QLPKELNPFLGFRAIRLCLE--KQDIFRTQLRALLRAS---TYGNLKVMF 391

Query: 644 PFVRTIEEGRKVIEVLEQNGLKQ-GDGADGKPGLKIIMMCELPSNALLADEFLDIFDGFS 702
P + T+EE R+ ++++ K +G D +++ +M E+PS A+ A+ F D FS
Sbjct: 392 PMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFS 451

Query: 703 IGSNDLTQLTLGLDRDSSIVAHLFDERNPAVKKLLSMAIKSARAKGKYVGICGQGPSDHP 762
IG+NDL Q T+ DR + V++L+ +PA+ +L+ M IK+A ++GK+VG+CG+ D
Sbjct: 452 IGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD-E 510

Query: 763 ELAEWLMQEGIGSVSLNPDTVVDTWLRLAKLKSE 796
L+ G+ S++ +++ +L KL E
Sbjct: 511 VAIPLLLGLGLDEFSMSATSILPARSQLLKLSKE 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09575CLENTEROTOXN320.003 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 31.6 bits (71), Expect = 0.003
Identities = 14/64 (21%), Positives = 21/64 (32%), Gaps = 2/64 (3%)

Query: 3 TIRPVFYVSDGTGITAETIGHSLLTQF--SGFNFVTDRMSFIDDAEKARDAAMRVRAAGE 60
+ V+ G T+E I S+ F + T S A +V A
Sbjct: 78 SKEVSINVNFSVGFTSEFIQASVEYGFGITIGEQNTIERSVSTTAGPNEYVYYKVYATYR 137

Query: 61 RYQV 64
+YQ
Sbjct: 138 KYQA 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09580BACTRLTOXIN280.012 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 28.3 bits (63), Expect = 0.012
Identities = 7/30 (23%), Positives = 14/30 (46%)

Query: 73 YDLCDPVTGEPDPSAYVRLYRDARQAETTH 102
YD+ + D S Y+ +Y D + ++
Sbjct: 225 YDMMPAPGDKFDQSKYLMMYNDNKTVDSKS 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09590DHBDHDRGNASE1051e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 105 bits (264), Expect = 1e-29
Identities = 73/255 (28%), Positives = 109/255 (42%), Gaps = 11/255 (4%)

Query: 2 RSILITGAGSGIGAGIATQLATDGHHLIVSDMELPAAERTAHALRQAGGSAEALALDVTD 61
+ ITGA GIG +A LA+ G H+ D E+ +L+ AEA DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 62 ADSIAQALASASRTPQ---VLVNNAGLQHVAALDEFPMRQWALLVDVMLTGAARLSRAVL 118
+ +I + A R +LVN AG+ + +W V TG SR+V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMRAAGYGRIVNIGSIHSLVASPYKSAYVAAKHGLVGLAKVIALETADCDITVNTLCPS 178
M G IV +GS + V +AY ++K V K + LE A+ +I N + P
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 YVRTPLVERQIADQARTSGIAEEAVIRDVMLK---PMPKGAFIDYDELAGTVAFLMSHAA 235
T + AD+ E VI+ + +P ++A V FL+S A
Sbjct: 189 STETDMQWSLWADEN-----GAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 236 RNITGQSIAIDGGWT 250
+IT ++ +DGG T
Sbjct: 244 GHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09600RTXTOXIND389e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 37.5 bits (87), Expect = 9e-05
Identities = 36/189 (19%), Positives = 60/189 (31%), Gaps = 19/189 (10%)

Query: 140 QEAAQTLQKWREENA-PWLDMPAFGLNRN----HQSRLQKLARAQ----QDFQAQSEAYG 190
Q Q L + E N P L +P +N RL L + Q Q+ + Q E
Sbjct: 150 QTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL 209

Query: 191 EQLKAAIEQAFARFASKLSEHESSGSQLTSARALFD------LWIEAAEESYADVALSNQ 244
++ +A AR + S+L +L + E Y +
Sbjct: 210 DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA---VN 266

Query: 245 FREVYGGFANAHMRLRAALQEEIEQLSERIGMPTRSEMDAAHRRIAELE-RLVRRMLRTA 303
VY + +EE + +++ ++ I L L + R
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQ 326

Query: 304 ASPARKPAA 312
AS R P +
Sbjct: 327 ASVIRAPVS 335


74AXO1947_RS09625AXO1947_RS09735N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS09625010-0.351930***4'-phosphopantetheinyl transferase
AXO1947_RS21705114-0.879294*UDP-N-acetyl-D-galactosamine dehydrogenase
AXO1947_RS09640019-0.814565aminoacetone oxidase family FAD-binding enzyme
AXO1947_RS09650020-0.670480peptidylprolyl isomerase
AXO1947_RS09665013-0.644488hypothetical protein
AXO1947_RS09680013-0.245518sulfurtransferase
AXO1947_RS096850130.007317DNA-binding response regulator
AXO1947_RS096900120.497725ribonuclease E
AXO1947_RS096950120.63738923S rRNA pseudouridylate synthase
AXO1947_RS097000111.099691hypothetical protein
AXO1947_RS09705-1162.735936zinc transporter ZupT domain protein
AXO1947_RS09710-1162.300693energy transducer protein TonB
AXO1947_RS09715-2131.7902804a-hydroxytetrahydrobiopterin dehydratase
AXO1947_RS09725-1100.442666Fe-S biogenesis protein NfuA
AXO1947_RS09735-111-1.550011hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09865ENTSNTHTASED717e-17 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 70.8 bits (173), Expect = 7e-17
Identities = 50/147 (34%), Positives = 78/147 (53%), Gaps = 14/147 (9%)

Query: 65 RSVRKRQAEYFFGRLAARHALHQQGLVVHPDTVQIATGNAREPIWPKTAVGSISHTHRLA 124
+ RKR+AE+ GR+AA HAL + G+ P G+ R+P+WP GSISH A
Sbjct: 41 SAGRKRKAEHLAGRIAAVHALREVGVRTVP-----GMGDKRQPLWPDGLFGSISHCATTA 95

Query: 125 MSAVAPADRWRGIGIDLEHLADPDAQAALRATVVNASELALLQTLHDAGDATLDALLTLV 184
++ ++ + IGID+E + L +++++ E +LQ LTL
Sbjct: 96 LAVISR----QRIGIDIEKIMSQHTATELAPSIIDSDERQILQASLL----PFPLALTLA 147

Query: 185 FSAKESLFKASFAAVRRYFDFSAAQVT 211
FSAKES++KA F+ F++A+VT
Sbjct: 148 FSAKESVYKA-FSDRVTLPGFNSAKVT 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09885INFPOTNTIATR613e-14 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 61.2 bits (148), Expect = 3e-14
Identities = 37/104 (35%), Positives = 50/104 (48%), Gaps = 9/104 (8%)

Query: 38 GTGAEATPGAMVTVHYTGWLYDENAADKHGKKFDSSLDRAEPFQFLLGGHQVIRGWDDGV 97
GTGA+ VTV YTG L D G FDS+ +P F + QVI GW + +
Sbjct: 136 GTGAKPGKSDTVTVEYTGTLID-------GTVFDSTEKAGKPATFQVS--QVIPGWTEAL 186

Query: 98 AGMRVGGKRTLMIPPDYGYGDNGAGGVIPPGASLVFDVELLGVQ 141
M G + +P D YG GG I P +L+F + L+ V+
Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVK 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09900HTHFIS635e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 5e-14
Identities = 29/122 (23%), Positives = 49/122 (40%), Gaps = 3/122 (2%)

Query: 3 IRVFLIDDHALVRTGMKMILSKEVDVDVVGEAESGEAALPQIRQLKPDIVLCDLHLPGVS 62
+ + DD A +RT + LS+ DV + I D+V+ D+ +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GLEITERIVKGDYGTRVIIVSVLEDGPLPKRLLEAGASGYVGKGGDAHELLRAVREVALG 122
++ RI K V+++S + E GA Y+ K D EL+ + AL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR-ALA 120

Query: 123 RR 124

Sbjct: 121 EP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09905IGASERPTASE414e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 4e-05
Identities = 41/278 (14%), Positives = 68/278 (24%), Gaps = 16/278 (5%)

Query: 878 TVRAKPEPTAQAAAKPRPVPKERAEPQVGNDTSSTTAPVSNTVPASQPPVATPVAK---- 933
TV T P E ++ + P + P+ +K
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNE-EIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 934 -ADTDHERPAAPTV-------AADAPAKPAPAATASTAVNADVAATQAVEQRPAPVAQHA 985
+ + + T A + K ++ TQ E + +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 986 PAAPAPVAAPAPVASTPVPAVAVVAPVAETASTAPVAAPSAPAPTVASPVSNVAATSTQQ 1045
A V A A + P + P S T+ +
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 1046 QPLGSVSARAAASDTDKAAAPNADRQQRAAAADVATPATTQAAPVQTTPVKQTADLSAPV 1105
QP S+ T+ + + TPATTQ + K V
Sbjct: 1170 QPAKETSSNVEQPVTESTTVNTGNSVVE--NPENTTPATTQPTVNSESSNKPKNRHRRSV 1227

Query: 1106 AAMPAAQAEVVTATSPHAEPPAAQAPSSEVAVTATSTL 1143
++P E T +S A +S S
Sbjct: 1228 RSVP-HNVEPATTSSNDRSTVALCDLTSTNTNAVLSDA 1264



Score = 39.7 bits (92), Expect = 8e-05
Identities = 43/277 (15%), Positives = 78/277 (28%), Gaps = 20/277 (7%)

Query: 519 NIPAPPAVTSIKPSQPAPVREETPAPMAPVAAPAPVVTVPIPAPVTGVVGWL-------- 570
NI P + + PS P+ E APV PAP P+ T V
Sbjct: 996 NITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPAT----PSETTETVAENSKQESKTV 1051

Query: 571 -KRIFGGVEPLAPAPESIPRPRQNDAGRNHRNERGERGGQRRDGRDARNGGHSGNQQRGN 629
K E A E + N NE + G + ++ + + ++
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 630 GNGANKERRDERRQPASGQGAQNGAQAQQQAQTPKPPRNEAQAPKQ-QQPQQAQQQKPKP 688
++ ++ + + Q ++ Q P + K+ Q +P
Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171

Query: 689 QNQTPRPPRTP--AQQDGAQAERQPRPARQDEGMAAAQTVTSTAAMATTS----SVVAAI 742
+T P TV S ++ + SV +
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231

Query: 743 TDAAAPATAQTNANEAAQAHAVDVTVSASTADPGADA 779
+ T+ + + A +A +D A A
Sbjct: 1232 HNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKA 1268



Score = 33.5 bits (76), Expect = 0.007
Identities = 37/280 (13%), Positives = 68/280 (24%), Gaps = 32/280 (11%)

Query: 840 NDLDSDSEGDDAAAQAHASAAPRAGQPEFDFDDDAPAPTVRAKPEPTAQAAAKPRPVPKE 899
+S E + A E + + E + E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 900 RAEPQVGNDTSSTTAPVSNTVPASQPPVATPVAKADTDHERPAAPTVAADAPAKPAPAAT 959
E T + + ++ +P A + P
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 960 ASTAVNADVAATQAVEQRPAPVAQHAPAAPAPVAAPAPVASTPVPAVAVVAPVAETASTA 1019
+ T AD P + + PV V + V P T +T
Sbjct: 1160 SQTNTTADTE---------QPAKETSSNVEQPVTESTTVNTGNSV---VENPENTTPATT 1207

Query: 1020 PVAAPSAPAPTVASPVSNVAATSTQQQPLGSVSARAAASDTDKAAAPNADRQQRAAAADV 1079
PTV S SN ++ SV + + A +++ + A D+
Sbjct: 1208 --------QPTVNSESSNKPKNRHRR----SVRSVPH---NVEPATTSSNDRSTVALCDL 1252

Query: 1080 ATPATTQA-----APVQTTPVKQTADLSAPVAAMPAAQAE 1114
+ T A Q + +S ++ +
Sbjct: 1253 TSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEG 1292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09915IGASERPTASE382e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.7 bits (87), Expect = 2e-04
Identities = 25/151 (16%), Positives = 40/151 (26%), Gaps = 11/151 (7%)

Query: 444 ASVTPLERVKDMQAAQQRMAQLHADSRAAQEKAAAANRAHAAMPPRDAFASRDARDSPFR 503
A+V E+ K Q + ++ + QE++ D + S
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 504 SQA---QPARWKPPVQEQ---RQLPPRQQFAFAASPRGEHAQPSQPRYEPRPVMKPEPQQ 557
+ A QPA+ EQ + +P +QP KP+
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK--- 1220

Query: 558 HMQRMASFTPPRAAPARPADTHQQHPHPAAQ 588
R PA T A
Sbjct: 1221 --NRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 31.6 bits (71), Expect = 0.013
Identities = 20/142 (14%), Positives = 37/142 (26%), Gaps = 16/142 (11%)

Query: 451 RVKDMQAAQQRMAQLHADSRAAQEKAAAANRAHAAMPPRDAFAS-RDARDSPFRSQAQPA 509
+ Q + +EKA +P + S + + + QA+PA
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 510 RWKPPVQEQRQLPPRQQFAFAASPRGEHAQPSQPRYEPRPVMKPEPQQHMQRMASFTPPR 569
R P ++ S A QP E ++ + +
Sbjct: 1147 RENDPTVNIKE---------PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197

Query: 570 ------AAPARPADTHQQHPHP 585
A +P + P
Sbjct: 1198 NPENTTPATTQPTVNSESSNKP 1219



Score = 29.6 bits (66), Expect = 0.041
Identities = 22/177 (12%), Positives = 51/177 (28%), Gaps = 27/177 (15%)

Query: 417 AVPLTTVAHAMQAPPAIHSPHVQAPAFASVTPLERVKDMQAAQQRMAQLHADSRAAQEKA 476
+TT + P++ S + + A P+ ++ + A++ + K
Sbjct: 994 TTNITTPNNIQADVPSVPSNN-EEIARVDEAPVPPPAPATPSET--TETVAENSKQESKT 1050

Query: 477 AAANRAHAAMPPRDA--FASRDARDSPF-RSQAQPARWKPPVQEQRQLPPRQQFAFAASP 533
N A A + + A+ +E +
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ-------------- 1096

Query: 534 RGEHAQPSQPRYEPRPVMKPEPQQHMQRMASFTPPRAAPARPADTHQQHPHPAAQPQ 590
E + + E + ++ E Q + ++ S P+ + P A+P
Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV-------QPQAEPA 1146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09925PF03544541e-11 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 53.8 bits (129), Expect = 1e-11
Identities = 15/96 (15%), Positives = 33/96 (34%), Gaps = 5/96 (5%)

Query: 25 QQAAAPTVAPTELAAVKTPPPEYAPQLACAGIGGTTVLRVVVGTQGTPTDVLVAQSSGQP 84
+ T + A+ P+Y + I G ++ V G +V + +
Sbjct: 145 ATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPAN 204

Query: 85 VLDEAARTRVREWQFKAATRNGQAVPQTIQVPVSFK 120
+ + + +R W+++ V V + FK
Sbjct: 205 MFEREVKNAMRRWRYEPGKPGSGIV-----VNILFK 235


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS09940IGASERPTASE442e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 43.9 bits (103), Expect = 2e-06
Identities = 45/282 (15%), Positives = 78/282 (27%), Gaps = 37/282 (13%)

Query: 153 QPATEASTQAASAAVSSPAQAGASAAKSEPAPSPTPTPTPARPARPAADLVERPDTDQAP 212
T + QA +V S + + +P P P PA P+ + E +
Sbjct: 996 NITTPNNIQADVPSVPS-----NNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 213 ENAPEPVQAASEPVTADVPQVTVQVPPVTIESPLQVTETPVATNDFVVPPPPTITLTPPA 272
E Q A+E TA +V + + +
Sbjct: 1051 VEKNE--QDATET-TAQNREVAKEAKSNVKANTQTNEVAQSGSE---------------- 1091

Query: 273 IERAAPQVQVRQRDIQTVTERPQVRQLQRPATEVAVRSAAAPAVRERDIVIPERPQRTAL 332
+ ++ TV + + + EV ++ +E+ E Q A
Sbjct: 1092 TKETQTTET---KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQ----SETVQPQAE 1144

Query: 333 AARPREISPKVRMPDVAVRTAALPSVPDPAPAPVAVPPAVPAASATANPTPTAAAAQPAQ 392
AR + P V + + +T PA + S T N + P
Sbjct: 1145 PAREND--PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN-SVVENPEN 1201

Query: 393 PAPQPAQSQANPAPPERSSNASAAASSAAKPAASGPKPADRS 434
P Q N + N + + +PA S
Sbjct: 1202 TTPATTQPTVNSESSNKPKNRHR---RSVRSVPHNVEPATTS 1240



Score = 38.9 bits (90), Expect = 7e-05
Identities = 29/201 (14%), Positives = 55/201 (27%), Gaps = 16/201 (7%)

Query: 234 TVQVPPVTIESPLQVTETPVATNDFVVPPPPTITLTPPAIERAAPQVQVRQRDIQTVTER 293
TV +T + +Q V +N+ + + PPA + + + + ++
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 294 PQVRQLQRPATEVAVRSAAAPAVRERDIVIPERPQRTALAARPREISPKVRMPDVAVRTA 353
+ + Q A A + + + + +E T
Sbjct: 1051 VEKNE-QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE--------TK 1101

Query: 354 ALPSVPDPAPAPVAVPPAVPAASATANPTPTAAAAQPAQPAPQPAQSQANPAPPERSSNA 413
+V A V T+ P Q + Q QA PA +
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQ-------VSPKQEQSETVQPQAEPARENDPTVN 1154

Query: 414 SAAASSAAKPAASGPKPADRS 434
S A +PA +
Sbjct: 1155 IKEPQSQTNTTADTEQPAKET 1175



Score = 35.4 bits (81), Expect = 8e-04
Identities = 22/164 (13%), Positives = 45/164 (27%), Gaps = 12/164 (7%)

Query: 153 QPATEASTQAASAAVSSPAQAGASAAKSEPAPSPTPTPTPARPARPAADLVERPDTDQAP 212
+ AT + A + ++ P + T P D Q+
Sbjct: 1102 ETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161

Query: 213 ENAPEPVQAASEPVTADVPQVTVQVPPVTIESPLQVTETPVATNDFVVPPPPTITLTPPA 272
N + ++ +++V PVT + + N V P T T
Sbjct: 1162 TNTTADTEQPAKETSSNVE------QPVTESTTVN------TGNSVVENPENTTPATTQP 1209

Query: 273 IERAAPQVQVRQRDIQTVTERPQVRQLQRPATEVAVRSAAAPAV 316
+ + + R ++V P + ++ A
Sbjct: 1210 TVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT 1253


75AXO1947_RS09930AXO1947_RS09975N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS09930-212-0.537371two-component sensor histidine kinase
AXO1947_RS09935-213-0.721435cytochrome-c peroxidase
AXO1947_RS09945-121-3.231858DNA-binding response regulator
AXO1947_RS09950-124-3.212919hypothetical protein
AXO1947_RS09955-121-2.890834hypothetical protein
AXO1947_RS09960014-2.810885hypothetical protein
AXO1947_RS21735012-3.186760TetR family transcriptional regulator
AXO1947_RS21745-112-1.903771RND transporter
AXO1947_RS09975-210-0.662731multidrug efflux RND transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10115HTHFIS320.013 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.013
Identities = 26/121 (21%), Positives = 48/121 (39%), Gaps = 4/121 (3%)

Query: 688 ASLLLLCDDAAELDRLEEMLAALGHEPVALLELPAAVAMATSDPMRFDGVLLK-RDRAGD 746
A++L+ DDAA L + L+ G++ + D V+ +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDEN 61

Query: 747 AEHAIDALHAAAPKLPLILATRAMSLATR-KGLGGAITEIIAQPFDLSALALALERALGR 805
A + + A P LP+++ + + T K + + +PFDL+ L + RAL
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 806 T 806

Sbjct: 122 P 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10125HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 36/117 (30%), Positives = 61/117 (52%), Gaps = 4/117 (3%)

Query: 2 LVVDDDQAMAQVVMGHIRSHGMEAFVATNSSELAEALRRREPDILLLDLMLKHEDGLDLL 61
LV DDD A+ V+ + G + + +N++ L + + D+++ D+++ E+ DLL
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 62 RALRKE-SDIPVIIMTGHRRDEIDRVV-GLELGADDYLPKPFGLHELTARIRAVLRR 116
++K D+PV++M+ + + E GA DYLPKPF L EL I L
Sbjct: 67 PRIKKARPDLPVLVMSAQ--NTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10170HTHTETR357e-05 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 35.4 bits (81), Expect = 7e-05
Identities = 13/91 (14%), Positives = 39/91 (42%), Gaps = 4/91 (4%)

Query: 2 RQDDQRLIRLLAATLTRRPRSNLT--ELAAGAGISRATLYRFAPTRAAIVEKVTAEAWVR 59
++ Q ++ + +++ S+ + E+A AG++R +Y ++ + ++ +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 60 LQAALPG--GDASPDPMARLRRMTHALVEDL 88
+ DP++ LR + ++E
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLEST 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10175RTXTOXIND461e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 46.4 bits (110), Expect = 1e-07
Identities = 40/213 (18%), Positives = 70/213 (32%), Gaps = 28/213 (13%)

Query: 100 EAALARARGELTRSEAELENATAQFERSQQLVQRQVISRQDFDT-ARSNFKSTQAAVASA 158
E A EL +++LE ++ +++ + Q F + T +
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKE---EYQLVTQLFKNEILDKLRQTTDNIGLL 314

Query: 159 RAALKTAQLDLGFATVRAPIDGRIGRALV-TEGALVGQGGDATEMALVQQLDPIFADFNR 217
L + + +RAP+ ++ + V TEG +V T M +V + D +
Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA--ETLMVIVPEDDTLEVTALV 372

Query: 218 PVAEALKLRGRARKGDAPLKVVIDIPELGETREGDL------LFADMRVDETTDTV--SL 269
K G G +I + TR G L + D D+ V +
Sbjct: 373 QN----KDIGFINVG---QNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVI 425

Query: 270 RAQ------FDNRDNLLLPGMFVRVRTPNGTAS 296
+ N++ L GM V G S
Sbjct: 426 ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRS 458



Score = 40.6 bits (95), Expect = 8e-06
Identities = 22/133 (16%), Positives = 46/133 (34%), Gaps = 8/133 (6%)

Query: 55 PGRVSPM-RVAQVRARVAGIVLARRFEEGSDVKAGQVLFQIDPAPFEAALARARGELTRS 113
G+++ R +++ IV +EG V+ G VL ++ EA + + L ++
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146

Query: 114 EAELENATAQFERSQQLVQRQVISRQDFDTARS----NFKSTQAAVASARAALK--TAQL 167
E RS +L + + D ++ + + + + Q
Sbjct: 147 RLEQTRYQIL-SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205

Query: 168 DLGFATVRAPIDG 180
+L RA
Sbjct: 206 ELNLDKKRAERLT 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10180ACRIFLAVINRP10720.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1072 bits (2774), Expect = 0.0
Identities = 503/1030 (48%), Positives = 696/1030 (67%), Gaps = 10/1030 (0%)

Query: 1 MSRFFIDRPNFAWVVAIFISLAGVLALRTLPVEKYPEVAPPQISIMATYPGASAQVVNDA 60
M+ FFI RP FAWV+AI + +AG LA+ LPV +YP +APP +S+ A YPGA AQ V D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTSVIEQELNGVRDMLYYDSSS-SNGSAQITIMFQPGTDPNIAQVDVQNRIRQSESRLPA 119
VT VIEQ +NG+ +++Y S+S S GS IT+ FQ GTDP+IAQV VQN+++ + LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 AVTQLGLQVEQTTAGFLMLYSLVYKDATAAQDVVRLNDYAARVVNDEIRRVPGVGRVQFF 179
V Q G+ VE++++ +LM+ V + QD ++DY A V D + R+ GVG VQ F
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQD--DISDYVASNVKDTLSRLNGVGDVQLF 178

Query: 180 GAEAAMRVWVDTQALRGYGLSIVDVNNAIRAQNLQVAAGSLGERPGAQDQELTTTLVVRG 239
GA+ AMR+W+D L Y L+ VDV N ++ QN Q+AAG LG P Q+L +++ +
Sbjct: 179 GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238

Query: 240 QMESPQEFGQIVLRAQANGAVVHLSDVAKLELGLENYQFDVQENGGPAAGAAVQLAPGGN 299
+ ++P+EFG++ LR ++G+VV L DVA++ELG ENY + NG PAAG ++LA G N
Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 300 AVATVAAVRKRLQELSQSFPADIAYSVPFDSSTFVNVAIKKVLHTLLEAMALVFLVMFVF 359
A+ T A++ +L EL FP + P+D++ FV ++I +V+ TL EA+ LVFLVM++F
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 360 LQNIRYTLIPAIVVPVCLLGTFAVMKLLGFSVNMMSMFAMVLAIGILVDDAIVVVENVER 419
LQN+R TLIP I VPV LLGTFA++ G+S+N ++MF MVLAIG+LVDDAIVVVENVER
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 420 LMADEGLLPRDASMKAMTQVGGAIVGITLVLTAVFLPLAFMSGSVGVIYRQFSAVLAVSI 479
+M ++ L P++A+ K+M+Q+ GA+VGI +VL+AVF+P+AF GS G IYRQFS + ++
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 480 LFSGFLALTMTPALCATCLAPI--DGHQEKKGFFGWFDRNFNALTSRFDRLNHRLVHRAG 537
S +AL +TPALCAT L P+ + H+ K GFFGWF+ F+ + + +++ G
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 538 RCMLVYAVLLGVLGLAYVRLPEAFVPQEDEGYMIVDMQLPPGASYSRTRAVGQQVNDYL- 596
R +L+YA+++ + + ++RLP +F+P+ED+G + +QLP GA+ RT+ V QV DY
Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598

Query: 597 -AARPSMQDVTLVYGFSFSGSGANAAMAFPSLKDWSER-GDSESVANEVAAANVALGRIS 654
+ +++ V V GFSFSG NA MAF SLK W ER GD S + A + LG+I
Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 655 DVTIMAVMPPPIEGLGNSGGFSLRVQDRGNLGRDALMQAVNQLLRAANQSP-KLAYAMVE 713
D ++ P I LG + GF + D+ LG DAL QA NQLL A Q P L
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 714 GLADAPQLRLEVDRGKAEALGVSFQSAMDVLSSAFGSTIVNDFVNRGRLQRVVVQGAAGD 773
GL D Q +LEVD+ KA+ALGVS +S+A G T VNDF++RGR++++ VQ A
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 774 RATPQSLDTLHVTSSTGRQVPLTAFTTQRWEQGPVQIARYNGYASVNLTGEAAPGISSGD 833
R P+ +D L+V S+ G VP +AFTT W G ++ RYNG S+ + GEAAPG SSGD
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 834 ALAEMERLAAALPQGIGYAWSALSYQEKAAGTQAPMLLGLALLVVFLLLVALYESWAIPF 893
A+A ME LA+ LP GIGY W+ +SYQE+ +G QAP L+ ++ +VVFL L ALYESW+IP
Sbjct: 839 AMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898

Query: 894 SVMLIVPIGAVGATAAVWVAGLSNDVYFKVGLITIIGLAAKNAILIVEFAKELHAR-GAR 952
SVML+VP+G VG A + NDVYF VGL+T IGL+AKNAILIVEFAK+L + G
Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958

Query: 953 VPEAAMQAARLRFRPIVMTSLAFILGVIPLVIARGAGAASQNALGTGVIGGMLAASTLGV 1012
V EA + A R+R RPI+MTSLAFILGV+PL I+ GAG+ +QNA+G GV+GGM++A+ L +
Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018

Query: 1013 VFTPIFFTWV 1022
F P+FF +
Sbjct: 1019 FFVPVFFVVI 1028



Score = 61.4 bits (149), Expect = 1e-11
Identities = 85/509 (16%), Positives = 173/509 (33%), Gaps = 48/509 (9%)

Query: 540 MLVYAVLLGVLGL-AYVRLPEAFVPQEDEGYMIVDMQLPPGASYSRTRAVGQQVNDYL-A 597
V A++L + G A ++LP A P + V P GA + V V +
Sbjct: 12 AWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GAD---AQTVQDTVTQVIEQ 67

Query: 598 ARPSMQDVTLVYGFSFSGSGANAAMAFPSLKDWSERGDSESVANEVAAANVALGRISDVT 657
+ ++ + S S + F D + +V L + +
Sbjct: 68 NMNGIDNLMYMSSTSDSAGSVTITLTFQ------SGTDPDIAQVQVQNK---LQLATPLL 118

Query: 658 IMAVMPPPIEGLGNSGGF---SLRVQDRGNLGRDALMQAVNQLLRAANQSPKLAYAMVEG 714
V I +S + + V D +D + V ++ L+ + G
Sbjct: 119 PQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVK-----DTLS--RLNG 171

Query: 715 LADAP------QLRLEVDRGKAEALGVSFQSAMDVLSS-----AFGSTIVNDFVNRGRLQ 763
+ D +R+ +D ++ ++ L A G + +L
Sbjct: 172 VGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLN 231

Query: 764 RVVVQGAAGDRATPQSLDTLHVTSST-GRQVPLTAFTTQRW-EQGPVQIARYNGYASVNL 821
++ A P+ + + ++ G V L + IAR NG + L
Sbjct: 232 ASII--AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 822 TGEAAPGISSGDAL----AEMERLAAALPQG--IGYAWSALSYQEKAAGTQAPMLLGLAL 875
+ A G ++ D A++ L PQG + Y + + + + L +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIM 349

Query: 876 LVVFLLLVALYESWAIPFSVMLIVPIGAVGATAAVWVAGLSNDVYFKVGLITIIGLAAKN 935
LV ++ + L ++ + VP+ +G A + G S + G++ IGL +
Sbjct: 350 LVFLVMYLFL-QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDD 408

Query: 936 AILIVE-FAKELHARGARVPEAAMQAARLRFRPIVMTSLAFILGVIPLVIARGAGAASQN 994
AI++VE + + EA ++ +V ++ IP+ G+ A
Sbjct: 409 AIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYR 468

Query: 995 ALGTGVIGGMLAASTLGVVFTPIFFTWVM 1023
++ M + + ++ TP ++
Sbjct: 469 QFSITIVSAMALSVLVALILTPALCATLL 497


76AXO1947_RS10120AXO1947_RS10155N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS101202230.524251serine protease
AXO1947_RS101304230.315576cation transporter
AXO1947_RS101352230.407263CusA/CzcA family heavy metal efflux RND
AXO1947_RS101402220.926595histidine kinase
AXO1947_RS101452191.091577ATPase
AXO1947_RS101500160.748396DNA-binding response regulator
AXO1947_RS10155-1130.420054hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10325SUBTILISIN1214e-32 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 121 bits (305), Expect = 4e-32
Identities = 72/325 (22%), Positives = 120/325 (36%), Gaps = 43/325 (13%)

Query: 78 NADLAQQAGAKGKGVKLAVLDDNLVQSYTPISGKVDSFNDYTASPGTPESSANALRGHGT 137
A +G+GVK+AVLD + + ++ ++T GHGT
Sbjct: 30 QAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGT 88

Query: 138 IVSALVLGSAQDGFAGGVAPDADLFYARICAENSCGTQATRRAAVDLAAA-GVRIANLSI 196
V+ + + + GVAP+ADL ++ + G + A V I ++S+
Sbjct: 89 HVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSL 148

Query: 197 GASYPDAAASANAALAWKYALTPLVQADALIVASTGNEGAAEAS-----YPAATPVQEAS 251
G A K A V + L++ + GNEG + YP
Sbjct: 149 GGPEDVPELHE----AVKKA----VASQILVMCAAGNEGDGDDRTDELGYPGCYN----- 195

Query: 252 VRNNWLAVGAINIDSAGNAAGLTSYSNHCGAAAQWCLVAPGSYTVPALAGSELGGQIAGT 311
++VGAIN D + +SN + LVAPG + + G + +GT
Sbjct: 196 ---EVISVGAINFDR-----HASEFSNSN---NEVDLVAPGEDILSTVPGGKY-ATFSGT 243

Query: 312 SFSTAAVSGVAAQVLGVYPW-----MTASQLQQTLLTTATDLGDPGVDALYGWGLVNAAK 366
S +T V+G A + + +T +L L+ LG + G GL+
Sbjct: 244 SMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGNGLLYLTA 301

Query: 367 AIKGPGQFASNWAVNVTSGYDSTFS 391
+ + + +G ST S
Sbjct: 302 V----EELSRIFDTQRVAGILSTAS 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10330RTXTOXIND569e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.0 bits (135), Expect = 9e-11
Identities = 38/205 (18%), Positives = 72/205 (35%), Gaps = 28/205 (13%)

Query: 114 SAELATAYSDAGKARAMLQQARLELARQKVLAADSIAAARDLQAAQQAFDSAQNDARAAS 173
EL S + + + A+ E L + I L+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD--KLRQTTDNIGLLTLELAKNE 322

Query: 174 DRLAQLGVAAQATSHRRYVLRAPIAGRVVDLSAALGGFWNDTSAPLMTVADISQV-WLTA 232
+R + V+RAP++ +V L G T+ LM + +TA
Sbjct: 323 ERQ------------QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370

Query: 233 SVPEREIGQVFEGQQVTASLDAYPGQ---HFTGLVQHL--DDLLDPTTRTL-KVRVALNN 286
V ++IG + GQ ++A+P + G V+++ D + D + V +++
Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEE 430

Query: 287 HDGL-------LKPGMFARAQFQTR 304
+ L GM A+ +T
Sbjct: 431 NCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 36.0 bits (83), Expect = 2e-04
Identities = 27/136 (19%), Positives = 46/136 (33%), Gaps = 9/136 (6%)

Query: 76 VLPERLVRVVPPLAGRVVALPKTLGDTVRAGDVLCVLDSAELATAYSDAGKARAMLQQAR 135
R + P V + G++VR GDVL L + A +D K ++ L QAR
Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA---LGAEADTLKTQSSLLQAR 147

Query: 136 LELARQKVLAADSIAAARDLQAAQQAFDSAQNDARAASDRLAQLGVAAQATSHRR---YV 192
LE R S + + + D + + L + + S + Y
Sbjct: 148 LEQTR---YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204

Query: 193 LRAPIAGRVVDLSAAL 208
+ + + L
Sbjct: 205 KELNLDKKRAERLTVL 220


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10335ACRIFLAVINRP6390.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 639 bits (1651), Expect = 0.0
Identities = 233/1034 (22%), Positives = 426/1034 (41%), Gaps = 43/1034 (4%)

Query: 11 QRRGIVWLVFVLIALYGTWSWTQLPVEAYPDIADVTSQVVTQVPGLGAEEVEQQITVPLE 70
+R W++ +++ + G + QLPV YP IA V PG A+ V+ +T +E
Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIE 66

Query: 71 RALMGTPGLHVLRSRSLFA-LSLITLVFDDGTEGYFARQRVLERIQAVT--LPYGA-IPG 126
+ + G L + S S A ITL F GT+ A+ +V ++Q T LP G
Sbjct: 67 QNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQG 126

Query: 127 LDPYTSPTGEIYRYTLES--KTRSLRELSDLQFWTVIPRLQKVQGVADVTNFGGLTTQFS 184
+ S + + S + ++SD V L ++ GV DV FG
Sbjct: 127 ISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMR 185

Query: 185 LALEPDRLTRYGVSLQQVKSAITSNNAD------GGGSVMDRGEQSYVIRGIGLLGSLQD 238
+ L+ D L +Y ++ V + + N GG + + + I + ++
Sbjct: 186 IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEE 245

Query: 239 IGDVVV-SNSNGVPVLVKDLGEVRYDNVERRGILGKDKNPDTIEGIALLLKDSNPSVALQ 297
G V + NS+G V +KD+ V I + P L +N +
Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKP-AAGLGIKLATGANALDTAK 304

Query: 298 GIHSAVEELNNSLLPKDVKVVPYLDRTALIDATLHTVSATLTEGMLLVCVVLLIFLGSPR 357
I + + EL P+ +KV+ D T + ++H V TL E ++LV +V+ +FL + R
Sbjct: 305 AIKAKLAELQPFF-PQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 358 AAAIVSLTIPLSLLIAFIFMHHLKIPANLLSLG--AIDFGILVDGAVVLVENVLRLREEN 415
A I ++ +P+ LL F + N L++ + G+LVD A+V+VENV R+ E+
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 416 SERALTAGDAIDATLHVARPIFFGMAVIGCAYLPLLAFERIEYKLFSPMAYAVGAALIGA 475
A + + + V+ ++P+ F ++ + + +A+ +
Sbjct: 424 KLPPKEA--TEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 476 LLVALMLIPALAWLAFRKPRKMMH-----------NRVLEALGQRYRALLERSVGRRGWL 524
+LVAL+L PAL KP H N + Y + + +G G
Sbjct: 482 VLVALILTPALCAT-LLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 525 LACAALALCVLAVLGGSIGRDFLPYIDEGSLWLQVQMPPGITLDKAARMANALRTATL-- 582
L AL + + VL + FLP D+G +Q+P G T ++ ++ + + L
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 583 EFPEVSYVVTQTGRNDDGTDYWTPSHIEASVGLRPYKDWPS-GMDKQGLIAALGARYAQM 641
E V V T G + G + A V L+P+++ + +I ++
Sbjct: 601 EKANVESVFTVNGFSFSGQ---AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657

Query: 642 PGYTVSMMQPMIDGVQDKLSGAHSDLTIKVFGDDLQQVRGVAEQVATALHAVPGA-ADIA 700
V +G +L I G + Q+ P + +
Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFEL-IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 701 VDVEPPLPNLQVRFDREAAARYGINAADVSDLIATGIGGSPIGQMYIGEKSYDLTVRFPQ 760
+ ++ D+E A G++ +D++ I+T +GG+ + + L V+
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776

Query: 761 RYRNDPQAIGALRLRTAAGAEIPLSAVASITTTSGQSVIVREMGRRNIIVRLNVRGRDLS 820
++R P+ + L +R+A G +P SA + G + R G ++ ++
Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP---G 833

Query: 821 SFLSDAQATLVRHVRIDPQHMQLVWGGQFENLQRAQARLLVVLPTTLCIMFVLLFGAFGN 880
+ DA A + P + W G + + + ++ + ++F+ L + +
Sbjct: 834 TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893

Query: 881 LRQPTLVLAAVPLAMIGGLAALHLRGMTLNVSSAVGFIALFGVAVLNAVLMLAQINRLRQ 940
P V+ VPL ++G L A L +V VG + G++ NA+L++ L +
Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953

Query: 941 DPGMSLREAVVAGAVSRMRPVLMTATVAALGLTPAMLAAGLGSDVQRPLATVVVGGLITA 1000
G + EA + R+RP+LMT+ LG+ P ++ G GS Q + V+GG+++A
Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSA 1013

Query: 1001 TALTLLLLPSLYYL 1014
T L + +P + +
Sbjct: 1014 TLLAIFFVPVFFVV 1027



Score = 82.6 bits (204), Expect = 5e-18
Identities = 62/351 (17%), Positives = 139/351 (39%), Gaps = 29/351 (8%)

Query: 682 VAEQVATALHAVPGAADIAVDVEPPLPNLQVRFDREAAARYGINAADVSDLIATG----I 737
VA V L + G D+ + +++ D + +Y + DV + +
Sbjct: 158 VASNVKDTLSRLNGVGDVQLFGAQYA--MRIWLDADLLNKYKLTPVDVINQLKVQNDQIA 215

Query: 738 GGSPIGQMYIGEKSYDLTVRFPQRYRNDPQAIGALRLRTAA-GAEIPLSAVASITTTS-G 795
G G + + + ++ R++N P+ G + LR + G+ + L VA +
Sbjct: 216 AGQLGGTPALPGQQLNASIIAQTRFKN-PEEFGKVTLRVNSDGSVVRLKDVARVELGGEN 274

Query: 796 QSVIVREMGRRNIIVRLNVRG--------RDLSSFLSDAQATLVRHVRID-PQHMQLVWG 846
+VI R G+ + + + + + + L++ Q + +++ P
Sbjct: 275 YNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQ 334

Query: 847 GQFENLQRA--QARLLVVLPTTLCIMFVLLFGAFGNLRQPTLVLAAVPLAMIGGLAALHL 904
+ + +A +LV L +M++ L N+R + AVP+ ++G A L
Sbjct: 335 LSIHEVVKTLFEAIMLVFL-----VMYLFL----QNMRATLIPTIAVPVVLLGTFAILAA 385

Query: 905 RGMTLNVSSAVGFIALFGVAVLNAVLMLAQINRLRQDPGMSLREAVVAGAVSRMRPVLMT 964
G ++N + G + G+ V +A++++ + R+ + + +EA ++
Sbjct: 386 FGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGI 445

Query: 965 ATVAALGLTPAMLAAGLGSDVQRPLATVVVGGLITATALTLLLLPSLYYLM 1015
A V + P G + R + +V + + + L+L P+L +
Sbjct: 446 AMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL 496


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10350HTHFIS962e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.1 bits (239), Expect = 2e-25
Identities = 28/153 (18%), Positives = 61/153 (39%)

Query: 5 APVVYLIDDDASMRAALEDLFASVGLQVCAFGSTDQFLAHRLQDAPACLVLDIRMPGQSG 64
+ + DDDA++R L + G V + +V D+ MP ++
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 65 MEFHRRMVESGFALPTIFITGHGDIAMGVEAMKNGAIEFLTKPFRDQALLDAIQDGIRRD 124
+ R+ ++ LP + ++ ++A + GA ++L KPF L+ I +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 125 RTRRQSDAVAAELRARWESLSSGEQDVTRLVVQ 157
+ R ++ S+ Q++ R++ +
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10355FLAGELLIN330.027 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 32.7 bits (74), Expect = 0.027
Identities = 48/325 (14%), Positives = 84/325 (25%), Gaps = 12/325 (3%)

Query: 518 GNNTYSGGTTLGAGSVLLETSGALGTGTVTAAGGSLDTTAPLSLTNNFALTNTLGLGPSG 577
++G L + + GA T+T +D + N +G
Sbjct: 127 NQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLK 186

Query: 578 NALTLSGTLAGVGGVNKTGAGTLTLGGLNTYSGGTNLASGTLQLGTASALGTGALNVTGA 637
++ + G + T + + L T A
Sbjct: 187 SSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTA 246

Query: 638 SNLSTTAPLTVANAISLAAALNLPSTQALTLTGAISGAGSLIKSGAGDLTLTNANAYTGG 697
+L T T A + A A + G G K + N G
Sbjct: 247 VDLFKTTKSTAGTAEAKAI--------AGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 698 TTLSAGRLVVGSNAALGTGTLTASGGELDATTATTLGNAMALTGTMGVGSSGNALNLTGT 757
+ + G L +TA +DA T + N N +
Sbjct: 299 VSTTIN----GEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAK 354

Query: 758 ISGAGALNKLGTGTLTLGGLNTYSGGTSLNAGTLQVASGTALGTGALDVTGAATLQNTAA 817
+S A N + + Y+ + + TL + T + T A
Sbjct: 355 LSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAK 414

Query: 818 ATLNNAVTLSTGTLTLDGAQALTLG 842
+ N + L+ A +LG
Sbjct: 415 KSTANPLASIDSALSKVDAVRSSLG 439


77AXO1947_RS10445AXO1947_RS10610N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS10445024-2.431141DNA-binding protein
AXO1947_RS10450122-3.386317c-di-GMP phosphodiesterase A
AXO1947_RS21805118-2.370055sensor histidine kinase
AXO1947_RS10460-116-1.187951flagella protein
AXO1947_RS10465-214-1.023747flagellar biosynthesis anti-sigma factor FlgM
AXO1947_RS10470-212-0.298558flagella basal body P-ring formation protein
AXO1947_RS10480-210-0.126241chemotaxis protein CheW
AXO1947_RS10485-112-0.229951flagellar biosynthesis protein FlgB
AXO1947_RS10490112-1.103058flagellar basal body rod protein FlgC
AXO1947_RS10495112-1.068572flagellar basal body rod modification protein
AXO1947_RS10500113-1.088276flagellar hook protein FlgE
AXO1947_RS10505114-0.610883flagellar basal body rod protein FlgF
AXO1947_RS10510117-0.428531flagellar basal body rod protein FlgG
AXO1947_RS10515017-0.290583flagellar basal body L-ring protein
AXO1947_RS105200160.005565flagellar P-ring protein FlgI
AXO1947_RS10525-114-0.080359flagellar assembly peptidoglycan hydrolase FlgJ
AXO1947_RS10530-113-0.469400flagellar hook-associated protein FlgK
AXO1947_RS10535-111-0.904030flagellar hook protein FlgL
AXO1947_RS10540017-1.748087flagellin
AXO1947_RS10545020-1.671985flagellar protein
AXO1947_RS10550019-1.726499flagellar protein FliS
AXO1947_RS10555-114-1.327007hypothetical protein
AXO1947_RS21815-29-0.448093PilZ domain-containing protein
AXO1947_RS10565-27-0.252907DNA-binding response regulator
AXO1947_RS10570-1140.338368RNA polymerase sigma-54 factor
AXO1947_RS10575-1130.627399response regulator
AXO1947_RS105801150.547023sigma-54-dependent Fis family transcriptional
AXO1947_RS10585115-0.629788aminotransferase
AXO1947_RS10590213-1.156897acyl carrier protein
AXO1947_RS10600214-1.414578ketoacyl-ACP synthase III
AXO1947_RS10605113-0.7975903-oxoacyl-ACP reductase
AXO1947_RS10610013-0.7836023-oxoacyl-ACP reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10665PF06580394e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 4e-05
Identities = 18/84 (21%), Positives = 30/84 (35%), Gaps = 10/84 (11%)

Query: 609 NALRHA---CAGEVHLRLHSI-DSDSFRLEVSDDGDGFEPEGPR--GLGLIVMRERAQTV 662
N ++H + L D+ + LEV + G G GL +RER Q +
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQML 325

Query: 663 GG---TLAIESAPGAGTRVTLRLP 683
G + + G + +P
Sbjct: 326 YGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10670HTHFIS992e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99 bits (249), Expect = 2e-24
Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 1/115 (0%)

Query: 447 TLLLLDDEENVLRSLVRLFRRDGYRILAAGNVRDAFDLLATNDVQVILSDQRMSDMSGTE 506
T+L+ DD+ + L + R GY + N + +A D ++++D M D + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 507 FLGRVKMLYPDTVRLVLSGYTDLATVTEAINRGAIYRFLTKPWNDDELREHIRQA 561
L R+K PD LV+S T +A +GA Y +L KP++ EL I +A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGA-YDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10680PYOCINKILLER280.009 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 27.8 bits (61), Expect = 0.009
Identities = 16/70 (22%), Positives = 30/70 (42%), Gaps = 4/70 (5%)

Query: 34 QDKLSALHALEAAMPAGEEERLRELAEANRANGALLARRRREVNWALRHLGRTESAPSYD 93
Q +++ L A +A++ A + RE A A A R++ A T + P+
Sbjct: 201 QIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRA----ANTYAMPANG 256

Query: 94 AKGQSSVLRG 103
+ ++ RG
Sbjct: 257 SVVATAAGRG 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10695HTHFIS383e-05 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 38.3 bits (89), Expect = 3e-05
Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 9/75 (12%)

Query: 184 VLVVDDSRVARQQIRSVLDQLGVSATLLSDGRQALDHLLQVAASGENPADRYAMVISDIE 243
+LV DD R + L + G + S+ + A +V++D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDVV 56

Query: 244 MPAMDGYTLTTEIRR 258
MP + + L I++
Sbjct: 57 MPDENAFDLLPRIKK 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10715FLGHOOKAP1448e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 8e-07
Identities = 24/67 (35%), Positives = 37/67 (55%), Gaps = 3/67 (4%)

Query: 4 NTSLSGISAANADLNVTSNNIANVNTTGFKESRAEFADMFQSTSYGLSRNAVGSGVRVSN 63
N ++SG++AA A LN SNNI++ N G+ A Q+ S + VG+GV VS
Sbjct: 5 NNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA---QANSTLGAGGWVGNGVYVSG 61

Query: 64 VAQQFSQ 70
V +++
Sbjct: 62 VQREYDA 68



Score = 42.6 bits (100), Expect = 2e-06
Identities = 27/155 (17%), Positives = 61/155 (39%), Gaps = 15/155 (9%)

Query: 264 MQLNVSGSTQYGEQFALRDTRQDGYASGKLNEISIDTSGVVFARYSNGADKPLGQVALSS 323
++L +G+ + F L+ A ++ + D + + A + D +
Sbjct: 396 LELTFTGTPAVNDSFTLKPVSD---AIVNMDVLITDEAKIAMASEEDAGDSDNRNGQ-AL 451

Query: 324 FVNPQGLQSQGNNMWA-ESY----------TSGAARTGAPNTSDLGQIESGSLESSTVDL 372
++ G ++Y T+ + A + + Q+ + S V+L
Sbjct: 452 LDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNL 511

Query: 373 TEQLVNMIVAQRNFQANSQMISTQDQVTQTIINIR 407
E+ N+ Q+ + AN+Q++ T + + +INIR
Sbjct: 512 DEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10720FLGHOOKAP1300.009 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.009
Identities = 9/31 (29%), Positives = 19/31 (61%)

Query: 5 LYVAMTGARASLQAQSTVSHNLANVDTVGFK 35
+ AM+G A+ A +T S+N+++ + G+
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYT 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10725FLGHOOKAP1406e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.9 bits (93), Expect = 6e-06
Identities = 13/41 (31%), Positives = 20/41 (48%)

Query: 219 LEGSNVNTVEELVSMIETQRAYEMNAKAISTTDAMLGYLNN 259
S VN EE ++ Q+ Y NA+ + T +A+ L N
Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 37.6 bits (87), Expect = 4e-05
Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 5 LWVAKTGLDAQQTRMSVISNNLANTNTTGFKRDR 38
+ A +GL+A Q ++ SNN+++ N G+ R
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10730FLGLRINGFLGH1473e-46 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 147 bits (373), Expect = 3e-46
Identities = 78/196 (39%), Positives = 108/196 (55%), Gaps = 9/196 (4%)

Query: 39 VPVVAPVA-----QPTAGAIYAAGPGLNLYGDRRARDVGDLLTVNLVESTTASSTANTSI 93
VP PVA Q Y P L+ DRR R++GD LT+ L E+ +AS +++ +
Sbjct: 40 VPGPTPVANGSIFQSAQPINYGYQP---LFEDRRPRNIGDTLTIVLQENVSASKSSSANA 96

Query: 94 SKKDATTMGAPTLLGAPLTVGGLNVLENSTSGDRSFAGKGNTAQSNRMQGSVTVTVMQRL 153
S+ T G T+ + G + SG +F GKG SN G++TVTV Q L
Sbjct: 97 SRDGKTNFGFDTVPRYLQGLFGNARADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVL 156

Query: 154 PNGNLVIQGQKNLRLTQGDELVQVQGIVRAADIAPDNTVPSSKVADARIAYGGRGAIAQS 213
NGNL + G+K + + QG E ++ G+V I+ NTVPS++VADARI Y G G I ++
Sbjct: 157 VNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEA 216

Query: 214 NAMGWLSRFFNSRLSP 229
MGWL RFF + LSP
Sbjct: 217 QNMGWLQRFFLN-LSP 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10735FLGPRINGFLGI360e-125 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 360 bits (926), Expect = e-125
Identities = 156/368 (42%), Positives = 221/368 (60%), Gaps = 9/368 (2%)

Query: 6 LSFRLLATAVALCAIAAPASAERIKDLAQVGGVRGNALVGYGLVVGLDGSGDRTSQAPFT 65
+ + + L A A RIKD+A + R N L+GYGLVVGL G+GD +PFT
Sbjct: 8 AAALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFT 67

Query: 66 VQSLKNLLGELGVNVPANVNPQLKNVAAVAIHAELPPFAKPGQPIDITVSSIANAVSLRG 125
QS++ +L LG+ KN+AAV + A LPPFA PG +D+TVSS+ +A SLRG
Sbjct: 68 EQSMRAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRG 126

Query: 126 GSLLMAPLKGADGQVYAMAQGNLVVGGFGAQGKDGSRVSVNVPSVGRIPNGAIVERALPD 185
G+L+M L GADGQ+YA+AQG L+V GF AQG D + ++ V + R+PNGAI+ER LP
Sbjct: 127 GNLIMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPS 185

Query: 186 VFAGTGEITLNLHQNDFTTVSRMVAAIDS----SFGAGTARAVDGVTVAVRSPTDPGARI 241
F + + L L DF+T R+ +++ +G A D +AV+ P
Sbjct: 186 KFKDSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLT 244

Query: 242 GLLSRLENVELSPGDAPAKVVVNARTGTVVIGQLVRVMPAAIAHGSLTVTISENTNVSQP 301
L++ +EN+ + D PAKVV+N RTGT+VIG VR+ A+++G+LTV ++E+ V QP
Sbjct: 245 RLMAEIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP 303

Query: 302 GAFSGGRTAVTQQSTITATSEGSRMFKFEGGTTLDQIVRAVNEVGAAPGDLVAILEALKQ 361
FS G+TAV Q+ I A EGS++ E G L +V +N +G ++AIL+ +K
Sbjct: 304 APFSRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKS 362

Query: 362 AGALSAEL 369
AGAL AEL
Sbjct: 363 AGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10740FLGFLGJ1305e-37 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 130 bits (328), Expect = 5e-37
Identities = 64/140 (45%), Positives = 83/140 (59%), Gaps = 4/140 (2%)

Query: 218 FVAKIWTHAQKAARELGVDPRALVAQAALETGWGRRGI--GNGGDSNNLFGIKATG-WSG 274
F+A++ AQ A+++ GV ++AQAALE+GWG+R I NG S NLFG+KA+G W G
Sbjct: 152 FLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKG 211

Query: 275 DKVTTGTHEYVNGVKTTETADFRVYGSAEESFADYVRLLKNNSRYQPALQAGTDIKGFAR 334
T EY NG A FRVY S E+ +DYV LL N RY A+ + A+
Sbjct: 212 PVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYA-AVTTAASAEQGAQ 270

Query: 335 GLQQAGYATDPGYAAKIAAI 354
LQ AGYATDP YA K+ +
Sbjct: 271 ALQDAGYATDPHYARKLTNM 290



Score = 71.3 bits (174), Expect = 5e-16
Identities = 48/137 (35%), Positives = 70/137 (51%), Gaps = 16/137 (11%)

Query: 4 AASPIDLNPSTKADPA-KIDKVSRQLEGQFAQMLVKSMRNASSGDPMFPGENQ-MFREMY 61
A S +L DPA I V+RQ+EG F QM++KSMR+A D +F E+ ++ MY
Sbjct: 15 AQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMY 74

Query: 62 DQQMAKALTQGKGLGLSAMISKQLSGDTGGPALNTSL--------------NTAEAAKAY 107
DQQ+A+ +T GKGLGL+ M+ KQ++ + P +T N A +
Sbjct: 75 DQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQ 134

Query: 108 SLVAGKRDASLPLPTRD 124
V D SLP ++
Sbjct: 135 KAVPRNYDDSLPGDSKA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10745FLGHOOKAP12211e-66 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 221 bits (564), Expect = 1e-66
Identities = 140/437 (32%), Positives = 219/437 (50%), Gaps = 8/437 (1%)

Query: 2 SIMSTGTSALIAFQRALSTVSHNVANINTEGYSRQRVEFATRTPTDMGYAFVGNGAKITD 61
S+++ S L A Q AL+T S+N+++ N GY+RQ A T +VGNG ++
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61

Query: 62 VGRVADQLAISRLLDSGGELSRLQQLSSLSNRVDALYSNTATNVAGLWSNFFDSTSAVSS 121
V R D ++L + + S L +++D + S + +++A +FF S + S
Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121

Query: 122 NASSTAERQSMLDSGNSLATRFKQLNGQMDGLSNEVNSGLTSSVDEVNRLTQQIAKLNGT 181
NA A RQ+++ L +FK + + +VN + +SVD++N +QIA LN
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181

Query: 182 I----GSSAQNAAPDMLDQRDALVSKLVGYTGGTAVMQDGGFINVFTAGGQALVVATTSS 237
I G A + ++LDQRD LVS+L G +QDGG N+ A G +LV +T+
Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241

Query: 238 KLTTVADPYQPSKLQVAMQTQGQNVSLSANSL--GGQIGGLLEFRSSVLEPTQAELGRLA 295
+L V PS+ VA L G +GG+L FRS L+ T+ LG+LA
Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301

Query: 296 VGMASTFNAGHRQGMDLYGAMGGNFFNIGSPTTAANPSNTGSASLSASFSNMSAVDGQNV 355
+ A FN H+ G D G G +FF IG P N N G ++ A+ ++ SAV +
Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDY 361

Query: 356 TLSFDGTNWKATNASTGSAVPMTGTGTAANPLVLNGVSMVVGGTPASGDKFLLQPTAGLA 415
+SFD W+ T ++ + T T A + +G+ + GTPA D F L+P +
Sbjct: 362 KISFDNNQWQVTRLASNTT--FTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419

Query: 416 GSLSVAITDPSRIAAAT 432
++ V ITD ++IA A+
Sbjct: 420 VNMDVLITDEAKIAMAS 436



Score = 82.7 bits (204), Expect = 1e-18
Identities = 38/105 (36%), Positives = 55/105 (52%)

Query: 517 AGSSDNGNAKLLAKIDDAKALSGGTVTLNGALSGLTTSVGSAARAANYSADAQKVINDQA 576
AG SDN N + L + GG + N A + L + +G+ S+ Q + Q
Sbjct: 440 AGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQL 499

Query: 577 QASRDSISGVNLDEEAANMLKLQQAYQAAAQMISTADTIFQAILG 621
+ SISGVNLDEE N+ + QQ Y A AQ++ TA+ IF A++
Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10750FLAGELLIN539e-10 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 53.1 bits (127), Expect = 9e-10
Identities = 56/349 (16%), Positives = 107/349 (30%), Gaps = 6/349 (1%)

Query: 4 RISTSMMYSQSVASMGAKQARLSQIEAQLASGQRLVTAKDDPVAAGTAVGLDRALAAITR 63
I+T+ + + ++ Q+ LS +L+SG R+ +AKDD A + +T+
Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62

Query: 64 FGENANNVQNRLGLQENALSQAGDKMARVTELAVQANNSSLSPDDRKAIAAELTALRDSM 123
NAN+ + E AL++ + + RV EL+VQA N + S D K+I E+ + +
Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122

Query: 124 VSLANSTDGTGRYLFGGTADGSAPFIKSSG---NVTYNGDQTQKQVEVAPDTFVSDTLPG 180
++N T G + + G + + +
Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182

Query: 181 SEIFMRIRTGDGTVDAHANAANTGTGLLLDFSRDTSTGSWNGASYSVQFTAANTYEVRDS 240
++ + G A + +T V
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 241 TNALVSTGTYKEG--QDINAAGVRMRISGAPAVGDSFQIGASGSKDVFSTID-DMVGALN 297
N V + A + I G G + + D + D + +
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302

Query: 298 SDTLTAPQKASMINTLQSSMRDIAQASSKMIDARASGGAQLSAIDNANS 346
+ + I +++ SSK + G N
Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351



Score = 36.2 bits (83), Expect = 2e-04
Identities = 44/269 (16%), Positives = 82/269 (30%), Gaps = 1/269 (0%)

Query: 127 ANSTDGTGRYLFGGTADGSAPFIKSSGNVTYNGDQTQKQVEVAPDTFVSDTLPGSEIFMR 186
AN T D ++G + DTF + +
Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291

Query: 187 IRTGDGTVDAHANAANTGTGLLLDFSRDTSTGSWNGASYSVQFTAANTYEVRDSTNALVS 246
G+G V N + + + + S +T+ +
Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351

Query: 247 TGTYKEGQDINAAGVRMRISGAPAVGDSFQIGASGSKDVFSTIDDMVGALNSDTLTAPQK 306
+ + + NA +I+ A + G + + D + TL
Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTAS-GVSTLINEDA 410

Query: 307 ASMINTLQSSMRDIAQASSKMIDARASGGAQLSAIDNANSLLESNEVTLKTTLSSIRDLD 366
A+ + + + I A SK+ R+S GA + D+A + L + L + S I D D
Sbjct: 411 AAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 367 YASAIGQYQLEKASLQAAQTIFQQMQSSS 395
YA+ + + QA ++ Q
Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVP 499


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10755FLAGELLIN1241e-33 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 124 bits (313), Expect = 1e-33
Identities = 120/400 (30%), Positives = 182/400 (45%), Gaps = 10/400 (2%)

Query: 2 AQVINTNVMSLNAQRNLNTNSSSLALSIQQLSSGKRITSFAVDAAGGAIAERFTTQIRGL 61
AQVINTN +SL Q NLN + SSL+ +I++LSSG RI S DAAG AIA RFT+ I+GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVASRNANDGISLSQTAEGAMQEIGNNLQRIRELSVQSANATNSSTDREALNSEVKQLTS 121
ASRNANDGIS++QT EGA+ EI NNLQR+RELSVQ+ N TNS +D +++ E++Q
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 EIDRVANQTSFNGTKLLDGSFSGALFQVGADAGQTIGINSIADANIDTLGRANFAAAVSG 181
EIDRV+NQT FNG K+L QVGA+ G+TI I + ++ +LG F
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITI-DLQKIDVKSLGLDGFNVNGPK 178

Query: 182 AGVTGTATASGSVSGISLSFKDASGSAKSITIADVKVGAGDTAADVNKKVASAINDKLDQ 241
G +S ++ + + + V +K +A N +L
Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238

Query: 242 TGMYASIKSDGSVQIESLKAGQDFTSLSAG--------TSSAAGITVGAGITTASAASGS 293
+ D +S + +++ T G+T T + +G
Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298

Query: 294 TASTLGSLDISTFSGAQQALEIVDKALTTVNSSRADMGAVQNRFTSTIANLSATSENLSA 353
++T+ ++ A A T +S V +FT + +++
Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358

Query: 354 SRSRIADTDYAKTTAELTRTQILQQAGTAMLAQAKSVPQN 393
+ + T T + + + +
Sbjct: 359 EANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKT 398



Score = 93.2 bits (231), Expect = 9e-23
Identities = 67/301 (22%), Positives = 121/301 (40%), Gaps = 3/301 (0%)

Query: 99 SANATNSSTDREALNSEVKQLTSEIDRVANQTSFNGTKLLDGSFSGALFQVGADAGQTIG 158
++ A + T + +V + + N L + A A
Sbjct: 210 NSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269

Query: 159 INSIADANIDTLGRANFAAAVSGAGVTGTATASGSVSGISLSFKDASGSAKSITIADVKV 218
D G +G G + + + ++L+ D + A ++ A ++
Sbjct: 270 KGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQS 329

Query: 219 GAGDTAADVNKKVASAINDKLDQTGMYASIKSDGSVQIESLKAGQDFTSLSAGTSSAAGI 278
+ VN + K + + ++ + + ++ +
Sbjct: 330 SKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVN---GAEYTANAAGDKV 386

Query: 279 TVGAGITTASAASGSTASTLGSLDISTFSGAQQALEIVDKALTTVNSSRADMGAVQNRFT 338
T+ + ++ + + L +D AL+ V++ R+ +GA+QNRF
Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFD 446

Query: 339 STIANLSATSENLSASRSRIADTDYAKTTAELTRTQILQQAGTAMLAQAKSVPQNVLSLL 398
S I NL T NL+++RSRI D DYA + +++ QILQQAGT++LAQA VPQNVLSLL
Sbjct: 447 SAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506

Query: 399 Q 399
+
Sbjct: 507 R 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10780HTHFIS726e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 6e-17
Identities = 35/160 (21%), Positives = 66/160 (41%), Gaps = 9/160 (5%)

Query: 2 RVIIVDDHTLVRAGLSRLLQTFAGIDVVGEASNAQQALDMTSLHRPDLVLMDLSLPGRSG 61
+++ DD +R L++ L + AG DV SNA + DLV+ D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 LDAMTDVLRAAPRTHVVMMSMHDDPVHVRDALDRGAVGFVVKDAAPLELELALRAAAAGQ 121
D + + +A P V++MS + + A ++GA ++ K EL + A
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA---- 118

Query: 122 VFLSPQISSKMIAPMLGREKPVGIAALSPRQREILREIGR 161
+ + + + + S +EI R + R
Sbjct: 119 ---LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10790HTHFIS562e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.6 bits (134), Expect = 2e-12
Identities = 20/118 (16%), Positives = 44/118 (37%), Gaps = 2/118 (1%)

Query: 1 MSKLTVLLVDDHEGFINAAMRHFRKVEWLNIVGSAANGLEAIERSESLRPNVVLMDLAMP 60
M+ T+L+ DD + + + V +N + ++V+ D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 EMGGLQATRLIKTQDDPPYIVIASHFDDAEHREHALRAGADNFVSKLSYIQEVMPILE 118
+ IK +++ S + A GA +++ K + E++ I+
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10795HTHFIS434e-151 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 434 bits (1117), Expect = e-151
Identities = 175/489 (35%), Positives = 252/489 (51%), Gaps = 16/489 (3%)

Query: 1 MSESRILLIDSDAVRAERTVSLLEFMDFNPRWVTDGADINPGRHRHDEWMAVMVGSAQDA 60
M+ + IL+ D DA L ++ R ++ A + R ++V
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLW--RWIAAGDGDLVVTDVVMP 58

Query: 61 -AQADKFFDWLADAKLPPPVLLMEGSPTAFAQAHGLHEANVWALDTPLRHAQLEALLRRA 119
A + A+ PVL+M T + L P +L ++ RA
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 120 S--LKRLDAEHQAGVQQDSGPTGNSEAVTRLRRLIDQVAAFDTTVLVLGESGTGKEVVAR 177
KR ++ + Q G S A+ + R++ ++ D T+++ GESGTGKE+VAR
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 178 AIHQHSPRRDGPFVAINCGAIPPDLLESELFGHEKGAFTGALTTRKGRFEMAEGGTLLLD 237
A+H + RR+GPFVAIN AIP DL+ESELFGHEKGAFTGA T GRFE AEGGTL LD
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 238 EIGDMSLPMQVKLLRVLQERSFERVGGGQTIRCNVRVIAATHRNLETRISDGQFREDLFY 297
EIGDM + Q +LLRVLQ+ + VGG IR +VR++AAT+++L+ I+ G FREDL+Y
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 298 RLNVFPIEMPALRERVDDLAMLVQTIAVQLARTGRGEVRFAEEALQALRSYNWPGNVREL 357
RLNV P+ +P LR+R +D+ LV+ Q + G RF +EAL+ ++++ WPGNVREL
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 358 TNLVERLAVLHPGGLVRVQDLPARYRGDFASAIPVELPPEPALLAAPVQVTDLPSNVVTL 417
NLV RL L+P ++ + + R + P ++
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEI-----------PDSPIEKAAARSGSLSISQA 407

Query: 418 PPKTADAEPATAASLPDDGIDLRGHMANIELALINEALERTQGVVAHAAQLLGLRRTTLV 477
+ A+ +A +E LI AL T+G AA LLGL R TL
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 478 EKLRKYGID 486
+K+R+ G+
Sbjct: 468 KKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10815DHBDHDRGNASE1044e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 104 bits (261), Expect = 4e-29
Identities = 66/254 (25%), Positives = 116/254 (45%), Gaps = 15/254 (5%)

Query: 13 LQGKRILVTGASSGIGRQIALSCAQIGAQLVITGRNEGR--LAETFALLEGTGHAQVIAN 70
++GK +TGA+ GIG +A + A GA + N + + E A+
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 71 LDKQEDIDHLVT----SVGVLDGVAHAAGIARLAPFRMINRAHLDETFASNVYAPLLLTR 126
+ ID + +G +D + + AG+ R ++ + TF+ N +R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 GLLAKKRINANGSILFISAVGSHIGPVATAAYSASKAALLGAMRTLALEVAKHGIRANCI 186
+ +GSI+ + + + + + AAY++SKAA + + L LE+A++ IR N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 187 VPGYVRTPMLEGL--KQSG------GSIDEHAKLTPLG-LGEPEDVAYAAVFYLSDASRW 237
PG T M L ++G GS++ PL L +P D+A A +F +S +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 238 VTRNYFLVDGGLTV 251
+T + VDGG T+
Sbjct: 246 ITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10820DHBDHDRGNASE1001e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (250), Expect = 1e-27
Identities = 66/257 (25%), Positives = 117/257 (45%), Gaps = 14/257 (5%)

Query: 8 AFSLEGKTILITGASSGLGQEIALTCARRGGRLVISGRDSERLQQTHAQLAGNGHVQVQ- 66
A +EGK ITGA+ G+G+ +A T A +G + + E+L++ + L
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 67 -ADL----TVSEDRERLVQASQRIDGVVHCFGGQMLSPIRQLKEELMTRMYQVHFLAPVM 121
AD+ + E R+ + ID +V+ G I L +E + V+
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 122 LTQRLLQANAINAQGSIVFMLSTSAHIGTRGVGPYSAMKSGLLGIIRCLALEQAKHRVRV 181
++ + + GSIV + S A + + Y++ K+ + +CL LE A++ +R
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 182 NGISPSAVPTP---RLW----GEDNRLNEMLNQQRARHPLG-LGTPHDVANAAVYLLADA 233
N +SP + T LW G + + L + PL L P D+A+A ++L++
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 234 SRWVTGTSLVMDGGAVL 250
+ +T +L +DGGA L
Sbjct: 243 AGHITMHNLCVDGGATL 259


78AXO1947_RS10645AXO1947_RS10735N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS10645110-0.415767flagellar hook-basal body complex protein FliE
AXO1947_RS21820110-1.113316flagellar M-ring protein FliF
AXO1947_RS10655211-0.731869flagellar motor switch protein FliG
AXO1947_RS10660212-0.682846flagellar assembly protein FliH
AXO1947_RS10665110-0.147962flagellar protein FliI
AXO1947_RS10670-114-0.636063flagellar export protein FliJ
AXO1947_RS10675016-0.352377flagellar protein
AXO1947_RS106800170.803163flagellar basal body protein FliL
AXO1947_RS10685019-0.203193flagellar motor switch protein FliM
AXO1947_RS10690119-0.297224flagellar motor switch protein FliN
AXO1947_RS10695019-1.027103flagellar protein
AXO1947_RS10700019-0.996468flagellar biosynthetic protein FliP
AXO1947_RS10705120-0.927786flagellar biogenesis protein
AXO1947_RS10710221-0.306653flagellar biosynthetic protein FliR
AXO1947_RS107200201.209777GGDEF domain-containing protein
AXO1947_RS10725-1221.148805GGDEF domain-containing protein
AXO1947_RS10730-1230.802674bifunctional diguanylate
AXO1947_RS10735-1220.367192flagellar biosynthesis protein FlhB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10850FLGHOOKFLIE623e-16 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 62.0 bits (150), Expect = 3e-16
Identities = 28/84 (33%), Positives = 48/84 (57%)

Query: 22 AGAQGTPATQAPSFSETLRGAIGGVNEAQQKAGALSKAFEMGDPNADLARVMVASQQSQV 81
A AQ + SF+ L A+ +++ Q A ++ F +G+P L VM Q++ V
Sbjct: 20 ARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASV 79

Query: 82 AFRATVEVRNRLVQAYQDVMNMPL 105
+ + ++VRN+LV AYQ+VM+M +
Sbjct: 80 SMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10855FLGMRINGFLIF351e-117 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 351 bits (903), Expect = e-117
Identities = 189/574 (32%), Positives = 302/574 (52%), Gaps = 43/574 (7%)

Query: 16 KAGQWFDRVRSLQITRKLTMMAMIALAVAAGLAVFFWSQKPGYQSLYTGLDEKGNAEAAD 75
K +W +R+R+ ++ ++ + AVA +A+ W++ P Y++L++ L ++
Sbjct: 11 KPLEWLNRLRANP---RIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVA 67

Query: 76 LLRIAQIPYKIDQGTGAISVPQDRLYDARLKLAGSGLTGKETGGGFELMEKDPGFGVSQF 135
L IPY+ G+GAI VP D++++ RL+LA GL K GFEL++++ FG+SQF
Sbjct: 68 QLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLP-KGGAVGFELLDQEK-FGISQF 125

Query: 136 VENARYQHALETELSRTIGTLRPVREARVHLAIPKPSAFTRQRDVASASVVLELRGGQGL 195
E YQ ALE EL+RTI TL PV+ ARVHLA+PKPS F R++ SASV + L G+ L
Sbjct: 126 SEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL 185

Query: 196 ERNQVDAIVNLVASSIPDMTPERVTVVDQSGRMLSIADPNSDAAQHAAQFEQVRRQESSY 255
+ Q+ A+V+LV+S++ + P VT+VDQSG +L+ ++ + AQ + ES
Sbjct: 186 DEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLN-DAQLKFANDVESRI 244

Query: 256 NQRIRELLEPMTGPGRVNPETSVDMDFSVVEEARELYN----GEPAKLRSEQVND-TSTT 310
+RI +L P+ G G V+ + + +DF+ E+ E Y+ A LRS Q+N
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 311 ATGPQGPPGATSNSPGQP-------PAPAAAGAPGTPAAANGQATAAAAPTESSKSATRN 363
A P G PGA SN P P P A TP + + +A P + ++ T N
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364

Query: 364 YELDRTLQHTRQPAGRIKRVSVAVLLDNVPRPGAKGKMVEQPLTAAELTRIEGLVKQAVG 423
YE+DRT++HT+ G I+R+SVAV+++ K PLTA ++ +IE L ++A+G
Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMG 420

Query: 424 FDAARGDTVSVMNAPFVREAVAGEEGPKWWEDPRVQNGLRLLVGAVVVLALLF----GVV 479
F RGDT++V+N+PF G E P W + + L ++VL + + V
Sbjct: 421 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAG-RWLLVLVVAWILWRKAV 479

Query: 480 RPTLRQLTGVTPIKEKQRKGGNDGTPQNADVRMVDDEDSLLPQMGEDTASIGQERKPAIA 539
RP L + ++Q + + + +VR+ DE Q+R+
Sbjct: 480 RPQLTRRVEEAKAAQEQAQVRQETE-EAVEVRLSKDEQL-------------QQRRANQR 525

Query: 540 LPDAYEERMRVAREAVKADSKRVAQVVKGWVASE 573
L E + RE D + VA V++ W++++
Sbjct: 526 LG--AEVMSQRIREMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10860FLGMOTORFLIG307e-106 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 307 bits (789), Expect = e-106
Identities = 104/329 (31%), Positives = 200/329 (60%)

Query: 1 MSGVQRAAVLLLSLGESDAAEVLKHMDPKEVQKIGIAMATMTGISRDQVEKVMDDFNGEL 60
++G Q+AA+LL+S+G +++V K++ +E++ + +A + I+ + + V+ +F +
Sbjct: 15 LTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELM 74

Query: 61 AGKTSLGVGADDYIRNVLIQALGADKAGGLIDRILLGRNTTGLDTLKWMDPRAVADLVRN 120
+ + G DY R +L ++LG KA +I+ + + + ++ DP + + ++
Sbjct: 75 MAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQ 134

Query: 121 EHPQIIAIVMAHLDSDQAAEALKLLPERTRADVLLRIATLDGIPPNALSELNDIMERQFA 180
EHPQ IA+++++LD +A+ L LP + +V RIA +D P + E+ ++E++ A
Sbjct: 135 EHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLA 194

Query: 181 GNQNLKSSNVGGIKVAANILNFLDTGSDQGVLGEIGKIDADLAGKIQDLMFVFDNLVDLD 240
+ ++ GG+ I+N D +++ ++ + + D +LA +I+ MFVF+++V LD
Sbjct: 195 SLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLD 254

Query: 241 DRGLQTLLREVSGERLGLALRGADVKVREKITRNMSQRAAEILLEDMEARGPVRLADVEA 300
DR +Q +LRE+ G+ L AL+ D+ V+EKI +NMS+RAA +L EDME GP R DVE
Sbjct: 255 DRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEE 314

Query: 301 AQKEILTIVRRLADEGAISLGGAGAEAMV 329
+Q++I++++R+L ++G I + G E ++
Sbjct: 315 SQQKIVSLIRKLEEQGEIVISRGGEEDVL 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10865FLGFLIH463e-08 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 45.6 bits (107), Expect = 3e-08
Identities = 37/159 (23%), Positives = 78/159 (49%), Gaps = 7/159 (4%)

Query: 51 QEGFARGHAEGFAQGQSEVRRLTAQIDGILDNFTRPLARLENEVVGALGELAVRIAGQLV 110
QEG A+G +G A+ +S+ + A++ ++ F L L++ + L ++A+ A Q++
Sbjct: 73 QEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVI 132

Query: 111 GRAYQADPQLLAELVGEAVDAVGGAGREVEVRLHPDDITALLPHLAPSSTT---RVAPDM 167
G+ D L + + + + + ++R+HPDD+ + L + + R+ D
Sbjct: 133 GQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDP 192

Query: 168 SLSRGDLRVHAESVRIDGTLDARLRAALETVMRKSGAGL 206
+L G +V A+ +G LDA + + + R + G+
Sbjct: 193 TLHPGGCKVSAD----EGDLDASVATRWQELCRLAAPGV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10875FLGFLIJ260.045 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 26.3 bits (57), Expect = 0.045
Identities = 33/140 (23%), Positives = 58/140 (41%), Gaps = 4/140 (2%)

Query: 1 MMQSKRIDPLLRRAQEQEDKVARDLAERQRTLETNQSRLEELRRYVEEYANSQMAGTSAV 60
M + + L A+++ + AR L E +R + + +L+ L Y EY N+ + SA
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 ALTNR----RAFLDRLDSAVLQQAQTVQSNIAKVEAERTRLLLASREKQVLEQLAASYRA 116
+NR + F+ L+ A+ Q Q + KV+ + Q + L
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 117 QENKVIERRDQREMDDLGAR 136
R DQ++MD+ R
Sbjct: 121 AALLAENRLDQKKMDEFAQR 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10880FLGHOOKFLIK513e-09 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 51.4 bits (122), Expect = 3e-09
Identities = 41/176 (23%), Positives = 79/176 (44%), Gaps = 6/176 (3%)

Query: 257 AAKALEPAADDSTAPAAPDAPAFALPTTTAPALSRLQDAAPIFSASPTPTPDLGSDNFDD 316
A+ L P ++ + A + + +P ++ Q A+P + LGS +
Sbjct: 183 PAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQ 242

Query: 317 AIGARMSWLADQKIGHAHIKVTPNEMGPVEVRLHLDGDRVNASFTAANADTRQALEQSLP 376
++ +S Q A +++ P ++G V++ L +D ++ + + R ALE +LP
Sbjct: 243 SLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALP 302

Query: 377 RLREMLGQNGFQLGQADV------GQQQQHPSGNRTGGNGNGNGLTLDDSPPVGIP 426
LR L ++G QLGQ+++ GQQQ ++ N L +D + +P
Sbjct: 303 VLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVP 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10890FLGMOTORFLIM2552e-85 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 255 bits (653), Expect = 2e-85
Identities = 91/327 (27%), Positives = 162/327 (49%), Gaps = 14/327 (4%)

Query: 3 VSDLLSQDEIDALLHGVDSGAVNTEPEPLPGEARQ-----YDLSSQDRIIRGRMPTLEMV 57
++++LSQDEID LL + SG + E + YD D+ + +M TL ++
Sbjct: 1 MTEVLSQDEIDQLLTAISSG--DASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLM 58

Query: 58 NERFARLWRIGLFNLIRRSADLSVRGIDLVKFNEYMHSLYVPTNLNLIRFKPLRGTGLIV 117
+E FARL L +R + V +D + + E++ S+ P+ L +I PL+G ++
Sbjct: 59 HETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLE 118

Query: 118 FEPTLVFTVVDNFFGGDGRYHTRIEGREFTATEMRVVQLMLKQTFADLKEAWAPVMDVDL 177
+P++ F+++D FGG G+ R+ T E V++ ++ + A+++E+W V+D+
Sbjct: 119 VDPSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176

Query: 178 EYINSEINPHFANIVTPREYVVVCRFHVELEGGGGEIHITLPYSMLEPIRELLDAG--IQ 235
E NP FA IV P E VV+ ++ G ++ +PY +EPI L +
Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236

Query: 236 SDRNDRDDSWNVMLREQLDTAEVTLSSVLASKRMSLRQLTGLKVGDIL---PIDLPAQVP 292
S R + +LR++L T ++ + + + S R+S+R + GL+VGDI+ +
Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296

Query: 293 LCVEEIPLFTGEFGVSNGNNAVKITAV 319
L + F + GV A +I
Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILER 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10895FLGMOTORFLIN1143e-36 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 114 bits (286), Expect = 3e-36
Identities = 50/90 (55%), Positives = 74/90 (82%)

Query: 22 DQNAADLNLDVILDVPVTLSLEVGRARIPIRNLLQLNQGSVVELERGAGEPLDVYVNGTL 81
D + A ++D+I+D+PV L++E+GR R+ I+ LL+L QGSVV L+ AGEPLD+ +NG L
Sbjct: 46 DVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYL 105

Query: 82 IAHGEVVVINDRFGIRLTDVVSPSERIRRL 111
IA GEVVV+ D++G+R+TD+++PSER+RRL
Sbjct: 106 IAQGEVVVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10905FLGBIOSNFLIP2407e-82 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 240 bits (615), Expect = 7e-82
Identities = 123/228 (53%), Positives = 162/228 (71%), Gaps = 1/228 (0%)

Query: 51 PAGSNQLPSLPNVSVGRIGDQPVSLPLQTLLLMTAITLLPSMLLVLTAFTRITIVLGLLR 110
P QLP + + + G Q SLP+QTL+ +T++T +P++LL++T+FTRI IV GLLR
Sbjct: 17 PLAFAQLPGITSQPL-PGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLR 75

Query: 111 QALGTGQTPSNQVLLGLSMFLTALVMMPVWQKMWGAGLQPYLNNQIDFSTAWTLTTQPLR 170
ALGT P NQVLLGL++FLT +M PV K++ QP+ +I A QPLR
Sbjct: 76 NALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLR 135

Query: 171 AFMLAQIRETDLMTFAGMAGDGKYVGPDAVPFPVLVASFVTSELKTAFEIGFLIFIPFVI 230
FML Q RE DL FA +A G GP+AVP +L+ ++VTSELKTAF+IGF IFIPF+I
Sbjct: 136 EFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLI 195

Query: 231 IDLVVASVLMSMGMMMLSPMLISAPFKILLFILVDGWVLVVGTLAASF 278
IDLV+ASVLM++GMMM+ P I+ PFK++LF+LVDGW L+VG+LA SF
Sbjct: 196 IDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10910TYPE3IMQPROT441e-09 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 44.0 bits (104), Expect = 1e-09
Identities = 17/69 (24%), Positives = 32/69 (46%)

Query: 13 GLVTVLWIAGPMLLAVLVVGVVIGVVQAATQLNEPTISFVAKAVALTATLFATGSMLLGH 72
L VL ++G + ++G+++G+ Q TQL E T+ F K + + LF
Sbjct: 11 ALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEV 70

Query: 73 LVEFTIALF 81
L+ + +
Sbjct: 71 LLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10915TYPE3IMRPROT1232e-36 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 123 bits (311), Expect = 2e-36
Identities = 79/239 (33%), Positives = 129/239 (53%), Gaps = 2/239 (0%)

Query: 23 WTMLRTGALLTAMPLIGTRTVPGRVRVMLAGTLAMVLAPILPPVPEWDGFTAQAVLSIAR 82
W +LR AL++ P++ R+VP RV++ LA + +AP LP L++ +
Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAV-Q 76

Query: 83 ELAVGASMGFMLKLIFEAGALAGELVSQSTGLSFAQMSDPMRGVTSGVIAQWFYIGFGLL 142
++ +G ++GF ++ F A AGE++ GLSFA DP + V+A+ + LL
Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136

Query: 143 FFAANGHLAVIALLVDSYKALPIGTALPDAGAFAEVAPTLFLQILRGGLTLALPMMVAML 202
F NGHL +I+LLVD++ LPIG ++ AF + I GL LALP++ +L
Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALT-KAGSLIFLNGLMLALPLITLLL 195

Query: 203 AVNLAFGALAKAAPALNPVQLGLPLTVLLGLFLLSSFASEFAPPVQRMFDTAFDAARKL 261
+NLA G L + AP L+ +G PLT+ +G+ L+++ AP + +F F+ +
Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10920GPOSANCHOR350.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.0 bits (80), Expect = 0.002
Identities = 33/125 (26%), Positives = 49/125 (39%), Gaps = 14/125 (11%)

Query: 762 LLLAVVAVIALLRWRTAKLLRRKRELEQLVAERTAELEQDKRDLEAARAEL-SLKATHDE 820
A A I L A L K +LE A + +RDL+A+R L+A H +
Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQK 334

Query: 821 LTGLLNRAGILAALREML---LHAARSGGPLAVVLIDLDHFKLVNDQHGHLAGDAVLAGV 877
L I A R+ L L A+R A ++ +H KL + +A +
Sbjct: 335 LEEQN---KISEASRQSLRRDLDASRE----AKKQLEAEHQKLEEQ---NKISEASRQSL 384

Query: 878 GRRMD 882
R +D
Sbjct: 385 RRDLD 389


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10935TYPE3IMSPROT349e-121 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 349 bits (898), Expect = e-121
Identities = 104/344 (30%), Positives = 184/344 (53%), Gaps = 2/344 (0%)

Query: 8 GERRELPTEKRLREAREQGNIPQSRELSTAAVFGAGVFALMALARGIGDGATAWMKTALS 67
GE+ E PT K++R+AR++G + +S+E+ + A+ A LM L+ + + M +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LIP 60

Query: 68 PDPTMRENPMALFGHFGDLLLQLLWVMLPLIGICLAAGLAGPLLMSGLHFSGKAIMPDLT 127
+ + AL ++LL+ ++ PL+ + +A ++ G SG+AI PD+
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 128 KLNPANGLKRMWGSNSLAELVKSVLRLLFVGLAASFCISKSLPGLRSLVSQPLEQAVGNG 187
K+NP G KR++ SL E +KS+L+++ + + I +L L L + +E
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 188 LDFTKSLLFYTAGALVLLAAIDAPYQKWNWMRKLKMTREEIKREMKESEGSPEVKGRIRQ 247
+ L+ V+++ D ++ + ++++LKM+++EIKRE KE EGSPE+K + RQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 248 MQMQMSQRQMMEAVPKADVVLMNPTHYAVALKYEGGKMRAPIVVAKGVDEMAFRIREAGE 307
++ R M E V ++ VV+ NPTH A+ + Y+ G+ P+V K D +R+ E
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 308 QHRVAIVTAPPLARALYREAQIGKEIPVRLYSVVAQVLSYVYQL 351
+ V I+ PLARALY +A + IP A+VL ++ +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344


79AXO1947_RS10760AXO1947_RS10780N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS10760-1170.461078response regulator
AXO1947_RS107650150.040752chemotaxis protein
AXO1947_RS107700150.699037chemotaxis protein CheA
AXO1947_RS107750140.111025IS5/IS1182 family transposase
AXO1947_RS10780013-0.300176IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10960HTHFIS924e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 4e-25
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 3/105 (2%)

Query: 2 RILIVDDFSTMRRIVKNLLGDLGFTNTAEAEDGNSALAALRAGPFDFVVTDWNMPGMTGI 61
IL+ DD + +R ++ L G+ + + + AG D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 DLLRNIRADAKLKHLPVMMVTAEAKREQIIEAAQCGVNGYIIKPF 106
DLL I+ LPV++++A+ I+A++ G Y+ KPF
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10970PF06580441e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.1 bits (104), Expect = 1e-06
Identities = 24/136 (17%), Positives = 44/136 (32%), Gaps = 53/136 (38%)

Query: 284 LVRNAIDHGIESPALREATGKPRSGHVRLSAQQEGDYVSIEIQDDGAGIDPERLREIARN 343
LV N I HGI P+ G + L ++ V++E+++ G+
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN-------- 306

Query: 344 KGLIDAEAAARLSTDECLHLIFMPGFSTKVEVTDISGRGVGMDVVQSRIRELSG---QIQ 400
G G+ V+ R++ L G QI+
Sbjct: 307 ---------------------------------TKESTGTGLQNVRERLQMLYGTEAQIK 333

Query: 401 IQSELGRGSRFMIRVP 416
+ + G+ M+ +P
Sbjct: 334 LSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10975PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS10985PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.0 bits (75), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


80AXO1947_RS10865AXO1947_RS10895N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS108651283.462805flagellar motor protein MotD
AXO1947_RS108701272.851678chromosome partitioning protein ParA
AXO1947_RS108752241.823321chemotaxis protein
AXO1947_RS10885220-0.447755hypothetical protein
AXO1947_RS108901170.080384Fis family transcriptional regulator
AXO1947_RS108950121.800868chemotaxis protein CheA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11095OMPADOMAIN715e-16 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 71.1 bits (174), Expect = 5e-16
Identities = 33/118 (27%), Positives = 47/118 (39%), Gaps = 16/118 (13%)

Query: 162 INSDILFGTGSAALAGNARTTLSTLASVLRD---APNGVRVEGYTDNQPIATAQFPSNWE 218
+ SD+LF A L + L L S L + V V GYTD I + + N
Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--NQG 272

Query: 219 LSAARAASVVHLFADDGVAPQRLAMVGYGEFRARADNSTEAGRNA---------NRRV 267
LS RA SVV G+ +++ G GE N+ + + +RRV
Sbjct: 273 LSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRV 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11105FLGHOOKFLIK320.003 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 32.1 bits (72), Expect = 0.003
Identities = 37/160 (23%), Positives = 54/160 (33%), Gaps = 16/160 (10%)

Query: 28 PPAAVAVETAALAHADLALAEPPVQAAALPEAAAVSEAATSNAIAAVLS--------ADA 79
P + V A A+ + + E P + A + A+AAV AD
Sbjct: 63 PLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKADD 122

Query: 80 IAADFLAEMDADPAFGPPVVAAPSAADIAADFLAEMDADPAFGTAPTAVDLLTADLLAEM 139
+ D A + A A P P D + L PT LT++ L
Sbjct: 123 LNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPT--------EKPTLFTKLTSEQLTTA 174

Query: 140 DADPAFGLETAPVAVPAPAPAPKPEPHAAPAPMRAAPAPT 179
D A G P+ K E + P+P+ AA +P
Sbjct: 175 QPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPL 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11115HTHFIS858e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.3 bits (211), Expect = 8e-23
Identities = 33/119 (27%), Positives = 60/119 (50%), Gaps = 2/119 (1%)

Query: 3 ARILVVDDSASMRQMVSFALTSAGFAVEEAEDGAVALGRAKGQRFNAVVTDVNMPNMDGI 62
A ILV DD A++R +++ AL+ AG+ V + A + VVTDV MP+ +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 SLIRELRQLPDYKFTPMLMLTTESAADKKSEGKAAGATGWLVKPFNPEQLIATVQKVLG 121
L+ +++ P+L+++ ++ + GA +L KPF+ +LI + + L
Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS11120PF06580456e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.9 bits (106), Expect = 6e-07
Identities = 23/133 (17%), Positives = 37/133 (27%), Gaps = 50/133 (37%)

Query: 397 LVRNSIDHGLEMPDARRASGKDETGTITLAASHQGGHIVIEVSDDGRGLNRAKILEKAAE 456
LV N I HG+ + G I L + G + +EV + G +
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------ 308

Query: 457 RGIAVPDNPTDAQVWDLIFAPGFSTADAVTDLSGRGVGMDVVRRNIQGLGGE---VQLES 513
G G+ VR +Q L G ++L
Sbjct: 309 --------------------------------ESTGTGLQNVRERLQMLYGTEAQIKLSE 336

Query: 514 NAGSGTRVLIRLP 526
G ++ +P
Sbjct: 337 KQG-KVNAMVLIP 348


81AXO1947_RS12265AXO1947_RS12335N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS12265-2130.007528hypothetical protein
AXO1947_RS12270-112-0.783812*excinuclease ABC subunit B
AXO1947_RS12275-213-0.689683fimbrial protein
AXO1947_RS12280-113-0.767069*pilus assembly protein PilE
AXO1947_RS12285-112-1.268101pilus assembly protein
AXO1947_RS12290014-2.057474hypothetical protein
AXO1947_RS12295016-1.304893pilus assembly protein
AXO1947_RS12300016-1.186703pilus assembly protein PilW
AXO1947_RS123054140.445468type IV pilus modification protein PilV
AXO1947_RS123103140.320605pre-pilin like leader sequence
AXO1947_RS123153160.419982LOG family protein
AXO1947_RS123205220.802529IS5/IS1182 family transposase
AXO1947_RS12325623-0.212749Oar protein
AXO1947_RS12330623-0.399041membrane protein
AXO1947_RS12335828-4.459595short-chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS22035ABC2TRNSPORT270.029 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 26.8 bits (59), Expect = 0.029
Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 99 DTFQRLSHDVPLSEATDAIRSII 121
FQ + +PLS + D IR I+
Sbjct: 204 IVFQTAARFLPLSHSIDLIRPIM 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12515BCTERIALGSPH310.002 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 30.7 bits (69), Expect = 0.002
Identities = 14/75 (18%), Positives = 33/75 (44%), Gaps = 8/75 (10%)

Query: 10 RGYTAVQLLIVMAIVGIGAAIGIPSFKSLIEWQRATTRVHVLTAHLAMARSFAVTQGAPV 69
RG+T +++++++ ++G+ A + + +F + + A + A L + + G
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRD-DSAAQTLARFEAQLRFVQQRGLQTGQFF 62

Query: 70 SICPSTDGVRCRTDR 84
GV DR
Sbjct: 63 -------GVSVHPDR 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12525BCTERIALGSPG544e-12 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 53.7 bits (129), Expect = 4e-12
Identities = 22/72 (30%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 11 LSRQLRQRAGTGGFTLIELMIVVAIIGILAAVAYPSYADYVRKSRRAQAKADLVEYSQLL 70
+ +QR GFTL+E+M+V+ IIG+LA++ P+ K+ + +A +D+V L
Sbjct: 1 MRATDKQR----GFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56

Query: 71 ERSHTTNNTYAS 82
+ N+ Y +
Sbjct: 57 DMYKLDNHHYPT 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12540BCTERIALGSPH290.020 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.020
Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 13 GISLVEMMIAMVIGLVLMLGVIQVFSASRTASMLAEGSARAQENGRFAMDFLQRDIRMAG 72
G +L+EMM+ +++ V V+ F ASR S A+ AR + RF + + G
Sbjct: 5 GFTLLEMMLILLLMGVSAGMVLLAFPASRDDS-AAQTLARFEAQLRFVQQRGLQTGQFFG 63


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12545BCTERIALGSPG280.011 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 27.9 bits (62), Expect = 0.011
Identities = 9/22 (40%), Positives = 18/22 (81%), Gaps = 2/22 (9%)

Query: 12 RTKGFSLLEVLIAIVVLAFGLL 33
+ +GF+LLE+++ IV++ G+L
Sbjct: 6 KQRGFTLLEIMVVIVII--GVL 25


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12550BCTERIALGSPG371e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 36.8 bits (85), Expect = 1e-05
Identities = 11/31 (35%), Positives = 21/31 (67%)

Query: 4 RRFAGFTLVELMITIVVLAILLTIAFPSFRG 34
+ GFTL+E+M+ IV++ +L ++ P+ G
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS12575DHBDHDRGNASE717e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 71.2 bits (174), Expect = 7e-17
Identities = 43/194 (22%), Positives = 82/194 (42%), Gaps = 3/194 (1%)

Query: 12 ALAGRVVLITGAAGGLGAAAAQACAAAGATVVLLGRKLRPLERVYDAVAALGSEPLLYPL 71
+ G++ ITGAA G+G A A+ A+ GA + + LE+V ++ A +P
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 72 DLAGATPDDYATLARRLQTELGGLHGLLQCAADFAGLTPAELAAPADFARTLHVNLTARA 131
D+ + R++ E+G + L+ A + ++ T VN T
Sbjct: 65 DV--RDSAAIDEITARIEREMGPIDILV-NVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 132 WLTQACLPLLRQQHDAAVVFVVDDPARVGQAYWGAYGAAQHAQRGLIASLHHETAAGPVR 191
+++ + + ++V V +PA V + AY +++ A L E A +R
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 192 VSGLQPGPMRTALR 205
+ + PG T ++
Sbjct: 182 CNIVSPGSTETDMQ 195


82AXO1947_RS13020AXO1947_RS13065N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS13020-110-0.120304acriflavin resistance protein
AXO1947_RS130250121.376359RND transporter
AXO1947_RS13035-1131.554212membrane protein
AXO1947_RS130400122.007134outer membrane channel protein
AXO1947_RS13045-1101.112167DNA-binding response regulator
AXO1947_RS130501121.609625two-component sensor histidine kinase
AXO1947_RS130554110.142946potassium transporter
AXO1947_RS13060113-1.126040serine hydrolase
AXO1947_RS13065-111-0.292959ribonuclease activity regulator protein RraA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13265ACRIFLAVINRP441e-140 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 441 bits (1137), Expect = e-140
Identities = 231/1044 (22%), Positives = 426/1044 (40%), Gaps = 73/1044 (6%)

Query: 13 LTLFAAALILIGGIVAFVGFPSQEEPSVTVRDTLVSVAYPGMPSEQVENLLARPVEAQLR 72
A ++++ G +A + P + P++ VS YPG ++ V++ + + +E +
Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMN 70

Query: 73 ELAGIKRIV-TTVRPGSAIVQLTAYDDVRDLPALWQRVRAKAAEAGAQLPAGTLGPFVDD 131
+ + + T+ GS + LT D +V+ K A LP +
Sbjct: 71 GIDNLMYMSSTSDSAGSVTITLTFQSGT-DPDIAQVQVQNKLQLATPLLPQEVQQQGISV 129

Query: 132 DFGRVS---VASIAVTAPGFSMSEMRGPL-RRMREQLYGVPGVEQVKVFGLQDERVYVSF 187
+ S VA PG + ++ + +++ L + GV V++FG + +
Sbjct: 130 EKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQYAMRIWL 188

Query: 188 DRARLLASGLTPSSVMAQLRAQNVVGSGGQV----AVSG--LALTVATSGEIRTPEQLRD 241
D L LTP V+ QL+ QN + GQ+ A+ G L ++ + PE+
Sbjct: 189 DADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGK 248

Query: 242 VLLSVPGASVGGAREVTLGELAQVQVMQADPPQSAAVYQGQPAVVVSVSMQPGSNIADVG 301
V L V V L ++A+V + + A G+PA + + + G+N D
Sbjct: 249 VTLRV----NSDGSVVRLKDVARV-ELGGENYNVIARINGKPAAGLGIKLATGANALDTA 303

Query: 302 KALRAKLDDTARQLPVGFTQHVVSFQADVVEREMGKMHHVMGETIVIVMAVVMLFLG-WR 360
KA++AKL + P G V+ + ++ + E I++V V+ LFL R
Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363

Query: 361 TGLIVGAIVPLTIFASLIVMRVLDVELQTVSIAAIILALGLLVDNGIVIAEDIERRLV-A 419
LI VP+ + + ++ + T+++ ++LA+GLLVD+ IV+ E++ER ++
Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423

Query: 420 GEERRQACIDAGRTLATPLLTSSLVIVLAFSPFFFGQTSTNEYLRSLAIVLGVTLLGSWL 479
++A + + L+ ++V+ F P F ST R +I + + S L
Sbjct: 424 KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL 483

Query: 480 LSITVTPLLCMYFAKVHVTKRDEAESRFYR-----------GYRRVIERVLQHKALFIGA 528
+++ +TP LC K + E + F+ Y + ++L ++
Sbjct: 484 VALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLI 543

Query: 529 MAAMLAVAITVLVSIPYDFLPKSDRLQFQMPVTLQAGSDARETLRTVSELSRW-LGDRRA 587
A ++A + + + +P FLP+ D+ F + L AG+ T + + +++ + L + +A
Sbjct: 544 YALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKA 603

Query: 588 NPEVVDSIGYVADGGPRIVLGLNPPLPAANQAYFTVSVRPGTD-------IDAVIARVRT 640
N E V ++ + G A N VS++P + +AVI R +
Sbjct: 604 NVESVFTVNGFSFSGQ-----------AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652

Query: 641 H---VRSHFPALRAEPKRFSLG-ATEAGMAVYRVVGPDEAVLRSSAAAIARALRAVPGTV 696
+R F P LG AT + G L + + P ++
Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 697 -DVQDDWQARIPRYVVQVDQLKARRAGVSSEDIAQALQARYSGVDATLIRDDGTDVPVIV 755
V+ + ++ ++VDQ KA+ GVS DI Q + G D G + V
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYV 772

Query: 756 RGSAQERAANGNPAD--TLVYPQVGSAPVPLAAIATVLRDSEPSAIQRRNLSRAITVTAR 813
+ A+ R P D L VP +A T ++R N ++ +
Sbjct: 773 QADAKFR---MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGE 829

Query: 814 NLQ----LTATEIVERLSAPIAALKLPPGYSVEIGGELEDSAEANQALLHYMPHALGAIL 869
A ++E L++ KLP G + G + + + +
Sbjct: 830 AAPGTSSGDAMALMENLAS-----KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVF 884

Query: 870 LLFVWQFNSFRKLCIVLSAVPFVLIGAALALVLTGYPFGFMATFGLLALAGIIVNNAVLL 929
L + S+ V+ VP ++G LA L GLL G+ NA+L+
Sbjct: 885 LCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILI 944

Query: 930 LERI-EAELADGLPRREAVVAAAVKRLRPIVMTKLTCIVGLVPLMLFAGP---LWTGMAI 985
+E + +G EA + A RLRPI+MT L I+G++PL + G + I
Sbjct: 945 VEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGI 1004

Query: 986 TMIGGLALGTLVTLGLIPILYDLL 1009
++GG+ TL+ + +P+ + ++
Sbjct: 1005 GVMGGMVSATLLAIFFVPVFFVVI 1028



Score = 99.1 bits (247), Expect = 4e-23
Identities = 88/423 (20%), Positives = 159/423 (37%), Gaps = 30/423 (7%)

Query: 618 QAYFTVSVRPGTDIDAVIARVR---THVRSHFPALRAEPKRFSLGATEAGMAVYRVVGPD 674
T++ + GTD D +V+ P + ++ + + V V +
Sbjct: 87 SVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDN 146

Query: 675 EAVLRSS-----AAAIARALRAVPGTVDVQDDWQARIPRYVVQVDQLKARRAGVSSEDIA 729
+ A+ + L + G DVQ R + D L ++ D+
Sbjct: 147 PGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKY--KLTPVDVI 204

Query: 730 QALQARYSGVDATLIRDDGTDVPVIVRGSAQERAANGNPAD----TLVYPQVGSAPVPLA 785
L+ + + A + + S + NP + TL GS V L
Sbjct: 205 NQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGS-VVRLK 263

Query: 786 AIATVLRDSEP-SAIQRRNLSRAITVTARNLQLTAT-EIVERLSAPIAALK--LPPGYSV 841
+A V E + I R N A + + + + + A +A L+ P G V
Sbjct: 264 DVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKV 323

Query: 842 EIGGELEDSAEANQALLHYMPHAL-GAILLLFVWQF---NSFRKLCIVLSAVPFVLIGAA 897
D+ Q +H + L AI+L+F+ + + R I AVP VL+G
Sbjct: 324 LY---PYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTF 380

Query: 898 LALVLTGYPFGFMATFGLLALAGIIVNNAVLLLERIEAELAD-GLPRREAVVAAAVKRLR 956
L GY + FG++ G++V++A++++E +E + + LP +EA + +
Sbjct: 381 AILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQG 440

Query: 957 PIVMTKLTCIVGLVPLMLFAG---PLWTGMAITMIGGLALGTLVTLGLIPILYDLLFGLR 1013
+V + +P+ F G ++ +IT++ +AL LV L L P L L
Sbjct: 441 ALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPV 500

Query: 1014 MRR 1016

Sbjct: 501 SAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13270RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 16/134 (11%), Positives = 37/134 (27%), Gaps = 3/134 (2%)

Query: 67 GRLSAVLVDVGDRVTRGQVLARLDDEPLRLREQQADAHVRAALAQSGERQLQLRQQQAMF 126
+ ++V G+ V +G VL +L + + + A + Q+ R +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 127 DDGASSNATLTAARAAADAASAQLQAARADLAMARRGTRLGELRAPFDGSVVARLQQPQA 186
+ + + + + EL A A
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL---DKKRAERLTVLA 221

Query: 187 DVAAGQTVLQVEGQ 200
+ + + +VE
Sbjct: 222 RINRYENLSRVEKS 235



Score = 33.3 bits (76), Expect = 0.002
Identities = 14/136 (10%), Positives = 34/136 (25%), Gaps = 9/136 (6%)

Query: 94 LRLREQQADAHVRAALAQSGERQLQLRQQQAMFDDGASSNATLTAARAAADAASAQLQAA 153
+ + + +S + Q L + +L
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKN 321

Query: 154 RADLAMARRGTRLGELRAPFDGSVVA-RLQQPQADVAAGQTVLQVEGQGHVQLV-ATLPA 211
+ +RAP V ++ V +T++ + + V A +
Sbjct: 322 EERQQAS-------VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQN 374

Query: 212 AAGADLVPGQTVRARL 227
+ GQ ++
Sbjct: 375 KDIGFINVGQNAIIKV 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13285HTHFIS928e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 8e-24
Identities = 35/130 (26%), Positives = 62/130 (47%), Gaps = 1/130 (0%)

Query: 1 MTGKKVLLVEDDADSASILEAYLRRDGFDVAMAGDGERAIQLHRQWAPDLVLLDVMLPKL 60
MTG +L+ +DDA ++L L R G+DV + + + DLV+ DV++P
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGIEVLSAIR-RASDTPVIMVTAIGDEPEKLGALRYGADDYVVKPYSPKEVVARVHAVLR 119
+ ++L I+ D PV++++A + A GA DY+ KP+ E++ + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RSVAVRAPGE 129
+ E
Sbjct: 121 EPKRRPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13290PF04335290.040 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 28.6 bits (64), Expect = 0.040
Identities = 7/32 (21%), Positives = 11/32 (34%)

Query: 6 RFHAWRNQAPLWWWVGLRMSVLAVLTMMVIAF 37
+ A L W V LA ++ +A
Sbjct: 24 KLAAAERSKKLAWVVAGVAGALATAGVVAVAA 55


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13300BLACTAMASEA340.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.0 bits (78), Expect = 0.001
Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 5/78 (6%)

Query: 66 ADTLFAIASNTKAFTAASLSILADEGKLSLEDKVI----DHLPWFRMSDPYVSGEMRIRD 121
AD F + S K ++ D G LE K+ D + + +S+ +++ M + +
Sbjct: 58 ADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGE 117

Query: 122 LLAHRSGLS-LGAGDLLF 138
L A +S A +LL
Sbjct: 118 LCAAAITMSDNSAANLLL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS13305TYPE3OMBPROT300.006 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 29.7 bits (66), Expect = 0.006
Identities = 15/36 (41%), Positives = 16/36 (44%), Gaps = 3/36 (8%)

Query: 70 HALLGDQIAANAVANGWAGVLIHG---CVRDVEMLA 102
+LLGD N V GWA I C DV LA
Sbjct: 363 CSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLA 398


83AXO1947_RS14150AXO1947_RS14190N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS14150014-1.663732IS5/IS1182 family transposase
AXO1947_RS14160-211-0.690434IS630 family transposase
AXO1947_RS14170-110-0.588629IS5/IS1182 family transposase
AXO1947_RS14180114-1.451887IS630 family transposase
AXO1947_RS14185013-1.300565IS5/IS1182 family transposase
AXO1947_RS22265-113-1.065106IS5/IS1182 family transposase
AXO1947_RS14190-210-1.104421IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14455PF05043354e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 4e-04
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAHM 140
+ L+ H NTAH+
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAHL 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14475PF05043358e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.5 bits (79), Expect = 8e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14490PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS14495PF05043355e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 35.3 bits (81), Expect = 5e-04
Identities = 20/86 (23%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAHM 140
+ L+ H NTAH+
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAHL 325


84AXO1947_RS17060AXO1947_RS17085N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS17060080.487268membrane protein
AXO1947_RS17065-17-0.457993transcriptional regulator
AXO1947_RS17070-17-0.408196membrane protein
AXO1947_RS17075-27-0.593971hydroxymethylbilane synthase
AXO1947_RS17080-17-0.676713DNA-binding response regulator
AXO1947_RS17085-210-0.957294histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS17395BCTLIPOCALIN521e-10 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 51.5 bits (123), Expect = 1e-10
Identities = 31/120 (25%), Positives = 56/120 (46%), Gaps = 7/120 (5%)

Query: 33 DLSKIMGTWYVIARMPNAVERGHVTSRDEYTLVEDGKVAVRYLYRDGFGEPE---KEVNA 89
+L+ +G WY +AR+ ++ ERG EY + DG ++V G+ E + KE
Sbjct: 30 ELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISV---LNRGYSEEKGEWKEAEG 86

Query: 90 RASVDADSGNRDWRVWFYKVIPAKQRILEIAPDG-SWMLISYPGRDLAWIFARTPDMSRD 148
+A S + +V F+ + E+ + S+ +S P + W+ +RTP + R
Sbjct: 87 KAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSRTPTVERG 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS17405PF00577280.048 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 27.9 bits (62), Expect = 0.048
Identities = 20/98 (20%), Positives = 38/98 (38%), Gaps = 6/98 (6%)

Query: 31 DPAAALMPSPW-SGSSGEFGYAAANGNSTTDSLNGRVRLRYTDGDWIHSLDATALRSSSE 89
+ P W G + +GNS + + G Y + ++ A LR ++
Sbjct: 168 RARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTT 227

Query: 90 YTNTNEDGSTTRERQ-----TTAERYTGSVGSALQLGE 122
++ + D S+ + + T ER + S L LG+
Sbjct: 228 WSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGD 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS17415HTHFIS619e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 60.6 bits (147), Expect = 9e-13
Identities = 29/122 (23%), Positives = 51/122 (41%), Gaps = 8/122 (6%)

Query: 1 MTATQVRVLIADDEPLARERLRMLLG-EHPQVEVIGEAENGQQVVQQCEHLHPDLVLLDI 59
MT +L+ADD+ R L L V + N + + DLV+ D+
Sbjct: 1 MTGA--TILVADDDAAIRTVLNQALSRAGYDVRI---TSNAATLWRWIAAGDGDLVVTDV 55

Query: 60 AMPGVDGLETARLLRQSQPPPAVVFCTAYD--QHALSAFDAAALDYLMKPVRPERLASAL 117
MP + + +++++P V+ +A + A+ A + A DYL KP L +
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 118 EK 119
+
Sbjct: 116 GR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS17420PF065801552e-46 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 155 bits (393), Expect = 2e-46
Identities = 71/305 (23%), Positives = 130/305 (42%), Gaps = 22/305 (7%)

Query: 59 LWLALAVSVLLCVLRPRLSRLPPRLGGLAALLIAAVVAMLG---AGIVHGLYAVLGQAPL 115
+ L L + + R +L L L V+ M+ + L A + P+
Sbjct: 51 MGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPV 110

Query: 116 GALVGFWRFTLGSAGTVVLIT-ALALRYFYVS----------DRWEAQVQANARAEADAL 164
L VV++T +L YF D+W+ A A+ AL
Sbjct: 111 AFT---LPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQ-EAQLMAL 166

Query: 165 QARIRPHFLFNSMNLIASLLRRDPVVAEQAVLDLSDLFRAALGAGEG-VSTLRAECELAE 223
+A+I PHF+FN++N I +L+ DP A + + LS+L R +L +L E + +
Sbjct: 167 KAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVD 226

Query: 224 RYLAIESLRLGERLQVRWHRQEPLPWELPMPRLVLQPLVENAVLHGVSRMPEGGTLYLSL 283
YL + S++ +RLQ + ++ +P +++Q LVEN + HG++++P+GG + L
Sbjct: 227 SYLQLASIQFEDRLQFENQINPAI-MDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKG 285

Query: 284 RQRGSQLQIRIVNPAPQPGTQLPLLAGAGHAQASISHRLAFQFGAGARMAASWAEGYYAC 343
+ + + + N G ++ RL +G A++ S +G
Sbjct: 286 TKDNGTVTLEVENTGSLALKNTKE--STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343

Query: 344 EITLP 348
+ +P
Sbjct: 344 MVLIP 348


85AXO1947_RS17760AXO1947_RS17785N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS177602131.803586hypothetical protein
AXO1947_RS17770-1160.263571membrane protein
AXO1947_RS17775-117-0.480571response regulator receiver protein
AXO1947_RS17780-217-0.679904hybrid sensor histidine kinase/response
AXO1947_RS17785-221-1.259041bacterioferritin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18240GPOSANCHOR374e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.4 bits (86), Expect = 4e-05
Identities = 20/79 (25%), Positives = 29/79 (36%), Gaps = 1/79 (1%)

Query: 55 EAALQQAQRSQAQQRRQIEQLQQRQVNLAMSDKISRAANTEVQASLAERDEQIAALRADV 114
A Q +R R +QL+ L +KIS A+ ++ L E L A+
Sbjct: 308 NANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEH 367

Query: 115 AFYERLVG-STAQRKGLNA 132
E S A R+ L
Sbjct: 368 QKLEEQNKISEASRQSLRR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18250HTHFIS661e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 1e-13
Identities = 26/123 (21%), Positives = 54/123 (43%), Gaps = 2/123 (1%)

Query: 65 RVLIVEDDRSQALFAQSVLHGAGMHAQVEMTPASVPQAIQDYHPDLILMDLHMPELDGIR 124
+L+ +DD + L AG ++ A++ + I DL++ D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 125 LTTLIRQQPGQQLLPIVFLTGDPDPERQFEVLDSGADDFLTKPIRPRHLIAAVSNRIRRA 184
L I++ + LP++ ++ + + GA D+L KP LI + +
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 185 RRQ 187
+R+
Sbjct: 123 KRR 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18255HTHFIS572e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.1 bits (138), Expect = 2e-10
Identities = 24/129 (18%), Positives = 48/129 (37%), Gaps = 5/129 (3%)

Query: 498 RILLVEDNPVNLLIAQKLLAVLGFEADTATDGEAALTRMESICYDMVFMDCQMPVLDGYA 557
IL+ +D+ + + L+ G++ ++ + + D+V D MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 558 ATRRWRAMETESGGRPIPIVAMTANAMAGDRERCLAAGMDDYLSKPVAREQLDACLQRWL 617
R + + +P++ M+A + G DYL KP +L + R L
Sbjct: 65 LLPRIKKARPD-----LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 618 PRQTLLPGP 626
P
Sbjct: 120 AEPKRRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18260HELNAPAPROT290.005 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 29.1 bits (65), Expect = 0.005
Identities = 21/110 (19%), Positives = 44/110 (40%), Gaps = 11/110 (10%)

Query: 37 LKELAEREYKESIDEMKHADKLSDRILFLEGLPNF---QALGKLRIGENP-----TEMFR 88
L E E Y + + D +++R+L + G P + I + +EM +
Sbjct: 46 LHEKFEELYDHAAE---TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQ 102

Query: 89 CDLTLEREAVVVLREAVAYAETVKDYVSRQLLVDILESEEEHIDWLETQL 138
+ ++ + + AE +D + L V ++E E+ + L + L
Sbjct: 103 ALVNDYKQISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


86AXO1947_RS18065AXO1947_RS18100N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS180651195.470032IS5/IS1182 family transposase
AXO1947_RS180701204.974870IS5/IS1182 family transposase
AXO1947_RS180800203.946741hypothetical protein
AXO1947_RS18085-1163.269961pyridine nucleotide-disulfide oxidoreductase
AXO1947_RS18090-1182.302651hypothetical protein
AXO1947_RS18095314-0.0072033-oxoacyl-ACP reductase
AXO1947_RS181001110.931585IS5/IS1182 family transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18520PF05043357e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 34.9 bits (80), Expect = 7e-04
Identities = 20/85 (23%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCVPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F CV D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18525PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18540DHBDHDRGNASE1131e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 113 bits (284), Expect = 1e-32
Identities = 75/260 (28%), Positives = 113/260 (43%), Gaps = 9/260 (3%)

Query: 1 MSNTALRPQRVLIAGGSRGIGLAIAEGFVRNGAQVSICARTAAGLAQAAAALAAHGAPVH 60
M+ + + I G ++GIG A+A GA ++ L + ++L A
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 61 TRPCDLADATQIDAYVHAAAQALGGLDVVINNAS----GFGHGNDDASWQAGLDVDLMAA 116
P D+ D+ ID + +G +D+++N A G H D W+A V+
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 117 VRCNRAALPYLRSSDAAVILNISSINAQRPTPRAIAYSTAKAALNYYTTTLAAELARERI 176
+R+ Y+ + I+ + S A P AY+++KAA +T L ELA I
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 177 RVNAISPGSIE--FPDGLWDKRSREEPELY---ARIRDSIPFGGFGQVQHVADAALFLAS 231
R N +SPGS E LW + E + + IP + +ADA LFL S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 232 PQASWITGQVLAVDGGQSLG 251
QA IT L VDGG +LG
Sbjct: 241 GQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18545PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


87AXO1947_RS18200AXO1947_RS18265N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS18200110-0.214996EscC/YscC/HrcC family type III secretion system
AXO1947_RS18205017-1.307863EscT/YscT/HrcT family type III secretion system
AXO1947_RS18210015-1.789416type III secretion protein HrpB7
AXO1947_RS22775118-0.896500EscN/YscN/HrcN family type III secretion system
AXO1947_RS18215-1140.000611ATP-dependent helicase HrpB
AXO1947_RS18220-1150.318222type III secretion protein HrpB4
AXO1947_RS18225-1101.065875EscJ/YscJ/HrcJ family type III secretion inner
AXO1947_RS18230-1152.736068type III secretion protein HrpB2
AXO1947_RS18235-1152.278992HPr kinase
AXO1947_RS182400182.234459EscU/YscU/HrcU family type III secretion system
AXO1947_RS18245-1182.199871hypersensitivity response secretion protein
AXO1947_RS18250-2201.663413type III secretion protein HpaP
AXO1947_RS18255-1200.397023aldolase
AXO1947_RS18260321-0.855846EscR/YscR/HrcR family type III secretion system
AXO1947_RS182652210.177105EscS/YscS/HrcS family type III secretion system
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18645TYPE3OMGPROT333e-108 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 333 bits (855), Expect = e-108
Identities = 100/299 (33%), Positives = 158/299 (52%), Gaps = 13/299 (4%)

Query: 309 SKARRDESNPIDAGGGAELASDAPVIEADPRTNAILIRDRPERMQSYGTLIQQLDNRPKL 368
+ ++ + A AS +EADP NAI++RD PERM Y LI LD
Sbjct: 222 ATIQQVTVDNQRIPQAATRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSAR 281

Query: 369 LQIDATIIEIRDGAMQDLGVDWRFHSQHTDIQTGDGRGGQLGFNGVLSGAATDGATTPVG 428
+++ +I++I + +LGVDWR I+TG+ + G S A++GA
Sbjct: 282 IEVALSIVDINADQLTELGVDWR-----VGIRTGNNHQVVIKTTGDQSNIASNGAL---- 332

Query: 429 GTLTAVLGDAGRYLMTRVSALETTNKAKIVSSPQVATLDNVEAVMDHKQQAFVRVSGYAS 488
G+L G YL+ RV+ LE A++VS P + T +N +AV+DH + +V+V+G
Sbjct: 333 GSLVDARGL--DYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEV 390

Query: 489 ADLYNLSAGVSLRVLPSVVPGSPNGQMRLDVRIEDGQLGSNT--VDGIPVITSSEIKTQA 546
A+L ++ G LR+ P V+ ++ L++ IEDG N+ ++GIP I+ + + T A
Sbjct: 391 AELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEGIPTISRTVVDTVA 450

Query: 547 FVNEGQSLLIAGYAYDADETDLNAVPGLSKIPLLGNLFKHRQKSGSRMQRLFLLTPHVV 605
V GQSL+I G D L+ VP L IP +G LF+ + + R RLF++ P ++
Sbjct: 451 RVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRII 509



Score = 239 bits (611), Expect = 6e-73
Identities = 69/212 (32%), Positives = 110/212 (51%), Gaps = 6/212 (2%)

Query: 15 LAAVLLLSLLPLFSPQADAAQVPWHSRTFKYVADNKDLKEVLRDLSASQSIATWISPEVT 74
VL +LL L S + A ++ W + YVA + L+++L D A+ +S ++
Sbjct: 9 FKRVLTGTLL-LLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKIN 67

Query: 75 GTLSGKFE-TSPQKFLDDLAATYGFVWYYDGAVLRIWGANESKSATLSLGTASTKSLRDA 133
+SG+FE +PQ FL +A+ Y VWYYDG VL I+ +E S + L + L+ A
Sbjct: 68 DKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQA 127

Query: 134 LARMRLDDSRFPVRYDEAAHVAVVSGPPGYVDTVSAIARQVEQGARQR----DATEVQVF 189
L R + + RF R D + + VSGPP Y++ V A +EQ + R A +++F
Sbjct: 128 LQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIF 187

Query: 190 QLHYAQAADHMTRIGGQDVQIPGMASLLRSIY 221
L YA A+D +V PG+A++L+ +
Sbjct: 188 PLKYASASDRTIHYRDDEVAAPGVATILQRVL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18650TYPE3IMRPROT1741e-55 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 174 bits (442), Expect = 1e-55
Identities = 51/240 (21%), Positives = 105/240 (43%), Gaps = 3/240 (1%)

Query: 8 LLALSSQGVSLLTLLALCGVRVFVMFIVLPATAQDSLPGIARNGVIYVLSSFIAYGQPAD 67
L S Q +S L L +RV + P ++ S+P + G+ +++ IA PA+
Sbjct: 2 LQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPAN 61

Query: 68 ALAKIQTVGLVGVVFKEAFIGLLMGFAASTVFWIAESVGLLIDDLAGYNNVQMTNPLSGQ 127
+ L + ++ IG+ +GF F + G +I G + +P S
Sbjct: 62 DVPVFSFFAL-WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 128 QSTPVSTVLLQLAIVSFYALGGMLMLLGALFESFRWWPLTQLGPNMGAVAESFVIQQYDS 187
++ ++ LA++ F G L L+ L ++F P+ + + A + +
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGG--EPLNSNAFLALTKAGSL 178

Query: 188 MMAAVVKLSAPVMLVLVLVDLAIGLVARAADKLEPSNLSQPIRGVLALLLLALLTSVFIA 247
+ + L+ P++ +L+ ++LA+GL+ R A +L + P+ + + L+A L +
Sbjct: 179 IFLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAP 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18655IGASERPTASE280.014 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.014
Identities = 11/62 (17%), Positives = 23/62 (37%)

Query: 93 AEQAQAAADQSLQSARDELASVQQALSKLQAQAQVYADKAASARRARQAQRDAAEEEDAI 152
+E + A+ S Q ++ + Q A +V + ++ + Q A +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093

Query: 153 ET 154
ET
Sbjct: 1094 ET 1095


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18665FLGFLIH310.002 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 31.3 bits (70), Expect = 0.002
Identities = 55/239 (23%), Positives = 88/239 (36%), Gaps = 47/239 (19%)

Query: 4 WLRSTPDAIGMDCDVIPREALASVLALDAVTV--EVHARCEQALSQAQARAQTLIDEAQQ 61
W TPD D+ P +A + T+ E EQ L+Q Q +A +Q
Sbjct: 7 WKTWTPD------DLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAH------EQ 54

Query: 62 QAEAILHDARQKAERSARLGYAAGLRRQLDEWNESGLRHAFAAETAAHRARERLAEIVAR 121
+A + + RQ+ + GY GL + L E GL A + + H ++L
Sbjct: 55 GYQAGIAEGRQQGHKQ---GYQEGLAQGL----EQGLAEAKSQQAPIHARMQQLVSEFQT 107

Query: 122 TCEHI------------------ILGHDPA----ALYARAAQALEGALDEAKALRVSVHP 159
T + + ++G P AL + Q L+ + ++ VHP
Sbjct: 108 TLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHP 167

Query: 160 DALDAARRAFDAAATEAGWTLQVELCGDADLAVGACVCEWDTGVFETDLRDQLRSLRRV 218
D L A + GW L+ GD L G C D G + + + + L R+
Sbjct: 168 DDLQRVDDMLGATLSLHGWRLR----GDPTLHPGGCKVSADEGDLDASVATRWQELCRL 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18675FLGMRINGFLIF803e-19 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 79.6 bits (196), Expect = 3e-19
Identities = 42/188 (22%), Positives = 81/188 (43%), Gaps = 11/188 (5%)

Query: 3 ALRYLVVLLLALLLSACSQQ---LYSGLTENDANDMLEVLLHAGVDASKVTPDDGKTWAV 59
A V +++A++L A + L+S L++ D ++ L + + G A+
Sbjct: 30 AGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY-RFANGSG---AI 85

Query: 60 NAPHDQVSYSLEALRAHGLPHERHANLG-EMFKKDGLISTPTEERVRFIYGVSQQLSQTL 118
P D+V L GLP + +G E+ ++ + E+V + + +L++T+
Sbjct: 86 EVPADKVHELRLRLAQQGLP--KGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTI 143

Query: 119 SNIDGVISADVEIVLPNNDPLATSVKPSSAAVFIKFRVGSDLT-SLVPNIKTMVMHSVEG 177
+ V SA V + +P K SA+V + G L + + +V +V G
Sbjct: 144 ETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAG 203

Query: 178 LTYENVSV 185
L NV++
Sbjct: 204 LPPGNVTL 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18690TYPE3IMSPROT331e-114 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 331 bits (850), Expect = e-114
Identities = 112/345 (32%), Positives = 191/345 (55%), Gaps = 2/345 (0%)

Query: 1 MSEEKTEKPTEKKLRDARKDGEVPVSPDVTAAAVLFGALLVMKSAGDYFSDHMRALTRIG 60
MS EKTE+PT KK+RDARK G+V S +V + A++ ++ DY+ +H L I
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 FDFPENTRDATAINRALAHIGIQGLLLMLPFLAACLVAGLVGGAFQTGLNASLKPVAPKF 120
+ + A++ + ++ ++ L P L + + Q G S + + P
Sbjct: 61 AE-QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 121 DSLNPANGVKKLFSLRSLINLLKLIIKAVLIGVVLWVGIRALMPMIIGLAYETPLDISQI 180
+NP G K++FS++SL+ LK I+K VL+ +++W+ I+ + ++ L I+ +
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 181 AWRTLSMLFALGVLLFILVGAADWSVQHWLFIRDKRMSKDEQKREHKESEGDPEIKGKRK 240
+ L L + + F+++ AD++ +++ +I++ +MSKDE KRE+KE EG PEIK KR+
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 241 EFAKELVFGDPRERVAKAKVMVVNPTHYAVALAYEPDDFGVPQVVAKGVDDGALELRAFA 300
+F +E+ + RE V ++ V+V NPTH A+ + Y+ + +P V K D +R A
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 301 HNQGIPIVANPPLARALY-QVELGDAIPEQLFETVAVVLRWVDEL 344
+G+PI+ PLARALY + IP + E A VLRW++
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18705TYPE3OMOPROT653e-14 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 65.0 bits (158), Expect = 3e-14
Identities = 40/167 (23%), Positives = 73/167 (43%), Gaps = 16/167 (9%)

Query: 144 PAQLPAWLAALRVNTRLRIGGRTASAALLQSLRPGDVLLHCTAAAAVTSGEVLWG----I 199
PA LR R IG +LL + GDVLL T+ A V G +
Sbjct: 138 PAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTSRAEVYCYAKKLGHFNRV 197

Query: 200 AGGAVLRAPVRLNMQQMILEASPTMQHDTFEPEVAPSTSNVAELELPVQLEVDQLALSLS 259
GG ++ L++Q + E + T E A + + +L + ++ + + ++L+
Sbjct: 198 EGGIIVET---LDIQHIEEENNTT--------ETAETLPGLNQLPVKLEFVLYRKNVTLA 246

Query: 260 TLSGLQPGQILELSVPVDQADIRLVVYGQTIGTGRLLAVGEHLGVQI 306
L + Q+L L + ++ ++ G +G G L+ + + LGV+I
Sbjct: 247 ELEAMGQQQLLSLPTNAEL-NVEIMANGVLLGNGELVQMNDTLGVEI 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18710TYPE3IMPPROT2449e-85 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 244 bits (625), Expect = 9e-85
Identities = 79/219 (36%), Positives = 129/219 (58%), Gaps = 8/219 (3%)

Query: 3 MPDVGSLLLVVIMLGLLPFAAMVVTSYTKIVVVLGLLRNAIGVQQVPPNMVLNGVALLVS 62
M + SL+ ++ LLPF T + K +V ++RNA+G+QQ+P NM LNGVALL+S
Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60

Query: 63 CFVMAPVGMEAFKA-AQNYGAGSDNSRIVVLLDACREPFRQFLLKHTREREKAFFMRSAQ 121
FVM P+ +A+ +D S + +D + +R +L+K++ FF +
Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120

Query: 122 QIWPKDKAAT-------LKSDDLLILAPAFTLGELTEAFRIGFLLYLVFIVIDLVVANAL 174
+ ++ T ++ + L PA+ L E+ AF+IGF LYL F+V+DLVV++ L
Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180

Query: 175 MAMGLSQVTPTNVAIPFKLLLFVAMDGWSMLIHGLVLSY 213
+A+G+ ++P ++ P KL+LFVA+DGW++L GL+L Y
Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS18715TYPE3IMQPROT643e-17 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 63.6 bits (155), Expect = 3e-17
Identities = 25/78 (32%), Positives = 44/78 (56%)

Query: 4 DDLVRFTSEALLLCLKVSLPVVGVAALTGLLIAFFQAVMSLQDASISFALKLVVVVAAIA 63
DDLV ++AL L L +S VA + GLL+ FQ V LQ+ ++ F +KL+ V +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 VTAPWGASAIMQFGQALM 81
+ + W ++ +G+ ++
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


88AXO1947_RS19030AXO1947_RS19065N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AXO1947_RS19030421-0.400448tetracycline resistance MFS efflux pump
AXO1947_RS19035423-0.692479epimerase
AXO1947_RS22855133-5.919968hypothetical protein
AXO1947_RS22860-125-4.058658hypothetical protein
AXO1947_RS19040-125-4.023156hypothetical protein
AXO1947_RS19045-124-3.324503IS5/IS1182 family transposase
AXO1947_RS19050-216-0.637863hypothetical protein
AXO1947_RS22865-130-0.080341ATP-binding protein
AXO1947_RS19060-1230.920877hypothetical protein
AXO1947_RS19065-2210.581646hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19565TCRTETA2436e-79 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 243 bits (622), Expect = 6e-79
Identities = 150/403 (37%), Positives = 222/403 (55%), Gaps = 19/403 (4%)

Query: 17 ALIFIFITVLIDVLSFGVIIPVLPGLVRHFTGGDYVQAAVWIGWFGFLFAAIQFVCSPLQ 76
LI I TV +D + G+I+PVLPGL+R + G L+A +QF C+P+
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN--DVTAHYGILLALYALMQFACAPVL 63

Query: 77 GALSDRFGRRPVILLSCLG--LDFILMAVAHSLPMLLLARVISGVCSASFSTATAYIADV 134
GALSDRFGRRPV+L+S G +D+ +MA A L +L + R+++G+ A+ + A AYIAD+
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADI 123

Query: 135 TPADKRAGAFGMLGAAFGIGLVAGPLIGGWLGSMGLRWPFWFAAGLALLNVLYGWFVLPE 194
T D+RA FG + A FG G+VAGP++GG +G PF+ AA L LN L G F+LPE
Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 195 SLPVERRTARLDWSHANPLGALKLLRRYPQVFGLASAVFLANLAHYVYPSIFLLFAGYQY 254
S ERR R + NPL + + R V L + F+ L V +++++F ++
Sbjct: 184 SHKGERRPLRREAL--NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241

Query: 255 HWGPREVSWVLAGVGVCSIIVNVLLVGRLVRWLGERRALMLGLGCGVIGFVIYGLADSGA 314
HW + LA G+ + ++ G + LGERRALMLG+ G+++ A G
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 315 AFLIGVPISALWALAAPSAQALITREVGADAQGRVQGALTGLVSLAGIAGPLLFANVFAW 374
+ + A + P+ QA+++R+V + QG++QG+L L SL I GPLLF ++A
Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361

Query: 375 FIGS--------GAPLHLPGAPWLLAGVLLAAGWGMAWKRAGR 409
I + GA L+L P L G+ W A +RA R
Sbjct: 362 SITTWNGWAWIAGAALYLLCLPALRRGL-----WSGAGQRADR 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19580SECA341e-04 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 33.7 bits (77), Expect = 1e-04
Identities = 10/17 (58%), Positives = 11/17 (64%)

Query: 7 NDPCPCGRPADYARCCG 23
NDPCPCG Y +C G
Sbjct: 882 NDPCPCGSGKKYKQCHG 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19590PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 14/85 (16%)

Query: 68 IAGLLYLKHAYDLSDEAVCERWLENPYWQFFTGEVVFQTCLPCDPSSLTRWRQRLGEAGM 127
+A ++ L +E VC+ ++ FF E +F C+ D S + + L +
Sbjct: 241 VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKD-SYVEKSYHLLSDFID 299

Query: 128 E-------------ELLAHTINTAH 139
+ L+ H NTAH
Sbjct: 300 QISVKYQIEIENKDNLIWHLHNTAH 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19600RTXTOXIND367e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.3 bits (84), Expect = 7e-04
Identities = 15/130 (11%), Positives = 32/130 (24%), Gaps = 8/130 (6%)

Query: 306 REQRRLALLDARLHALDVNDQGLAGEEGQRRAAVDNHQQRLSDLEAQRRSQGGERIDALE 365
Q + + L + + + RL D + Q + LE
Sbjct: 197 TWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLE 256

Query: 366 REQ--LQVQGELARRSDKRAKAEQACRQLDQSLADNAHGFAEQ------SAQARAALEDG 417
+E ++ EL + + E + F + L
Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTL 316

Query: 418 QRLAAEQDEA 427
+ E+ +
Sbjct: 317 ELAKNEERQQ 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AXO1947_RS19610FLAGELLIN300.031 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.6 bits (66), Expect = 0.031
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 5/80 (6%)

Query: 104 TQAIQAIRFADGLEQ-ARGAATESRLAL-VMQQLSQLAALTETNPDARLSALRDERDRID 161
TQA + + Q GA E L +++LS + A TN D+ L +++DE +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELS-VQATNGTNSDSDLKSIQDEIQQRL 119

Query: 162 AEIARVAAGKVASLDGKRAL 181
EI RV+ +G + L
Sbjct: 120 EEIDRVSNQ--TQFNGVKVL 137



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.