PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome2755.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_004603 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1VP0057VP0071Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP00572213.627546hypothetical protein
VP00583234.071868gluconate utilization system Gnt-I
VP00592224.253270LysR family transcriptional regulator
VP00602224.402069multidrug transmembrane resistance signal
VP00612234.633329multidrug transmembrane resistance signal
VP00621214.148259phosphogluconate dehydratase
VP0063-1212.897539thermoresistant gluconokinase
VP00640233.270019gluconate permease
VP0065-1263.757664keto-hydroxyglutarate-aldolase/keto-deoxy-
VP0066-2253.457308purine nucleoside phosphorylase
VP0067-1243.150633LysR family transcriptional regulator
VP00680253.862979glutathione reductase
VP00691273.966247hypothetical protein
VP00701263.679867oligopeptidase A
VP00722262.973545DNA-binding transcriptional regulator AsnC
VP00712232.562812sensory box/GGDEF family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0057YERSSTKINASE340.002 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 33.6 bits (76), Expect = 0.002
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 9/74 (12%)

Query: 192 HAHHHHVLHADLKPENILID-PNKRPKLLDFNLTQKVQSNGGESPPALIAFSQNFASPEQ 250
H V+H D+KP N++ D + P ++D L S GE P F+++F +PE
Sbjct: 260 HLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGL----HSRSGEQPK---GFTESFKAPEL 312

Query: 251 QTGQF-LTAQSDVY 263
G + +SDV+
Sbjct: 313 GVGNLGASEKSDVF 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0058PF04335280.047 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 28.3 bits (63), Expect = 0.047
Identities = 15/76 (19%), Positives = 26/76 (34%), Gaps = 10/76 (13%)

Query: 3 LLWILCLAFGGLSASCVFVIMLHVTL------LI---NHIFMAQQNKKTRTTLQDVADQV 53
L W++ G L+ + V + L +I + A K D+
Sbjct: 34 LAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLHGDATITYDEA 93

Query: 54 GVTKMTVSRYMRNPES 69
V K ++ Y+R E
Sbjct: 94 -VRKYFLATYVRYREG 108


2VP0186VP0233Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0186-115-3.12945350S ribosomal protein L33
VP0187-212-2.718518Dca
VP0188-214-2.281373hypothetical protein
VP0189-215-1.856943formamidopyrimidine-DNA glycosylase
VP0190-217-2.253525phosphopantetheine adenylyltransferase
VP0191-217-2.352630lipopolysaccharide A protein
VP0192-115-2.998645hypothetical protein
VP0193220-3.621781lipopolysaccharide biosynthesis
VP0194123-3.734094lipopolysaccharide biosynthesis protein
VP0195125-4.406433diacylglycerol kinase
VP0196126-4.7098373-deoxy-D-manno-octulosonic-acid kinase
VP0197127-5.071425capsular polysaccharide biosynthesis protein D
VP0198228-4.748024aminotransferase
VP0199227-4.787989NeuC protein
VP0200125-4.894810N-acetylneuraminic acid synthetase
VP0201125-4.902046acetyltransferase
VP0202024-4.922409sugar-phosphate nucleotide transferase
VP0203128-7.077313CMP-N-acetlyneuraminic acid synthetase
VP0204129-7.8509213-chlorobenzoate-3,4-dioxygenase
VP0205133-9.281448glutamate-1-semialdehyde 2,1-aminomutase
VP0206233-9.128508amidohydrolase
VP0207329-8.458035flagellin modification protein A
VP0208329-8.006030integral membrane protein
VP0209022-5.626783citrate synthase
VP0210-114-3.508297Lex2B
VP0211011-2.2892453-deoxy-D-manno-octulosonic-acid transferase
VP0212112-2.154964ADP-heptose-LPS heptosyltransferase II
VP0213213-2.136286lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA
VP0214214-1.811310ADP-L-glycero-D-manno-heptose-6-epimerase
VP0215113-1.251039OtnG protein
VP0216114-1.926322hypothetical protein
VP0217015-1.971610regulator
VP0218018-2.417455hypothetical protein
VP0219019-4.121164hypothetical protein
VP0220123-6.394571OtnA protein
VP0221533-10.221195OtnB protein
VP0222739-12.562121dTDP-glucose 4,6 dehydratase
VP0223945-15.290155D-glucose-1-phosphate thymidylyltransferase
VP02241046-15.917051dTDP-4-dehydrorhamnose reductase
VP02251045-14.655431capsular polysaccharide biosynthesis protein
VP02261044-13.128233rhamnosyl transferase
VP0227841-11.594223hypothetical protein
VP0228839-9.972963integral membrane protein
VP0229529-5.548299dTDP-4-dehydrorhamnose 3,5-epimerase
VP0230329-5.148159glycosyltransferase
VP0231226-4.189290UDP-galactose phosphate transferase
VP0232124-4.012602carbamoyl phosphate synthase-like protein
VP0233120-3.025083hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0190LPSBIOSNTHSS2105e-73 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 210 bits (537), Expect = 5e-73
Identities = 72/157 (45%), Positives = 107/157 (68%)

Query: 8 MKVIYPGTFDPVTNGHLNLIERTHEMFDEVVIGVAASPSKNTMFTLEERVALMEEVVAHL 67
M IYPG+FDP+T GHL++IER +FD+V + V +P+K MF+++ER+ + + +AHL
Sbjct: 1 MNAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHL 60

Query: 68 PGVTVKGFSGLLVDFARQEQAKVLIRGLRTTVDFEYEFGLTNMYRKLLPGIESVFLTPEE 127
P V F GL V++ARQ QA ++RGLR DFE E + N + L +E+VFLT
Sbjct: 61 PNAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120

Query: 128 EFAFLSSTIVREVAIHGGSIEQFVPAAVANAIEKKVN 164
E++FLSS++V+EVA GG++E FVP+ VA A+ + +
Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0197NUCEPIMERASE462e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.5 bits (108), Expect = 2e-07
Identities = 41/214 (19%), Positives = 77/214 (35%), Gaps = 26/214 (12%)

Query: 34 RFLVLGGAGSIGQAVTKEIFKRNPKKLHVVDISENNMVELVRDIRSSFGYIDGDFQTFAL 93
++LV G AG IG V+K + + + + + ++++ V L + FQ +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA--QPGFQFHKI 59

Query: 94 DIGSLEYDAFIKADGKFDYVLNLSALKHVR-SEKDPYTLMRMIDVNVFNTDKTIQQSIDA 152
D+ E + A G F+ V VR S ++P+ D N+ ++
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAY---ADSNLTGFLNILEGCRHN 116

Query: 153 GAKKYFCVST---------------DKAANPVNMMGASKRIMEMFLMRKSEQIAISTA-- 195
+ S+ D +PV++ A+K+ E+ S +
Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176

Query: 196 RFANVAFSDGS---LLHGFNQRIQKRQPIVAPND 226
RF V G L F + + + + I N
Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNY 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0198OMS28PORIN300.013 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 30.1 bits (67), Expect = 0.013
Identities = 21/80 (26%), Positives = 36/80 (45%), Gaps = 14/80 (17%)

Query: 48 VSSVGKFVDDFERKIE---------VYTGTAKAVATVNGTAALHAALYMADVQRGDLVIT 98
+ ++ K +D K+E V + A V G+ +L M+DV +G +V +
Sbjct: 60 LDTINKVTEDVSSKLEGVRESSLELVESNDAGVVKKFVGSMSL-----MSDVAKGTVVAS 114

Query: 99 QALTFVATCNALYHMGAEPI 118
Q T VA C+ + GA +
Sbjct: 115 QEATIVAKCSGMVAEGANKV 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0207DHBDHDRGNASE622e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 61.6 bits (149), Expect = 2e-13
Identities = 56/270 (20%), Positives = 103/270 (38%), Gaps = 34/270 (12%)

Query: 2 LKDKKIVIAGAGGLLGASVVKSILEAGGSVVATDVSLEHLKARLSSVGVNLADTHLTMHE 61
++ K I GA +G +V +++ G + A D + E L+ +SS+ H
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL--KAEARHAEAFP 63

Query: 62 LDITRSDAL----TRFWQEAEGVTGAVNCTYPRTKSYGAKFFDVTLDSFNENVSIHLGSA 117
D+ S A+ R +E + VN ++ + + S++
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVA---GVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 118 FLFSQQCAAYFVNKEQPFSLVNISSIYGVIAPKFSVYDNTPMTMPVEYAAIKSAIVHLNK 177
F S+ + Y +++ S+V + S P T YA+ K+A V K
Sbjct: 121 FNASRSVSKYMMDRRSG-SIVTVGSNPA----------GVPRTSMAAYASSKAAAVMFTK 169

Query: 178 YVVSYINDSRFRVNSVSPGGI-LDGQPEAFLDAYRKNTHGAGMLNV-------------E 223
+ + + R N VSPG D Q + D G L
Sbjct: 170 CLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPS 229

Query: 224 EMTGSIVYLLSDQSKYVTGQNIIVDDGFSL 253
++ ++++L+S Q+ ++T N+ VD G +L
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0214NUCEPIMERASE871e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 86.8 bits (215), Expect = 1e-21
Identities = 77/358 (21%), Positives = 130/358 (36%), Gaps = 88/358 (24%)

Query: 2 IIVTGGAGMIGSNIVKALNEAGINDILVVDNLKN--------------GKKFKNLVDLDI 47
+VTG AG IG ++ K L EAG + ++ +DNL + + +D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 48 TDYMDRDDFLTQIMAGDDFGPIEAIFHEGACSATTEWDGKYMMLNNYEYSK-------EL 100
D + +T + A F E +F A +Y + N + Y+ +
Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109

Query: 101 LHYCLDREIP-FLYASSAATYGETET--FVEEREYEGALNVYGYSKQQFDNYVRRLWKDA 157
L C +I LYASS++ YG F + + +++Y +K K
Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATK-----------KAN 158

Query: 158 EEHGEQLSQI-----TGFRYFNVYGPREDHKGSMASVAFHLNNQINAGENPKLFEGSGHF 212
E S + TG R+F VYGP + MA F + G++ ++ G
Sbjct: 159 ELMAHTYSHLYGLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKM 213

Query: 213 KRDFVYVGDVCKVNL------------WFLENGVSG-------IFNCGTGRAESFEEVAK 253
KRDF Y+ D+ + + W +E G ++N G + +
Sbjct: 214 KRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQ 273

Query: 254 AVVKHHN-KGEIQTIPFPDHLKGAYQEFTQADLTKLRAAGCDVEFK---TVAEGVAEY 307
A+ + + +P T AD L + F TV +GV +
Sbjct: 274 ALEDALGIEAKKNMLPLQP----GDVLETSADTKALYE---VIGFTPETTVKDGVKNF 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0218PF06057250.033 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 24.8 bits (54), Expect = 0.033
Identities = 12/50 (24%), Positives = 22/50 (44%)

Query: 6 LAILMALSASTAFAAEEAGSAGAASASATGTTAVAVGAAAAVTVVAVAAS 55
L++L+ S + AFA E A + G +T V ++ + + S
Sbjct: 9 LSVLLLCSTANAFADEFADNLGLTLLPVEPSTQVNAASSHTKPPLVIFLS 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0219OMPADOMAIN455e-08 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 44.9 bits (106), Expect = 5e-08
Identities = 45/187 (24%), Positives = 71/187 (37%), Gaps = 26/187 (13%)

Query: 26 ALSAISLGLCSFTASA----DFYAGALVSYSNAEYHHS---STSSVTEGNPFLLQAQAGY 78
A++ G + +A +Y GA + + ++YH + + + T N A GY
Sbjct: 7 AIAVALAGFATVAQAAPKDNTWYTGAKLGW--SQYHDTGFINNNGPTHENQLGAGAFGGY 64

Query: 79 FFNDYVALEARY---GTSVQRESGLAIDSLASGF---VKLNMPVSERFAFYGLAGYSSVQ 132
N YV E Y G + S A G KL P+++ Y G +
Sbjct: 65 QVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWR 124

Query: 133 IDQQN--VGSNKDQGFS--FGLGMHYALDKHNAVVFEFVD-------NTSEDQVRLNALT 181
D ++ G N D G S F G+ YA+ A E+ +T + L+
Sbjct: 125 ADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRPDNGMLS 184

Query: 182 LGFQHRF 188
LG +RF
Sbjct: 185 LGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0222NUCEPIMERASE1782e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 178 bits (454), Expect = 2e-55
Identities = 78/358 (21%), Positives = 146/358 (40%), Gaps = 48/358 (13%)

Query: 1 MKILVTGGAGFIGSALVRHIIKNTSDSVVNVDCLT--YAGNL-ESLGSVIQSERYVFEQV 57
MK LVTG AGFIG + + +++ VV +D L Y +L ++ ++ + F ++
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 NICDRAELNRVFEAHKPDAVMHLAAESHVDRSITGPAAFIETNVVGTYTMLEATREYWSK 117
++ DR + +F + + V V S+ P A+ ++N+ G +LE R +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 118 LDDNAKAAFRFHHISTDEVYGDLPHPDEVSDGKELPMFLETTPYEPSSPYSASKASSDHL 177
+ S+ VYG +P + + P S Y+A+K +++ +
Sbjct: 120 ---------HLLYASSSSVYGLNRK---------MPFSTDDSVDHPVSLYAATKKANELM 161

Query: 178 VRAWLRTYGLPTMVTNCSNNYGPYHFPEKLIPLVILNALEGKDLPIYGKGDQIRDWLFVE 237
+ YGLP YGP+ P+ + LEGK + +Y G RD+ +++
Sbjct: 162 AHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYID 221

Query: 238 DHARALYKVI------------------TEGKVGETYNIGGHNEKKNIEVVNTICEILDT 279
D A A+ ++ YNIG + + ++ + + + L
Sbjct: 222 DIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI 281

Query: 280 LVPKQTAYSEQITYVQDRPGHDRRYAIDSSKMQRELNWTPEETFETGLRKTVQWYLDN 337
K + +PG + D+ + + +TPE T + G++ V WY D
Sbjct: 282 EAKKN--------MLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0224NUCEPIMERASE461e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.5 bits (108), Expect = 1e-07
Identities = 46/213 (21%), Positives = 85/213 (39%), Gaps = 33/213 (15%)

Query: 3 NVLLLGESGFVGSVVYNSL----HCVCKVYTIAPNKKITI--DNVFEVANDVFKSIEIN- 55
L+ G +GF+G V L H V + + +++ + +A F+ +I+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 56 -------------NIDVVINCIAMANLDQCENNKLDCELVNTTFVTHIVDYLKDKDIK-L 101
+ + V + N N T +I++ + I+ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 102 VHISSNAVYDGLNA--PYSE-NSLREPINYYGICKSNADYYIESNLNNYAIA----RPIT 154
++ SS++VY GLN P+S +S+ P++ Y K + + + Y + R T
Sbjct: 122 LYASSSSVY-GLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 155 VYGPRKIEQR-DNPVSFIVKKILSGESFDLVDD 186
VYGP R D + K +L G+S D+ +
Sbjct: 181 VYGPW---GRPDMALFKFTKAMLEGKSIDVYNY 210


3VP0372VP0402Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0372119-4.118738hemolysin
VP0373221-6.518792hypothetical protein
VP0374220-5.394538hypothetical protein
VP0375222-5.565303lipoprotein
VP0376324-7.021413hypothetical protein
VP0377326-5.184747CFA/I fimbrial subunit D
VP0378224-3.476576hypothetical protein
VP0379326-4.121884ABC transporter substrate binding protein
VP0380329-5.121936integrase
VP0381424-3.382822hypothetical protein
VP0382523-4.297385hypothetical protein
VP0383316-2.775476hypothetical protein
VP0384216-2.853883hypothetical protein
VP0385116-1.926875hypothetical protein
VP0386116-1.739105inner membrane protein
VP0387215-2.478383HsdS polypeptide
VP0388017-0.494934type I restriction enzyme M protein
VP0389226-2.408536hypothetical protein
VP0390218-0.798409hypothetical protein
VP0391318-1.068306hypothetical protein
VP0392318-1.002402hypothetical protein
VP0393118-1.643919hypothetical protein
VP0394018-1.633711haemagglutinin associated protein
VP0395-117-1.548896type I restriction enzyme R protein
VP0396224-4.439812hypothetical protein
VP0397223-3.684200hypothetical protein
VP0398224-3.584249hypothetical protein
VP0399317-2.656077transcriptional regulator
VP0400213-2.195843transmembrane protein
VP0401213-2.131912hypothetical protein
VP0402215-0.714100Y4mE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0372PHPHTRNFRASE300.028 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 30.1 bits (68), Expect = 0.028
Identities = 15/106 (14%), Positives = 40/106 (37%), Gaps = 15/106 (14%)

Query: 83 LGCVEGVILAELLLMVRDDIQILANQYLKTVPELDQLFIGVDVFEGKDAVKSNMKALRAA 142
+ GV +A+ + + ++ I E+++L + K ++A++
Sbjct: 8 IAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLT------AALEKSKEELRAIK-- 59

Query: 143 NKHLANGGLLLVFPAGEVSQLVDAKQQRLEDKEWSRSVSALIRKNK 188
++ A+ G + +++ A L+D E + I +
Sbjct: 60 DQTEASMG-------ADKAEIFAAHLLVLDDPELVDGIKGKIENEQ 98


4VP0435VP0440Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VP0435324-2.293360hypothetical protein
VP0436223-1.341713hypothetical protein
VP0437425-1.058830hypothetical protein
VP0438425-0.74352950S ribosomal protein L13
VP0439324-1.04301830S ribosomal protein S9
VP0440220-1.276335hypothetical protein
5VP0628VP0642Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP06282151.547057hypothetical protein
VP06291171.699934homocysteine synthase
VP06302191.385251hypothetical protein
VP06310170.849550hypothetical protein
VP06320160.253417Na+/H+ antiporter
VP0633-119-0.058433integral membrane protein
VP0634-1190.065826hypothetical protein
VP0635326-0.604802LysR family transcriptional regulator
VP0636527-0.221150outer membrane protein A
VP06375331.292084hypothetical protein
VP06387311.468659resolvase
VP06396260.821507hypothetical protein
VP06404230.414408hypothetical protein
VP06414220.110548hypothetical protein
VP0642220-0.531385ribonuclease H
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0628UREASE356e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 35.5 bits (82), Expect = 6e-04
Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 2/40 (5%)

Query: 512 RVDTYTALQAMTIWPAYQHFEESYKGSIEVGKNADLIILD 551
RV Y A TI PA H GS+EVGK ADL++ +
Sbjct: 401 RVKRYIA--KYTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0636OMPADOMAIN781e-19 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 78.4 bits (193), Expect = 1e-19
Identities = 56/201 (27%), Positives = 82/201 (40%), Gaps = 24/201 (11%)

Query: 1 MKSNVVTLVGLFSLTTFSTLSYAADSKDHGVYVGANYGY------LKVDGQDDFDDDSDA 54
MK + + +L F+T++ AA KD+ Y GA G+ ++ ++
Sbjct: 1 MKKTAIAIA--VALAGFATVAQAAP-KDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLG 57

Query: 55 MQALVGYRFNRYLALEGGYIDFGSY---GNNLANA-ETDGYTAALKVTAPITDRVDVYAK 110
A GY+ N Y+ E GY G G+ A + G K+ PITD +D+Y +
Sbjct: 58 AGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTR 117

Query: 111 GGQMWYSTDYNVAGFHGNKDDEGV--FAGAGVGFKVTDNFLVNAEYTWYDVELNAENVFD 168
G M + D + +G D GV GV + +T EY W N+ D
Sbjct: 118 LGGMVWRADTK-SNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQW------TNNIGD 170

Query: 169 GA--NTNTDFKQASLGVEYRF 187
T D SLGV YRF
Sbjct: 171 AHTIGTRPDNGMLSLGVSYRF 191


6VP0651VP0659Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0651225-0.938920heat shock protein GrpE
VP0652122-0.797914hypothetical protein
VP0653223-1.314747molecular chaperone DnaK
VP0654-118-2.198869molecular chaperone DnaJ
VP0655227-3.782753hypothetical protein
VP0656228-4.090550fimbrial assembly protein PilE
VP0657219-2.693760type IV pilin
VP0658219-2.139663hypothetical protein
VP0659320-1.898554hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0651PHPHTRNFRASE280.021 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.2 bits (63), Expect = 0.021
Identities = 24/119 (20%), Positives = 52/119 (43%), Gaps = 6/119 (5%)

Query: 8 VTEEELDQIIEEAEKVEAAAQEAEAELEEIGDEKDAKIAQLEAALLSSETKVKDQQDAVL 67
+ + + + E EK+ AA ++++ EL I D+ +A + +A + ++ V D + V
Sbjct: 29 IEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAHLLVLDDPELVD 88

Query: 68 RAKAEVENMRRRTEQEIDKARKYALNKFAEELLPVIDNL--ERAIQAADTENEVIKPIL 124
K ++EN + E + + + F + + ERA D V+ ++
Sbjct: 89 GIKGKIENEQMNAEYALKEVS----DMFVSMFESMDNEYMKERAADIRDVSKRVLGHLI 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0653SHAPEPROTEIN1392e-38 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 139 bits (351), Expect = 2e-38
Identities = 81/385 (21%), Positives = 145/385 (37%), Gaps = 81/385 (21%)

Query: 5 IGIDLGTTNSCVAVLDG----DKPRVIE-NAEGERTTASVIAYTDGETLVGQPAKRQAVT 59
+ IDLGT N+ + V ++P V+ + + SV A VG AK+
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA-------VGHDAKQMLGR 65

Query: 60 NPTNTLFAIKRLIGRRFEDEEVQRDIEIMPYKIVKADNGDAWVEAKGQKMAAPQVSAEVL 119
P N + AI+ + D V + ++L
Sbjct: 66 TPGN-IAAIRPMKDGVIADFFV---------------------------------TEKML 91

Query: 120 KK-MKKTAEDFLGEEVTGAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALAY 178
+ +K+ + ++ VP +R+A +++ + AG +I EP AAA+
Sbjct: 92 QHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGA 151

Query: 179 GLDKKGGDRTIAVYDLGGGTFDISIIEIDEVEGEKTFEVLATNGDTHLGGEDFDNRLINY 238
GL V D+GGGT ++++I ++ V + +GG+ FD +INY
Sbjct: 152 GLPVS-EATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAIINY 201

Query: 239 LVDEFKKEQGIDLKNDPLAMQRVKEAAEKAKIELSSTSQTD----VNLPYVTADATGPKH 294
+ + G + AE+ K E+ S D + + P+
Sbjct: 202 VRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRG 248

Query: 295 MNIKVTRAKLESLVEDLVQRSLEPLKVALADA--DLSVNDITD--VILVGGQTRMPMVQA 350
+ + LE+L E L + + VAL +L+ +DI++ ++L GG + +
Sbjct: 249 FTLN-SNEILEALQEPLTG-IVSAVMVALEQCPPELA-SDISERGMVLTGGGALLRNLDR 305

Query: 351 KVAEFFGKEARRDVNPDEAVAMGAA 375
+ E G +P VA G
Sbjct: 306 LLMEETGIPVVVAEDPLTCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0654PF07132300.016 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 30.0 bits (67), Expect = 0.016
Identities = 16/38 (42%), Positives = 19/38 (50%)

Query: 76 QGGGGFGGGFGGGGADFGDIFGDVFGDIFGGGRRGGGG 113
GGG GGG GG G+ G + G + G GGG G
Sbjct: 64 MMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLG 101



Score = 29.7 bits (66), Expect = 0.020
Identities = 17/43 (39%), Positives = 19/43 (44%)

Query: 78 GGGFGGGFGGGGADFGDIFGDVFGDIFGGGRRGGGGHRAQRGA 120
G GGG GGG G G + G + GGG GG G G
Sbjct: 62 GSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGL 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0656BCTERIALGSPG464e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 45.6 bits (108), Expect = 4e-09
Identities = 14/53 (26%), Positives = 34/53 (64%)

Query: 12 KRLRGMTLIELLIAVTIVGIIAAIAYPSYTNHVIKSHRTVALSDLSRIQLELE 64
+ RG TL+E+++ + I+G++A++ P+ + K+ + A+SD+ ++ L+
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0657BCTERIALGSPG348e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.5 bits (79), Expect = 8e-05
Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 17 RGFTLLELMITVAVLSALLATAAPS 41
RGFTLLE+M+ + ++ L + P+
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPN 32


7VP0787VP0810Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP07872190.601834hypothetical protein
VP07882210.111775flagellin
VP0789327-0.601552hypothetical protein
VP0790430-0.433125flagellin
VP0791529-1.262395flagellin
VP0792643-1.553644hypothetical protein
VP0793435-1.646319PTS system glucose-specific transporter subunit
VP0794325-1.408966phosphoenolpyruvate-protein phosphotransferase
VP0795111-0.608528phosphocarrier protein HPr
VP0796011-0.772311hypothetical protein
VP0797017-0.218071cysteine synthase A
VP0798017-0.432275sulfate transport protein CysZ
VP0799118-0.078006cell division protein ZipA
VP0800120-0.093288NAD-dependent DNA ligase LigA
VP0801322-0.678844hypothetical protein
VP0802323-0.531345hypothetical protein
VP0803014-0.871613hypothetical protein
VP0804-115-0.128760hypothetical protein
VP08050180.117884hypothetical protein
VP08062210.827027hypothetical protein
VP08072261.310419hypothetical protein
VP08082271.420571short chain dehydrogenase
VP08092291.395248sugar nucleotide epimerase
VP08113301.131679hypothetical protein
VP08102281.190745PTS system mannose-specific, factor IIC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0788FLAGELLIN1857e-56 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 185 bits (470), Expect = 7e-56
Identities = 90/370 (24%), Positives = 162/370 (43%), Gaps = 10/370 (2%)

Query: 2 AVTVSTNVSAMTAQRYLNKATNELNTSMERLSSGHKINSAKDDAAGLQISNRLTAQSRGL 61
A ++TN ++ Q LNK+ + L++++ERLSSG +INSAKDDAAG I+NR T+ +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAMRNANDGISIAQTAEGAMNEATSVMQRMRDLAIQSSNGTNSPAERQAINEESMALVD 121
A RNANDGISIAQT EGA+NE + +QR+R+L++Q++NGTNS ++ ++I +E ++
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGRRLLNGSFGEAAFQIGASSGEAMIMGLTSIRADDTRMGGVTFFSEVG 181
E++R++ T F G ++L+ Q+GA+ GE + + L I + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KGKDWGVDPTKADLKITLPGMGEDEDGNVDDLEININAKAGDDIEELATYINGQSDMINA 241
+ + +++N+ A T +
Sbjct: 180 ATVGD------LKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN 233

Query: 242 SVSEDGKLQIFVAHPNVQGDISISGGLASELGLSDEPVRTSVQDIDMTTVQGSQNAISVL 301
+ A + S +G ++ D V + + +
Sbjct: 234 GQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGN 293

Query: 302 DSALK---YVDSQRADLGAKQNRLSHSINNLANIQENVDASNSRIKDTDFAKETTQMTKA 358
D K ++ ++ L + + A +Q + + S + + T+ A
Sbjct: 294 DGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA 353

Query: 359 QILQQAGTSI 368
++ +
Sbjct: 354 KLSDLEANNA 363



Score = 137 bits (346), Expect = 3e-38
Identities = 70/243 (28%), Positives = 114/243 (46%), Gaps = 24/243 (9%)

Query: 160 GLTSIRADDTRMGGVTFFSEVGKGKDWGVDPTKADLKITLPGMGEDEDGNVDDLEININA 219
G D + T ++ G + V T K+TL ++ N++A
Sbjct: 271 GGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTL------TVADITAGAANVDA 324

Query: 220 KAGDDIEEL-ATYINGQSDMINASVSEDGKLQIFVAHPNVQGDISISGGLASELGLSDEP 278
+ + + +NGQ + + +E KL A+ V+G+ I+ A +
Sbjct: 325 ATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGD 384

Query: 279 VRT-----------------SVQDIDMTTVQGSQNAISVLDSALKYVDSQRADLGAKQNR 321
T + + + + N ++ +DSAL VD+ R+ LGA QNR
Sbjct: 385 KVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNR 444

Query: 322 LSHSINNLANIQENVDASNSRIKDTDFAKETTQMTKAQILQQAGTSILAQAKQLPNSAMS 381
+I NL N N++++ SRI+D D+A E + M+KAQILQQAGTS+LAQA Q+P + +S
Sbjct: 445 FDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLS 504

Query: 382 LLQ 384
LL+
Sbjct: 505 LLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0789PF03944260.010 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 25.8 bits (56), Expect = 0.010
Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 28 PFSFERTGARLVARAYVEIKRNEH 51
PFSF+ V + + E K+N H
Sbjct: 23 PFSFQHKSLDTVQKEWTEWKKNNH 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0790FLAGELLIN2016e-62 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 201 bits (511), Expect = 6e-62
Identities = 90/297 (30%), Positives = 144/297 (48%), Gaps = 1/297 (0%)

Query: 2 AVNVNTNVSAMTAQRYLNNANSAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGL 61
A +NTN ++ Q LN + S+ +++ERLSSG +INSAKDDAAG I+NR +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAVRNANDGISIAQTAEGAMNETTNILQRMRDLSLQSANGSNSKAERVAIQEEVTALND 121
A RNANDGISIAQT EGA+NE N LQR+R+LS+Q+ NG+NS ++ +IQ+E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGNKLLNGTHGAKSFQIGADNGEAVMLELKDMRSDNKMMGGVSYQAESG 181
E++R++ T F G K+L+ Q+GA++GE + ++L+ + + + G +
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KGKDWNVAQGKNDLKISLTDSFGQEQEININAKAGDDIEELATYINGQTDLVKASVDQDG 241
+ KN + +++N+ A T +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 KLQIFAGNNKVEGEVSFSGGLSGELGLGDDKKNVTVDTIDVTSVGGAQESVAIIDAA 298
+ + + S +G + G K DT D V ++ D
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 135 bits (342), Expect = 1e-37
Identities = 82/377 (21%), Positives = 144/377 (38%), Gaps = 20/377 (5%)

Query: 19 NNANSAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGLDVAVRNANDGISIAQTA 78
N Q + ++ G L + ++ + +
Sbjct: 132 NGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKN 191

Query: 79 EGAMNETTNILQRMRDLSLQSANGSNSKAERVAIQEEVTALNDELNRIAETTSFGGNKLL 138
+ + R A +++ A V + V A N +L + +
Sbjct: 192 VTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFK 251

Query: 139 NGTHGAKSFQIGADNGEAVMLELKDMRSDNKMMGGVSYQAESGKGKDWNVAQGKNDLKIS 198
A + + A G + D + ++G + V+ N K++
Sbjct: 252 TTKSTAGTAEAKAIAGAIKGGKEGDTFDYKG--VTFTIDTKTGNDGNGKVSTTINGEKVT 309

Query: 199 LTDSFGQEQEININAKAGDDIEEL-ATYINGQTDLVKASVDQDGKLQIFAGNNKVEGEVS 257
LT + N++A + + + +NGQ + ++ KL NN V+GE
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 258 FSGGLSGELGLGDDKK-----------------NVTVDTIDVTSVGGAQESVAIIDAALK 300
+ + K + ++ + +A ID+AL
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 301 YVDSHRAELGAFQNRFNHAISNLDNINENVNASKSRIKDTDFAKETTAMTKSQILSQASS 360
VD+ R+ LGA QNRF+ AI+NL N N+N+++SRI+D D+A E + M+K+QIL QA +
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 361 SILAQAKQAPNSALSLL 377
S+LAQA Q P + LSLL
Sbjct: 490 SVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0791FLAGELLIN1492e-42 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 149 bits (376), Expect = 2e-42
Identities = 80/354 (22%), Positives = 132/354 (37%), Gaps = 17/354 (4%)

Query: 5 NTNVAAMMTQRHLSQAADQNVESQRNLSSGYRINSASDDAAGLQISNTLHVQTRGIDVAL 64
NTN +++TQ +L+++ + LSSG RINSA DDAAG I+N +G+ A
Sbjct: 5 NTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQAS 64

Query: 65 RNAHDAYSVAQTAEGALHESSDILQRLRSLGLQAANGSHEQDDRKSLQQEVIALQDELDR 124
RNA+D S+AQT EGAL+E ++ LQR+R L +QA NG++ D KS+Q E+ +E+DR
Sbjct: 65 RNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDR 124

Query: 125 VAITTTFADKNLFNGSYGSQSFHIGANANS-ISLALRNMRTHIPEMGGQHYLGDSL-DKD 182
V+ T F + + +GAN I++ L+ + + G + G
Sbjct: 125 VSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVG 183

Query: 183 WRVTRDNQQFAFEYQDNEGQAQSKVLTLKVGDNLEEVATYINAQQSVVDASVTQDHQLQF 242
+ ++ + T + +
Sbjct: 184 DLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAEN 243

Query: 243 FTSTLNAPEGITWKGNFADEMDIGSGELVTVDDLDMSTVGGAQLAIGVVDAAIKYVDSHR 302
T+ + G + G+ + D G I
Sbjct: 244 NTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEG--DTFDYKGVTFTIDTKTGNDG------ 295

Query: 303 SEIGGFQNRVSGTIDNLNTINRSVSESKGRIRDTDFARESTVMVRSQVLQDATT 356
+VS TI+ + G +S+ V + V+ T
Sbjct: 296 ------NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFT 343



Score = 100 bits (250), Expect = 2e-25
Identities = 46/216 (21%), Positives = 86/216 (39%), Gaps = 19/216 (8%)

Query: 177 DSLDKDWRVTRDNQQFAFEYQDNEGQAQSKVLTLKVGDNLEE-VATYINAQQSVVDASVT 235
D + +V+ + A + + + + + +N Q + D +
Sbjct: 291 TGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKN 350

Query: 236 QDHQLQFFTSTLNAPEGITWKGNFADEMDIGSGELVTVD------------------DLD 277
+ +L + N A+ +G+ VT+ +
Sbjct: 351 ESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDA 410

Query: 278 MSTVGGAQLAIGVVDAAIKYVDSHRSEIGGFQNRVSGTIDNLNTINRSVSESKGRIRDTD 337
+ + +D+A+ VD+ RS +G QNR I NL +++ ++ RI D D
Sbjct: 411 AAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 338 FARESTVMVRSQVLQDATTALLAQAKQRPSSALGLL 373
+A E + M ++Q+LQ A T++LAQA Q P + L LL
Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0794PHPHTRNFRASE7540.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 754 bits (1949), Expect = 0.0
Identities = 284/571 (49%), Positives = 409/571 (71%), Gaps = 2/571 (0%)

Query: 1 MISGILASPGIAIGKALLLQEDEIVLNTNTISDDQVEAEVARFFDARNKSAAQLETIKQK 60
I+GI AS G+AI KA + E + + +I+D V E+ + A KS +L IK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITD--VSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 ALETFGEEKEAIFEGHIMLLEDEELEEEILALIKNDKMTADHAIHSVIEEQACALESLDD 120
+ G +K IF H+++L+D EL + I I+N++M A++A+ V + ES+D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERATDIRDIGSRFVKNALGINIVSLSDINEEVILVAYDLTPSETAQINLDYVLGFA 180
EY+KERA DIRD+ R + + +G+ SL+ I EE +++A DLTPS+TAQ+N +V GFA
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 CDIGGRTSHTSIMARSLELPAIVGTNDITKKVKNGDMLILDAMNNKIVVNPSEAEVEEAK 240
DIGGRTSH++IM+RSLE+PA+VGT ++T+K+++GDM+I+D + ++VNP+E EV+ +
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 AVKAAFLAEKEELAKLKDLHAETTDGHRVEVCGNIGTVKDCDGIIRNGGEGVGLYRTEFL 300
+AAF +K+E AKL + T DG VE+ NIGT KD DG++ NGGEG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRTALPTEEEQYQAYKEVAEAMNGQAVIIRTMDIGGDKDLPYMDLPQEMNPFLGWRAV 360
+MDR LPTEEEQ++AYKEV + M+G+ V+IRT+DIGGDK+L Y+ LP+E+NPFLG+RA+
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RISLDRREILRDQLRGILRASAHGKLRIMFPMIISVEEIRELKNAIEEYKAELRAEGLAF 420
R+ L++++I R QLR +LRAS +G L++MFPMI ++EE+R+ K ++E K +L +EG+
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DENIEIGVMVETPAAAAIAHHLAKEVSFFSIGTNDLTQYTLAVDRGNEMISHLYNPLSPA 480
++IE+G+MVE P+ A A+ AKEV FFSIGTNDL QYT+A DR NE +S+LY P PA
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLTVIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSGISIPKVKKVIRNS 540
+L ++ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMS SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NFAEVKAMAEEALSLPTAAEIEAVVEKFIAE 571
+ E+K A++AL L TA E+E +V+K +
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0799TONBPROTEIN300.008 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.3 bits (68), Expect = 0.008
Identities = 12/56 (21%), Positives = 20/56 (35%), Gaps = 1/56 (1%)

Query: 123 EDVAIQPSEVEEPMQEVVEEEIMPSAFDAPKQEMEMVEEVAP-AVVEQPEEPKPEP 177
+ V P V EP E P ++ + + P V + E+PK +
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDV 114



Score = 28.4 bits (63), Expect = 0.034
Identities = 13/53 (24%), Positives = 23/53 (43%), Gaps = 5/53 (9%)

Query: 133 EEPMQEVVEEEIMPSAFDAPKQEMEMVEEVAPAVVEQPE-----EPKPEPEMQ 180
EP Q V + + + + AP V+E+P+ +PKP ++Q
Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0808DHBDHDRGNASE865e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.9 bits (212), Expect = 5e-22
Identities = 56/217 (25%), Positives = 83/217 (38%), Gaps = 13/217 (5%)

Query: 3 KSILITGCSTGIGYVCAHALHKQGFHVIA----SCRKLDDVQRLQSEGLTCIQLDLD--D 56
K ITG + GIG A L QG H+ A + V L++E D D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 57 PISITSGAKQAIELAQGQLYALFNNGAYGQPGALEDLPTEALKAQFQTNFFGWHQLVQEV 116
+I + IE G + L N +PG + L E +A F N G + V
Sbjct: 69 SAAIDEITAR-IEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 117 LPTMRAQGEGRIIQNSSVLGFAAMKYRGAYNASKFALEGWTDTLRLELADTDIKVALIEP 176
M + G I+ S AY +SK A +T L LELA+ +I+ ++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 177 GPIETQFRHNALLAFEQWIDVDSSPHRIAYDGQKKRL 213
G ET + + W D + + I + +
Sbjct: 188 GSTETDMQWSL------WADENGAEQVIKGSLETFKT 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0809NUCEPIMERASE466e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 46.3 bits (110), Expect = 6e-08
Identities = 31/144 (21%), Positives = 52/144 (36%), Gaps = 18/144 (12%)

Query: 1 MKILLTGGTGFIGSELLKLLT--THQIV---LLTRSPDQAKQHLQYADL---------GN 46
MK L+TG GFIG + K L HQ+V L D + + + L +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 47 IEYLASLDELADLNNIDAVINLAGEPIADKRWTDQQKEKICDSRWKITEQIVELIHASTE 106
+ + +L + + V R++ + DS I+E +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHR--LAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 107 PPSVFLSGSAVGYYGDQQDHPFDE 130
++ S S+V YG + PF
Sbjct: 119 QHLLYASSSSV--YGLNRKMPFST 140


8VP1067VP1086Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP1067222-3.392676hypothetical protein
VP1068325-3.696951hypothetical protein
VP1069328-4.170516sensor histidine kinase
VP1070635-5.647452hypothetical protein
VP1071836-6.141578*hypothetical protein
VP1072635-4.571065helicase
VP1073529-4.408787immunity repressor protein
VP1074523-4.394762hypothetical protein
VP1075422-3.337548hypothetical protein
VP1076423-3.554621hypothetical protein
VP1077524-3.591299phage family integrase
VP1078626-4.129653hypothetical protein
VP1079629-3.758166hypothetical protein
VP1080733-3.752254hypothetical protein
VP1081836-4.482851hypothetical protein
VP1082736-5.570887hypothetical protein
VP1083430-4.548259ATP-dependent DNA helicase
VP1084330-4.940114hypothetical protein
VP1085229-5.236669hypothetical protein
VP1086022-3.708640hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1069HTHFIS581e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.5 bits (139), Expect = 1e-10
Identities = 22/133 (16%), Positives = 46/133 (34%), Gaps = 6/133 (4%)

Query: 599 KALIVEDNRTNAIIMETFLRAKGFECSSVENGQLAVNKIAVEPFDLILMDNHMPVLDGVG 658
L+ +D+ ++ L G++ N IA DL++ D MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 659 AISAIRSMSSAAKSVLIFG-CTADVFKETQERMLGVGADHIIAKPIVESELDDALYRHAD 717
+ I+ +++ T + E+ GA + KP +EL + A
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEK----GAYDYLPKPFDLTELIG-IIGRAL 119

Query: 718 LLYQYQTKQNQQA 730
+ + + +
Sbjct: 120 AEPKRRPSKLEDD 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1072PF07299300.027 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 29.8 bits (67), Expect = 0.027
Identities = 14/72 (19%), Positives = 30/72 (41%), Gaps = 1/72 (1%)

Query: 453 LRPTIENDDKFVIKDQSEVSIDKSALIDDPKALEALRYLNSIGVTGDIDELLQQVDKIPV 512
+ I +D IK Q+ + + A +D ++AL+ L + + L + ++ +
Sbjct: 7 MEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKEL-I 65

Query: 513 TKVRKRQAQRQA 524
V Q + A
Sbjct: 66 DTVLTVQNREDA 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1083ISCHRISMTASE300.039 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.4 bits (68), Expect = 0.039
Identities = 31/107 (28%), Positives = 44/107 (41%), Gaps = 16/107 (14%)

Query: 538 DQAVVLAETIKDMQESGYSYRDQAVLCSGNERLSQLAEQLEL----LGVPVLYLGSLFER 593
++AV+L I DMQ S LS +L+ LG+PV+Y +
Sbjct: 29 NRAVLL---IHDMQNYFVDAFTAG--ASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQ 83

Query: 594 EEVKDMFSFLSLLTDSWGSGLVRIGRWPEF--ELSLEDLDVILNHLR 638
+LLTD WG GL + EL+ ED D++L R
Sbjct: 84 NPDDR-----ALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWR 125


9VP1313VP1320Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP1313-115-3.027719hypothetical protein
VP1314019-4.464356hypothetical protein
VP1315020-4.929007multidrug resistance protein
VP1316-222-6.237752LysR family transcriptional regulator
VP1317-223-7.249807glutaredoxin
VP1318-124-7.023326flippase
VP1319-121-4.834899hypothetical protein
VP1320-119-3.352050CDP-ribitol pyrophosphorylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1315TCRTETB606e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.9 bits (145), Expect = 6e-12
Identities = 77/376 (20%), Positives = 132/376 (35%), Gaps = 46/376 (12%)

Query: 9 YKRITLSLALGSFLVFSNLYLLQPMLPTFATLFSISETQVNWLFAASTLALSFSLVPMAV 68
+ +I + L + SF N +L LP A F+ NW+ A L S
Sbjct: 12 HNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK 71

Query: 69 LSESIGRKPVMMVGLFSIPAISALMLLGDSFIF-LVACRALIGIALAAFAAVAVAYMAEE 127
LS+ +G K +++ G+ S + +G SF L+ R + G AAF A+ + +A
Sbjct: 72 LSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY 131

Query: 128 LDKHAFSMAIGTYIAANSLGGIAGRISGGLLADNFSVDVAIEVMMVVTLIGVICVHYLLP 187
+ K A G + ++G G GG++A + + M +T+I V + LL
Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLK 190

Query: 188 KQRN----------------------FTPSSSSL-------------RHQNRAI-----I 207
K+ FT S S +H +
Sbjct: 191 KEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDP 250

Query: 208 GHFRNQRVWFAMLIGGLNFALFVNLYSVMGFRLVSAPHNVPVGLAS-LIFICYLGGTFSS 266
G +N +L GG+ F S++ + + + S +IF +
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 267 RCAGHWSKRYSSILGMFLGAVVSMAGMWIAAF---ESLAAMLISLLLISFGAFFTHTLAY 323
G R + + +G A+F + M I ++ + G FT T+
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIS 370

Query: 324 GWVGQNATQAKATATA 339
V + Q +A A
Sbjct: 371 TIVSSSLKQQEAGAGM 386


10VP1402VP1419Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VP1402214-2.125690hypothetical protein
VP1403112-3.013213hypothetical protein
VP1404014-3.617041hypothetical protein
VP1405015-3.635936hypothetical protein
VP1406116-4.001284hypothetical protein
VP1407114-3.845278transcriptional regulator
VP1408013-3.023371IcmF-like protein
VP1409017-2.009380hypothetical protein
VP1410017-2.356408hypothetical protein
VP1411017-2.832889hypothetical protein
VP1412119-3.302886hypothetical protein
VP1413322-4.272457hypothetical protein
VP1414529-5.961797hypothetical protein
VP1415730-7.275497hypothetical protein
VP1416930-8.387220hypothetical protein
VP1417624-6.953020hypothetical protein
VP1418217-4.668765hypothetical protein
VP1419017-3.558673hypothetical protein
11VP1509VP1531Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP15092201.343884hypothetical protein
VP15101181.370092(Fe-S)-binding protein
VP15111201.255636formate dehydrogenase-specific chaperone
VP1512113-1.236512hypothetical protein
VP1513113-1.729046formate dehydrogenase large subunit
VP1514219-3.544460formate dehydrogenase, iron-sulfur subunit
VP1515322-4.279285formate dehydrogenase, cytochrome b556 subunit
VP1516526-5.225562hypothetical protein
VP1517527-5.281229Rhs-family protein
VP1518427-4.640095hypothetical protein
VP1519227-3.533584hypothetical protein
VP1520220-3.289631hypothetical protein
VP1521219-2.979956hypothetical protein
VP1522117-2.691370hypothetical protein
VP1523117-2.736287ammonium transporter Amt
VP1524-118-2.946827NAD-dependent deacetylase
VP1525-118-3.408046spermidine/putrescine ABC transporter
VP1526-214-3.480518spermidine/putrescine ABC transporter
VP1527-214-3.888628spermidine/putrescine ABC transporter membrane
VP1528-312-3.851168spermidine/putrescine ABC transporter membrane
VP1529-214-3.860138putrescine/spermidine ABC transporter ATPase
VP1530-216-3.628887hypothetical protein
VP1531-214-3.204616Bax protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1526MYCMG045393e-05 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 38.5 bits (89), Expect = 3e-05
Identities = 25/93 (26%), Positives = 44/93 (47%), Gaps = 4/93 (4%)

Query: 23 SACALSLFSGSAAADDKELVFMNWGPYINSNILEQFTKETGIKVIYSTYESNETLYAKLK 82
+ +SL S ++ V N+ YI+ +LE+ + + + TY SNE L
Sbjct: 10 FSLFVSLSSILSSCGSTTFVLANFESYISPLLLER--VQEKHPLTFLTYPSNEKLINGFA 67

Query: 83 THNQGYDLVVPSTYFVAKMRDEGMLQKIDKTKL 115
N Y + V STY V+++ + +L ID ++
Sbjct: 68 --NNTYSVAVASTYAVSELIERDLLSPIDWSQF 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1531FLGFLGJ290.029 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 28.5 bits (63), Expect = 0.029
Identities = 24/91 (26%), Positives = 36/91 (39%), Gaps = 12/91 (13%)

Query: 146 LPEALVLTQAANESAWGTSRFATQ----ANNLFGQWCYKQGCGIVPAQRAAGKTHEVQK- 200
+P L+L QAA ES WG + + + NLFG G V + K
Sbjct: 169 VPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKK 228

Query: 201 -------FDSVQQSIHGYFMNVNRNPAYADL 224
+ S +++ Y + RNP YA +
Sbjct: 229 VKAKFRVYSSYLEALSDYVGLLTRNPRYAAV 259


12VP1540VP1589Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP1540421-2.013656hypothetical protein
VP1541319-2.235744cytochrome c oxidase subunit CcoP
VP1542220-4.107016cytochrome c oxidase subunit CcoQ
VP1543221-4.248617cbb3-type cytochrome c oxidase subunit II
VP1544121-4.660029cbb3-type cytochrome c oxidase subunit I
VP1545226-5.874475hypothetical protein
VP1546-120-3.349243hypothetical protein
VP1547-218-2.794495sensor histidine kinase/response regulator
VP15491220.066423bacteriophage f237 ORF10
VP15482231.478108hypothetical protein
VP15502222.123161hypothetical protein
VP15513201.881403bacteriophage f237 ORF1
VP15522201.259231bacteriophage f237 ORF2
VP15531201.883704bacteriophage f237 ORF3
VP15541201.799464bacteriophage f237 ORF4
VP15551221.852950hypothetical protein
VP1556122-1.461229bacteriophage f237 ORF5
VP1557127-3.039901bacteriophage f237 ORF6
VP1558227-3.122848bacteriophage f237 ORF7
VP1559632-6.954126hypothetical protein
VP1560633-6.867613hypothetical protein
VP1561430-6.035528bacteriophage f237 ORF8
VP1562119-1.587433bacteriophage f237 ORF9
VP15632200.390982hypothetical protein
VP15642210.942193hypothetical protein
VP15656335.254388hypothetical protein
VP15666336.029803structural protein P5
VP15676316.018958hypothetical protein
VP15688346.190584hypothetical protein
VP15696376.012092phage-like protein
VP15705336.879508hypothetical protein
VP15714346.793185hypothetical protein
VP15725346.687179hypothetical protein
VP15736336.584656hypothetical protein
VP15747336.681933hypothetical protein
VP15756326.083130major phage capsid protein
VP15767345.290989hypothetical protein
VP15777365.232309hypothetical protein
VP15785343.787721hypothetical protein
VP15795303.570465hypothetical protein
VP15805282.709085hypothetical protein
VP15823272.144031hypothetical protein
VP15814240.441392hypothetical protein
VP1583323-0.214119phage replication initiation protein
VP1584127-5.007842hypothetical protein
VP1585126-5.303188hypothetical protein
VP1586028-5.573443hypothetical protein
VP1587025-4.941043hypothetical protein
VP1588126-3.946484hypothetical protein
VP1589-123-3.044974hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1547HTHFIS715e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.6 bits (173), Expect = 5e-15
Identities = 23/118 (19%), Positives = 52/118 (44%), Gaps = 2/118 (1%)

Query: 458 LLIVEDTQSNQLVIKLILNKLGHNVHIASHGAEALTFLEENDTRIDMILMDVSMPVMDGI 517
+L+ +D + + V+ L++ G++V I S+ A ++ D+++ DV MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA--AGDGDLVVTDVVMPDENAF 63

Query: 518 TATRLIRKKGITIPIVALTAHALESDKDKCLDAGMDSFVSKPVRRQDIYEAIQSLIET 575
I+K +P++ ++A K + G ++ KP ++ I +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1579TONBPROTEIN270.045 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 26.9 bits (59), Expect = 0.045
Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 11/65 (16%)

Query: 81 VEQLEVSDAVTIDVELPTVEPVPEIVASTDMHSIAVAEPVEAPSVETEPPTEPEPVKKPP 140
+E + +++ + P P+ V P P VE EP EP P
Sbjct: 36 IELPAPAQPISVTMVTPADLEPPQAVQ-----------PPPEPVVEPEPEPEPIPEPPKE 84

Query: 141 FPVKK 145
PV
Sbjct: 85 APVVI 89


13VP1625VP1633Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP16255191.715911hypothetical protein
VP16263181.480165sulfite reductase, gamma subunit-like protein
VP16274181.469555acylphosphatase
VP16284130.382192methyl-accepting chemotaxis protein
VP16294120.297907SAM-dependent methyltransferase
VP16305130.341227calcium-binding outer membrane-like protein
VP1631212-1.189800agglutination protein
VP1632211-1.252946outer membrane protein
VP1633211-1.234410RTX toxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1630RTXTOXINA584e-10 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 58.1 bits (140), Expect = 4e-10
Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 14/139 (10%)

Query: 2082 NLIMGTDGDDILVGTDGNDMIIGGLGNDILTGGEGDDVFKWTEMQSATDTVTDFSDGDQL 2141
+ + G +GDD L G DGND +IG GN+ L GG+GDD F+ A + L
Sbjct: 765 DTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAK---------NVL 815

Query: 2142 DFTDVFDDMTGTDISALLDDLGSGDY--RGRVDDITVEVTESGGNSTLTINKDGQQL--- 2196
D + G++ + LLD D G +DI ++ G + +L
Sbjct: 816 FGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLA 875

Query: 2197 EVNFDGASAADIANSLISN 2215
+++F + N LI
Sbjct: 876 DIDFRDVAFKREGNDLIMY 894



Score = 49.6 bits (118), Expect = 1e-07
Identities = 24/81 (29%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 2072 YQTMALNSELNLIMGTDGDDILVGTDGNDMIIGGLGNDILTGGEGDDVFKWTEMQSATDT 2131
+Q + N++ G G+D L G++G D++ GG G+D+L GG G+D++++
Sbjct: 803 FQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHII 862

Query: 2132 VTDFSDGDQLDFTDV-FDDMT 2151
D D+L D+ F D+
Sbjct: 863 DDDGGKEDKLSLADIDFRDVA 883



Score = 38.4 bits (89), Expect = 4e-04
Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 2086 GTDGDDILVGTDGNDMIIGGLGNDILTGGEGDDV 2119
G+ DI G DG+D+I G GND L G +G+D
Sbjct: 733 GSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDT 766


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1632OMPADOMAIN828e-21 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 82.3 bits (203), Expect = 8e-21
Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 11/123 (8%)

Query: 87 QMQIRVLFANDSDEINPVFAKQIRELSDFLKEY--PSTSIELQGYASRTGGSEHNLDLSK 144
++ VLF + + P + +L L S+ + GY R G +N LS+
Sbjct: 216 TLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSE 275

Query: 145 RRAENVRKALLQNGITPDRVTIVGYGDT-VLATTGTDEVSH--------ALNRRVTATVV 195
RRA++V L+ GI D+++ G G++ + D V A +RRV V
Sbjct: 276 RRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVK 335

Query: 196 GHK 198
G K
Sbjct: 336 GIK 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1633CABNDNGRPT737e-15 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 73.5 bits (180), Expect = 7e-15
Identities = 38/142 (26%), Positives = 54/142 (38%), Gaps = 13/142 (9%)

Query: 3085 DQADTIYGGAGNDILFGQGGNDKLFGGADNDILIGGLGSDILTGGDGEDIFKWID----V 3140
+ GG+GNDIL G ++ L GGA ND+L GG G+D L GG G D F +
Sbjct: 338 VTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDST 397

Query: 3141 ANERDTVTDFSSSEDSLDFSDL-------FDDLSKDEVGDLLSDLQSGSHTGDAGGYHVE 3193
D + DF D +D S F G + + +
Sbjct: 398 VAAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEV--MLQWDAANSITNLWLH 455

Query: 3194 VSQDGSTDTNLSITKGSSTLDI 3215
+ S D + I ++ DI
Sbjct: 456 EAGHSSVDFLVRIVGQAAQSDI 477


14VP1654VP1671Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP16542161.290513hypothetical protein
VP16552161.268240hypothetical protein
VP16564171.267115translocator protein PopD
VP16572181.823689translocator protein PopB
VP16582181.636007low calcium response locus protein H
VP16593191.592781hypothetical protein
VP16601213.048679type III secretion regulator
VP16611212.768353LcrR
VP16622222.561563low calcium response protein
VP16634213.926610YscY
VP16644213.424525type III secretion protein
VP16654192.836256type III secretion protein
VP16662192.729520hypothetical protein
VP16673192.780298outer membrane protein PopN
VP16682193.042342type III secretion system ATPase
VP16692191.429021type III secretion protein YscO
VP16702181.429230translocation protein in type III secretion
VP16712201.354709type III secretion system protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1656PF058442274e-75 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 227 bits (580), Expect = 4e-75
Identities = 74/238 (31%), Positives = 135/238 (56%), Gaps = 5/238 (2%)

Query: 97 IKSPSDAVSQSLSLLTLLYQVSKLSREQQVLQREIAVEANVASLQSQAAELNNSASAMIA 156
+++ + S+ LL +L+++++ +RE VLQR+ +A + + ++Q E+ + A+ MIA
Sbjct: 63 MEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQKAQVDEMRSGATLMIA 122

Query: 157 MAVVSGVLAGATAIIGALGSFKAGKEIKTEMASNNVLKTQKAGFDQVEELMNNGNLSKTQ 216
MAV++GV A A+A++G+LG+ K GK I E + + D + + +
Sbjct: 123 MAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGK----TSD 178

Query: 217 QDQVKRAHSLAKDSIADTTAQLTSGGRKFDKLMSSNQAKNAILQALGQMANSASNVEQTK 276
+D+ A D D+ A L + GR F+ + Q N ++Q+ QMAN++ V Q +
Sbjct: 179 EDRKIVGKVWAADQAQDSVA-LRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGE 237

Query: 277 AQARSKDDEVQATRAQAAKQKADENIGFQEGLLKELRELFRSISDSQNQAWRASIPTV 334
+QA ++++EV AT Q+ KQK ++ + F G +K++ +L + + S NQAWRA+ V
Sbjct: 238 SQASAREEEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1657BACINVASINB320.006 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 31.6 bits (71), Expect = 0.006
Identities = 67/381 (17%), Positives = 139/381 (36%), Gaps = 47/381 (12%)

Query: 56 KVQLDAPNAAVSDKVTDLTLKAMAQLQKIVDTIAKALHAVADQTGSIAVKIIAGSADDFE 115
K LD A TD KA + A A +Q ++ A
Sbjct: 199 KEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQNQVSQGEQDNLSNVARLTM 258

Query: 116 VELAAITDKLKSAQNELKIQEVKVAKAKHEQEMAENQEKIKESEAAAKEAQKS----GLA 171
+ I K+ + L+ ++ + A E AE ++K E + ++A+++ G
Sbjct: 259 LMAMFIEIVGKNTEESLQ-NDLALFNALQEGRQAEMEKKSAEFQEETRKAEETNRIMGCI 317

Query: 172 AKIFGWISAVVSIVVGAIMVATGVGAAAG--ALMIAGGVMGAVS---------------- 213
K+ G + +VS+V + AA A+M+A ++ A +
Sbjct: 318 GKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHV 377

Query: 214 ----MALQEPAVQDALKEAGVN---VDVLNKVVMALEIAVAVIGAIVTFGGAAAGGIAKL 266
M L A+ AL+ GV+ ++ +V A+ A+A++ IV G AKL
Sbjct: 378 LKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKL 437

Query: 267 AAKSASKIAQKVTDIATKAAANMAKVAD-------------MGSKAATTTAKAIRYGAET 313
+ + + + + +A+ +G+ + + E
Sbjct: 438 GNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLGNVGSKMGLQTNALSKEL 497

Query: 314 VDLTVN----IGKGATDSVHAANNANVTEIQADITDLRAKMTLSQAVIDKLKEEIGKLME 369
V T+N + + +A + ++ A L++ +D++++ + + +E
Sbjct: 498 VGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFMLARFAMDQIQQWLKQSVE 557

Query: 370 DFQELMSIIMQMIQAKSETMQ 390
F E + ++ +A S +Q
Sbjct: 558 IFGENQKVTAELQKAMSSAVQ 578


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1658SYCDCHAPRONE1531e-50 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 153 bits (389), Expect = 1e-50
Identities = 76/154 (49%), Positives = 112/154 (72%)

Query: 7 TDPSQMQAEELLSFLEEGGTLKMLHDVSADTIEHIYAVGYNFFQSGKIEQAAKVFQLLSM 66
T +Q + SFL+ GGT+ ML+++S+DT+E +Y++ +N +QSGK E A KVFQ L +
Sbjct: 5 TTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCV 64

Query: 67 LDHYQARFFIGLGAARQELGEYLQAIDAYSYAALVDINDPRPPFHSAECHLKLEQLTEAE 126
LDHY +RFF+GLGA RQ +G+Y AI +YSY A++DI +PR PFH+AEC L+ +L EAE
Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAE 124

Query: 127 SGFYSAKEMSAGKSQYADLHQRAGIMLEAVRNKR 160
SG + A+E+ A K+++ +L R MLEA++ K+
Sbjct: 125 SGLFLAQELIADKTEFKELSTRVSSMLEAIKLKK 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1659LCRVANTIGEN1143e-30 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 114 bits (287), Expect = 3e-30
Identities = 76/233 (32%), Positives = 127/233 (54%), Gaps = 20/233 (8%)

Query: 392 GALKDRLLNITEQEKKDLEVRAEHSLTARDLLAVVESSI-GDRFDEQVLFALNERRVNRL 450
G ++L N ++ K+ LE R +AV+ S+ DR D+ +L + + +
Sbjct: 88 GHYDNQLQNGIKRVKEFLESSPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNHHG 147

Query: 451 EKRNEQKEALQDLTVQLKIFGVVQSKIHSTQSVDGTYKPDDNAFSASDFNYNSVTD---F 507
+ R++ +E L +LT +LKI+ V+Q++I+ S GT D + + D N TD F
Sbjct: 148 DARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIF 207

Query: 508 QNSPEYKYL------------TDNGITTHTDFL----KKQGVTVADGASFKDEEKTKKLS 551
+ S EYK L ++ I + DFL K+ G S+ + +LS
Sbjct: 208 KASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNNELS 267

Query: 552 NFSSSVSDKSKLLNDEVQIKTTELNDISSQYNSTVEAMNKFVQKYHSILQEIL 604
+F+++ SDKS+ LND V KTT+L+DI+S++NS +EA+N+F+QKY S++Q +L
Sbjct: 268 HFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1667PF072012695e-92 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 269 bits (688), Expect = 5e-92
Identities = 96/298 (32%), Positives = 163/298 (54%), Gaps = 6/298 (2%)

Query: 1 MSIINSQIATNTKFDASVRNGLESSRADSAVKGSYRGETVRVHNAT-QSLFDAMEELTSL 59
M+ +++ NT R + SS+ + G +RGE+V++ + T QS+ D EE+T +
Sbjct: 1 MTTLHNLSYGNTPLHNE-RPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFV 59

Query: 60 GSEKAEKDLTKRKIKDGGVRVNEAHELVSDYLRKVPDLEKNQKIKDLAAKMAGGNISTIA 119
SE+ E L KRK+ D RV++ E V+ YL KVP+LE+ Q + +L + ++ +++
Sbjct: 60 FSERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLS 119

Query: 120 QLQAYLNGFSEEKSHQYLALKAVKKYLSANPESKHLLALIDQAILKIEQNPDSWSQIDTE 179
QL+AYL G SEE S Q+ L ++ L PE HL L++QA++ + + +
Sbjct: 120 QLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGAR 179

Query: 180 IRVSHFADEFSKEQEFSSLHQLRGFYRDTVHSYQGLGSAYQDVVERFGEQEVSTAVDFML 239
I + + + + L LR YRD V YQG+ + + D+ +RF ++ + + F+
Sbjct: 180 ITPEAYRES---QSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQ 236

Query: 240 QGMSADLSVQGSNIDSVKLQLLMSDMQKLKTLNTLQDQVGRLFQMFKPERMSHGLSGF 297
+ +SADL Q S KL +++SD+QKLK ++ DQV +Q F E ++G+ F
Sbjct: 237 KALSADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFS-EGKTNGVRPF 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1669IGASERPTASE290.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.007
Identities = 18/137 (13%), Positives = 37/137 (27%), Gaps = 7/137 (5%)

Query: 13 ADRADKAVQRQEYRVANVAAELQKAERSVADYHVWRQEEEERRFAKAKQQTVLLKELETL 72
+++ Q VA +E ++ + + ++EE+ + K Q + +
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ----EVPKVT 1126

Query: 73 RQEIALLREREAELKQRVAE---VKVTLEQERTLLKQKQQEALQAHKTKEKFVQLQQQEI 129
Q + E Q +E + Q K V+ E
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 130 AEQSRQQQYQEELEQEE 146
+ E E
Sbjct: 1187 TTVNTGNSVVENPENTT 1203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1670IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.002
Identities = 39/245 (15%), Positives = 92/245 (37%), Gaps = 6/245 (2%)

Query: 9 TPSTQPHSPQSPMPIDDHAMLQSRFERALKENPDQTKPQPNETNNQAALEAKKPFTENYS 68
T T P++ Q+ +P ++ + E A + P P + A+ E+ +
Sbjct: 995 TNITTPNNIQADVP----SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 69 ERSLASLHSTGSNQRKTAEKSAFAQNNDVVTENINNESDTSSPIALNTQDKMPSDMDANM 128
+ + Q + K A + N +S + + T+ K + ++
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 129 KPDIRIPTTGDK-KLPMEPSNKREKNADEQDAFERLVEDHDDSKTELAAAKTAESHESTQ 187
K + T + K+ + S K+E++ Q E E+ + ++T + ++ Q
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 188 SPKDT-KHIPTAGTKPNTVETVSDALASTLASTTVASATSASLASHPVTSTVHKTATKTE 246
K+T ++ T+ TV T + + + +T + + + S H+ + ++
Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230

Query: 247 VNKHE 251
+ E
Sbjct: 1231 PHNVE 1235



Score = 29.3 bits (65), Expect = 0.044
Identities = 29/201 (14%), Positives = 59/201 (29%), Gaps = 5/201 (2%)

Query: 33 FERALKENPDQTKPQPNETNNQAALEAKKPFTENYSERSLASLHSTGSNQRKTAEKSAFA 92
+ + QT + +AK + + S S Q +T + A
Sbjct: 1086 AQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 93 QNNDVVTENINNESDTSSPIALNTQDKMPSDMDANMKPDIRIPTTGDKKLPMEPSNKREK 152
+ T NI ++ A Q + +N++ + TT + + + +
Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 153 NADEQDAFERLVEDHDDSKTELAAAKTAESHESTQSPKDTKHIPTAGTKPNTVETVSDAL 212
A Q E + K + + H + + T T + L
Sbjct: 1204 PATTQP--TVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVL 1261

Query: 213 ASTLAST-TVASATSASLASH 232
+ A VA +++ H
Sbjct: 1262 SDARAKAQFVALNVGKAVSQH 1282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1671TYPE3OMOPROT841e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.9 bits (207), Expect = 1e-20
Identities = 44/155 (28%), Positives = 61/155 (39%), Gaps = 12/155 (7%)

Query: 170 TQHIALPVWLSLGKTHLDLNQFHSLELGDVIFFDQCYIAQQQAIVQVSNKNLWRCQLEDN 229
L + T L + +GDV+ A V K L +
Sbjct: 147 MLRWPLRFVIGSSDTQRSL--LGRIGIGDVLLIRTSR-----AEVYCYAKKLGHFNRVEG 199

Query: 230 -----TLYIIEKETNMNDVNTSETLTDHQQLPVELTFDIGHQTVTLEQLNQLQPGYVFEL 284
TL I E N T+ETL QLPV+L F + + VTL +L + + L
Sbjct: 200 GIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYRKNVTLAELEAMGQQQLLSL 259

Query: 285 NQPVSKPVTLRANGKIIGECELVNVNDHLGVRVLE 319
V + ANG ++G ELV +ND LGV + E
Sbjct: 260 PTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHE 294


15VP1682VP1697Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP16822200.241622hypothetical protein
VP16813210.060828hypothetical protein
VP16833230.428648hypothetical protein
VP16843210.487528hypothetical protein
VP16854210.981928hypothetical protein
VP16863221.236953effector protein
VP16874221.174182type III chaperone
VP16883221.631544type III secretion system protein
VP16893241.618096type III secretion protein
VP16901232.107280type III secretion lipoprotein
VP16911192.972163type III export protein
VP16920172.493023type III export protein
VP16931192.126782type III secretion protein
VP16941201.670627type III export protein YscF
VP1695-1201.121393type III export protein PscD
VP1696-120-0.052342type III secretion protein YscC
VP1697223-2.460475type III export apparatus protein NosA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1682PF05932464e-09 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 45.6 bits (108), Expect = 4e-09
Identities = 29/124 (23%), Positives = 47/124 (37%), Gaps = 5/124 (4%)

Query: 3 TIQPLLDEFCRLNELPPLILEDGNRCQLLVDDRFVLYFTATEDDALMLSVAFGGLEKSGE 62
+ LLD+F R E+ PL+ +D C +++D+ F L + D A + G LE +
Sbjct: 5 FYKTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTL--SCDYARERLLLIGLLEPHKD 62

Query: 63 LRVRGLELLARANYQRVGSGNLALSLAPNGRQLVLAGRQPTEHLNSANLTVWFHEIIEQT 122
+ + L L N L L P E L+ L ++E
Sbjct: 63 IPQQCL-LAGALNPLLNAGP--GLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWM 119

Query: 123 ELWQ 126
W+
Sbjct: 120 RGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1686YERSSTKINASE320.006 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 31.6 bits (71), Expect = 0.006
Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 4/70 (5%)

Query: 25 GKLSIGGKEYTINAATQEFTRANPTSGAVARFFEATGKLFREGSTQ-SVAKAITKAVFDN 83
G+L+IGGK Y I + R NP SG + F E GK+F S+A+ +T +
Sbjct: 32 GELNIGGKRYRI--IDNQVLRLNPHSG-FSLFREGVGKIFSGKMFNFSIARNLTDTLHAA 88

Query: 84 EQGQAQRLQT 93
++ +Q L++
Sbjct: 89 QKTTSQELRS 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1687PF05932491e-10 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 49.0 bits (117), Expect = 1e-10
Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 6/121 (4%)

Query: 1 MANGFITALMTDFAHRCQIEQLFFDGDDCCHLLIDQDTAITVRAE--DDRLTLIGLISGD 58
M+N F L+ DF+ +++ L FD C+++ID A+T+ + +RL LIGL+
Sbjct: 1 MSNLFYKTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPH 60

Query: 59 K--PEHDVMLQYMKASLTQGSPAVYWDEEVG-FVGFVHLSQQWLDAAILDESLGNFIEWL 115
K P+ +L L P + DE+ G + + + ++ L L + +EW+
Sbjct: 61 KDIPQQC-LLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWM 119

Query: 116 K 116
+
Sbjct: 120 R 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1688FLGFLIH372e-05 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 37.1 bits (85), Expect = 2e-05
Identities = 41/161 (25%), Positives = 68/161 (42%), Gaps = 22/161 (13%)

Query: 51 QQAYETEKQRGYQDGLEQAKIENAQAMVATLARCNEYYLQ-----------VEHKMTNVV 99
QQ ++ Q G GLEQ E AR + + + ++ +
Sbjct: 65 QQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMA 124

Query: 100 LDAVRKIIDTFDDVDTT--ISVVREALQ---LVSNQKQVILHVHPEQVVDVREKVAGVLS 154
L+A R++I VD + I +++ LQ L S + Q L VHP+ + V + + LS
Sbjct: 125 LEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQ--LRVHPDDLQRVDDMLGATLS 182

Query: 155 DFPEVGYVDVVADARLKNGGCILETEVGIIDASIDGQIQAL 195
+ + D L GGC + + G +DAS+ + Q L
Sbjct: 183 ----LHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1690FLGMRINGFLIF654e-14 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 64.6 bits (157), Expect = 4e-14
Identities = 33/168 (19%), Positives = 73/168 (43%), Gaps = 8/168 (4%)

Query: 22 TELYTNVSQKEGNEMLSILLSEGVVATKEPDKDNKVKLMVDSSQIAFAVDALKRKGYPRE 81
L++N+S ++G +++ L + + V + ++ L ++G P+
Sbjct: 51 RTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGA---IEVPADKVHELRLRLAQQGLPKG 107

Query: 82 QFSTLKEVFPKDDLISSPLAERARLVYAKSQELSSTLSQIDGVLVARVHVVL-EDQDLRP 140
+ E+ ++ S +E+ A EL+ T+ + V ARVH+ + +
Sbjct: 108 G-AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVR 166

Query: 141 GERPTPASASVFIKHAADVALD-SYVPQIKLLVNNSIEGLNYDRISVV 187
++ SASV + ALD + + LV++++ GL +++V
Sbjct: 167 EQK--SPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1692PF09025621e-14 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 62.0 bits (150), Expect = 1e-14
Identities = 34/119 (28%), Positives = 47/119 (39%), Gaps = 1/119 (0%)

Query: 89 PQTQKEALWYAFHQAKSAKGTDDAVPELLSVLKQELLGDFAGQLMAEPPTDRAALKAMLA 148
P + + + L + LL FA L DR LKAML
Sbjct: 24 PAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQG-LEADRLELKAMLR 82

Query: 149 QSFPLGAQKEQALWHCWAELKSLPEMTSTVDLVREELSFVIQKNAMVKNIMTHSHKLDL 207
PLG Q++ L ++ P L R EL +I N M+ N++ +SHKLDL
Sbjct: 83 AELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQVLIPLNGMLDNLVRNSHKLDL 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1696TYPE3OMGPROT5770.0 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 577 bits (1489), Expect = 0.0
Identities = 288/594 (48%), Positives = 406/594 (68%), Gaps = 21/594 (3%)

Query: 34 ATELNWPEQPFRYYADNDSLKDLLNNFGANYRVSVSVSDKVNDRVSGRFTPEDPAEFLDY 93
A EL+W P+ Y A +SL+DLL +FGANY +V VSDK+ND+VSG+F ++P +FL +
Sbjct: 26 AQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQH 85

Query: 94 LAQVYNLMWYFDGAVLHVYKATETRSRLLQLELLTARELRSTLISTGVWDARYGWRAAEN 153
+A +YNL+WY+DG VL+++K +E SRL++L+ A EL+ L +G+W+ R+GWR +
Sbjct: 86 IASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWEPRFGWRPDAS 145

Query: 154 KGLVYLAGPPRYVELVVQTAEALESRLLQKSNSTDELFVELIPLKYASATDRSISYRDQS 213
LVY++GPPRY+ELV QTA ALE + +S T L +E+ PLKYASA+DR+I YRD
Sbjct: 146 NRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASASDRTIHYRDDE 205

Query: 214 ITVPGIASVLSRVVGGVQTQITDSASVQTSSVNGLPAEAAKPRGKTASVHGGATVEAEPG 273
+ PG+A++L RV+ A++Q +V+ A R A VEA+P
Sbjct: 206 VAAPGVATILQRVL--------SDATIQQVTVDNQRIPQAATRAS-----AQARVEADPS 252

Query: 274 LNAIIVRDTQARLPLYRKLVAQLDQPQSRIEVALSIVDISANDLRQLGVDWRAGVSVGNN 333
LNAIIVRD+ R+P+Y++L+ LD+P +RIEVALSIVDI+A+ L +LGVDWR G+ GNN
Sbjct: 253 LNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNN 312

Query: 334 RIVDIKTTGDVDNGDVTLGSGQSFKSLLDSTNLNYLLAQIRLLESKGSAQVVSRPTLLTQ 393
V IKTTGD N + S + SL+D+ L+YLLA++ LLE++GSAQVVSRPTLLTQ
Sbjct: 313 HQVVIKTTGDQSN----IASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQ 368

Query: 394 ENVEAVLNNSSTFYVKLVGKETAALEEVTYGTLLRIVPRIVGDRFATRPEINLSLHLEDG 453
EN +AV+++S T+YVK+ GKE A L+ +TYGT+LR+ PR++ + EI+L+LH+EDG
Sbjct: 369 ENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQ--GDKSEISLNLHIEDG 426

Query: 454 AKIPDG-GVDDLPSVRKTEISTLATVKQGQSLLIGGVYRDEVSHQLRKVPLLGDIPYLGA 512
+ P+ G++ +P++ +T + T+A V GQSL+IGG+YRDE+S L KVPLLGDIPY+GA
Sbjct: 427 NQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGA 486

Query: 513 LFRSNTNTTRRTVRMFIIEPRIVVDGIGDSVLIGNEHDLRPSIGQLNNISNNSAEFKSVV 572
LFR + TRRTVR+FIIEPRI+ +GI + +GN DLR I ++ ISN S ++
Sbjct: 487 LFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGQDLRTGILTVDEISNQSTTLNKLL 546

Query: 573 EVFSCTSKTQAERYQQDLLSQQKSSLLTQCQLPSGQVGWRVKVAECDLSQAECV 626
C +A+ Q+ L KSS LTQC++ +GWRV C +Q+ CV
Sbjct: 547 GGSQCQPLNKAQEVQKWLSQNNKSSYLTQCKM-DKSLGWRVVEGACTPAQSWCV 599


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1697PF05932391e-06 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 39.0 bits (91), Expect = 1e-06
Identities = 15/124 (12%), Positives = 40/124 (32%), Gaps = 8/124 (6%)

Query: 3 DKMMKSLAETLGIGPFIAGENGAYTIEVD-QLTLTIKQHSSWILWETALPLRFNEHLDYQ 61
++ + +L + P + ++G + +D LT+ + L+
Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGL------LEPH 60

Query: 62 QEQALKRCMQLSLKTLRDTPSVLTTNADQQLILQGKAM-IENTSNDQFAELLAQHANVCE 120
++ + + +L L + L + L +++ E S +A
Sbjct: 61 KDIPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMR 120

Query: 121 RYME 124
+ E
Sbjct: 121 GWRE 124


16VP1786VP1864Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP1786230-4.457148hypothetical protein
VP1787432-5.055475transposase
VP1788434-7.358776transposase
VP1789432-7.718395hypothetical protein
VP1790331-7.775061hypothetical protein
VP1791733-8.170848hypothetical protein
VP1792529-6.586632hypothetical protein
VP1793628-6.014992hypothetical protein
VP1794729-5.791317hypothetical protein
VP1795732-5.999836hypothetical protein
VP1796432-6.594445hypothetical protein
VP1797033-7.650187hypothetical protein
VP1798134-7.664994hypothetical protein
VP1799233-7.370772hypothetical protein
VP1800333-7.867491hypothetical protein
VP1801332-7.969385hypothetical protein
VP1802433-7.878946hypothetical protein
VP1803427-6.486789hypothetical protein
VP1804327-6.832082hypothetical protein
VP1805327-6.678314hypothetical protein
VP1806025-5.588930hypothetical protein
VP1807-124-5.239306hypothetical protein
VP1808-126-5.914854hypothetical protein
VP1809027-5.859383hypothetical protein
VP1810125-5.397386hypothetical protein
VP1811-126-6.078276hypothetical protein
VP1812030-6.308289hypothetical protein
VP1813231-6.360794hypothetical protein
VP1814331-6.429933hypothetical protein
VP1815231-6.356229hypothetical protein
VP1816335-7.005020Hit protein involved in cell-cycle regulation
VP1817235-6.381771hypothetical protein
VP1818239-9.398192hypothetical protein
VP1819336-9.913710hypothetical protein
VP1820132-8.919461hypothetical protein
VP1821131-8.860033YefM protein
VP1822130-9.008137hypothetical protein
VP1823131-8.939761hypothetical protein
VP1824027-7.236585hypothetical protein
VP1825-128-6.685690hypothetical protein
VP1826028-7.220747hypothetical protein
VP1827027-6.665009spermine/spermidine acetyltransferase BltD
VP1828-127-5.569976PnuC protein
VP1829026-6.116528RelB protein/Vco27A protein
VP1830328-7.314022hypothetical protein
VP1831130-6.978130hypothetical protein
VP1832232-5.613988hypothetical protein
VP1833334-4.839730hypothetical protein
VP1834234-5.686691hypothetical protein
VP1835437-7.221708hypothetical protein
VP1836343-7.656886hypothetical protein
VP1837339-7.597560hypothetical protein
VP1838436-7.784682hypothetical protein
VP1839436-8.025270hypothetical protein
VP1840335-8.335032hypothetical protein
VP1841236-7.112075hypothetical protein
VP1842232-6.433030RelB protein
VP1843232-7.349170hypothetical protein
VP1844034-7.185974hypothetical protein
VP1845-132-6.740640hypothetical protein
VP1846130-6.056334hypothetical protein
VP1847029-6.844191hypothetical protein
VP1848029-7.191340hypothetical protein
VP1849131-6.910167hypothetical protein
VP1850130-7.164660acetyltransferase
VP1851231-7.394877hypothetical protein
VP1852130-7.496615hypothetical protein
VP1853028-6.542858hypothetical protein
VP1854025-5.288187hypothetical protein
VP1855023-5.272267hypothetical protein
VP1856028-3.765338hypothetical protein
VP1857226-4.321026hypothetical protein
VP1858124-4.905251hypothetical protein
VP1859124-5.537930hypothetical protein
VP1860023-4.728675hypothetical protein
VP1861021-4.312118hypothetical protein
VP1862021-4.442927threonine efflux protein
VP1864019-3.726730hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1787PilS_PF08805290.018 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 28.7 bits (64), Expect = 0.018
Identities = 13/67 (19%), Positives = 26/67 (38%), Gaps = 2/67 (2%)

Query: 76 NVKTIAASMKRQDLTPKAARKFKCTTDSKHKMPVAPNLLAQDFNAAAPNQKWAGDITYVA 135
NV T+ A+MK + + + P+ + D A+ W G +T
Sbjct: 66 NVLTVIANMKSLKFQ--GRYTDSNYIKTLYAQGLLPSDMIADTTGASAKNPWGGSVTITT 123

Query: 136 TSEGWLY 142
+S+ + +
Sbjct: 124 SSDKYSF 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1794SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 2e-05
Identities = 20/79 (25%), Positives = 29/79 (36%), Gaps = 4/79 (5%)

Query: 101 VDVSCRKQGIGQRLIDAVINFCQEKAEIDWLDLCVLSENFPAKNLYLKAGFDVVGEFKDQ 160
V RK+G+G L+ I + +E L L N A + Y K F + D
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFIIGA--VDT 153

Query: 161 YRIDGLSVS-ETAMTKYVK 178
+ E A+ Y K
Sbjct: 154 MLYSNFPTANEIAIFWYYK 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1808SYCDCHAPRONE310.003 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 30.7 bits (69), Expect = 0.003
Identities = 16/98 (16%), Positives = 27/98 (27%), Gaps = 14/98 (14%)

Query: 15 LSFLIFTEPNNYKHYYERGILYGGEHIKSTECRESDYHGDYSQAIEDFDKVIELQPNFAN 74
L + + + + G G Y AI + +
Sbjct: 59 FQALCVLDHYDSRFFLGLGACR---QAM----------GQYDLAIHSYSYGAIMDIKEPR 105

Query: 75 AYYHKACIYFDLGIDDDEAESLLEKALALEPNNSAYTI 112
+H A G + EAES L A L + + +
Sbjct: 106 FPFHAAECLLQKG-ELAEAESGLFLAQELIADKTEFKE 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1819PF03544362e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 36.1 bits (83), Expect = 2e-05
Identities = 12/62 (19%), Positives = 25/62 (40%), Gaps = 5/62 (8%)

Query: 85 GYVLINYLIDSNGEIFNPTIVESSPKGEWDLIALKALSKVEYVPSESNSLNIPVYVTTEI 144
G V + + + +G + N I+ + P ++ A+ + Y P + S + I
Sbjct: 178 GQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSG-----IVVNI 232

Query: 145 KF 146
F
Sbjct: 233 LF 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1827SACTRNSFRASE334e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 4e-04
Identities = 27/156 (17%), Positives = 57/156 (36%), Gaps = 37/156 (23%)

Query: 4 LIPQWESKNLVFTEFSSDEAELAKSIFDSNLNVKAMDPTFREWSIIEYEKLIAESNKSKL 63
+IP +E+ +TE K D +++V ++
Sbjct: 26 MIPAFENGVWTYTEERF-SKPYFKQYEDDDMDVSYVE----------------------- 61

Query: 64 SDEQGAFYLRKISTKKGQVVGYIQMEFHAPSTGTLWLPMLTILPSFKGKGLGSEIVSSVI 123
+E A +L + +G I + + G + + + ++ KG+G ++++
Sbjct: 62 -EEGKAAFLYYLENN---CIGRI--KIRSNWNGYALIEDIAVAKDYRKKGVG----TALL 111

Query: 124 AVACEYA---NLQNVGLNVYAENISAFRFWYRQGFT 156
A E+A + + L NISA F+ + F
Sbjct: 112 HKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1850SACTRNSFRASE532e-11 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 52.6 bits (126), Expect = 2e-11
Identities = 26/112 (23%), Positives = 48/112 (42%), Gaps = 5/112 (4%)

Query: 17 WSTTEAML----LRDADSKENIGKYLKRNPNLSFVALDGDNIIGAILVGTD-GRRGYVQH 71
W+ TE + + + Y++ +F+ +N IG I + ++ ++
Sbjct: 35 WTYTEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIED 94

Query: 72 LAVDSTFRGKGVGAKLISSAVEALSKVGIAKTHLFVANENINAQSFYEKLGW 123
+AV +R KGVG L+ A+E + L + NI+A FY K +
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1861SACTRNSFRASE414e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.1 bits (96), Expect = 4e-07
Identities = 20/79 (25%), Positives = 40/79 (50%), Gaps = 5/79 (6%)

Query: 74 FISFVDNKPAGFAVCFESFSTYRAQRVMNVHDFMVSDNFRGKGVGKAQLNGIEQYCRDND 133
F+ +++N G +++ Y + D V+ ++R KGVG A L+ ++ ++N
Sbjct: 68 FLYYLENNCIGRIKIRSNWNGY-----ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122

Query: 134 YLKITLEVGDDNVAAKKLY 152
+ + LE D N++A Y
Sbjct: 123 FCGLMLETQDINISACHFY 141


17VP1881VP1887Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VP1881-219-3.757734hypothetical protein
VP1882-123-5.141889hypothetical protein
VP1883128-7.2985896-pyruvoyl tetrahydrobiopterin synthase
VP1884232-8.242025hypothetical protein
VP1885024-6.312813hypothetical protein
VP1886122-5.628645hypothetical protein
VP1887015-3.336798hypothetical protein
18VP1921VP1934Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VP1921221-1.724347GTP cyclohydrolase II
VP1922321-1.533243hypothetical protein
VP1923319-1.427364NrfF protein
VP1924116-0.272528thiol:disulfide interchange protein DsbE
VP19250150.049556NrfE protein
VP1926-214-0.536274formate dependent nitrate reductase NrfD
VP1927-1110.138806nitrite reductase Fe-S protein NrfC
VP19280120.142833cytochrome c nitrite reductase pentaheme
VP19294200.288443cytochrome c552
VP1930628-0.238706hypothetical protein
VP1931629-0.232153YfrE protein
VP1932426-0.055644DNA gyrase subunit A
VP1933222-0.2920613-demethylubiquinone-9 3-methyltransferase
VP1934222-0.408138ribonucleotide-diphosphate reductase subunit
19VP1992VP2029Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP19922221.001069hypothetical protein
VP19931261.671720transcriptional regulator
VP19941232.503168isochorismatase-like protein
VP19951232.656134ABC transporter ATP-binding protein
VP19960232.142874hypothetical protein
VP19971221.194302hypothetical protein
VP19980200.636243outer membrane protein TolC
VP19991170.191037hypothetical protein
VP2000220-1.021352ribosomal-protein-alanine N-acetyltransferase
VP20011200.827848hypothetical protein
VP20021200.868237hypothetical protein
VP20030201.609523hypothetical protein
VP20041211.911473hypothetical protein
VP20051252.177415thioredoxin
VP20061252.633283suppressor for copper-sensitivity B
VP20073252.799491membrane protein, suppressor for
VP20082243.287249LysR family transcriptional regulator
VP20093213.967937tetrathionate reductase complex: response
VP20103203.967937tetrathionate reductase complex: sensory
VP20123204.154828hypothetical protein
VP20113203.456015tetrathionate reductase subunit B
VP20131172.648795tetrathionate reductase complex subunit C
VP20141162.296620tetrathionate reductase subunit A
VP20151150.275919cytochrome c
VP2016115-0.259388hypothetical protein
VP2017117-0.688555paraquat-inducible protein A
VP2018120-2.473053paraquat-inducible protein B
VP2019225-3.751483hypothetical protein
VP2020222-4.029665hypothetical protein
VP2021120-4.466513hypothetical protein
VP2022019-3.797313glycosyl transferase family protein
VP2023-119-4.619359nucleotide sugar epimerase
VP2024-216-3.102588hypothetical protein
VP2025019-2.317405hypothetical protein
VP2026426-1.039578orotidine 5'-phosphate decarboxylase
VP2027426-1.167757tetratricopeptide repeat protein
VP2028431-0.761572hypothetical protein
VP2029227-0.281594integration host factor subunit beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1994ISCHRISMTASE452e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 45.4 bits (107), Expect = 2e-08
Identities = 43/182 (23%), Positives = 73/182 (40%), Gaps = 13/182 (7%)

Query: 2 SNSALLVIDIQN---DYFPNGRFPLWNTDATLDNIKQLMARAKAQDIPI-FLVQHVSSAP 57
+ + LL+ D+QN D F G P+ NI++L + IP+ + Q S P
Sbjct: 29 NRAVLLIHDMQNYFVDAFTAGASPV---TELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85

Query: 58 KGKA---PFFEEGSVGVEIHPDIIS-ICPDAEIIQ--KQHADSFYQTDLEQALERNGVDE 111
+A F+ G II+ + P+ + + K +F +T+L + + + G D+
Sbjct: 86 DDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQ 145

Query: 112 LLICGMMTQNCVTHTAISKAAEKYNVSIIEDCCTTTDQMIHNIALSAVSIRVPLLASSDV 171
L+I G+ TA E + D H +AL + R +D
Sbjct: 146 LIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDS 205

Query: 172 LL 173
LL
Sbjct: 206 LL 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1999RTXTOXIND606e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 59.8 bits (145), Expect = 6e-12
Identities = 37/193 (19%), Positives = 76/193 (39%), Gaps = 14/193 (7%)

Query: 40 GTIEKQAVAVGKIVPA-HSVSIKSQIDGIVGEIYAKVGEKVKQGQPLIKVRPNPTPQALT 98
G +E A A GK+ + S IK + IV EI K GE V++G L+K+
Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------L 130

Query: 99 DASAELMRSEADLESAKQKLSNLESLVKQDIIPSNYDEYVSARSAVKSAQADVLQKRQNL 158
A A+ +++++ L A+ + + + L I N + + +
Sbjct: 131 GAEADTLKTQSSLLQARLEQTRYQILS--RSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188

Query: 159 ELIRSGEASIGDARLTSTIYAPIDGTVLNQKVEVGEPIISTQSSQAATEMMSLADMNSLI 218
LI+ ++ + + ++ + I+ + + E L D +SL+
Sbjct: 189 SLIKEQFSTWQNQKYQKE----LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLL 244

Query: 219 FKGSVSEHDAAQL 231
K ++++H +
Sbjct: 245 HKQAIAKHAVLEQ 257



Score = 49.1 bits (117), Expect = 2e-08
Identities = 27/153 (17%), Positives = 55/153 (35%), Gaps = 26/153 (16%)

Query: 95 QALTDASAELMRSEADLESAKQKLSNLESLVKQDIIPSNYDEYVSARSAVKSAQADVLQK 154
L ++L + E+++ SAK++ + L K +I ++ ++
Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI-----------LDKLRQTTDNIGLL 314

Query: 155 RQNLELIRSGEASIGDARLTSTIYAPIDGTVLNQKV-EVGEPIISTQSSQAATEMMSLA- 212
L A + + S I AP+ V KV G + A +M +
Sbjct: 315 TLEL-------AKNEERQQASVIRAPVSVKVQQLKVHTEGGVV------TTAETLMVIVP 361

Query: 213 DMNSLIFKGSVSEHDAAQLSPGMPVMLTVAPYP 245
+ ++L V D ++ G ++ V +P
Sbjct: 362 EDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2001SACTRNSFRASE358e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.5 bits (79), Expect = 8e-05
Identities = 16/75 (21%), Positives = 34/75 (45%), Gaps = 4/75 (5%)

Query: 74 ENQLVAFARLITDRTTFAYLADVFVVEAHRGKGISKWLISEIVAHPELQGLRRMMLATRD 133
EN + ++ ++ +A + D+ V + +R KG+ L+ + + + +ML T+D
Sbjct: 73 ENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 134 ----AHGLYEQFGFT 144
A Y + F
Sbjct: 133 INISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2009HTHFIS837e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 7e-21
Identities = 24/110 (21%), Positives = 45/110 (40%)

Query: 10 VYVVDDDESVRDSLAFMLEEHDFNVTTFADGQSFLDEVNIHQAGCVILDSRMPNLRGQQV 69
+ V DDD ++R L L ++V ++ + + V+ D MP+ +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 70 HQFLNEAHSPLAVIYLTGHGDVPMAVDALQAGAVNFFQKPVKGDELAQAI 119
+ +A L V+ ++ A+ A + GA ++ KP EL I
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2010PF06580300.032 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.032
Identities = 45/330 (13%), Positives = 102/330 (30%), Gaps = 71/330 (21%)

Query: 253 PAIIAAGASGWTSPVSLLRIDKLYQALDLHPLQQPWWSEALRWLRSHQEWAWALFMFVIV 312
PA + G + + S+ R+ A AL + + + + +
Sbjct: 82 PACVVIGMVWFVANTSIWRL----LAFINTKPVAFTLPLALSII-------FNVVVVTFM 130

Query: 313 LNAYHFGLEYRFSKSKQALELTSLRLKEKSEQLEHSQRVAIVGEIGSSLAHELNQPLAAI 372
+ +FG + + + ++ + + QL + +I H + L I
Sbjct: 131 WSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALK-----AQINP---HFMFNALNNI 182

Query: 373 RNYSEGGLLRLAKKRPHEDIVPVLEKIQGQVERADAIIQRLRTLIRKRSVDKTPCDIQAL 432
R I +A ++ L L+R S+ + +L
Sbjct: 183 RAL-----------------------ILEDPTKAREMLTSLSELMRY-SLRYSNARQVSL 218

Query: 433 ------IADTIELLHFRMQKQNVAIVTSVEGEIRPLLADSVGVQQVLVNVINNAIDACAL 486
+ ++L + + + + + I + + VQ ++ N I + I
Sbjct: 219 ADELTVVDSYLQLASIQFEDR-LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLP- 276

Query: 487 FQEKYHSSGYQGKIALHCDYQANQLSIRILDNGTGLQQENPTQAFVSSKAEGLGLGLAIC 546
GKI L +++ + + G+ + E G GL
Sbjct: 277 ---------QGGKILLKGTKDNGTVTLEVENTGSLALKNTK---------ESTGTGLQNV 318

Query: 547 RDVME-MHGGEF-LIASTTPHGCLVELVFP 574
R+ ++ ++G E + S ++ P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2023NUCEPIMERASE5220.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 522 bits (1346), Expect = 0.0
Identities = 198/335 (59%), Positives = 251/335 (74%), Gaps = 2/335 (0%)

Query: 1 MKYLVTGAAGFIGSATIRKLNSLGYEVIGIDNINDYYDVELKYARLNFIKNPLFRFFNMD 60
MKYLVTGAAGFIG ++L G++V+GIDN+NDYYDV LK ARL + P F+F +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 61 ISNKNKNEIERLFEKEKFDRVIHLAAQAGVRYSLVNPHCYAESNLSGFLNVLEACRKSHI 120
++++ + LF F+RV + VRYSL NPH YA+SNL+GFLN+LE CR + I
Sbjct: 61 LADREG--MTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 121 KHFIYASSSSVYGLNKKVPFSTSDNVDHPVSLYAATKKSNELMAHSYSHLYQLPTTGLRF 180
+H +YASSSSVYGLN+K+PFST D+VDHPVSLYAATKK+NELMAH+YSHLY LP TGLRF
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178

Query: 181 FTVYGSWGRPDMAPFIFTEKIINGQSIDINNNGDMWRDFTHINDIVEGIVRISDVIPRIN 240
FTVYG WGRPDMA F FT+ ++ G+SID+ N G M RDFT+I+DI E I+R+ DVIP +
Sbjct: 179 FTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHAD 238

Query: 241 QRWQFENSTPADSSAPYSIYNIGYGSPICLMDFIKAIENELGIEAKKNYREMQPGDVYQT 300
+W E TPA S APY +YNIG SP+ LMD+I+A+E+ LGIEAKKN +QPGDV +T
Sbjct: 239 TQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLET 298

Query: 301 YADTTAFYQATGYRPSVSVEEGIAEFVAWYRNFYN 335
ADT A Y+ G+ P +V++G+ FV WYR+FY
Sbjct: 299 SADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2027SYCDCHAPRONE280.044 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.0 bits (62), Expect = 0.044
Identities = 19/90 (21%), Positives = 26/90 (28%), Gaps = 11/90 (12%)

Query: 196 GNSNKAIQHFKKALSEDPKCVRASISLGRIYLESEDYKQTIK-YLTGVLEQDKDFVSDVL 254
G A + F+ D R + LG Y I Y G + K+
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKE------ 103

Query: 255 PT----IAECYHHLGQEDELVEFLRACIDK 280
P AEC G+ E L +
Sbjct: 104 PRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2029DNABINDINGHU1192e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (301), Expect = 2e-39
Identities = 33/89 (37%), Positives = 57/89 (64%), Gaps = 1/89 (1%)

Query: 2 TKSELIERLCAEQTHLSAKEVEDAVKDILEHMASTLESGDRIEIRGFGSFSLHYREPRVG 61
K +LI ++ AE T L+ K+ AV + ++S L G+++++ GFG+F + R R G
Sbjct: 3 NKQDLIAKV-AEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRERV 90
RNP+TG++++++ VP FK GK L++ V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


20VP2116VP2143Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP21164340.021916hypothetical protein
VP21174360.384867glutaredoxin protein
VP21184290.264007manganese superoxide dismutase Mn-SOD
VP21192240.405187hypothetical protein
VP21201260.276825short chain dehydrogenase
VP2121124-0.179931bifunctional acetaldehyde-CoA/alcohol
VP2122-210-0.357795hypothetical protein
VP2123-29-0.869035potassium channel
VP2124010-1.147281aspartate-semialdehyde dehydrogenase
VP2125-310-1.556011Na+/H+ antiporter
VP2126-113-2.539168hypothetical protein
VP2127014-3.123655hypothetical protein
VP2128-114-3.577089nucleoid-associated protein NdpA
VP2129123-5.308455hypothetical protein
VP2130326-7.103229hypothetical protein
VP2131433-8.174896*hypothetical protein
VP2132331-8.467850hypothetical protein
VP2133430-8.136792hypothetical protein
VP2134329-7.933891hypothetical protein
VP2135327-7.407681phage-like protein
VP2136425-5.347751pore-forming cytotoxin integrase
VP2137627-6.283671hypothetical protein
VP2138830-6.437722hypothetical protein
VP2139729-6.193914hypothetical protein
VP2140727-5.707492hypothetical protein
VP2141526-5.078535hypothetical protein
VP2142625-5.174617hypothetical protein
VP2143323-4.045368hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2137GPOSANCHOR290.044 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 29.3 bits (65), Expect = 0.044
Identities = 17/104 (16%), Positives = 43/104 (41%), Gaps = 7/104 (6%)

Query: 320 ERRTKQRKEMDQQAKLPKSQRKNNLNNYLLKRQAIQRKELASDIEAGRSELDDIKTQVAI 379
+ +T + ++ +A+ + ++ + + K L ++ A +E D++ Q +
Sbjct: 247 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQV 306

Query: 380 STGENEMINELKRQNSRDISAEKKEIVQLRAEKHALEKLVQSLK 423
+ + RD+ A ++ QL AE LE+ + +
Sbjct: 307 LNANRQSL-------RRDLDASREAKKQLEAEHQKLEEQNKISE 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2138GPOSANCHOR379e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 9e-05
Identities = 31/188 (16%), Positives = 63/188 (33%), Gaps = 7/188 (3%)

Query: 121 RIREAEAISQHAIKQKEQSEGEALDIEQTLQRATDANDQLEEKIEALQSEVVESQIEKSE 180
A + +K D+E+ L+ A + + KI+ L++E + ++E
Sbjct: 135 FSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 194

Query: 181 LLKNISQLTSSLNQTNDRLENQQDTICGLQASLSKVEKMNSALTVQLEHALNDAKKLESQ 240
L K + + + +++ + L A + +EK K LE++
Sbjct: 195 LEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE 254

Query: 241 LTELSERYESTCIKLTESSSKLQSMTEALDNSEHKLSTVQDEYTDLSVNYRILESKLHES 300
L + E L+ K+ T++ E L LE +
Sbjct: 255 KAALEA-------RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVL 307

Query: 301 QSNISELK 308
+N L+
Sbjct: 308 NANRQSLR 315



Score = 33.9 bits (77), Expect = 8e-04
Identities = 32/192 (16%), Positives = 77/192 (40%), Gaps = 7/192 (3%)

Query: 134 KQKEQSEGEALDIEQTLQRATDANDQLEEKIEALQSEVVESQIEKSELLKNISQLTSSLN 193
+K D+E+ L+ A + + KI+ L++E + ++EL K + +
Sbjct: 218 AEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFST 277

Query: 194 QTNDRLENQQDTICGLQASLSKVEKMNSALTV-------QLEHALNDAKKLESQLTELSE 246
+ +++ + L+A + +E + L L+ + K+LE++ +L E
Sbjct: 278 ADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337

Query: 247 RYESTCIKLTESSSKLQSMTEALDNSEHKLSTVQDEYTDLSVNYRILESKLHESQSNISE 306
+ + + L + EA E + ++++ + + L L S+ +
Sbjct: 338 QNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ 397

Query: 307 LKLANHELEKQL 318
++ A E +L
Sbjct: 398 VEKALEEANSKL 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2142CHANLCOLICIN382e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 37.7 bits (87), Expect = 2e-04
Identities = 32/149 (21%), Positives = 68/149 (45%), Gaps = 25/149 (16%)

Query: 320 AKAEEKKIQQKLQEQKAQIAAKEQEITQIIENVALKRREAQ---DEIKRIAAETEDEVRQ 376
A A +Q + +++ ++A E++ + E +EA+ EI+R AETE +++
Sbjct: 116 AHANNAAMQAE--DERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKL 173

Query: 377 TLKLKQQELSENIAALEDEQNSLE--KQIIDKVQHLELVNEIEKLKEECNYYEKHSKRLE 434
+++ AAL +E ++E ++ + Q +E+ K+ E + RL
Sbjct: 174 AEAEEKRL-----AALSEEAKAVEIAQKKLSAAQ-----SEVVKMDGEIK---TLNSRLS 220

Query: 435 STVKGFEATLKNNDTEELAKKIGEMEAIN 463
S++ +A + + LA K E+ +
Sbjct: 221 SSIHARDA-----EMKTLAGKRNELAQAS 244


21VP2235VP2246Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP22352222.440738flagellar biosynthesis protein FlhA
VP22362231.996022flagellar biosynthesis protein FlhB
VP22372211.745560flagellar biosynthesis protein FliR
VP22381191.882146flagellar biosynthesis protein FliQ
VP22391182.493785flagellar biosynthesis protein FliP
VP22402162.440920polar flagellar assembly protein FliO
VP22411173.275172flagellar motor switch protein
VP22421173.159414flagellar motor switch protein FliM
VP22431173.267662flagellar basal body protein FliL
VP22440163.362831polar flagellar hook-length control protein
VP22450173.241562flagellar biosynthesis chaperone
VP2246-1163.336050flagellum-specific ATP synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2236TYPE3IMSPROT367e-129 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 367 bits (945), Expect = e-129
Identities = 114/354 (32%), Positives = 190/354 (53%), Gaps = 14/354 (3%)

Query: 8 ERTEEATPRRLQQAREKGQVARSKELASASVLIVGAIALMWFGESLARSLFSIMSRLFDL 67
E+TE+ TP++++ AR+KGQVA+SKE+ S ++++ + LM L+ F S+L +
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLM----GLSDYYFEHFSKLMLI 59

Query: 68 KRDEIFDTTKLFDIALGAMTDLLFPLFLI--LITLFVAATIGAAG---VGGISFSAEAAM 122
++ + F AL + D + F L VAA + A G S EA
Sbjct: 60 PAEQSYLP---FSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIK 116

Query: 123 PKLSKMNPLSGLKRMVGMQSWVELIKSILKVVLVTGVAMYLIQASQADLIQLSMDVYPQN 182
P + K+NP+ G KR+ ++S VE +KSILKVVL++ + +I+ + L+QL +
Sbjct: 117 PDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQL-PTCGIEC 175

Query: 183 IFHAL-DILLNFILLISCSLLIVVAIDIPFQIWQHADQLKMTKQEVKDEYKETEGKPEVK 241
I L IL +++ + +++ D F+ +Q+ +LKM+K E+K EYKE EG PE+K
Sbjct: 176 ITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIK 235

Query: 242 GRIRMLQREAAQRRMMADVPQADVIVTNPEHYSVALRYKQKTDRAPVVIAKGTDHMAMKI 301
+ R +E R M +V ++ V+V NP H ++ + YK+ P+V K TD +
Sbjct: 236 SKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTV 295

Query: 302 REVAREHDITIVPAPPLARALYHTTELEQEIPDGLFTAVAQVLAFVFQLKQYRK 355
R++A E + I+ PLARALY ++ IP A A+VL ++ + ++
Sbjct: 296 RKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2237TYPE3IMRPROT1211e-35 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 121 bits (306), Expect = 1e-35
Identities = 81/221 (36%), Positives = 129/221 (58%), Gaps = 2/221 (0%)

Query: 9 LDWIANYFWPYVRISSMLMVMTVTGARFVSPRIRLYLGLAITFAVMPAIPAVPQDIQLLS 68
L W+ YFWP +R+ +++ + R V R++L L + ITFA+ P++PA D+ + S
Sbjct: 10 LSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA--NDVPVFS 67

Query: 69 FRGFMTIAEQMIIGVAMGMVTQFMIQTFVLLGQILGMQSSLGFASMVDPANGQNTPLLGQ 128
F +Q++IG+A+G QF G+I+G+Q L FA+ VDPA+ N P+L +
Sbjct: 68 FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLAR 127

Query: 129 LFMFLTTMFFLATDGHLKMLQLVVFSFKTLPIGSGTLTAVDFRDMAGWLGIMFKTALSMS 188
+ L + FL +GHL ++ L+V +F TLPIG L + F + ++F L ++
Sbjct: 128 IMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLA 187

Query: 189 LSGIIALLTINLSFGVMTRAAPQLNIFSLGFAFALMVGLLI 229
L I LLT+NL+ G++ R APQL+IF +GF L VG+ +
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISL 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2238TYPE3IMQPROT558e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 8e-14
Identities = 26/70 (37%), Positives = 40/70 (57%)

Query: 7 VELFREALWMVLIMVCAIIIPSLLIGLVVAIFQAATSINEQTLSFLPRLIVTLLALMLFG 66
V +AL++VLI+ I + +IGL+V +FQ T + EQTL F +L+ L L L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 HWMTQMLMEY 76
W ++L+ Y
Sbjct: 65 GWYGEVLLSY 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2239FLGBIOSNFLIP2852e-99 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 285 bits (732), Expect = 2e-99
Identities = 116/229 (50%), Positives = 166/229 (72%), Gaps = 1/229 (0%)

Query: 60 MSVGNGAGIPAFTMTTNPDGSEDYSVTLQILALMTMLGFLPAMVILMTSFTRIVVVMSIL 119
++ A +P T P G + +S+ +Q L +T L F+PA++++MTSFTRI++V +L
Sbjct: 15 ITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLL 74

Query: 120 RQAMGLQQTPSNQVIIGIALFLTFFVMSPVLNEINDTAIQPYLNEQVTAREAFDAAQVPM 179
R A+G P NQV++G+ALFLTFF+MSPV+++I A QP+ E+++ +EA + P+
Sbjct: 75 RNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPL 134

Query: 180 KAFMLKQTRIKDLETFVNMSGE-QVTNPEDVSMAVLIPAFITSELKTAFQIGFMLFLPFL 238
+ FML+QTR DL F ++ + PE V M +L+PA++TSELKTAFQIGF +F+PFL
Sbjct: 135 REFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFL 194

Query: 239 IIDLVVASVLMAMGMMMLSPMIVSLPFKLMLFVLVDGWNLILSTLAGSF 287
IIDLV+ASVLMA+GMMM+ P ++LPFKLMLFVLVDGW L++ +LA SF
Sbjct: 195 IIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2241FLGMOTORFLIN1131e-35 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 113 bits (285), Expect = 1e-35
Identities = 63/135 (46%), Positives = 91/135 (67%), Gaps = 8/135 (5%)

Query: 3 PSDDQK--LADEWAAALGEDPSAPSIDVDEVLAAPLEELKDTSRPITDDERRKLDTIMDI 60
PSD+ L D WA AL E + + + + L D S + D +D IMDI
Sbjct: 7 PSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGG-GDVSGAMQD-----IDLIMDI 60

Query: 61 PVTISMEVGRSQISIRNLLQLNQGSVVELDRLAGESLDVLVNGTLIAHGEVVVVNDKFGI 120
PV +++E+GR++++I+ LL+L QGSVV LD LAGE LD+L+NG LIA GEVVVV DK+G+
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RLTDVISQTERIKKL 135
R+TD+I+ +ER+++L
Sbjct: 121 RITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2242FLGMOTORFLIM2418e-80 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 241 bits (617), Expect = 8e-80
Identities = 86/327 (26%), Positives = 163/327 (49%), Gaps = 9/327 (2%)

Query: 1 MTDLLSQDEIDALLHGVD--DVDDIDEPLDNDTEGAVSFDFSSQDRIVRGRMPTLELINE 58
MT++LSQDEID LL + D D +DT +DF D+ + +M TL L++E
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 59 RFARHMRISLFNMLRKTAEVSINGVQMMKFGEYQNTLYVPTSLNMVRFRPLKGTALITME 118
FAR SL LR V + V + + E+ ++ P++L ++ PLKG A++ ++
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 119 ARLVFILVENFFGGDGRFHAKIEGREFTPTERRIIQLLLKIVFEDYKEAWSPVMGVEFEY 178
+ F +++ FGG G+ R+ T E +++ ++ + + +E+W+ V+ +
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRL 178

Query: 179 LDSEVNPSMANIVSPTEVIVVSSFHIEVDGGGGDFHVVMPYSMVEPIRELLDAG--VQSD 236
E NP A IV P+E++V+ + +V G + +PY +EPI L + S
Sbjct: 179 GQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSV 238

Query: 237 KMETDVRWSSALREEIMDCPVNFRVNLLEKDISLRDLMELQPGDIIPIE---MPEHATMF 293
+ + ++ LR+++ ++ + +S+RD++ L+ GDII + + + +
Sbjct: 239 RRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLS 298

Query: 294 IEDLPTYRVKMGRSEDKLAVQVSQEIE 320
I + + + G K+A Q+ + IE
Sbjct: 299 IGNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2244FLGHOOKFLIK489e-08 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 47.5 bits (112), Expect = 9e-08
Identities = 28/120 (23%), Positives = 63/120 (52%)

Query: 509 EQVAEKVQMMMSKNLKNLDIRLDPPELGRMQIRMTMNNDLANVHFTVTNPQARDIIEQTL 568
+ +++ + + + ++ ++RL P +LG +QI + ++++ A + + R +E L
Sbjct: 242 QSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAAL 301

Query: 569 PRLREMLAQQGMQLADSSVQQQSSGQQQSGYAAAEQNGQGTSGRGFSGQSDENFDADVNL 628
P LR LA+ G+QL S++ +S QQ + +Q+ + + +G+ D+ V+L
Sbjct: 302 PVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSL 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2245FLGFLIJ383e-06 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 38.3 bits (88), Expect = 3e-06
Identities = 29/142 (20%), Positives = 76/142 (53%)

Query: 2 NNAMEFLLEQTKEREDQAVLALNKARSELEDYYRQVEQIEKYRLDYCQQLVDRGQAGLTA 61
+ A+ L + ++ + A L + R + Q++ + Y+ +Y L AG+T+
Sbjct: 4 HGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITS 63

Query: 62 SQYGHLNRFLCQLDETLSKQKQAEHHFKEQVENCKDYWLKMRQERMSYEWMIEKKAKEKQ 121
+++ + +F+ L++ +++ +Q + + ++V+ + W + +Q +++ + E+++
Sbjct: 64 NRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAAL 123

Query: 122 IAEAKREQKQMDEFSTLLFSRK 143
+AE + +QK+MDEF+ RK
Sbjct: 124 LAENRLDQKKMDEFAQRAAMRK 145


22VP2342VP2347Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2342222-0.582547DNA polymerase IV
VP2343433-0.604945hypothetical protein
VP2344437-0.487787hypothetical protein
VP2345339-0.274210thiamin biosynthesis lipoprotein ApbE
VP23465420.183190Na(+)-translocating NADH-quinone reductase
VP2347539-0.111688Na(+)-translocating NADH-quinone reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2342DPTHRIATOXIN300.015 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 30.1 bits (67), Expect = 0.015
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 10/72 (13%)

Query: 249 VERTFSQNISTYDECWQVIEEKLYPELEKRLERA--------SPDKSIIKQGIKVKFADF 300
V R+ ++S + W VI +K ++E E SP+K++ ++ K +F
Sbjct: 223 VRRSVGSSLSCINLDWDVIRDKTKTKIESLKEHGPIKNKMSESPNKTVSEEKAKQYLEEF 282

Query: 301 QLTTIEHIHPQL 312
T +E HP+L
Sbjct: 283 HQTALE--HPEL 292


23VP2389VP2400Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP23892210.735557membrane transport protein
VP23902221.047623hypothetical protein
VP23910180.938559hypothetical protein
VP2392-1171.001396hypothetical protein
VP2393-1161.826056LacI family transcription regulator
VP23940162.168588sodium:galactoside symporter family protein
VP23950173.245524hypothetical protein
VP23961203.556368LacI-family regulatory protein
VP23970203.042009aldose 1-epimerase
VP23982253.673335galactokinase
VP23993263.244809galactose-1-phosphate uridylyltransferase
VP24002272.905063UDP-glucose 4-epimerase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2400NUCEPIMERASE1825e-57 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 182 bits (463), Expect = 5e-57
Identities = 82/345 (23%), Positives = 142/345 (41%), Gaps = 35/345 (10%)

Query: 1 MKVLVTGGMGYIGSHTCVQMIEAGIEPIIVDNLCNAKLEVLN--RIEALTGKQPAFHQGD 58
MK LVTG G+IG H +++EAG + + +DNL + L R+E L FH+ D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 59 IRDEAFLDTVFAQHDIQAVIHFAGLKAVGESVAKPLEYYDNNVNGSLVLARSMRKSGVKS 118
+ D + +FA + V AV S+ P Y D+N+ G L + R + ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 119 IVFSSSATVYGDPEIVPITEDSPTGATTNPYGRSKYMVEQCFSDLFHAENDWSITLLRYF 178
++++SS++VYG +P + D + Y +K E + + T LR+F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE-LMAHTYSHLYGLPATGLRFF 179

Query: 179 NPVGAHSSGSMGEDPQGIPNNLMPFIAQVAVGRREKLAVFGSDYPTPDGTGVRDYIHVMD 238
G P G P ++ F A+ + + V+ G RD+ ++ D
Sbjct: 180 TVYG----------PWGRP-DMALFKFTKAMLEGKSIDVYN------YGKMKRDFTYIDD 222

Query: 239 LADGHIAALKSVGKTSG---------------LHIYNLGTGKGSSVLEMVDAFAAACGKP 283
+A+ I + +YN+G +++ + A A G
Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE 282

Query: 284 VPYELCPRRPGDIAECWASTEKAERELGWKATRTVAEMTADTWNW 328
+ P +PGD+ E A T+ +G+ TV + + NW
Sbjct: 283 AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327


24VP2450VP2456Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2450430-0.305156MarR family transcriptional regulator
VP2451535-0.057391lipoprotein NlpI
VP2452842-0.141614polynucleotide phosphorylase
VP2453737-0.52441030S ribosomal protein S15
VP2454634-0.611522tRNA pseudouridine synthase B
VP2455535-0.959334ribosome-binding factor A
VP2456434-0.924204translation initiation factor IF-2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2456TCRTETOQM781e-16 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 77.6 bits (191), Expect = 1e-16
Identities = 71/313 (22%), Positives = 112/313 (35%), Gaps = 77/313 (24%)

Query: 411 IMGHVDHGKTSTLDYIRRTHVASGEAG------------------GITQHIGAYHVETEN 452
++ HVD GKT+ + + A E G GIT G + EN
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 453 GMITFLDTPGHAAFTAMRARGAQATDIVVLVVAADDGVMPQTVEAIQHAKAAGVPLIVAV 512
+ +DTPGH F A R D +L+++A DGV QT + G+P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 513 NKIDKEDANPD----NVKNELAQYDVI-----------------PEEWG----------- 540
NKID+ + ++K +L+ VI E+W
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 541 ---------------GENMFV---------HISAKQGTNIDGLLEAILLQSEVLELTAVK 576
E++ H SAK ID L+E I ++ T
Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245

Query: 577 EGMASGVVVESRLDKGRGPVATVLVQSGTLHKGDIVL-CGQEYGRVRAMRDELGQEITEA 635
+ G V + + R +A + + SG LH D V +E ++ M + E+ +
Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKI 305

Query: 636 GPSIPVEILGLSG 648
+ EI+ L
Sbjct: 306 DKAYSGEIVILQN 318


25VP2514VP2519Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP25145410.662320carbonic anhydrase
VP25155390.859964hypoxanthine-guanine phosphoribosyltransferase
VP25165370.937260OpaR protein
VP25174361.111768dihydrolipoamide dehydrogenase
VP25184321.181812dihydrolipoamide acetyltransferase
VP25192190.033064pyruvate dehydrogenase subunit E1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2516HTHTETR654e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.6 bits (157), Expect = 4e-15
Identities = 28/208 (13%), Positives = 71/208 (34%), Gaps = 20/208 (9%)

Query: 4 IAKRPRTRLSPLKRKQQLMEIALEVFARRGIGRGGHADIAEIAQVSVATVFNYFPTREDL 63
+A++ + +Q ++++AL +F+++G+ +IA+ A V+ ++ +F + DL
Sbjct: 1 MARKTKQEAQE--TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 64 VDEVLNHVVRQFSNFLSDNIDLDIHARENIANITNAMIELVSQDCH------WLKVWFEW 117
E+ + + ++ + +I ++ +++ F
Sbjct: 59 FSEIWELSESNIGELELEYQAKF--PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116

Query: 118 SASTRDEVWPLFVSTNRTNQLLVQNMFI----KAIERGEVCDQHDSEHLANLFHGICYSL 173
+ + R L + IE + + A + G L
Sbjct: 117 CEFVGE--MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174

Query: 174 FVQANRFKGEAELKE----LVSAYLDML 197
+LK+ V+ L+M
Sbjct: 175 MENWLFAPQSFDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2518RTXTOXIND388e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 8e-05
Identities = 41/281 (14%), Positives = 90/281 (32%), Gaps = 35/281 (12%)

Query: 26 DKVEEEQSLITVEGDKASMEVPASQAGIVKEIKVAEGDKVSTGSLIMIFE---AEGAADA 82
+ V +T G S E+ + IVKEI V EG+ V G +++ AE
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 83 APAPAAEAAPAAAPAPAAAAELKEVHVPDIG------------GDEVEVTEIMVAIGDSI 130
+ +A + ++ +P++ + + +T ++ +
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 131 EEEQSLITVEGDKASMEVPAPFAGTLKEIKVAAGDKVSTGSLIMVFETAGSGAPAAPAAV 190
+ ++ + DK E A + ++ +K + + A A
Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH-KQAIAKHAVLEQ 257

Query: 191 EAPAAAAPAASAAKEVNVPDIGGDEVEV----------------TEIMVAVGDTVEEEQS 234
E A + + I + + ++ +
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317

Query: 235 LITVEGDKASMEVPAPFAGTVKEIKIAA-GDKVSTGSLIMV 274
L E + + + AP + V+++K+ G V+T +MV
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358


26VP2689VP2694Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2689218-2.124633rod shape-determining protein MreD
VP2690218-2.080295rod shape-determining protein MreC
VP2691219-2.127958rod shape-determining protein MreB
VP2692222-2.906932hypothetical protein
VP2693422-2.109715MshP protein
VP2694221-2.038837type IV prepilin, MshO
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2691SHAPEPROTEIN5670.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 567 bits (1463), Expect = 0.0
Identities = 318/347 (91%), Positives = 334/347 (96%)

Query: 1 MFKKLRGMFSNDLSIDLGTANTLIYVKGQGIVLDEPSVVAIRQDRVGSAKSVAAVGHAAK 60
M KK RGMFSNDLSIDLGTANTLIYVKGQGIVL+EPSVVAIRQDR GS KSVAAVGH AK
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60

Query: 61 QMLGRTPGNISAIRPMKDGVIADFYVTEKMLQHFIKQVHDNSILKPSPRVLVCVPCGSTQ 120
QMLGRTPGNI+AIRPMKDGVIADF+VTEKMLQHFIKQVH NS ++PSPRVLVCVP G+TQ
Sbjct: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120

Query: 121 VERRAIRESALGAGAREVYLIDEPMAAAIGAGLRVSEPTGSMVVDIGGGTTEVAVISLNG 180
VERRAIRESA GAGAREV+LI+EPMAAAIGAGL VSE TGSMVVDIGGGTTEVAVISLNG
Sbjct: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180

Query: 181 VVYSSSVRIGGDRFDEAVINYVRRNYGSLIGEATAEKIKHEIGSAYPGDEVQEIEVRGRN 240
VVYSSSVRIGGDRFDEA+INYVRRNYGSLIGEATAE+IKHEIGSAYPGDEV+EIEVRGRN
Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240

Query: 241 LAEGVPRSFSLNSNEILEALQEPLSGIVSAVMVALEQCPPELASDISENGMVLTGGGALL 300
LAEGVPR F+LNSNEILEALQEPL+GIVSAVMVALEQCPPELASDISE GMVLTGGGALL
Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300

Query: 301 KDLDRLLMEETGIPVVIAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347
++LDRLLMEETGIPVV+AEDPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2694BCTERIALGSPG328e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 8e-04
Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 2 KTRGFTLMEMIVTIVIGSFIMLGI-AGYVQLGMKGYADTIDRQRMQTQAQ 50
K RGFTL+E++V IVI +G+ A V + G + D+Q+ +
Sbjct: 6 KQRGFTLLEIMVVIVI-----IGVLASLVVPNLMGNKEKADKQKAVSDIV 50


27VP2854VP2875Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP28541203.188662hypothetical protein
VP28531193.446541hypothetical protein
VP28551213.1784446-phosphofructokinase
VP28563253.429415ferrous iron efflux protein F
VP28573233.348812periplasmic repressor CpxP
VP2858-2202.351943transcriptional regulator CpxR
VP28590222.626945two-component sensor protein
VP28600222.547785superoxide dismutase, Mn
VP28610202.367403rRNA methylase
VP2862-1162.740635FxsA protein
VP2863-2172.600204aspartate ammonia-lyase
VP28640183.617305anaerobic C4-dicarboxylate transporter
VP2865-1193.326892thiol:disulfide interchange protein
VP2866-1193.136187LuxR family transcriptional regulator
VP2867-1182.900016potassium/proton antiporter
VP2868-2213.232969hypothetical protein
VP28690234.320945sodium/solute symporter
VP28703254.463008hypothetical protein
VP28712254.064696hypothetical protein
VP28722253.957328hypothetical protein
VP28731234.157238fumarate hydratase
VP28741213.887900sensor histidine kinase
VP2875-1193.1732823-phenylpropionic acid transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2858HTHFIS963e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.1 bits (239), Expect = 3e-25
Identities = 39/125 (31%), Positives = 63/125 (50%), Gaps = 2/125 (1%)

Query: 2 ANILLIDDDIELTSLLKEVLSFEGFEVSEANDGEAGLAAINSD-IDLILLDVMMPKLNGM 60
A IL+ DDD + ++L + LS G++V ++ I + DL++ DV+MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 61 ETLKRLRENWE-TPVLMLTAKGEEIDRVIGLELGADDYLPKPFSDRELLARIRAILRRTS 119
+ L R+++ PVL+++A+ + + E GA DYLPKPF EL+ I L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 120 NTQKN 124

Sbjct: 124 RRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2859PF06580423e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.2 bits (99), Expect = 3e-06
Identities = 29/189 (15%), Positives = 63/189 (33%), Gaps = 40/189 (21%)

Query: 287 IDTEAQRLEQMISELLELSRMQVNSHMTRETQPIESLWEEI-IKDAQFEAEQMGKTLRYS 345
I + + +M++ L EL R + SL +E+ + D+ + + ++
Sbjct: 186 ILEDPTKAREMLTSLSELMRYS----LRYSNARQVSLADELTVVDSYLQLASI----QFE 237

Query: 346 EIPERHISGNPKLLMSAVEN-----IIRNAIYYG------KDEVQIDIQDKTDTLTITVD 394
+ + NP ++ V ++ N I +G ++ + T+T+ V+
Sbjct: 238 DRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297

Query: 395 DNGDGVPDEELEAIFRPFYRVSTARDRHSGGTGLGLTITESAIRQHSGTI--VASRSPLG 452
+ G E TG GL ++ GT + G
Sbjct: 298 NTGSLALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339

Query: 453 GLRVSITLP 461
+ + +P
Sbjct: 340 KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2866HTHFIS813e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.4 bits (201), Expect = 3e-20
Identities = 41/169 (24%), Positives = 76/169 (44%), Gaps = 16/169 (9%)

Query: 1 MDSTYTIIIADDHPLFRNALFQSVHMAVSGANLLEADSLDALLALLAKEEEPDLVLLDLK 60
M TI++ADD R L + ++ +G ++ + L +A + DLV+ D+
Sbjct: 1 MTGA-TILVADDDAAIRTVL--NQALSRAGYDVRITSNAATLWRWIA-AGDGDLVVTDVV 56

Query: 61 MPGANGMSGLIHLRAEYPDLPIVVVSA-SEEPTVVSQVKSHGAFGFIPKSSDMRELVNAL 119
MP N L ++ PDLP++V+SA + T + + GA+ ++PK D+ EL+ +
Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEK-GAYDYLPKPFDLTELIGII 115

Query: 120 NQVL----------NGDPYFPEGLITNNAACNDLAEKIAALTPQQYKVL 158
+ L D L+ +AA ++ +A L ++
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2874HTHFIS695e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.1 bits (169), Expect = 5e-14
Identities = 20/116 (17%), Positives = 50/116 (43%), Gaps = 4/116 (3%)

Query: 1027 RILCVDNEPDILVGMENLLSRWGCDIRVATDLIESLQALEGDWVPDVIFSDYRLDNGRTG 1086
IL D++ I + LSR G D+R+ ++ + + D++ +D +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMP-DENA 62

Query: 1087 LEVLQQCRLRLGNHFEGVIISADRT-DDMMEGIKANGFSFIAKPVKPLKLRSVLNR 1141
++L + + + +++SA T ++ + + ++ KP +L ++ R
Sbjct: 63 FDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


28VP2954VP2991Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP29543180.727916cell division protein FtsX
VP29551160.650466cell division ATP-binding protein FtsE
VP29562150.630724cell division protein FtsY
VP2957523-0.699458hypothetical protein
VP2958524-1.944456hypothetical protein
VP2959222-1.538266hypothetical protein
VP2960219-1.646496hypothetical protein
VP2961324-0.784550hypothetical protein
VP2962325-1.309366hypothetical protein
VP29632270.723574hypothetical protein
VP29642241.673195hypothetical protein
VP29653262.391325hypothetical protein
VP29662262.235764inner membrane protein
VP29670242.284285hypothetical protein
VP29681232.408199transporter
VP29690222.080508phosphatase
VP29701201.754228glyceraldehyde-3-phosphate dehydrogenase
VP2971-1211.234767ArsR family transcriptional regulator
VP29720192.085967hypothetical protein
VP29732182.265427homoserine/homoserine lactone efflux protein
VP2974-1182.106687lysophospholipase L2
VP29750192.002542hypothetical protein
VP29760212.621497hypothetical protein
VP29770202.617630hypothetical protein
VP29780222.531360hypothetical protein
VP29790212.611165GGDEF family protein
VP29801193.030644hypothetical protein
VP29810193.101800site-specific tyrosine recombinase XerC
VP2982-1192.040199hypothetical protein
VP2983-1202.710027diaminopimelate epimerase
VP2984-2162.521360diaminopimelate decarboxylase
VP2985-2161.639116lipoprotein L
VP2986-2141.036490frataxin-like protein
VP2987-2140.969039adenylate cyclase
VP29881180.458995porphobilinogen deaminase
VP2989318-1.032748uroporphyrinogen-III synthase
VP2990318-1.919264uroporphyrin-III C-methyltransferase
VP2991320-1.948713HemY protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2956IGASERPTASE402e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 39.7 bits (92), Expect = 2e-05
Identities = 30/130 (23%), Positives = 49/130 (37%), Gaps = 29/130 (22%)

Query: 18 EEQSPKPKTEETVTEEAVEAPSE--------------EQQTEEVAEQAQQTQELEQEPEK 63
E + KT E ++A E ++ QT EVA+ +T+E Q E
Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE-TQTTET 1100

Query: 64 AEAAPAEAEAEAEAEAE--AEEPQV------------PVAPRIQEQEKPTESFFARLKRS 109
E A E E +A+ E E E P+V V P+ + + + + +S
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160

Query: 110 LSRTKANIGA 119
+ T A+
Sbjct: 1161 QTNTTADTEQ 1170



Score = 34.3 bits (78), Expect = 0.001
Identities = 20/82 (24%), Positives = 30/82 (36%), Gaps = 7/82 (8%)

Query: 17 DEEQSPKPKTEETVTEEAVEAP----SEEQQTEEVAEQAQQTQELEQEPEKAEAAPAEAE 72
D P E +EA P + + TE VAE ++Q + EK E E
Sbjct: 1006 DVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE---SKTVEKNEQDATETT 1062

Query: 73 AEAEAEAEAEEPQVPVAPRIQE 94
A+ A+ + V + E
Sbjct: 1063 AQNREVAKEAKSNVKANTQTNE 1084



Score = 30.8 bits (69), Expect = 0.011
Identities = 16/85 (18%), Positives = 29/85 (34%), Gaps = 1/85 (1%)

Query: 17 DEEQSPKPKTEETVTEEAVEAPSEEQQTEEVAEQAQQTQELEQEPEKAEAAP-AEAEAEA 75
E+ K K E T+E + S+ +E +E Q E +E + +++
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 76 EAEAEAEEPQVPVAPRIQEQEKPTE 100
A+ E + E T
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTV 1189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2958RTXTOXIND260.023 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 25.9 bits (57), Expect = 0.023
Identities = 10/33 (30%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query: 3 ALLILAKAAIAFVWLVLI-LNIVMPFPGNAAVA 34
A I+ IAF+ VL + IV G +
Sbjct: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTHS 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2986MALTOSEBP280.005 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 28.2 bits (62), Expect = 0.005
Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 13/76 (17%)

Query: 39 LEFEDRSQIIINRQEPMHEIWLASKSGGFHFKLVEDKWTCSKTGME----------LFEM 88
L+ + +S ++ N QEP L + GG+ FK K+ G++ L ++
Sbjct: 165 LKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDL 224

Query: 89 VKQECEKHAGEEIDWA 104
+K KH + D++
Sbjct: 225 IKN---KHMNADTDYS 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2990RTXTOXIND300.020 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.020
Identities = 8/67 (11%), Positives = 29/67 (43%), Gaps = 2/67 (2%)

Query: 90 QQADYQAQIAQLQNQLEKAQASMKQELNQVKEETIEKATTVTHKAEVVLGQQQKSIESLQ 149
+ Y++Q+ Q+++++ A+ + K E ++K T ++ + + +
Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL--TLELAKNEER 324

Query: 150 LAVADVK 156
+ ++
Sbjct: 325 QQASVIR 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2991MICOLLPTASE310.012 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 30.8 bits (69), Expect = 0.012
Identities = 14/99 (14%), Positives = 30/99 (30%), Gaps = 8/99 (8%)

Query: 49 IATLAGLFLLEYLIKKLVYASSST--WNYFSVRKM------RRSRRYTNEGIIKLLEGDW 100
I ++ + L ++K + ++ Y R + N+ + +L + W
Sbjct: 672 IKEVSNIKDLSSNVEKSQFFTTYDMRGTYVGGRSQGEENDWKDMNSKLNDILKELSKKSW 731

Query: 101 KGAEKKVTRWANHHDMPLLCYLVASEAAQGQGDKAKRDH 139
G + + NH Y+ D H
Sbjct: 732 NGYKTVTAYFVNHKVDGNGNYVYDVVFHGMNTDTNTDVH 770


29VP3006VP3011Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP30061223.095597hypothetical protein
VP30071223.317095ATP-dependent DNA helicase RecQ
VP30081183.226269RarD protein
VP30090193.129264AraC family transcriptional regulator
VP30100183.372959hypothetical protein
VP3011-1183.388121gamma-glutamyltranspeptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP3008SECYTRNLCASE310.004 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 31.3 bits (71), Expect = 0.004
Identities = 16/120 (13%), Positives = 42/120 (35%), Gaps = 3/120 (2%)

Query: 147 LIVFGSVPIVAMALAMSFGFYGLLRKKVAVDAQTGLFVETLILLPAAAVYLLFIASSPTA 206
L + +VA A + + ++ D + +I + A ++++ T
Sbjct: 124 LAILQGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWLGELITD 183

Query: 207 NMIENPWQLNTLLIAAGVVTTLPLLCFTGAATRLKLSTLGFFQYIGPSLMFLLAVLIYGE 266
I N ++L+ + T P + F + + ++A++++ E
Sbjct: 184 RGIGNG---MSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVVFVE 240


30VP3040VP3046Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP3040-2153.629893Smf protein
VP30410224.245061hypothetical protein
VP30421254.905264peptide deformylase
VP30431254.786620methionyl-tRNA formyltransferase
VP30441244.752041Sun protein
VP30450274.845300potassium transporter peripheral membrane
VP3046-1263.762696potassium uptake protein TrkH
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP3045NUCEPIMERASE300.022 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.022
Identities = 16/82 (19%), Positives = 34/82 (41%), Gaps = 12/82 (14%)

Query: 1 MKIIILG-AGQVGGTLAENLVGENNDITIVDNNADRLRELQDKYDLRVVNGHA---SHPD 56
MK ++ G AG +G +++ L+ + + +DN L D YD+ + + P
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDN-------LNDYYDVSLKQARLELLAQPG 53

Query: 57 V-LHEAGAQDADMLVAVTNTDE 77
H+ D + + + +
Sbjct: 54 FQFHKIDLADREGMTDLFASGH 75


31VP0132VP0139N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0132-2151.062073general secretion pathway protein C
VP0133-2171.189682general secretion pathway protein D
VP0134-1212.091741general secretion pathway protein E
VP01350181.196685general secretion pathway protein F
VP01360211.860601general secretion pathway protein G
VP01371222.033335general secretion pathway protein H
VP01381192.842647general secretion pathway protein I
VP01390182.976910general secretion pathway protein J
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0132BCTERIALGSPC2206e-73 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 220 bits (561), Expect = 6e-73
Identities = 78/281 (27%), Positives = 134/281 (47%), Gaps = 29/281 (10%)

Query: 30 LSLLLTCLFIAITGWILGKVVW-LAIPQTSDVPKWRPSAGNVIASNKSNTIDFNALQSAN 88
+ +L L + + L + W + +P + P A + T L
Sbjct: 14 IRRILFYLLMLLFCQQLAMIFWRIGLPDNA--PVSSVQITPAQARQQPVT-----LNDFT 66

Query: 89 LFGKYTEQK-PVVVEQPVVKDAPKTRLNLTLVGAVASSNPQTSLAVIANRGKQATYGISE 147
LFG E+ ++ + + P + LNL+L G +A + S+A+I+ +Q + G++E
Sbjct: 67 LFGVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNE 126

Query: 148 EIEGTRATLKAVLVDRVIIDNEGRDETLMLEGVEYKKLSESPQATRVQAAQKEPASDVSD 207
E+ G A + ++ DRV++ +GR E L L E P A
Sbjct: 127 EVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGA---------------- 170

Query: 208 KLEQIREEI-AKDPQSVFKYITISPVKKDDAIIGYRVSPGRDAALFNDVGLEPGDIAVQL 266
Q+ E++ + ++ Y++ SP+ D+ + GYR++PG + F VGL+ D+AV L
Sbjct: 171 ---QVNEQLQQRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVAL 227

Query: 267 NGIDLSDPSSSVQLMQVMSDPQELNLTVERDGQQYDIYIQL 307
NG+DL D + + M+ M+D LTVERDGQ+ DIY++
Sbjct: 228 NGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0133BCTERIALGSPD6410.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 641 bits (1655), Expect = 0.0
Identities = 325/665 (48%), Positives = 453/665 (68%), Gaps = 32/665 (4%)

Query: 11 LLAGSLLCVPGAMANEFSASFKGTDIQEFINIVGRNLEKTIIVDPSVRGKIDVRSYDVLN 70
L+ +LL P A A EFSASFKGTDIQEFIN V +NL KT+I+DPSVRG I VRSYD+LN
Sbjct: 15 LIFAALLFRP-AAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLN 73

Query: 71 EEQYYSFFLNVLEVYGYAVVEMDNGVLKVIKAKDSKTSAIPVMGDGS-AKGDSVITRVVA 129
EEQYY FFL+VL+VYG+AV+ M+NGVLKV+++KD+KT+A+PV D + GD V+TRVV
Sbjct: 74 EEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVP 133

Query: 130 VRNVSVRELSPLLRQLIDNAGAGNVVHYDPANIILITGRAAVVNRLAEIIKRVDQAGDKE 189
+ NV+ R+L+PLLRQL DNAG G+VVHY+P+N++L+TGRAAV+ RL I++RVD AGD+
Sbjct: 134 LTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRS 193

Query: 190 IELVELRNASAAEMVRIVEALNKTTNQKSTPEFLEPKIVADERTNSILISGDPKVRARLK 249
+ V L ASAA++V++V LNK T++ + P + +VADERTN++L+SG+P R R+
Sbjct: 194 VVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRII 253

Query: 250 RLIRQLDVEMATKGNNRVVYLKYAKAEDLVDVLKGVSDNLQAEKQAGQKGASSAQRGDVV 309
+I+QLD + AT+GN +V+YLKYAKA DLV+VL G+S +Q+EKQA +A +++
Sbjct: 254 AMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAA--KPVAALDKNII 311

Query: 310 IAAHEATNSLVLTAPPDIMMALQDVISQLDIRRAQVLIEALIVEMSEGDGINLGVQWGSL 369
I AH TN+L++TA PD+M L+ VI+QLDIRR QVL+EA+I E+ + DG+NLG+QW +
Sbjct: 312 IKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANK 371

Query: 370 ETGAVIQYGNAGAPIGQVMVGLEEAKDTVEKKPIRDSDTGAIKYYEETTTKGDYSTLASA 429
G + Q+ N+G PI + GA +Y ++ T S+LASA
Sbjct: 372 NAG-MTQFTNSGLPISTAI-------------------AGANQYNKDGTVS---SSLASA 408

Query: 430 LKNVNGAAMSIVMGDWTALVSAVASDSNSNILSSPSITVMDNGEASFIVGEEVPVITGST 489
L + NG A G+W L++A++S + ++IL++PSI +DN EA+F VG+EVPV+TGS
Sbjct: 409 LSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQ 468

Query: 490 AGSNNDNPFQTVDRKEVGIKLKVVPQINEGDSVQLNIEQEVSNVL----GANGAVDVRFA 545
S DN F TV+RK VGIKLKV PQINEGDSV L IEQEVS+V + + F
Sbjct: 469 TTS-GDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFN 527

Query: 546 KRQLNTSVMIQDGQMLVLGGLVDERALESESKVPLLGDIPVLGHLFKSTSTQTQKRNLMV 605
R +N +V++ G+ +V+GGL+D+ ++ KVPLLGDIPV+G LF+STS + KRNLM+
Sbjct: 528 TRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLML 587

Query: 606 FIKPTIIRDGMTADGITQRKYNFIRAEQLYKADQGLKLMSDDKIPVMPAFGQDRKHPAEI 665
FI+PT+IRD + +Y Q + + ++ + QD ++
Sbjct: 588 FIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFRQV 647

Query: 666 QAFID 670
A ID
Sbjct: 648 SAAID 652


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0135BCTERIALGSPF5220.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 522 bits (1347), Expect = 0.0
Identities = 219/407 (53%), Positives = 305/407 (74%), Gaps = 3/407 (0%)

Query: 1 MAAFEYKALDAKGKQKKGTIEGDNARQVRQRLKEQGMIPVEVVEAKAKAAKS-SGSVGFK 59
MA + Y+ALDA+GK+ +GT E D+ARQ RQ L+E+G++P+ V E + KS S + +
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 RGIK--TAELALITRQLSTLVQSGMPLEECLRAVSEQAEKPRIRTMIAAVRSKVTEGYPL 117
R I+ T++LAL+TRQL+TLV + MPLEE L AV++Q+EKP + ++AAVRSKV EG+ L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 118 ADSLGDYPHVFDELFRSMVAAGEKSGHLDTVLERLAEYVENRQKMRSKLLQAMIYPVVLV 177
AD++ +P F+ L+ +MVAAGE SGHLD VL RLA+Y E RQ+MRS++ QAMIYP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 178 VFAVAIVSFLLATVVPKIIEPIIQMGQELPQSTQFLLAASEFVQEWGLIIFAVLVVCFYG 237
V A+A+VS LL+ VVPK++E I M Q LP ST+ L+ S+ V+ +G + L+ F
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 238 LKLALQKPDFRLSWDRKIISLPLVGKISKGLNTARFARTLSICTSSAIPILEGMRVAVDV 297
++ L++ R+S+ R+++ LPL+G+I++GLNTAR+ARTLSI +SA+P+L+ MR++ DV
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 298 MSNRYVKQQVLIAADNVREGASLRKALDQTRLFPPMMLHMIASGEQSGELESMLTRAADN 357
MSN Y + ++ +A D VREG SL KAL+QT LFPPMM HMIASGE+SGEL+SML RAADN
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 358 QDQNFESTVNIALGVFTPALIALMAGLVLFIVMATLMPMLEMNNLMS 404
QD+ F S + +ALG+F P L+ MA +VLFIV+A L P+L++N LMS
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0136BCTERIALGSPG2189e-77 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 218 bits (558), Expect = 9e-77
Identities = 87/141 (61%), Positives = 107/141 (75%), Gaps = 4/141 (2%)

Query: 4 KRSKQHGFTLLEVMVVVVILGILASFVVPNLLGNKEKADQQKAITDIVALENALDMYKLD 63
KQ GFTLLE+MVV+VI+G+LAS VVPNL+GNKEKAD+QKA++DIVALENALDMYKLD
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 64 NSVYPTTDQGLDALVTKPS-NPEPRNYRDGGYIKRLPNDPWGNAYQYLSPGDNGTIDIFT 122
N YPTT+QGL++LV P+ P NY GYIKRLP DPWGN Y ++PG++G D+ +
Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122

Query: 123 LGADGQEGGEGPAADIGNWNM 143
G DG+ G E DI NW +
Sbjct: 123 AGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0137BCTERIALGSPH1003e-29 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 100 bits (251), Expect = 3e-29
Identities = 43/185 (23%), Positives = 72/185 (38%), Gaps = 37/185 (20%)

Query: 3 KHNGFTLIEILLVLVLLSLTAVAVITTLPTSQKDLSKQYAQSFFQRLQLLNEEAVLSGKD 62
+ GFTL+E++L+L+L+ ++A V+ P S+ D + Q F +L+ + + + +G+
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61

Query: 63 FGVRVDDTKKTYTLLSLTAEG-------------WKPLEMKQIPSKTKLEDDIALQLDLG 109
FGV V + + +L W PL ++ + +
Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAFAQ 121

Query: 110 GGAWDKDDRLFEPGSLFEEMFADETEEKKVKPPQVFIFSSAEVTPFTLSFFPQKGDAFN- 168
G AW D P V IF E+TPF L+ G AFN
Sbjct: 122 GEAWTPGDN-----------------------PDVLIFPGGEMTPFRLTLGEAPGIAFNA 158

Query: 169 DGWRV 173
G +
Sbjct: 159 RGESL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0139BCTERIALGSPH290.012 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 28.8 bits (64), Expect = 0.012
Identities = 20/91 (21%), Positives = 35/91 (38%), Gaps = 11/91 (12%)

Query: 17 GFTLIE---VLVSIAIFASLSVAAYQVVSQVQRSNELSQERTQRLNELQRAMVMMDNDF- 72
GFTL+E +L+ + + A + + A+ + + +L +Q+ + F
Sbjct: 5 GFTLLEMMLILLLMGVSAGMVLLAFPASRD-DSAAQTLARFEAQLRFVQQRGLQTGQFFG 63

Query: 73 ------RQMALRQTRTNGEDPAGQLIFWSDY 97
R L +G DPA WS Y
Sbjct: 64 VSVHPDRWQFLVLEARDGADPAPADDGWSGY 94


32VP0218VP0224N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0218018-2.417455hypothetical protein
VP0219019-4.121164hypothetical protein
VP0220123-6.394571OtnA protein
VP0221533-10.221195OtnB protein
VP0222739-12.562121dTDP-glucose 4,6 dehydratase
VP0223945-15.290155D-glucose-1-phosphate thymidylyltransferase
VP02241046-15.917051dTDP-4-dehydrorhamnose reductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0218PF06057250.033 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 24.8 bits (54), Expect = 0.033
Identities = 12/50 (24%), Positives = 22/50 (44%)

Query: 6 LAILMALSASTAFAAEEAGSAGAASASATGTTAVAVGAAAAVTVVAVAAS 55
L++L+ S + AFA E A + G +T V ++ + + S
Sbjct: 9 LSVLLLCSTANAFADEFADNLGLTLLPVEPSTQVNAASSHTKPPLVIFLS 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0219OMPADOMAIN455e-08 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 44.9 bits (106), Expect = 5e-08
Identities = 45/187 (24%), Positives = 71/187 (37%), Gaps = 26/187 (13%)

Query: 26 ALSAISLGLCSFTASA----DFYAGALVSYSNAEYHHS---STSSVTEGNPFLLQAQAGY 78
A++ G + +A +Y GA + + ++YH + + + T N A GY
Sbjct: 7 AIAVALAGFATVAQAAPKDNTWYTGAKLGW--SQYHDTGFINNNGPTHENQLGAGAFGGY 64

Query: 79 FFNDYVALEARY---GTSVQRESGLAIDSLASGF---VKLNMPVSERFAFYGLAGYSSVQ 132
N YV E Y G + S A G KL P+++ Y G +
Sbjct: 65 QVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWR 124

Query: 133 IDQQN--VGSNKDQGFS--FGLGMHYALDKHNAVVFEFVD-------NTSEDQVRLNALT 181
D ++ G N D G S F G+ YA+ A E+ +T + L+
Sbjct: 125 ADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRPDNGMLS 184

Query: 182 LGFQHRF 188
LG +RF
Sbjct: 185 LGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0222NUCEPIMERASE1782e-55 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 178 bits (454), Expect = 2e-55
Identities = 78/358 (21%), Positives = 146/358 (40%), Gaps = 48/358 (13%)

Query: 1 MKILVTGGAGFIGSALVRHIIKNTSDSVVNVDCLT--YAGNL-ESLGSVIQSERYVFEQV 57
MK LVTG AGFIG + + +++ VV +D L Y +L ++ ++ + F ++
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 NICDRAELNRVFEAHKPDAVMHLAAESHVDRSITGPAAFIETNVVGTYTMLEATREYWSK 117
++ DR + +F + + V V S+ P A+ ++N+ G +LE R +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 118 LDDNAKAAFRFHHISTDEVYGDLPHPDEVSDGKELPMFLETTPYEPSSPYSASKASSDHL 177
+ S+ VYG +P + + P S Y+A+K +++ +
Sbjct: 120 ---------HLLYASSSSVYGLNRK---------MPFSTDDSVDHPVSLYAATKKANELM 161

Query: 178 VRAWLRTYGLPTMVTNCSNNYGPYHFPEKLIPLVILNALEGKDLPIYGKGDQIRDWLFVE 237
+ YGLP YGP+ P+ + LEGK + +Y G RD+ +++
Sbjct: 162 AHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYID 221

Query: 238 DHARALYKVI------------------TEGKVGETYNIGGHNEKKNIEVVNTICEILDT 279
D A A+ ++ YNIG + + ++ + + + L
Sbjct: 222 DIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI 281

Query: 280 LVPKQTAYSEQITYVQDRPGHDRRYAIDSSKMQRELNWTPEETFETGLRKTVQWYLDN 337
K + +PG + D+ + + +TPE T + G++ V WY D
Sbjct: 282 EAKKN--------MLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0224NUCEPIMERASE461e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.5 bits (108), Expect = 1e-07
Identities = 46/213 (21%), Positives = 85/213 (39%), Gaps = 33/213 (15%)

Query: 3 NVLLLGESGFVGSVVYNSL----HCVCKVYTIAPNKKITI--DNVFEVANDVFKSIEIN- 55
L+ G +GF+G V L H V + + +++ + +A F+ +I+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 56 -------------NIDVVINCIAMANLDQCENNKLDCELVNTTFVTHIVDYLKDKDIK-L 101
+ + V + N N T +I++ + I+ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 102 VHISSNAVYDGLNA--PYSE-NSLREPINYYGICKSNADYYIESNLNNYAIA----RPIT 154
++ SS++VY GLN P+S +S+ P++ Y K + + + Y + R T
Sbjct: 122 LYASSSSVY-GLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 155 VYGPRKIEQR-DNPVSFIVKKILSGESFDLVDD 186
VYGP R D + K +L G+S D+ +
Sbjct: 181 VYGPW---GRPDMALFKFTKAMLEGKSIDVYNY 210


33VP0361VP0367N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP03610201.047255two component response regulator transcription
VP0362-1170.506268two component sensor protein
VP0363-1140.489414glycerol dehydrogenase
VP0364-1140.535336dihydroxyacetone kinase subunit DhaK
VP0365-1120.869134dihydroxyacetone kinase subunit DhaL
VP0366-1130.805409phosphoenolpyruvate-protein phosphotransferase
VP0367-1160.629472DNA-binding transcriptional regulator DhaR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0361HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.1 bits (195), Expect = 3e-19
Identities = 27/153 (17%), Positives = 62/153 (40%), Gaps = 9/153 (5%)

Query: 2 KILIVEDEHKAGEYLQKGLIESGYVVDLVHDGVDGLYHATSEEYDLILLDIMLPKLDGWQ 61
IL+ +D+ L + L +GY V + + + + DL++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VLNTLRSSGIHTPVIMLTAKEQVEDRVRGFELGANDYVVKPYAFAELLARVQNVFRHHIA 121
+L ++ + PV++++A+ ++ E GA DY+ KP+ EL+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 AQVVASPQTLRVADLE---------LDMIKRVA 145
+ L ++ R+
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0362PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 25/165 (15%), Positives = 60/165 (36%), Gaps = 35/165 (21%)

Query: 298 LIHRHDEVFAVQEEIDRLVAYFEILAEDAEVRIVSSGDGQLYMDKNMFERAVGNLL---- 353
L + + ++ +E+ + +Y ++ ++ + ++ + + V +L
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLA----SIQFEDRLQFENQINPAIMDVQVPPMLVQTL 263

Query: 354 -SNAIRHAYADS----TIVIDVQEVDEQVTVSVSNQGDTIAEQNLPYLFDRFYRADKSRQ 408
N I+H A I++ + + VT+ V N G +
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------------- 309

Query: 409 HVGSVGAGLGLS-ITQSIVQAY--EGQIKVTSDKDKTRFVMMLPS 450
G GL + + + Y E QIK++ + K ++++P
Sbjct: 310 -----STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0366PHPHTRNFRASE5400.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 540 bits (1393), Expect = 0.0
Identities = 202/542 (37%), Positives = 324/542 (59%), Gaps = 14/542 (2%)

Query: 9 AVVGIRVNDGIAAAPVVLFTHEMPAVPERDFQSEQGEIERVKRAIGVVVQHLQ------- 61
+ GI + G+A A + + + EIE++ A+ + L+
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTE 63

Query: 62 EQAKQPKGEIFSAHSMMLSDPELWASVESRIQT-GMIAEQAWIESLQTLADEFRQAESQY 120
K EIF+AH ++L DPEL ++ +I+ M AE A E F +++Y
Sbjct: 64 ASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEY 123

Query: 121 MREREADVHDIARQVMVEMTGV-TPNAIDIQEPSILLARDLMPSDVAGLDKSKVLGICLS 179
M+ER AD+ D++++V+ + GV T + I E ++++A DL PSD A L+K V G
Sbjct: 124 MKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATD 183

Query: 180 EGGKTSHSAILARAMGIPAMVKAQGCLDAVRAGQVVTIDGFRGHLWFSPSDAIQQELEAQ 239
GG+TSHSAI++R++ IPA+V + + ++ G +V +DG G + +P++ + E +
Sbjct: 184 IGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEK 243

Query: 240 QIEWQSTRQSALASAQQAAATCDGVHIPVFANIGGPKDIDDALTSGAEGVGLFRTEFLFQ 299
+ ++ +Q + + T DG H+ + ANIG PKD+D L +G EG+GL+RTEFL+
Sbjct: 244 RAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYM 303

Query: 300 NSDELPTEEAQYQVYRDIAAALGDKPLTIRSLDVGGDKPLAAYPMPAEDNPFLGLRGVRL 359
+ D+LPTEE Q++ Y+++ + KP+ IR+LD+GGDK L+ +P E NPFLG R +RL
Sbjct: 304 DRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAIRL 363

Query: 360 CLQHESLFTAQLRAILRAFHEQPNIQLMIPMVAQVEEVRKVKALLAHQANQL---GLDAT 416
CL+ + +F QLRA+LRA N+++M PM+A +EE+R+ KA++ + ++L G+D +
Sbjct: 364 CLEKQDIFRTQLRALLRA-STYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVS 422

Query: 417 H-LPVGIMIEVPAAVLNADALAQEVDFFSIGTNDLTQYVMAADRGNAAVAELVNYFEPSV 475
+ VGIM+E+P+ + A+ A+EVDFFSIGTNDL QY MAADR N V+ L + P++
Sbjct: 423 DSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAI 482

Query: 476 LKAIELTCAAGDRAGIPVSMCGEMAGDPNATETLLRVGLQKFSASPSLLPGLKAQIRQLS 535
L+ +++ A G V MCGEMAGD A LL +GL +FS S + + ++Q+ +LS
Sbjct: 483 LRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKLS 542

Query: 536 VD 537
+
Sbjct: 543 KE 544


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0367HTHFIS2243e-68 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 224 bits (572), Expect = 3e-68
Identities = 86/353 (24%), Positives = 144/353 (40%), Gaps = 40/353 (11%)

Query: 323 AQQQIGGSARYTFSTLPVISRKMKHVITVAKRAIKSKSPILITGEEGVGKATLAMAIHNE 382
+ L S M+ + V R +++ ++ITGE G GK +A A+H+
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 383 STYKEGPFITLNCRSINTEQLIIETLGYDEG------QGMPSKFELAHGGTLFLEKVEYL 436
+ GPF+ +N +I + + E G+++G +FE A GGTLFL+++ +
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 437 SPDLQAVLLKLLKTGLVSRSDSLRLIPVDFQLITSTASEISEYVTQRSFGRQLYYEISSN 496
D Q LL++L+ G + I D +++ +T ++ + + Q F LYY ++
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 497 ELHIPPLRKRKEDIEFLVQQLISHYERRHNVTISIEPVALEALMNFRWSGNNSELRNRTE 556
L +PPLR R EDI LV+ + E+ + ALE + W GN EL N
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVR 363

Query: 557 RILLNRSSNLIKLNDIPEDIK------------LNSRHTAENTPV--------------- 589
R+ ++I I +++ S + + V
Sbjct: 364 RLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDAL 423

Query: 590 -------ITLEEAERRAIVQAWNQCDGKMHDMAKALQIGRTTLWRKINKFGLQ 635
L E E I+ A G A L + R TL +KI + G+
Sbjct: 424 PPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


34VP0413VP0426N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP04130131.231682undecaprenyl pyrophosphate phosphatase
VP04140121.075689multifunctional tRNA nucleotidyl
VP0415-1131.123242general secretion pathway protein A
VP0416-1120.357925general secretion pathway protein B
VP0417-1110.271783hypothetical protein
VP0418-190.949963pho4 family protein
VP0419-1111.058135hypothetical protein
VP0420-1100.988410hypothetical protein
VP0422-1100.842344potassium channel protein
VP0421-190.997487methyl-accepting chemotaxis protein
VP0423-190.750151bifunctional glutamine-synthetase
VP0424011-0.147861bifunctional heptose 7-phosphate kinase/heptose
VP0425-1110.233219outer membrane channel protein
VP0426-1111.067553MutT/nudix family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0413MICOLLPTASE310.006 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 30.8 bits (69), Expect = 0.006
Identities = 13/55 (23%), Positives = 21/55 (38%), Gaps = 1/55 (1%)

Query: 125 GLLLWWVDKNAKLVADEYQTGWKKAVFIGIAQALAMIPGTSRSGATITAALYLGF 179
G+L W+ + A+ A +T K + Q LA S + A Y +
Sbjct: 527 GVLTWYEEGTAEFFAGSTRTDGIKPR-KSVTQGLAYDRNNRMSLYGVLHAKYGSW 580


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0415HTHFIS320.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.7 bits (72), Expect = 0.008
Identities = 16/43 (37%), Positives = 23/43 (53%)

Query: 47 MLTGEVGTGKTTVAKAMLANLDESTKAGLILNPTFSSRDLLEA 89
M+TGE GTGK VA+A+ + +N RDL+E+
Sbjct: 164 MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIES 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0417RTXTOXIND300.008 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.008
Identities = 14/76 (18%), Positives = 27/76 (35%), Gaps = 5/76 (6%)

Query: 91 RLPRIEKELSEVKEQLANAR-QTSDTEKAGLVTSLETRNQQITDLEKKYSEISDQLTSVE 149
+ E + E +L + Q E + S + Q +T + +EI D+L
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESE--ILSAKEEYQLVT--QLFKNEILDKLRQTT 308

Query: 150 TENRELRAKLDTQKDD 165
L +L ++
Sbjct: 309 DNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0419ANTHRAXTOXNA290.015 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 29.3 bits (65), Expect = 0.015
Identities = 35/182 (19%), Positives = 82/182 (45%), Gaps = 22/182 (12%)

Query: 29 ECCSHLVKFFEVSSKGDWEKASEIRAQISHLEK-EADVLK--REIRLKLPRGLFMPVDRT 85
+ ++LVK E ++ E +I+ L+K DVL+ E+ ++ F +D
Sbjct: 61 DSINNLVKT-EFTN----ETLDKIQQTQDLLKKIPKDVLEIYSELGGEI---YFTDIDLV 112

Query: 86 DMLEL--LTQQDKLANLAKDIAGR---VYGRQLVIPEALQPNFLAYVQRCLDAANQAQKV 140
+ EL L++++K + + G R + + P + ++ + Q+++V
Sbjct: 113 EHKELQDLSEEEKNS---MNSRGEKVPFASRFVFEKKRETPKLIINIKDYAINSEQSKEV 169

Query: 141 INELDELLETGFKGREVTLVAEMIHQLDVIEDDTDAMQIELRQQLMAIESEMNP--IDVM 198
E+ + + ++ +L E ++ + + DD+D+ + Q+ + E+N ID+
Sbjct: 170 YYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKE-KLELNNKSIDIN 228

Query: 199 FL 200
F+
Sbjct: 229 FI 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0421RTXTOXINA340.002 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 33.8 bits (77), Expect = 0.002
Identities = 14/93 (15%), Positives = 43/93 (46%)

Query: 191 SAQEEFNEIDQLATAMSEMSSTVQTVADHAQTASSLTEQASTQAVTGQQFLQSTVAKMSE 250
A+ + + + +S + + T + + +Q S V+ + ++++ +++
Sbjct: 131 GAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSELAKASIELINQ 190

Query: 251 LSSDIASSAQAVNQVEERVESIGSVVGTIQGIS 283
L +AS VN +++ ++GSV+ + ++
Sbjct: 191 LVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLN 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0424LPSBIOSNTHSS300.015 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.8 bits (67), Expect = 0.015
Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 347 GCFDILHAGHVSYLNHAAELGDRLIVAV 374
G FD + GH+ + L D++ VAV
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAV 34


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0426FRAGILYSIN280.036 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 27.7 bits (61), Expect = 0.036
Identities = 29/139 (20%), Positives = 53/139 (38%), Gaps = 29/139 (20%)

Query: 63 LPYDPKTDQVVIIEQIRIGALEHEHPWQLEIVAGMIDRDESAEEVIRREAEEEAGITVGR 122
+P +PKT V+ + + +E Q++ A + A ++R +
Sbjct: 221 VPSEPKTVYVICLRENGSTIYPNEVSAQMQDAANSV----YAVHGLKRY------VNFHF 270

Query: 123 VASVTSYYPSSGGCSEKLDVFVGEVDAS-KAHGIHGLDYEDEDIRVHVLSREQAYQWVKD 181
V T Y SG E L+ F + ++ KA G Y+D Q Y ++
Sbjct: 271 VLYTTEYSCPSGDAKEGLEGFTASLKSNPKAEG-----YDD-----------QIYFLIRW 314

Query: 182 GIFENGASIIALQWLQLNH 200
G ++N I+ + W +
Sbjct: 315 GTWDN--KILGMSWFNSYN 331


35VP0536VP0542N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0536-1140.197840FKBP-type peptidylprolyl isomerase
VP0537-1140.0699124-hydroxy-3-methylbut-2-enyl diphosphate
VP0538-1130.297654two-component response-regulatory protein YehT
VP05390121.190208hypothetical protein
VP05401121.588647carbon starvation protein A
VP05410131.933541hypothetical protein
VP0542-1131.853920hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0536INFPOTNTIATR341e-04 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 33.8 bits (77), Expect = 1e-04
Identities = 14/32 (43%), Positives = 18/32 (56%)

Query: 5 TNDSVVTLHFTIKMKDGSVADSTHNMGKPAKF 36
VT+ +T + DG+V DST GKPA F
Sbjct: 142 GKSDTVTVEYTGTLIDGTVFDSTEKAGKPATF 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0538HTHFIS644e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 4e-14
Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 7/116 (6%)

Query: 3 TALVIDDEQFAREELAELLDETG-QVEVVGDASNAILGLKKINELKPDVVFLDIQMPQVT 61
T LV DD+ R L + L G V + +A+ + I D+V D+ MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATL---WRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GIELLGML-DPETMPYVVFVTAYDQY--AIQAFEDNAFDYLLKPVDPCRLNKTVKR 114
+LL + V+ ++A + + AI+A E A+DYL KP D L + R
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0539PF065802221e-69 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 222 bits (566), Expect = 1e-69
Identities = 67/185 (36%), Positives = 109/185 (58%), Gaps = 1/185 (0%)

Query: 355 DYQQQQALLAQAEIKLLHAQVNPHFLFNALNTISAITRRDPDKARELIQNLSHFFRSNLK 414
D + ++ +A++ L AQ+NPHF+FNALN I A+ DP KARE++ +LS R +L+
Sbjct: 150 DQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLR 209

Query: 415 -QNINTVTLKEELAHVNSYLSIEKARFTDRLEVEIDIDPELLDIKLPSFTLQPLVENAIK 473
N V+L +EL V+SYL + +F DRL+ E I+P ++D+++P +Q LVEN IK
Sbjct: 210 YSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIK 269

Query: 474 HGISNMLEGGKVKIYSEMHPQGHLITVEDNAGSFQPPKDNHSGLGLEIVDKRLTNQFGRD 533
HGI+ + +GGK+ + + VE+ +G GL+ V +RL +G +
Sbjct: 270 HGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTE 329

Query: 534 SALKI 538
+ +K+
Sbjct: 330 AQIKL 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0542NEISSPPORIN290.004 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 28.8 bits (64), Expect = 0.004
Identities = 22/72 (30%), Positives = 31/72 (43%), Gaps = 17/72 (23%)

Query: 1 MKKYFLVALVL--LPLTASAGNVLKDTIKA----KRKYDHNKHQITNTLNNTK------- 47
MKK L+AL L LP+ A A L IKA R +H +++ ++
Sbjct: 1 MKKS-LIALTLAALPVAAMADVTLYGAIKAGVQTYRSVEHTDGKVSKVETGSEIADFGSK 59

Query: 48 ---RDIEDLTNG 56
+ EDL NG
Sbjct: 60 IGFKGQEDLGNG 71


36VP0648VP0661N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0648117-0.872627recombination and repair protein
VP0649020-0.627992hypothetical protein
VP0650122-0.653346inorganic polyphosphate/ATP-NAD kinase
VP0651225-0.938920heat shock protein GrpE
VP0652122-0.797914hypothetical protein
VP0653223-1.314747molecular chaperone DnaK
VP0654-118-2.198869molecular chaperone DnaJ
VP0655227-3.782753hypothetical protein
VP0656228-4.090550fimbrial assembly protein PilE
VP0657219-2.693760type IV pilin
VP0658219-2.139663hypothetical protein
VP0659320-1.898554hypothetical protein
VP0660-115-0.247125type IV pilin
VP0661-1221.484922hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0648RTXTOXIND330.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.002
Identities = 29/163 (17%), Positives = 57/163 (34%), Gaps = 27/163 (16%)

Query: 148 QYAGHLNLLKSTRSAYQHWRQADNNLKQLKENSQQNQAQKQLLEYQIKELNELSLGEEE- 206
+ +L+K S +Q N Q + N + +A++ + +I LS E+
Sbjct: 183 EVLRLTSLIKEQFSTWQ------NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236

Query: 207 -------FAE--------LEQEHKRLSNSGELAATCQQALELIYEGEEVNALGILQSA-- 249
+ LEQE+K + EL Q ++ E L +
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296

Query: 250 -NHSLIQLAELDEKLAELPNMLADAMIQLEETKNELRSYLDGI 291
N L +L + + + L LA + + + +R+ +
Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQAS--VIRAPVSVK 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0651PHPHTRNFRASE280.021 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 28.2 bits (63), Expect = 0.021
Identities = 24/119 (20%), Positives = 52/119 (43%), Gaps = 6/119 (5%)

Query: 8 VTEEELDQIIEEAEKVEAAAQEAEAELEEIGDEKDAKIAQLEAALLSSETKVKDQQDAVL 67
+ + + + E EK+ AA ++++ EL I D+ +A + +A + ++ V D + V
Sbjct: 29 IEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAHLLVLDDPELVD 88

Query: 68 RAKAEVENMRRRTEQEIDKARKYALNKFAEELLPVIDNL--ERAIQAADTENEVIKPIL 124
K ++EN + E + + + F + + ERA D V+ ++
Sbjct: 89 GIKGKIENEQMNAEYALKEVS----DMFVSMFESMDNEYMKERAADIRDVSKRVLGHLI 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0653SHAPEPROTEIN1392e-38 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 139 bits (351), Expect = 2e-38
Identities = 81/385 (21%), Positives = 145/385 (37%), Gaps = 81/385 (21%)

Query: 5 IGIDLGTTNSCVAVLDG----DKPRVIE-NAEGERTTASVIAYTDGETLVGQPAKRQAVT 59
+ IDLGT N+ + V ++P V+ + + SV A VG AK+
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA-------VGHDAKQMLGR 65

Query: 60 NPTNTLFAIKRLIGRRFEDEEVQRDIEIMPYKIVKADNGDAWVEAKGQKMAAPQVSAEVL 119
P N + AI+ + D V + ++L
Sbjct: 66 TPGN-IAAIRPMKDGVIADFFV---------------------------------TEKML 91

Query: 120 KK-MKKTAEDFLGEEVTGAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALAY 178
+ +K+ + ++ VP +R+A +++ + AG +I EP AAA+
Sbjct: 92 QHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGA 151

Query: 179 GLDKKGGDRTIAVYDLGGGTFDISIIEIDEVEGEKTFEVLATNGDTHLGGEDFDNRLINY 238
GL V D+GGGT ++++I ++ V + +GG+ FD +INY
Sbjct: 152 GLPVS-EATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAIINY 201

Query: 239 LVDEFKKEQGIDLKNDPLAMQRVKEAAEKAKIELSSTSQTD----VNLPYVTADATGPKH 294
+ + G + AE+ K E+ S D + + P+
Sbjct: 202 VRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRG 248

Query: 295 MNIKVTRAKLESLVEDLVQRSLEPLKVALADA--DLSVNDITD--VILVGGQTRMPMVQA 350
+ + LE+L E L + + VAL +L+ +DI++ ++L GG + +
Sbjct: 249 FTLN-SNEILEALQEPLTG-IVSAVMVALEQCPPELA-SDISERGMVLTGGGALLRNLDR 305

Query: 351 KVAEFFGKEARRDVNPDEAVAMGAA 375
+ E G +P VA G
Sbjct: 306 LLMEETGIPVVVAEDPLTCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0654PF07132300.016 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 30.0 bits (67), Expect = 0.016
Identities = 16/38 (42%), Positives = 19/38 (50%)

Query: 76 QGGGGFGGGFGGGGADFGDIFGDVFGDIFGGGRRGGGG 113
GGG GGG GG G+ G + G + G GGG G
Sbjct: 64 MMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLG 101



Score = 29.7 bits (66), Expect = 0.020
Identities = 17/43 (39%), Positives = 19/43 (44%)

Query: 78 GGGFGGGFGGGGADFGDIFGDVFGDIFGGGRRGGGGHRAQRGA 120
G GGG GGG G G + G + GGG GG G G
Sbjct: 62 GSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGL 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0656BCTERIALGSPG464e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 45.6 bits (108), Expect = 4e-09
Identities = 14/53 (26%), Positives = 34/53 (64%)

Query: 12 KRLRGMTLIELLIAVTIVGIIAAIAYPSYTNHVIKSHRTVALSDLSRIQLELE 64
+ RG TL+E+++ + I+G++A++ P+ + K+ + A+SD+ ++ L+
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0657BCTERIALGSPG348e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.5 bits (79), Expect = 8e-05
Identities = 10/25 (40%), Positives = 17/25 (68%)

Query: 17 RGFTLLELMITVAVLSALLATAAPS 41
RGFTLLE+M+ + ++ L + P+
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPN 32


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0660BCTERIALGSPG290.002 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.5 bits (66), Expect = 0.002
Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 5/58 (8%)

Query: 4 KQKGFNLLEVLISFLLIGV-GALGLTKLNVYLEQ-ESDYAIESIEALRLAENKLEWFR 59
KQ+GF LLE+++ ++IGV +L + L E+ + A+ I AL EN L+ ++
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVAL---ENALDMYK 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0661BACINVASINB270.004 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 27.0 bits (59), Expect = 0.004
Identities = 9/19 (47%), Positives = 16/19 (84%), Gaps = 1/19 (5%)

Query: 34 GVAFLQN-MSPIMEKIIEP 51
GV+F+Q ++PIME +++P
Sbjct: 362 GVSFIQQALNPIMEHVLKP 380


37VP0776VP0794N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP07762160.981222flagellar basal body rod protein FlgC
VP07771161.348697flagellar basal body rod modification protein
VP07782172.042085flagellar hook protein FlgE
VP07790172.917665hypothetical protein
VP07800162.330483flagellar basal body rod protein FlgF
VP0781-1161.802000flagellar basal body rod protein FlgG
VP07820151.626767flagellar basal body L-ring protein
VP07830141.308769flagellar basal body P-ring biosynthesis protein
VP07840140.392659flagellar rod assembly protein/muramidase FlgJ
VP07850170.377438flagellar hook-associated protein FlgK
VP07860160.319749flagellar hook-associated protein FlgL
VP07872190.601834hypothetical protein
VP07882210.111775flagellin
VP0789327-0.601552hypothetical protein
VP0790430-0.433125flagellin
VP0791529-1.262395flagellin
VP0792643-1.553644hypothetical protein
VP0793435-1.646319PTS system glucose-specific transporter subunit
VP0794325-1.408966phosphoenolpyruvate-protein phosphotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0776FLGHOOKAP1310.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 31.1 bits (70), Expect = 0.001
Identities = 9/32 (28%), Positives = 15/32 (46%)

Query: 98 NVNVMEEMANMISASRAYQTNVQVADSSKQML 129
VN+ EE N+ + Y N QV ++ +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIF 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0778FLGHOOKAP1393e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.2 bits (91), Expect = 3e-05
Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 3 YVSLSGLSAAQLDLNTTSNNIANANTYGFKESR 35
++SGL+AAQ LNT SNNI++ N G+
Sbjct: 5 NNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 35.7 bits (82), Expect = 3e-04
Identities = 11/49 (22%), Positives = 26/49 (53%)

Query: 389 TINNGMLEQSNIDMTQELVDLISAQRNFQANSRSLEVHNQLQQNILQIR 437
++N S +++ +E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0781FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 10/47 (21%), Positives = 22/47 (46%)

Query: 214 EIRQSMLEASNVNVTEELVNMIEAQRVYEMNSKVISSVDKMMSFVNQ 260
++ S VN+ EE N+ Q+ Y N++V+ + + + +
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 39.6 bits (92), Expect = 9e-06
Identities = 17/77 (22%), Positives = 34/77 (44%), Gaps = 14/77 (18%)

Query: 5 LWVSKTGLDAQQTNIATISNNLANASTVGYKKSRAVFEDLFYQNINQPGGQSSQNTELPS 64
+ + +GL+A Q + T SNN+++ + GY + + + N+ L +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTLGA 49

Query: 65 GLMLGAGSKVVATQKVH 81
G +G G V Q+ +
Sbjct: 50 GGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0782FLGLRINGFLGH1479e-46 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 147 bits (372), Expect = 9e-46
Identities = 75/204 (36%), Positives = 108/204 (52%), Gaps = 13/204 (6%)

Query: 65 AWAPIHPKQK--------PEHYAAATGSLF-SPEHIT----DLYDDSKPRGIGDIITVTL 111
AW P P + P A GS+F S + I L++D +PR IGD +T+ L
Sbjct: 23 AWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVL 82

Query: 112 DETTSATKSANADLSKTNEAQMDPLQVGGEELKVGGKYNFSYDLNNTNTFAGDSSAKQSN 171
E SA+KS++A+ S+ + V + G + + NTF G A SN
Sbjct: 83 QENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGGNTFNGKGGANASN 142

Query: 172 SISGYITVEVIEVLANGNLVIRGEKWMTLNTGDEYIRLSGTIRPDDINFDNTIASNRVSN 231
+ SG +TV V +VL NGNL + GEK + +N G E+IR SG + P I+ NT+ S +V++
Sbjct: 143 TFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVAD 202

Query: 232 ARIQYSGTGLSQDMQEPGFLARFF 255
ARI+Y G G + Q G+L RFF
Sbjct: 203 ARIEYVGNGYINEAQNMGWLQRFF 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0783FLGPRINGFLGI415e-147 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 415 bits (1068), Expect = e-147
Identities = 166/367 (45%), Positives = 226/367 (61%), Gaps = 15/367 (4%)

Query: 6 LLLMSVALFSTA---AQAARIKDVAQVAGVRSNQLVGYGLVSGLPGTGE---ANPFTEQS 59
L+ ++ ST A +RIKD+A + R NQL+GYGLV GL GTG+ ++PFTEQS
Sbjct: 11 LVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQS 70

Query: 60 FAAMLQNFGIQLPPGTKPKIKNVAAVMVTAELPPFSKPGQQVDVTVSSIGSAKSLRGGTL 119
AMLQN GI G KN+AAVMVTA LPPF+ PG +VDVTVSS+G A SLRGG L
Sbjct: 71 MRAMLQNLGITTQGGQ-SNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNL 129

Query: 120 LQTFLKGLDGQVYAVAQGNLVVSGFSAEGADGSKIVGNNPTVGLISSGATVEREIPNPFG 179
+ T L G DGQ+YAVAQG L+V+GFSA+G D + + T + +GA +ERE+P+ F
Sbjct: 130 IMTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFK 188

Query: 180 RGDYITFNLLESDFTTAQRMADAVNNF----LGPQMASAVDATSVRVRAPRDVSQRVAFL 235
+ L DF+TA R+AD VN F G +A D+ + V+ PR V+ +
Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247

Query: 236 SAIENLEFDPADGAAKIIVNSRTGTIVVGKHVRLKPAAVTHGGMTVAIKENLNVSQPNSF 295
+ IENL + D AK+++N RTGTIV+G VR+ AV++G +TV + E+ V QP F
Sbjct: 248 AEIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPF 306

Query: 296 SGGQTVVVPDSDIEVTEEKGKMFKFEPGLTLDDLVRAVNEVGAAPSDLMAILQALKQAGA 355
S GQT V P +DI +E K+ E G L LV +N +G ++AILQ +K AGA
Sbjct: 307 SRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGA 365

Query: 356 IEGQLII 362
++ +L++
Sbjct: 366 LQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0784FLGFLGJ2706e-92 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 270 bits (690), Expect = 6e-92
Identities = 98/299 (32%), Positives = 156/299 (52%), Gaps = 18/299 (6%)

Query: 13 DISSLDSLRQKAVKEGKDGEQEA--LHAAARQFESIFTSMMLKSMREANEGFESNIMNSQ 70
D SL+ L+ KA GE A + ARQ E +F MMLKSMR+A + + +S+
Sbjct: 14 DAQSLNELKAKA------GEDPAANIRPVARQVEGMFVQMMLKSMRDALP--KDGLFSSE 65

Query: 71 NEKFYRQMLDEQMASELSANGSMGLADMIVAQLTAGQGNDKSETAMRDAANSAVEYRRVD 130
+ + Y M D+Q+A +++A +GLA+M+V Q+T Q + T R
Sbjct: 66 HTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQ 125

Query: 131 PKKAREIEKRLIESGELSRTSHTPAKFDSPESFVNSMKPYAEKAAKALGVEPSLLLAQAA 190
+ ++ ++ + DS ++F+ + A+ A++ GV L+LAQAA
Sbjct: 126 NQALSQLVQKAVPRNYDDSLP-----GDS-KAFLAQLSLPAQLASQQSGVPHHLILAQAA 179

Query: 191 LETGWGQKVVQNARGS-SNNLFNIKADRSWQGDKVTTQTLEFHDNTPVKETAAFRSYSNY 249
LE+GWGQ+ ++ G S NLF +KA +W+G T E+ + K A FR YS+Y
Sbjct: 180 LESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSY 239

Query: 250 QDSFNDYVRFLNDNPRYETALQQRGDSESFIRGIHRAGYATDPTYADKVLQVKQKIESM 308
++ +DYV L NPRY A+ +E + + AGYATDP YA K+ + Q+++S+
Sbjct: 240 LEALSDYVGLLTRNPRY-AAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSI 297


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0785FLGHOOKAP15390.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 539 bits (1390), Expect = 0.0
Identities = 113/446 (25%), Positives = 207/446 (46%), Gaps = 17/446 (3%)

Query: 3 SDLLNVGTQSVLTAQRQLNTTGHNISNVNTEGYSRQSVIQGTNDPRMFGGSTYGMGVHVE 62
S L+N + AQ LNT +NIS+ N GY+RQ+ I + + G G GV+V
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 63 NVRRSWDQFAVNELNLSSTSNANKTDTQDNLDMLSSMLSSVASKKIPENLNEWFDAVKTM 122
V+R +D F N+L + T ++ T + + + +MLS+ ++ + + ++F +++T+
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFFTSLQTL 119

Query: 123 ADTPNDLGARKVVLEKAKIFSDTLNDFHETVRLQSDVTNKKLDMGIERINQLALEIRDIH 182
D AR+ ++ K++ + + +R Q N + +++IN A +I ++
Sbjct: 120 VSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLN 179

Query: 183 RLMMRTPG-----PHNDLMDQHEKLITELSEYTKVTVTPRKNAEGFNVHIGNGHTLVSGT 237
+ R G N+L+DQ ++L++EL++ V V+ + +N+ + NG++LV G+
Sbjct: 180 DQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGT-YNITMANGYSLVQGS 238

Query: 238 EASQLKMIDGYPDVHQRRLAMVEGDG--IKAIKADDMDGKIGALLDMRDKHIPQLLDEMG 295
A QL + D + +A V+G I+ + G +G +L R + + Q + +G
Sbjct: 239 TARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLG 298

Query: 296 RLATGFSYKVNQLQSQGLDLNGKIGKDVFTDVNSELVAKSRVFTAPNSKADVAV--YVDD 353
+LA F+ N G D NG G+D F + K V +K DVA+ V D
Sbjct: 299 QLALAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTD 352

Query: 354 ISALKGGEYALRFDGDRYSVTTPKGEQVQIDIDQSDSTFVLDGMRVQIGEGLAAGERVLL 413
SA+ +Y + FD +++ VT ++ DG+ + A + L
Sbjct: 353 ASAVLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTL 412

Query: 414 RPTRSGAAIIKMETNDAKSIAAQSYE 439
+P + + D IA S E
Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMASEE 438



Score = 143 bits (363), Expect = 9e-39
Identities = 34/105 (32%), Positives = 60/105 (57%)

Query: 542 EGGNGNLRKMQQLQTNKMMDGKSSTLIDVYHNLNTEVGLKSATANRLASVARLEHEAAQE 601
+ N N + + LQ+N G + + D Y +L +++G K+AT ++
Sbjct: 442 DSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSN 501

Query: 602 RVASIAGVNLDEEAANMMKFQQAYMASSRVMQAANDTFNTILQLR 646
+ SI+GVNLDEE N+ +FQQ Y+A+++V+Q AN F+ ++ +R
Sbjct: 502 QQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0786FLAGELLIN371e-04 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 36.9 bits (85), Expect = 1e-04
Identities = 26/183 (14%), Positives = 57/183 (31%), Gaps = 3/183 (1%)

Query: 202 QSGSDLLLSKATNVDAKDTASYRVTFVDMNNGKFGYQLERNGKVVDADEFSPEKGIEYKG 261
+ + + + + + V+ + K+ D + + KG
Sbjct: 312 VADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKIT 371

Query: 262 LKVHVKGQITPGDSIGIEKRESFSIFDTFKEAMSWSDKSVSDTSATAKLHQMTEEFQAAF 321
+ GD + + ++F + + + +A +A
Sbjct: 372 VNGAEYTANAAGDKVTLA---GKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSAL 428

Query: 322 IHLNKARTDVGARLSTLDIQEQNHEDFNLSLAKAKSNFEDLDYSKAVIEFSENSRALQAS 381
++ R+ +GA + D N + +L A+S ED DY+ V S+ QA
Sbjct: 429 SKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAG 488

Query: 382 QQA 384

Sbjct: 489 TSV 491



Score = 31.6 bits (71), Expect = 0.006
Identities = 34/227 (14%), Positives = 75/227 (33%), Gaps = 13/227 (5%)

Query: 9 HNYQSV--QNDLRRQENKIHHNQEQLASGKKLLKPSDDPLAAHYIQNIGQQQEQLKQYLS 66
N S+ QN+L + ++ + E+L+SG ++ DD + L Q
Sbjct: 6 TNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASR 65

Query: 67 SIVLVRNRLENHEVNIANAESFADESKRLTMEMINGAFSAEDRQAKKRELEEIANNFLNL 126
+ + + E + + + L+++ NG S D ++ + E+++ +
Sbjct: 66 NANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRV 125

Query: 127 VNAQDESGNYVFAGTKPKSQPFYRDKDGSVQYAGDDYQR-KMKVSSMLDMPMNDPGSKLF 185
N +G V + +Q +D + + + + + G +
Sbjct: 126 SNQTQFNGVKVLSQDNQM----------KIQVGANDGETITIDLQKIDVKSLGLDGFNVN 175

Query: 186 MEIPNPFGDYQPSYDLQSGSDLLLSKATNVDAKDTASYRVTFVDMNN 232
GD + S+ +G D A + VT
Sbjct: 176 GPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPT 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0788FLAGELLIN1857e-56 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 185 bits (470), Expect = 7e-56
Identities = 90/370 (24%), Positives = 162/370 (43%), Gaps = 10/370 (2%)

Query: 2 AVTVSTNVSAMTAQRYLNKATNELNTSMERLSSGHKINSAKDDAAGLQISNRLTAQSRGL 61
A ++TN ++ Q LNK+ + L++++ERLSSG +INSAKDDAAG I+NR T+ +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAMRNANDGISIAQTAEGAMNEATSVMQRMRDLAIQSSNGTNSPAERQAINEESMALVD 121
A RNANDGISIAQT EGA+NE + +QR+R+L++Q++NGTNS ++ ++I +E ++
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGRRLLNGSFGEAAFQIGASSGEAMIMGLTSIRADDTRMGGVTFFSEVG 181
E++R++ T F G ++L+ Q+GA+ GE + + L I + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KGKDWGVDPTKADLKITLPGMGEDEDGNVDDLEININAKAGDDIEELATYINGQSDMINA 241
+ + +++N+ A T +
Sbjct: 180 ATVGD------LKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAAN 233

Query: 242 SVSEDGKLQIFVAHPNVQGDISISGGLASELGLSDEPVRTSVQDIDMTTVQGSQNAISVL 301
+ A + S +G ++ D V + + +
Sbjct: 234 GQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGN 293

Query: 302 DSALK---YVDSQRADLGAKQNRLSHSINNLANIQENVDASNSRIKDTDFAKETTQMTKA 358
D K ++ ++ L + + A +Q + + S + + T+ A
Sbjct: 294 DGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA 353

Query: 359 QILQQAGTSI 368
++ +
Sbjct: 354 KLSDLEANNA 363



Score = 137 bits (346), Expect = 3e-38
Identities = 70/243 (28%), Positives = 114/243 (46%), Gaps = 24/243 (9%)

Query: 160 GLTSIRADDTRMGGVTFFSEVGKGKDWGVDPTKADLKITLPGMGEDEDGNVDDLEININA 219
G D + T ++ G + V T K+TL ++ N++A
Sbjct: 271 GGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTL------TVADITAGAANVDA 324

Query: 220 KAGDDIEEL-ATYINGQSDMINASVSEDGKLQIFVAHPNVQGDISISGGLASELGLSDEP 278
+ + + +NGQ + + +E KL A+ V+G+ I+ A +
Sbjct: 325 ATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGD 384

Query: 279 VRT-----------------SVQDIDMTTVQGSQNAISVLDSALKYVDSQRADLGAKQNR 321
T + + + + N ++ +DSAL VD+ R+ LGA QNR
Sbjct: 385 KVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNR 444

Query: 322 LSHSINNLANIQENVDASNSRIKDTDFAKETTQMTKAQILQQAGTSILAQAKQLPNSAMS 381
+I NL N N++++ SRI+D D+A E + M+KAQILQQAGTS+LAQA Q+P + +S
Sbjct: 445 FDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLS 504

Query: 382 LLQ 384
LL+
Sbjct: 505 LLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0789PF03944260.010 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 25.8 bits (56), Expect = 0.010
Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 28 PFSFERTGARLVARAYVEIKRNEH 51
PFSF+ V + + E K+N H
Sbjct: 23 PFSFQHKSLDTVQKEWTEWKKNNH 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0790FLAGELLIN2016e-62 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 201 bits (511), Expect = 6e-62
Identities = 90/297 (30%), Positives = 144/297 (48%), Gaps = 1/297 (0%)

Query: 2 AVNVNTNVSAMTAQRYLNNANSAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGL 61
A +NTN ++ Q LN + S+ +++ERLSSG +INSAKDDAAG I+NR +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAVRNANDGISIAQTAEGAMNETTNILQRMRDLSLQSANGSNSKAERVAIQEEVTALND 121
A RNANDGISIAQT EGA+NE N LQR+R+LS+Q+ NG+NS ++ +IQ+E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGNKLLNGTHGAKSFQIGADNGEAVMLELKDMRSDNKMMGGVSYQAESG 181
E++R++ T F G K+L+ Q+GA++GE + ++L+ + + + G +
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KGKDWNVAQGKNDLKISLTDSFGQEQEININAKAGDDIEELATYINGQTDLVKASVDQDG 241
+ KN + +++N+ A T +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 KLQIFAGNNKVEGEVSFSGGLSGELGLGDDKKNVTVDTIDVTSVGGAQESVAIIDAA 298
+ + + S +G + G K DT D V ++ D
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 135 bits (342), Expect = 1e-37
Identities = 82/377 (21%), Positives = 144/377 (38%), Gaps = 20/377 (5%)

Query: 19 NNANSAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGLDVAVRNANDGISIAQTA 78
N Q + ++ G L + ++ + +
Sbjct: 132 NGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKN 191

Query: 79 EGAMNETTNILQRMRDLSLQSANGSNSKAERVAIQEEVTALNDELNRIAETTSFGGNKLL 138
+ + R A +++ A V + V A N +L + +
Sbjct: 192 VTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFK 251

Query: 139 NGTHGAKSFQIGADNGEAVMLELKDMRSDNKMMGGVSYQAESGKGKDWNVAQGKNDLKIS 198
A + + A G + D + ++G + V+ N K++
Sbjct: 252 TTKSTAGTAEAKAIAGAIKGGKEGDTFDYKG--VTFTIDTKTGNDGNGKVSTTINGEKVT 309

Query: 199 LTDSFGQEQEININAKAGDDIEEL-ATYINGQTDLVKASVDQDGKLQIFAGNNKVEGEVS 257
LT + N++A + + + +NGQ + ++ KL NN V+GE
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 258 FSGGLSGELGLGDDKK-----------------NVTVDTIDVTSVGGAQESVAIIDAALK 300
+ + K + ++ + +A ID+AL
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 301 YVDSHRAELGAFQNRFNHAISNLDNINENVNASKSRIKDTDFAKETTAMTKSQILSQASS 360
VD+ R+ LGA QNRF+ AI+NL N N+N+++SRI+D D+A E + M+K+QIL QA +
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 361 SILAQAKQAPNSALSLL 377
S+LAQA Q P + LSLL
Sbjct: 490 SVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0791FLAGELLIN1492e-42 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 149 bits (376), Expect = 2e-42
Identities = 80/354 (22%), Positives = 132/354 (37%), Gaps = 17/354 (4%)

Query: 5 NTNVAAMMTQRHLSQAADQNVESQRNLSSGYRINSASDDAAGLQISNTLHVQTRGIDVAL 64
NTN +++TQ +L+++ + LSSG RINSA DDAAG I+N +G+ A
Sbjct: 5 NTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQAS 64

Query: 65 RNAHDAYSVAQTAEGALHESSDILQRLRSLGLQAANGSHEQDDRKSLQQEVIALQDELDR 124
RNA+D S+AQT EGAL+E ++ LQR+R L +QA NG++ D KS+Q E+ +E+DR
Sbjct: 65 RNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDR 124

Query: 125 VAITTTFADKNLFNGSYGSQSFHIGANANS-ISLALRNMRTHIPEMGGQHYLGDSL-DKD 182
V+ T F + + +GAN I++ L+ + + G + G
Sbjct: 125 VSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVG 183

Query: 183 WRVTRDNQQFAFEYQDNEGQAQSKVLTLKVGDNLEEVATYINAQQSVVDASVTQDHQLQF 242
+ ++ + T + +
Sbjct: 184 DLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAEN 243

Query: 243 FTSTLNAPEGITWKGNFADEMDIGSGELVTVDDLDMSTVGGAQLAIGVVDAAIKYVDSHR 302
T+ + G + G+ + D G I
Sbjct: 244 NTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEG--DTFDYKGVTFTIDTKTGNDG------ 295

Query: 303 SEIGGFQNRVSGTIDNLNTINRSVSESKGRIRDTDFARESTVMVRSQVLQDATT 356
+VS TI+ + G +S+ V + V+ T
Sbjct: 296 ------NGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFT 343



Score = 100 bits (250), Expect = 2e-25
Identities = 46/216 (21%), Positives = 86/216 (39%), Gaps = 19/216 (8%)

Query: 177 DSLDKDWRVTRDNQQFAFEYQDNEGQAQSKVLTLKVGDNLEE-VATYINAQQSVVDASVT 235
D + +V+ + A + + + + + +N Q + D +
Sbjct: 291 TGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKN 350

Query: 236 QDHQLQFFTSTLNAPEGITWKGNFADEMDIGSGELVTVD------------------DLD 277
+ +L + N A+ +G+ VT+ +
Sbjct: 351 ESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDA 410

Query: 278 MSTVGGAQLAIGVVDAAIKYVDSHRSEIGGFQNRVSGTIDNLNTINRSVSESKGRIRDTD 337
+ + +D+A+ VD+ RS +G QNR I NL +++ ++ RI D D
Sbjct: 411 AAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470

Query: 338 FARESTVMVRSQVLQDATTALLAQAKQRPSSALGLL 373
+A E + M ++Q+LQ A T++LAQA Q P + L LL
Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0794PHPHTRNFRASE7540.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 754 bits (1949), Expect = 0.0
Identities = 284/571 (49%), Positives = 409/571 (71%), Gaps = 2/571 (0%)

Query: 1 MISGILASPGIAIGKALLLQEDEIVLNTNTISDDQVEAEVARFFDARNKSAAQLETIKQK 60
I+GI AS G+AI KA + E + + +I+D V E+ + A KS +L IK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITD--VSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 ALETFGEEKEAIFEGHIMLLEDEELEEEILALIKNDKMTADHAIHSVIEEQACALESLDD 120
+ G +K IF H+++L+D EL + I I+N++M A++A+ V + ES+D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERATDIRDIGSRFVKNALGINIVSLSDINEEVILVAYDLTPSETAQINLDYVLGFA 180
EY+KERA DIRD+ R + + +G+ SL+ I EE +++A DLTPS+TAQ+N +V GFA
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 CDIGGRTSHTSIMARSLELPAIVGTNDITKKVKNGDMLILDAMNNKIVVNPSEAEVEEAK 240
DIGGRTSH++IM+RSLE+PA+VGT ++T+K+++GDM+I+D + ++VNP+E EV+ +
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 AVKAAFLAEKEELAKLKDLHAETTDGHRVEVCGNIGTVKDCDGIIRNGGEGVGLYRTEFL 300
+AAF +K+E AKL + T DG VE+ NIGT KD DG++ NGGEG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRTALPTEEEQYQAYKEVAEAMNGQAVIIRTMDIGGDKDLPYMDLPQEMNPFLGWRAV 360
+MDR LPTEEEQ++AYKEV + M+G+ V+IRT+DIGGDK+L Y+ LP+E+NPFLG+RA+
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RISLDRREILRDQLRGILRASAHGKLRIMFPMIISVEEIRELKNAIEEYKAELRAEGLAF 420
R+ L++++I R QLR +LRAS +G L++MFPMI ++EE+R+ K ++E K +L +EG+
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DENIEIGVMVETPAAAAIAHHLAKEVSFFSIGTNDLTQYTLAVDRGNEMISHLYNPLSPA 480
++IE+G+MVE P+ A A+ AKEV FFSIGTNDL QYT+A DR NE +S+LY P PA
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLTVIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSGISIPKVKKVIRNS 540
+L ++ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMS SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NFAEVKAMAEEALSLPTAAEIEAVVEKFIAE 571
+ E+K A++AL L TA E+E +V+K +
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572


38VP0914VP0925N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0914-113-0.784958C4-dicarboxylate transport transcriptional
VP0915017-1.194555C4-dicarboxylate transport sensor protein
VP0916325-1.072847trigger factor
VP0917221-1.258013ATP-dependent Clp protease proteolytic subunit
VP0918217-1.343365ATP-dependent protease ATP-binding subunit ClpX
VP0919116-1.462737ATP-dependent protease LA
VP0920111-1.091797DNA-binding protein HU-beta
VP0921110-1.204771peptidyl-prolyl cis-trans isomerse D
VP0922-313-0.621846hypothetical protein
VP0924-3140.176756hypothetical protein
VP0923-2170.742708hypothetical protein
VP0925-1160.763932deoxyguanosinetriphosphate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0914HTHFIS457e-161 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 457 bits (1177), Expect = e-161
Identities = 166/479 (34%), Positives = 245/479 (51%), Gaps = 50/479 (10%)

Query: 4 VFFIDDEADLRFAIEQTFELADIEATFFSDAESALLAMKQSDEAGVIVTDICLPGISGMD 63
+ DD+A +R + Q A + S+A + + + + ++VTD+ +P + D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI-AAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLTTLVQRDPNLPVIMITGHGDISMAVNALHQGAYDFIEKPFAPEHLVETVKRAIERRQL 123
LL + + P+LPV++++ A+ A +GAYD++ KPF L+ + RA+ +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK- 123

Query: 124 TNENEKLRQSLKASQTLGPRIIGETTSIQTLRETITHIADTDADILLFGETGTGKELIAR 183
+ L+ G ++G + ++Q + + + TD +++ GE+GTGKEL+AR
Sbjct: 124 -----RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 184 SLHEQSSRRNNNFVAVNCGAVPENLIESELYGHEKGAFTGADSRRIGKFEFAQGGTLFLD 243
+LH+ RRN FVA+N A+P +LIESEL+GHEKGAFTGA +R G+FE A+GGTLFLD
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 244 EIESMPIQAQIRLLRVLQERVIERVGSNELLPLDIRVVAATKVDLKRAAEQGTFRQDLYY 303
EI MP+ AQ RLLRVLQ+ VG + D+R+VAAT DLK++ QG FR+DLYY
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 304 RLNVVTLDLPPLRERKEDIPALFHHFLLVAASRYGKSSPSLSAQDLAQLMAHDWPGNVRE 363
RLNVV L LPPLR+R EDIP L HF+ A + G + L + AH WPGNVRE
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQ-QAEKEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 364 LRNAAERFVLL----------------------------GKLIQLGEPSSIPASTSS--- 392
L N R L + L ++ +
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 393 -----------LAEQVAEFEKAAIENALLENNGSIKQTMEQLNVPRKTLYDKMQRYQID 440
+AE E I AL G+ + + L + R TL K++ +
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0915PF06580372e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 2e-04
Identities = 36/192 (18%), Positives = 76/192 (39%), Gaps = 24/192 (12%)

Query: 349 QRTETEQTLRLTQDELIQAAKLAVLGQMSASIS-HELNNPLAAIRSFAENGKLFLQKEKY 407
+ + + Q ++ A+ A L + A I+ H + N L IR+
Sbjct: 139 HFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDP-------- 190

Query: 408 DRVEDNLTRISALTDRMANISQQLRSFAKKASGNELTQTRLLPVIASSKELMKPAFKSAR 467
+ + LT +S L +R + ++ +++ L V+ S +L F+ R
Sbjct: 191 TKAREMLTSLSEL----------MRYSLRYSNARQVSLADELTVVDSYLQLASIQFED-R 239

Query: 468 VLLATELPTDDIEVQINTIQLEQVLVNLLTNAIEAVKEQQDKQVWLLVETDTDENKVMIH 527
+ ++ ++VQ+ + + Q LV N I+ Q + +L++ D V +
Sbjct: 240 LQFENQINPAIMDVQVPPMLV-QTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 528 VDDNGPGLGSHT 539
V++ G +T
Sbjct: 296 VENTGSLALKNT 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0919HTHFIS320.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.1 bits (73), Expect = 0.009
Identities = 40/215 (18%), Positives = 76/215 (35%), Gaps = 45/215 (20%)

Query: 261 KMPQEAREKTEQELQKLKMMSP---MSAEATV----------VRSYIDWMVSVPWTKRSK 307
MP E ++K + P MSA+ T Y+ P+ ++
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYL----PKPF-DLTE 110

Query: 308 VKKNLAKAEEILNEDHYGLERVKERILEYL-------AVQNRINKL---KGPILCLVGPP 357
+ + +A LE + + + + + +L ++ + G
Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM-ITGES 169

Query: 358 GVGKTSLGRSIASATGRK---YVRMALGGVRD---EAEIRGHRRTYIGSLPGKLIQKMSK 411
G GK + R++ R+ +V + + + E+E+ GH + G+ G + +
Sbjct: 170 GTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGAQTRSTGR 226

Query: 412 VGVKN--PLFLLDEIDKMSSDMRGDPASALLEVLD 444
LF LDEI M D + + LL VL
Sbjct: 227 FEQAEGGTLF-LDEIGDMPMDAQ----TRLLRVLQ 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0920DNABINDINGHU1222e-40 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 122 bits (308), Expect = 2e-40
Identities = 50/87 (57%), Positives = 64/87 (73%)

Query: 2 NKTQLVEQIAENADISKASAGRALDAFIEAVSGTLQSGDQVALVGFGTFSVRTRAARTGR 61
NK L+ ++AE +++K + A+DA AVS L G++V L+GFG F VR RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPKTGEEIQIAEAKVPAFKAGKALKDA 88
NP+TGEEI+I +KVPAFKAGKALKDA
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0922OUTRMMBRANEA260.018 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 26.4 bits (58), Expect = 0.018
Identities = 14/35 (40%), Positives = 22/35 (62%), Gaps = 3/35 (8%)

Query: 54 GIGEKKAQSIVDYRTEHGPFKTAADLTNVKGIGEA 88
G+ E++AQS+VDY G AD + +G+GE+
Sbjct: 272 GLSERRAQSVVDYLISKG---IPADKISARGMGES 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0925PF08280300.018 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 30.2 bits (68), Expect = 0.018
Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 2/46 (4%)

Query: 115 GEVALNYMMRDHGGFEGNAQTFRIVTKLEPYTEHFGMNLSRRTLLG 160
L R H F N+ +R+ L P +F + LS+ ++G
Sbjct: 134 HSRPLTDFARSH--FLSNSSAYRMREALIPLLRNFELKLSKNKIVG 177


39VP0938VP0944N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP0938-212-0.649454hypothetical protein
VP0939-212-0.484058hypothetical protein
VP0940-113-0.546901******hypothetical protein
VP0941-112-0.270810multidrug resistance protein
VP0942-316-2.605957fimbrial protein
VP0943-315-1.585202hypothetical protein
VP0944-315-1.150839outer membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0938HTHTETR351e-04 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 34.6 bits (79), Expect = 1e-04
Identities = 16/106 (15%), Positives = 32/106 (30%), Gaps = 5/106 (4%)

Query: 1 MARRNDHTREELINLTLTTVKNFLDENSYHELSLRKIANMIGYVPSTLVNIFGNYNLLLL 60
+ TR+ ++++ L + SL +IA G + F + + L
Sbjct: 5 TKQEAQETRQHILDVALR----LFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 HAVAQTLDELSQ-EALNATKSSKDAHQALFELAYCYHDFAQKHPYR 105
+ + + E K D L E+ + R
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERR 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0940RTXTOXIND515e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 50.6 bits (121), Expect = 5e-09
Identities = 31/206 (15%), Positives = 74/206 (35%), Gaps = 21/206 (10%)

Query: 71 VKQIAVKANQNVQQNQLLIQLDDDKAQ--AALVEAKAYLKDEERKLKEFQRLVKRNAITQ 128
+ + A+ + ++Q ++ ++ + L + ++ + + + + +L K + +
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303

Query: 129 TEIDAQKASVEIAQARLDAAKANLADLHITAPFSGTVGFID-FSRGKMVSAGTELLTL-D 186
+ L + I AP S V + + G +V+ L+ +
Sbjct: 304 LR-QTTDNIGLL-TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361

Query: 187 DLSVMELDLQIPERYLSMLSVGMEVAAKTSAWGEQRFSGKVTGIDTRISA-----ETLNL 241
+ +E+ + + + ++VG K A+ R+ G + G I+ + L L
Sbjct: 362 EDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLVGKVKNINLDAIEDQRLGL 420

Query: 242 --RVRIEFD-------NPENQLKPGM 258
V I + N L GM
Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGM 446



Score = 39.4 bits (92), Expect = 2e-05
Identities = 15/65 (23%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 44 EINQSLSLIGKLKAA-ESVVVASEVAGKVKQIAVKANQNVQQNQLLIQLDDDKAQAALVE 102
++ + GKL + S + VK+I VK ++V++ +L++L A+A ++
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 103 AKAYL 107
++ L
Sbjct: 139 TQSSL 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0941ACRIFLAVINRP7800.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 780 bits (2016), Expect = 0.0
Identities = 308/1032 (29%), Positives = 517/1032 (50%), Gaps = 28/1032 (2%)

Query: 3 LSDVSVKRPVAALVLSMLLCVFGFVSFTKLAVREMPDIESPVVSISTRYEGASATIIESQ 62
+++ ++RP+ A VL+++L + G ++ +L V + P I P VS+S Y GA A ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 ITSVLEDQLSGISGIDEISSTTRNSMS-RITITFELGYDLNTGVSDVRDAVARAQRSLPD 121
+T V+E ++GI + +SST+ ++ S IT+TF+ G D + V++ + A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 122 EADDPIVYKNNGSGEASLYINLSSSEMDRTQ--LTDYAERVLMDRFSLITGVSSIDLSGG 179
E + S + S TQ ++DY + D S + GV + L G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 LYKVMYVKLKPELMAGRAVTASDITSALRSENLESPGGEVRNDSTV------MSVRTART 233
Y + + L +L+ +T D+ + L+ +N + G++ + S+
Sbjct: 181 QYAMR-IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 234 YNTPEDFQYLVIKRASDNTPIYLKDVADVFIGAENENSTFKSDGVVNISLGVVPQSDANP 293
+ PE+F + ++ SD + + LKDVA V +G EN N + +G LG+ + AN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 294 LEVAKLVRSEVDNIQKFLPEGTRLAIDYDATVFIERSIEEVYSTLFITGGLVILVLYIFI 353
L+ AK +++++ +Q F P+G ++ YD T F++ SI EV TLF LV LV+Y+F+
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 354 GQARATLIPAVTVPVSLISSFIAAYYFGFSINLITLMALILSIGLVVDDAIVVVENIFHH 413
RATLIP + VPV L+ +F FG+SIN +T+ ++L+IGL+VDDAIVVVEN+
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 414 I-ERGESPLLAAYKGTREVGFAVIATTLVLVMVFLPISFMDGMVGLLFTEFSVLLAMSVI 472
+ E P A K ++ A++ +VL VF+P++F G G ++ +FS+ + ++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 473 FSSLIALTLTPVLGSQILKANVK-----PNRFNEVVDRLFSKLENGYRSLLKGALKARLA 527
S L+AL LTP L + +LK F + F N Y + + L +
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 528 APLVILACMGGSYFLMNQVPAQLTPQEDRGVIFAFVRGADATSYNRMSANMDIVEDRLMP 587
L+ + G L ++P+ P+ED+GV ++ + R +D V D +
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 588 LLGQGYLKSFSIQTPAFGGQAGDQTGFVIMILEDWNERDV---TAQEALNKVRGALAGIP 644
F++ +F GQ G + L+ W ER+ +A+ +++ + L I
Sbjct: 600 NEKANVESVFTVNGFSFSGQ-AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658

Query: 645 DVRVFPF-MPGFQG-GSSEPVQFVL---GGSDYDELLVWAELLKNKAEESP-MMEGAEID 698
D V PF MP G++ F L G +D L L A + P + +
Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 699 YSEKTPELLVTVDKQRAAELGVSVKDISDTLEIMLGGKSETTYVERGEEYDVYLRGDENS 758
E T + + VD+++A LGVS+ DI+ T+ LGG +++RG +Y++ D
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 759 FNNAADLSQIYLRTNSGELVTLDTVTKIEEVAASIRLSHYNKQKSITITANLSEGYTLGE 818
D+ ++Y+R+ +GE+V T V S RL YN S+ I + G + G+
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 819 ALNYLDQQAIDNLPGDISVSYSGESKDFKENQASVAVVFALALLVAYLVLAAQFESFVNP 878
A+ ++ A LP I ++G S + + + A++ +V +L LAA +ES+ P
Sbjct: 839 AMALMENLA-SKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897

Query: 879 LVVMFTVPMGVFGGFLGLVVMGQGMNIYSQIGMIMLIGMVTKNGILIVEFANQLRDR-GV 937
+ VM VP+G+ G L + Q ++Y +G++ IG+ KN ILIVEFA L ++ G
Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957

Query: 938 EFEKAIIDAAARRLRPILMTAFTTLAGAIPLIVSTGAGYESRIAVGTVIFFGMGFATLVT 997
+A + A RLRPILMT+ + G +PL +S GAG ++ AVG + GM ATL+
Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017

Query: 998 LFVIPAMYRLIS 1009
+F +P + +I
Sbjct: 1018 IFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP0944OMPADOMAIN396e-06 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 38.8 bits (90), Expect = 6e-06
Identities = 38/194 (19%), Positives = 66/194 (34%), Gaps = 9/194 (4%)

Query: 2 KNAILLLSALTISVPTFAAQEQTHQSYFYLGTGNVGF-DTPYFGSDRDNEWSTPHITVGW 60
K AI + AL A + + Y G + DT + ++ +
Sbjct: 3 KTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFG 62

Query: 61 GYYVNQYLAFE-GVLRYSQNELKNDTLKTDLDLHYYQAGMSAVLTSDNLGDTPLSLFGRV 119
GY VN Y+ FE G + K + + Q + D L ++ R+
Sbjct: 63 GYQVNPYVGFEMGYDWLGRMPYKG---SVENGAYKAQGVQLTAKLGYPITDD-LDIYTRL 118

Query: 120 TALGTQAEMYVPNAQKVTDDSGALFNVGAGVHWDMSKDIWLRAEYIY--NVADMGFEDFY 177
+ +A+ N D+G GV + ++ +I R EY + N+ D
Sbjct: 119 GGMVWRAD-TKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTR 177

Query: 178 DSYEGLQISLGKRF 191
L + + RF
Sbjct: 178 PDNGMLSLGVSYRF 191


40VP1172VP1178N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP1172119-0.196566psp operon transcriptional activator
VP11730180.136636phage shock protein A
VP11740190.364340phage shock protein B
VP1175018-0.001768phage shock protein C
VP11760170.623499multidrug resistance protein
VP1177-1150.895906periplasmic linker protein
VP1178-2140.824073AcrB/AcrD/AcrF family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1172HTHFIS349e-120 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 349 bits (897), Expect = e-120
Identities = 122/341 (35%), Positives = 187/341 (54%), Gaps = 13/341 (3%)

Query: 3 QNLIGESPAFLAVLDKVSQLAPIERPVLIIGERGTGKELIAQRLHYLSKRWDKPLLSLNC 62
L+G S A + +++L + ++I GE GTGKEL+A+ LH KR + P +++N
Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196

Query: 63 ATLSEGLIDSELFGHESGSFTGSKGKHKGRFERAEGGTLFLDELATAPLLVQEKLLRVIE 122
A + LI+SELFGHE G+FTG++ + GRFE+AEGGTLFLDE+ P+ Q +LLRV++
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256

Query: 123 YGEYERVGGHTALNADVRLVCATNADLPRLAEQGDFRADLLDRLAFDVIMLPPLRERKED 182
GEY VGG T + +DVR+V ATN DL + QG FR DL RL + LPPLR+R ED
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316

Query: 183 ILSLAEHYAMKMCRELQLEYFVGFTHQAQQALLDYSWPGNVRELKNVIERAIYQHGLNAE 242
I L H+ + +E F +A + + + WPGNVREL+N++ R + +
Sbjct: 317 IPDLVRHFVQQAEKEGLDV--KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374

Query: 243 PIDELIFNPFATGWNNALGHTAANEEPQEEASSQTTSIHFPL-----------DYKQWQE 291
+ + + ++ + AA + + ++ Y +
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434

Query: 292 EQDINLLNRALEEAKFNQRQAAELLGLSYHQLRGMVRKYGL 332
E + L+ AL + NQ +AA+LLGL+ + LR +R+ G+
Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1174FLGHOOKAP1260.015 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 26.1 bits (57), Expect = 0.015
Identities = 7/29 (24%), Positives = 16/29 (55%)

Query: 37 SSQDLERLQVLSEKAEAMQSRVDTLERIL 65
+++D Q L K+E + ++ T ++ L
Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYL 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1176RTXTOXIND452e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.2 bits (107), Expect = 2e-07
Identities = 27/182 (14%), Positives = 57/182 (31%), Gaps = 27/182 (14%)

Query: 92 EYSLLEKQARANFALADVQYKRYKKLRKDQVVSEQDFDEAKANHNSAKAQWDQAKANLRY 151
E +L + + + KLR + N + + + +
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLR-----------QTTDNIGLLTLELAKNEERQQA 327

Query: 152 TQLVAPYDGTISYLPAENH-EYVAAKEGVMNI-QTNQVMKVIFQLPDYLLNRYTQGVNVS 209
+ + AP + L V E +M I + ++V + + + G N
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNA- 386

Query: 210 AKMIFDAFPERSF---DLTFQEI--DTEADPKTG-SYKVTMVMERP------PELGILPG 257
+ +AFP + + I D D + G + V + +E + + G
Sbjct: 387 -IIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSG 445

Query: 258 MS 259
M+
Sbjct: 446 MA 447



Score = 38.7 bits (90), Expect = 3e-05
Identities = 22/114 (19%), Positives = 41/114 (35%), Gaps = 17/114 (14%)

Query: 54 AGDRAVLAFRVPGLLQSIEVNEGQVVKKGDVLAVLNPDEYSLLEKQARANFALADVQYKR 113
+G + +++ I V EG+ V+KGDVL L + +++ A ++ R
Sbjct: 93 SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTR 152

Query: 114 YKKLRKD-----------------QVVSEQDFDEAKANHNSAKAQWDQAKANLR 150
Y+ L + Q VSE++ + + W K
Sbjct: 153 YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1177RTXTOXIND447e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.7 bits (103), Expect = 7e-07
Identities = 25/132 (18%), Positives = 42/132 (31%), Gaps = 30/132 (22%)

Query: 67 GEVSRVMVREGDKVQKGAVIATLDPTDYQLDVDNATARF-----------SVVDSQYRRS 115
V ++V+EG+ V+KG V+ L + D + + S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 116 RPLVD-----------------KGLLAKSQFDEIAAQRQIALAELELAKLRLSFTTLKAP 158
P + L K QF Q Q EL L K R T+ A
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS--TWQNQKYQKELNLDKKRAERLTVLAR 222

Query: 159 VDGIISRVNIDQ 170
++ + +++
Sbjct: 223 INRYENLSRVEK 234



Score = 37.5 bits (87), Expect = 6e-05
Identities = 35/194 (18%), Positives = 65/194 (33%), Gaps = 25/194 (12%)

Query: 92 TDYQLDVDNATARFSVVDSQYRRSRPLVD--KGLLAKSQFDEIA-AQRQIALAELELAKL 148
+ ++ ++ ++S+ ++ L D++ I L LELAK
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKN 321

Query: 149 --RLSFTTLKAPVDGIISRVNI-DQFENVQVGQHIVNIHSLDR---VEVLIQ--LPDRLY 200
R + ++APV + ++ + + V + ++ I D V L+Q +
Sbjct: 322 EERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFIN 381

Query: 201 VKQPPTEERLTAINAVVRVPSGNEYKATVKEFT--TEPDPTTGT-FTVTLSLPMP----- 252
V Q ++ A VK D G F V +S+
Sbjct: 382 VGQ-NAIIKVEAFPY----TRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTG 436

Query: 253 -KEEYILDGMAVEV 265
K + GMAV
Sbjct: 437 NKNIPLSSGMAVTA 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1178ACRIFLAVINRP513e-167 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 513 bits (1323), Expect = e-167
Identities = 223/1053 (21%), Positives = 442/1053 (41%), Gaps = 64/1053 (6%)

Query: 17 IAAYFIRNRVISWMVSLIFLIGGIAAFFGLGRLEDPAFTIKDAMVVTSYPGATPQQVEEE 76
+A +FIR + +W++++I ++ G A L + P V +YPGA Q V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 77 VTYPLEKAIQQLTYVDEVNSISNR-GLSQITVTMKNNYGPDDLPQIWDELRRKVNDLKVT 135
VT +E+ + + + ++S S+ G IT+T ++ PD +++ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQV---QVQNKLQLATPL 117

Query: 136 LPPGVNEPQV-IDDFGDVYGILLAVTGDGYSY--KELLDYVD-YLRRELELVDGVSKVSV 191
LP V + + ++ Y ++ D ++ DYV ++ L ++GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 192 SGQQQEQVFIEVSMKKLSSIGLSPNTVFNLLSTQNIVSDAGAIRIGDEYI-------RIQ 244
G Q + I + L+ L+P V N L QN AG + G + I
Sbjct: 178 FGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL-GGTPALPGQQLNASII 235

Query: 245 PTGEFQSVDELGDLLITESGAQGLIFLKDVAEIKRGYVEVPSNIINFNGSLALNVGVSFA 304
F++ +E G + + + ++ LKDVA ++ G E + I NG A +G+ A
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKLA 294

Query: 305 QGVNVVEVGKAFDRRLAELKYQQPVGVEISEIYSQPKEVDKSVSGFVISLAQAVGIVIIV 364
G N ++ KA +LAEL+ P G+++ Y V S+ V +L +A+ +V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 365 LLFFMG-LRSGLLIGLILLLTVLGTFIFMKYLAIDLQRISLGALVIALGMLVDNAIVVVE 423
+ F+ +R+ L+ + + + +LGTF + + +++ +V+A+G+LVD+AIVVVE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 424 GILIGTQKGRTRLQAAT-DIVTQTKWPLLGATVIAVTAFAPIGLSEDSTGEYCGTLFTVL 482
+ + + + AT ++Q + L+G ++ F P+ STG +
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 483 LISLMLSWFTAISLTPFFADIFFKGQKIKQGEGEENDPYNGIIFVAYKKFLEF------- 535
+ ++ LS A+ LTP K + E + G + +
Sbjct: 475 VSAMALSVLVALILTPALCATLLKP--VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 536 CMRRAWLTVVVLIVGLGASVYGFTLVKQSFFPSSTTPIFQLDVWLPEGTDIRATNDKLKE 595
+ +++ + + V F + SF P +F + LP G T L +
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 596 LESWL--AEQEHVDHITTTAGKGLQRFMLTYAPEKSYAAYGEIT-----TRVDNYEALAP 648
+ + E+ +V+ + T G +++ + A ++ R + +
Sbjct: 593 VTDYYLKNEKANVESVFTVNG-------FSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645

Query: 649 LMARFRDHLKANYPEINYKLK---QIELGPGGGAKIE-ARIIGSDPTVLRTIAAQVMDIM 704
++ R + L +ELG G E G L Q++ +
Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMA 705

Query: 705 YADPSA-TNIRHDWRERTQVLEPQFNESQARRYGITKSDVDDFLSMSFSGMTIGLYRDGT 763
P++ ++R + E T + + ++ +A+ G++ SD++ +S + G + + D
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 764 TLMPIVARLPEDERIDIRNIEGMKIWSPAQSEFIPLQQVTMGYDMRWED--PIIVRKNRK 821
+ + + R+ +++ + + S A E +P T W P + R N
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRS-ANGEMVPFSAFT---TSHWVYGSPRLERYNGL 821

Query: 822 RMLTVMADPDILGEETASTLQKRLQPQIEAIQMPPGYSLEWGGEYESSGDAQESLFTTMP 881
+ + + S+ + A ++P G +W G + +
Sbjct: 822 PSMEIQGEA----APGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVA 877

Query: 882 MGYLFMFLITVFLFNSIKEPLIVWLTVPLALIGVTTGLLALNTPFGFMALLGFLSLSGMV 941
+ ++ +FL L+ S P+ V L VPL ++GV N ++G L+ G+
Sbjct: 878 ISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLS 937

Query: 942 LKNGIVLLDQI-EIEMKSGKEAYDAVVDAAVSRVRPVCMAAITTILGMIPLLPDI----- 995
KN I++++ ++ K GK +A + A R+RP+ M ++ ILG++PL
Sbjct: 938 AKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSG 997

Query: 996 FFKPMAVTIMFGLGFATILTLIVVPVLYRLFHK 1028
+ + +M G+ AT+L + VPV + + +
Sbjct: 998 AQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


41VP1386VP1394N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP1386017-1.166715hypothetical protein
VP1387-114-0.846667hypothetical protein
VP1388113-1.885526hypothetical protein
VP1389-111-1.653018hypothetical protein
VP1390-110-1.265032hypothetical protein
VP1391-18-0.150782transcriptional regulator
VP139209-0.468344ClpA/B-type protease
VP1393011-1.082945BfdA protein
VP1394010-1.087215VgrG protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1386FLGMOTORFLIM310.007 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 31.4 bits (71), Expect = 0.007
Identities = 23/109 (21%), Positives = 42/109 (38%), Gaps = 13/109 (11%)

Query: 51 STSSNAFGDYKVEFESSSMPLYVVAQQGSYTDPLTKEIVSSSVGKTLRLEGVVNFSEGSN 110
S+++ G + + + M VVA+ GS + ++I+ VG +RL +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMD--VVAEVGSLRLSV-RDILGLRVGDIIRL----------H 287

Query: 111 QKLMLTPLTNIVSGFAKYKIAGGVSGPDAVSQALDSINGMYGFDVNETT 159
+ P + K+ GV G +Q L+ I D E +
Sbjct: 288 DTHVGDPFVLSIGNRKKFLCQPGVVGKKIAAQILERIESTSQEDFEELS 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1390OMPADOMAIN558e-10 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 55.3 bits (133), Expect = 8e-10
Identities = 31/125 (24%), Positives = 54/125 (43%), Gaps = 16/125 (12%)

Query: 604 KIEFAFNSREKELKDHAS-IFKQLAASLANVLEDEPNLRVEIEGHACQVGTEKANMQVSA 662
+ F FN LK QL + L+N+ D + V + G+ ++G++ N +S
Sbjct: 220 DVLFNFNK--ATLKPEGQAALDQLYSQLSNL--DPKDGSVVVLGYTDRIGSDAYNQGLSE 275

Query: 663 ERAEYAKKLLMKEFPVFDGRVSIAAFGESKPV------YVPKQGEKVDRNNPNLRQNRRV 716
RA+ L+ + + ++S GES PV V ++ +D L +RRV
Sbjct: 276 RRAQSVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVKQRAALID----CLAPDRRV 330

Query: 717 VIRIY 721
I +
Sbjct: 331 EIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1391HTHFIS396e-135 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 396 bits (1019), Expect = e-135
Identities = 133/369 (36%), Positives = 195/369 (52%), Gaps = 14/369 (3%)

Query: 184 KSLSADNQALVRENTQLKKRTQKAYQGP--IAESEEMLNVLNRLDKVLSLPVDVLLRGET 241
+ + +AL + K + G + S M + L +++ + +++ GE+
Sbjct: 110 ELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGES 169

Query: 242 GAGKEVIAKYIHENSNRSEQPLIVQNCAAIPEQLLESELFGHKKGSFTGADKDKVGLFEA 301
G GKE++A+ +H+ R P + N AAIP L+ESELFGH+KG+FTGA G FE
Sbjct: 170 GTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQ 229

Query: 302 ANGGTLFLDEIGDMPMLLQAKLLRVLQERKVRPIGTSKEIEVDVRVIAATHCNLMQQIKD 361
A GGTLFLDEIGDMPM Q +LLRVLQ+ + +G I DVR++AAT+ +L Q I
Sbjct: 230 AEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQ 289

Query: 362 GGFRADLFYRLNVFPITLPPLRARKSDIIPLAEHFVQHTTNTLGLPQAPGLSANVRKQLL 421
G FR DL+YRLNV P+ LPPLR R DI L HFVQ GL + +
Sbjct: 290 GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLD-VKRFDQEALELMK 347

Query: 422 AYQYPGNVRELKNIIERSVLLSDFETITHIEFGEQIPEDVPNIDMKAASPTPDQ------ 475
A+ +PGNVREL+N++ R L + IT ++ ++P+ ++ A+
Sbjct: 348 AHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQA 407

Query: 476 -PAYERQAQADLEDA---SKSLKDVVSQYERTVIIDCLNACNWHTKKAAEQLALPMSTLN 531
RQ A DA S V+++ E +I+ L A + KAA+ L L +TL
Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467

Query: 532 HKMKKYDIS 540
K+++ +S
Sbjct: 468 KKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1392HTHFIS369e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 9e-04
Identities = 58/327 (17%), Positives = 104/327 (31%), Gaps = 64/327 (19%)

Query: 579 LQDEAETTLKLKESLTQSIKGQEYAIDALSEGIQTAK---AGLGNPDAPTGVFLLVGPSG 635
+ A K + S + + S +Q A L D ++ G SG
Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL---MITGESG 170

Query: 636 VGKTETARAIADQMFGGERFMTTINMSEFQEKHTVSRLIGSPPGYVGYGEGGMLTEAVRQ 695
GK ARA+ D INM+ S L G E G T A +
Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTR 222

Query: 696 RPYSV-------VLLDEVEKADPEVLNLFYQVFDKGTLND-GEGRTIDFKNTLIIMTSNL 747
+ LDE+ + +V +G G I + I+
Sbjct: 223 STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRS-DVRIV----A 277

Query: 748 ATHE-IESLVHQSKEIDANIIAEAIRPTLNQHFKPALLARMSVLPFV--PLSD--EAMTE 802
AT++ ++ ++Q F+ L R++V+P PL D E + +
Sbjct: 278 ATNKDLKQSINQ------------------GLFREDLYYRLNVVPLRLPPLRDRAEDIPD 319

Query: 803 IIHHKLNKVSQRLHSHHKLSLSYEESLVEFVL-----GNCR-----LAETGARNIDAVIN 852
++ H + Q+ +++ +E + GN R + A VI
Sbjct: 320 LVRHFV----QQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVIT 375

Query: 853 RQLLPQLSTQLLVHDKDDSHTQITVSV 879
R+++ + + + S+
Sbjct: 376 REIIENELRSEIPDSPIEKAAARSGSL 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1394ICENUCLEATIN404e-05 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 40.1 bits (93), Expect = 4e-05
Identities = 35/141 (24%), Positives = 51/141 (36%), Gaps = 9/141 (6%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T T+T TQ +N SDL G T+ G N L G S T +
Sbjct: 503 TQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTA 562

Query: 615 GADDNQT--VGGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT G +LT + +D G+TQ S + GS+
Sbjct: 563 GYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQS 622

Query: 666 SIKLSGSSITIEGSDKVVVKG 686
+ S + G+D ++ G
Sbjct: 623 VLTTGYGSTSTAGADSSLIAG 643



Score = 39.0 bits (90), Expect = 7e-05
Identities = 28/125 (22%), Positives = 47/125 (37%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T + +T TQ + NSDL G T+ G + L G S T G +
Sbjct: 839 TQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTA 898

Query: 615 GADDNQTVGGNLTVSVKGNTSYKADGATQIISGDKIVLKTGGSSLVMNSDGSIKLSGSSI 674
G QT N ++ ++ A + +I+G S +M GS + +
Sbjct: 899 GYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQS 958

Query: 675 TIEGS 679
++
Sbjct: 959 SLTAG 963



Score = 38.6 bits (89), Expect = 9e-05
Identities = 38/160 (23%), Positives = 58/160 (36%), Gaps = 16/160 (10%)

Query: 536 GDHKETIDGHKTTQVNSTFTETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDD 595
GD I G+ +TQ T +D ++T TQ + SDL G T+ G
Sbjct: 443 GDDSSLIAGYGSTQ-------TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESS 495

Query: 596 LDVGENSNLTVGASKSSDIGADDNQT--VGGNLTVSVKGNTSYKAD-------GATQIIS 646
L G S T G + G QT +L ++ A+ G+TQ S
Sbjct: 496 LIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTAS 555

Query: 647 GDKIVLKTGGSSLVMNSDGSIKLSGSSITIEGSDKVVVKG 686
+ ++ GS+ + S GSD ++ G
Sbjct: 556 YNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAG 595



Score = 38.2 bits (88), Expect = 1e-04
Identities = 38/160 (23%), Positives = 56/160 (35%), Gaps = 16/160 (10%)

Query: 536 GDHKETIDGHKTTQVNSTFTETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDD 595
GD I G+ +TQ T +D ++T TQ + SDL G T G +
Sbjct: 347 GDDSSLIAGYGSTQ-------TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSS 399

Query: 596 LDVGENSNLTVGASKSSDIGADDNQT--VGGNLTVSVKGNTSYKAD-------GATQIIS 646
L G S T G + G QT G +LT + D G+TQ
Sbjct: 400 LIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAG 459

Query: 647 GDKIVLKTGGSSLVMNSDGSIKLSGSSITIEGSDKVVVKG 686
D + GS+ + S + G + ++ G
Sbjct: 460 EDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAG 499



Score = 37.8 bits (87), Expect = 2e-04
Identities = 38/164 (23%), Positives = 59/164 (35%), Gaps = 14/164 (8%)

Query: 526 VARHQTNEVHGDHKETIDGHKTTQV---NSTF------TETVEQDVTVTYNANETQYVKN 576
+A + + E GD I G+ +T +ST T+T ++ + TQ
Sbjct: 177 IAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMK 236

Query: 577 NSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDIGADDNQTVGGNLTVSVKGNTSY 636
SDL G T G + L G S T G S G QT ++ ++
Sbjct: 237 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTG 296

Query: 637 KADGATQIISGDKIVLKTGGSSLVMNSDGSIKLSGSSITIEGSD 680
A + +I+G G S GS + + +GSD
Sbjct: 297 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQ-----KGSD 335



Score = 37.4 bits (86), Expect = 2e-04
Identities = 38/142 (26%), Positives = 56/142 (39%), Gaps = 11/142 (7%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T ++T TQ + S L G T+ G + L G S T G +
Sbjct: 599 TQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTA 658

Query: 615 GADDNQTV--GGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT G +LT ++ AD G+TQ +G +L G S +G
Sbjct: 659 GYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ-TAGYNSILTAGYGSTQTAQEG 717

Query: 666 SIKLSG-SSITIEGSDKVVVKG 686
S SG S + G+D ++ G
Sbjct: 718 SDLTSGYGSTSTAGADSSLIAG 739



Score = 37.0 bits (85), Expect = 3e-04
Identities = 28/122 (22%), Positives = 47/122 (38%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T + +T TQ + NSDL G T+ G L G S T +
Sbjct: 887 TQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMA 946

Query: 615 GADDNQTVGGNLTVSVKGNTSYKADGATQIISGDKIVLKTGGSSLVMNSDGSIKLSGSSI 674
G +QT +++ ++ A + +I+G G S + GS + + S
Sbjct: 947 GYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSS 1006

Query: 675 TI 676
T+
Sbjct: 1007 TL 1008



Score = 37.0 bits (85), Expect = 3e-04
Identities = 34/141 (24%), Positives = 50/141 (35%), Gaps = 9/141 (6%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T +D ++T TQ + SDL G T G + L G S T G +
Sbjct: 263 TQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTA 322

Query: 615 GADDNQT--VGGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT G +LT + D G+TQ D + GS+
Sbjct: 323 GYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGS 382

Query: 666 SIKLSGSSITIEGSDKVVVKG 686
+ S G+D ++ G
Sbjct: 383 DLTAGYGSTGTAGADSSLIAG 403



Score = 36.7 bits (84), Expect = 4e-04
Identities = 35/141 (24%), Positives = 48/141 (34%), Gaps = 9/141 (6%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T ++ T T TQ + SDL G T G + L G S T G S
Sbjct: 311 TQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTA 370

Query: 615 GADDNQT--VGGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT G +LT + AD G+TQ + GS+
Sbjct: 371 GYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGS 430

Query: 666 SIKLSGSSITIEGSDKVVVKG 686
+ S G D ++ G
Sbjct: 431 DLTAGYGSTGTAGDDSSLIAG 451



Score = 35.9 bits (82), Expect = 7e-04
Identities = 36/142 (25%), Positives = 52/142 (36%), Gaps = 11/142 (7%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T + +T TQ + SDL G T G + + G S T S
Sbjct: 551 TQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTA 610

Query: 615 GADDNQT--VGGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT LT ++ AD G+TQ + I L G S +G
Sbjct: 611 GYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSI-LTAGYGSTQTAQEG 669

Query: 666 SIKLSG-SSITIEGSDKVVVKG 686
S +G S + G+D ++ G
Sbjct: 670 SDLTAGYGSTSTAGADSSLIAG 691



Score = 35.9 bits (82), Expect = 7e-04
Identities = 32/141 (22%), Positives = 52/141 (36%), Gaps = 9/141 (6%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T ++ T T TQ + SDL G T G + L G S T G S
Sbjct: 407 TQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTA 466

Query: 615 GADDNQT--VGGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT G +LT ++ + G+TQ + GS+ ++
Sbjct: 467 GYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNES 526

Query: 666 SIKLSGSSITIEGSDKVVVKG 686
+ S + G++ ++ G
Sbjct: 527 DLITGYGSTSTAGANSSLIAG 547



Score = 34.3 bits (78), Expect = 0.002
Identities = 32/141 (22%), Positives = 50/141 (35%), Gaps = 9/141 (6%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T ++T TQ + S L G T+ G + L G S T G
Sbjct: 743 TQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTA 802

Query: 615 GADDNQTV--GGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT +LT ++ AD G+TQ + I+ GS+ +
Sbjct: 803 GYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENS 862

Query: 666 SIKLSGSSITIEGSDKVVVKG 686
+ S + G D ++ G
Sbjct: 863 DLTTGYGSTSTAGYDSSLIAG 883



Score = 34.3 bits (78), Expect = 0.002
Identities = 36/151 (23%), Positives = 56/151 (37%), Gaps = 16/151 (10%)

Query: 552 STFTETVEQDVTVTYNANE--------------TQYVKNNSDLEIGDNRTTKIGKNDDLD 597
S+ +T Y +N+ TQ N S L G + G L
Sbjct: 1062 SSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLI 1121

Query: 598 VGENSNLTVGASKSSDIGADDNQTVGGNLTVSVKGNTSYKADG-ATQIISGDKIVLKTGG 656
G +S G GAD QT G+ + + GN SY G +++ +G+ +L G
Sbjct: 1122 SGADSVQMAGERGKLIAGADSTQT-AGDRSKLLAGNNSYLTAGDRSKLTAGNDCILMAGD 1180

Query: 657 SSLVMNSDGSIKLSGSSITIEGSDKVVVKGG 687
S + SI +G + GS+ + G
Sbjct: 1181 RSKLTAGINSILTAGCRSKLIGSNGSTLTAG 1211



Score = 33.2 bits (75), Expect = 0.005
Identities = 28/118 (23%), Positives = 43/118 (36%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T T D ++ TQ +S L G T + L G S T GA S
Sbjct: 583 TGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIA 642

Query: 615 GADDNQTVGGNLTVSVKGNTSYKADGATQIISGDKIVLKTGGSSLVMNSDGSIKLSGS 672
G QT G N ++ ++ A + + +G G S ++ GS + +G
Sbjct: 643 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700



Score = 32.8 bits (74), Expect = 0.005
Identities = 34/152 (22%), Positives = 59/152 (38%), Gaps = 5/152 (3%)

Query: 525 KVARHQTNEVHGDHKETIDGHKTTQV---NSTFTETVEQDVTVTYNANETQYVKNNSDLE 581
+ AR Q+ G + G ++ + ST T +T Y + TQ + SDL
Sbjct: 760 QTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGS--TQTAQERSDLT 817

Query: 582 IGDNRTTKIGKNDDLDVGENSNLTVGASKSSDIGADDNQTVGGNLTVSVKGNTSYKADGA 641
G T+ G + L G S T G + G QT N ++ ++ A
Sbjct: 818 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYD 877

Query: 642 TQIISGDKIVLKTGGSSLVMNSDGSIKLSGSS 673
+ +I+G G +S++ GS + + +
Sbjct: 878 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEN 909



Score = 32.0 bits (72), Expect = 0.009
Identities = 41/177 (23%), Positives = 63/177 (35%), Gaps = 17/177 (9%)

Query: 519 GQDEFLKVARHQTNEVHGDHKETIDGHKTTQVNSTFTETVEQDVTVTYNANETQYVKNNS 578
G D + +A + + + H G+ +TQ T + +T Y + T +S
Sbjct: 587 GSDSSI-IAGYGSTQTASYHSSLTAGYGSTQ-----TAREQSVLTTGYGSTST--AGADS 638

Query: 579 DLEIGDNRTTKIGKNDDLDVG--------ENSNLTVGASKSSDIGADDNQTVGGNLTVSV 630
L G T G N L G E S+LT G +S GAD + G T +
Sbjct: 639 SLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTA 698

Query: 631 KGNTSYKAD-GATQIISGDKIVLKTGGSSLVMNSDGSIKLSGSSITIEGSDKVVVKG 686
N+ A G+TQ + GS+ +D S+ S + G
Sbjct: 699 GYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAG 755



Score = 31.3 bits (70), Expect = 0.019
Identities = 33/141 (23%), Positives = 49/141 (34%), Gaps = 9/141 (6%)

Query: 555 TETVEQDVTVTYNANETQYVKNNSDLEIGDNRTTKIGKNDDLDVGENSNLTVGASKSSDI 614
T+T + +T TQ + SDL G T+ G + L G S T S
Sbjct: 695 TQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTA 754

Query: 615 GADDNQTV--GGNLTVSVKGNTSYKAD-------GATQIISGDKIVLKTGGSSLVMNSDG 665
G QT LT ++ AD G+TQ I+ GS+
Sbjct: 755 GYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERS 814

Query: 666 SIKLSGSSITIEGSDKVVVKG 686
+ S + G+D ++ G
Sbjct: 815 DLTTGYGSTSTAGADSSLIAG 835



Score = 30.5 bits (68), Expect = 0.032
Identities = 37/160 (23%), Positives = 59/160 (36%), Gaps = 18/160 (11%)

Query: 519 GQDEFLKVARHQTNEVHGDHKETIDGHKTTQV---NSTFT------ETVEQDVTVTYNAN 569
Q+ + + G I G+ +TQ ST +T + ++T
Sbjct: 906 AQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYG 965

Query: 570 ETQYVKNNSDLEIGDNRTTKIGKNDDLDVG--------ENSNLTVGASKSSDIGADDNQT 621
T +S L G T G L G +S LT G ++ GAD +
Sbjct: 966 STSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLI 1025

Query: 622 VGGNLTVSVKGNTSYKADGATQIISGDKIVLKTG-GSSLV 660
G +++ + A + +ISG + VL G GSSL+
Sbjct: 1026 AGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLI 1065


42VP1630VP1635N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP16305130.341227calcium-binding outer membrane-like protein
VP1631212-1.189800agglutination protein
VP1632211-1.252946outer membrane protein
VP1633211-1.234410RTX toxin
VP1634-110-1.495532agglutination protein
VP1635-112-1.514423outer membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1630RTXTOXINA584e-10 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 58.1 bits (140), Expect = 4e-10
Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 14/139 (10%)

Query: 2082 NLIMGTDGDDILVGTDGNDMIIGGLGNDILTGGEGDDVFKWTEMQSATDTVTDFSDGDQL 2141
+ + G +GDD L G DGND +IG GN+ L GG+GDD F+ A + L
Sbjct: 765 DTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAK---------NVL 815

Query: 2142 DFTDVFDDMTGTDISALLDDLGSGDY--RGRVDDITVEVTESGGNSTLTINKDGQQL--- 2196
D + G++ + LLD D G +DI ++ G + +L
Sbjct: 816 FGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLA 875

Query: 2197 EVNFDGASAADIANSLISN 2215
+++F + N LI
Sbjct: 876 DIDFRDVAFKREGNDLIMY 894



Score = 49.6 bits (118), Expect = 1e-07
Identities = 24/81 (29%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 2072 YQTMALNSELNLIMGTDGDDILVGTDGNDMIIGGLGNDILTGGEGDDVFKWTEMQSATDT 2131
+Q + N++ G G+D L G++G D++ GG G+D+L GG G+D++++
Sbjct: 803 FQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHII 862

Query: 2132 VTDFSDGDQLDFTDV-FDDMT 2151
D D+L D+ F D+
Sbjct: 863 DDDGGKEDKLSLADIDFRDVA 883



Score = 38.4 bits (89), Expect = 4e-04
Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 2086 GTDGDDILVGTDGNDMIIGGLGNDILTGGEGDDV 2119
G+ DI G DG+D+I G GND L G +G+D
Sbjct: 733 GSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDT 766


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1632OMPADOMAIN828e-21 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 82.3 bits (203), Expect = 8e-21
Identities = 33/123 (26%), Positives = 53/123 (43%), Gaps = 11/123 (8%)

Query: 87 QMQIRVLFANDSDEINPVFAKQIRELSDFLKEY--PSTSIELQGYASRTGGSEHNLDLSK 144
++ VLF + + P + +L L S+ + GY R G +N LS+
Sbjct: 216 TLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSE 275

Query: 145 RRAENVRKALLQNGITPDRVTIVGYGDT-VLATTGTDEVSH--------ALNRRVTATVV 195
RRA++V L+ GI D+++ G G++ + D V A +RRV V
Sbjct: 276 RRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVK 335

Query: 196 GHK 198
G K
Sbjct: 336 GIK 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1633CABNDNGRPT737e-15 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 73.5 bits (180), Expect = 7e-15
Identities = 38/142 (26%), Positives = 54/142 (38%), Gaps = 13/142 (9%)

Query: 3085 DQADTIYGGAGNDILFGQGGNDKLFGGADNDILIGGLGSDILTGGDGEDIFKWID----V 3140
+ GG+GNDIL G ++ L GGA ND+L GG G+D L GG G D F +
Sbjct: 338 VTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDST 397

Query: 3141 ANERDTVTDFSSSEDSLDFSDL-------FDDLSKDEVGDLLSDLQSGSHTGDAGGYHVE 3193
D + DF D +D S F G + + +
Sbjct: 398 VAAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEV--MLQWDAANSITNLWLH 455

Query: 3194 VSQDGSTDTNLSITKGSSTLDI 3215
+ S D + I ++ DI
Sbjct: 456 EAGHSSVDFLVRIVGQAAQSDI 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1635OMPADOMAIN833e-21 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 83.4 bits (206), Expect = 3e-21
Identities = 35/122 (28%), Positives = 58/122 (47%), Gaps = 11/122 (9%)

Query: 78 MQVRVLFANDSDEINPVFRRQIRELSDFLKDY--PTTSIELQGYASKTGGSKHNQDLSER 135
++ VLF + + P + + +L L + S+ + GY + G +NQ LSER
Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSER 276

Query: 136 RAKNVRDALLSYGIEPNRVRIIGFGDTH-LAEQGTDQVSH--------ALNRRVTASVVG 186
RA++V D L+S GI +++ G G+++ + D V A +RRV V G
Sbjct: 277 RAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336

Query: 187 YK 188
K
Sbjct: 337 IK 338


43VP1656VP1659N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP16564171.267115translocator protein PopD
VP16572181.823689translocator protein PopB
VP16582181.636007low calcium response locus protein H
VP16593191.592781hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1656PF058442274e-75 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 227 bits (580), Expect = 4e-75
Identities = 74/238 (31%), Positives = 135/238 (56%), Gaps = 5/238 (2%)

Query: 97 IKSPSDAVSQSLSLLTLLYQVSKLSREQQVLQREIAVEANVASLQSQAAELNNSASAMIA 156
+++ + S+ LL +L+++++ +RE VLQR+ +A + + ++Q E+ + A+ MIA
Sbjct: 63 MEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQKAQVDEMRSGATLMIA 122

Query: 157 MAVVSGVLAGATAIIGALGSFKAGKEIKTEMASNNVLKTQKAGFDQVEELMNNGNLSKTQ 216
MAV++GV A A+A++G+LG+ K GK I E + + D + + +
Sbjct: 123 MAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDGRNELIDAKMQALGK----TSD 178

Query: 217 QDQVKRAHSLAKDSIADTTAQLTSGGRKFDKLMSSNQAKNAILQALGQMANSASNVEQTK 276
+D+ A D D+ A L + GR F+ + Q N ++Q+ QMAN++ V Q +
Sbjct: 179 EDRKIVGKVWAADQAQDSVA-LRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGE 237

Query: 277 AQARSKDDEVQATRAQAAKQKADENIGFQEGLLKELRELFRSISDSQNQAWRASIPTV 334
+QA ++++EV AT Q+ KQK ++ + F G +K++ +L + + S NQAWRA+ V
Sbjct: 238 SQASAREEEVNATIGQSQKQKVEDQMSFDAGFMKDVLQLIQQYTQSHNQAWRAAAGVV 295


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1657BACINVASINB320.006 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 31.6 bits (71), Expect = 0.006
Identities = 67/381 (17%), Positives = 139/381 (36%), Gaps = 47/381 (12%)

Query: 56 KVQLDAPNAAVSDKVTDLTLKAMAQLQKIVDTIAKALHAVADQTGSIAVKIIAGSADDFE 115
K LD A TD KA + A A +Q ++ A
Sbjct: 199 KEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQNQVSQGEQDNLSNVARLTM 258

Query: 116 VELAAITDKLKSAQNELKIQEVKVAKAKHEQEMAENQEKIKESEAAAKEAQKS----GLA 171
+ I K+ + L+ ++ + A E AE ++K E + ++A+++ G
Sbjct: 259 LMAMFIEIVGKNTEESLQ-NDLALFNALQEGRQAEMEKKSAEFQEETRKAEETNRIMGCI 317

Query: 172 AKIFGWISAVVSIVVGAIMVATGVGAAAG--ALMIAGGVMGAVS---------------- 213
K+ G + +VS+V + AA A+M+A ++ A +
Sbjct: 318 GKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHV 377

Query: 214 ----MALQEPAVQDALKEAGVN---VDVLNKVVMALEIAVAVIGAIVTFGGAAAGGIAKL 266
M L A+ AL+ GV+ ++ +V A+ A+A++ IV G AKL
Sbjct: 378 LKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKL 437

Query: 267 AAKSASKIAQKVTDIATKAAANMAKVAD-------------MGSKAATTTAKAIRYGAET 313
+ + + + + +A+ +G+ + + E
Sbjct: 438 GNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLGNVGSKMGLQTNALSKEL 497

Query: 314 VDLTVN----IGKGATDSVHAANNANVTEIQADITDLRAKMTLSQAVIDKLKEEIGKLME 369
V T+N + + +A + ++ A L++ +D++++ + + +E
Sbjct: 498 VGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFMLARFAMDQIQQWLKQSVE 557

Query: 370 DFQELMSIIMQMIQAKSETMQ 390
F E + ++ +A S +Q
Sbjct: 558 IFGENQKVTAELQKAMSSAVQ 578


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1658SYCDCHAPRONE1531e-50 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 153 bits (389), Expect = 1e-50
Identities = 76/154 (49%), Positives = 112/154 (72%)

Query: 7 TDPSQMQAEELLSFLEEGGTLKMLHDVSADTIEHIYAVGYNFFQSGKIEQAAKVFQLLSM 66
T +Q + SFL+ GGT+ ML+++S+DT+E +Y++ +N +QSGK E A KVFQ L +
Sbjct: 5 TTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCV 64

Query: 67 LDHYQARFFIGLGAARQELGEYLQAIDAYSYAALVDINDPRPPFHSAECHLKLEQLTEAE 126
LDHY +RFF+GLGA RQ +G+Y AI +YSY A++DI +PR PFH+AEC L+ +L EAE
Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAE 124

Query: 127 SGFYSAKEMSAGKSQYADLHQRAGIMLEAVRNKR 160
SG + A+E+ A K+++ +L R MLEA++ K+
Sbjct: 125 SGLFLAQELIADKTEFKELSTRVSSMLEAIKLKK 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1659LCRVANTIGEN1143e-30 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 114 bits (287), Expect = 3e-30
Identities = 76/233 (32%), Positives = 127/233 (54%), Gaps = 20/233 (8%)

Query: 392 GALKDRLLNITEQEKKDLEVRAEHSLTARDLLAVVESSI-GDRFDEQVLFALNERRVNRL 450
G ++L N ++ K+ LE R +AV+ S+ DR D+ +L + + +
Sbjct: 88 GHYDNQLQNGIKRVKEFLESSPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNHHG 147

Query: 451 EKRNEQKEALQDLTVQLKIFGVVQSKIHSTQSVDGTYKPDDNAFSASDFNYNSVTD---F 507
+ R++ +E L +LT +LKI+ V+Q++I+ S GT D + + D N TD F
Sbjct: 148 DARSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIF 207

Query: 508 QNSPEYKYL------------TDNGITTHTDFL----KKQGVTVADGASFKDEEKTKKLS 551
+ S EYK L ++ I + DFL K+ G S+ + +LS
Sbjct: 208 KASAEYKILEKMPQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDNNELS 267

Query: 552 NFSSSVSDKSKLLNDEVQIKTTELNDISSQYNSTVEAMNKFVQKYHSILQEIL 604
+F+++ SDKS+ LND V KTT+L+DI+S++NS +EA+N+F+QKY S++Q +L
Sbjct: 268 HFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL 320


44VP1667VP1675N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP16673192.780298outer membrane protein PopN
VP16682193.042342type III secretion system ATPase
VP16692191.429021type III secretion protein YscO
VP16702181.429230translocation protein in type III secretion
VP16712201.354709type III secretion system protein
VP16721161.084443type III secretion system protein
VP16730151.140308translocation protein in type III secretion
VP16741170.611338translocation protein in type III secretion
VP16750160.934123translocation protein in type III secretion
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1667PF072012695e-92 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 269 bits (688), Expect = 5e-92
Identities = 96/298 (32%), Positives = 163/298 (54%), Gaps = 6/298 (2%)

Query: 1 MSIINSQIATNTKFDASVRNGLESSRADSAVKGSYRGETVRVHNAT-QSLFDAMEELTSL 59
M+ +++ NT R + SS+ + G +RGE+V++ + T QS+ D EE+T +
Sbjct: 1 MTTLHNLSYGNTPLHNE-RPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFV 59

Query: 60 GSEKAEKDLTKRKIKDGGVRVNEAHELVSDYLRKVPDLEKNQKIKDLAAKMAGGNISTIA 119
SE+ E L KRK+ D RV++ E V+ YL KVP+LE+ Q + +L + ++ +++
Sbjct: 60 FSERKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLS 119

Query: 120 QLQAYLNGFSEEKSHQYLALKAVKKYLSANPESKHLLALIDQAILKIEQNPDSWSQIDTE 179
QL+AYL G SEE S Q+ L ++ L PE HL L++QA++ + + +
Sbjct: 120 QLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGAR 179

Query: 180 IRVSHFADEFSKEQEFSSLHQLRGFYRDTVHSYQGLGSAYQDVVERFGEQEVSTAVDFML 239
I + + + + L LR YRD V YQG+ + + D+ +RF ++ + + F+
Sbjct: 180 ITPEAYRES---QSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQ 236

Query: 240 QGMSADLSVQGSNIDSVKLQLLMSDMQKLKTLNTLQDQVGRLFQMFKPERMSHGLSGF 297
+ +SADL Q S KL +++SD+QKLK ++ DQV +Q F E ++G+ F
Sbjct: 237 KALSADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFS-EGKTNGVRPF 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1669IGASERPTASE290.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.3 bits (65), Expect = 0.007
Identities = 18/137 (13%), Positives = 37/137 (27%), Gaps = 7/137 (5%)

Query: 13 ADRADKAVQRQEYRVANVAAELQKAERSVADYHVWRQEEEERRFAKAKQQTVLLKELETL 72
+++ Q VA +E ++ + + ++EE+ + K Q + +
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ----EVPKVT 1126

Query: 73 RQEIALLREREAELKQRVAE---VKVTLEQERTLLKQKQQEALQAHKTKEKFVQLQQQEI 129
Q + E Q +E + Q K V+ E
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 130 AEQSRQQQYQEELEQEE 146
+ E E
Sbjct: 1187 TTVNTGNSVVENPENTT 1203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1670IGASERPTASE330.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.002
Identities = 39/245 (15%), Positives = 92/245 (37%), Gaps = 6/245 (2%)

Query: 9 TPSTQPHSPQSPMPIDDHAMLQSRFERALKENPDQTKPQPNETNNQAALEAKKPFTENYS 68
T T P++ Q+ +P ++ + E A + P P + A+ E+ +
Sbjct: 995 TNITTPNNIQADVP----SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 69 ERSLASLHSTGSNQRKTAEKSAFAQNNDVVTENINNESDTSSPIALNTQDKMPSDMDANM 128
+ + Q + K A + N +S + + T+ K + ++
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 129 KPDIRIPTTGDK-KLPMEPSNKREKNADEQDAFERLVEDHDDSKTELAAAKTAESHESTQ 187
K + T + K+ + S K+E++ Q E E+ + ++T + ++ Q
Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 188 SPKDT-KHIPTAGTKPNTVETVSDALASTLASTTVASATSASLASHPVTSTVHKTATKTE 246
K+T ++ T+ TV T + + + +T + + + S H+ + ++
Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230

Query: 247 VNKHE 251
+ E
Sbjct: 1231 PHNVE 1235



Score = 29.3 bits (65), Expect = 0.044
Identities = 29/201 (14%), Positives = 59/201 (29%), Gaps = 5/201 (2%)

Query: 33 FERALKENPDQTKPQPNETNNQAALEAKKPFTENYSERSLASLHSTGSNQRKTAEKSAFA 92
+ + QT + +AK + + S S Q +T + A
Sbjct: 1086 AQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 93 QNNDVVTENINNESDTSSPIALNTQDKMPSDMDANMKPDIRIPTTGDKKLPMEPSNKREK 152
+ T NI ++ A Q + +N++ + TT + + + +
Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 153 NADEQDAFERLVEDHDDSKTELAAAKTAESHESTQSPKDTKHIPTAGTKPNTVETVSDAL 212
A Q E + K + + H + + T T + L
Sbjct: 1204 PATTQP--TVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVL 1261

Query: 213 ASTLAST-TVASATSASLASH 232
+ A VA +++ H
Sbjct: 1262 SDARAKAQFVALNVGKAVSQH 1282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1671TYPE3OMOPROT841e-20 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 83.9 bits (207), Expect = 1e-20
Identities = 44/155 (28%), Positives = 61/155 (39%), Gaps = 12/155 (7%)

Query: 170 TQHIALPVWLSLGKTHLDLNQFHSLELGDVIFFDQCYIAQQQAIVQVSNKNLWRCQLEDN 229
L + T L + +GDV+ A V K L +
Sbjct: 147 MLRWPLRFVIGSSDTQRSL--LGRIGIGDVLLIRTSR-----AEVYCYAKKLGHFNRVEG 199

Query: 230 -----TLYIIEKETNMNDVNTSETLTDHQQLPVELTFDIGHQTVTLEQLNQLQPGYVFEL 284
TL I E N T+ETL QLPV+L F + + VTL +L + + L
Sbjct: 200 GIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYRKNVTLAELEAMGQQQLLSL 259

Query: 285 NQPVSKPVTLRANGKIIGECELVNVNDHLGVRVLE 319
V + ANG ++G ELV +ND LGV + E
Sbjct: 260 PTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHE 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1672TYPE3IMPPROT2512e-87 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 251 bits (642), Expect = 2e-87
Identities = 98/217 (45%), Positives = 145/217 (66%), Gaps = 7/217 (3%)

Query: 6 DELNLIVSLALLALIPFIAMMATSFVKLAVVFSLLRNALGVQQIPPNMALYGLAIILSIF 65
++++LI LA L+PFI T FVK ++VF ++RNALG+QQIP NM L G+A++LS+F
Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62

Query: 66 IMAPVGFETYDYVKQHDISLEDSASVEGLIESGLQPYREFLKKHIRETEAIFFTDAARTL 125
+M P+ + Y Y + D++ D +S+ ++ GL YR++L K+ FF +A
Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKR 122

Query: 126 WP-------QKYVDRLESDSLLLLLPAFTVSELTRAFEIGFLLYLPFIAIDLIVSNILLA 178
++ D +E S+ LLPA+ +SE+ AF+IGF LYLPF+ +DL+VS++LLA
Sbjct: 123 QYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLA 182

Query: 179 MGMMMVSPMTISLPFKLLLFVLLDGWTKLTHGLVLSY 215
+GMMM+SP+TIS P KL+LFV LDGWT L+ GL+L Y
Sbjct: 183 LGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1673TYPE3IMQPROT621e-16 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 62.1 bits (151), Expect = 1e-16
Identities = 31/80 (38%), Positives = 48/80 (60%)

Query: 5 EIIHFTTQALTLVLFLSLPPILVAALVGTLVSLIQALTQVQEQTLGFVVKLIAVIITLFI 64
+++ +AL LVL LS P +VA ++G LV L Q +TQ+QEQTL F +KL+ V + LF+
Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62

Query: 65 TTQWLGAELHAFASLALDKI 84
+ W G L ++ +
Sbjct: 63 LSGWYGEVLLSYGRQVIFLA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1674TYPE3IMRPROT1568e-49 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 156 bits (395), Expect = 8e-49
Identities = 53/240 (22%), Positives = 103/240 (42%), Gaps = 5/240 (2%)

Query: 10 LFLYSLTLPRLMACFIFLPILSKQMLGGAMIRNGVLCSLALFIFPVVNEQALPAETDGLW 69
L LY L R++A PILS++ + ++ G+ + I P + +P +
Sbjct: 13 LNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFAIAPSLPANDVPVFSFFAL 71

Query: 70 LIVILGKEVLLGMLIGFVAAIPFWAIEATGFLVDNQRGAAMASMFNPTLGSQSTPTAVLL 129
+ + +++L+G+ +GF F A+ G ++ Q G + A+ +P A ++
Sbjct: 72 WLAV--QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIM 129

Query: 130 TQTLITLFFSGGGFVAFIYALFKSYTTWPILGFFPMVTDAWVSFFYDQFQQLMWLGVLMS 189
+ LF + G + I L ++ T PI G + G++++
Sbjct: 130 DMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPL--NSNAFLALTKAGSLIFLNGLMLA 187

Query: 190 APLVLAMFLAEFGLALISRFAPQLNVFFLAMPIKSAIASVLLIVYLGLMMDHFEALFYGI 249
PL+ + L L++R APQL++F + P+ + L+ + L+ E LF I
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1675TYPE3IMSPROT416e-148 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 416 bits (1072), Expect = e-148
Identities = 217/347 (62%), Positives = 281/347 (80%)

Query: 1 MSGEKTEQPTAKKLRDARKKGQVAKSQEIVSSALILALIAVLFAFADYYMSHISALLLLP 60
MSGEKTEQPT KK+RDARKKGQVAKS+E+VS+ALI+AL A+L +DYY H S L+L+P
Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60

Query: 61 SELAYQGFQDALIDVAIAIAKEIAYLLAPIILVAALIAIFSNMGQFGFLFSGESIKPDIK 120
+E +Y F AL V + E YL P++ VAAL+AI S++ Q+GFL SGE+IKPDIK
Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120

Query: 121 KINPVEGAKRIFSLKSVIEFIKSILKVSLLSCIIWVTLRGNINTLMQIPTCGLECVPAVT 180
KINP+EGAKRIFS+KS++EF+KSILKV LLS +IW+ ++GN+ TL+Q+PTCG+EC+ +
Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180

Query: 181 GVMIKQLMIISSVGFVVIAAADFAYQKFDHTKKLKMSKDEVKREYKEMEGSPEIKSKRRQ 240
G +++QLM+I +VGFVVI+ AD+A++ + + K+LKMSKDE+KREYKEMEGSPEIKSKRRQ
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 241 LHQELQASNQRENVKRSNVLVTNPTHIAVGLYYKKGETPLPVITLMETDAMAKRMIAIAR 300
HQE+Q+ N RENVKRS+V+V NPTHIA+G+ YK+GETPLP++T TDA + + IA
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 301 EEGVPVMQKVPLARALYADGNVDQYIPSELIEATAEVLRWLASLESD 347
EEGVP++Q++PLARALY D VD YIP+E IEATAEVLRWL +
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIE 347


45VP1682VP1697N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP16822200.241622hypothetical protein
VP16813210.060828hypothetical protein
VP16833230.428648hypothetical protein
VP16843210.487528hypothetical protein
VP16854210.981928hypothetical protein
VP16863221.236953effector protein
VP16874221.174182type III chaperone
VP16883221.631544type III secretion system protein
VP16893241.618096type III secretion protein
VP16901232.107280type III secretion lipoprotein
VP16911192.972163type III export protein
VP16920172.493023type III export protein
VP16931192.126782type III secretion protein
VP16941201.670627type III export protein YscF
VP1695-1201.121393type III export protein PscD
VP1696-120-0.052342type III secretion protein YscC
VP1697223-2.460475type III export apparatus protein NosA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1682PF05932464e-09 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 45.6 bits (108), Expect = 4e-09
Identities = 29/124 (23%), Positives = 47/124 (37%), Gaps = 5/124 (4%)

Query: 3 TIQPLLDEFCRLNELPPLILEDGNRCQLLVDDRFVLYFTATEDDALMLSVAFGGLEKSGE 62
+ LLD+F R E+ PL+ +D C +++D+ F L + D A + G LE +
Sbjct: 5 FYKTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTL--SCDYARERLLLIGLLEPHKD 62

Query: 63 LRVRGLELLARANYQRVGSGNLALSLAPNGRQLVLAGRQPTEHLNSANLTVWFHEIIEQT 122
+ + L L N L L P E L+ L ++E
Sbjct: 63 IPQQCL-LAGALNPLLNAGP--GLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWM 119

Query: 123 ELWQ 126
W+
Sbjct: 120 RGWR 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1686YERSSTKINASE320.006 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 31.6 bits (71), Expect = 0.006
Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 4/70 (5%)

Query: 25 GKLSIGGKEYTINAATQEFTRANPTSGAVARFFEATGKLFREGSTQ-SVAKAITKAVFDN 83
G+L+IGGK Y I + R NP SG + F E GK+F S+A+ +T +
Sbjct: 32 GELNIGGKRYRI--IDNQVLRLNPHSG-FSLFREGVGKIFSGKMFNFSIARNLTDTLHAA 88

Query: 84 EQGQAQRLQT 93
++ +Q L++
Sbjct: 89 QKTTSQELRS 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1687PF05932491e-10 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 49.0 bits (117), Expect = 1e-10
Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 6/121 (4%)

Query: 1 MANGFITALMTDFAHRCQIEQLFFDGDDCCHLLIDQDTAITVRAE--DDRLTLIGLISGD 58
M+N F L+ DF+ +++ L FD C+++ID A+T+ + +RL LIGL+
Sbjct: 1 MSNLFYKTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPH 60

Query: 59 K--PEHDVMLQYMKASLTQGSPAVYWDEEVG-FVGFVHLSQQWLDAAILDESLGNFIEWL 115
K P+ +L L P + DE+ G + + + ++ L L + +EW+
Sbjct: 61 KDIPQQC-LLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWM 119

Query: 116 K 116
+
Sbjct: 120 R 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1688FLGFLIH372e-05 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 37.1 bits (85), Expect = 2e-05
Identities = 41/161 (25%), Positives = 68/161 (42%), Gaps = 22/161 (13%)

Query: 51 QQAYETEKQRGYQDGLEQAKIENAQAMVATLARCNEYYLQ-----------VEHKMTNVV 99
QQ ++ Q G GLEQ E AR + + + ++ +
Sbjct: 65 QQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMA 124

Query: 100 LDAVRKIIDTFDDVDTT--ISVVREALQ---LVSNQKQVILHVHPEQVVDVREKVAGVLS 154
L+A R++I VD + I +++ LQ L S + Q L VHP+ + V + + LS
Sbjct: 125 LEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQ--LRVHPDDLQRVDDMLGATLS 182

Query: 155 DFPEVGYVDVVADARLKNGGCILETEVGIIDASIDGQIQAL 195
+ + D L GGC + + G +DAS+ + Q L
Sbjct: 183 ----LHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1690FLGMRINGFLIF654e-14 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 64.6 bits (157), Expect = 4e-14
Identities = 33/168 (19%), Positives = 73/168 (43%), Gaps = 8/168 (4%)

Query: 22 TELYTNVSQKEGNEMLSILLSEGVVATKEPDKDNKVKLMVDSSQIAFAVDALKRKGYPRE 81
L++N+S ++G +++ L + + V + ++ L ++G P+
Sbjct: 51 RTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGA---IEVPADKVHELRLRLAQQGLPKG 107

Query: 82 QFSTLKEVFPKDDLISSPLAERARLVYAKSQELSSTLSQIDGVLVARVHVVL-EDQDLRP 140
+ E+ ++ S +E+ A EL+ T+ + V ARVH+ + +
Sbjct: 108 G-AVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVR 166

Query: 141 GERPTPASASVFIKHAADVALD-SYVPQIKLLVNNSIEGLNYDRISVV 187
++ SASV + ALD + + LV++++ GL +++V
Sbjct: 167 EQK--SPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLV 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1692PF09025621e-14 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 62.0 bits (150), Expect = 1e-14
Identities = 34/119 (28%), Positives = 47/119 (39%), Gaps = 1/119 (0%)

Query: 89 PQTQKEALWYAFHQAKSAKGTDDAVPELLSVLKQELLGDFAGQLMAEPPTDRAALKAMLA 148
P + + + L + LL FA L DR LKAML
Sbjct: 24 PAVDQVLAFEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQG-LEADRLELKAMLR 82

Query: 149 QSFPLGAQKEQALWHCWAELKSLPEMTSTVDLVREELSFVIQKNAMVKNIMTHSHKLDL 207
PLG Q++ L ++ P L R EL +I N M+ N++ +SHKLDL
Sbjct: 83 AELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQVLIPLNGMLDNLVRNSHKLDL 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1696TYPE3OMGPROT5770.0 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 577 bits (1489), Expect = 0.0
Identities = 288/594 (48%), Positives = 406/594 (68%), Gaps = 21/594 (3%)

Query: 34 ATELNWPEQPFRYYADNDSLKDLLNNFGANYRVSVSVSDKVNDRVSGRFTPEDPAEFLDY 93
A EL+W P+ Y A +SL+DLL +FGANY +V VSDK+ND+VSG+F ++P +FL +
Sbjct: 26 AQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQH 85

Query: 94 LAQVYNLMWYFDGAVLHVYKATETRSRLLQLELLTARELRSTLISTGVWDARYGWRAAEN 153
+A +YNL+WY+DG VL+++K +E SRL++L+ A EL+ L +G+W+ R+GWR +
Sbjct: 86 IASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWEPRFGWRPDAS 145

Query: 154 KGLVYLAGPPRYVELVVQTAEALESRLLQKSNSTDELFVELIPLKYASATDRSISYRDQS 213
LVY++GPPRY+ELV QTA ALE + +S T L +E+ PLKYASA+DR+I YRD
Sbjct: 146 NRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASASDRTIHYRDDE 205

Query: 214 ITVPGIASVLSRVVGGVQTQITDSASVQTSSVNGLPAEAAKPRGKTASVHGGATVEAEPG 273
+ PG+A++L RV+ A++Q +V+ A R A VEA+P
Sbjct: 206 VAAPGVATILQRVL--------SDATIQQVTVDNQRIPQAATRAS-----AQARVEADPS 252

Query: 274 LNAIIVRDTQARLPLYRKLVAQLDQPQSRIEVALSIVDISANDLRQLGVDWRAGVSVGNN 333
LNAIIVRD+ R+P+Y++L+ LD+P +RIEVALSIVDI+A+ L +LGVDWR G+ GNN
Sbjct: 253 LNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNN 312

Query: 334 RIVDIKTTGDVDNGDVTLGSGQSFKSLLDSTNLNYLLAQIRLLESKGSAQVVSRPTLLTQ 393
V IKTTGD N + S + SL+D+ L+YLLA++ LLE++GSAQVVSRPTLLTQ
Sbjct: 313 HQVVIKTTGDQSN----IASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQ 368

Query: 394 ENVEAVLNNSSTFYVKLVGKETAALEEVTYGTLLRIVPRIVGDRFATRPEINLSLHLEDG 453
EN +AV+++S T+YVK+ GKE A L+ +TYGT+LR+ PR++ + EI+L+LH+EDG
Sbjct: 369 ENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQ--GDKSEISLNLHIEDG 426

Query: 454 AKIPDG-GVDDLPSVRKTEISTLATVKQGQSLLIGGVYRDEVSHQLRKVPLLGDIPYLGA 512
+ P+ G++ +P++ +T + T+A V GQSL+IGG+YRDE+S L KVPLLGDIPY+GA
Sbjct: 427 NQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGA 486

Query: 513 LFRSNTNTTRRTVRMFIIEPRIVVDGIGDSVLIGNEHDLRPSIGQLNNISNNSAEFKSVV 572
LFR + TRRTVR+FIIEPRI+ +GI + +GN DLR I ++ ISN S ++
Sbjct: 487 LFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGQDLRTGILTVDEISNQSTTLNKLL 546

Query: 573 EVFSCTSKTQAERYQQDLLSQQKSSLLTQCQLPSGQVGWRVKVAECDLSQAECV 626
C +A+ Q+ L KSS LTQC++ +GWRV C +Q+ CV
Sbjct: 547 GGSQCQPLNKAQEVQKWLSQNNKSSYLTQCKM-DKSLGWRVVEGACTPAQSWCV 599


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1697PF05932391e-06 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 39.0 bits (91), Expect = 1e-06
Identities = 15/124 (12%), Positives = 40/124 (32%), Gaps = 8/124 (6%)

Query: 3 DKMMKSLAETLGIGPFIAGENGAYTIEVD-QLTLTIKQHSSWILWETALPLRFNEHLDYQ 61
++ + +L + P + ++G + +D LT+ + L+
Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGL------LEPH 60

Query: 62 QEQALKRCMQLSLKTLRDTPSVLTTNADQQLILQGKAM-IENTSNDQFAELLAQHANVCE 120
++ + + +L L + L + L +++ E S +A
Sbjct: 61 KDIPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMR 120

Query: 121 RYME 124
+ E
Sbjct: 121 GWRE 124


46VP1711VP1714N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP17110170.862278response regulator
VP17121160.770814sensor kinase CitA
VP17131150.273562hypothetical protein
VP17140160.406696hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1711HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 1e-19
Identities = 32/140 (22%), Positives = 60/140 (42%), Gaps = 6/140 (4%)

Query: 4 ATRVMIIEDDIAIAELHHKYLSQLAGLDVVGIATTRLEAEMQLEVLKPDLLLMDVYLPDG 63
+++ +DD AI + ++ LS+ AG DV + + DL++ DV +PD
Sbjct: 3 GATILVADDDAAIRTVLNQALSR-AGYDVRITSNAA-TLWRWIAAGDGDLVVTDVVMPDE 60

Query: 64 TGLEILNTLRSNNQTCDVILITAARDVDTLQQAMRGGVVDYLLKPV----MFPRLETALK 119
++L ++ V++++A T +A G DYL KP + + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 KYITQRQQLDVAKSLDQGLV 139
+ + +L+ LV
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1712PF06580320.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.006
Identities = 33/234 (14%), Positives = 74/234 (31%), Gaps = 56/234 (23%)

Query: 320 EQLSQTKEYADL--LRSQTHEH--RNKLNTISGLVQMGELEAVQKLIGQETAHYQAMIEF 375
+++ + A L L++Q + H N LN I L+ +A + L ++ E
Sbjct: 152 WKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREML--------TSLSEL 203

Query: 376 LRDTIKDPLIAGMLLGK----------TERAR---ELGLQLVVEEGSRLEPLTEWLNSED 422
+R +++ + L + L + + ++ +
Sbjct: 204 MRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINP--------AIMDVQV 255

Query: 423 LVTILGNLIDNAFDATLSVIRDESNVASERRNIEVSVSDYGNEVILEVSDHGCGLPENIE 482
++ L++N ++ + I + + V LEV + G +N
Sbjct: 256 PPMLVQTLVENGIKHGIAQLPQGG-------KILLKGTKDNGTVTLEVENTGSLALKN-- 306

Query: 483 PQTLFKKGISTKSRQNRGVGLHLVNQLATRYHG--HVEMLPNTGHGTRITVYLP 534
++++ G GL V + +G L V +P
Sbjct: 307 ------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1713ARGDEIMINASE300.025 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 29.8 bits (67), Expect = 0.025
Identities = 25/171 (14%), Positives = 61/171 (35%), Gaps = 37/171 (21%)

Query: 186 VEQGLKKHRDFAIKLEEQG-QIARVERLQAEA--------------SLDKAKVETR---- 226
+E ++H FA L+ +I +E L +E + +A+++T
Sbjct: 48 LEVARQEHEVFASILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTIN 107

Query: 227 ------KAASDLSIAQAALGKLLAQE--------DSVEPAETLFVNDNLPPLSAFIDQTL 272
+ + ++ + ++ +E D + LF+ D +P + D
Sbjct: 108 LLKDYFSSLTIDNMISKMISGVVTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFA 167

Query: 273 LTYPGLDI--LDAKHKQASSLIKAEKGKYYPEVYLYGDYSLYEDDSLASQM 321
G+ I + K +Q ++ KY+P + ++ + + +
Sbjct: 168 SIGNGVTINKMFTKVRQRETIFAEYIFKYHPV--YKENVPIWLNRWEEASL 216


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1714RTXTOXIND414e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 4e-06
Identities = 18/132 (13%), Positives = 45/132 (34%), Gaps = 9/132 (6%)

Query: 48 ISSKVPGRIDEVMVRKGDDVEKGQLIFTLHSPEIEAKLEQAKAGEKAADALAQEAEKGAR 107
I + E++V++G+ V KG ++ L + EA + ++ A + +R
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 108 EQQIQAAKDQWLKAKAAADLMEKTYNRVNNLYKDGVVAEQKRDEALTQWQASKYTESAAF 167
++ + L + + + ++ + E + WQ KY +
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSE---------EEVLRLTSLIKEQFSTWQNQKYQKELNL 209

Query: 168 QMYEMAKEGARS 179
+ +
Sbjct: 210 DKKRAERLTVLA 221


47VP1981VP1994N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP1981-1181.012527methyl-accepting chemotaxis protein
VP1982-1190.625746hypothetical protein
VP1983-1210.870393two-component response regulator
VP1984-2201.130955two-component sensor
VP19850200.939236hypothetical protein
VP19860201.296266hypothetical protein
VP19870191.123600hypothetical protein
VP19880190.590790LysR family transcriptional regulator
VP19890160.179474secretion protein
VP19901190.141888hypothetical protein
VP19910190.7957975-methyltetrahydropteroyltriglutamate--
VP19922221.001069hypothetical protein
VP19931261.671720transcriptional regulator
VP19941232.503168isochorismatase-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1981RTXTOXIND350.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.8 bits (80), Expect = 0.001
Identities = 31/232 (13%), Positives = 72/232 (31%), Gaps = 35/232 (15%)

Query: 419 LLTVMATADQVDKNASEGQARAAASRDQLNVQVNEVNSLATAINEMSATAQEVANSAVQA 478
L + A AD + +S QAR +R Q L+ +I ++ +
Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQ---------ILSRSIELNKLPELKLPDEPYFQ 177

Query: 479 AAAASQVQSNSANGMSRMDNAASAVDNLASQVNDAQHQTQNLVASSTAIQGILSEIGGIA 538
+ +V ++ + + ++ + + ++A I +
Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLA---RINRYENLSRVEK 234

Query: 539 DQTNL---LALNAAIEAARAGEAGRGFAVVADEVRNLATRTQGSTEEIRAMLARLEQETQ 595
+ + L AI E + E N + E+I + + ++E Q
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYV----EAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 596 SIVVLMEQSQKQAVDTKEETQAAQLALAEINQAIEVINDMNNQIASAAEEQS 647
+ + + + L ++ Q + I + ++A E Q
Sbjct: 291 LVT---QLFKNE-------------ILDKLRQTTDNIGLLTLELAKNEERQQ 326


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1983HTHFIS861e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.0 bits (213), Expect = 1e-21
Identities = 30/120 (25%), Positives = 58/120 (48%), Gaps = 1/120 (0%)

Query: 3 RVLLVEDNREIAGMLFDYFECAGMQLDYADNGELGLKLAMDNAFDIILLDLMLPRMDGLT 62
+L+ +D+ I +L AG + N + D+++ D+++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LCNKLRDAGNNTPVLMLTALDSRDDMLKGFQHGADDYLTKPFDLD-ILEARMTALIKRYR 121
L +++ A + PVL+++A ++ +K + GA DYL KPFDL ++ AL + R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1985YERSSTKINASE300.022 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.1 bits (67), Expect = 0.022
Identities = 46/146 (31%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 246 SDGYPHDELEACLQ----AGHHNNL--VKSIAQVDDG--KELALVMELIPNSYYNLGLPP 297
++G+ ELEA AG H NL V +A V G KE AL+M+ +
Sbjct: 169 AEGHLFAELEAYKHIYKTAGKHPNLANVHGMAVVPYGNRKEEALLMDEVDG----WRCSD 224

Query: 298 TLETCTRDTFPQGFTLTVEQVNS---------IVEQMVDVFNHLHDNKVCHGDLYAHNTL 348
TL T D++ QG ++NS I +++DV NHL V H D+ N +
Sbjct: 225 TLRTLA-DSWKQG------KINSEAYWGTIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVV 277

Query: 349 VNE-QGEMIFGDFGAASIYGYLSGEQ 373
+ GE + D G S SGEQ
Sbjct: 278 FDRASGEPVVIDLGLHS----RSGEQ 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1987ISCHRISMTASE310.003 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.8 bits (69), Expect = 0.003
Identities = 32/137 (23%), Positives = 48/137 (35%), Gaps = 19/137 (13%)

Query: 28 DALIENITKLVKGAKALDLPILWL----EQNPE-------------RLGPTAEPIREVL- 69
L NI KL L +P+++ QNP+ GP E I L
Sbjct: 54 TELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELA 113

Query: 70 -ESTHLPITKYTFDGCKEATFKVAVENAKVDTWLVCGIESHICVYQTAVSLRQSGYRVEL 128
E L +TK+ + K + D ++ GI +HI TA +
Sbjct: 114 PEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFF 173

Query: 129 VTDCVSSRTAANKALAL 145
V D V+ + +AL
Sbjct: 174 VGDAVADFSLEKHQMAL 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1989RTXTOXIND575e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 56.8 bits (137), Expect = 5e-11
Identities = 24/196 (12%), Positives = 63/196 (32%), Gaps = 9/196 (4%)

Query: 1 MTADQKFKHWMRTLIVLFIVLFLYIIIADRHAPLTTEGRVQGYVV------QVAPEVSGK 54
T + + I+ F+V+ + + + G + ++ P +
Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVL---GQVEIVATANGKLTHSGRSKEIKPIENSI 106

Query: 55 VTDVLIENNQSVQKGDVLFTIDDRKYKIALEQAELSLQSAYEKEATLYSQREAALANIAR 114
V +++++ +SV+KGDVL + + + + SL A ++ + N
Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLP 166

Query: 115 AQATFDNAHREYTRLLTLSKQKVISQSTLDNAFAQNQVSRAALKAERQNLKVIEAQLGDQ 174
D + + + + + + Q L +R + A++
Sbjct: 167 ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRY 226

Query: 175 KGQSTAVRIAKNGIEK 190
+ S + +
Sbjct: 227 ENLSRVEKSRLDDFSS 242



Score = 52.9 bits (127), Expect = 7e-10
Identities = 38/227 (16%), Positives = 80/227 (35%), Gaps = 29/227 (12%)

Query: 64 QSVQKGDVLFTIDDRKYKIALEQAELSLQSAYEKEATLY---SQREAALANIARAQATFD 120
Q+V + +VL K + + Q + Y+KE L ++R LA I R +
Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQK-----YQKELNLDKKRAERLTVLARINRYENLSR 231

Query: 121 NAHREYTRLLTLSKQKVISQ----------STLDNAFAQNQVSRAALKAERQNLKVIEAQ 170
+L ++ I++ N + +++E + K
Sbjct: 232 VEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291

Query: 171 LGDQKGQSTAVRIAKNG---------IEKAQLDLANTAVLAPSDGVVTNLQL-EVGTMAN 220
+ ++ + + K + + + AP V L++ G +
Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVT 351

Query: 221 TNMPLLTFVPTG-SLWVAADFREKSVANVDKTFHALVTFDANPGSVY 266
T L+ VP +L V A + K + ++ +A++ +A P + Y
Sbjct: 352 TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1990ACRIFLAVINRP290.024 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.4 bits (66), Expect = 0.024
Identities = 15/67 (22%), Positives = 31/67 (46%), Gaps = 4/67 (5%)

Query: 173 MGWVVAMAAFIVFQV-ADLYDSLSAQASILIILTPMTLAGSLAMAKIRIIGTALGCIAGM 231
+VA++ +VF A LY+S S S+++++ P+ + G L A + +
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVV-PLGIVGVLLAATLF--NQKNDVYFMV 928

Query: 232 AVQLILG 238
+ +G
Sbjct: 929 GLLTTIG 935


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP1994ISCHRISMTASE452e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 45.4 bits (107), Expect = 2e-08
Identities = 43/182 (23%), Positives = 73/182 (40%), Gaps = 13/182 (7%)

Query: 2 SNSALLVIDIQN---DYFPNGRFPLWNTDATLDNIKQLMARAKAQDIPI-FLVQHVSSAP 57
+ + LL+ D+QN D F G P+ NI++L + IP+ + Q S P
Sbjct: 29 NRAVLLIHDMQNYFVDAFTAGASPV---TELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85

Query: 58 KGKA---PFFEEGSVGVEIHPDIIS-ICPDAEIIQ--KQHADSFYQTDLEQALERNGVDE 111
+A F+ G II+ + P+ + + K +F +T+L + + + G D+
Sbjct: 86 DDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQ 145

Query: 112 LLICGMMTQNCVTHTAISKAAEKYNVSIIEDCCTTTDQMIHNIALSAVSIRVPLLASSDV 171
L+I G+ TA E + D H +AL + R +D
Sbjct: 146 LIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDS 205

Query: 172 LL 173
LL
Sbjct: 206 LL 207


48VP2023VP2033N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2023-119-4.619359nucleotide sugar epimerase
VP2024-216-3.102588hypothetical protein
VP2025019-2.317405hypothetical protein
VP2026426-1.039578orotidine 5'-phosphate decarboxylase
VP2027426-1.167757tetratricopeptide repeat protein
VP2028431-0.761572hypothetical protein
VP2029227-0.281594integration host factor subunit beta
VP2030019-0.15922730S ribosomal protein S1
VP2031-2120.146374cytidylate kinase
VP2032-311-0.238744periplasmic protease
VP2033-213-0.074595short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2023NUCEPIMERASE5220.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 522 bits (1346), Expect = 0.0
Identities = 198/335 (59%), Positives = 251/335 (74%), Gaps = 2/335 (0%)

Query: 1 MKYLVTGAAGFIGSATIRKLNSLGYEVIGIDNINDYYDVELKYARLNFIKNPLFRFFNMD 60
MKYLVTGAAGFIG ++L G++V+GIDN+NDYYDV LK ARL + P F+F +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 61 ISNKNKNEIERLFEKEKFDRVIHLAAQAGVRYSLVNPHCYAESNLSGFLNVLEACRKSHI 120
++++ + LF F+RV + VRYSL NPH YA+SNL+GFLN+LE CR + I
Sbjct: 61 LADREG--MTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 121 KHFIYASSSSVYGLNKKVPFSTSDNVDHPVSLYAATKKSNELMAHSYSHLYQLPTTGLRF 180
+H +YASSSSVYGLN+K+PFST D+VDHPVSLYAATKK+NELMAH+YSHLY LP TGLRF
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178

Query: 181 FTVYGSWGRPDMAPFIFTEKIINGQSIDINNNGDMWRDFTHINDIVEGIVRISDVIPRIN 240
FTVYG WGRPDMA F FT+ ++ G+SID+ N G M RDFT+I+DI E I+R+ DVIP +
Sbjct: 179 FTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHAD 238

Query: 241 QRWQFENSTPADSSAPYSIYNIGYGSPICLMDFIKAIENELGIEAKKNYREMQPGDVYQT 300
+W E TPA S APY +YNIG SP+ LMD+I+A+E+ LGIEAKKN +QPGDV +T
Sbjct: 239 TQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLET 298

Query: 301 YADTTAFYQATGYRPSVSVEEGIAEFVAWYRNFYN 335
ADT A Y+ G+ P +V++G+ FV WYR+FY
Sbjct: 299 SADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2027SYCDCHAPRONE280.044 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.0 bits (62), Expect = 0.044
Identities = 19/90 (21%), Positives = 26/90 (28%), Gaps = 11/90 (12%)

Query: 196 GNSNKAIQHFKKALSEDPKCVRASISLGRIYLESEDYKQTIK-YLTGVLEQDKDFVSDVL 254
G A + F+ D R + LG Y I Y G + K+
Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKE------ 103

Query: 255 PT----IAECYHHLGQEDELVEFLRACIDK 280
P AEC G+ E L +
Sbjct: 104 PRFPFHAAECLLQKGELAEAESGLFLAQEL 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2029DNABINDINGHU1192e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (301), Expect = 2e-39
Identities = 33/89 (37%), Positives = 57/89 (64%), Gaps = 1/89 (1%)

Query: 2 TKSELIERLCAEQTHLSAKEVEDAVKDILEHMASTLESGDRIEIRGFGSFSLHYREPRVG 61
K +LI ++ AE T L+ K+ AV + ++S L G+++++ GFG+F + R R G
Sbjct: 3 NKQDLIAKV-AEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRERV 90
RNP+TG++++++ VP FK GK L++ V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2030SUBTILISIN290.032 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 29.4 bits (66), Expect = 0.032
Identities = 12/85 (14%), Positives = 32/85 (37%), Gaps = 11/85 (12%)

Query: 437 ENDPFNAYVADNKKGTLVNATVTAVDAKGATVEIAEGVE--GYIRASEVSRDRVEDASLI 494
+ + N GT V T+ A + + V +A + +V +
Sbjct: 73 DEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLII---------KVLNKQGS 123

Query: 495 LSAGDVVEAKFTGVDRKNRVINLSI 519
+++ + +++K +I++S+
Sbjct: 124 GQYDWIIQGIYYAIEQKVDIISMSL 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2033DHBDHDRGNASE892e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 89.3 bits (221), Expect = 2e-23
Identities = 52/204 (25%), Positives = 95/204 (46%), Gaps = 4/204 (1%)

Query: 9 ALEGKVILVTGAGNGIGRQAALSYAKHGATVILLGRNVKNLESIYDEIEAAGYPQAAIIP 68
+EGK+ +TGA GIG A + A GA + + N + LE + ++A A P
Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-P 63

Query: 69 LDLKGATKQNYIDMAETIEGQFGQLDGLLHNAGVLSALCPFEQINEEDFDDIMQINVKAE 128
D++ + + ++ IE + G +D L++ AGVL +++E+++ +N
Sbjct: 64 ADVRDSAAID--EITARIEREMGPIDILVNVAGVLR-PGLIHSLSDEEWEATFSVNSTGV 120

Query: 129 VLMTQALLPVMRKSEAGRIIFTSSTVGHSGRAFWGPYAISKFAVEGMMQVLADELSDTPM 188
++++ M +G I+ S R YA SK A + L EL++ +
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 189 RVNAINPGATRTRMREKAYPGEDA 212
R N ++PG+T T M+ + E+
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENG 204


49VP2213VP2217N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2213-1190.242798long-chain fatty acid transport protein
VP2214-1200.250714VacJ lipoprotein
VP2215-2200.551771cytochrome c-type biogenesis protein
VP22160190.970662cytochrome c-type biogenesis protein
VP22171180.769179thiol:disulfide interchange protein DsbE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2213OMPADOMAIN290.036 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 29.1 bits (65), Expect = 0.036
Identities = 24/94 (25%), Positives = 36/94 (38%), Gaps = 7/94 (7%)

Query: 290 QLALHASFNWTD-WSSFEKLEAHLETAGTHMVKVENWEDN---YRFAVGATYQLQPKVAL 345
QL + TD + +L + A T D FA G Y + P++A
Sbjct: 99 QLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIAT 158

Query: 346 RTGIAYDTSAVSDKNRTITIPETDRTWLSIGATY 379
R T+ + D + T P D LS+G +Y
Sbjct: 159 RLEY-QWTNNIGDAHTIGTRP--DNGMLSLGVSY 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2214VACJLIPOPROT2692e-93 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 269 bits (690), Expect = 2e-93
Identities = 101/249 (40%), Positives = 144/249 (57%), Gaps = 15/249 (6%)

Query: 12 LLALGLVGCSSAPEEAVTSEGETNQTTSDVYDPLEGFNRTMWEINYEYLDPYLVRPVSIA 71
L LVGC+S+ DPLEGFNRTM+ N+ LDPY+VRPV++A
Sbjct: 10 LGTTLLVGCASSGT-----------DQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVA 58

Query: 72 YVEYTPVPIRSGIANFLSNLDEPSSMVNNLLMGNGSKAVDHFNRFWINSTFGILGVFDIA 131
+ +Y P P R+G++NF NL+EP+ MVN L G+ + + HF RF++N+ G+ G D+A
Sbjct: 59 WRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVA 118

Query: 132 TAAGI--TKYDNKEFSSAVGHYGVGNGPYFMIPGYGPYTLR-EVTDTVDGMYLPLSYLNI 188
A + + F S +GHYGVG GPY +P YG +TLR + D D +Y LS+L
Sbjct: 119 GMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTW 178

Query: 189 WAGLGKWALEGLEKRALLVPQEAQLDSSPDPYVLTRDVYIQRQNFKAEIDTVEEV-NPEE 247
+GKW LEG+E RA L+ + L S DPY++ R+ Y QR +F A ++ NP
Sbjct: 179 PMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNA 238

Query: 248 EALLDEYLD 256
+A+ D+ D
Sbjct: 239 QAIQDDLKD 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2215PF00577290.036 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 29.0 bits (65), Expect = 0.036
Identities = 11/55 (20%), Positives = 21/55 (38%), Gaps = 4/55 (7%)

Query: 313 NAVLIVSIHRADGSPMPVAAARYPLGSFPRTVVLDDGNAMMQGQKLSSLEKLIVR 367
+L+ + P+P A S +V D+G + G + K+ V+
Sbjct: 796 IKLLM--TLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSG--MPLAGKVQVK 846


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2217SECA270.048 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.1 bits (60), Expect = 0.048
Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 88 YLNELAGKGVKIIGMN 103
YLN L GKGV ++ +N
Sbjct: 117 YLNALTGKGVHVVTVN 132


50VP2226VP2261N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2226-1122.325457hypothetical protein
VP2227-1131.831853Soj-like protein
VP22280111.423189chemotaxis-specific methylesterase
VP22290100.660796chemotaxis protein CheA
VP2230-2110.492141chemotaxis protein CheZ
VP2231-2131.038351chemotaxis protein CheY
VP2232-1151.486700flagellar biosynthesis sigma factor
VP2233-1161.812856flagellar biosynthesis protein FlhG
VP22340192.212563flagellar biosynthesis regulator FlhF
VP22352222.440738flagellar biosynthesis protein FlhA
VP22362231.996022flagellar biosynthesis protein FlhB
VP22372211.745560flagellar biosynthesis protein FliR
VP22381191.882146flagellar biosynthesis protein FliQ
VP22391182.493785flagellar biosynthesis protein FliP
VP22402162.440920polar flagellar assembly protein FliO
VP22411173.275172flagellar motor switch protein
VP22421173.159414flagellar motor switch protein FliM
VP22431173.267662flagellar basal body protein FliL
VP22440163.362831polar flagellar hook-length control protein
VP22450173.241562flagellar biosynthesis chaperone
VP2246-1163.336050flagellum-specific ATP synthase
VP2247-1142.749005flagellar assembly protein H
VP2248-1152.576895flagellar motor switch protein G
VP22490132.383799flagellar MS-ring protein
VP22500121.296834flagellar hook-basal body protein FliE
VP22510111.731688FlaM
VP22521111.315970FlaL
VP22531110.759784polar flagellar protein FlaK
VP22541170.723116flagellar protein FliS
VP22552170.189987polar flagellar rod protein FlaI
VP22561150.725497flagellar capping protein
VP2257019-0.052295flagellar protein FlaG
VP2258-118-0.359198flagellin
VP2259-115-0.524633flagellin
VP2260015-1.232359hypothetical protein
VP2261014-0.511051flagellin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2226TONBPROTEIN467e-08 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 46.1 bits (109), Expect = 7e-08
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 36 LLAQELSSVEPEPEPEPEPEPEPEPEPEPEPE-PEPEPEPEPEP 78
+ +V+P PEP EPEPEPEP PEP E P +P+P+P
Sbjct: 53 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP 96



Score = 44.6 bits (105), Expect = 2e-07
Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 9/89 (10%)

Query: 25 VEVTDLEHELDLLAQELSSVEPEPEPEPEPEPE---------PEPEPEPEPEPEPEPEPE 75
V DLE + VEPEPEPEP PEP P+P+P+P+P+P + + +
Sbjct: 50 VTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQ 109

Query: 76 PEPELQMQYAESDSYQFRENYAPQLEAPN 104
P+ +++ + S A +
Sbjct: 110 PKRDVKPVESRPASPFENTAPARLTSSTA 138



Score = 44.6 bits (105), Expect = 2e-07
Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 40 ELSSVEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPE 79
++ V P P+ P P EPEPEPEP PEP E
Sbjct: 46 SVTMVTPADLEPPQAVQPPPE-PVVEPEPEPEPIPEPPKE 84



Score = 43.4 bits (102), Expect = 5e-07
Identities = 17/54 (31%), Positives = 24/54 (44%), Gaps = 1/54 (1%)

Query: 26 EVTDLEHELDLLAQELSSVEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPEPE 79
+V +L ++ + + P+ P P EPEPEPEP PEP E
Sbjct: 34 QVIELPAPAQPISVTMVTPADLEPPQAVQPPPE-PVVEPEPEPEPIPEPPKEAP 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2228HTHFIS668e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 8e-14
Identities = 32/165 (19%), Positives = 65/165 (39%), Gaps = 8/165 (4%)

Query: 2 AIKVLVVDDSSFFRRRVSEIINSESRLEVIDVAVNGREAVEKAKALKPDVITMDIEMPVM 61
+LV DD + R +++ ++ + + N A D++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGITAVREIMAASP-TPILMFSSLTHDGAKATLDALDAGALDFLPKKF--EDIARNRDEA 118
+ + I A P P+L+ S+ + + A + GA D+LPK F ++ A
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 119 VSLLQQRVIQIASKRAFMRRPVARPAAATSSARPLASRTAAPAAS 163
++ ++R ++ V R +AA + +R +
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGR-SAAMQEIYRVLARLMQTDLT 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2229PF06580442e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 43.7 bits (103), Expect = 2e-06
Identities = 18/104 (17%), Positives = 41/104 (39%), Gaps = 18/104 (17%)

Query: 449 ETDLDKNLVEALADPLI--HLVRNSVDHGIEMPEDRVKAGKSRTGKVILSASQEGDHIEL 506
E ++ +++ P++ LV N + HGI + GK++L +++ + L
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294

Query: 507 AIVDDGGGMDPNKLRGIAVK--------RGMMDEDAAARLSDKE 542
+ + G N + + +A +LS+K+
Sbjct: 295 EVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQ 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2231HTHFIS911e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 1e-24
Identities = 34/121 (28%), Positives = 54/121 (44%), Gaps = 3/121 (2%)

Query: 6 KILIVDDFSTMRRIVKNLLRDLGFNNTQEADDGLTALPMLKKGDFDFVVTDWNMPGMQGI 65
IL+ DD + +R ++ L G++ + T + GD D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 66 DLLKHIRADAELKHLPVLMITAEAKREQIIEAAQAGVNGYIVKPFTAATLKEKLEKIFER 125
DLL I+ LPVL+++A+ I+A++ G Y+ KPF L + +
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 126 L 126

Sbjct: 122 P 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2234PF05272290.048 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.048
Identities = 24/99 (24%), Positives = 36/99 (36%), Gaps = 26/99 (26%)

Query: 219 VSGLMWQEVERREPLRAMLIKRLERMGVSPELADQMACYIPEDTKPAR--AWKALLSLVA 276
V W EV R L L+ L G +P+ Y P + + L+ VA
Sbjct: 539 VKAQQWDEVPR---LEKWLVHVL---GKTPD------DYKPRRLRYLQLVGKYILMGHVA 586

Query: 277 DQINIPKQDILKRG----GVVALLGPTGVGKTTTVAKLA 311
+++ G V L G G+GK+T + L
Sbjct: 587 R--------VMEPGCKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2236TYPE3IMSPROT367e-129 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 367 bits (945), Expect = e-129
Identities = 114/354 (32%), Positives = 190/354 (53%), Gaps = 14/354 (3%)

Query: 8 ERTEEATPRRLQQAREKGQVARSKELASASVLIVGAIALMWFGESLARSLFSIMSRLFDL 67
E+TE+ TP++++ AR+KGQVA+SKE+ S ++++ + LM L+ F S+L +
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLM----GLSDYYFEHFSKLMLI 59

Query: 68 KRDEIFDTTKLFDIALGAMTDLLFPLFLI--LITLFVAATIGAAG---VGGISFSAEAAM 122
++ + F AL + D + F L VAA + A G S EA
Sbjct: 60 PAEQSYLP---FSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIK 116

Query: 123 PKLSKMNPLSGLKRMVGMQSWVELIKSILKVVLVTGVAMYLIQASQADLIQLSMDVYPQN 182
P + K+NP+ G KR+ ++S VE +KSILKVVL++ + +I+ + L+QL +
Sbjct: 117 PDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQL-PTCGIEC 175

Query: 183 IFHAL-DILLNFILLISCSLLIVVAIDIPFQIWQHADQLKMTKQEVKDEYKETEGKPEVK 241
I L IL +++ + +++ D F+ +Q+ +LKM+K E+K EYKE EG PE+K
Sbjct: 176 ITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIK 235

Query: 242 GRIRMLQREAAQRRMMADVPQADVIVTNPEHYSVALRYKQKTDRAPVVIAKGTDHMAMKI 301
+ R +E R M +V ++ V+V NP H ++ + YK+ P+V K TD +
Sbjct: 236 SKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTV 295

Query: 302 REVAREHDITIVPAPPLARALYHTTELEQEIPDGLFTAVAQVLAFVFQLKQYRK 355
R++A E + I+ PLARALY ++ IP A A+VL ++ + ++
Sbjct: 296 RKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQ 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2237TYPE3IMRPROT1211e-35 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 121 bits (306), Expect = 1e-35
Identities = 81/221 (36%), Positives = 129/221 (58%), Gaps = 2/221 (0%)

Query: 9 LDWIANYFWPYVRISSMLMVMTVTGARFVSPRIRLYLGLAITFAVMPAIPAVPQDIQLLS 68
L W+ YFWP +R+ +++ + R V R++L L + ITFA+ P++PA D+ + S
Sbjct: 10 LSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA--NDVPVFS 67

Query: 69 FRGFMTIAEQMIIGVAMGMVTQFMIQTFVLLGQILGMQSSLGFASMVDPANGQNTPLLGQ 128
F +Q++IG+A+G QF G+I+G+Q L FA+ VDPA+ N P+L +
Sbjct: 68 FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLAR 127

Query: 129 LFMFLTTMFFLATDGHLKMLQLVVFSFKTLPIGSGTLTAVDFRDMAGWLGIMFKTALSMS 188
+ L + FL +GHL ++ L+V +F TLPIG L + F + ++F L ++
Sbjct: 128 IMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLA 187

Query: 189 LSGIIALLTINLSFGVMTRAAPQLNIFSLGFAFALMVGLLI 229
L I LLT+NL+ G++ R APQL+IF +GF L VG+ +
Sbjct: 188 LPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISL 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2238TYPE3IMQPROT558e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 8e-14
Identities = 26/70 (37%), Positives = 40/70 (57%)

Query: 7 VELFREALWMVLIMVCAIIIPSLLIGLVVAIFQAATSINEQTLSFLPRLIVTLLALMLFG 66
V +AL++VLI+ I + +IGL+V +FQ T + EQTL F +L+ L L L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 HWMTQMLMEY 76
W ++L+ Y
Sbjct: 65 GWYGEVLLSY 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2239FLGBIOSNFLIP2852e-99 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 285 bits (732), Expect = 2e-99
Identities = 116/229 (50%), Positives = 166/229 (72%), Gaps = 1/229 (0%)

Query: 60 MSVGNGAGIPAFTMTTNPDGSEDYSVTLQILALMTMLGFLPAMVILMTSFTRIVVVMSIL 119
++ A +P T P G + +S+ +Q L +T L F+PA++++MTSFTRI++V +L
Sbjct: 15 ITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLL 74

Query: 120 RQAMGLQQTPSNQVIIGIALFLTFFVMSPVLNEINDTAIQPYLNEQVTAREAFDAAQVPM 179
R A+G P NQV++G+ALFLTFF+MSPV+++I A QP+ E+++ +EA + P+
Sbjct: 75 RNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPL 134

Query: 180 KAFMLKQTRIKDLETFVNMSGE-QVTNPEDVSMAVLIPAFITSELKTAFQIGFMLFLPFL 238
+ FML+QTR DL F ++ + PE V M +L+PA++TSELKTAFQIGF +F+PFL
Sbjct: 135 REFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFL 194

Query: 239 IIDLVVASVLMAMGMMMLSPMIVSLPFKLMLFVLVDGWNLILSTLAGSF 287
IIDLV+ASVLMA+GMMM+ P ++LPFKLMLFVLVDGW L++ +LA SF
Sbjct: 195 IIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2241FLGMOTORFLIN1131e-35 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 113 bits (285), Expect = 1e-35
Identities = 63/135 (46%), Positives = 91/135 (67%), Gaps = 8/135 (5%)

Query: 3 PSDDQK--LADEWAAALGEDPSAPSIDVDEVLAAPLEELKDTSRPITDDERRKLDTIMDI 60
PSD+ L D WA AL E + + + + L D S + D +D IMDI
Sbjct: 7 PSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGG-GDVSGAMQD-----IDLIMDI 60

Query: 61 PVTISMEVGRSQISIRNLLQLNQGSVVELDRLAGESLDVLVNGTLIAHGEVVVVNDKFGI 120
PV +++E+GR++++I+ LL+L QGSVV LD LAGE LD+L+NG LIA GEVVVV DK+G+
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RLTDVISQTERIKKL 135
R+TD+I+ +ER+++L
Sbjct: 121 RITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2242FLGMOTORFLIM2418e-80 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 241 bits (617), Expect = 8e-80
Identities = 86/327 (26%), Positives = 163/327 (49%), Gaps = 9/327 (2%)

Query: 1 MTDLLSQDEIDALLHGVD--DVDDIDEPLDNDTEGAVSFDFSSQDRIVRGRMPTLELINE 58
MT++LSQDEID LL + D D +DT +DF D+ + +M TL L++E
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 59 RFARHMRISLFNMLRKTAEVSINGVQMMKFGEYQNTLYVPTSLNMVRFRPLKGTALITME 118
FAR SL LR V + V + + E+ ++ P++L ++ PLKG A++ ++
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 119 ARLVFILVENFFGGDGRFHAKIEGREFTPTERRIIQLLLKIVFEDYKEAWSPVMGVEFEY 178
+ F +++ FGG G+ R+ T E +++ ++ + + +E+W+ V+ +
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRL 178

Query: 179 LDSEVNPSMANIVSPTEVIVVSSFHIEVDGGGGDFHVVMPYSMVEPIRELLDAG--VQSD 236
E NP A IV P+E++V+ + +V G + +PY +EPI L + S
Sbjct: 179 GQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSV 238

Query: 237 KMETDVRWSSALREEIMDCPVNFRVNLLEKDISLRDLMELQPGDIIPIE---MPEHATMF 293
+ + ++ LR+++ ++ + +S+RD++ L+ GDII + + + +
Sbjct: 239 RRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLS 298

Query: 294 IEDLPTYRVKMGRSEDKLAVQVSQEIE 320
I + + + G K+A Q+ + IE
Sbjct: 299 IGNRKKFLCQPGVVGKKIAAQILERIE 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2244FLGHOOKFLIK489e-08 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 47.5 bits (112), Expect = 9e-08
Identities = 28/120 (23%), Positives = 63/120 (52%)

Query: 509 EQVAEKVQMMMSKNLKNLDIRLDPPELGRMQIRMTMNNDLANVHFTVTNPQARDIIEQTL 568
+ +++ + + + ++ ++RL P +LG +QI + ++++ A + + R +E L
Sbjct: 242 QSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAAL 301

Query: 569 PRLREMLAQQGMQLADSSVQQQSSGQQQSGYAAAEQNGQGTSGRGFSGQSDENFDADVNL 628
P LR LA+ G+QL S++ +S QQ + +Q+ + + +G+ D+ V+L
Sbjct: 302 PVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSL 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2245FLGFLIJ383e-06 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 38.3 bits (88), Expect = 3e-06
Identities = 29/142 (20%), Positives = 76/142 (53%)

Query: 2 NNAMEFLLEQTKEREDQAVLALNKARSELEDYYRQVEQIEKYRLDYCQQLVDRGQAGLTA 61
+ A+ L + ++ + A L + R + Q++ + Y+ +Y L AG+T+
Sbjct: 4 HGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITS 63

Query: 62 SQYGHLNRFLCQLDETLSKQKQAEHHFKEQVENCKDYWLKMRQERMSYEWMIEKKAKEKQ 121
+++ + +F+ L++ +++ +Q + + ++V+ + W + +Q +++ + E+++
Sbjct: 64 NRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAAL 123

Query: 122 IAEAKREQKQMDEFSTLLFSRK 143
+AE + +QK+MDEF+ RK
Sbjct: 124 LAENRLDQKKMDEFAQRAAMRK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2247FLGFLIH642e-14 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 64.4 bits (156), Expect = 2e-14
Identities = 46/208 (22%), Positives = 99/208 (47%), Gaps = 7/208 (3%)

Query: 47 SWVPNFDEPEEEQALELTEEQIELIKQG--AYQEGLYQGQEAGFKQGYDKGKEEGLQAGH 104
+W P+ P + + + + E + +I++ + ++ L Q Q +QGY G EG Q GH
Sbjct: 9 TWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGH 68

Query: 105 AEGLELGKAEGVSAGQEFIQ-QQVEV---FMNLANQFAQPLELMNAQVEKQLVDMVLCLV 160
+G + G A+G+ G + QQ + L ++F L+ +++ + +L+ M L
Sbjct: 69 KQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAA 128

Query: 161 KEVVHVEVQTNPQIILDTVKQSVEALPISGHPITLHLNPEDVAIIRSAYGEEDLDCRNWT 220
++V+ + ++ ++Q ++ P+ L ++P+D+ + G L W
Sbjct: 129 RQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT-LSLHGWR 187

Query: 221 LVSEPSLNRGDVQIEAGESSINYRMEER 248
L +P+L+ G ++ A E ++ + R
Sbjct: 188 LRGDPTLHPGGCKVSADEGDLDASVATR 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2248FLGMOTORFLIG2901e-98 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 290 bits (743), Expect = 1e-98
Identities = 108/330 (32%), Positives = 199/330 (60%)

Query: 20 DASTITGEEKAAILLLSLNEQDAAGIIRHLEPKQVQRVGSAMARAKDLSQEKVSAVHRTF 79
D S +TG++KAAILL+S+ + ++ + ++L ++++ + +A+ + ++ E V F
Sbjct: 11 DVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEF 70

Query: 80 LEDIQKYTNIGMGSEDFMRNALVAALGEDKANNLVDQILLGTGSKGLDSLKWMDPRQVAS 139
E + I G D+ R L +LG KA ++++ + S+ + ++ DP + +
Sbjct: 71 KELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILN 130

Query: 140 IIVNEHPQIQTIVLSYLEADQSAEILSQFPERVRLDLMMRIANLEEVQPSALAELNEIME 199
I EHPQ ++LSYL+ +++ ILS P V+ ++ RIA ++ P + E+ ++E
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 200 KQFAGQAGAQAAKIGGLKAAAEIMNYLDNNVEGILMEQIRDQDEDMATQIQDLMFVFENL 259
K+ A + GG+ EI+N D E ++E + ++D ++A +I+ MFVFE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 260 VEVDDQGIQKLLRDVPQDVLQKALKGADDSLREKVFKNMSKRAAEMMRDDIEAMPPVRVA 319
V +DD+ IQ++LR++ L KALK D ++EK+FKNMSKRAA M+++D+E + P R
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 320 DVEAAQKEILAIARRMADAGEIMLSGGADE 349
DVE +Q++I+++ R++ + GEI++S G +E
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2249FLGMRINGFLIF2761e-87 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 276 bits (706), Expect = 1e-87
Identities = 143/557 (25%), Positives = 254/557 (45%), Gaps = 40/557 (7%)

Query: 47 GDLDLLRQVVLVLSISICVALIVMLFFWVKEPEMRPL-GAYDTEELIPVLDYLDQQKINY 105
L ++ L+++ S VA++V + W K P+ R L ++ ++ L Q I Y
Sbjct: 17 NRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY 76

Query: 106 KL--DGNTVSVESSEYNSIKLGMVRSGVNQATEAGDDILLQDMGFGVSQRLEQERLKLSR 163
+ + V + + + ++L + + G+ + G + LL FG+SQ EQ + +
Sbjct: 77 RFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFE-LLDQEKFGISQFSEQVNYQRAL 135

Query: 164 ERQLAQAIEEMKQVRKARVLLALPKHSVFVRHNQEASASVFLTLSTGANLKQQEVDSIVD 223
E +LA+ IE + V+ ARV LA+PK S+FVR + SASV +TL G L + ++ ++V
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 224 MVASAVPGMKTSRITVTDQHGRLLSSGSQDPASAARRKEQELERSQEQALREKIDSVLLP 283
+V+SAV G+ +T+ DQ G LL+ + + + E ++ +I+++L P
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSN-TSGRDLNDAQLKFANDVESRIQRRIEAILSP 254

Query: 284 ILGFGNYTAQVDIQMDFSAVEQTRKRFDPNTPSTRSEYALEDYNNGNMVA-----GIPGA 338
I+G GN AQV Q+DF+ EQT + + PN ++++ N V G+PGA
Sbjct: 255 IVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGA 314

Query: 339 LSNQPPADASIP-----------QDVAQ---MKDGSVMGQGSVRKESTRNFELDTTISHE 384
LSNQP P Q+ Q + + G S ++ T N+E+D TI H
Sbjct: 315 LSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHT 374

Query: 385 RKQTGTVARQTVSVAIKDRRQVNPDTGEVTYTPMSEGEINAIRQVLIGTVGFDQSRGDLL 444
+ G + R +V+V + + + P++ ++ I + +GF RGD L
Sbjct: 375 KMNVGDIERLSVAVVVNYKTLA-----DGKPLPLTADQMKQIEDLTREAMGFSDKRGDTL 429

Query: 445 NVLSVKFAEPETEQLVDQPIWEHPNFNDWVRWFASALVIIVVVLVLVRPAMKKLLNPAGD 504
NV++ F+ + + P W+ +F D + L+++VV +L R K + P
Sbjct: 430 NVVNSPFSAVD-NTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWR----KAVRPQLT 484

Query: 505 DDDEMYGPDGLPIGADGETSLIGSDIESSELFEFGSSIDLPN--LHKDEDVLKAVRALVA 562
E E + L + E + + +R +
Sbjct: 485 RRVEEAKAAQEQAQVRQE----TEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD 540

Query: 563 NEPELAAQVVKNWMNEN 579
N+P + A V++ WM+ +
Sbjct: 541 NDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2250FLGHOOKFLIE603e-15 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 59.7 bits (144), Expect = 3e-15
Identities = 28/101 (27%), Positives = 55/101 (54%), Gaps = 3/101 (2%)

Query: 3 VDGIQAEMRAMMVEATNTTPTGTGAKVGADFNDLLTKAINNVNSLQKSSGDLQTRFDRGD 62
++G+ ++++A + A F L A++ ++ Q ++ +F G+
Sbjct: 6 IEGVISQLQATAMSARAQESLPQPT---ISFAGQLHAALDRISDTQTAARTQAEKFTLGE 62

Query: 63 ADVSLSDVMIARNKSSVAFEATVQIRNKLVEAYKDLMNMPV 103
V+L+DVM K+SV+ + +Q+RNKLV AY+++M+M V
Sbjct: 63 PGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2251HTHFIS489e-173 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 489 bits (1261), Expect = e-173
Identities = 172/484 (35%), Positives = 265/484 (54%), Gaps = 20/484 (4%)

Query: 1 MAQSKVLIVEDDEGLREALVDTLALAGYEWLEADSAEDALVKLKSNAVDIVVSDVQMAGM 60
M + +L+ +DD +R L L+ AGY+ +A + + D+VV+DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 GGLALLRNIKQNWPNLPVLLMTAYANIEDAVSAMKDGAIDYMAKPFAPEVLLNMVSR--- 117
LL IK+ P+LPVL+M+A A+ A + GA DY+ KPF L+ ++ R
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 118 ------YAPIKSDDNGDAVVADEKSLR-LLALADKVARTDANVMILGPSGSGKEVMSRYI 170
+G +V +++ + + ++ +TD +MI G SG+GKE+++R +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 171 HKASNRKDGPFVAINCAAIPDNMLEATLFGYEKGAFTGAVQACPGKFEQAQGGTILLDEI 230
H R++GPFVAIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGT+ LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 231 SEMDLNLQAKLLRVLQEREVERLGSRKSIKLDVRVLATSNRDLKQYVSEGNFREDLYYRL 290
+M ++ Q +LLRVLQ+ E +G R I+ DVR++A +N+DLKQ +++G FREDLYYRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 291 NVFPIAWPALNERKGDIAPLAKHLAERHCSKMGMPVPQFSPVAVEKLLQYPWPGNVRELD 350
NV P+ P L +R DI L +H ++ K G+ V +F A+E + +PWPGNVREL+
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 351 NVVQRALILSENGDIGAEHILLEGVDWQDANSLQYV-----VQSAETLVPDVKPVAQAES 405
N+V+R L I E I E + ++ S V + A
Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419

Query: 406 VNRVNTGGEGLGGELRDQEYAIILETLVECNGRRKEMAEKLGISPRTLRYKLAKMRDAGI 465
+ + G L + EY +IL L G + + A+ LG++ TLR K+ ++ G+
Sbjct: 420 GDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL---GV 475

Query: 466 DIPS 469
+
Sbjct: 476 SVYR 479


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2252PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.0 bits (70), Expect = 0.007
Identities = 19/126 (15%), Positives = 47/126 (37%), Gaps = 24/126 (19%)

Query: 216 IDYFLEVEEEQTELLGNANAIASALSNLVMNAIQ--MSGKES--QIDIFFRPVNGELRIS 271
+ + ++ + + + LV N I+ ++ +I + NG + +
Sbjct: 240 LQFENQINPA----IMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 272 VQDSGPGVPQELQAKIMEPFFTTRSQGTGLGLAVVQMVCRA---HDGRLELISEQGDGAC 328
V+++G + + + TG GL V+ + + +++L +QG
Sbjct: 296 VENTGSLALKNTK------------ESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVN 342

Query: 329 FTMCIP 334
+ IP
Sbjct: 343 AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2253HTHFIS5230.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 523 bits (1349), Expect = 0.0
Identities = 178/497 (35%), Positives = 268/497 (53%), Gaps = 31/497 (6%)

Query: 1 MQGLAKLLVIDDDASSRLNLSNILEFVGESCEAVGSEQLGDVDWSSVWSGCIVGNIS-AG 59
M G A +LV DDDA+ R L+ L G + ++ +V ++
Sbjct: 1 MTG-ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 60 RAATAVMARLNDAY-HIPLLVLGSFPLPVDDLPNFVGELEQ--------PLNYPQLSEAL 110
A ++ R+ A +P+LV+ + + E+ P + +L +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQN----TFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 111 RHCKDFLGRKGVNVVASARKNTLFRSLVGQSRGIQEVRHLIEQVSGTEANVLILGESGTG 170
R+ + ++ LVG+S +QE+ ++ ++ T+ ++I GESGTG
Sbjct: 116 GRALAEPKRRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTG 172

Query: 171 KEVVARNIHYHSSYRNGAFVPINCGAIPPELLESELFGHEKGAFTGALTARKGRFELADG 230
KE+VAR +H + RNG FV IN AIP +L+ESELFGHEKGAFTGA T GRFE A+G
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEG 232

Query: 231 GTIFLDEIGDMPMSMQVKLLRVLQERCFERVGGNSTIKVNVRVVAATHRNLESMIEEGTF 290
GT+FLDEIGDMPM Q +LLRVLQ+ + VGG + I+ +VR+VAAT+++L+ I +G F
Sbjct: 233 GTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLF 292

Query: 291 REDLFYRLNVFPIEMPALKERKQDIPLLLQELMTRLEAEGGQPICFTPRAINSLMEHHWP 350
REDL+YRLNV P+ +P L++R +DIP L++ + + E EG F A+ + H WP
Sbjct: 293 REDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWP 352

Query: 351 GNVRELANLVERMIILYPNSLVDVNHLPTKYRYSDIPEFQPEGNPFTSIEEQERDVFQDI 410
GNVREL NLV R+ LYP ++ + + R S+IP+ E S ++
Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELR-SEIPDSPIEKAAARSGSLSISQAVEEN 411

Query: 411 FSEDFSFDEQSDLDHNMNAPQALPPEGVNLKELLADLEVNMISQALEAQGGVVARAADML 470
+ F+ + ALPP G+ +LA++E +I AL A G +AAD+L
Sbjct: 412 MRQYFA-----------SFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLL 459

Query: 471 GMRRTTLVEKMRKYNLQ 487
G+ R TL +K+R+ +
Sbjct: 460 GLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2256IGASERPTASE330.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.003
Identities = 46/260 (17%), Positives = 83/260 (31%), Gaps = 17/260 (6%)

Query: 244 NPLPPEAQKAADNA--QDDAQDDASQEPISAAGAEAAKAGQEAIDKANQRSSLRPEERIP 301
N P +A + ++ + E A A + N + + E+
Sbjct: 996 NITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN- 1054

Query: 302 GWTETASGTLLDSYEEPELELDEKAIEKAPDVPGWNNAASGTLTDSYVTTKEAKQLLEQE 361
+ A+ T + E + ++A N A T E K+ E
Sbjct: 1055 --EQDATETTAQNRE-----VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 362 KAEIEQKIADEKQELDAKVERGELSEEQAKQIHRAKLDPQERERLEKIDEAEAKIAKAQS 421
K E + K+ EK + KV S+ KQ + PQ E K ++Q+
Sbjct: 1108 KEE-KAKVETEKTQEVPKVT----SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 422 SFEEYLGMTEVQAGQDSEVLLDGVAKLSSHNNVIEDAIEGVDLTLKGKSEPNKPPAEIGV 481
+ + + E + +++ N+V+E+ + N +
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN--TTPATTQPTVNSESSNKPK 1220

Query: 482 EYDRQSVRSDIENFVSAYNS 501
R+SVRS N A S
Sbjct: 1221 NRHRRSVRSVPHNVEPATTS 1240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2258FLAGELLIN1825e-55 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 182 bits (463), Expect = 5e-55
Identities = 85/295 (28%), Positives = 134/295 (45%), Gaps = 8/295 (2%)

Query: 2 AINVNTNVSAMTAQRYLNHAAEGQQKSMERLSSGYKINSAKDDAAGLQISNRLNAQSRGL 61
A +NTN ++ Q LN + ++ERLSSG +INSAKDDAAG I+NR + +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DMAVKNANDGISIAQVAEGAMNESTNILQRMRDLSLQSANGSNSKAERVAIQEEVTALND 121
A +NANDGISIAQ EGA+NE N LQR+R+LS+Q+ NG+NS ++ +IQ+E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGNKLLNGTYGTQSFQIGADSGEAVMLSMGSLRSDTSAMGGKSYSAEEG 181
E++R++ T F G K+L+ Q+GA+ GE + + + + + + G + +
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNG--- 176

Query: 182 KDASWTVGDKTELKMSYTNKQGEEKELTIKAKQGDDIEQLATYINGQSEDVKASVGEDGK 241
+LK S+ N G + K D+ A + + V V +
Sbjct: 177 ----PKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAA 232

Query: 242 LQVFASTQKVNGEVEFSGNLAGEIGFGDAKDVTVKDIDVTTVAGSQEAVAVIDGA 296
+ N I + + V
Sbjct: 233 NGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTI 287



Score = 125 bits (314), Expect = 6e-34
Identities = 76/338 (22%), Positives = 127/338 (37%), Gaps = 24/338 (7%)

Query: 57 QSRGLDMAVKNANDGISIAQVAEGAMNESTNILQRMRDLSLQSANGSNSKAERVAIQEEV 116
+ + V + + + + ++ + + + ++V +
Sbjct: 174 VNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNA-A 232

Query: 117 TALNDELNRIAETTSFGGNKLLNGTYGTQSFQIGADSGEAVMLSMGSLRSDTSAMGGKSY 176
+ T + ++ I + T + K+
Sbjct: 233 NGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTG 292

Query: 177 SAEEGKDASWTVGDKTELKMSYTNKQGEEKELTIKAKQGDDIEQL-ATYINGQSEDVKAS 235
+ GK ++ G K++ T + A + + + +NGQ +
Sbjct: 293 NDGNGKVSTTING----EKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 236 VGEDGKLQVFASTQKVNGEVEFSGNLAGEIGFGDAKDVTV------------------KD 277
E KL + V GE + + N A VT+ +
Sbjct: 349 KNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINE 408

Query: 278 IDVTTVAGSQEAVAVIDGALKSVDSQRASLGAFQNRFNHAISNLDNINENVNASNSRIKD 337
+ +A ID AL VD+ R+SLGA QNRF+ AI+NL N N+N++ SRI+D
Sbjct: 409 DAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468

Query: 338 TDYAKETTAMTKSQILQQASTSILAQAKQSPSAALSLL 375
DYA E + M+K+QILQQA TS+LAQA Q P LSLL
Sbjct: 469 ADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2259FLAGELLIN2016e-62 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 201 bits (511), Expect = 6e-62
Identities = 90/297 (30%), Positives = 144/297 (48%), Gaps = 1/297 (0%)

Query: 2 AVNVNTNVSAMTAQRYLNNANSAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGL 61
A +NTN ++ Q LN + S+ +++ERLSSG +INSAKDDAAG I+NR +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAVRNANDGISIAQTAEGAMNETTNILQRMRDLSLQSANGSNSKAERVAIQEEVTALND 121
A RNANDGISIAQT EGA+NE N LQR+R+LS+Q+ NG+NS ++ +IQ+E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGNKLLNGTHGAKSFQIGADNGEAVMLELKDMRSDNKMMGGVSYQAESG 181
E++R++ T F G K+L+ Q+GA++GE + ++L+ + + + G +
Sbjct: 121 EIDRVSNQTQFNGVKVLSQD-NQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KGKDWNVAQGKNDLKISLTDSFGQEQEININAKAGDDIEELATYINGQTDLVKASVDQDG 241
+ KN + +++N+ A T +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 KLQIFAGNNKVEGEVSFSGGLSGELGLGDDKKNVTVDTIDVTSVGGAQESVAIIDAA 298
+ + + S +G + G K DT D V ++ D
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 135 bits (342), Expect = 1e-37
Identities = 82/377 (21%), Positives = 144/377 (38%), Gaps = 20/377 (5%)

Query: 19 NNANSAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGLDVAVRNANDGISIAQTA 78
N Q + ++ G L + ++ + +
Sbjct: 132 NGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKN 191

Query: 79 EGAMNETTNILQRMRDLSLQSANGSNSKAERVAIQEEVTALNDELNRIAETTSFGGNKLL 138
+ + R A +++ A V + V A N +L + +
Sbjct: 192 VTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFK 251

Query: 139 NGTHGAKSFQIGADNGEAVMLELKDMRSDNKMMGGVSYQAESGKGKDWNVAQGKNDLKIS 198
A + + A G + D + ++G + V+ N K++
Sbjct: 252 TTKSTAGTAEAKAIAGAIKGGKEGDTFDYKG--VTFTIDTKTGNDGNGKVSTTINGEKVT 309

Query: 199 LTDSFGQEQEININAKAGDDIEEL-ATYINGQTDLVKASVDQDGKLQIFAGNNKVEGEVS 257
LT + N++A + + + +NGQ + ++ KL NN V+GE
Sbjct: 310 LTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESK 369

Query: 258 FSGGLSGELGLGDDKK-----------------NVTVDTIDVTSVGGAQESVAIIDAALK 300
+ + K + ++ + +A ID+AL
Sbjct: 370 ITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALS 429

Query: 301 YVDSHRAELGAFQNRFNHAISNLDNINENVNASKSRIKDTDFAKETTAMTKSQILSQASS 360
VD+ R+ LGA QNRF+ AI+NL N N+N+++SRI+D D+A E + M+K+QIL QA +
Sbjct: 430 KVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGT 489

Query: 361 SILAQAKQAPNSALSLL 377
S+LAQA Q P + LSLL
Sbjct: 490 SVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2261FLAGELLIN1993e-61 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 199 bits (506), Expect = 3e-61
Identities = 91/297 (30%), Positives = 142/297 (47%), Gaps = 2/297 (0%)

Query: 2 AITVNTNVAALVAQRHLTSATDMLNQSMERLSSGKRINSAKDDAAGLQISNRLQSQMSGL 61
A +NTN +L+ Q +L + L+ ++ERLSSG RINSAKDDAAG I+NR S + GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAVRNANDGISIMQTAEGAMNEVTNIMQRMRDLSLQSANGSNSQVERTALQEEVTALND 121
A RNANDGISI QT EGA+NE+ N +QR+R+LS+Q+ NG+NS + ++Q+E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGRKLLNGAFGKSSFQIGAASGEAVQIELKSMRTDGLEMGGFSYVAQGR 181
E++R++ T F G K+L+ Q+GA GE + I+L+ + L + GF+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQM-KIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 ADSDWQVKENANDLTMSFINRSGETEKIQINAKSGDDIEELATYINGQTDKVTASVNEKG 241
A N ++ +N+ + T +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 QLQIFMAGEDTAGTISFSGDL-ASELGMSLKGYDAVNNLNITTVGGAQQAVAVLDTA 297
+ A + T S +G A + ++KG + + V D
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 126 bits (318), Expect = 2e-34
Identities = 59/213 (27%), Positives = 90/213 (42%), Gaps = 19/213 (8%)

Query: 183 DSDWQVKENANDLTMSFINRSGETEKIQINAKSGDDIEEL-ATYINGQTDKVTASVNEKG 241
D + +V N ++ ++A + + + + +NGQ + NE
Sbjct: 294 DGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESA 353

Query: 242 QLQIFMAGEDTAGTISFSGDLASELGMSLKGYDAV------------------NNLNITT 283
+L A G + + A + + N
Sbjct: 354 KLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAA 413

Query: 284 VGGAQQAVAVLDTAMKFVDSQRAELGAYQNRFNHAINNLDNIHENLAASNSRIQDTDYAK 343
+A +D+A+ VD+ R+ LGA QNRF+ AI NL N NL ++ SRI+D DYA
Sbjct: 414 KKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYAT 473

Query: 344 ETTQMVKQQILQQVSTTILAQAKQAPNLALTLL 376
E + M K QILQQ T++LAQA Q P L+LL
Sbjct: 474 EVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


51VP2467VP2473N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2467-311-0.499900outer membrane protein OmpU
VP2468-112-0.275844D-alanyl-D-alanine
VP2469-2110.137403hypothetical protein
VP2470-290.487864tyrosyl-tRNA synthetase
VP2471-2110.205053hypothetical protein
VP2472-290.186821multidrug resistance protein
VP2473-3130.086037hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2467ECOLIPORIN1042e-27 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 104 bits (260), Expect = 2e-27
Identities = 104/399 (26%), Positives = 157/399 (39%), Gaps = 78/399 (19%)

Query: 1 MKKTLIALSVSAAAMATGVNAAELYNQDGTSLEMGGRAEARLSMKDGDAQ--DNSRIRLN 58
MK+ ++AL + A A +AAE+YN+DG L++ G+ + D ++ D + +R+
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 59 FLGTQAINDNLYGVGFWEGEFTTNEQGGVDGDVNKDSSNLDTRYAYAGLG-GAWGEFTYG 117
F G IND L G G WE N G + +N TR A+AGL G +G F YG
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEG-------EGANSWTRLAFAGLKFGDYGSFDYG 113

Query: 118 KNEGALGVITDFTDIMAYHGNSA--------------------ADKLAVADRSDNMMSYK 157
+N G L + +TD++ G + D + D + + Y+
Sbjct: 114 RNYGVLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQ 173

Query: 158 GQFENLSVKASYRFADRKLNDAGTEYTDNGQDGYSLSAIYAVADTGLELGAGYADQDEAN 217
G+ E+ S + + N Y DNG DG+ +S Y + G GA Y D N
Sbjct: 174 GKNESQSADDVNIGTNNRNNGDDIRY-DNG-DGFGISTTYDI-GMGFSAGAAYTTSDRTN 230

Query: 218 E----------------YMLAASYTMGDLYFAGIFTDGEKAKTEGDYTGYELAGAYTLGQ 261
E + Y ++Y A ++++ G G Q
Sbjct: 231 EQVNAGGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQ 290

Query: 262 TVFTT-----------------------TYNNAETNNETSANNFAVDASYYFKPNFRGYV 298
T TYNN +++ V A+YYF NF YV
Sbjct: 291 NFEVTAQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYV 350

Query: 299 SYNFNLIDSGDKLGKVGGNTTASKADAEDELALGLRYDF 337
Y NL+D D K G +T +D +ALG+ Y F
Sbjct: 351 DYKINLLDDDDPFYKDAGIST------DDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2468BLACTAMASEA320.004 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 32.1 bits (73), Expect = 0.004
Identities = 26/98 (26%), Positives = 39/98 (39%), Gaps = 12/98 (12%)

Query: 7 LLISSLIFPLLSYAYPHQEVLPE--------GARISLVAEKLTESSTLDGIRPTDQLFPP 58
L I SL+ L + + L + R+ ++ L TL R D+ FP
Sbjct: 6 LCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRA-DERFPM 64

Query: 59 ASTLKIVTALA--AKLELGDSFAFRTKLETSSSDAVIY 94
ST K+V A A+++ GD K+ D V Y
Sbjct: 65 MSTFKVVLCGAVLARVDAGDEQ-LERKIHYRQQDLVDY 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2472ACRIFLAVINRP510e-166 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 510 bits (1315), Expect = e-166
Identities = 220/1045 (21%), Positives = 435/1045 (41%), Gaps = 54/1045 (5%)

Query: 8 ALSRSRTMLTLLVMILIAGVITYVTIPKESSPDITIPIIYVSVGHQGISPTDAERLLVRP 67
+ R L +++++AG + + +P P I P + VS + G + + +
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 68 IEQELRSIEGVKEMTSVA-SEGHASVTLEFSVGVDLDKAMADVRDAVDLAKPKLPADSDE 126
IEQ + I+ + M+S + S G ++TL F G D D A V++ + LA P LP + +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 127 PTVNEVTFASEEPVLTVVLYGTVPERTIVQI----ARTLRDRLESFRQILEVDIAGDRED 182
++ V +S ++ P T I A ++D L + +V + G +
Sbjct: 125 QGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQY 182

Query: 183 IVEIVVDPLLMESYGLDQADIYNLIALNNRVVAAGFVDTG------YGRFSVKVPSVFDS 236
+ I +D L+ Y L D+ N + + N +AAG + S+ + F +
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 237 LKDVLELPIKVNGK-EVITFGDVATVRRAFRDPESFARLDGEPAIVLDVKKRAGENIIET 295
++ ++ ++VN V+ DVA V + AR++G+PA L +K G N ++T
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 296 VALVKEVIAQGQLRAEWPSNLQVKYTWDQSDDVKLMLSDLQNNILSAIILVVIVIIAILG 355
+K +A+ L+ +P ++V Y +D + V+L + ++ + AI+LV +V+ L
Sbjct: 303 AKAIKAKLAE--LQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 356 -VRTALLVGISIPGSFLTGLLVLSVFGLTVNIVVLFALIMAVGMLVDGAIVVTEFADRRM 414
+R L+ I++P L +L+ FG ++N + +F +++A+G+LVD AIVV E +R M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 415 QE-GTPRKEAYRDAAKRMAWPITASTATTLAAFAPLLFWPDITGEFMKYLPLTLIATLAA 473
E P KEA + ++ + A F P+ F+ TG + +T+++ +A
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 474 SLVMALLFVPVIGGIIGKPQVINPKAQQEMVELHNGNFEKATGITKLYYKTLFVAIQHPW 533
S+++AL+ P + + KP + + Y ++ +
Sbjct: 481 SVLVALILTPALCATLLKPVS--AEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 534 KVLLSAVLLAGGVGFTYSKAGLGAEFFPEVDPPFFTVKVRSYGDLSINEKDIVMREIEKV 593
+ LL L+ G+ + + L + F PE D F ++ + V+ ++
Sbjct: 539 RYLLIYALIVAGMVVLFLR--LPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDY 596

Query: 594 MLGH--DEFESVYTRTG--SSDNGDEIGQIQITPVDWQYR-RKVKTIIEELKATTDQFYG 648
L + ESV+T G S G ++ W+ R + + +
Sbjct: 597 YLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGK 656

Query: 649 VELEYKFPDAGPPVE-------HDLVIEVSSRSRSIGELDDAAKLVRLWADSNPALTNLS 701
+ + P P + D + + +L+ + A +L ++
Sbjct: 657 IRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716

Query: 702 DTTNKVGIDWQIDIRRDDASRFAADATLVGNTVQFVTNGLKIGDYLPDDADEEVDILVRY 761
+ +++++ ++ A + + T+ G + D++ + V+
Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR--GRVKKLYVQA 774

Query: 762 PQEKR-DIGRFDQLRVKTAAG-LVPITNFAQIKPDHKQDTIRRVDGHRVINIKADMVEGY 819
+ R D+L V++A G +VP + F + + R +G + I+ + G
Sbjct: 775 DAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGT 834

Query: 820 NLALELPKIAAEMEQLGLPEGVEYKIRGQNEEQENSSAFLQNAFVVALGVMALILITQFN 879
+ + + + LP G+ Y G + ++ S ++ V+ L L +
Sbjct: 835 SSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYE 892

Query: 880 SFYQAFLILSAVLFSTVGVFAGLLIFQKPFGIIMSGIGVIALAGIVVNNNIVLIDTYNQ- 938
S+ ++ V VGV +F + + +G++ G+ N I++++
Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVY-FMVGLLTTIGLSAKNAILIVEFAKDL 951

Query: 939 MRKRGLDKAEAILRTGVQRLRPVLLTTITTILGLLPMVLEMNIDLVNQKVEFGAPSTQWW 998
M K G EA L RLRP+L+T++ ILG+LP+ + GA S
Sbjct: 952 MEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAI-----------SNGAGSGAQN 1000

Query: 999 SQLATAVAGGLAFATVLTLVLTPCL 1023
+ + V GG+ AT+L + P
Sbjct: 1001 A-VGIGVMGGMVSATLLAIFFVPVF 1024


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2473RTXTOXIND415e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.0 bits (96), Expect = 5e-06
Identities = 35/226 (15%), Positives = 79/226 (34%), Gaps = 28/226 (12%)

Query: 53 LAKVMFDTFTAKPTSKTIELYGRTAPNRQARLGAEVAGKIVSLSINKGQ------LVKQG 106
L K F T+ + K + L + A + + A + + K + L+ +
Sbjct: 190 LIKEQFSTWQNQKYQKELNLDKKRA--ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247

Query: 107 QVIANIDKRDLDSQLKRAQAMLRVKEKEFNAAKS------LKSRGLQGEV------AFAT 154
IA + +++ A LRV + + +S + + +
Sbjct: 248 -AIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQ 306

Query: 155 AEAALVDARANLNNVQTALKNTEVKAPFDGIVDHHFV-EVGDFVGVGDPIATVI-DLETL 212
+ L + + + ++AP V V G V + + ++ + +TL
Sbjct: 307 TTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTL 366

Query: 213 VIEADVSERHIQYLKEGLQADVR--TINGQHHLGTLRYIGRVSSVS 256
+ A V + I ++ G A ++ + G L G+V +++
Sbjct: 367 EVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLV--GKVKNIN 409



Score = 30.6 bits (69), Expect = 0.010
Identities = 24/104 (23%), Positives = 36/104 (34%), Gaps = 19/104 (18%)

Query: 177 EVKAPFDGIVDHHFVEVGDFVGVGDPIATVIDLETLVIEADVSERHIQYLKEGLQADVRT 236
E+K + IV V+ G+ V GD + L L EAD + L+ L+
Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLK---LTALGAEADTLKTQSSLLQARLEQ---- 150

Query: 237 INGQHHLGTLRYIGRVSSVSTNTFPIEIEIDNRNSLIPAGISAE 280
RY S+ N P E+ + +S E
Sbjct: 151 ---------TRYQILSRSIELNKLP---ELKLPDEPYFQNVSEE 182


52VP2513VP2526N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP25130231.022202sulfate permease
VP25145410.662320carbonic anhydrase
VP25155390.859964hypoxanthine-guanine phosphoribosyltransferase
VP25165370.937260OpaR protein
VP25174361.111768dihydrolipoamide dehydrogenase
VP25184321.181812dihydrolipoamide acetyltransferase
VP25192190.033064pyruvate dehydrogenase subunit E1
VP2520-116-1.051571transcriptional regulator PdhR
VP2521018-1.442775N-acetyl-anhydromuranmyl-L-alanine amidase
VP2522-119-1.551574quinolinate phosphoribosyltransferase
VP2523020-2.259795type IV pilin PilA
VP2524121-2.355592type IV pilin assembly protein PilB
VP2525018-2.526358type IV pilin biogenesis protein PilC
VP2526120-2.478264type IV prepilin-like proteins leader peptide
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2513MECHCHANNEL290.021 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 29.0 bits (65), Expect = 0.021
Identities = 27/107 (25%), Positives = 40/107 (37%), Gaps = 7/107 (6%)

Query: 335 LTEPIPMAVLAGIAVYVGFNILDWSFIQRAHKVSFSGMAIMYGVMLLTVFVDLIVAVGLG 394
L I M L + + F + + M YGV + VF LIVA +
Sbjct: 36 LVADIIMPPLGLLIGGIDFKQFAVTLRDAQGDIPAVVMH--YGVFIQNVFDFLIVAFAIF 93

Query: 395 VFVSNIMIIERLSREQARQVKAISDADEDDVPLTDSERGLLDRANGR 441
+ I +I +L+R++ A A + L R LL N R
Sbjct: 94 MA---IKLINKLNRKKEEPAAA--PAPTKEEVLLTEIRDLLKEQNNR 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2516HTHTETR654e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.6 bits (157), Expect = 4e-15
Identities = 28/208 (13%), Positives = 71/208 (34%), Gaps = 20/208 (9%)

Query: 4 IAKRPRTRLSPLKRKQQLMEIALEVFARRGIGRGGHADIAEIAQVSVATVFNYFPTREDL 63
+A++ + +Q ++++AL +F+++G+ +IA+ A V+ ++ +F + DL
Sbjct: 1 MARKTKQEAQE--TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 64 VDEVLNHVVRQFSNFLSDNIDLDIHARENIANITNAMIELVSQDCH------WLKVWFEW 117
E+ + + ++ + +I ++ +++ F
Sbjct: 59 FSEIWELSESNIGELELEYQAKF--PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116

Query: 118 SASTRDEVWPLFVSTNRTNQLLVQNMFI----KAIERGEVCDQHDSEHLANLFHGICYSL 173
+ + R L + IE + + A + G L
Sbjct: 117 CEFVGE--MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174

Query: 174 FVQANRFKGEAELKE----LVSAYLDML 197
+LK+ V+ L+M
Sbjct: 175 MENWLFAPQSFDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2518RTXTOXIND388e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 8e-05
Identities = 41/281 (14%), Positives = 90/281 (32%), Gaps = 35/281 (12%)

Query: 26 DKVEEEQSLITVEGDKASMEVPASQAGIVKEIKVAEGDKVSTGSLIMIFE---AEGAADA 82
+ V +T G S E+ + IVKEI V EG+ V G +++ AE
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 83 APAPAAEAAPAAAPAPAAAAELKEVHVPDIG------------GDEVEVTEIMVAIGDSI 130
+ +A + ++ +P++ + + +T ++ +
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 131 EEEQSLITVEGDKASMEVPAPFAGTLKEIKVAAGDKVSTGSLIMVFETAGSGAPAAPAAV 190
+ ++ + DK E A + ++ +K + + A A
Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH-KQAIAKHAVLEQ 257

Query: 191 EAPAAAAPAASAAKEVNVPDIGGDEVEV----------------TEIMVAVGDTVEEEQS 234
E A + + I + + ++ +
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317

Query: 235 LITVEGDKASMEVPAPFAGTVKEIKIAA-GDKVSTGSLIMV 274
L E + + + AP + V+++K+ G V+T +MV
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2520PF08280290.022 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 29.0 bits (65), Expect = 0.022
Identities = 27/116 (23%), Positives = 43/116 (37%), Gaps = 11/116 (9%)

Query: 82 FSDPLLNLLSSHSETQLDLLESRHAMEGISAYFAALRGTEEDFARIQGCLERISKEQANN 141
F L + +T L L+ H E ++A+F I SKE
Sbjct: 54 FKTSSLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKRMISCQFTHPSKETYLY 113

Query: 142 DIEAESAEVMQFL-IAITEAAHNVVL------LHIVRSLAPLLEQNI---LQNFKL 187
+ A S V+Q L I +H+ L + S A + + + L+NF+L
Sbjct: 114 QLYASS-NVLQLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREALIPLLRNFEL 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2523BCTERIALGSPG501e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 49.9 bits (119), Expect = 1e-10
Identities = 19/62 (30%), Positives = 36/62 (58%)

Query: 5 KQKKQQGFTLIELMIVVAIIGVLAAAAIPAYQNYVTRSEVTSGLATVKALITPAELHYQE 64
KQ+GFTL+E+M+V+ IIGVLA+ +P +++ ++ + AL +++ +
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 65 NG 66
N
Sbjct: 63 NH 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2525BCTERIALGSPF435e-154 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 435 bits (1121), Expect = e-154
Identities = 105/407 (25%), Positives = 213/407 (52%), Gaps = 9/407 (2%)

Query: 8 LKNFRWKGVNSSGKKTSGQTLAMSEIEVRERLDAQHI--------KIKKLKKSSISFLTK 59
+ + ++ +++ GKK G A S + R+ L + + + + K S +
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 LSHRVKGKDITVFTRQISTMLVTGVPLVQALKLVSDNHKKAEMKSILMSVTRAVEAGTPM 119
R+ D+ + TRQ++T++ +PL +AL V+ +K + ++ +V V G +
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SKAMRTASNHFDPLYTDLIATGEQSGNLAEVFERLATYREKNEQLRAKVIKALIYPAMVI 179
+ AM+ F+ LY ++A GE SG+L V RLA Y E+ +Q+R+++ +A+IYP ++
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 LVALGVSFIMLTKVIPEFEKMFVGFGAELPWFTRQVLDLSAWTQNWSPFIALGSISVFIS 239
+VA+ V I+L+ V+P+ + F+ LP TR ++ +S + + P++ L ++ F++
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 ARVLSKRSDSFRLMLNRSVLKFPVLGAVLSKAAIAKFSRTLATSFTAGIPILTSLKTTSK 299
RV+ R + R+ +R +L P++G + A+++RTL+ + +P+L +++ +
Sbjct: 241 FRVM-LRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 300 TSGNMHYQLAIEEVYRDTAAGMPMYVAMRNCNVFPELVLQMVMIGEESGRLDDMLNKVAT 359
N + + + G+ ++ A+ +FP ++ M+ GE SG LD ML + A
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 360 IYEFEVDNTVDNLSKILEPLIIVFLGIVVGGLVTAMYLPIFNLMSVL 406
+ E + + + EPL++V + VV +V A+ PI L +++
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2526PREPILNPTASE359e-128 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 359 bits (924), Expect = e-128
Identities = 155/287 (54%), Positives = 195/287 (67%), Gaps = 1/287 (0%)

Query: 1 MEVFQYYPWLFVVFASIFGLIVGSFLNVVIYRLPKIMELEWRRECAESFPEYKIKPPQEV 60
+E+ PWL+ +F L++GSFLNVVI+RLP ++E EW+ E F +
Sbjct: 5 LELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPP 64

Query: 61 LTLSVPRSSCQNCATPIRIRDNIPVISWLLLKGKCHHCHTAISPRYPLIELLTAACAGFV 120
L VPRS C +C PI +NIP++SWL L+G+C C IS RYPL+ELLTA + V
Sbjct: 65 YNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAV 124

Query: 121 AYHFGFSYFTVALIFFTFFLIAATFIDLDTMLLPDQLTLPLTWAGIALALTEISPVSLQD 180
A + T+A + T+ L+A TFIDLD MLLPDQLTLPL W G+ L VSL D
Sbjct: 125 AMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLG-GFVSLGD 183

Query: 181 AVIGAIAGYLCLWSVYWGFKLLTGKEGMGYGDFKLLAALGAWLGWQSLPMIILLSSVVGV 240
AVIGA+AGYL LWS+YW FKLLTGKEGMGYGDFKLLAALGAWLGWQ+LP+++LLSS+VG
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 241 IFGLVQLRLQKQGIERAFPFGPYLAIAGWVSLIWGDQILSWYFTSIL 287
G+ + L+ + PFGPYLAIAGW++L+WGD I WY T+ L
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYLTNFL 290


53VP2691VP2704N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2691219-2.127958rod shape-determining protein MreB
VP2692222-2.906932hypothetical protein
VP2693422-2.109715MshP protein
VP2694221-2.038837type IV prepilin, MshO
VP2695-218-1.498241MSHA pilin protein MshD
VP2696-3140.143909MSHA pilin protein MshC
VP2697-4140.496364MSHA pilin protein MshA
VP2698-4130.485188V10 pilin
VP2699-2140.854960MSHA biogenesis protein MshF
VP2700-2130.900582MSHA biogenesis protein MshG
VP2701-3130.724873MSHA biogenesis protein MshE
VP2702-1150.008167MSHA biogenesis protein MshN
VP2703-115-0.434886MSHA biogenesis protein MshM
VP2704-114-0.053779MSHA biogenesis protein MshL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2691SHAPEPROTEIN5670.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 567 bits (1463), Expect = 0.0
Identities = 318/347 (91%), Positives = 334/347 (96%)

Query: 1 MFKKLRGMFSNDLSIDLGTANTLIYVKGQGIVLDEPSVVAIRQDRVGSAKSVAAVGHAAK 60
M KK RGMFSNDLSIDLGTANTLIYVKGQGIVL+EPSVVAIRQDR GS KSVAAVGH AK
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60

Query: 61 QMLGRTPGNISAIRPMKDGVIADFYVTEKMLQHFIKQVHDNSILKPSPRVLVCVPCGSTQ 120
QMLGRTPGNI+AIRPMKDGVIADF+VTEKMLQHFIKQVH NS ++PSPRVLVCVP G+TQ
Sbjct: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120

Query: 121 VERRAIRESALGAGAREVYLIDEPMAAAIGAGLRVSEPTGSMVVDIGGGTTEVAVISLNG 180
VERRAIRESA GAGAREV+LI+EPMAAAIGAGL VSE TGSMVVDIGGGTTEVAVISLNG
Sbjct: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180

Query: 181 VVYSSSVRIGGDRFDEAVINYVRRNYGSLIGEATAEKIKHEIGSAYPGDEVQEIEVRGRN 240
VVYSSSVRIGGDRFDEA+INYVRRNYGSLIGEATAE+IKHEIGSAYPGDEV+EIEVRGRN
Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240

Query: 241 LAEGVPRSFSLNSNEILEALQEPLSGIVSAVMVALEQCPPELASDISENGMVLTGGGALL 300
LAEGVPR F+LNSNEILEALQEPL+GIVSAVMVALEQCPPELASDISE GMVLTGGGALL
Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300

Query: 301 KDLDRLLMEETGIPVVIAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347
++LDRLLMEETGIPVV+AEDPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2694BCTERIALGSPG328e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 8e-04
Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 6/50 (12%)

Query: 2 KTRGFTLMEMIVTIVIGSFIMLGI-AGYVQLGMKGYADTIDRQRMQTQAQ 50
K RGFTL+E++V IVI +G+ A V + G + D+Q+ +
Sbjct: 6 KQRGFTLLEIMVVIVI-----IGVLASLVVPNLMGNKEKADKQKAVSDIV 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2696BCTERIALGSPG354e-05 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 34.9 bits (80), Expect = 4e-05
Identities = 11/27 (40%), Positives = 19/27 (70%)

Query: 9 GFTLMELILVIVLLSILSLYAASRFMG 35
GFTL+E+++VIV++ +L+ MG
Sbjct: 9 GFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2697BCTERIALGSPG356e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 35.2 bits (81), Expect = 6e-06
Identities = 11/35 (31%), Positives = 21/35 (60%)

Query: 1 MVHMRKSSGLSAIEFVVVLIILGVVGYVVLPRFIS 35
M K G + +E +VV++I+GV+ +V+P +
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2698BCTERIALGSPG516e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 51.0 bits (122), Expect = 6e-11
Identities = 20/54 (37%), Positives = 32/54 (59%), Gaps = 4/54 (7%)

Query: 1 MKRQGGFTLIELVVVIVILGILAVTAAPRFLNLQSDARE----SALQGLKGAID 50
+Q GFTL+E++VVIVI+G+LA P + + A + S + L+ A+D
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2700BCTERIALGSPF2782e-92 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 278 bits (712), Expect = 2e-92
Identities = 114/399 (28%), Positives = 197/399 (49%), Gaps = 2/399 (0%)

Query: 1 MPTFRYQGRTLDGSSTSGKVDAVNSEAAAEALMNKGIIPLNLRLEKEGVKNHVSLSKLLV 60
M + YQ G G +A ++ A + L +G++PL++ + + S L
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 PAIPLEV--IILFSRQLFSLTKAGVPLLRSMRGLLQNCENKQLKEALEDVVSELSNGRGL 118
I L + L +RQL +L A +PL ++ + + E L + + V S++ G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 SSAMQPHNKVFSPLFVSMINVGENTGRLDEALLQLANYYEQELETRKRIKAAMRYPTFVI 178
+ AM+ F L+ +M+ GE +G LD L +LA+Y EQ + R RI+ AM YP +
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VFITIAMFILNILVIPEFASMFTRFGVELPLPTRILIATSNFFVHYWGLLIAAMVGAFFV 238
V + IL +V+P+ F LPL TR+L+ S+ + ++ A++ F
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 FKAWVATAGGREKFDKFRLRLPIVGDIVNRAQLSRFARTFSLMLKSGVPLNQSLALAGEA 298
F+ + R F + L LP++G I +R+ART S++ S VPL Q++ ++G+
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 LGNRFLENRILEMKAAIEAGSTISVTAINSNIFTPLVIQMIAVGEETGRIDELLLEVSDF 358
+ N + +R+ A+ G ++ + +F P++ MIA GE +G +D +L +D
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 YDREVDYDLKTLTARIEPLLLVIVAGMVMVLALGIFLPM 397
DRE + EPLL+V +A +V+ + L I P+
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPI 399



Score = 67.9 bits (166), Expect = 1e-14
Identities = 31/126 (24%), Positives = 57/126 (45%), Gaps = 1/126 (0%)

Query: 71 FSRQLFSLTKAGVPLLRSMRGLLQNCENKQLKEALEDVVSELSNGRGLSSAMQPHNKVFS 130
++R L L + VPLL++MR N + L + G L A++ +F
Sbjct: 276 YARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKALE-QTALFP 334

Query: 131 PLFVSMINVGENTGRLDEALLQLANYYEQELETRKRIKAAMRYPTFVIVFITIAMFILNI 190
P+ MI GE +G LD L + A+ ++E ++ + + P V+ + +FI+
Sbjct: 335 PMMRHMIASGERSGELDSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLA 394

Query: 191 LVIPEF 196
++ P
Sbjct: 395 ILQPIL 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2704BCTERIALGSPD1825e-52 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 182 bits (462), Expect = 5e-52
Identities = 76/314 (24%), Positives = 138/314 (43%), Gaps = 26/314 (8%)

Query: 222 QQAVANLIGSGKGQSVVVTPQAGVITVRAFPDDIREVREFLGVSQERMQRQVILEAKILE 281
+QA + K + Q + V A PD + ++ + + + QV++EA I E
Sbjct: 297 KQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIA-QLDIRRPQVLVEAIIAE 355

Query: 282 VTLSDGYQQGINWSNLSASIGN--SGSIIVNRPASALPPLDAIGTLLGGQTN-------- 331
V +DG GI W+N +A + + + ++ + + GT+ +
Sbjct: 356 VQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGI 415

Query: 332 -VTISDGNFEAVLNFMSTQGDLNVLSSPRITAANNQKSVIKVGTDQYFVTELSSNAGNGE 390
GN+ +L +S+ ++L++P I +N ++ VG + +T S +G+
Sbjct: 416 AAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT--GSQTTSGD 473

Query: 391 NSNAVPEVELTPFFSGISLDVTPQIDNKGNVFLHVHPAVIEVTEEVKQLNLGGDFQNIQL 450
N E + GI L V PQI+ +V L + V V + +
Sbjct: 474 NIFNTVERKTV----GIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLG------ 523

Query: 451 PLAKSSIRESDSVIRAKDGDVVVIGGLMKQQNVEQVSKVPFLGDVPALGHLFRNTSNVTQ 510
A + R ++ + G+ VV+GGL+ + + KVP LGD+P +G LFR+TS
Sbjct: 524 --ATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581

Query: 511 KTELVILLKPTVVG 524
K L++ ++PTV+
Sbjct: 582 KRNLMLFIRPTVIR 595



Score = 34.9 bits (80), Expect = 8e-04
Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 6/90 (6%)

Query: 68 SKRFRIQANAVEARSFFASLVKGTEYSVAIHPAVQGNITVNLSDVT----LDEVLSVVQN 123
++ F + + F ++ K +V I P+V+G ITV D+ + V +
Sbjct: 27 AEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLD 86

Query: 124 MYGYDVMKSGK-VIQVYPA-GMRTVTIPVD 151
+YG+ V+ V++V + +T +PV
Sbjct: 87 VYGFAVINMNNGVLKVVRSKDAKTAAVPVA 116


54VP2880VP2887N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VP2880-1291.326296acetyl-CoA carboxylase biotin carboxyl carrier
VP2881-1181.738369acetyl-CoA carboxylase biotin carboxylase
VP28821150.450391hypothetical protein
VP2883013-0.477371ribosomal protein L11 methyltransferase
VP2884018-1.128346NifR3/Smm1 family protein
VP2885013-1.493486DNA-binding protein Fis
VP2886014-0.891564MFS transporter
VP2887119-1.953590ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2880RTXTOXIND320.001 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.001
Identities = 7/27 (25%), Positives = 16/27 (59%)

Query: 123 IEADKSGVVTAILVEDGQPVEFDQPLV 149
I+ ++ +V I+V++G+ V L+
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLL 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2885DNABINDNGFIS1483e-50 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 148 bits (374), Expect = 3e-50
Identities = 81/98 (82%), Positives = 89/98 (90%)

Query: 1 MFEQNLTSEALTVTTVTSQDQITQKPLRDSVKASLKNYLAQLNGQEVTELYELVLAEVEQ 60
MFEQ + S+ LTV+TV SQDQ+TQKPLRDSVK +LKNY AQLNGQ+V +LYELVLAEVEQ
Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60

Query: 61 PLLDTIMQYTRGNQTRAATMMGINRGTLRKKLKKYGMN 98
PLLD +MQYTRGNQTRAA MMGINRGTLRKKLKKYGMN
Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2886TCRTETA771e-17 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 76.8 bits (189), Expect = 1e-17
Identities = 79/348 (22%), Positives = 135/348 (38%), Gaps = 23/348 (6%)

Query: 41 LFITAYSLAIAFCAPYLGRWSDRVGRLRLMLPACFVFGISSILTGLVGQFEWALVTRVVT 100
+ + Y+L CAP LG SDR GR ++L + + + + R+V
Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106

Query: 101 GIASAGMLPIAFALAGDAKGGRSMRQIVMVQAGLTLGMITSPAIGALITQLLSWRAAFII 160
GI A +A G R + A GM+ P +G L+ S A F
Sbjct: 107 GITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFA 165

Query: 161 LGMAALAVGALVWGC--MREPHDQNERLMTL--PPVIEAFRLPGALGAIAAMCFGLGGGI 216
AAL + GC + E H R + + +FR + +AA+ + +
Sbjct: 166 --AAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMA-VFFIM 222

Query: 217 GVFNLIGLNM-----RDVAGLSISWIGMMYAALGIVSVLGN-LLTTRASHRLGSGRAVMR 270
+ + + D + IG+ AA GI+ L ++T + RLG RA
Sbjct: 223 QLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA--- 279

Query: 271 IALMVCMPCSIWVFACGASQFVAYLPILAIWSLAGGIGSPALQAYIA-SLSDEYRGVLMS 329
+ L + + ++ A++ PI+ + +GGIG PALQA ++ + +E +G L
Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIM-VLLASGGIGMPALQAMLSRQVDEERQGQLQG 338

Query: 330 SAMSMMHFGVAIWSALAGFAYDVGSE----WVAVLAVLLFGFAIAALK 373
S ++ + L Y W + L+ + AL+
Sbjct: 339 SLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VP2887PF05272320.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.009
Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 349 LVIEGFNGVGKSTLLKTLMGE 369
+V+EG G+GKSTL+ TL+G
Sbjct: 599 VVLEGTGGIGKSTLINTLVGL 619



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.