PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomeprueba.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in AE003852 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1VC_0021VC_0043Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_00212171.305102glycyl-tRNA synthetase, alpha chain
VC_00220172.304296conserved hypothetical protein
VC_0023-1204.223960NADH dehydrogenase subunit II-related protein
VC_0024-1214.629693conserved hypothetical protein
VC_00250214.439260hypothetical protein
VC_00261225.206679zinc-binding alcohol dehydrogenase
VC_00271195.406216threonine dehydratase
VC_00281175.236991dihydroxy-acid dehydratase
VC_00291164.104272branched-chain amino acid amiotransferase
VC_00300163.383566acetolactate synthase II, small subunit
VC_0031-1173.082867acetolactate synthase II, large subunit
VC_0032-2171.526471ComM-related protein
VC_0033-1170.321549conserved hypothetical protein
VC_0034-1160.558023thiol:disulfide interchange protein
VC_0035-2150.693359conserved hypothetical protein
VC_0036-1160.773127FixG-related protein
VC_0037-2171.579956conserved hypothetical protein
VC_0038-3172.620906hypothetical protein
VC_0039-2153.309433SpoOM-related protein
VC_0040-1163.338064hemolysin, putative
VC_0041-2173.502061conserved hypothetical protein
VC_0042-2173.387164potassium uptake protein TrkH
VC_0043-2133.410713potassium uptake protein TrkA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0024PF01206963e-30 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 96.0 bits (239), Expect = 3e-30
Identities = 27/71 (38%), Positives = 42/71 (59%)

Query: 10 HTLEAEGLRCPEPVMMVRKTIRTMLDGEVLLVTADDPSTTRDIPSFCRFMDHQLLGAQID 69
+L+A GL CP P++ +KT+ TM GEVL V A DP + +D SF + H+LL + +
Sbjct: 6 QSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEE 65

Query: 70 QLPYQYLIKKG 80
Y + +K+
Sbjct: 66 DGTYHFRLKRA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0026NUCEPIMERASE310.007 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 30.9 bits (70), Expect = 0.007
Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 151 ILVTGASGGVGSVAVTLLAQLGYKVVAV 178
LVTGA+G +G L + G++VV +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0032HTHFIS372e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.7 bits (85), Expect = 2e-04
Identities = 37/178 (20%), Positives = 61/178 (34%), Gaps = 51/178 (28%)

Query: 189 RDLQDIIGQ----QQGKRALEIAAAGNHNLLFLGPPGTGKTMLASRLCDLLPEMSDEEAM 244
+D ++G+ Q+ R L + L+ G GTGK ++A L D
Sbjct: 134 QDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK-------- 185

Query: 245 ETASIASLTQQEINQHNWKLRPFRAPH-----HSSSMAALVG-------GGTIPRPGEIS 292
+ PF A + + L G G G
Sbjct: 186 -----------------RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228

Query: 293 LAHNGLLFLDEM----PEFERKVLDSLREPLESGEIVISRAQGKTRFPARFQLVGALN 346
A G LFLDE+ + + ++L L++ GE + G+T + ++V A N
Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQ----GE--YTTVGGRTPIRSDVRIVAATN 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0035MALTOSEBP290.022 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 29.3 bits (65), Expect = 0.022
Identities = 13/37 (35%), Positives = 24/37 (64%), Gaps = 4/37 (10%)

Query: 25 PDLIWYALESIGIRAESGFL----PLNSYENRVYQFT 57
PD+I++A + G A+SG L P ++++++Y FT
Sbjct: 83 PDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFT 119


2VC_0123VC_0129Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_01230183.081498cyaY protein
VC_0125-2223.465918diaminopimelate decarboxylase
VC_0126-1244.020099diaminopimelate epimerase
VC_0127-2224.113370conserved hypothetical protein
VC_0128-3213.947032integrase/recombinase XerC
VC_0129-3203.451027conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0123MALTOSEBP260.024 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 26.2 bits (57), Expect = 0.024
Identities = 13/40 (32%), Positives = 21/40 (52%)

Query: 45 SQIIINRQEPMREIWLASKSGGYHFKSIDGEWICSKTGLE 84
S ++ N QEP L + GGY FK +G++ G++
Sbjct: 171 SALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVD 210


3VC_0451VC_0469Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VC_04512203.147618conserved hypothetical protein
VC_04522213.131919A/G-specific adenine glycosylase
VC_04531233.261712conserved hypothetical protein
VC_04541233.224413glutaminase family protein
VC_04552253.971797oxygen-independent coproporphyrinogen III
VC_04563223.444239HAM1 protein
VC_04571182.482494hypothetical protein
VC_04583172.777069conserved hypothetical protein
VC_04593182.738495conserved hypothetical protein
VC_04604192.632382pyrroline-5-carboxylate reductase
VC_04614202.124376conserved hypothetical protein
VC_04623222.341061twitching motility protein PilT
VC_04632212.475822twitching motility protein PilT
VC_04640192.586708transcriptional regulator, LuxR family
VC_04650182.309784tyrosyl-tRNA synthetase
VC_0466-1172.254523conserved hypothetical protein
VC_0467-2162.093623conserved hypothetical protein
VC_0468-2223.064761glutathione synthetase
VC_0469-1203.325769conserved hypothetical protein
4VC_0479VC_0515Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_0479018-3.071491hypothetical protein
VC_0480120-2.072306conserved hypothetical protein
VC_0481317-2.062882LysE/YggA family protein
VC_0482117-0.907316chromosome initiation inhibitor
VC_0483017-0.298038conserved hypothetical protein
VC_0484-214-0.054862hypothetical protein
VC_0485015-2.337381pyruvate kinase I
VC_0486116-3.399575transcriptional regulator, DeoR family
VC_0487220-4.458999glucosamine--fructose-6-phosphate
VC_0488526-6.403944extracellular solute-binding protein, putative
VC_0489831-7.538401hemolysin, putative
VC_04901234-9.304069conserved hypothetical protein
VC_04911129-8.217272hypothetical protein
VC_04921028-7.265687hypothetical protein
VC_0493926-5.381089hypothetical protein
VC_04941028-4.616948conserved hypothetical protein
VC_0495725-4.528903conserved hypothetical protein
VC_0496526-4.724720hypothetical protein
VC_0497427-3.686238transcriptional regulator
VC_0498324-2.991958ribonuclease HI, putative
VC_0502324-3.216066type IV pilin, putative
VC_0503321-2.427415conserved hypothetical protein
VC_0504523-1.183397hypothetical protein
VC_0505425-0.875691hypothetical protein
VC_0506422-1.918024hypothetical protein
VC_0507525-3.624044hypothetical protein
VC_0508430-7.208836hypothetical protein
VC_0509537-9.711169hypothetical protein
VC_0510641-11.237815DNA repair protein RadC-related protein
VC_0511643-11.390594hypothetical protein
VC_0512435-9.047828methyl-accepting chemotaxis protein
VC_0513535-9.386154transcriptional regulator, AraC/XylS family
VC_0514533-7.668911methyl-accepting chemotaxis protein
VC_0515323-4.634780conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0492LIPPROTEIN48320.005 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 31.5 bits (71), Expect = 0.005
Identities = 30/131 (22%), Positives = 55/131 (41%), Gaps = 10/131 (7%)

Query: 151 TDACRLISKFTYIYGSGSAPHDLRESYKLHR-LGALEEHLDEIMYEILGWVSDVLTLAAE 209
+ RL +K Y+ G S D R L ++ +H+ + +YE L D++ E
Sbjct: 279 FETVRLANKGQYVIGVDS---DQGMIQDKDRILTSVLKHIKQAVYETL---LDLILEKEE 332

Query: 210 KRQPTIVRAKDFGARLGEIESKYRQ--KTILNYFCNRSSESEDVQNTIKDAPNYIKQLNL 267
+P +V+ K + ++ + N+F N + E + N IK+A K+L
Sbjct: 333 GYKPYVVKDKKADKKWSHFGTQKEKWIGVAENHFSN-TEEQAKINNKIKEAIKMFKELPE 391

Query: 268 INVDDSELEEA 278
V ++A
Sbjct: 392 DFVKYINSDKA 402


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0502BCTERIALGSPG465e-09 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 46.4 bits (110), Expect = 5e-09
Identities = 18/53 (33%), Positives = 31/53 (58%)

Query: 1 MSKNQGFTLIELIIAIVILGVIAVIAAPRFINISKDAKANTMLSVAAGMESAL 53
K +GFTL+E+++ IVI+GV+A + P + + A +S +E+AL
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0514CHANLCOLICIN381e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 38.1 bits (88), Expect = 1e-04
Identities = 49/265 (18%), Positives = 93/265 (35%), Gaps = 19/265 (7%)

Query: 362 AEKTANEAEVSRLVVEQQLQELEQLATAMNEMAMTASEVANSAQVAADAAKEGESASLEG 421
AEK EAE R +E++ E E+ + ++ A+ A K+ +A E
Sbjct: 146 AEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEV 205

Query: 422 SSVVHETTDAIQRLSIRIGSSVEDVKELVKATDRIETVLDVINDIADQTNLLALNAAIEA 481
+ E RLS I + ++K L + L + + + L + A
Sbjct: 206 VKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNE----LAQASAKYKELDELVKKLSPRA 261

Query: 482 ARAGESGRGFAVVADEVRTLAQRTQQSTMQISEIIEQLQEGAKNVSRSMDESKLETDIVV 541
++ F V R ++ +Q+ ++R + +
Sbjct: 262 NDPLQNRPFFEATRRRVGAGKIREEKQ--------KQVTASETRINRINADITQIQKAIS 313

Query: 542 EKTNQVNEKISLVQQAIHRISDMNLQIASAAEEQSLVAEEINNNTVNIKDLSIKLSEAAS 601
+ +N N I+ V +A NL+ A S + + ++ + L+ K E S
Sbjct: 314 QVSNNRNAGIARVHEAEE-----NLKKAQNNLLNSQIKDAVDATVSFYQTLTEKYGEKYS 368

Query: 602 NAGTEM--NAQVSKVKEQNELLNEF 624
E+ ++ K+ NE L F
Sbjct: 369 KMAQELADKSKGKKIGNVNEALAAF 393


5VC_0596VC_0601Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VC_05961173.849394dnaK suppressor protein
VC_05970173.936882sugar fermentation stimulation protein
VC_0598-1163.650018hypothetical protein
VC_0599-2133.557954hypothetical protein
VC_0600-2133.422994hypothetical protein
VC_0601-2133.253859ATP-dependent helicase HrpB
6VC_0630VC_0644Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_0630218-1.873283conserved hypothetical protein
VC_0631221-1.856067tyrosyl-tRNA synthetase
VC_0632223-1.610466D-alanyl-D-alanine
VC_0633127-2.017272outer membrane protein OmpU
VC_0634-119-1.588381transcription elongation factor GreA
VC_0635020-1.498688conserved hypothetical protein
VC_0636021-1.469684cell division protein FtsJ
VC_0637121-1.012346cell division protein FtsH
VC_0638225-0.387363dihydropteroate synthase
VC_0639329-0.089556phosphoglucomutase/phosphomannomutase family
VC_06405300.538568preprotein translocase, SecG subunit
VC_06416320.685874**conserved hypothetical protein
VC_06426371.130050N utilization substance protein A
VC_06436321.478257initiation factor IF-2
VC_06444261.302838ribosome-binding factor A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0632BLACTAMASEA300.024 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.024
Identities = 33/203 (16%), Positives = 68/203 (33%), Gaps = 33/203 (16%)

Query: 259 VVYRLLSQLNIELKGKIKVGKANTKQAQKIASHH---SQPLPVLLKTMLQESDNLIADTL 315
V + + +L+ KI + + ++ H + L + SDN A+ L
Sbjct: 75 AVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLL 134

Query: 316 TKALGHRFYSQPGSFTNGTQAIKQIFYSRTGISLEDTQLADGSGLSRNNRMRPQVMLETL 375
+G P T ++QI + T + +T+L + + P M TL
Sbjct: 135 LATVG-----GPAGLT---AFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATL 186

Query: 376 RYLYQHEAELGLIAMLPSAGESGTLQYRRSMRAPQISGQ-IKA----------KSGSL-Y 423
R L + Q + M +++G I++ K+G+
Sbjct: 187 RKLLTSQR----------LSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGER 236

Query: 424 GTYNMAGFVMDENQRPKTLFVQF 446
G + + N+ + + +
Sbjct: 237 GARGIVALLGPNNKAERIVVIYL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0633ECOLIPORIN831e-19 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 82.7 bits (204), Expect = 1e-19
Identities = 91/397 (22%), Positives = 151/397 (38%), Gaps = 70/397 (17%)

Query: 10 MNKTLIALAVSAAAVATGAYADGINQSGDKAGSTVYSAKGTSLEVGGRAEAR--LSLKDG 67
M + ++AL + A A A+A + +Y+ G L++ G+ + S
Sbjct: 1 MKRKVLALVIPALLAAGAAHA-----------AEIYNKDGNKLDLYGKVDGLHYFSDDSS 49

Query: 68 KAQDNSRVRLNFLGKAEINDSLYGVGFYEGEFTTNDQGKNASNNSLDNRYTYAGIG-GTY 126
K D + +R+ F G+ +IND L G G +E N +N+ R +AG+ G Y
Sbjct: 50 KDGDQTYMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSW--TRLAFAGLKFGDY 107

Query: 127 GEVTYGKNDGALGVITDFTDIMSYHG--NTAAEKIAVADRVDNMLAYKGQ--FG-----D 177
G YG+N G L + +TD++ G + + R + + Y+ FG +
Sbjct: 108 GSFDYGRNYGVLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLN 167

Query: 178 LGVKASYRFADRNAVDAMGNVVTETNAAKYSDNGEDGYSLSAIYTFGDTGFNVGAGYADQ 237
++ + ++A D N + DG+ +S Y G GF+ GA Y
Sbjct: 168 FALQYQGKNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIG-MGFSAGAAYTTS 226

Query: 238 DDQNE----------------YMLAASYRMENLYFAGLFTDGELAKDVDYTGYELAAGYK 281
D NE + Y N+Y A ++++ T G
Sbjct: 227 DRTNEQVNAGGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVA 286

Query: 282 LGQAAFTAT------------------------YNNAETAKETSADNFAIDATYYFKPNF 317
F T YNN + + ATYYF NF
Sbjct: 287 NKTQNFEVTAQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNF 346

Query: 318 RSYISYQFNLLDSD----KVGKVASEDELAIGLRYDF 350
+Y+ Y+ NLLD D K ++++D +A+G+ Y F
Sbjct: 347 STYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0637HTHFIS366e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.6 bits (82), Expect = 6e-04
Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 18/82 (21%)

Query: 192 VLMVGPPGTGKTLLAKAI---AGEAKVPFFT-----ISGSDFVEMFVGV------GASRV 237
+++ G GTGK L+A+A+ PF I G GA
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222

Query: 238 RD-MFEQAKKASPCIIFIDEID 258
FEQA+ + +F+DEI
Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0640SECGEXPORT1371e-45 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 137 bits (346), Expect = 1e-45
Identities = 62/112 (55%), Positives = 88/112 (78%), Gaps = 5/112 (4%)

Query: 1 MFTVLLVIYLLAAVAIIGLVLIQQGKGADMGASFGAGASNTVFGASGSGNFLTRMTAIFA 60
M+ LLV++L+ A+ ++GL+++QQGKGADMGASFGAGAS T+FG+SGSGNF+TRMTA+ A
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 61 TVFFVISLVLGNMSTHKTE--SQWVDPSQGQVIQQAGDSVSEAPAKSSDEIP 110
T+FF+ISLVLGN++++KT S+W + S +Q + APAK + +IP
Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPA---APAKPTSDIP 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0643TCRTETOQM741e-15 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 74.1 bits (182), Expect = 1e-15
Identities = 68/313 (21%), Positives = 107/313 (34%), Gaps = 77/313 (24%)

Query: 404 IMGHVDHGKTSTLDYIRRTHVASGEAG------------------GITQHIGAYHVETPN 445
++ HVD GKT+ + + A E G GIT G + N
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67

Query: 446 GMITFLDTPGHAAFTAMRARGAQATDIVVLVVAADDGVMPQTVEAIQHAKAAGVPLIVAV 505
+ +DTPGH F A R D +L+++A DGV QT + G+P I +
Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127

Query: 506 NKIDKDTANPDNV--------------KTELSQY-------NVMPEEWG----GDNMFV- 539
NKID++ + V K ++ Y E+W G++ +
Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187

Query: 540 ------------------------------HISAKQGTNIDGLLEAILLQAEVLELKAVK 569
H SAK ID L+E I +
Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245

Query: 570 QGMASGVVIESRLDKGRGPVATVLVQSGTLRKGDIVL-CGQEYGRVRAMRDEVGNEVEEA 628
Q G V + + R +A + + SG L D V +E ++ M + E+ +
Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKI 305

Query: 629 GPSIPVEILGLSG 641
+ EI+ L
Sbjct: 306 DKAYSGEIVILQN 318


7VC_0788VC_0845Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_07882171.599214DOPA-dioxygenase-related protein
VC_07891171.419066hypothetical protein
VC_07902150.934092transcriptional regulator CitB
VC_07912161.300413sensor kinase citA
VC_07922162.110205oxaloacetate decarboxylase, beta subunit
VC_07942172.239947hypothetical protein
VC_07952162.690313citrate/sodium symporter
VC_07963173.278853citrate (pro-3S)-lyase ligase
VC_07972194.509895citrate lyase, gamma subunit
VC_07983204.377289citrate lyase, beta subunit
VC_07992203.866247citrate lyase, alpha subunit
VC_08003213.604082citX protein
VC_08011203.546504citG protein
VC_08020242.794643hypothetical protein
VC_08031242.675491RNA methyltransferase, TrmH family
VC_0804-1181.565329ferredoxin
VC_0805-114-0.165659hypothetical protein
VC_0806-111-2.186602conserved hypothetical protein
VC_0807-117-4.755570hypothetical protein
VC_0808-121-5.120199hypothetical protein
VC_0809-121-5.022136hypothetical protein
VC_0810228-6.666582hypothetical protein
VC_0811228-6.161855hypothetical protein
VC_0812326-4.731851helicase-related protein
VC_0813633-7.215587tellurite resistance protein-related protein
VC_0814841-11.047405transcriptional regulator, putative
VC_0815943-12.662084hypothetical protein
VC_0816947-14.436741hypothetical protein
VC_08171048-14.354400transposase, putative
VC_0819950-15.139624aldehyde dehydrogenase
VC_0820947-14.673290ToxR-activated gene A protein
VC_0821745-13.848971hypothetical protein
VC_0822643-12.590953inner membrane protein, putative
VC_0823239-9.721847hypothetical protein
VC_0824136-8.750347tagD protein
VC_0825035-9.037530toxin co-regulated pilus biosynthesis protein I
VC_0826238-9.656725toxin co-regulated pilus biosynthesis protein P
VC_0827338-10.389866toxin co-regulated pilus biosynthesis protein H
VC_0828438-11.902766toxin co-regulated pilin
VC_0829538-13.320171toxin co-regulated pilus biosynthesis protein B
VC_0830541-14.372237toxin co-regulated pilus biosynthesis protein Q
VC_0831640-14.221059toxin co-regulated pilus biosynthesis outer
VC_0832642-14.516971toxin co-regulated pilus biosynthesis protein R
VC_0833643-14.862562toxin co-regulated pilus biosynthesis protein D
VC_0834544-14.665095toxin co-regulated pilus biosynthesis protein S
VC_0835643-14.324360toxin co-regulated pilus biosynthesis protein T
VC_0836544-14.568744toxin co-regulated pilus biosynthesis protein E
VC_0837546-15.194678toxin co-regulated pilus biosynthesis protein F
VC_0838643-15.252667TCP pilus virulence regulatory protein
VC_0839543-14.730220leader peptidase TcpJ
VC_0840742-12.116948accessory colonization factor AcfB
VC_0841637-10.213520accessory colonization factor AcfC
VC_0842533-9.452941conserved hypothetical protein
VC_0843431-8.493174tagE protein
VC_0844429-7.566229accessory colonization factor AcfA
VC_0845326-6.517032conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0790HTHFIS801e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 1e-19
Identities = 28/144 (19%), Positives = 52/144 (36%), Gaps = 8/144 (5%)

Query: 2 MELIDVLIVEDETSIADVHSFYLKQTARFR--PVGVAQSINEARNMVRILKPKLIFLDNY 59
M +L+ +D+ +I V L Q V + + + L+ D
Sbjct: 1 MTGATILVADDDAAIRTV----LNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVV 56

Query: 60 LPDGRGIEFLKELT-HQPQAPDVIFITAASDMETVREAVRCGVFDYLLKPIAYDRVQNSL 118
+PD + L + +P P V+ ++A + T +A G +DYL KP + +
Sbjct: 57 MPDENAFDLLPRIKKARPDLP-VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 119 ERYLKYISSLRANDSVNQRHVDEL 142
R L + + + L
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0791PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 17/100 (17%), Positives = 33/100 (33%), Gaps = 17/100 (17%)

Query: 451 LLDNAFEATLKNPHSNKTISLLLTDNGAELVIEVADNGIGISADIAQTLFLKGVSSKNQE 510
L++N + + I L T + + +EV + G +E
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-------------SLALKNTKE 309

Query: 511 GHGIGLYLVHQFVTQAHG---SILIDSAEPQGTIFSIFIP 547
G GL V + + +G I + + + + IP
Sbjct: 310 STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0796LPSBIOSNTHSS463e-08 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 46.0 bits (109), Expect = 3e-08
Identities = 21/67 (31%), Positives = 33/67 (49%), Gaps = 2/67 (2%)

Query: 160 NPFTLGHQYLIEQACEQCDWVHLFVVKAENK--DFSYADRMAMIKAGSKHLLNLTIHSGS 217
+P T GH +IE+ C D V++ V++ NK FS +R+ I HL N + S
Sbjct: 10 DPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFE 69

Query: 218 DYIISRA 224
++ A
Sbjct: 70 GLTVNYA 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0828PF05946312e-111 Toxin-coregulated pilus subunit TcpA
		>PF05946#Toxin-coregulated pilus subunit TcpA

Length = 199

Score = 312 bits (801), Expect = e-111
Identities = 161/199 (80%), Positives = 178/199 (89%)

Query: 26 MTLLEVIIVLGIMGVVSAGVVTLAQRAIDSQNMTKAAQNLNSVQIAMTQTYRSLGNYPAT 85
MTLLEVIIVLGIMGVVSAGVVTLAQRAIDSQNMTKAAQ+LNS+Q+A+TQTYR LGNYPAT
Sbjct: 1 MTLLEVIIVLGIMGVVSAGVVTLAQRAIDSQNMTKAAQSLNSIQVALTQTYRGLGNYPAT 60

Query: 86 ANANAATQLANGLVSLGKVSADEAKNPFTGTAMGIFSFPRNSAANKAFAITVGGLTQAQC 145
A+A AA++L +GLVSLGK+S+DEAKNPF GT M IFSFPRN+AANKAFAI+V GLTQAQC
Sbjct: 61 ADATAASKLTSGLVSLGKISSDEAKNPFIGTNMNIFSFPRNAAANKAFAISVDGLTQAQC 120

Query: 146 KTLVTSVGDMFPFINVKEGAFAAVADLGDFETSVADAATGAGVIKSIAPGSANLNLTNIT 205
KTL+TSVGDMFP+I +K G A+ADLGDFE S A A TG GVIKSIAP S NL+LTNIT
Sbjct: 121 KTLITSVGDMFPYIAIKAGGAVALADLGDFENSAAAAETGVGVIKSIAPASKNLDLTNIT 180

Query: 206 HVEKLCTGTAPFTVAFGNS 224
HVEKLC GTAPF VAFGNS
Sbjct: 181 HVEKLCKGTAPFGVAFGNS 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0829FLGBIOSNFLIP290.033 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 29.0 bits (65), Expect = 0.033
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 198 FESNVISYEEFIENPSA--RENFLLKATKDRTLALAVSLAQTGEIA 241
F IS +E +E + RE F+L+ T++ L L LA TG +
Sbjct: 116 FSEEKISMQEALEKGAQPLRE-FMLRQTREADLGLFARLANTGPLQ 160


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0831BCTERIALGSPD418e-06 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 41.4 bits (97), Expect = 8e-06
Identities = 48/271 (17%), Positives = 91/271 (33%), Gaps = 36/271 (13%)

Query: 211 SASLGETGKFTIFEDYSLVTVKARPDKFLLLHTFFDKLINESKMQIAVDYRVVSLSEERL 270
A+L + + + V A PD L +L + + Q+ V+ + + +
Sbjct: 303 VAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQL-DIRRPQVLVEAIIAEVQDADG 361

Query: 271 NQLAAKFGIENAGKYSITSDMV--------------DAISLSQVGGGL----GASYRSAS 312
L ++ +NAG T+ + D S + L G +
Sbjct: 362 LNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQ 421

Query: 313 ARLDAVVNELSQE----VMHEGHFIGIPNRVMPLNVTTNSKYISSIETTKDTN--TDEET 366
++ LS ++ + + N NV ++ +TT N E
Sbjct: 422 GNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVER 481

Query: 367 RTVKVSDLVTGFSMMVMPKILDDGRI--QISSGFSRKQLVSIGTAQGITLPTVDENESMN 424
+TV G + V P+I + + +I S + T+ + T + N
Sbjct: 482 KTV-------GIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDL-GATFNTRTVNN 533

Query: 425 TVTMNPGE-VRLAMLFKDNYIQNSNGVQLLG 454
V + GE V + L + ++ V LLG
Sbjct: 534 AVLVGSGETVVVGGLLDKSVSDTADKVPLLG 564


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0836BCTERIALGSPF664e-14 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 66.0 bits (161), Expect = 4e-14
Identities = 58/325 (17%), Positives = 130/325 (40%), Gaps = 15/325 (4%)

Query: 16 LVDLLNDNIPLYDALNKIQNEGVGIYDKNFIKS-IELIKDRMKSNSSLTDAL---TGLIP 71
L L+ ++PL +AL+ + + +K + + ++ ++ SL DA+ G
Sbjct: 77 LATLVAASMPLEEALDAVAKQS----EKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFE 132

Query: 72 DKEVLMINVAENSGKISSGIAAIRKNIIDADEIKSKAISSMITPSVMLIVTMVVIAGYSV 131
M+ E SG + + + + +++S+ +MI P V+ +V + V++
Sbjct: 133 RLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLS 192

Query: 132 KVFPTFESVLPVSR--WPGVTQALYNLGFSLYE-GLWIKVLIFVAIFITILVFMSKNITG 188
V P + P T+ L + ++ G W+ + + ++ +
Sbjct: 193 VVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRV 252

Query: 189 NFRDGFLDKLPPFNFVKHIAAT-EFLANMSMLLDSRVPFKEGLDIV-DHKTTRWLSSHLQ 246
+F L LP + T + +S+L S VP + + I D + + L
Sbjct: 253 SF-HRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLS 311

Query: 247 RMKANMQEGLDYKQALDTNLLDKKMLLTM-AVYSELPNFSDVMQKLAIEANINLHKKIAT 305
++EG+ +AL+ L M+ M A ++++ A + ++
Sbjct: 312 LATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQDREFSSQMTL 371

Query: 306 LAGVMKNISLITLALSVIWIFGAIF 330
G+ + + ++++A V++I AI
Sbjct: 372 ALGLFEPLLVVSMAAVVLFIVLAIL 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0837PF063405680.0 Vibrio cholerae toxin co-regulated pilus biosynthesis pr...
		>PF06340#Vibrio cholerae toxin co-regulated pilus biosynthesis

protein F (TcpF)
Length = 338

Score = 568 bits (1466), Expect = 0.0
Identities = 335/338 (99%), Positives = 335/338 (99%)

Query: 1 MRYKKTLMLSIMITSFNSFAFNDNYSSTSTVYATSNEATDSRGSEHLRYPYLECIKIGMS 60
MRYKKTLMLSIMITSFNSFAFNDNYSSTSTVYATSNEATDSRGSEHLRYPYLECIKIGMS
Sbjct: 1 MRYKKTLMLSIMITSFNSFAFNDNYSSTSTVYATSNEATDSRGSEHLRYPYLECIKIGMS 60

Query: 61 RDYLENCVKVSFPTSQDMFYDAYPSTESDGAKTRTKEDFSARLLAGDYDSLQKLYIDFYL 120
RDYLENCVKVSFPTSQDMFYDAY STESDGAKTRTKEDFSARLLAGDYDSLQKLYIDFYL
Sbjct: 61 RDYLENCVKVSFPTSQDMFYDAYSSTESDGAKTRTKEDFSARLLAGDYDSLQKLYIDFYL 120

Query: 121 AQTTFDWEIPTRDQIETLVNYANEGKLSTALNQEYITGRFLTKENGRYDIVNVGGVPDNT 180
AQTTFDWEIPTRDQIETLVNYANEGKLSTALNQEYITGRFLTKENGRYDIVNVGGVPDNT
Sbjct: 121 AQTTFDWEIPTRDQIETLVNYANEGKLSTALNQEYITGRFLTKENGRYDIVNVGGVPDNT 180

Query: 181 PVKLPAIVSKRGLMGTTSVVNAIPNEIYPHIKVYEGTLSRLKPGGAMIAVLEYDVSELSK 240
PVKLPAIVSKRGLMGTTSVVNAIPNEIYPHIKVYEGTLSRLKPGGAMIAVLEYDVSELSK
Sbjct: 181 PVKLPAIVSKRGLMGTTSVVNAIPNEIYPHIKVYEGTLSRLKPGGAMIAVLEYDVSELSK 240

Query: 241 HGYTNLWDVQFKVLVGVPHAETGVIYDPVYEETVKPYQPSGNLTGKKLYNVSTNDMHNGY 300
HGYTNLWDVQFKVLVGVPHAETGVIYDPVYEETVKPYQPS NLTGKKLYNVSTNDM NGY
Sbjct: 241 HGYTNLWDVQFKVLVGVPHAETGVIYDPVYEETVKPYQPSDNLTGKKLYNVSTNDMRNGY 300

Query: 301 KWSNTMFSNSNYKTQILLTKGDGSGVKLYSKAYSENFK 338
KWSNTMFSNSNYKTQILLTKGDGSGVKLYSKAYSENFK
Sbjct: 301 KWSNTMFSNSNYKTQILLTKGDGSGVKLYSKAYSENFK 338


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0839PREPILNPTASE2271e-76 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 227 bits (581), Expect = 1e-76
Identities = 77/268 (28%), Positives = 130/268 (48%), Gaps = 22/268 (8%)

Query: 1 MEYVYLILFSIVSLILGSFSNVVIYRLPRKILLKNHFF------------------YDID 42
+ ++Y L + SL++GSF NVVI+RLP I+L+ + Y++
Sbjct: 11 LPWLYFSLVFLFSLMIGSFLNVVIHRLP--IMLEREWQAEYRSYFNPDDEGVDEPPYNLM 68

Query: 43 SNRSMCPKCGNKISWYDNVPLLSYLLLHGKCRHCDEKISLSYFIVELSFFIIAFPIYWLS 102
RS CP C + I+ +N+PLLS+L L G+CR C IS Y +VEL +++ +
Sbjct: 69 VPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTL 128

Query: 103 TDWVDSFVLLGLYFILFNLFVIDFKSMLLPNLLTYPIFMLAFIYVQQNPALTVESSIIGG 162
+ L L ++L L ID MLLP+ LT P+ ++ +++ ++IG
Sbjct: 129 APGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGA 188

Query: 163 FAAFIISYVSNFIVRLFKRIDVMGGGDIKLYTAIGTLIGVEFVPYLFLLSSIIAFIHWFF 222
A +++ + + +L + MG GD KL A+G +G + +P + LLSS++
Sbjct: 189 MAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIG 248

Query: 223 ARVSCRYCL--YIPLGPSIIISFVIVFF 248
+ + IP GP + I+ I
Sbjct: 249 LILLRNHHQSKPIPFGPYLAIAGWIALL 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0844OMPADOMAIN517e-10 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 51.1 bits (122), Expect = 7e-10
Identities = 49/231 (21%), Positives = 80/231 (34%), Gaps = 56/231 (24%)

Query: 1 MQKTLSAI---FLFTTLSANAAP-----YIGLELGIGTANHSFETNYQSDAVSLNPNMED 52
M+KT AI A AAP Y G +LG + +T + ++ + N
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLG---WSQYHDTGFINNNGPTHEN--Q 55

Query: 53 MFLGGLIGYKFNDNFSFEINYSSYKLEDQYSKFIGIESLNNQQYRKEHEWSSEIEAKQLA 112
+ G GY+ N FE+ Y D + S+ N Y + + QL
Sbjct: 56 LGAGAFGGYQVNPYVGFEMGY------DWLGRMPYKGSVENGAY--------KAQGVQLT 101

Query: 113 LISVYSYPIHQQLKTNIKVGVTHTQYEAYSGKYEELELLLNDDIEIRKALLETGLKKNRF 172
YPI L ++G + + S Y + N D
Sbjct: 102 AKL--GYPITDDLDIYTRLGGMVWRADTKSNVYGK-----NHD--------------TGV 140

Query: 173 GALFSLGLDYNLIPELSVGTELRYQ---FDKYN-----DVLSISLNSKYYF 215
+F+ G++Y + PE++ E ++ D + D +SL Y F
Sbjct: 141 SPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRPDNGMLSLGVSYRF 191


8VC_0917VC_0929Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_0917-121-3.018109UDP-N-acetylglucosamine 2-epimerase
VC_0918024-3.634247UDP-N-acetyl-D-mannosaminuronic acid
VC_0919226-3.981385serine acetyltransferase-related protein
VC_0920025-3.899636exopolysaccharide biosynthesis protein EpsF,
VC_0921024-3.100418polysaccharide export protein, putative
VC_0922-124-2.571078hypothetical protein
VC_0923-122-2.657714serine acetyltransferase-related protein
VC_0924-121-4.635375capK protein, putative
VC_0925-119-3.780728polysaccharide biosynthesis protein, putative
VC_0926015-4.006978hypothetical protein
VC_0927116-4.146482UDP-N-acetyl-D-mannosamine transferase
VC_0928016-4.296643hypothetical protein
VC_0929013-3.684125hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0918NUCEPIMERASE290.046 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 28.6 bits (64), Expect = 0.046
Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 1/30 (3%)

Query: 4 KVSVIGL-GYIGLPTAAVLASRGLDVVGVD 32
K V G G+IG + L G VVG+D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGID 31


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0926PF06917290.029 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 29.5 bits (66), Expect = 0.029
Identities = 9/33 (27%), Positives = 15/33 (45%)

Query: 79 LFILGLIEEYQVTKDPALLTEAEQLAQWLVTQR 111
+L L+E + + P L T A Q+ L +
Sbjct: 448 YLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH 480


9VC_0961VC_0973Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_09612220.001913phoH family protein
VC_0962224-0.549402conserved hypothetical protein
VC_0963123-1.543244VisC-related protein
VC_0964229-2.444952PTS system, glucose-specific IIA component
VC_0965220-1.980681phosphoenolpyruvate-protein phosphotransferase
VC_0966114-1.179920phosphocarrier protein HPr
VC_0967-117-1.504798hypothetical protein
VC_0968-117-2.083754cysteine synthase A
VC_0969-114-1.870437cysZ protein
VC_0970-215-1.801877cell division protein ZipA
VC_0971-118-2.215164DNA ligase
VC_0972020-3.222211porin, putative
VC_0973021-3.503063hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0965PHPHTRNFRASE7620.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 762 bits (1969), Expect = 0.0
Identities = 287/568 (50%), Positives = 408/568 (71%), Gaps = 2/568 (0%)

Query: 1 MISGILASPGIAIGKALLLQEDEIVLNTNTISDDQVEAEVERFYTARDKSSAQLEVIKQK 60
I+GI AS G+AI KA + E + + +I+D V E+E+ A +KS +L IK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITD--VSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 ALETFGEEKEAIFEGHIMLLEDEELEEEILALIKKEKMSADNAIHTVIEEQATALESLDD 120
+ G +K IF H+++L+D EL + I I+ E+M+A+ A+ V + + ES+D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERATDIRDIGTRFVKNALGMHIVSLSEIDQEVVLVAYDLTPSETAQINLNYVLGFA 180
EY+KERA DIRD+ R + + +G+ SL+ I +E V++A DLTPS+TAQ+N +V GFA
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 CDIGGRTSHTSIMARSLELPAIVGTNDITKKVKNGDTLILDAMNNKIIVNPTQAQIEEAK 240
DIGGRTSH++IM+RSLE+PA+VGT ++T+K+++GD +I+D + +IVNPT+ +++ +
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 AVKAAFLAEKEELAKLKDLPAETLDGHRVEVCGNIGTVKDCDGIIRNGGEGVGLYRTEFL 300
+AAF +K+E AKL P+ T DG VE+ NIGT KD DG++ NGGEG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRDALPTEEEQYQAYKEVAEAMNGQAVIIRTMDIGGDKDLPYMDLPKEMNPFLGWRAV 360
+MDRD LPTEEEQ++AYKEV + M+G+ V+IRT+DIGGDK+L Y+ LPKE+NPFLG+RA+
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RISLDRREILRDQLRGILRASAHGKLRVMFPMIISVEEIRELKNAIEEYKAELRTEGHAF 420
R+ L++++I R QLR +LRAS +G L+VMFPMI ++EE+R+ K ++E K +L +EG
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DENIEIGVMVETPAAAAIAHHLAKEVSFFSIGTNDLTQYTLAVDRGNEMISHLYNPLSPA 480
++IE+G+MVE P+ A A+ AKEV FFSIGTNDL QYT+A DR NE +S+LY P PA
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLTVIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSGISIPKVKKVIRNA 540
+L ++ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMS SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NYAEIKAMAEEALALPTAAEIEACVDKF 568
+ E+K A++AL L TA E+E V K
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKT 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0969ACRIFLAVINRP290.018 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.018
Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 3/39 (7%)

Query: 208 FGMLVAFFTS--IPIVNLFIVPVAVCGA-TAMWVMEFKI 243
F L A + S IP+ + +VP+ + G A + K
Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKN 922


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0970TONBPROTEIN320.002 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 31.9 bits (72), Expect = 0.002
Identities = 11/63 (17%), Positives = 23/63 (36%)

Query: 92 ELDEEEDEEARIPVQPQSQPQPRKVQPQVEMPRVAPNVPMAKVQPEVVTEIEVQEPQEEK 151
D E + + P +P +P+P + K +P+ + + ++ K
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 152 LDV 154
DV
Sbjct: 112 RDV 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0972ECOLNEIPORIN598e-12 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 58.7 bits (142), Expect = 8e-12
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 5/143 (3%)

Query: 50 YQEDSNGYDYENESRIGFRASKDMFDNVNVFMQIESGYVGEDGKGSTLGARDTFLGLQGD 109
++ + S+IGF+ +D+ + + Q+E G S G R +F+GL+G
Sbjct: 45 ASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQK-ASIAGTDSGWGNRQSFIGLKGG 103

Query: 110 WGKVRFGRMLTPLYEIVDWPYSNPGLGRVFDWGGDVAGHYDRKGDIARYDSPAFGGLTFN 169
+GK+R GR+ + L D NP + G + + + RYDSP F GL+ +
Sbjct: 104 FGKLRVGRLNSVLK---DTGDINPWDSKSDYLGVNKIAEPEARLISVRYDSPEFAGLSGS 160

Query: 170 IS-AGRGDVGTKSSNHFGAAAHY 191
+ A + G +S + A +Y
Sbjct: 161 VQYALNDNAGRHNSESYHAGFNY 183


10VC_0993VC_0998Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_0993316-0.115805N-acetylglucosamine repressor
VC_09943160.242138N-acetylglucosamine-6-phosphate deacetylase
VC_09953160.674080PTS system, N-acetylglucosamine-specific IIABC
VC_09962150.931808hypothetical protein
VC_09972140.896065glutaminyl-tRNA synthetase
VC_09983151.252011hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0998IGASERPTASE453e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.1 bits (106), Expect = 3e-06
Identities = 50/315 (15%), Positives = 95/315 (30%), Gaps = 29/315 (9%)

Query: 867 DLELPEENDEPQLAEVTPSSAFDEQQVETEIEP-ESEPLAAEASNDESDLTALNELDLPE 925
DL PE Q + T + + Q + P +E +A + E
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 926 YTEEDVLADVQLEPAAESEVEPDLELVNEPVTEEAFTELDELDLPEYTEEDALADAQLEP 985
E+ + + E + + N V +EA + + + ++
Sbjct: 1039 TVAENSKQESKTVEKNEQDAT-ETTAQNREVAKEAKSNV----------KANTQTNEVAQ 1087

Query: 986 VAESEVEPELDLASEPAEEEAFTELNKLDLPEYTEEDALADAQLESATESEVESELELVS 1045
E + E A E E K++ + E + + + E ++ +
Sbjct: 1088 SGSETKETQTTETKETATVEK-EEKAKVETEKTQEV---PKVTSQVSPKQEQSETVQPQA 1143

Query: 1046 EPAAEEAFTELDELDLPEYTEEDALADSQLEPAAESEVEPELELVSEPVTEEAFTELDEL 1105
EPA E T + E + +PA E+ E + V+E T + E
Sbjct: 1144 EPARENDPTVN----IKEPQSQTNTTADTEQPAKETSSNVE-QPVTESTTVNTGNSVVEN 1198

Query: 1106 DLPEYTEEDALADAQLEPAVESEVEPELELASEPAEEEASTE-------LNELDLPEYTE 1158
T E + + + + S P E +T + DL
Sbjct: 1199 PENT-TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257

Query: 1159 EDALADAQLEPAAES 1173
L+DA+ + +
Sbjct: 1258 NAVLSDARAKAQFVA 1272



Score = 40.4 bits (94), Expect = 7e-05
Identities = 27/138 (19%), Positives = 49/138 (35%), Gaps = 15/138 (10%)

Query: 145 NSTQDAVNIMASHQAKLNQTPDTPVRPVAPPRPAPVATPKVEAVAQTPPQVTPTTAPQEK 204
N+ Q V + S+ ++ + + PV P AP P+ E VA+ Q + T E+
Sbjct: 1001 NNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETT----ETVAENSKQESKTVEKNEQ 1056

Query: 205 APTELKTPAKPSQSTDAEVMALEEKNHTLRLMLSQVQSEVSTLKEELGDENRIRSEVERL 264
TE A+ K +T +EV+ E + ++
Sbjct: 1057 DATE----TTAQNREVAKEAKSNVKANT-------QTNEVAQSGSETKETQTTETKETAT 1105

Query: 265 LEEERRKAEEASRLAPSA 282
+E+E + E +
Sbjct: 1106 VEKEEKAKVETEKTQEVP 1123



Score = 35.0 bits (80), Expect = 0.003
Identities = 44/239 (18%), Positives = 77/239 (32%), Gaps = 37/239 (15%)

Query: 812 AEDDLPEQTTATNETADELLADLAAQPQSNTVDTSDDALAPDGLSQSVEEPLTLND---- 867
E D E T E A E +++ A Q+N V S + + + +E T+
Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG-SETKETQTTETKETATVEKEEKA 1112

Query: 868 -LELPEENDEPQLAEVTPSSAFDEQQVETEIEPESEPLAAEASNDESDLTALNELDLPEY 926
+E + + P++ + V+ + EP E + E
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN--------------IKEP 1158

Query: 927 TEEDVLADVQLEPAAESEVEPDLELVNEPVTEEAFTELDELDLPEYTEEDALADAQLEPV 986
+ +PA E+ + +PVTE + E E A Q
Sbjct: 1159 QSQTNTTADTEQPAKETSSNVE-----QPVTESTTVNTGN-SVVENPENTTPATTQPTVN 1212

Query: 987 AESEVEPE----LDLASEPAEEEAFTELNKLDLPEYTEEDALADAQLESATESEVESEL 1041
+ES +P+ + S P E T + + +A L S + V S+
Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSN-------DRSTVALCDLTSTNTNAVLSDA 1264


11VC_1274VC_1280Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1274211-0.156409conserved hypothetical protein
VC_1275311-0.600830conserved hypothetical protein
VC_1276212-1.505228sensor histidine kinase
VC_1277314-1.715570transcriptional regulator, LuxR family
VC_1278315-2.037191transcriptional regulator, MarR family
VC_1279216-2.291971transporter, BCCT family
VC_1280118-3.161760hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1276PF06580388e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 8e-05
Identities = 16/77 (20%), Positives = 34/77 (44%), Gaps = 4/77 (5%)

Query: 369 IEKHAKAEKVTVLLQQMGDMLQLMVRDDGVGFSSKEALQKRGIGLRNMRERVEFIGGE-- 426
I + + K+ + + + L V + G K + G GL+N+RER++ + G
Sbjct: 272 IAQLPQGGKILLKGTKDNGTVTLEVENTGSLAL-KNTKESTGTGLQNVRERLQMLYGTEA 330

Query: 427 -FELSSEPQLGTEITVL 442
+LS + + ++
Sbjct: 331 QIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1277HTHFIS695e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 69.1 bits (169), Expect = 5e-16
Identities = 24/120 (20%), Positives = 45/120 (37%), Gaps = 3/120 (2%)

Query: 5 MDKPIRVVMVDDHQVVLDGFIARLEQEPEIEVVATASNGLEALELVKLHQPDVVLMDVSM 64
M +++ DD + L + V SN + D+V+ DV M
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIAAGDGDLVVTDVVM 57

Query: 65 PIMNGIEATRLIKEEVPHTKVLMLTMHDNREYIMQVMQAGAMGYMLKEISALKMVQAIKT 124
P N + IK+ P VL+++ + ++ + GA Y+ K +++ I
Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


12VC_1321VC_1338Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1321213-0.533006hypothetical protein
VC_1322115-1.847351conserved hypothetical protein
VC_1323115-1.539326hypothetical protein
VC_1324015-2.777078hypothetical protein
VC_1325213-3.626192galactoside ABC transporter, periplasmic
VC_1326313-4.976122hypothetical protein
VC_1327214-2.531960galactoside ABC transporter, ATP-binding
VC_1328118-2.376011galactoside ABC transporter, permease protein
VC_1329118-1.359208opacity protein-related protein
VC_13301180.243291hypothetical protein
VC_1331-1202.816103hypothetical protein
VC_1332-1183.338936conserved hypothetical protein
VC_13331194.382728hypothetical protein
VC_13342204.980529conserved hypothetical protein
VC_13353215.054112transcriptional regulator, GntR family
VC_13362214.971790carboxyphosphonoenolpyruvate phosphonomutase
VC_13370184.216635methylcitrate synthase
VC_13380163.758404aconitate hydratase 1
13VC_1377VC_1403Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_13772161.627878conserved hypothetical protein
VC_13782171.744519hypothetical protein
VC_13791181.868637hypothetical protein
VC_13801181.763981hypothetical protein
VC_13812172.367291hypothetical protein
VC_13822182.354568ATP-dependent helicase HrpA
VC_1383-1182.514183hypothetical protein
VC_1384-1182.594877hypothetical protein
VC_13850213.400243hypothetical protein
VC_1386-1203.683307heat shock protein 70 family protein
VC_13871193.892497hypothetical protein
VC_13881193.842338lipoate-protein ligase A
VC_13890171.983727hypothetical protein
VC_13900171.707266transcriptional regulator, LysR family
VC_1391-3150.035149multidrug transporter, putative
VC_1392017-1.189751deoxyribodipyrimidine photolyase, putative
VC_1393119-4.014276sugE protein
VC_1394219-4.162151methyl-accepting chemotaxis protein
VC_1396020-2.946260hypothetical protein
VC_1397119-3.038518chemotaxis protein CheA
VC_1398020-3.442485chemotaxis protein CheY
VC_1399-120-3.748288chemotaxis protein methyltransferase CheR
VC_1400120-3.218057hypothetical protein
VC_1401020-3.326279protein-glutamate methylesterase CheB
VC_1402-117-3.232744purine-binding chemotaxis protein Chew,
VC_1403118-3.211288methyl-accepting chemotaxis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1384OMPADOMAIN586e-13 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 58.4 bits (141), Expect = 6e-13
Identities = 47/192 (24%), Positives = 69/192 (35%), Gaps = 34/192 (17%)

Query: 1 MKKTLLAL--ALLGASSTAMA----DSWIYGGASVGQSDY------EGKHGT-----AYS 43
MKKT +A+ AL G ++ A A ++W Y GA +G S Y T
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTW-YTGAKLGWSQYHDTGFINNNGPTHENQLGAG 59

Query: 44 VHAGTGILPFIGLEAGYVNHGDFEINATQE---LSASSLYFAVKPSMDFGP-LHVYAKGG 99
G + P++G E GY G + E A + K L +Y + G
Sbjct: 60 AFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLG 119

Query: 100 LHSWDKD----INGGKIDDGIDVMYGIGAEYFIIGPFSVGASYM------NYTMDST--D 147
W D + G D G+ ++ G EY I + Y + T D
Sbjct: 120 GMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRPD 179

Query: 148 VGTLSFNATFHF 159
G LS ++ F
Sbjct: 180 NGMLSLGVSYRF 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1386SHAPEPROTEIN491e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.4 bits (118), Expect = 1e-08
Identities = 26/105 (24%), Positives = 47/105 (44%), Gaps = 19/105 (18%)

Query: 152 AVIGRPVNFHGRGGEDSNQQAENILRRAATRAGFRHLEFQFEPVAAGLEYEATLTEDKTV 211
++ PV ++ +R +A AG R + EP+AA + ++E
Sbjct: 110 VLVCVPVGA------TQVER--RAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGS 161

Query: 212 LVVDIGGGTTDCSLIQMGPSWRGKADRTQSLIAHTGQRVGGNDLD 256
+VVDIGGGTT+ ++I + ++ + R+GG+ D
Sbjct: 162 MVVDIGGGTTEVAVISLN-----------GVVYSSSVRIGGDRFD 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1391TCRTETB1215e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (304), Expect = 5e-32
Identities = 98/419 (23%), Positives = 169/419 (40%), Gaps = 21/419 (5%)

Query: 27 LGSLEKSIVTTPLALIGQDLSA-GTALTWVITAYLLAATAVLPVYGKLSDLFGRVRMLNI 85
L + ++ L I D + + WV TA++L + VYGKLSD G R+L
Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84

Query: 86 SIGIFIVGSAMCTFA-VDLPTLIGARVVQGIGGGGLIALAFTVIADSIPAREVGKYQGYI 144
I I GS + LI AR +QG G AL V+A IP GK G I
Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 145 SAVYAVSSVAGPLLGGYFADHLSWRWVFGINLPLGMVALYMVNRHLRHLNQKRHSRFDWL 204
++ A+ GP +GG A ++ W ++ +P+ + L + FD
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKGHFDIK 202

Query: 205 GAGLLMLTTTLLLLQLSSHSFLPAGWGAFALLLCLVLLILVERQ--VSDPILPARLARLP 262
G L+ + +L +S+S F ++ L LI V+ V+DP + L +
Sbjct: 203 GIILMSVGIVFFMLFTTSYSIS------FLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNI 256

Query: 263 SYLTAIGLIMASQMLMFALLVYMPLQLQWQKGFSPSQSGT-VMVIFMFSITTGAYLGGKW 321
++ + + + +P ++ S ++ G+ ++ S+ Y+GG
Sbjct: 257 PFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL 316

Query: 322 VARSGRYK------ALVVSGFLLAAVAIWQIHYDLWVHLSLGIGGLGLGFTLPSLNVVVQ 375
V R G + FL A+ + ++ + + GL FT ++ +V
Sbjct: 317 VDRRGPLYVLNIGVTFLSVSFLTASFLLETTS--WFMTIIIVFVLGGLSFTKTVISTIVS 374

Query: 376 SVLPARDRGIGMSLFNFGRELGGALGVAFCSALFYLRVPQSVTVSEHGSQASSVTPDVL 434
S L ++ G GMSL NF L G+A L + + + Q++ + ++L
Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLL 433


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1392TACYTOLYSIN300.019 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 30.3 bits (68), Expect = 0.019
Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 120 QHDIVWHEFPYAAVIRGAQTRKNWDEHWQ 148
Q++I+W E Y + T++ WD +W
Sbjct: 479 QYEILWDEINYDDKGKEVITKRRWDNNWY 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1397PF06580330.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.005
Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 8/48 (16%)

Query: 490 LIRNALDHGIESPDIRREQGKNPTGKITLSAFTLDDSVIIKMSDDGKG 537
L+ N + HGI GKI L + +V +++ + G
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSL 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1398HTHFIS681e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 1e-16
Identities = 25/113 (22%), Positives = 45/113 (39%), Gaps = 2/113 (1%)

Query: 4 KVMVVDDASTVRMYHKALLEEIGIFILEASNGVEALERALEMPVDLFLVDINMPKMDGFT 63
++V DD + +R L G + SN DL + D+ MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LVREIRCRPELAGIPTVMISTESQESDRQQGIHMGANLYMVKPVNPEELQQTV 116
L+ I+ +P +++S ++ + GA Y+ KP + EL +
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1401HTHFIS727e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 7e-16
Identities = 27/104 (25%), Positives = 52/104 (50%), Gaps = 4/104 (3%)

Query: 2 KILVVDDSALMRTTISDILQNIPNAEIKTARDGMDAIDKVMKWQPDVMTLDINMPNMDGL 61
ILV DD A +RT ++ L + +++ + + D++ D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 TCLTQIMVERP-LPIVMLSSLTHEGAITTLEALYLGAVDFVAKP 104
L +I RP LP++++S+ +T ++A GA D++ KP
Sbjct: 64 DLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1402BINARYTOXINB300.027 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.0 bits (67), Expect = 0.027
Identities = 15/96 (15%), Positives = 35/96 (36%), Gaps = 7/96 (7%)

Query: 113 DPEMIKPTNTAGGKIDPTLITNTIHSDNQIIQVLNCHRLLDNSVEEELELSTQRFNHLSS 172
P ++ + T I + + I S+NQ Q + +E +T NH++
Sbjct: 60 APMVVTSSTTGDLSIPSSEL-ENIPSENQYFQSAIWSGFIKVKKSDEYTFATSADNHVTM 118

Query: 173 QSHFGDVIDEERDDDEDMRQLVCCMVDGQEYAFPLE 208
+D++ ++ + G+ Y ++
Sbjct: 119 W------VDDQEVINKASNSNKIRLEKGRLYQIKIQ 148


14VC_1414VC_1441Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1414-114-3.027824thermostable carboxypeptidase 1
VC_1415118-4.056886hcp protein
VC_1416119-4.355368vgrG protein
VC_1417224-6.135968hypothetical protein
VC_1418224-6.166712hypothetical protein
VC_1419217-4.554096hypothetical protein
VC_1420116-3.116917hypothetical protein
VC_1421016-2.443981conserved hypothetical protein
VC_1422-118-2.456980sodium/alanine symporter
VC_1423120-3.638549hypothetical protein
VC_1424-120-3.630835spermidine/putrescine ABC transporter,
VC_1425-118-3.839495spermidine/putrescine ABC transporter,
VC_1426-213-3.997099spermidine/putrescine ABC transporter, permease
VC_1427-211-4.087681spermidine/putrescine ABC transporter, permease
VC_1428-113-3.776897spermidine/putrescine ABC transporter,
VC_1429-112-4.048502hypothetical protein
VC_1430013-3.050333bax protein, putative
VC_1431013-2.879972hypothetical protein
VC_1432-115-2.174823conserved hypothetical protein
VC_1433-116-2.151496conserved hypothetical protein
VC_1434-216-2.210930fumarate and nitrate reduction regulatory
VC_1435-117-1.881954conserved hypothetical protein
VC_1436-119-2.591760FixS-related protein
VC_1437119-2.027122cation transport ATPase, E1-E2 family
VC_1438322-2.262752hypothetical protein
VC_1439220-2.700606cytochrome c oxidase, subunit CcoP
VC_1440221-4.874766cytochrome c oxidase, subunit CcoQ
VC_1441016-3.092269cytochrome c oxidase, subunit CcoO
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1416PF03544396e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 39.2 bits (91), Expect = 6e-05
Identities = 19/104 (18%), Positives = 28/104 (26%), Gaps = 11/104 (10%)

Query: 639 PALPGGLEPA-------VALAPPQTISYQALLQAEQANVPAVKVCPLAAQEATPAVNS-- 689
LP +P L PPQ + E P P +EA +
Sbjct: 41 IELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI--PEPPKEAPVVIEKPK 98

Query: 690 ITPPPPPPIAPPMAPPQPIMNPQPTANAQPNLGRSTKATPDFPT 733
P P P + P+ + P + A P +
Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTA 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1423cloacin300.010 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.010
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 80 QNQEEAWQHKMD-----QALEKQREEWQAEAVQRDKYVAELEKQNLNLEQQLNEQKMALE 134
Q++E Q + D +A E+ E +AE Q ++ VA +++ Q N +K L+
Sbjct: 300 QDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAVQVYNSRKSELD 359

Query: 135 LAN 137
AN
Sbjct: 360 AAN 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1424MALTOSEBP280.048 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 28.2 bits (62), Expect = 0.048
Identities = 31/109 (28%), Positives = 48/109 (44%), Gaps = 9/109 (8%)

Query: 1 MKNKLFASALCAAAL----FTTNAMAKDQELYFYNWSEYIP-----SEVLEDFTKETGIK 51
MK K A L +AL F+ +A+AK +E W +EV + F K+TGIK
Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60

Query: 52 VIYSTYESNESMYAKLKTQGAGYDLVVPSTYFVSKMRKEGMLQEIDHSK 100
V + E + ++ G G D++ + + G+L EI K
Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDK 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1425MYCMG045409e-06 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 40.5 bits (94), Expect = 9e-06
Identities = 26/94 (27%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 34 SACALSLLSGTAAAEDKELVFMNWGPYINSGILEQFTKETGIKVIYSTYESNETLYAKLK 93
+ +SL S ++ V N+ YI+ +LE+ + + + TY SNE L
Sbjct: 10 FSLFVSLSSILSSCGSTTFVLANFESYISPLLLER--VQEKHPLTFLTYPSNEKLINGFA 67

Query: 94 THNQGYDLVVPSTYFVAKMRDEGMLQKIDKSKLS 127
N Y + V STY V+++ + +L ID S+ +
Sbjct: 68 --NNTYSVAVASTYAVSELIERDLLSPIDWSQFN 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1430FLGFLGJ290.025 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 28.5 bits (63), Expect = 0.025
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 4/34 (11%)

Query: 134 VPEALVLTQAANESAWGTSRFAKE----ANNYFG 163
VP L+L QAA ES WG + +E + N FG
Sbjct: 169 VPHHLILAQAALESGWGQRQIRRENGEPSYNLFG 202


15VC_1452VC_1486Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1452633-7.445087RstC protein
VC_1453327-4.844479RstB1 protein
VC_1454428-4.938306RstA1 protein
VC_1455527-3.894152transcriptional repressor RstR
VC_1456326-3.177454cholera enterotoxin, B subunit
VC_1457324-2.080501cholera enterotoxin, A subunit
VC_1458125-0.878016zona occludens toxin
VC_1459326-2.361074accessory cholera enterotoxin
VC_1460224-1.918216hypothetical protein
VC_1461021-2.971996colonization factor
VC_1462-120-2.054330RstB2 protein
VC_1463-123-1.839495RstA2 protein
VC_1464124-0.551045transcriptional repressor RstR
VC_1465225-0.042878hypothetical protein
VC_1466227-0.135791hypothetical protein
VC_14673250.373779hypothetical protein
VC_1468228-0.177878conserved hypothetical protein
VC_14692280.188658phage replication protein Cri
VC_1471226-0.616346hypothetical protein
VC_1472122-0.922194hypothetical protein
VC_1473122-1.767248hypothetical protein
VC_1474123-2.735449conserved hypothetical protein
VC_1475022-3.497742phage replication protein Cri
VC_1477325-5.463972transposase OrfAB, subunit A
VC_1478124-5.273921transposase OrfAB, subunit B
VC_1479323-5.500294hypothetical protein
VC_1480021-4.111306hypothetical protein
VC_1481020-3.854612conserved hypothetical protein
VC_1482-120-3.639957ATP-dependent protease LA-related protein
VC_1483-118-2.8811243-hydroxydecanoyl-(acyl-carrier-protein)
VC_1484-118-3.044708ribosome modulation factor
VC_1485-117-3.088744hypothetical protein
VC_1486016-3.203618ABC transporter, ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1456ENTEROTOXINB2397e-86 Heat labile enterotoxin B chain signature.
		>ENTEROTOXINB#Heat labile enterotoxin B chain signature.

Length = 124

Score = 239 bits (612), Expect = 7e-86
Identities = 124/124 (100%), Positives = 124/124 (100%)

Query: 1 MIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAI 60
MIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAI
Sbjct: 1 MIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAI 60

Query: 61 ITFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAI 120
ITFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAI
Sbjct: 61 ITFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAI 120

Query: 121 SMAN 124
SMAN
Sbjct: 121 SMAN 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1457ENTEROTOXINA471e-173 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 471 bits (1214), Expect = e-173
Identities = 206/258 (79%), Positives = 232/258 (89%)

Query: 1 MVKIIFVFFIFLSSFSYANDDKLYRADSRPPDEIKQSGGLMPRGQSEYFDRGTQMNINLY 60
M I F+FFI L+S YAN D+LYRADSRPPDEIK+SGGLMPRG +EYFDRGTQMNINLY
Sbjct: 1 MKNITFIFFILLASPLYANGDRLYRADSRPPDEIKRSGGLMPRGHNEYFDRGTQMNINLY 60

Query: 61 DHARGTQTGFVRHDDGYVSTSISLRSAHLVGQTILSGHSTYYIYVIATAPNMFNVNDVLG 120
DHARGTQTGFVR+DDGYVSTS+SLRSAHL GQ+ILSG+STYYIYVIATAPNMFNVNDVLG
Sbjct: 61 DHARGTQTGFVRYDDGYVSTSLSLRSAHLAGQSILSGYSTYYIYVIATAPNMFNVNDVLG 120

Query: 121 AYSPHPDEQEVSALGGIPYSQIYGWYRVHFGVLDEQLHRNRGYRDRYYSNLDIAPAADGY 180
YSPHP EQEVSALGGIPYSQIYGWYRV+FGV+DE+LHRNR YRDRYY NL+IAPA DGY
Sbjct: 121 VYSPHPYEQEVSALGGIPYSQIYGWYRVNFGVIDERLHRNREYRDRYYRNLNIAPAEDGY 180

Query: 181 GLAGFPPEHRAWREEPWIHHAPPGCGNAPRSSMSNTCDEKTQSLGVKFLDEYQSKVKRQI 240
LAGFPP+H+AWREEPWIHHAP GCGN+ R+ +TC+E+TQ+L +L EYQSKVKRQI
Sbjct: 181 RLAGFPPDHQAWREEPWIHHAPQGCGNSSRTITGDTCNEETQNLSTIYLREYQSKVKRQI 240

Query: 241 FSGYQSDIDTHNRIKDEL 258
FS YQS++D +NRI+DEL
Sbjct: 241 FSDYQSEVDIYNRIRDEL 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1486PF05272300.050 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.050
Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 4/32 (12%)

Query: 337 GFSFNIMRGDRIALIGPNGCGKSTLLKILLGD 368
G F+ + L G G GKSTL+ L+G
Sbjct: 592 GCKFDYS----VVLEGTGGIGKSTLINTLVGL 619


16VC_1567VC_1573Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VC_15672191.736920conserved hypothetical protein
VC_15682191.960859ABC transporter, ATP-binding protein
VC_15693201.541541hypothetical protein
VC_15703201.321404quinol oxidase, subunit II
VC_15712200.684650quinol oxidase, subunit I
VC_1572218-0.337272hypothetical protein
VC_1573219-0.668853fumarate hydratase, class II
17VC_1747VC_1808Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1747119-3.347518hypothetical protein
VC_1748017-3.061474hypothetical protein
VC_1749017-3.130837hypothetical protein
VC_1750118-2.784381hypothetical protein
VC_1751-115-1.411160conserved hypothetical protein
VC_1752-1161.126317hypothetical protein
VC_1753-1140.481493paraquat-inducible protein A
VC_1754-111-1.151219paraquat-inducible protein B
VC_1755-112-1.366021conserved hypothetical protein
VC_1756-114-2.355632periplasmic linker protein, putative
VC_1757019-3.361670transporter, AcrB/D/F family
VC_1758333-6.940356*integrase, phage family
VC_1760329-6.688972helicase, putative
VC_1761326-6.211379hypothetical protein
VC_1762224-5.731537hypothetical protein
VC_1763222-5.723999chemotaxis protein MotB-related protein
VC_1764219-5.327445hypothetical protein
VC_1765120-5.250386type I restriction enzyme HsdR, putative
VC_1766220-5.658193conserved hypothetical protein
VC_1767119-5.606399conserved hypothetical protein
VC_1768222-6.499162conserved hypothetical protein
VC_1769222-6.367183DNA methylase HsdM, putative
VC_1770228-6.702756hypothetical protein
VC_1771129-6.570125hypothetical protein
VC_1772230-5.892457hypothetical protein
VC_1773232-6.365798conserved hypothetical protein
VC_1774027-4.811054conserved hypothetical protein
VC_1775-125-4.950606conserved hypothetical protein
VC_1776-223-4.408566N-acetylneuraminate lyase, putative
VC_1777-222-4.054410conserved hypothetical protein
VC_1778-123-4.187179conserved hypothetical protein
VC_1779022-3.542990C4-dicarboxylate-binding periplasmic protein
VC_1780023-3.532656hypothetical protein
VC_1781023-2.656367conserved hypothetical protein
VC_1782122-2.867511ROK family protein
VC_1783224-3.117568N-acetylglucosamine-6-phosphate deacetylase
VC_1784224-3.195673neuraminidase
VC_1785531-3.510547transcriptional regulator
VC_1786529-3.718113DNA repair protein RadC, putative
VC_1787531-4.495929hypothetical protein
VC_1788431-5.039964hypothetical protein
VC_1789328-5.890203transposase OrfAB, subunit B
VC_1790331-6.848754transposase OrfAB, subunit A
VC_1791330-7.255779conserved hypothetical protein
VC_1792430-8.687979conserved hypothetical protein
VC_1793428-7.298325hypothetical protein
VC_1794225-4.599842hypothetical protein
VC_1795225-3.819653transcriptional regulator, putative
VC_1796125-4.167449middle operon regulator-related protein
VC_1797124-4.088955hypothetical protein
VC_1798125-4.048502eha protein
VC_1799123-3.823262hypothetical protein
VC_1800325-5.723138hypothetical protein
VC_1801325-6.021399hypothetical protein
VC_1802529-8.545020hypothetical protein
VC_1803526-7.955472hypothetical protein
VC_1804423-7.755886hypothetical protein
VC_1805424-7.550167hypothetical protein
VC_1806323-7.518601hypothetical protein
VC_1808326-8.601399hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1748IGASERPTASE300.011 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.011
Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 4/35 (11%)

Query: 235 QTSWNPLVGM----QYQLNDSWYLLGEFGFGDRQS 265
TS N L + +Y ++ WYL + G+G QS
Sbjct: 1352 ATSKNTLAQVNFYSKYYADNHWYLGIDLGYGKFQS 1386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1756RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 2e-08
Identities = 29/166 (17%), Positives = 58/166 (34%), Gaps = 6/166 (3%)

Query: 77 SGRLTDIAVKEGDQVKKGQLLASLDSRDAKTALEAAQLELKNTEQEYRRAKAIFEKTQAI 136
+ + +I VKEG+ V+KG +L L + A+ Q L E R + + +I
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR---SI 160

Query: 137 SKAELDKVTNRYDLAKNRVEEAKRKLEYTQITAPFDGVISEKTIENFAQVQANQVIMILQ 196
+L ++ + V E + + I F ++K ++ ++
Sbjct: 161 ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ---KELNLDKKRAERL 217

Query: 197 DLNDLEVAIEIPHRVMLSGVRNTRAIAELSAIPNQQFDLKLRTYST 242
+ E RV S + + ++ AI + Y
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVE 263



Score = 35.6 bits (82), Expect = 3e-04
Identities = 28/196 (14%), Positives = 68/196 (34%), Gaps = 19/196 (9%)

Query: 95 QLLASLDSRDAKTALEAAQLE--LKNTEQEYRRAKAIFEKTQAISKAELDKVTNRYDLAK 152
+ + Q+E + + ++EY+ +F+ +L + T+ L
Sbjct: 259 NKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL---DKLRQTTDNIGLLT 315

Query: 153 NRVEEAKRKLEYTQITAPFDGVISE-KTIENFAQVQANQVIMILQDLND-LEVAIEIPHR 210
+ + + + + + I AP + + K V + +M++ +D LEV + ++
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNK 375

Query: 211 VMLSGVRNTRAIAELSAIPNQQF---DLKLRTYSTQPSSDSQTYSVVL---------GFE 258
+ AI ++ A P ++ K++ + D + V
Sbjct: 376 DIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLST 435

Query: 259 DLKGFRVMPGMSAKVI 274
K + GM+
Sbjct: 436 GNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1757ACRIFLAVINRP471e-152 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 471 bits (1214), Expect = e-152
Identities = 244/1049 (23%), Positives = 445/1049 (42%), Gaps = 60/1049 (5%)

Query: 3 IARYTLAKRTSVWVLIALTLIGGYISYLKLGRFEDPEFVIRQAVIVTPYPGATAQEVSDE 62
+A + + + WVL + ++ G ++ L+L + P + YPGA AQ V D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTDVIEGAVQALQELKEVKSVSMQ-GRSEVTVEIKLEFAKSSAQLQQVWDKLRRKVADAQ 121
VT VIE + + L + S S G +T+ + AQ QV +KL+ A
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQ-VQVQNKLQL----AT 115

Query: 122 RQLPPGA-GASIVNDDFSDVYALFYAV--TGEGFSDKQLQDYVD-TLRRELVLVPGVAKA 177
LP I + S Y + G + + DYV ++ L + GV
Sbjct: 116 PLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175

Query: 178 ATLAEQQEAIFIEMSSERMAEFGLSVERVLQVLQKQSLVTVAGSVDA------QQMRIPV 231
L Q A+ I + ++ + ++ L+ V+ L+ Q+ AG + QQ+ +
Sbjct: 176 -QLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASI 234

Query: 232 IPKSNISSLADLTNLQVAVGSNNAVVRLGDIANISRGYTEPASMLMRYNGQRAIGFGISN 291
I ++ + + + + V S+ +VVRL D+A + G E +++ R NG+ A G GI
Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKL 293

Query: 292 VTGGNVVEMGDAVKARLAELESQRPLGMDLHVISMQSDSVRASVANFIDNLIAAVAIVFV 351
TG N ++ A+KA+LAEL+ P GM + + V+ S+ + L A+ +VF+
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 352 VLLLFMG-VRSGVIIGFVLLLTVAGTLCVMLIDDIAMQRISLGALIIALGMLVDNAIVVT 410
V+ LF+ +R+ +I + + + GT ++ ++ +++ +++A+G+LVD+AIVV
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 411 DGVLVRFQQEPNADKQQVVSEVVNATKWPLLGGTVVGIFAFSAIGLSPSDMGEYAGSLFW 470
+ V R E ++ + ++ + L+G +V F + G
Sbjct: 414 ENV-ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSI 472

Query: 471 VILYSMFLSWVFAVTVTPMLCHDFLRVKAPTKEAKPSK-----------LVTGYKAVLQW 519
I+ +M LS + A+ +TP LC L+ + V Y +
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 520 VLSHRVVSCAMLLGTLVAAVWGAQFIPPGFMPESQRPQFVVDVYLPQGSDIRRTEQVVAS 579
+L + + V +P F+PE + F+ + LP G+ RT++V+
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 580 IEKDVTQKDGITNITSFIGGGGLRFMLTYSPEARNPSYGQL-LIDIDDYTKIAPLVGELQ 638
+ D K+ N+ S G F S +A+N + L ++ +
Sbjct: 593 VT-DYYLKNEKANVESVFTVNGFSF----SGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 639 NELDAKY---PDASIKVWKFM----LGRGGGKKIE-AGFKGPDSHVLRQLAEQ-AKAIMH 689
+ + D + + LG G E G L Q Q
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 690 NDPNLIAVQDDWRQQVPVLQPVYSAQEAQRLGLTTQEISAAIAQTLNGRNVGVYREGNDL 749
+ +L++V+ + + + ++AQ LG++ +I+ I+ L G V + + +
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 750 IPLMVRAPENERHHERAIENSEVFSAQAGRYIPVSQLVDSVDTVYQDALLRRINRMPTIL 809
L V+A R ++ V SA G +P S VY L R N +P++
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSA-NGEMVPFSAFTT-SHWVYGSPRLERYNGLPSME 825

Query: 810 VQADPAPGVMTADAFNNVREKIEQI--ELPAGYELIWYGEYKASKDANEGLALSAPYGFA 867
+Q + APG + DA +E + +LPAG W G + + F
Sbjct: 826 IQGEAAPGTSSGDA----MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881

Query: 868 AMILAVVFMFNALRQPLVIWMTAPFAVVGVTIGLIAFQTPFEFMAILGFLSLIGMMVKNA 927
+ L + ++ + P+ + + P +VGV + F + ++G L+ IG+ KNA
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 928 IVLVDQA-DAEIRAGKEAYFAIIDAAVSRARPVLLGAFTTILGVAPLLVDP-----FFKS 981
I++V+ A D + GK A + A R RP+L+ + ILGV PL + +
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 982 MAVTIMFGLLFATILTLVVIPLFYAVLFR 1010
+ + +M G++ AT+L + +P+F+ V+ R
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1763OMPADOMAIN300.007 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 30.3 bits (68), Expect = 0.007
Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 15/78 (19%)

Query: 98 STLTFASGKSSIPNDATIQQAVKDIGVVLHSAIQKKDRFQYLDTIFIEGHTDSDSIHYRG 157
S + F K+++ + Q A+ + L S + KD ++ + G+TD G
Sbjct: 219 SDVLFNFNKATLKPEG--QAALDQLYSQL-SNLDPKDG-----SVVVLGYTDR-----IG 265

Query: 158 KG--NWGLSTDRAISVWN 173
N GLS RA SV +
Sbjct: 266 SDAYNQGLSERRAQSVVD 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1764TYPE4SSCAGA320.008 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 32.4 bits (73), Expect = 0.008
Identities = 61/278 (21%), Positives = 106/278 (38%), Gaps = 37/278 (13%)

Query: 420 SSMESTIKDLVENVSAQSQVLTDFVQNQVVQLTQTFSERDGMA--SQMEKERNDIFVNQT 477
S +E+++KD++ N Q +TD V N L Q S S++E+ D
Sbjct: 765 SDLENSVKDVIIN-----QKVTDKVDN----LNQAVSVAKATGDFSRVEQALAD------ 809

Query: 478 QAMKAGTDELLAQVKAATESQQITTNSIIEQGKQLQNSIDSS-VSASARATESMQQSANE 536
+K + E LAQ ES S I Q ++N ++ + V E+ S N
Sbjct: 810 --LKNFSKEQLAQQAQKNESLNARKKSEIYQ--SVKNGVNGTLVGNGLSQAEATTLSKNF 865

Query: 537 LRVAADSMNVFGSNIKDAGNKLSGAVTEAVNSTKDLAEQNHLG-------AVKVQALREQ 589
+ + G+ + N L A + K + L A KV A ++
Sbjct: 866 SDIKKELNAKLGNFNNNNNNGLKNEPIYAKVNKKKAGQAASLEEPIYAQVAKKVNAKIDR 925

Query: 590 LLEDTSKFSAIADQINNMLISAEQ--SFSTLRTTQNEFLAEQKGNLNELTSEMKRNVKEL 647
L + S + L ++ S + ++N+ LA++ NLN+ SE K
Sbjct: 926 LNQIASGLGVVGQAAGFPLKRHDKVDDLSKVGLSRNQELAQKIDNLNQAVSEAKAGFFGN 985

Query: 648 TEQMAQLLEDYAEQANGQTAEHLKVWANSSTQYAESMN 685
EQ L+D + + +W S+ + S++
Sbjct: 986 LEQTIDKLKDSTKH------NPMNLWVESAKKVPASLS 1017


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1782PF03309280.034 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 28.2 bits (63), Expect = 0.034
Identities = 8/32 (25%), Positives = 18/32 (56%), Gaps = 4/32 (12%)

Query: 4 LAIDIGGTKIALAIV----EEGTIIQRYQIAT 31
LAID+ T + ++ + ++Q+++I T
Sbjct: 3 LAIDVRNTHTVVGLISGSGDHAKVVQQWRIRT 34


18VC_1832VC_1839Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1832220-1.924216hypothetical protein
VC_1833220-2.054808quinolinate synthetase A
VC_1834221-2.163517conserved hypothetical protein
VC_1835221-2.560835peptidoglycan-associated lipoprotein
VC_1836020-2.609199tolB protein
VC_1837320-2.523421tolA protein
VC_1838218-2.529820tolR membrane protein
VC_1839218-1.773918tolQ protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1835OMPADOMAIN1033e-29 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 103 bits (259), Expect = 3e-29
Identities = 35/124 (28%), Positives = 57/124 (45%), Gaps = 4/124 (3%)

Query: 49 NAQGQLTEQELKEQALRENQTIYFAFDNATIASDYEAMLAAHAAYL--VKNPSLRVTIEG 106
A E++ + + F F+ AT+ + +A L + L + V + G
Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLG 259

Query: 107 HADERGTPEYNIALGERRAQAVAKYLEALGVQAGQLSIVSYGEEKPLVLGQSEEAYAKNR 166
+ D G+ YN L ERRAQ+V YL + G+ A ++S GE P V G + + K R
Sbjct: 260 YTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGNTCD-NVKQR 317

Query: 167 RAVL 170
A++
Sbjct: 318 AALI 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1837IGASERPTASE651e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 65.1 bits (158), Expect = 1e-13
Identities = 36/197 (18%), Positives = 70/197 (35%), Gaps = 6/197 (3%)

Query: 72 AKKEQERLDKLRRESEQLEKNRQAEEERIRQLKEQQAKEAKAAREAEKLREQKEQERLAA 131
++ + + ++ES+ +EKN Q E Q +E AKEAK+ +A + Q
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREV-AKEAKSNVKANTQTNEVAQSGSET 1092

Query: 132 EQKAREEKERAAKAEAERKVKEEAAKKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKA 191
++ E + A E E K K E K E +V + + ++ + E AE
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP-----KQEQSETVQPQAEPAR 1147

Query: 192 KREKEVAAKAEQERLAKEKAAKEAADKAKKEKERAAKAEAERKAQEAALNDIFGSLSEES 251
+ + V K Q + ++ A + E+ + + + + +
Sbjct: 1148 ENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207

Query: 252 QQNNAARQQFVTSEVGR 268
Q + R
Sbjct: 1208 QPTVNSESSNKPKNRHR 1224



Score = 63.9 bits (155), Expect = 4e-13
Identities = 37/196 (18%), Positives = 66/196 (33%), Gaps = 4/196 (2%)

Query: 76 QERLDKLRRESEQLEKNRQAEEERIRQLKEQQAKEAKAAREAEKLREQKEQERLAAEQKA 135
++R + + N QA+ + E+ A+ +A E AE
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 136 REEKE---RAAKAEAERKVKEEAAKKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKAK 192
+E K A E AK+A+ A A+ +E + +E A
Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105

Query: 193 REKEVAAKAEQERLAKEKAAKEAADKAKKEKERAAKAEAERKAQEAALNDIFGSLSEESQ 252
EKE AK E E+ + + K+E+ + +AE + +I S+ +
Sbjct: 1106 VEKEEKAKVETEKTQEV-PKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 253 QNNAARQQFVTSEVGR 268
+ + TS
Sbjct: 1165 TADTEQPAKETSSNVE 1180



Score = 58.2 bits (140), Expect = 3e-11
Identities = 31/143 (21%), Positives = 50/143 (34%), Gaps = 3/143 (2%)

Query: 96 EEERIRQLKEQQAKEAKAAREAEKLREQKEQERLAAEQKAREEKERAAKAEAERKVKEEA 155
E E+ Q + +A+ E +A +A A + E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 156 AKKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKAKREKEVAAKAEQERLAKEKAAKEA 215
+K+ + E A + Q E KEAK + + EVA + + + KE
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 216 ADKAKKEKERAAKAEAERKAQEA 238
A K+EK AK E E+ +
Sbjct: 1104 ATVEKEEK---AKVETEKTQEVP 1123



Score = 52.8 bits (126), Expect = 1e-09
Identities = 27/203 (13%), Positives = 64/203 (31%), Gaps = 4/203 (1%)

Query: 38 SDPEPTGQMIEAVVIDPQLVRQQAQQIRSQREEAAKKEQERLDKLRRE-SEQLEKNRQAE 96
S+ E ++ EA V P E +K+E + ++K ++ +E +NR+
Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETT--ETVAENSKQESKTVEKNEQDATETTAQNREVA 1069

Query: 97 EERIRQLKEQQAKEAKAAREAEKLREQKEQERLAAEQKAREEKERAAKAEAERKVKEEAA 156
+E +K + + A+ + +E + E +EEK + + + K +
Sbjct: 1070 KEAKSNVKAN-TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 157 KKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKAKREKEVAAKAEQERLAKEKAAKEAA 216
+QE+ A+ ++ + + E ++ +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 217 DKAKKEKERAAKAEAERKAQEAA 239
+ Q
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTV 1211



Score = 43.9 bits (103), Expect = 9e-07
Identities = 38/208 (18%), Positives = 67/208 (32%), Gaps = 4/208 (1%)

Query: 63 QIRSQREEAAKKEQERLDKLRRESEQLEKNRQAEEERIRQLKEQQAKEAKAAREAEKLRE 122
Q +E A +++E+ K+ E Q ++ ++ E +A+ ARE +
Sbjct: 1096 QTTETKETATVEKEEK-AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154

Query: 123 QKEQERLAAEQKAREEKERAAKAEAERKVKEEAAKKAEQERVAKEAAAAKAEQQRIEREK 182
KE + E+ + + E+ V E V A Q +
Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSE 1214

Query: 183 EAKLAEEKAKRE-KEVAAKAEQERLAKEKAAKEAADKAKKEKERAAKAEAERKAQEAALN 241
+ + + +R + V E + + A A ++A KAQ ALN
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALN 1274

Query: 242 DIFGSLSEESQ--QNNAARQQFVTSEVG 267
SQ NN + S
Sbjct: 1275 VGKAVSQHISQLEMNNEGQYNVWVSNTS 1302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1838adhesinmafb270.028 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 27.3 bits (60), Expect = 0.028
Identities = 12/40 (30%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 37 FVTQGVDVELP-KTHSAKSAQDLAGDSDSSFIIVEIDKEG 75
F G + P H+A SA + G+ D F + ++ EG
Sbjct: 98 FSGHGHEEHAPFDNHAADSASEEKGNVDEGFTVYRLNWEG 137


19VC_1848VC_1853Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1848019-4.485959cysteinyl-tRNA synthetase
VC_1849125-5.756678peptidyl-prolyl cis-trans isomerase B
VC_1850023-4.639198conserved hypothetical protein
VC_1851120-3.871015conserved hypothetical protein
VC_1852021-3.482394conserved hypothetical protein
VC_1853021-3.246331conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1852SECA529e-11 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 52.2 bits (125), Expect = 9e-11
Identities = 17/21 (80%), Positives = 18/21 (85%)

Query: 131 KQGRNDPCACGSGKKYKKCCG 151
K GRNDPC CGSGKKYK+C G
Sbjct: 878 KVGRNDPCPCGSGKKYKQCHG 898


20VC_1907VC_1919Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1907-114-3.342091cys regulon transcriptional activator
VC_1908-214-3.925761hypothetical protein
VC_1909-114-3.630796hypothetical protein
VC_1910427-2.540349tRNA-(MS[2]IO[6]A)-hydroxylase
VC_1911429-1.625627orotidine 5`-phosphate decarboxylase
VC_1912630-1.894293conserved hypothetical protein
VC_1913631-1.817508conserved hypothetical protein
VC_1914733-1.420615integration host factor, beta subunit
VC_1915524-0.774521ribosomal protein S1
VC_1916215-0.504686cytidylate kinase
VC_1917215-0.765254conserved hypothetical protein
VC_1918321-0.666751peptidyl-prolyl cis-trans isomerse D
VC_1919321-0.463426DNA-binding protein HU-beta
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1914DNABINDINGHU1181e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 118 bits (297), Expect = 1e-38
Identities = 34/89 (38%), Positives = 57/89 (64%), Gaps = 1/89 (1%)

Query: 2 TKSELIERLCAEQTHLSAKEIEDAVKNILEHMASTLEAGERIEIRGFGSFSLHYREPRVG 61
K +LI ++ AE T L+ K+ AV + ++S L GE++++ GFG+F + R R G
Sbjct: 3 NKQDLIAKV-AEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRERV 90
RNP+TG++++++ VP FK GK L++ V
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1919DNABINDINGHU1201e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 120 bits (303), Expect = 1e-39
Identities = 49/87 (56%), Positives = 63/87 (72%)

Query: 2 NKTQLVEQIAANADISKASAGRALDAFIEAVSGTLQSGDQVALVGFGTFSVRTRAARTGR 61
NK L+ ++A +++K + A+DA AVS L G++V L+GFG F VR RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPKTGEEIKIAEAKVPSFKAGKALKDA 88
NP+TGEEIKI +KVP+FKAGKALKDA
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDA 89


21VC_1968VC_1985Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1968-2113.227666transcriptional regulator, HTH_3 family
VC_1969-1123.940144hypothetical protein
VC_1970-1114.362699benzoate transport protein
VC_19710134.519537o-succinylbenzoic acid--CoA ligase
VC_1972-1143.942138o-succinylbenzoate-CoA synthase
VC_1973-1133.551131naphthoate synthase
VC_19741153.487603conserved hypothetical protein
VC_19750163.5582052-succinyl-6-hydroxy-2,
VC_19761173.184376menaquinone-specific isochorismate synthase
VC_19771182.559122aspartate aminotransferase, putative
VC_19782151.991779conserved hypothetical protein
VC_19791132.075521deoxyguanosinetriphosphate triphosphohydrolase
VC_19801131.699608conserved hypothetical protein
VC_19811121.779252hypothetical protein
VC_19820121.556571hypothetical protein
VC_1983-1121.816602peptidase, putative
VC_1984-1132.682109ribonuclease D
VC_1985-1143.021970long-chain-fatty-acid--CoA ligase
22VC_2284VC_2327Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_2284-218-3.105219hypothetical protein
VC_2285-213-2.486398GGDEF family protein
VC_2286118-2.091754conserved hypothetical protein
VC_2287222-1.727973DNA-damage-inducible protein P
VC_2288230-1.752969conserved hypothetical protein
VC_2289332-1.158116thiamin biosynthesis lipoprotein ApbE
VC_2290435-0.638275NADH:ubiquinone oxidoreductase, Na
VC_2291330-0.464811NADH:ubiquinone oxidoreductase, Na
VC_2292223-0.831710NADH:ubiquinone oxidoreductase, Na
VC_2293220-0.730748NADH:ubiquinone oxidoreductase, Na
VC_2294017-0.549578NADH:ubiquinone oxidoreductase, Na
VC_2295015-0.965724NADH:ubiquinone oxidoreductase, Na
VC_2296-314-1.089220bolA protein
VC_2297-212-1.237773conserved hypothetical protein
VC_2298-214-1.018578lipoprotein, putative
VC_2299-216-1.449462peptidyl-prolyl cis-trans isomerase A
VC_2300-116-2.400476ampG protein, putative
VC_2301-119-3.465726transcriptional activator, putative
VC_2302-317-2.866876RNA polymerase sigma-70 factor, ECF subfamily
VC_2303-117-1.171572conserved hypothetical protein
VC_23040170.812683hypothetical protein
VC_23051161.631581outer membrane protein OmpK
VC_23064172.918810hypothetical protein
VC_23072142.7380742-dehydropantoate 2-reductase
VC_23083152.7374174-methyl-5(B-hydroxyethyl)-thiazole
VC_23092151.435231aminotransferase, class V
VC_2310217-0.031251conserved hypothetical protein
VC_2311-1140.182319HesA/MoeB/ThiF family protein
VC_2312013-0.787779membrane-bound lytic murein transglycosylase A
VC_2313017-2.011445hypothetical protein
VC_23141161.140555hypothetical protein
VC_23153203.658963hypothetical protein
VC_23163203.829109N-acetylglutamate synthase
VC_23173224.390190hypothetical protein
VC_23183234.777676hypothetical protein
VC_23193224.636193exodeoxyribonuclease V, 67 kDa subunit
VC_23203234.687384exodeoxyribonuclease V, 135 kDa subunit
VC_23213213.894431hypothetical protein
VC_23223213.784437exodeoxyribonuclease V, 125 kDa subunit
VC_23232182.524740conserved hypothetical protein
VC_23242192.524375transcriptional regulator, LysR family
VC_23253203.252946hypothetical protein
VC_23263202.690420conserved hypothetical protein
VC_23272172.201498hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2305CHANNELTSX816e-20 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 81.2 bits (200), Expect = 6e-20
Identities = 82/298 (27%), Positives = 131/298 (43%), Gaps = 36/298 (12%)

Query: 31 MRKSLLALS-LLAATSAPVLAADYSDGDIHKNDYKWMQFNLMGAFDEL--PGKSSHDYLE 87
M+K+LLA ++A ++ A +D + +D+ N++G++ P + YLE
Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60

Query: 88 MEFGGRSGIFDLYGYVDVFNLATNKSSDK----VGDPKIFMKFAPRMSLDAFTGVDMSFG 143
E + FD YGY+D S+ K G P +FM+ PR S+D T D+SFG
Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSP-LFMEIEPRFSIDKLTNTDLSFG 119

Query: 144 PVQEVYVASLFEWDGTDFKTNAFSVNNQKIGLGSDVMVPWFGKVGLNLYGTYD------G 197
P +E Y A+ + +D + S +GLG+D+ + LN+Y Y
Sbjct: 120 PFKEWYFANNYIYDMGRNDSQEQS--TWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGAS 177

Query: 198 NRKDWNGFQISTNWFKPFYFFENGSFISYQGYIDYQFG--LKDEYSTASNGGA------- 248
N +W+G++ +F P GS +SY G+ ++ +G L D+ NG
Sbjct: 178 NENEWDGYRFKVKYFVPLTDLWGGS-LSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSI 236

Query: 249 ------MFNGIYWHSDRFAVGYGLKG-YKDVYGIKDTDG---FKSTGFGHYVAVTYKF 296
N +WH A + G + D + DG +STG+G Y V Y F
Sbjct: 237 ASSHILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2316CARBMTKINASE386e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 37.9 bits (88), Expect = 6e-05
Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 11/64 (17%)

Query: 35 GKTMVVMLGGEAIAD-----------KNFPNIISDIALLHSLGVKVVLVHGARPQINQIL 83
GK +V+ LGG A+ N IA + + G +VV+ HG PQ+ +L
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 84 EKNQ 87

Sbjct: 62 LHMD 65


23VC_2371VC_2386Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_23710183.693173conserved hypothetical protein
VC_23720203.917474hypothetical protein
VC_23730204.058122glutamate synthase, large subunit
VC_23741194.043808glutamate synthase, small subunit
VC_23750194.006799hypothetical protein
VC_23761193.978453glutamate synthase, large subunit
VC_23772203.299592glutamate synthase, small subunit
VC_23782202.972454conserved hypothetical protein
VC_23793193.098074MTA/SAH nucleosidase
VC_2380419-0.391667cobalamin biosynthesis protein CbiB, putative
VC_2381423-5.905206conserved hypothetical protein
VC_2382328-8.516914conserved hypothetical protein
VC_2383430-9.410647transcriptional regulator, LysR family
VC_2384-121-5.408275conserved hypothetical protein
VC_2385122-5.207207RNA-directed DNA polymerase
VC_2386-118-3.059398conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2381FERRIBNDNGPP310.003 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 31.5 bits (71), Expect = 0.003
Identities = 29/170 (17%), Positives = 60/170 (35%), Gaps = 17/170 (10%)

Query: 2 LVIRLIACTFLFITPSLLAKPFP------AERIISLAPHATEIAYAAGLGDKLVAVSEYS 55
L+ R T + ++P L RI++L E+ A LG V++
Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLA--LGIVPYGVADTI 63

Query: 56 DY------PPQALELERVANHQTINIEKILTLKPDLIIAWPAGNP-PRELAKL-RQLGFT 107
+Y PP + V N+E + +KP ++ P P LA++ GF
Sbjct: 64 NYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFN 123

Query: 108 IYDSQTKTLDEIADNIEALSHYSANPEVGQKAAHDFRQRLQDLRTQYASN 157
D + L ++ ++ + + ++ ++ ++
Sbjct: 124 FSDG-KQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKR 172


24VC_2408VC_2429Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_24082302.077258cell division protein FtsL
VC_24094392.265817conserved hypothetical protein
VC_24105402.222842hypothetical protein
VC_24113331.261928hypothetical protein
VC_24122291.735240pyruvate dehydrogenase, E3 component, lipoamide
VC_24131251.864733pyruvate dehydrogenase, E2 component,
VC_2414-1181.453324pyruvate dehydrogenase, E1 component
VC_2415-2110.908886pyruvate dehydrogenase complex repressor
VC_2416-1131.2288222`,3`-cyclic-nucleotide 2`-phosphodiesterase,
VC_24170183.211200single-stranded-DNA-specific exonuclease RecJ
VC_24181181.811299thiol:disulfide interchange protein DsbC
VC_24191202.631170integrase/recombinase XerD
VC_24202212.284451flavodoxin 2
VC_24212202.736469ampD protein
VC_24222192.393707nicotinate-nucleotide pyrophosphorylase,
VC_24231191.672039fimbrial protein
VC_24241232.600111type IV pilus assembly protein PilB
VC_2425-1212.536832type IV pilin biogenesis protein PilC
VC_2426-1203.406882leader peptidase PilD
VC_2427-1183.199647conserved hypothetical protein
VC_2428-2193.259129conserved hypothetical protein
VC_2429-2183.219518conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2413RTXTOXIND300.032 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.032
Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 2/57 (3%)

Query: 32 DKVAEEQSLITVEGDKASMEVPASQAGIVKEIKVVAGDKVSTGSLIMVFEAEGAAAA 88
+ VA +T G S E+ + IVKEI V G+ V G +++ A GA A
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2423BCTERIALGSPG603e-14 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 59.9 bits (145), Expect = 3e-14
Identities = 22/77 (28%), Positives = 42/77 (54%)

Query: 19 KNKQQKGFTLIELMIVVAVIGVLAAIAIPQYQNYVKKSAIGVGLANITALKTNIEDYIAT 78
+Q+GFTL+E+M+V+ +IGVLA++ +P +K+ +++I AL+ ++ Y
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62

Query: 79 EGSFPATTAGTAAGFTR 95
+P T G +
Sbjct: 63 NHHYPTTNQGLESLVEA 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2425BCTERIALGSPF447e-158 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 447 bits (1151), Expect = e-158
Identities = 114/408 (27%), Positives = 212/408 (51%), Gaps = 11/408 (2%)

Query: 9 LKNYRWKGINSNGKKVSGQMLAISEIEVRDKLKDQHI--------QIKKLKKGSVSLLAR 60
+ Y ++ +++ GKK G A S + R L+++ + + + K GS L R
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 LTHRVKSKDITILTRQLATMLTTGVPIVQALKLVGDNHRKAEMKSILAQITKSVEAGTPL 120
R+ + D+ +LTRQLAT++ +P+ +AL V K + ++A + V G L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 121 SKAMRTASAHFDTLYVDLVETGEMSGNLPEVFERLATYREKSEQLRAKVIKALIYPSMVV 180
+ AM+ F+ LY +V GE SG+L V RLA Y E+ +Q+R+++ +A+IYP ++
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 181 LVALGVSYLMLTMVIPEFESMFKGFGAELPWFTQQVLKLSHWVQAYSLWAFIAIAAAIFG 240
+VA+ V ++L++V+P+ F LP T+ ++ +S V+ + W +A+ A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 241 LK-ALRKNSFQIRLKTSRLGLKFPIIGNVLAKASIAKFSRTLATSFAAGIPILASLKTTA 299
+ LR+ R+ R L P+IG + + A+++RTL+ A+ +P+L +++ +
Sbjct: 241 FRVMLRQEKR--RVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISG 298

Query: 300 KTSGNVHFETAINEVYRDTAAGMPMYIAMRNTEAFPEMVLQMVMIGEESGQLDDMLNKVA 359
N + ++ G+ ++ A+ T FP M+ M+ GE SG+LD ML + A
Sbjct: 299 DVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAA 358

Query: 360 TIYEFEVDNTVDNLGKILEPLIIVFLGTVVGGLVVAMYLPIFNLMSVL 407
+ E + + + EPL++V + VV +V+A+ PI L +++
Sbjct: 359 DNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2426PREPILNPTASE359e-127 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 359 bits (922), Expect = e-127
Identities = 172/287 (59%), Positives = 208/287 (72%), Gaps = 1/287 (0%)

Query: 1 MELFYFYPWLFPVLATLFGLIVGSFLNVVIYRLPKIMEREWRAECAASFPEYGITPPEGK 60
+EL + PWL+ L LF L++GSFLNVVI+RLP ++EREW+AE + F E
Sbjct: 5 LELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPP 64

Query: 61 LTLSLPRSTCPHCQTPIRVIDNIPLLSWLALRGQCSHCKAPISARYPLIELLTALMSLVI 120
L +PRS CPHC PI ++NIPLLSWL LRG+C C+APISARYPL+ELLTAL+S+ +
Sbjct: 65 YNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAV 124

Query: 121 ATHFPFGVFAVALLFFSYVLIAATFIDFDTLLLPDQLTLPLLWGGIALALLGFSPVSLSD 180
A G +A L ++VL+A TFID D +LLPDQLTLPLLWGG+ LLG VSL D
Sbjct: 125 AMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLG-GFVSLGD 183

Query: 181 AVIGAMAGYLSLWSIYWLFKLLTGKEGMGYGDFKLLAALGAWLGWQQLPVIVLLSSVVGV 240
AVIGAMAGYL LWS+YW FKLLTGKEGMGYGDFKLLAALGAWLGWQ LP+++LLSS+VG
Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243

Query: 241 IFGLIQLRQQKKGIDMAFPFGPYLAIAGWFALLWGDKVIDWYFTTWV 287
G+ + + PFGPYLAIAGW ALLWGD + WY T ++
Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYLTNFL 290


25VC_2527VC_2538Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_25272230.444679conserved hypothetical protein
VC_25281221.137955ABC transporter, ATP-binding protein
VC_25291221.981962RNA polymerase sigma-54 factor
VC_25300172.272708sigma-54 modulation protein, putative
VC_2531-1182.772471PTS system, nitrogen regulatory IIA component
VC_2532-1183.236495conserved hypothetical protein
VC_2533-1163.692312phosphocarrier protein NPr
VC_2534-2153.584725magnesium transporter
VC_25350153.311806pmbA protein
VC_25362174.027497conserved hypothetical protein
VC_25371163.854161thiamine ABC transporter, ATP-binding protein,
VC_25380153.483014thiamine ABC transporter, permease protein,
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2534SECYTRNLCASE330.002 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 32.8 bits (75), Expect = 0.002
Identities = 24/114 (21%), Positives = 40/114 (35%), Gaps = 10/114 (8%)

Query: 336 QTVALVIRGLALGHIGDSNKRELLMKEAAIGLLNGITWALIIGAIVVVWKGE----WMLG 391
Q LV + G + ++ + +I + + G VV+W GE +G
Sbjct: 128 QGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWLGELITDRGIG 187

Query: 392 GIISAAMMTNLIVAGIAGVSIPILLKKMNIDPALAGGMALTTVTDVIGLSVFLG 445
+S L+ IA + P L + LAGG +GL +
Sbjct: 188 NGMSI-----LMFISIAA-TFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVAL 235


26VC_2686VC_2729Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_26862172.446473conserved hypothetical protein
VC_26871162.797078hypothetical protein
VC_26880172.979927glpX protein
VC_2689-1152.6462946-phosphofructokinase, isozyme I
VC_26900183.084301conserved hypothetical protein
VC_26910183.162244periplasmic protein cpxP, putative
VC_2692-1172.693925transcriptional regulator CpxR
VC_2693-1141.900631sensor protein CpxA
VC_2694-1182.004219superoxide dismutase, Mn
VC_26950162.307186rRNA methylase, SpoU family
VC_2696-1162.166106fxsA protein
VC_2697-3142.384746GGDEF family protein
VC_2698-2162.405336aspartate ammonia-lyase
VC_2699-2153.088449C4-dicarboxylate transporter, anaerobic
VC_2701-2142.702655thiol:disulfide interchange protein DsbD
VC_2702-1172.290079transcriptional regulator, LuxR family
VC_2703-1171.765812conserved hypothetical protein
VC_2704-3180.984079hypothetical protein
VC_2705-2172.391612sodium/solute symporter, putative
VC_27061183.049730conserved hypothetical protein
VC_27070173.335194hypothetical protein
VC_2708-1163.332223guanylate kinase
VC_2709-1163.759315DNA-directed RNA polymerase, omega subunit
VC_2710-1183.565192guanosine-3',5'-bis(diphosphate)
VC_27110173.986073ATP-dependent DNA helicase RecG
VC_27120163.815045xanthine/uracil permease family protein
VC_27131184.340797osmolarity sensor protein EnvZ
VC_27140194.884395transcriptional regulator OmpR
VC_2715-1194.934328transcription elongation factor GreB
VC_2716-1174.729591conserved hypothetical protein
VC_27170144.277465hypothetical protein
VC_27180164.361564bioH protein
VC_27190154.080127ComF-related protein
VC_27201143.490500conserved hypothetical protein
VC_27211193.649873MutT/nudix family protein
VC_27220214.192872cysQ protein
VC_27230244.052717general secretion pathway protein N
VC_27240243.570472cholera toxin secretion protein EpsM
VC_27250233.035846general secretion pathway protein L
VC_2726-1253.632845general secretion pathway protein K
VC_27270233.777162general secretion pathway protein J
VC_2728-1193.316141general secretion pathway protein I
VC_27290183.188102general secretion pathway protein H
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2686IGASERPTASE280.003 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.003
Identities = 15/73 (20%), Positives = 28/73 (38%), Gaps = 2/73 (2%)

Query: 7 EKLEAKIQTAVDTIALLQMEVEEL--KEEKQQLQNEAQELREAREALEQRAQQVQQEHAA 64
K K T + +A E +E E K+ E +E + Q +V + +
Sbjct: 1072 AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP 1131

Query: 65 WQERIRSLLGKME 77
QE+ ++ + E
Sbjct: 1132 KQEQSETVQPQAE 1144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2692HTHFIS958e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.3 bits (237), Expect = 8e-25
Identities = 37/136 (27%), Positives = 67/136 (49%), Gaps = 2/136 (1%)

Query: 2 AHILLIDDDTELTSLLTEVLQYEGFEISQANDGEAGLAAVSDE-IDLILLDVMMPKLNGM 60
A IL+ DDD + ++L + L G+++ ++ ++ DL++ DV+MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 61 ETLKRLREKWA-TPVLMLTAKGEEIDRVIGLELGADDYLPKPFSDRELLARIRAILRRTQ 119
+ L R+++ PVL+++A+ + + E GA DYLPKPF EL+ I L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 120 NGLTPKNSDVIECQDI 135
+ D + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2701TCRTETOQM310.020 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 30.6 bits (69), Expect = 0.020
Identities = 15/37 (40%), Positives = 19/37 (51%)

Query: 535 LSGFVLLQADVTKNQPQDIELLKALNVLGLPTIEFWN 571
L G +LL + Q Q L AL +G+PTI F N
Sbjct: 92 LDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFIN 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2702HTHFIS741e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 1e-17
Identities = 37/144 (25%), Positives = 67/144 (46%), Gaps = 6/144 (4%)

Query: 1 MDSTYTIIIADDHPLFRNALFQSVHMAISGANLLEADSLDALLTLLNKESEPDLLLLDLK 60
M TI++ADD R L + ++ +G ++ + L + + DL++ D+
Sbjct: 1 MTGA-TILVADDDAAIRTVL--NQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVV 56

Query: 61 MPGANGMSGLIHLRSEYPDLPIVVISA-SEEPSVVSQVKSHGAFGFIPKSSDMRSLIAAL 119
MP N L ++ PDLP++V+SA + + + + GA+ ++PK D+ LI +
Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEK-GAYDYLPKPFDLTELIGII 115

Query: 120 NQVLNGEPYFPAHLLVDGFVGNDL 143
+ L P+ L D G L
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2711SECA340.002 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 34.1 bits (78), Expect = 0.002
Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 5/66 (7%)

Query: 290 MRLVQGDV-----GSGKTLVAALAAVRAIEHGYQVALMAPTELLAEQHALNFAQWLEPMG 344
M L + + G GKTL A L A G V ++ + LA++ A N E +G
Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLG 151

Query: 345 IQVGWL 350
+ VG
Sbjct: 152 LTVGIN 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2713PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 5e-05
Identities = 28/134 (20%), Positives = 47/134 (35%), Gaps = 36/134 (26%)

Query: 308 YEVQIETELQAGLAPALGNPIAIKRSLSNLVVNALRYG------NGWVKVSSGMTADKKL 361
+E QI + P + + LV N +++G G + + T D
Sbjct: 242 FENQINPAIMDVQVPPM--------LVQTLVENGIKHGIAQLPQGGKILLKG--TKDNGT 291

Query: 362 VWLSVEDNGPGIDPSQVNKVFEPFTRGDTARGSEGTGLGLAIVKRIVSQHHG---AVSVS 418
V L VE+ G + E TG GL V+ + +G + +S
Sbjct: 292 VTLEVENTGSLALKNT----------------KESTGTGLQNVRERLQMLYGTEAQIKLS 335

Query: 419 NRSQGGLRAQISFP 432
+ QG + A + P
Sbjct: 336 EK-QGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2714HTHFIS995e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 98.8 bits (246), Expect = 5e-26
Identities = 40/136 (29%), Positives = 72/136 (52%), Gaps = 3/136 (2%)

Query: 6 KVLVVDDDARLRALLERYLSEQGFQVRSVANGEQMDRLLTRENFHLMVLDLMLPGEDGLS 65
+LV DDDA +R +L + LS G+ VR +N + R + + L+V D+++P E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 ICRRLRNSNNMIPILMLTAKGDEIDRIVGLEVGADDYLPKPFNPRELLARIKAVL---RR 122
+ R++ + +P+L+++A+ + I E GA DYLPKPF+ EL+ I L +R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 123 QVIEAPGAPSAEETII 138
+ + ++
Sbjct: 125 RPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2727PilS_PF08805328e-04 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 32.2 bits (73), Expect = 8e-04
Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 3/54 (5%)

Query: 3 RTNQVSSRQNM---AGFTLIEVLVAIAIFASLSVGAYQVLNQVQRSNEISAERT 53
+ +S+R+ G TL+EVL+ + + L+ AY++ + VQ + + S E+
Sbjct: 12 VFSSLSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQN 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2728BCTERIALGSPH300.001 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.9 bits (67), Expect = 0.001
Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 3 SKRGFTLLEVLVALAIFATAAISVIRSVS 31
+RGFTLLE+++ L + +A V+ +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFP 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2729BCTERIALGSPH1263e-39 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 126 bits (317), Expect = 3e-39
Identities = 48/173 (27%), Positives = 71/173 (41%), Gaps = 43/173 (24%)

Query: 5 RGFTLLEILLVLVLVSASAVAVIATFPVSVKDEAKISAQSFYQRLLLLNEEAILSGQDFG 64
RGFTLLE++L+L+L+ SA V+ FP S D A + F +L + + + +GQ FG
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFG 63

Query: 65 VRIDVDTRRLTFLQLTADKG--------------WQKWQNDKMTNQTTLKEG-LQLDFEL 109
V + D R FL L A G W + ++ ++ G L L F
Sbjct: 64 VSVHPD--RWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAF-A 120

Query: 110 GGGAWQKDDRLFNPGSLFDEEMFADEKKEQKQEPAPQLFVLSSGEVTPFTLSI 162
G AW D P + + GE+TPF L++
Sbjct: 121 QGEAWTPGDN-------------------------PDVLIFPGGEMTPFRLTL 148


27VC_0402VC_0415N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_0402-2190.878151MSHA biogenesis protein MshL
VC_0403-1191.100625MSHA biogenesis protein MshM
VC_0404-2170.409731MSHA biogenesis protein MshN
VC_04050180.182807MSHA biogenesis protein MshE
VC_0406120-0.690493MSHA biogenesis protein MshG
VC_0407321-0.875888MSHA biogenesis protein MshF
VC_0408421-0.673216MSHA pilin protein MshB
VC_0409322-1.677878MSHA pilin protein MshA
VC_0410217-0.795547MSHA pilin protein MshC
VC_0411216-0.874164MSHA pilin protein MshD
VC_0412115-0.748060hypothetical protein
VC_0413115-0.572063hypothetical protein
VC_0414-114-0.309087hypothetical protein
VC_0415-2161.751826rod shape-determining protein MreB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0402BCTERIALGSPD2147e-64 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 214 bits (547), Expect = 7e-64
Identities = 80/305 (26%), Positives = 146/305 (47%), Gaps = 29/305 (9%)

Query: 241 TVIVNPQAGVLTLRAYPDEIRQVNEFLGISQQRMHR-QVILEAKILEVTLSDGYQQGINW 299
+ + Q L + A PD + + I+Q + R QV++EA I EV +DG GI W
Sbjct: 311 IIKAHGQTNALIVTAAPDVMNDLERV--IAQLDIRRPQVLVEAIIAEVQDADGLNLGIQW 368

Query: 300 SKA------FSSNGANYKIG-SGSITQDSNGNPITSVLPGLDAIGNLLGGQSNVVISSGS 352
+ F+++G +G+ + +G +S+ L + + G G+
Sbjct: 369 ANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAG-----FYQGN 423

Query: 353 FDAVISFMATQGDLNVLSSPRVTASNNQKAVIKVGTDEYYVTDLSSVVGTGDNAQASPDI 412
+ +++ +++ ++L++P + +N +A VG + +T S +GDN + +
Sbjct: 424 WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLT--GSQTTSGDNIFNTVER 481

Query: 413 TLTPFFSGISLDVTPQIDDQGNVLLHVHPAVIEVEQQTKKILYRSEEIELPL-ARSSIRE 471
GI L V PQI++ +VLL E+EQ+ + + L A + R
Sbjct: 482 KTV----GIKLKVKPQINEGDSVLL-------EIEQEVSSVADAASSTSSDLGATFNTRT 530

Query: 472 SDSVIRAKDGDVVVIGGLMKSNTVDQVSKVPFLGDVPALGHLFRNTTKLTQKTELVILLK 531
++ + G+ VV+GGL+ + D KVP LGD+P +G LFR+T+K K L++ ++
Sbjct: 531 VNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIR 590

Query: 532 PTVVG 536
PTV+
Sbjct: 591 PTVIR 595


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0406BCTERIALGSPF2825e-94 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 282 bits (723), Expect = 5e-94
Identities = 113/407 (27%), Positives = 198/407 (48%), Gaps = 9/407 (2%)

Query: 1 MATFYYQGRNADGSKASGLVEAATEELAAEMLLNKGIVP-----TSIAQGAAEKSAFDFN 55
MA ++YQ +A G K G EA + A ++L +G+VP Q + +
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 56 WKALLTPSVPLEVLVIFCRQMFSLTKAGVPLLRSMRGLAQNCHNKQLKAALDSVCNELTN 115
K L+ S L + RQ+ +L A +PL ++ +A+ L + +V +++
Sbjct: 61 RKIRLSTSD----LALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVME 116

Query: 116 GRNLSASMQLHPAIFSPLFVSMIQVGENTGRLDQALLQLAGYYEQEVETRKRIKTAMRYP 175
G +L+ +M+ P F L+ +M+ GE +G LD L +LA Y EQ + R RI+ AM YP
Sbjct: 117 GHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYP 176

Query: 176 TFVITFVLLAMFILNVKVIPQFTSMFSRFGVDLPLPTRILITTSDFFVNYWGLLLGIIVG 235
+ + + IL V+P+ F LPL TR+L+ SD + +L ++
Sbjct: 177 CVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLA 236

Query: 236 LLFAFRAWVNTTNGRIRWDHLRLRMPIVGDIVNRAQLSRFARTFSLMLSAGVPLNQSLAL 295
AFR + R+ + L +P++G I +R+ART S++ ++ VPL Q++ +
Sbjct: 237 GFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRI 296

Query: 296 SAEAIDNKFLEQRILEMKSQIESGVAVSATAINANIFTPLVIQMMSVGEETGRIDELLLE 355
S + + N + R+ + GV++ +F P++ M++ GE +G +D +L
Sbjct: 297 SGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLER 356

Query: 356 VSDFYDREVDYDLKTLTARIEPILLVFVAAMVLVLALGIFLPMWGMM 402
+D DRE + EP+L+V +AA+VL + L I P+ +
Sbjct: 357 AADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLN 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0408BCTERIALGSPG423e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.8 bits (98), Expect = 3e-07
Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 6/54 (11%)

Query: 13 MKKM--QQGFSLVELVIVIVVVGLLAVAALPRFLDVTDEAK----KASIEGVAG 60
M+ Q+GF+L+E+++VIV++G+LA +P + ++A + I +
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALEN 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0409BCTERIALGSPG489e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.3 bits (115), Expect = 9e-10
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 4/56 (7%)

Query: 1 MVIMKRQGGFTLIELVVVIVILGILAVTAAPRFLNLQGDARE----ASLEGLRGAV 52
M +Q GFTL+E++VVIVI+G+LA P + + A + + + L A+
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0410BCTERIALGSPG413e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.0 bits (96), Expect = 3e-07
Identities = 11/28 (39%), Positives = 22/28 (78%)

Query: 6 KGFTLIELVVVIILLGILSAYAASRFLG 33
+GFTL+E++VVI+++G+L++ +G
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMG 35


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0411BCTERIALGSPG414e-07 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 41.4 bits (97), Expect = 4e-07
Identities = 26/95 (27%), Positives = 43/95 (45%), Gaps = 14/95 (14%)

Query: 17 VKPMIAKRGFTLVEMIIVIVVLGVALVGVTTSLYPRSKQSAEQVLSVKAAELGRAVLD-E 75
++ +RGFTL+E+++VIV++GV V +L +++ +Q AV D
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQK----------AVSDIV 50

Query: 76 VLGRAFDQHSGPNGGLPECVITETAGRTLCSAPSA 110
L A D + N P T +L AP+
Sbjct: 51 ALENALDMYKLDNHHYPT---TNQGLESLVEAPTL 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0412BCTERIALGSPH310.003 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 31.1 bits (70), Expect = 0.003
Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 7/48 (14%)

Query: 3 RGFTLIEMVITIILLGIVGLFLGNIAGQAMGIYVDTTAREALIQQGRF 50
RGFTL+EM++ ++L+G+ AG + + + A RF
Sbjct: 4 RGFTLLEMMLILLLMGVS-------AGMVLLAFPASRDDSAAQTLARF 44


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0414TYPE4SSCAGA310.042 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 30.8 bits (69), Expect = 0.042
Identities = 34/139 (24%), Positives = 56/139 (40%), Gaps = 20/139 (14%)

Query: 677 NQTVTGTIKAVRKDNASQQCLPSFGNVQK----SVAFWSEYLNPTANNSGFQSVSVGVNG 732
NQ V+ KA + +Q L N K A +E LN + +QSV GVNG
Sbjct: 788 NQAVS-VAKATGDFSRVEQALADLKNFSKEQLAQQAQKNESLNARKKSEIYQSVKNGVNG 846

Query: 733 TPIGQ--SANNATSISLNFNQNGEASFPISYREVGSLALHARFTGSGDEQDLLLEGQDSF 790
T +G S AT++S NF+ + L+A+ + + L+ + +
Sbjct: 847 TLVGNGLSQAEATTLSKNFSDIKK-------------ELNAKLGNFNNNNNNGLKNEPIY 893

Query: 791 IRVPRALVLSANNPYNPTH 809
+V + A + P +
Sbjct: 894 AKVNKKKAGQAASLEEPIY 912


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0415SHAPEPROTEIN5670.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 567 bits (1462), Expect = 0.0
Identities = 313/347 (90%), Positives = 332/347 (95%)

Query: 1 MFKKLRGMFSNDLSIDLGTANTLIYVKGQGIVLDEPSVVAIRQDKGRGGKTVAAVGHAAK 60
M KK RGMFSNDLSIDLGTANTLIYVKGQGIVL+EPSVVAIRQD+ K+VAAVGH AK
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60

Query: 61 QMLGRTPGNISAIRPMKDGVIADFYVTEKMLQHFIRQVHDNSVLKPSPRVLVCVPCGSTQ 120
QMLGRTPGNI+AIRPMKDGVIADF+VTEKMLQHFI+QVH NS ++PSPRVLVCVP G+TQ
Sbjct: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120

Query: 121 VERRAIRESALGAGAREVYLIDEPMAAAIGAGLRVSEPTGSMVIDIGGGTTEVAVISLNG 180
VERRAIRESA GAGAREV+LI+EPMAAAIGAGL VSE TGSMV+DIGGGTTEVAVISLNG
Sbjct: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180

Query: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAEKIKHEIGSAYPGDDVQEIEVRGRN 240
VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAE+IKHEIGSAYPGD+V+EIEVRGRN
Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240

Query: 241 LAEGVPRSFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISENGMVLTGGGALL 300
LAEGVPR FTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISE GMVLTGGGALL
Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300

Query: 301 KDLDRLLMEETGIPVVIADDPLTCVARGGGKALEMIDMHGGDLFSEE 347
++LDRLLMEETGIPVV+A+DPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


28VC_0628VC_0633N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_0628-110-0.459716conserved hypothetical protein
VC_0629010-0.799446multidrug resistance protein, putative
VC_0630218-1.873283conserved hypothetical protein
VC_0631221-1.856067tyrosyl-tRNA synthetase
VC_0632223-1.610466D-alanyl-D-alanine
VC_0633127-2.017272outer membrane protein OmpU
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0628RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 2e-08
Identities = 29/189 (15%), Positives = 65/189 (34%), Gaps = 30/189 (15%)

Query: 114 DKGDLDAQLERAKAQLKVRQQEFNAASALKNKGLQGEVA--FTNAAAALTDAQSSLSTVQ 171
+++++ AK + ++ Q F + E+ + L+ +
Sbjct: 274 QLEQIESEILSAKEEYQLVTQLF-----------KNEILDKLRQTTDNIGLLTLELAKNE 322

Query: 172 RLLDNTQVRAPFDGVVETLPI-EKGDFVGIGDPVASII-DLHKLVIEADVSERHIQHVQL 229
+ +RAP V+ L + +G V + + I+ + L + A V + I + +
Sbjct: 323 ERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINV 382

Query: 230 EQAARIRF--IDGTQT---QGKVRYIS--RLSSPATNTFAI--------EVEVDNPQQAI 274
Q A I+ T+ GKV+ I+ + + N +
Sbjct: 383 GQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPL 442

Query: 275 PAGVSAEVE 283
+G++ E
Sbjct: 443 SSGMAVTAE 451



Score = 33.3 bits (76), Expect = 0.001
Identities = 15/114 (13%), Positives = 41/114 (35%), Gaps = 4/114 (3%)

Query: 65 TQTDKVIELYGRTAPDRQAKIGAEIA-GRIAEVKIAKGQMVTKNQIIALIDKGDLDAQLE 123
Q + V G+ ++K I + E+ + +G+ V K ++ + +A
Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTL 137

Query: 124 RAKAQL---KVRQQEFNAASALKNKGLQGEVAFTNAAAALTDAQSSLSTVQRLL 174
+ ++ L ++ Q + S E+ + ++ + + L+
Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0629ACRIFLAVINRP488e-157 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 488 bits (1258), Expect = e-157
Identities = 217/1055 (20%), Positives = 443/1055 (41%), Gaps = 65/1055 (6%)

Query: 42 ALSRTRTMLTLLVFILVAGVATYLTIPKESSPDVTIPIIYVSVSHQGISPSDAERLIVRP 101
+ R L + +++AG L +P P + P + VS ++ G + + +
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64

Query: 102 LEQEMRSIEGVKEMTATA-SEGHASVVLEFNVGVDLAKAMADVRDAVDLAKPKLPADSDE 160
+EQ M I+ + M++T+ S G ++ L F G D A V++ + LA P LP + +
Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124

Query: 161 PTVNEVTLAAEQPVLSVVLYGTVPERT----TVLIARQLRDKLESFRQILSVDIAGDREE 216
++ ++ ++ P T + +A ++D L + V + G +
Sbjct: 125 QGISVEK-SSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYA 183

Query: 217 IVEIIVDPLLMESYGLDQSDIYNLIALNNRVVAAGFVDTG------YGRFSVKVPSVFES 270
+ I +D L+ Y L D+ N + + N +AAG + S+ + F++
Sbjct: 184 -MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 271 LKDVLELPIKVEGK-QVITFGDVATVRKSFRDPESFARLDGKPAIVLDIKKRSGENIIET 329
++ ++ ++V V+ DVA V + AR++GKPA L IK +G N ++T
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 330 VELVKAVLGEAQARDDWPNNLLVKTIWDESEDVKLMLNDLQNNILSAIILVVVVIIAILG 389
+ +KA L E Q +P + V +D + V+L ++++ + AI+LV +V+ L
Sbjct: 303 AKAIKAKLAELQP--FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 390 -VRTSLLVGVSIPGSFLTGLLVLAVFGLTVNIVVLFALIMAVGMLVDGAIVVTEFADRRM 448
+R +L+ +++P L +LA FG ++N + +F +++A+G+LVD AIVV E +R M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 449 QE-GMHRSEAYRDAAKRMAWPITASTATTLAAFAPLLFWPDVTGEFMKYLPLTLMATLTA 507
E + EA + ++ + A F P+ F+ TG + +T+++ +
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 508 SLVMALLFVPVLGSLFGRPQKVTQANQARMVALHNGDFSQATGITKAYYSTLAIAIRHPI 567
S+++AL+ P L + +P + + Y +++ +
Sbjct: 481 SVLVALILTPALCATLLKPVS--AEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538

Query: 568 KILCGALLMSAAIAFAYGKAGLGAEFFPEVDPPFFSVKVRSYGDLSLNEKDRIMSDIEQV 627
+ L L+ A + + + L + F PE D F ++ + +++ +
Sbjct: 539 RYLLIYALIVAGMVVLFLR--LPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDY 596

Query: 628 MLGH--DEFESVYTRTGGD-----DEIGVVQITPVDWQYR----RSVKAIIEELEQVTDT 676
L + ESV+T G G+ ++ W+ R S +A+I +
Sbjct: 597 YLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM---E 653

Query: 677 FPGVEIEYKFPDAGPPV-----EHDLEIEISARVADDLDKAA----QQVRLWAEANPALT 727
+ + P P + + E+ + D Q + + A+ +L
Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713

Query: 728 NLSDNGSKPGIDWKIDIRRDDASRFAADATLVGNTVQFVTNGLKIGDYLPDDSDEEVDIL 787
++ NG + +K+++ ++ A + + T+ G + D++ +
Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR--GRVKKLY 771

Query: 788 VRYPSEYR-DIGRFDQLRVKTAQG-LVPLTNFAQIIPEQKQDTIHRVDGRRVISVMADLK 845
V+ +++R D+L V++A G +VP + F + R +G + + +
Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831

Query: 846 EGYNLALELPAIEQALRELNLPSSVEFRIRGQNEEQEHSAVFLEKAFMVALAAMAIILIT 905
G + + +E + LP+ + + G + ++ S ++ + + L
Sbjct: 832 PGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889

Query: 906 QFNSFYQAFLILTAVIFSTVGVFAGLLIFQKPFGIIMSGIGVIALAGIVVNNNIVLIDTY 965
+ S+ ++ V VGV +F + + +G++ G+ N I++++
Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVY-FMVGLLTTIGLSAKNAILIVEFA 948

Query: 966 NQLL-KRGLSREEAILRTGVQRLRPVLLTTVTTILGLLPMVLEMNIDIINQKIEFGAPST 1024
L+ K G EA L RLRP+L+T++ ILG+LP+ I GA S
Sbjct: 949 KDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA-----------ISNGAGSG 997

Query: 1025 QWWSQLATAVAGGLAFATVLTLVLTPCLLMLGRRR 1059
+ + V GG+ AT+L + P ++ RR
Sbjct: 998 AQNA-VGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 79.5 bits (196), Expect = 5e-17
Identities = 90/531 (16%), Positives = 199/531 (37%), Gaps = 55/531 (10%)

Query: 561 IAIRHPIKILCGALLMSAAIAFAYGKAGLGAEFFPEVDPPFFSVKVRSYGDLSLNEKDRI 620
IR PI A+++ A A A + L +P + PP SV G + +D +
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQ--LPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 621 MSDIEQVMLGHDEFESVYTRTGGDDEIGVVQI----TPVDWQYRRSVKAIIEELEQVTDT 676
IEQ M G D + + + + + T D + + +L+ T
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQ----VQNKLQLATPL 117

Query: 677 FP-GVEIEYKFPDAGPPVEHDLEIEISARVADDLDKAAQQVRLWAEAN--PALTNLSD-- 731
P V+ + + + ++ V+D+ + + +N L+ L+
Sbjct: 118 LPQEVQQQGISVEKSSSS----YLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVG 173

Query: 732 ----NGSKPGIDWKIDIRRDDASRFAADATLVGNTVQFVTNGLKIG----DYLPDDSDEE 783
G++ + +I + D +++ V N ++ +I P ++
Sbjct: 174 DVQLFGAQYAM--RIWLDADLLNKYKLTPVDVINQLK--VQNDQIAAGQLGGTPALPGQQ 229

Query: 784 VDILVRYPSEYRDIGRFDQLRVKTAQ--GLVPLTNFAQI-IPEQKQDTIHRVDGRRVISV 840
++ + + +++ F ++ ++ +V L + A++ + + + I R++G+ +
Sbjct: 230 LNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 841 MADLKEGYNLALELPAIEQALRELN--LPSSVEFRI-RGQNEEQEHSAVFLEKAFMVALA 897
L G N AI+ L EL P ++ + S + K A+
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIM 349

Query: 898 AMAIILITQFNSFYQAFLILTAVIFSTVGVFAGLLIFQKPFGIIMSGIGVIALA-GIVVN 956
+ +++ + + AV +G FA L F + I + + LA G++V+
Sbjct: 350 LVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFG--YSINTLTMFGMVLAIGLLVD 407

Query: 957 NNIVLIDT-YNQLLKRGLSREEAILRTGVQRLRPVLLTTVTTILGLLPMVLEMNIDIINQ 1015
+ IV+++ +++ L +EA ++ Q ++ + +PM
Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF--------- 458

Query: 1016 KIEFGAPSTQWWSQLATAVAGGLAFATVLTLVLTP--CLLMLGRRRKGVSE 1064
FG + + Q + + +A + ++ L+LTP C +L E
Sbjct: 459 ---FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0632BLACTAMASEA300.024 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 29.8 bits (67), Expect = 0.024
Identities = 33/203 (16%), Positives = 68/203 (33%), Gaps = 33/203 (16%)

Query: 259 VVYRLLSQLNIELKGKIKVGKANTKQAQKIASHH---SQPLPVLLKTMLQESDNLIADTL 315
V + + +L+ KI + + ++ H + L + SDN A+ L
Sbjct: 75 AVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLL 134

Query: 316 TKALGHRFYSQPGSFTNGTQAIKQIFYSRTGISLEDTQLADGSGLSRNNRMRPQVMLETL 375
+G P T ++QI + T + +T+L + + P M TL
Sbjct: 135 LATVG-----GPAGLT---AFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATL 186

Query: 376 RYLYQHEAELGLIAMLPSAGESGTLQYRRSMRAPQISGQ-IKA----------KSGSL-Y 423
R L + Q + M +++G I++ K+G+
Sbjct: 187 RKLLTSQR----------LSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGER 236

Query: 424 GTYNMAGFVMDENQRPKTLFVQF 446
G + + N+ + + +
Sbjct: 237 GARGIVALLGPNNKAERIVVIYL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0633ECOLIPORIN831e-19 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 82.7 bits (204), Expect = 1e-19
Identities = 91/397 (22%), Positives = 151/397 (38%), Gaps = 70/397 (17%)

Query: 10 MNKTLIALAVSAAAVATGAYADGINQSGDKAGSTVYSAKGTSLEVGGRAEAR--LSLKDG 67
M + ++AL + A A A+A + +Y+ G L++ G+ + S
Sbjct: 1 MKRKVLALVIPALLAAGAAHA-----------AEIYNKDGNKLDLYGKVDGLHYFSDDSS 49

Query: 68 KAQDNSRVRLNFLGKAEINDSLYGVGFYEGEFTTNDQGKNASNNSLDNRYTYAGIG-GTY 126
K D + +R+ F G+ +IND L G G +E N +N+ R +AG+ G Y
Sbjct: 50 KDGDQTYMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSW--TRLAFAGLKFGDY 107

Query: 127 GEVTYGKNDGALGVITDFTDIMSYHG--NTAAEKIAVADRVDNMLAYKGQ--FG-----D 177
G YG+N G L + +TD++ G + + R + + Y+ FG +
Sbjct: 108 GSFDYGRNYGVLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLN 167

Query: 178 LGVKASYRFADRNAVDAMGNVVTETNAAKYSDNGEDGYSLSAIYTFGDTGFNVGAGYADQ 237
++ + ++A D N + DG+ +S Y G GF+ GA Y
Sbjct: 168 FALQYQGKNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIG-MGFSAGAAYTTS 226

Query: 238 DDQNE----------------YMLAASYRMENLYFAGLFTDGELAKDVDYTGYELAAGYK 281
D NE + Y N+Y A ++++ T G
Sbjct: 227 DRTNEQVNAGGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVA 286

Query: 282 LGQAAFTAT------------------------YNNAETAKETSADNFAIDATYYFKPNF 317
F T YNN + + ATYYF NF
Sbjct: 287 NKTQNFEVTAQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNF 346

Query: 318 RSYISYQFNLLDSD----KVGKVASEDELAIGLRYDF 350
+Y+ Y+ NLLD D K ++++D +A+G+ Y F
Sbjct: 347 STYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


29VC_0852VC_0864N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_08521170.205010DNA repair protein RecN
VC_0853119-0.278164conserved hypothetical protein
VC_0854120-0.955550heat shock protein GrpE
VC_0855-220-2.997401dnaK protein
VC_0856-220-4.346609dnaJ protein
VC_0857029-7.313440fimbrial assembly protein PilE, putative
VC_0858024-5.104517type IV pilin, putative
VC_0859-122-4.363604hypothetical protein
VC_0860-221-3.855002hypothetical protein
VC_0861-113-0.950606type IV pilin, putative
VC_0862-214-0.910688hypothetical protein
VC_0863-114-0.627503conserved hypothetical protein
VC_0864-1112.121446yfhC protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0852PF06580330.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.003
Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 161 HAYQNWRQASNQLKQLRENSQQNQAQLQLLEYQI 194
H ++N++QA ++ Q + S +AQL L+ QI
Sbjct: 139 HFFKNYKQA--EIDQWKMASMAQEAQLMALKAQI 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0855SHAPEPROTEIN1414e-39 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 141 bits (356), Expect = 4e-39
Identities = 83/385 (21%), Positives = 149/385 (38%), Gaps = 81/385 (21%)

Query: 5 IGIDLGTTNSCVAVLDG----DKPRVIE-NAEGERTTPSVIAYTDGETLVGQPAKRQAVT 59
+ IDLGT N+ + V ++P V+ + + SV A VG AK+
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA-------VGHDAKQMLGR 65

Query: 60 NPQNTLFAIKRLIGRRFEDEEVQRDIKIMPYKIVKADNGDAWVEAKGQKMAAPQVSAEVL 119
P N + AI+ + D V + ++L
Sbjct: 66 TPGN-IAAIRPMKDGVIADFFV---------------------------------TEKML 91

Query: 120 KK-MKKTAEDFLGEPVTAAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALAY 178
+ +K+ + P ++ VP +R+A +++ + AG +I EP AAA+
Sbjct: 92 QHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGA 151

Query: 179 GLDKQGGDRTIAVYDLGGGTFDISIIEIDEVEGEKTFEVLSTNGDTHLGGEDFDNRMINY 238
GL V D+GGGT ++++I ++ V + +GG+ FD +INY
Sbjct: 152 GLPVS-EATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAIINY 201

Query: 239 LVDEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADATGPKH 294
+ + G + AE+ K E+ SA + ++ + P+
Sbjct: 202 VRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRG 248

Query: 295 MNIKVTRAKLEALVEDLVQRSLEPLKVALADA--DLSVNDITD--VILVGGQTRMPMVQK 350
+ + LEAL E L + + VAL +L+ +DI++ ++L GG + + +
Sbjct: 249 FTLN-SNEILEALQEPLTG-IVSAVMVALEQCPPELA-SDISERGMVLTGGGALLRNLDR 305

Query: 351 KVAEFFGKEPRKDVNPDEAVAVGAA 375
+ E G +P VA G
Sbjct: 306 LLMEETGIPVVVAEDPLTCVARGGG 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0857BCTERIALGSPG484e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 48.3 bits (115), Expect = 4e-10
Identities = 17/52 (32%), Positives = 36/52 (69%)

Query: 15 QQQGMTLIELMIAVVIVGVLASIAYPAYTNYVKEGHRKQAMADMAKIQLYLE 66
+Q+G TL+E+M+ +VI+GVLAS+ P ++ +++A++D+ ++ L+
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALD 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0858BCTERIALGSPH355e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 35.3 bits (81), Expect = 5e-05
Identities = 16/68 (23%), Positives = 26/68 (38%), Gaps = 7/68 (10%)

Query: 17 RGFTLLELLIT---VAVLTTMLLFAAPNFSKVSQQTKMTNLANELQGFLIQAKSEAVFRN 73
RGFTLLE+++ + V M+L A P S+ + L + +
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFP----ASRDDSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 74 QDLWVHIQ 81
Q V +
Sbjct: 60 QFFGVSVH 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0859BCTERIALGSPG300.004 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.9 bits (67), Expect = 0.004
Identities = 21/96 (21%), Positives = 43/96 (44%), Gaps = 1/96 (1%)

Query: 8 KSANQQGNTLIEFMVAALVGAMALAIVGTVFLSNQKAAAQRSKEIMLLQQVSSVMQQMKE 67
+ Q+G TL+E MV ++ + ++V + N K A + K + + + + + K
Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGN-KEKADKQKAVSDIVALENALDMYKL 61

Query: 68 DIQRAGFDDVGNQSMRLSGAVGVIYRASNKIGYVYR 103
D + G +S+ + + + NK GY+ R
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKR 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0861BCTERIALGSPG374e-06 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 37.2 bits (86), Expect = 4e-06
Identities = 22/60 (36%), Positives = 33/60 (55%), Gaps = 7/60 (11%)

Query: 3 NKQSGFSLIEVMISFVLIGVGALGLV--KLQAYIEQ-RADYAMHSIEALNLAEQKLEWFR 59
+KQ GF+L+E+M+ V+IGV A LV L E+ A+ I AL E L+ ++
Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLA-SLVVPNLMGNKEKADKQKAVSDIVAL---ENALDMYK 60


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0864cloacin300.004 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.004
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 6/65 (9%)

Query: 128 DLKAGAAGTVMDLFSSQAAYHYATVEKGLLEEECRAQLQAFFQRRRKEIKAKRDAERKNA 187
LKA A T D+ + QAA+ A EK + A L + + R+K+ KR AE
Sbjct: 394 GLKAQRAQT--DVNNKQAAFDAAAKEK----SDADAALSSAMESRKKKEDKKRSAENNLN 447

Query: 188 ERKNE 192
+ KN+
Sbjct: 448 DEKNK 452


30VC_0965VC_0976N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_0965220-1.980681phosphoenolpyruvate-protein phosphotransferase
VC_0966114-1.179920phosphocarrier protein HPr
VC_0967-117-1.504798hypothetical protein
VC_0968-117-2.083754cysteine synthase A
VC_0969-114-1.870437cysZ protein
VC_0970-215-1.801877cell division protein ZipA
VC_0971-118-2.215164DNA ligase
VC_0972020-3.222211porin, putative
VC_0973021-3.503063hypothetical protein
VC_0974019-1.659634transcriptional regulator, MerR family
VC_0975119-1.683481conserved hypothetical protein
VC_0976019-1.693964conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0965PHPHTRNFRASE7620.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 762 bits (1969), Expect = 0.0
Identities = 287/568 (50%), Positives = 408/568 (71%), Gaps = 2/568 (0%)

Query: 1 MISGILASPGIAIGKALLLQEDEIVLNTNTISDDQVEAEVERFYTARDKSSAQLEVIKQK 60
I+GI AS G+AI KA + E + + +I+D V E+E+ A +KS +L IK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITD--VSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 ALETFGEEKEAIFEGHIMLLEDEELEEEILALIKKEKMSADNAIHTVIEEQATALESLDD 120
+ G +K IF H+++L+D EL + I I+ E+M+A+ A+ V + + ES+D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERATDIRDIGTRFVKNALGMHIVSLSEIDQEVVLVAYDLTPSETAQINLNYVLGFA 180
EY+KERA DIRD+ R + + +G+ SL+ I +E V++A DLTPS+TAQ+N +V GFA
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 CDIGGRTSHTSIMARSLELPAIVGTNDITKKVKNGDTLILDAMNNKIIVNPTQAQIEEAK 240
DIGGRTSH++IM+RSLE+PA+VGT ++T+K+++GD +I+D + +IVNPT+ +++ +
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 AVKAAFLAEKEELAKLKDLPAETLDGHRVEVCGNIGTVKDCDGIIRNGGEGVGLYRTEFL 300
+AAF +K+E AKL P+ T DG VE+ NIGT KD DG++ NGGEG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRDALPTEEEQYQAYKEVAEAMNGQAVIIRTMDIGGDKDLPYMDLPKEMNPFLGWRAV 360
+MDRD LPTEEEQ++AYKEV + M+G+ V+IRT+DIGGDK+L Y+ LPKE+NPFLG+RA+
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RISLDRREILRDQLRGILRASAHGKLRVMFPMIISVEEIRELKNAIEEYKAELRTEGHAF 420
R+ L++++I R QLR +LRAS +G L+VMFPMI ++EE+R+ K ++E K +L +EG
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DENIEIGVMVETPAAAAIAHHLAKEVSFFSIGTNDLTQYTLAVDRGNEMISHLYNPLSPA 480
++IE+G+MVE P+ A A+ AKEV FFSIGTNDL QYT+A DR NE +S+LY P PA
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLTVIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSGISIPKVKKVIRNA 540
+L ++ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMS SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NYAEIKAMAEEALALPTAAEIEACVDKF 568
+ E+K A++AL L TA E+E V K
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKT 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0969ACRIFLAVINRP290.018 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.018
Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 3/39 (7%)

Query: 208 FGMLVAFFTS--IPIVNLFIVPVAVCGA-TAMWVMEFKI 243
F L A + S IP+ + +VP+ + G A + K
Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKN 922


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0970TONBPROTEIN320.002 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 31.9 bits (72), Expect = 0.002
Identities = 11/63 (17%), Positives = 23/63 (36%)

Query: 92 ELDEEEDEEARIPVQPQSQPQPRKVQPQVEMPRVAPNVPMAKVQPEVVTEIEVQEPQEEK 151
D E + + P +P +P+P + K +P+ + + ++ K
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 152 LDV 154
DV
Sbjct: 112 RDV 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0972ECOLNEIPORIN598e-12 E.coli/Neisseria porin superfamily signature.
		>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature.

Length = 331

Score = 58.7 bits (142), Expect = 8e-12
Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 5/143 (3%)

Query: 50 YQEDSNGYDYENESRIGFRASKDMFDNVNVFMQIESGYVGEDGKGSTLGARDTFLGLQGD 109
++ + S+IGF+ +D+ + + Q+E G S G R +F+GL+G
Sbjct: 45 ASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQK-ASIAGTDSGWGNRQSFIGLKGG 103

Query: 110 WGKVRFGRMLTPLYEIVDWPYSNPGLGRVFDWGGDVAGHYDRKGDIARYDSPAFGGLTFN 169
+GK+R GR+ + L D NP + G + + + RYDSP F GL+ +
Sbjct: 104 FGKLRVGRLNSVLK---DTGDINPWDSKSDYLGVNKIAEPEARLISVRYDSPEFAGLSGS 160

Query: 170 IS-AGRGDVGTKSSNHFGAAAHY 191
+ A + G +S + A +Y
Sbjct: 161 VQYALNDNAGRHNSESYHAGFNY 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_0976IGASERPTASE330.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.5 bits (76), Expect = 0.001
Identities = 16/81 (19%), Positives = 29/81 (35%), Gaps = 2/81 (2%)

Query: 166 VQPPADLTAAMNAQMKAERNKRAEVLEAEGVRQAQILRAEGQKQSEILKAEGEKQAAILQ 225
V PPA T + + AE +K+ + + Q + +A+ +A
Sbjct: 1025 VPPPAPATPSETTETVAENSKQES--KTVEKNEQDATETTAQNREVAKEAKSNVKANTQT 1082

Query: 226 AEARERAAEAEAKATTMVSEA 246
E + +E + TT E
Sbjct: 1083 NEVAQSGSETKETQTTETKET 1103


31VC_1081VC_1089N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1081016-2.691522response regulator
VC_1082116-2.731956response regulator
VC_1083114-2.852794hypothetical protein
VC_1084015-2.622524sensory box sensor histidine kinase
VC_1085015-2.725452sensor histidine kinase
VC_1086010-2.767083response regulator
VC_1087110-2.440929response regulator
VC_1088113-1.906405sensor histidine kinase
VC_1089015-0.975105periplasmic binding protein-related protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1081HTHFIS552e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.8 bits (132), Expect = 2e-10
Identities = 23/108 (21%), Positives = 49/108 (45%), Gaps = 12/108 (11%)

Query: 7 KVVCVDDDDFMLKALGRMIRRMRPDWEIELVEDANH-WVVDVEHAPSVVISDLLMPGKNG 65
++ DDD + L + + R +++ + +A W +V++D++MP +N
Sbjct: 5 TILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 66 EALLTELRARCPETMRVLLTGDTT-----QELPRKAHTYAQFVLPKPF 108
LL ++ P+ ++++ T + + A+ Y LPKPF
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDY----LPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1082HTHFIS584e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.9 bits (140), Expect = 4e-13
Identities = 26/117 (22%), Positives = 48/117 (41%), Gaps = 1/117 (0%)

Query: 2 AIRLTVADDSKMSRKSVIRAIPAGWDVEITEAQNGKEAVENYNNGLADVMFLDLTMPEMD 61
+ VADD R + +A+ + ++ N G D++ D+ MP+ +
Sbjct: 3 GATILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GIQVLEHFHRIDAKCLVIVISADIQPLVQQRVRELGALNFLQKPLDPAQLEQTLHEA 118
+L + V+V+SA + + E GA ++L KP D +L + A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1086HTHFIS895e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.1 bits (221), Expect = 5e-21
Identities = 32/129 (24%), Positives = 56/129 (43%), Gaps = 2/129 (1%)

Query: 1 MQSNREILLVDDEPAVLNALKRELRPYFPHLHTATCAQEALELLKQHPVQMVISDYRMPG 60
M IL+ DD+ A+ L + L + + A + +V++D MP
Sbjct: 1 MTGAT-ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 MNGADLVIEINRRYPHIMSLILSGQADMNGLSRALNEGDLYKFLLKPWERNYLLQTILQC 120
N DL+ I + P + L++S Q +A +G Y +L KP++ L+ I +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKG-AYDYLPKPFDLTELIGIIGRA 118

Query: 121 FSEKERREE 129
+E +RR
Sbjct: 119 LAEPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1087HTHFIS976e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.2 bits (242), Expect = 6e-24
Identities = 31/124 (25%), Positives = 61/124 (49%), Gaps = 2/124 (1%)

Query: 13 VLLLDDENDILKALNRVL-RMDYNVVTFDNGADALEYLQENPIHIIISDMRMPEMDGADF 71
+L+ DD+ I LN+ L R Y+V N A ++ ++++D+ MP+ + D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 72 LAKAREMQPDTVRLLLTGYADIQSTVRAVNAGGIHTYISKPWDNENLKLIVGKAAEFYRL 131
L + ++ +PD L+++ + ++A G + Y+ KP+D L I+G+A +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEK-GAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 132 SRDK 135
K
Sbjct: 125 RPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1089TYPE3IMSPROT300.011 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.011
Identities = 11/52 (21%), Positives = 22/52 (42%), Gaps = 5/52 (9%)

Query: 74 DLVYMNPYHYT---FFHQQ--PGYVALAKQKDQKLQGIIVVAQENPIKSIQD 120
+V NP H + + P + K D ++Q + +A+E + +Q
Sbjct: 258 SVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQR 309


32VC_1311VC_1318N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1311-2100.749775conserved hypothetical protein
VC_1312-2100.489024alanine racemase, putative
VC_1313-290.129479methyl-accepting chemotaxis protein
VC_1314-212-0.289011transporter, putative
VC_1315-213-0.211867sensor histidine kinase
VC_1316010-0.332894chemotaxis protein CheY, putative
VC_1317010-0.459092conserved hypothetical protein
VC_1318112-0.680105outer membrane protein OmpV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1311VACCYTOTOXIN300.016 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 29.6 bits (66), Expect = 0.016
Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 3/45 (6%)

Query: 139 GVDALDPIGVALALIAGGFWA-GYIWFGQRAGSVGSGGMTVSIGM 182
GVDA + + I GGF + GY F +A S+ SG + G+
Sbjct: 1049 GVDAY--LNGEVEAIVGGFGSYGYSSFSNQANSLNSGANNTNFGV 1091


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1312ALARACEMASE1907e-59 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 190 bits (484), Expect = 7e-59
Identities = 81/378 (21%), Positives = 158/378 (41%), Gaps = 40/378 (10%)

Query: 64 SWLEISLGQFQSNIEQFKSHMNANTKICAIMKADAYGNGIRGLMPTIIAQGIPCVGVASN 123
+ L + N+ + + ++ +++KA+AYG+GI + I A + +
Sbjct: 5 IQASLDLQALKQNLSIVRQAA-THARVWSVVKANAYGHGIERIWSAIGATD--GFALLNL 61

Query: 124 AEARAVRESGFKGELIRVRSA-SLSEMSSALDLNIEELIGTHQQALDLAELAKQSGKTLK 182
EA +RE G+KG ++ + ++ + + ++ Q L + A+ L
Sbjct: 62 EEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQ-LKALQNARL-KAPLD 119

Query: 183 VHIALNDGGMGRNGIDMTTEAGKKEAVSIATQ----PSLSVVGIMTHFPNY-NADEVRAK 237
+++ + + GM R G +++ Q ++ + +M+HF + D +
Sbjct: 120 IYLKV-NSGMNRLGF------QPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGA 172

Query: 238 LAQFKESSTWLMQQANLKREEITLHVANSYTALNVPEAQLDMVRPGGVLFGDLPTNPEYP 297
+A+ ++++ L + ++NS L PEA D VRPG +L+G P + ++
Sbjct: 173 MARIEQAAEGLECR---------RSLSNSAATLWHPEAHFDWVRPGIILYGASP-SGQWR 222

Query: 298 SIVSF--------KTRVSSLHHLPKDSTVGYDSTFTTSRDSVLANLPVGYSDGYPRKMGN 349
I + + + + L VGY +T + + + GY+DGYPR
Sbjct: 223 DIANTGLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPT 282

Query: 350 KAEVLINGQRAKVVGVTSMNTTVVDVTEIKGVLPGQEVVLFGQQQKQSIAVSEMENNAEL 409
VL++G R VG SM+ VD+T G V L+G++ I + ++ A
Sbjct: 283 GTPVLVDGVRTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWGKE----IKIDDVAAAAGT 338

Query: 410 IFPELYTLWGTSNPRFYV 427
+ EL P V
Sbjct: 339 VGYELMCALALRVPVVTV 356


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1316HTHFIS623e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.2 bits (151), Expect = 3e-14
Identities = 26/140 (18%), Positives = 47/140 (33%), Gaps = 16/140 (11%)

Query: 1 MEKLNIICVDDQ---REVLSAVLQDLEPLSRWINIEDCESADEALELMDDLDAQGEWVAV 57
M I+ DD R VL+ L ++ +A + +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG-----YDVRITSNAATLWRWIAA-----GDGDL 50

Query: 58 VISDHVMPGKSGVELLSEISADPRFIHTKKVLLTGQATHTDTINAINTAGIHHYFDKPWS 117
V++D VMP ++ +LL I ++++ Q T I A G + Y KP+
Sbjct: 51 VVTDVVMPDENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASE-KGAYDYLPKPFD 107

Query: 118 AKILVDCVRSLVTHYVFDQR 137
L+ + +
Sbjct: 108 LTELIGIIGRALAEPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1318ECOLIPORIN300.007 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 30.3 bits (68), Expect = 0.007
Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 13/109 (11%)

Query: 1 MKK--IALFITASLIAGNALAAQTYIRNGNIYTHEGQWAAEVGAFGSTDLLKDQDKSYGA 58
MK+ +AL I A L AG A AA+ Y ++GN G+ + S D KD D++Y
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGL--HYFSDDSSKDGDQTY-- 56

Query: 59 LLNFGYHGE-DFNADLSGL---NYRFFGNT--GDIVNLGTYLTGSGVAY 101
+ G+ GE N L+G Y NT G+ N T L +G+ +
Sbjct: 57 -MRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKF 104


33VC_1391VC_1411N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1391-3150.035149multidrug transporter, putative
VC_1392017-1.189751deoxyribodipyrimidine photolyase, putative
VC_1393119-4.014276sugE protein
VC_1394219-4.162151methyl-accepting chemotaxis protein
VC_1396020-2.946260hypothetical protein
VC_1397119-3.038518chemotaxis protein CheA
VC_1398020-3.442485chemotaxis protein CheY
VC_1399-120-3.748288chemotaxis protein methyltransferase CheR
VC_1400120-3.218057hypothetical protein
VC_1401020-3.326279protein-glutamate methylesterase CheB
VC_1402-117-3.232744purine-binding chemotaxis protein Chew,
VC_1403118-3.211288methyl-accepting chemotaxis protein
VC_1404019-2.280853hypothetical protein
VC_1405017-1.385021methyl-accepting chemotaxis protein
VC_1406018-0.891092methyl-accepting chemotaxis protein
VC_1407017-0.324422ATP-dependent RNA helicase RhlE
VC_1408-116-1.350366transcriptional regulator, TetR family
VC_1409-113-0.855592multidrug resistance protein, putative
VC_1410-29-1.005788multidrug resistance protein VceA
VC_1411-28-0.478681multidrug resistance protein VceB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1391TCRTETB1215e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (304), Expect = 5e-32
Identities = 98/419 (23%), Positives = 169/419 (40%), Gaps = 21/419 (5%)

Query: 27 LGSLEKSIVTTPLALIGQDLSA-GTALTWVITAYLLAATAVLPVYGKLSDLFGRVRMLNI 85
L + ++ L I D + + WV TA++L + VYGKLSD G R+L
Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84

Query: 86 SIGIFIVGSAMCTFA-VDLPTLIGARVVQGIGGGGLIALAFTVIADSIPAREVGKYQGYI 144
I I GS + LI AR +QG G AL V+A IP GK G I
Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 145 SAVYAVSSVAGPLLGGYFADHLSWRWVFGINLPLGMVALYMVNRHLRHLNQKRHSRFDWL 204
++ A+ GP +GG A ++ W ++ +P+ + L + FD
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKGHFDIK 202

Query: 205 GAGLLMLTTTLLLLQLSSHSFLPAGWGAFALLLCLVLLILVERQ--VSDPILPARLARLP 262
G L+ + +L +S+S F ++ L LI V+ V+DP + L +
Sbjct: 203 GIILMSVGIVFFMLFTTSYSIS------FLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNI 256

Query: 263 SYLTAIGLIMASQMLMFALLVYMPLQLQWQKGFSPSQSGT-VMVIFMFSITTGAYLGGKW 321
++ + + + +P ++ S ++ G+ ++ S+ Y+GG
Sbjct: 257 PFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL 316

Query: 322 VARSGRYK------ALVVSGFLLAAVAIWQIHYDLWVHLSLGIGGLGLGFTLPSLNVVVQ 375
V R G + FL A+ + ++ + + GL FT ++ +V
Sbjct: 317 VDRRGPLYVLNIGVTFLSVSFLTASFLLETTS--WFMTIIIVFVLGGLSFTKTVISTIVS 374

Query: 376 SVLPARDRGIGMSLFNFGRELGGALGVAFCSALFYLRVPQSVTVSEHGSQASSVTPDVL 434
S L ++ G GMSL NF L G+A L + + + Q++ + ++L
Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLL 433


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1392TACYTOLYSIN300.019 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 30.3 bits (68), Expect = 0.019
Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 120 QHDIVWHEFPYAAVIRGAQTRKNWDEHWQ 148
Q++I+W E Y + T++ WD +W
Sbjct: 479 QYEILWDEINYDDKGKEVITKRRWDNNWY 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1397PF06580330.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.9 bits (75), Expect = 0.005
Identities = 11/48 (22%), Positives = 19/48 (39%), Gaps = 8/48 (16%)

Query: 490 LIRNALDHGIESPDIRREQGKNPTGKITLSAFTLDDSVIIKMSDDGKG 537
L+ N + HGI GKI L + +V +++ + G
Sbjct: 263 LVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSL 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1398HTHFIS681e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 1e-16
Identities = 25/113 (22%), Positives = 45/113 (39%), Gaps = 2/113 (1%)

Query: 4 KVMVVDDASTVRMYHKALLEEIGIFILEASNGVEALERALEMPVDLFLVDINMPKMDGFT 63
++V DD + +R L G + SN DL + D+ MP + F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 LVREIRCRPELAGIPTVMISTESQESDRQQGIHMGANLYMVKPVNPEELQQTV 116
L+ I+ +P +++S ++ + GA Y+ KP + EL +
Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1401HTHFIS727e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 71.8 bits (176), Expect = 7e-16
Identities = 27/104 (25%), Positives = 52/104 (50%), Gaps = 4/104 (3%)

Query: 2 KILVVDDSALMRTTISDILQNIPNAEIKTARDGMDAIDKVMKWQPDVMTLDINMPNMDGL 61
ILV DD A +RT ++ L + +++ + + D++ D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 TCLTQIMVERP-LPIVMLSSLTHEGAITTLEALYLGAVDFVAKP 104
L +I RP LP++++S+ +T ++A GA D++ KP
Sbjct: 64 DLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1402BINARYTOXINB300.027 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 30.0 bits (67), Expect = 0.027
Identities = 15/96 (15%), Positives = 35/96 (36%), Gaps = 7/96 (7%)

Query: 113 DPEMIKPTNTAGGKIDPTLITNTIHSDNQIIQVLNCHRLLDNSVEEELELSTQRFNHLSS 172
P ++ + T I + + I S+NQ Q + +E +T NH++
Sbjct: 60 APMVVTSSTTGDLSIPSSEL-ENIPSENQYFQSAIWSGFIKVKKSDEYTFATSADNHVTM 118

Query: 173 QSHFGDVIDEERDDDEDMRQLVCCMVDGQEYAFPLE 208
+D++ ++ + G+ Y ++
Sbjct: 119 W------VDDQEVINKASNSNKIRLEKGRLYQIKIQ 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1405FRAGILYSIN300.024 Fragilysin metallopeptidase (M10C) enterotoxin signat...
		>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin

signature.
Length = 405

Score = 30.0 bits (67), Expect = 0.024
Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 6/55 (10%)

Query: 512 ITAAKKQLDSINLALLELTSANTQVAAASEEQSVAADEISHNMTDIRDAGETIML 566
+ A + DS+ TS + V A+ + QSV+ +++ + D+ D G+ I+L
Sbjct: 24 LAACSNEADSLT------TSIDAPVTASIDLQSVSYTDLATQLNDVSDFGKMIIL 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1406BACSURFANTGN290.049 Yersinia/Haemophilus virulence surface antigen sign...
		>BACSURFANTGN#Yersinia/Haemophilus virulence surface antigen

signature.
Length = 322

Score = 28.5 bits (63), Expect = 0.049
Identities = 21/108 (19%), Positives = 42/108 (38%), Gaps = 26/108 (24%)

Query: 254 ASDVTDQVLKAHATKEASQMA----QQTSAETVRVAESGREMIDAAATIASGITESIAGA 309
++ +T+ V+ AH K + ++ Q+ + T++ +SGR M+D
Sbjct: 23 SATLTEGVIGAHRVKVETALSHSNLQKKLSATIKHNQSGRSMLD---------------- 66

Query: 310 NALMTDLSSQSQRITQIVTTINKIAEQTNLLALNAAIEAARAG--EYG 355
L+S + + T + I + L+ + A R YG
Sbjct: 67 ----RKLTSDGKANQRSSFTFSMIMYRMIHFVLSTRVPAVRESVANYG 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1408HTHTETR623e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 3e-14
Identities = 36/169 (21%), Positives = 71/169 (42%), Gaps = 8/169 (4%)

Query: 1 MRVKSEEKRQAILEIAKDSFTKQGFEQTSMSHIAKMLGGSKATLYNYFSSKEEIFTAVME 60
+ +++E RQ IL++A F++QG TS+ IAK G ++ +Y +F K ++F+ + E
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 SSATEQIALSFLSLDH-NRELKPALLEFGYNFLNSVLTQDAM-----SIYRMAIHEADRS 114
S + L + L E + L S +T++ I+ + +
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 115 AIGRHFYENGPKRGWARLSRYITCQIECGSLKE-CDPWIAAMHFKALLS 162
+ + N + R+ + + IE L AA+ + +S
Sbjct: 125 VVQQA-QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1410RTXTOXIND1114e-29 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 111 bits (280), Expect = 4e-29
Identities = 70/405 (17%), Positives = 136/405 (33%), Gaps = 65/405 (16%)

Query: 37 LAAAIVVAGGSYALYWHFIG---SRYISTDNAYAAAEIAEVTPAVGGIIAQVNVVDTEYV 93
L A ++ A +G + + E+ P I+ ++ V + E V
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 94 KQGDVLVQLDDTDARLALLQAEADLALAKRRVRSYLANDEGLS----------------- 136
++GDVL++L A L+ ++ L A+ Y +
Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 137 ----------AMVEAQEANEQRVKAQLKAA------------------QADFERAKIDLS 168
++++ Q + Q K Q + + K L
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 169 RREDLVRSGSVSGEELTNAKTGFAQAQANLNAAKAAMAQAQATKLSTIGSQKANAALTDN 228
L+ +++ + + + +A L K+ + Q ++ LS + L N
Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298

Query: 229 TTVDSNPEVLL----AKARYEQAKIDLERTVIRAPISGIVAKRQVQ-VGRRVQVGMPLMT 283
+D + + + + +VIRAP+S V + +V G V LM
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358

Query: 284 VVPTDHIY-VDANFKEVELRDVKVGQPVTLTADLYGDDVTYHGVVAGFSGGTGSAFSMIP 342
+VP D V A + ++ + VGQ + + + T +G + G I
Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF--PYTRYGYLVG-------KVKNIN 409

Query: 343 AQNATGNWIKVVQRLPIRIELD--PKDLQAYPLQVGLSMVATIDT 385
+ +V + I IE + + PL G+++ A I T
Sbjct: 410 LDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1411TCRTETB1022e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 102 bits (256), Expect = 2e-25
Identities = 82/395 (20%), Positives = 159/395 (40%), Gaps = 16/395 (4%)

Query: 24 LAMANFLAILDTTIANVSVSNIAGSLGTSTSQGTYVITSYAVAEAISVPLTGWLASRFGS 83
L + +F ++L+ + NVS+ +IA + +V T++ + +I + G L+ + G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 84 IRVFVTCFLLFGVFSLLCGLANSM-STLVMFRVLLGFVGGPLMPLSQTLMIRIFPKNKSH 142
R+ + ++ S++ + +S S L+M R + G L ++ R PK
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 143 AAIGIWSMTTLVAPIMGPILGGVLCDQLSWPYIFFIKMPFAIAAALLCWKLLKKFETKTT 202
A G+ + +GP +GG++ + W Y+ I M I KLLKK
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRIKG 197

Query: 203 HSKIDKVGLALLVVWVAALQLMLDEGKDHDWFESSRIVFLAVIAVIGFIAFLIWELTERN 262
H D G+ L+ V + L F +S + +++V+ F+ F+ +
Sbjct: 198 H--FDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245

Query: 263 PVVDLKVFRHRGYSISMVTLSLAFGAFFSISVVTPLWLQIYMGYTATISGHATASMGILA 322
P VD + ++ + I ++ + FG + P ++ + G G ++
Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305

Query: 323 V-FLAPIVANLSSKFDPRPFVFAGVMWLGLWTFMRGFNTVDMTFSQISWPLFFQGIGMPL 381
V I L + P + GV +L + F ++ T ++ + F G+
Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASF-LLETTSWFMTIIIVFVLGGLSF 364

Query: 382 FFVPLTAIALGSVKPHEMESAAGLMNFIRTLSGAF 416
++ I S+K E + L+NF LS
Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


34VC_1423VC_1430N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1423120-3.638549hypothetical protein
VC_1424-120-3.630835spermidine/putrescine ABC transporter,
VC_1425-118-3.839495spermidine/putrescine ABC transporter,
VC_1426-213-3.997099spermidine/putrescine ABC transporter, permease
VC_1427-211-4.087681spermidine/putrescine ABC transporter, permease
VC_1428-113-3.776897spermidine/putrescine ABC transporter,
VC_1429-112-4.048502hypothetical protein
VC_1430013-3.050333bax protein, putative
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1423cloacin300.010 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.1 bits (67), Expect = 0.010
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 80 QNQEEAWQHKMD-----QALEKQREEWQAEAVQRDKYVAELEKQNLNLEQQLNEQKMALE 134
Q++E Q + D +A E+ E +AE Q ++ VA +++ Q N +K L+
Sbjct: 300 QDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAVQVYNSRKSELD 359

Query: 135 LAN 137
AN
Sbjct: 360 AAN 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1424MALTOSEBP280.048 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 28.2 bits (62), Expect = 0.048
Identities = 31/109 (28%), Positives = 48/109 (44%), Gaps = 9/109 (8%)

Query: 1 MKNKLFASALCAAAL----FTTNAMAKDQELYFYNWSEYIP-----SEVLEDFTKETGIK 51
MK K A L +AL F+ +A+AK +E W +EV + F K+TGIK
Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60

Query: 52 VIYSTYESNESMYAKLKTQGAGYDLVVPSTYFVSKMRKEGMLQEIDHSK 100
V + E + ++ G G D++ + + G+L EI K
Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDK 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1425MYCMG045409e-06 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 40.5 bits (94), Expect = 9e-06
Identities = 26/94 (27%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 34 SACALSLLSGTAAAEDKELVFMNWGPYINSGILEQFTKETGIKVIYSTYESNETLYAKLK 93
+ +SL S ++ V N+ YI+ +LE+ + + + TY SNE L
Sbjct: 10 FSLFVSLSSILSSCGSTTFVLANFESYISPLLLER--VQEKHPLTFLTYPSNEKLINGFA 67

Query: 94 THNQGYDLVVPSTYFVAKMRDEGMLQKIDKSKLS 127
N Y + V STY V+++ + +L ID S+ +
Sbjct: 68 --NNTYSVAVASTYAVSELIERDLLSPIDWSQFN 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1430FLGFLGJ290.025 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 28.5 bits (63), Expect = 0.025
Identities = 15/34 (44%), Positives = 19/34 (55%), Gaps = 4/34 (11%)

Query: 134 VPEALVLTQAANESAWGTSRFAKE----ANNYFG 163
VP L+L QAA ES WG + +E + N FG
Sbjct: 169 VPHHLILAQAALESGWGQRQIRRENGEPSYNLFG 202


35VC_1445VC_1457N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1445-114-2.320681sensor histidine kinase/response regulator
VC_1446-1110.104752toxin secretion transporter, putative
VC_1447-111-0.172846RTX toxin transporter
VC_1448011-0.421582RTX toxin transporter
VC_1449011-0.203093hypothetical protein
VC_1450011-0.276760RTX toxin activating protein
VC_1451012-0.670225RTX toxin RtxA
VC_1452633-7.445087RstC protein
VC_1453327-4.844479RstB1 protein
VC_1454428-4.938306RstA1 protein
VC_1455527-3.894152transcriptional repressor RstR
VC_1456326-3.177454cholera enterotoxin, B subunit
VC_1457324-2.080501cholera enterotoxin, A subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1445HTHFIS791e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 78.7 bits (194), Expect = 1e-17
Identities = 26/118 (22%), Positives = 50/118 (42%), Gaps = 1/118 (0%)

Query: 455 HILVVEDTKTNQMVIQLLLNKLGYDVTIAENGLQAVELLEKNHVFDLVLMDISMPIMDGI 514
ILV +D + V+ L++ GYDV I N + DLV+ D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63

Query: 515 AATKILRDKHIEIPIIALTAHTAGSDKQNCIDAGMNDIVLKPIRSKDIMEVVKRFLNP 572
++ ++P++ ++A + G D + KP +++ ++ R L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1447RTXTOXIND380e-130 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 380 bits (977), Expect = e-130
Identities = 151/448 (33%), Positives = 235/448 (52%), Gaps = 16/448 (3%)

Query: 32 TEHHYEFLPAHLALAQRPPSPFARVTALTLSLGVLAALLWAYWGKLDVQATATGRLVVSG 91
+ EFLPAHL L + P S R+ A + ++ A + + G++++ ATA G+L SG
Sbjct: 35 EKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSG 94

Query: 92 RSQVIQSYEQSRLLSIHVRDGQRVEKGAPLLTLDTLGVNQDITRLVSQAEFQTNELIRYR 151
RS+ I+ E S + I V++G+ V KG LL L LG D + S E RY+
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 152 TL--------LNDQLLTHDPMFTALPAEQ----QALIHENYLSEKNEFDSTLASITAEMK 199
L L + L +P F + E+ +LI E + + +N+ +
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQ----KYQKELNLD 210

Query: 200 VNRTSQAARQSDIHALLQLTENISQRLAARKKLNQVKVIGHVEYLEQEKELLEAQRQVAQ 259
R + + I+ L+ RL L + I LEQE + +EA ++
Sbjct: 211 KKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRV 270

Query: 260 QRAELEVLKSQYESLEERLTGFKAQKQREWLEKRRQARLQLASLNQELSKVREREQLEII 319
+++LE ++S+ S +E + E L+K RQ + L EL+K ER+Q +I
Sbjct: 271 YKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVI 330

Query: 320 RSPVDGTVQQLSVYTLGAVLQPAQNLMIIVPENRVQQAEVQILNKDVGFVYPGQSVTVKV 379
R+PV VQQL V+T G V+ A+ LM+IVPE+ + + NKD+GF+ GQ+ +KV
Sbjct: 331 RAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKV 390

Query: 380 DAFPYTRYGTIDAELLSISRDSTTDEQLGLVFPAQIQLKTNHIMIDGQTVELTPGMSVVA 439
+AFPYTRYG + ++ +I+ D+ D++LGLVF I ++ N + + + L+ GM+V A
Sbjct: 391 EAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTA 450

Query: 440 EIKTDKRRVIDYLLSPIQEYQAEALRER 467
EIKT R VI YLLSP++E E+LRER
Sbjct: 451 EIKTGMRSVISYLLSPLEESVTESLRER 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1450RTXTOXINC704e-18 Gram-negative bacterial RTX toxin-activating protein C...
		>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C

signature.
Length = 170

Score = 70.3 bits (172), Expect = 4e-18
Identities = 32/103 (31%), Positives = 56/103 (54%), Gaps = 1/103 (0%)

Query: 18 MIGGVMLLSQHSPLHRRYVVAEWLQRILPAFELNQFCYYEDEHGRPIAFCNWAFVSEQIR 77
++G V L SPLHR + V+ + +LPA + NQ+ + P+A+C+WA +S +
Sbjct: 9 ILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRD-DYPVAYCSWANLSLENE 67

Query: 78 DELLSGVREISPSDWRSGQQIYIPEMIAPFGHGREVVNDLRRR 120
+ L+ V + DW SG + + + IAPFG + +R++
Sbjct: 68 IKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKK 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1451RTXTOXINA659e-12 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 64.6 bits (157), Expect = 9e-12
Identities = 60/261 (22%), Positives = 99/261 (37%), Gaps = 72/261 (27%)

Query: 4109 GGRGDDVFYATGKSNIFTGGEGNDMGVLMGRENMMFGGDGNDTAVVAGRINHVFLGAGDD 4168
G D F+ + ++IF G +G+D ++ G DGND ++ G+D
Sbjct: 724 GTTRADKFFGSKFTDIFHGADGDD---------LIEGNDGND---------RLYGDKGND 765

Query: 4169 QSFVFGEGGEIDTGSGRDYVVTSGNFNRVDTGDDQDYSVTIGNNNQVELGAGNDFANIFG 4228
+ G+G D ++ GD D + + NN + G G+D + G
Sbjct: 766 ---------TLSGGNGDD---------QLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQG 807

Query: 4229 N---YNRINAGAGNDVVKLMGYHAVLNGGDGDDHLIATAISKFSQFNGGEGRDLMVLGGY 4285
N N + G GND + +L+GG+GDD L GG G D+
Sbjct: 808 NSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLL-----------KGGYGNDIYRYL-- 854

Query: 4286 QNTFKGGTDVDSFVVSGDVIDNLVEDIRSEDNIVFNGIDWQKLWFERSGYDLKLSILRDP 4345
SG + +D ED + ID++ + F+R G DL +
Sbjct: 855 ---------------SGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMY----- 894

Query: 4346 SNDSDQSKFEHIGSVTFSDYF 4366
+ + H +TF ++F
Sbjct: 895 KGEGNVLSIGHKNGITFRNWF 915



Score = 47.7 bits (113), Expect = 1e-06
Identities = 36/135 (26%), Positives = 55/135 (40%), Gaps = 9/135 (6%)

Query: 4091 GGDGKDLGAYLGDNNNFWGGRGDDVFYATGKSNIFTGGEGNDMGVLMGRENMMFGGDGND 4150
G DG DL N+ +G +G+D + GG+GND + + N + GGDG+D
Sbjct: 742 GADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDD 801

Query: 4151 TAVVAG---RINHVFLGAGDDQSFVFGEGGEIDTGSGRDYVVTSGNFNRVDTGDDQDYSV 4207
V G N +F G G+D+ + +D G G D + D Y
Sbjct: 802 EFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLL------KGGYGNDIYRYLS 855

Query: 4208 TIGNNNQVELGAGND 4222
G++ + G D
Sbjct: 856 GYGHHIIDDDGGKED 870



Score = 43.4 bits (102), Expect = 2e-05
Identities = 38/129 (29%), Positives = 57/129 (44%), Gaps = 6/129 (4%)

Query: 4075 GDGDINISLGNYN-FNWGGDGKDLGAYLGDNNNFWGGRGDDVFYATG---KSNIFTGGEG 4130
G+ +S GN + +GGDG D + NN GG GDD F G N+ GG+G
Sbjct: 761 DKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKG 820

Query: 4131 NDMGVLMGRENMMFGGDGNDTAVVAGRINHVFL-GAGDDQSFVFGEGGEIDTGSGRDYVV 4189
ND +++ GG+G+D + G N ++ +G + +GG+ D S D
Sbjct: 821 NDKLYGSEGADLLDGGEGDDL-LKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDF 879

Query: 4190 TSGNFNRVD 4198
F R
Sbjct: 880 RDVAFKREG 888


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1456ENTEROTOXINB2397e-86 Heat labile enterotoxin B chain signature.
		>ENTEROTOXINB#Heat labile enterotoxin B chain signature.

Length = 124

Score = 239 bits (612), Expect = 7e-86
Identities = 124/124 (100%), Positives = 124/124 (100%)

Query: 1 MIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAI 60
MIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAI
Sbjct: 1 MIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAI 60

Query: 61 ITFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAI 120
ITFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAI
Sbjct: 61 ITFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAI 120

Query: 121 SMAN 124
SMAN
Sbjct: 121 SMAN 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1457ENTEROTOXINA471e-173 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 471 bits (1214), Expect = e-173
Identities = 206/258 (79%), Positives = 232/258 (89%)

Query: 1 MVKIIFVFFIFLSSFSYANDDKLYRADSRPPDEIKQSGGLMPRGQSEYFDRGTQMNINLY 60
M I F+FFI L+S YAN D+LYRADSRPPDEIK+SGGLMPRG +EYFDRGTQMNINLY
Sbjct: 1 MKNITFIFFILLASPLYANGDRLYRADSRPPDEIKRSGGLMPRGHNEYFDRGTQMNINLY 60

Query: 61 DHARGTQTGFVRHDDGYVSTSISLRSAHLVGQTILSGHSTYYIYVIATAPNMFNVNDVLG 120
DHARGTQTGFVR+DDGYVSTS+SLRSAHL GQ+ILSG+STYYIYVIATAPNMFNVNDVLG
Sbjct: 61 DHARGTQTGFVRYDDGYVSTSLSLRSAHLAGQSILSGYSTYYIYVIATAPNMFNVNDVLG 120

Query: 121 AYSPHPDEQEVSALGGIPYSQIYGWYRVHFGVLDEQLHRNRGYRDRYYSNLDIAPAADGY 180
YSPHP EQEVSALGGIPYSQIYGWYRV+FGV+DE+LHRNR YRDRYY NL+IAPA DGY
Sbjct: 121 VYSPHPYEQEVSALGGIPYSQIYGWYRVNFGVIDERLHRNREYRDRYYRNLNIAPAEDGY 180

Query: 181 GLAGFPPEHRAWREEPWIHHAPPGCGNAPRSSMSNTCDEKTQSLGVKFLDEYQSKVKRQI 240
LAGFPP+H+AWREEPWIHHAP GCGN+ R+ +TC+E+TQ+L +L EYQSKVKRQI
Sbjct: 181 RLAGFPPDHQAWREEPWIHHAPQGCGNSSRTITGDTCNEETQNLSTIYLREYQSKVKRQI 240

Query: 241 FSGYQSDIDTHNRIKDEL 258
FS YQS++D +NRI+DEL
Sbjct: 241 FSDYQSEVDIYNRIRDEL 258


36VC_1602VC_1607N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1602-212-0.461882chemotaxis protein CheV
VC_1603-2130.265151hypothetical protein
VC_1604-1130.920398response regulator
VC_1605-1120.798229sensor kinase citA, putative
VC_16061130.886353conserved hypothetical protein
VC_16070120.750669conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1602HTHFIS695e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.7 bits (168), Expect = 5e-15
Identities = 33/116 (28%), Positives = 52/116 (44%), Gaps = 15/116 (12%)

Query: 197 TIMVVDDSAFIRSLIQDTLSSAGYNIIACKDGGEAHEKLMELKQAAKEENVPVSELIDAV 256
TI+V DD A IR+++ LS AGY++ + + + D V
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-------------AGDGDLV 51

Query: 257 VTDVEMPRMDGMHLIKRLRDDDSYSSMPIVMFSSLMSDDNRAKALALGANDTLTKP 312
VTDV MP + L+ R++ +P+++ S+ + KA GA D L KP
Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1604HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 35/129 (27%), Positives = 63/129 (48%), Gaps = 6/129 (4%)

Query: 2 KTRLLIIEDDAAIAQLHHRYLSQLEGFEVVGIAMSQADARLQMEVLNPDLVLLDVYLPDG 61
+L+ +DDAAI + ++ LS+ G++V + + A + + DLV+ DV +PD
Sbjct: 3 GATILVADDDAAIRTVLNQALSR-AGYDVRITS-NAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 TGLELLQWIRGRNVHCDVILITAAREVETLQQAMRGGVVDYLLKPV----MFPRLETALR 117
+LL I+ V++++A T +A G DYL KP + + AL
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 118 KYQMRQAEL 126
+ + R ++L
Sbjct: 121 EPKRRPSKL 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1605PF06580389e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 9e-05
Identities = 39/224 (17%), Positives = 82/224 (36%), Gaps = 40/224 (17%)

Query: 321 EQLAQTREYAEMLRSQTHEH--RNKLNTISGLLQMGELDAVQQLIGQETEHYQVLIEFLR 378
+AQ + L++Q + H N LN I L+ ++++ L E +R
Sbjct: 155 ASMAQEAQL-MALKAQINPHFMFNALNNIRALILEDP-TKAREMLTS-------LSELMR 205

Query: 379 ETIKDPLIAGMLLGKTERARELGLELM-VEDGARLE-TLPIHIKAEDVVT---ILGNLID 433
+++ + L + L+L ++ RL+ I+ DV ++ L++
Sbjct: 206 YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVE 265

Query: 434 NAFEATLTAIRTLSPVPPERRVIEVSISDFGNEIILEVDDQGCGLPKELEHWQLTEKGVS 493
N + + + + I + + + LEV++ G K +
Sbjct: 266 NGIKHGIAQLP-------QGGKILLKGTKDNGTVTLEVENTGSLALKNTK---------- 308

Query: 494 SKAVQNRGVGLFLVKQ-LADRY--QGQLDMHSKAQGTRMTVYLP 534
++ G GL V++ L Y + Q+ + K V +P
Sbjct: 309 ----ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1607RTXTOXIND462e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 45.6 bits (108), Expect = 2e-07
Identities = 22/165 (13%), Positives = 56/165 (33%), Gaps = 15/165 (9%)

Query: 33 VTLQGQI--ESQQYSISSKVPGRIDQVLVRKGDQVEKGQLIFTLLSPEIDAKLEQAIAGQ 90
T G++ + I + +++V++G+ V KG ++ L + +A + +
Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSL 143

Query: 91 KAAGALAQEAENGAREQQIQAAKDQWLKAQAAAELAEKTYLRVNNLYNDGVVSEQKRDEA 150
A + +R ++ + L + + + + + E
Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE---------EEVLRLTSLIKEQ 194

Query: 151 QTQWQAAKYTESAALQMYQLAKEGARE----ETKQAALEKARMAA 191
+ WQ KY + L + + + +EK+R+
Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239


37VC_1643VC_1653N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1643-115-0.664891methyl-accepting chemotaxis protein
VC_1644-119-2.032196hypothetical protein
VC_1645016-1.666207conserved hypothetical protein
VC_1646-115-1.968150hypothetical protein
VC_1647-114-1.856492conserved hypothetical protein
VC_1648-116-3.013547hypothetical protein
VC_1649-116-3.083903trypsin, putative
VC_1650-213-2.077298collagenase
VC_1651-314-2.326816response regulator VieB
VC_1652-215-2.209343response regulator VieA
VC_1653-314-2.100041sensory box sensor histidine kinase/response
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1643FLAGELLIN310.015 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 31.2 bits (70), Expect = 0.015
Identities = 16/82 (19%), Positives = 36/82 (43%), Gaps = 3/82 (3%)

Query: 494 RDSANLIEQGVSGASKAVEKARLAGTALSQITANVDRISTMNTQIATAS---EEQSAVTE 550
+ + Q A+ + A+ AL++I N+ R+ ++ Q + + ++ +
Sbjct: 54 TSNIKGLTQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQD 113

Query: 551 EINKNIITISEISNQTALGAQQ 572
EI + + I +SNQT +
Sbjct: 114 EIQQRLEEIDRVSNQTQFNGVK 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1645BONTOXILYSIN280.029 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 28.3 bits (63), Expect = 0.029
Identities = 9/32 (28%), Positives = 17/32 (53%)

Query: 93 LQLMDDLLKAGYRLYALTDNVNEIVAYLKKQY 124
++L L+K+ Y LY + + N +V Y +
Sbjct: 220 MELTKCLIKSLYFLYGIKPSDNLVVPYRLRTE 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1649V8PROTEASE385e-05 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 38.4 bits (89), Expect = 5e-05
Identities = 37/240 (15%), Positives = 68/240 (28%), Gaps = 44/240 (18%)

Query: 30 SSRIIGGEQATAGEWPYMVAL-TARNSSHVFCGGSYLGGRYVLTAAHCVDKEDPAKGDVL 88
++ T G + + + + G +G +LT H VD +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 89 LGAFDMNDVNTAERIHVRQIYVHNSYITASMGNDIAVLELERDPL---PRRSVQISDSSD 145
+N N + S D+A+++ + V+ + S+
Sbjct: 133 AFPSAINQDNYPN-----GGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSN 187

Query: 146 FNELTKDSPMTVIGFGNRKEVDGEKSDPATILHQVQVPFVPLPECKTKGSDQDAKNNYSQ 205
E + +TV G+ K V I K Q
Sbjct: 188 NAETQVNQNITVTGYPGDKPVATMWESKGKI--------------------TYLKGEAMQ 227

Query: 206 LTNNAFCAGSFGKDACSGDSGGPIFFDSNNGRKQMGVVSWGDGCGRANSPGVYTNLSVFN 265
+ G+SG P+F N + +G+ G + V+ N +V N
Sbjct: 228 YDLSTTG----------GNSGSPVF---NEKNEVIGIHWG--GVPNEFNGAVFINENVRN 272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1650MICOLLPTASE522e-173 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 522 bits (1346), Expect = e-173
Identities = 111/598 (18%), Positives = 212/598 (35%), Gaps = 65/598 (10%)

Query: 41 DLPAQIAVATQACYNSWFYAPTATLDNLYSEASLAHLQTVLDAEIARYTGEAQQARRLEN 100
DL I + F + + + + L+ YT + + +
Sbjct: 105 DLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTYTAD--DDKGIPT 162

Query: 101 YGEFIRAAYYVRYNAGR--EPYSQALSQRFAQSIDRFLRHPHAFDQGREQVGAMKSLSLM 158
EF+RA YY+ + + + L ++ + + + Q G +++L +
Sbjct: 163 LVEFLRAGYYLGFYNKQLSYLNTPQLKNECLPAMKAIQYNSNFRLGTKAQDGVVEALGRL 222

Query: 159 VDNVKQLPLTMDAMILALHRFNRETAQDTQWVDGLNNLFRAMSG---------------H 203
+ N P ++ I L F + N +F M G
Sbjct: 223 IGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGNAVFNLMKGIDYYTNSVIYNTKGYD 282

Query: 204 VGNSEFYRYLAANTQHIDTL----YRFALDNEWALETDAEFLVYNALRETGRLL-ISPDA 258
N+EFY + + +++L + DN W LV NAL TGR+ D
Sbjct: 283 AKNTEFYNRIDPYMERLESLCTIGDKLNNDNAW--------LVNNALYYTGRMGKFREDP 334

Query: 259 ITKQKARHVMRQVIARYPLGSKHDKLWLAAVEMLHYYAPEVLQQLGIDLDAAKRDLAARI 318
Q+A + + + YP S +++ ID + K D +
Sbjct: 335 SISQRA---LERAMKEYPYLSYQYIEAANDLDLNFGGKNSSGN--DIDFNKIKADAREKY 389

Query: 319 LPNRFECQ-GPAIIRSQD-LSDAQAAQACDVLDKKEQDFHQVANTGLAPVADDYNTRVEV 376
LP + G ++++ D +++ + + + + F +V A + + + V
Sbjct: 390 LPKTYTFDDGKFVVKAGDKVTEEKIKRLYWASKEVKAQFMRVVQNDKALEEGNPDDILTV 449

Query: 377 VVFANNSSYVNYSSFLFGNTTDNGGQYLEGNPADQNNQARFVAYRYANDADLSILN--LE 434
V++ + Y + G +TDNGG Y+E N F Y + + L
Sbjct: 450 VIYNSPEEYKLN-RIINGFSTDNGGIYIE-------NIGTFFTYERTPEESIYTLEELFR 501

Query: 435 HEYTHYLDARFNQYGSFSDN--LAHGHIVWWLEGFAEYM--HYKQGYQAAVKLISQG--- 487
HE+THYL R+ G + G + W+ EG AE+ + K ++QG
Sbjct: 502 HEFTHYLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAY 561

Query: 488 ----KLSLSDVFATTYSNDTNRIYRWGYLAVRFMLEKHPQDVESLLALSRTGQFDQWAQS 543
++SL V Y + Y +G+ +M + + + +
Sbjct: 562 DRNNRMSLYGVLHAKY--GSWDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDY 619

Query: 544 VKLLGERY--NTEFSAWLDTLQRDNPDNPDNPDNPDNPDNPDNPEQPNPEPNAVTQLA 599
+ + Y N ++ ++D+L +N DN D P D N + N N + +++
Sbjct: 620 IASMSSDYGLNDKYQDYMDSL-LNNIDNLDVPLVSDEYVNGHEAKDINEITNDIKEVS 676



Score = 68.6 bits (167), Expect = 6e-14
Identities = 26/187 (13%), Positives = 45/187 (24%), Gaps = 29/187 (15%)

Query: 404 LEGNPADQNN------QARFVAYRYANDADLSILNLEHEYTHYLDARFNQYGSFSDNLAH 457
N N Y+ + S L +Y Y+D+ N + L
Sbjct: 594 YNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDKYQDYMDSLLNNIDNLDVPLVS 653

Query: 458 GHIVWWLEGFAEYMHYKQGYQAAVKLISQGKL-SLSDVFATTYSNDTNRIYRWGYLAVRF 516
+ A+ ++ V I F TTY R Y+ R
Sbjct: 654 --DEYVNGHEAKDINEITNDIKEVSNIKDLSSNVEKSQFFTTYD------MRGTYVGGRS 705

Query: 517 MLEKHPQ-----DVESLLALSRTGQFD-------QWAQSVKLLGER--YNTEFSAWLDTL 562
E++ + +L ++ + Y+ F
Sbjct: 706 QGEENDWKDMNSKLNDILKELSKKSWNGYKTVTAYFVNHKVDGNGNYVYDVVFHGMNTDT 765

Query: 563 QRDNPDN 569
D N
Sbjct: 766 NTDVHVN 772


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1651HTHFIS489e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.5 bits (113), Expect = 9e-08
Identities = 26/146 (17%), Positives = 53/146 (36%), Gaps = 8/146 (5%)

Query: 10 KVLIIDDAPIVIASLRSMLLKLGFTEPNIVWSKSPRAAVFMAGRQRFDIFICDYNFGKGL 69
+L+ DD + L L + G+ + + + D+ + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD---VRITSNAATLWRWIAAGDGDLVVTDVVMP-DE 60

Query: 70 NGKQVFEELKHYKLIKQDAVFVLVTGENSAYVVHSILELKPDEYILKPFNIMTLQERLTN 129
N + L K + D ++++ +N+ E +Y+ KPF++ L +
Sbjct: 61 NAFDL---LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117

Query: 130 AIS-RKHALKALYQAERDGDAELGLS 154
A++ K L +DG +G S
Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRS 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1652HTHFIS622e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.2 bits (151), Expect = 2e-12
Identities = 32/121 (26%), Positives = 51/121 (42%), Gaps = 12/121 (9%)

Query: 2 KIMIVEDDRIQATSLKLKLRQLGLDDVILAENGFAALELCNKVGIDLMFCDIRMPQMDGI 61
I++ +DD T L L + G D V + N DL+ D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 SLLSQLSLQAPKLGVVILSA---VEDTILELTHNMCSLAGFAYVDRLPKPYEVEDLQRVV 118
LL ++ P L V+++SA I G AY D LPKP+++ +L ++
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKAS------EKG-AY-DYLPKPFDLTELIGII 115

Query: 119 E 119

Sbjct: 116 G 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1653HTHFIS595e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.5 bits (144), Expect = 5e-11
Identities = 38/153 (24%), Positives = 71/153 (46%), Gaps = 11/153 (7%)

Query: 918 GHVLVADDDAINRLLIKKQLSELGLSATLVSDGLQAFEKLSQHPEQYDLLITDCHMPHLD 977
+LVADDDA R ++ + LS G + S+ + ++ DL++TD MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA--GDGDLVVTDVVMPDEN 61

Query: 978 GFALTRKVKQEISLFKGAVVGCTAEDSRLAAEQALQAGMDKVIYKPYTLANLRKVLSRYL 1037
F L ++K+ V+ +A+++ + A +A + G + KP+ L L ++ R
Sbjct: 62 AFDLLPRIKKARPDLP--VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR-- 117

Query: 1038 TTQWVALPEQSWLDAYQEEEREEMAMVVAESLA 1070
AL E + E++ ++ +V S A
Sbjct: 118 -----ALAEPKRRPSKLEDDSQDGMPLVGRSAA 145


38VC_1673VC_1679N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1673-1141.533579transporter, AcrB/D/F family
VC_1674-1141.637919periplasmic linker protein, putative
VC_1675-3161.455644multidrug resistance protein, putative
VC_1676-2160.832453phage shock protein C
VC_1677-2170.585175phage shock protein B
VC_1678-2150.757646phage shock protein A
VC_1679-1171.062779psp operon transcriptional activator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1673ACRIFLAVINRP492e-159 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 492 bits (1268), Expect = e-159
Identities = 219/1051 (20%), Positives = 436/1051 (41%), Gaps = 63/1051 (5%)

Query: 22 VAAYFINNRVISWMVSLIFLIGGTAAFFNLGRLEDPAFTIKDAMVVTSYPGATPQQVEEE 81
+A +FI + +W++++I ++ G A L + P V +YPGA Q V++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 82 VTYPLEKAIQQLTYVDEVNSISSR-GLSQITVTMKNNYGPDDLPQIWDELRRKVNDLKGA 140
VT +E+ + + + ++S S G IT+T ++ PD +++ K+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQV---QVQNKLQLATPL 117

Query: 141 LPPGVNPP-LVIDDFGDVYGILLAVTGEGYSY--KELLDYVD-YLRRELELIDGVSKVSV 196
LP V + ++ Y ++ + ++ DYV ++ L ++GV V +
Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177

Query: 197 SGQQQEQVFIEISMKRISTLGISPQTVFNLLSTQNLVSDAGAIRIGSEYI-------RIH 249
G Q + I + ++ ++P V N L QN AG + G+ + I
Sbjct: 178 FGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL-GGTPALPGQQLNASII 235

Query: 250 PTGEFDDVEKLGDLILSERGAQGLIYLRDVAEVKRGYVEVPSNVITFNGKLALNVGVSFA 309
F + E+ G + L ++ L+DVA V+ G E + + NGK A +G+ A
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKLA 294

Query: 310 QGVNVVEVGQRFDRRLAELKYQQPIGIDIAEVYSQPKEVDKSVSGFVVSLGQAVAIVIIV 369
G N ++ + +LAEL+ P G+ + Y V S+ V +L +A+ +V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 370 LLFFMG-LRSGLLIGLILLLTVLGTFIFMQYFKIDLQRISLGALVIALGMLVDNAIVVVE 428
+ F+ +R+ L+ + + + +LGTF + F + +++ +V+A+G+LVD+AIVVVE
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 429 GILIGTQKGRTRLQAAT-DIVTQTKWPLLGATVIAVTAFAPIGLSEDATGEYCGTLFTVL 487
+ + + + AT ++Q + L+G ++ F P+ +TG +
Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 488 LISLMLSWFTAISLTPFFADLFFRGQKAPASGEESDPYQGFIFVVYRRFLEFC------M 541
+ ++ LS A+ LTP + A + + F + +
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKIL 534

Query: 542 RRAWLTMGVLVLGLAASLYGFTKVKQAFFPSSTTPMFMVDVWMPEGTDIRATDAILLELE 601
+ + L +A + F ++ +F P +F+ + +P G T +L ++
Sbjct: 535 GSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594

Query: 602 KWLSAQES--VDSVTTTAGKGLQRFMLTYSPEKSYAAYGEIT-----TRVTDYQQLAALM 654
+ E V+SV T G ++S + A ++ R D A++
Sbjct: 595 DYYLKNEKANVESVFTVNG-------FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 655 ARFRAHLDARYPQINYKLK---QIELGPGGGAKIE-ARIVGSDPTVLRSIAAQVMDVMYA 710
R + L +ELG G E G L Q++ +
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 711 DPGA-YNIRHDWRERTKVLEPQFNESQARRYGITKADVDEFLAMSFSGKTIGVYRDGTTL 769
P + ++R + E T + + ++ +A+ G++ +D+++ ++ + G + + D +
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 770 MPIVARLPEEERVDIRNIEGMKIWSPALSEYIPLQQVTLGYEMRWED--PLIVRKNRKRM 827
+ + + R+ +++ + + S A E +P T W P + R N
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRS-ANGEMVPFSAFT---TSHWVYGSPRLERYNGLPS 823

Query: 828 LTVMADPDL-LGEETAATLQQRLQPQIEAIPLPPGYFLEWGGEYESSGDAKASLFKTMPL 886
+ + + A L + L + LP G +W G + + +
Sbjct: 824 MEIQGEAAPGTSSGDAMALMENLASK-----LPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 887 GYLFMFLITVFLFNSVKESLIVWLTVPLAVIGVTTGLLALNTPFGFMALLGFLSLSGMLL 946
++ +FL L+ S + V L VPL ++GV N ++G L+ G+
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 947 KNGIVLLDQI-EIEMHSGKDPYLAVVDASLSRVRPVCMAAVTTILGMIPLLPDI-----F 1000
KN I++++ ++ GK A + A R+RP+ M ++ ILG++PL
Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998

Query: 1001 FRPMAVTIMFGLGFATVLTLIVVPVLYRLFH 1031
+ + +M G+ AT+L + VPV + +
Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1674RTXTOXIND414e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.4 bits (97), Expect = 4e-06
Identities = 19/133 (14%), Positives = 46/133 (34%), Gaps = 19/133 (14%)

Query: 76 GEISKFHVKEGARVKQGDILAEIEPTDYRLAVDNAQARF-----------SVIDSQYRRS 124
+ + VKEG V++GD+L ++ Q+ + S
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 125 QPLV----EKGLLAKSQFDEIAAQR----QIARAELELAKLRLSFTQLRAPMDGIISRVS 176
P + E S+ + + Q + + + + L+ + RA +++R++
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 177 AEQFESVQVGQQI 189
+ S ++
Sbjct: 225 RYENLSRVEKSRL 237



Score = 31.3 bits (71), Expect = 0.006
Identities = 20/114 (17%), Positives = 37/114 (32%), Gaps = 12/114 (10%)

Query: 110 AQARFSVIDSQYRRSQPLVEKGLLAKSQFDEIA-AQRQIARAELELAKL--RLSFTQLRA 166
Q ++ ++ L D++ I LELAK R + +RA
Sbjct: 276 EQIESEILSAKEEYQL---VTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRA 332

Query: 167 PMDGIISRVSAEQFESV-QVGQQIVNIHNIDS---VEVLIQ--LPDRLYATQPT 214
P+ + ++ V + ++ I D V L+Q + Q
Sbjct: 333 PVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNA 386



Score = 30.6 bits (69), Expect = 0.012
Identities = 17/113 (15%), Positives = 40/113 (35%), Gaps = 5/113 (4%)

Query: 106 AVDNAQARFSVIDSQYRRSQPLVEKGLLAKSQFDEIAAQRQIARAELELAKLRLSFTQLR 165
++ + V S+ L+ K +AK + + + A EL + Q+
Sbjct: 222 RINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV--LEQENKYVEAVNELRVYKSQLEQIE 279

Query: 166 APMDGIISRVS--AEQFESVQVGQQIVNIHNIDSVEV-LIQLPDRLYATQPTA 215
+ + + F++ + + NI + + L + +R A+ A
Sbjct: 280 SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRA 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1675RTXTOXIND447e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.7 bits (103), Expect = 7e-07
Identities = 37/211 (17%), Positives = 74/211 (35%), Gaps = 34/211 (16%)

Query: 95 ALLEQQ---ASANFQLADVQFQRAQ------RLRQDKVVSEQDFD--------QAQANHN 137
A+LEQ+ A +L + Q Q +++ + Q F Q N
Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312

Query: 138 SAKATWEQAKANLRYTQLIAPYDGTIS-LIPAEQYEYVAAKQGVMNI-QTNQLLKVQFLL 195
+ + + + + AP + L + V + +M I + L+V L+
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372

Query: 196 PDHLLNRFSRDGVEAHMVFDSFPNRQYPLQFQEI-----DTEADSKT-------RSYKVT 243
+ + F G A + ++FP +Y ++ D D + S +
Sbjct: 373 QNKDIG-FINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 244 MVMERPNDVGILPGMAGRVSLMAPSGSATVI 274
+ ++ + GMA V+ +G +VI
Sbjct: 432 CLSTGNKNIPLSSGMA--VTAEIKTGMRSVI 460



Score = 36.7 bits (85), Expect = 1e-04
Identities = 23/136 (16%), Positives = 47/136 (34%), Gaps = 24/136 (17%)

Query: 33 AKLTTVSVGNNTFSRQFPATTAAGDRAVLAFRVPGQLQTIDVLAGQEVKKGEVLARLNPD 92
++ V+ N T +G + ++ I V G+ V+KG+VL +L
Sbjct: 78 GQVEIVATANGKL-------THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL 130

Query: 93 EYALLEQQASANFQLADVQFQRAQRLRQD-----------------KVVSEQDFDQAQAN 135
+ ++ A ++ R Q L + + VSE++ + +
Sbjct: 131 GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 136 HNSAKATWEQAKANLR 151
+TW+ K
Sbjct: 191 IKEQFSTWQNQKYQKE 206


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1679HTHFIS359e-124 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 359 bits (923), Expect = e-124
Identities = 130/339 (38%), Positives = 182/339 (53%), Gaps = 18/339 (5%)

Query: 3 QSLLGESPAFLAVLDKVSRLAPIDRPVLVIGERGTGKELIAQRLHYLSKRWEQPLVSLNC 62
L+G S A + ++RL D +++ GE GTGKEL+A+ LH KR P V++N
Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196

Query: 63 STLSEGLIDSELFGHESGSFTGAKGRHQGRFERAEGGTLFLDELATAPLSVQEKLLRVIE 122
+ + LI+SELFGHE G+FTGA+ R GRFE+AEGGTLFLDE+ P+ Q +LLRV++
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256

Query: 123 YGQYERVGGNQVLNANVRLVCATNANLPQMAQQGTFRADLLDRLAFDVIHLPALRHRPED 182
G+Y VGG + ++VR+V ATN +L Q QG FR DL RL + LP LR R ED
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316

Query: 183 IALLAEHFAIRMCRELQLPLFVGFSPNALHQLQAYSWPGNVRELKNVVERAV--YQHGLN 240
I L HF + +E F AL ++A+ WPGNVREL+N+V R Y +
Sbjct: 317 IPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374

Query: 241 SQPIDQLVF------NPFQSSDIVTDE-SEQVDVRHTTRTDFHLPLD-------YKAWQQ 286
++ I + +P + + + S V R F D Y
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434

Query: 287 QCDIELLQAALTQAKFNQKHAAQLLGLSYHQFRGMMRKY 325
+ + L+ AALT + NQ AA LLGL+ + R +R+
Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


39VC_1741VC_1748N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_17411190.578081transcriptional regulator, TetR family
VC_17421190.229861hypothetical protein
VC_17430160.133569hypothetical protein
VC_1744016-0.904214conserved hypothetical protein
VC_1745016-1.631828succinate-semialdehyde dehydrogenase
VC_1746016-2.929416transcriptional regulator, TetR family
VC_1747119-3.347518hypothetical protein
VC_1748017-3.061474hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1741HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 2e-15
Identities = 22/88 (25%), Positives = 36/88 (40%)

Query: 20 RSSTKEKILDVAEGLFAEYGFNDTSLRTITSKAGVNLASVNYHFGDKKTLVRAVLNRYLE 79
T++ ILDVA LF++ G + TSL I AGV ++ +HF DK L +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 80 AFMPEMKQSLERLNERDDYDMAEVFEAL 107
+ + + E+ +
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHV 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1745VACCYTOTOXIN290.049 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 29.2 bits (65), Expect = 0.049
Identities = 17/48 (35%), Positives = 23/48 (47%)

Query: 134 ELIPSHKPDARIMVSRQPVGVVAAITPWNFPAAMITRKCAPAFAAGCA 181
E+ +H+ R +VS VG + +ITP AA T PA G A
Sbjct: 2 EIQQTHRKINRPLVSLALVGALVSITPQQSHAAFFTTVIIPAIVGGIA 49


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1746HTHTETR624e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.0 bits (150), Expect = 4e-14
Identities = 31/187 (16%), Positives = 72/187 (38%), Gaps = 8/187 (4%)

Query: 4 SEQKRLALIEAAKEEFTQFGFHAANMDRVCERAGTSKRTLYRHFTSKELLFIEVINLLVA 63
+++ R +++ A F+Q G + ++ + + AG ++ +Y HF K LF E+ L +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 64 Q--PHKVGFEYQSTRSLADQLHDYFAAKIDLLYRTIGLDVLRMIV---GEFVRDPALTQQ 118
++ ++ + L + ++ +L I+ EFV + A+ QQ
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 119 YLALMGTQDTALTAW-LQAAIKDGKLIEKEVAPMATTLMNLFHGQFL--WPQLLAAVELP 175
+ + L+ I+ L + A +M + + W + +L
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLK 188

Query: 176 DAKQQQI 182
+ +
Sbjct: 189 KEARDYV 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1748IGASERPTASE300.011 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.011
Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 4/35 (11%)

Query: 235 QTSWNPLVGM----QYQLNDSWYLLGEFGFGDRQS 265
TS N L + +Y ++ WYL + G+G QS
Sbjct: 1352 ATSKNTLAQVNFYSKYYADNHWYLGIDLGYGKFQS 1386


40VC_1756VC_1764N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1756-114-2.355632periplasmic linker protein, putative
VC_1757019-3.361670transporter, AcrB/D/F family
VC_1758333-6.940356*integrase, phage family
VC_1760329-6.688972helicase, putative
VC_1761326-6.211379hypothetical protein
VC_1762224-5.731537hypothetical protein
VC_1763222-5.723999chemotaxis protein MotB-related protein
VC_1764219-5.327445hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1756RTXTOXIND492e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 48.7 bits (116), Expect = 2e-08
Identities = 29/166 (17%), Positives = 58/166 (34%), Gaps = 6/166 (3%)

Query: 77 SGRLTDIAVKEGDQVKKGQLLASLDSRDAKTALEAAQLELKNTEQEYRRAKAIFEKTQAI 136
+ + +I VKEG+ V+KG +L L + A+ Q L E R + + +I
Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR---SI 160

Query: 137 SKAELDKVTNRYDLAKNRVEEAKRKLEYTQITAPFDGVISEKTIENFAQVQANQVIMILQ 196
+L ++ + V E + + I F ++K ++ ++
Sbjct: 161 ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ---KELNLDKKRAERL 217

Query: 197 DLNDLEVAIEIPHRVMLSGVRNTRAIAELSAIPNQQFDLKLRTYST 242
+ E RV S + + ++ AI + Y
Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVE 263



Score = 35.6 bits (82), Expect = 3e-04
Identities = 28/196 (14%), Positives = 68/196 (34%), Gaps = 19/196 (9%)

Query: 95 QLLASLDSRDAKTALEAAQLE--LKNTEQEYRRAKAIFEKTQAISKAELDKVTNRYDLAK 152
+ + Q+E + + ++EY+ +F+ +L + T+ L
Sbjct: 259 NKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL---DKLRQTTDNIGLLT 315

Query: 153 NRVEEAKRKLEYTQITAPFDGVISE-KTIENFAQVQANQVIMILQDLND-LEVAIEIPHR 210
+ + + + + + I AP + + K V + +M++ +D LEV + ++
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNK 375

Query: 211 VMLSGVRNTRAIAELSAIPNQQF---DLKLRTYSTQPSSDSQTYSVVL---------GFE 258
+ AI ++ A P ++ K++ + D + V
Sbjct: 376 DIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLST 435

Query: 259 DLKGFRVMPGMSAKVI 274
K + GM+
Sbjct: 436 GNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1757ACRIFLAVINRP471e-152 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 471 bits (1214), Expect = e-152
Identities = 244/1049 (23%), Positives = 445/1049 (42%), Gaps = 60/1049 (5%)

Query: 3 IARYTLAKRTSVWVLIALTLIGGYISYLKLGRFEDPEFVIRQAVIVTPYPGATAQEVSDE 62
+A + + + WVL + ++ G ++ L+L + P + YPGA AQ V D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTDVIEGAVQALQELKEVKSVSMQ-GRSEVTVEIKLEFAKSSAQLQQVWDKLRRKVADAQ 121
VT VIE + + L + S S G +T+ + AQ QV +KL+ A
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQ-VQVQNKLQL----AT 115

Query: 122 RQLPPGA-GASIVNDDFSDVYALFYAV--TGEGFSDKQLQDYVD-TLRRELVLVPGVAKA 177
LP I + S Y + G + + DYV ++ L + GV
Sbjct: 116 PLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175

Query: 178 ATLAEQQEAIFIEMSSERMAEFGLSVERVLQVLQKQSLVTVAGSVDA------QQMRIPV 231
L Q A+ I + ++ + ++ L+ V+ L+ Q+ AG + QQ+ +
Sbjct: 176 -QLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASI 234

Query: 232 IPKSNISSLADLTNLQVAVGSNNAVVRLGDIANISRGYTEPASMLMRYNGQRAIGFGISN 291
I ++ + + + + V S+ +VVRL D+A + G E +++ R NG+ A G GI
Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKL 293

Query: 292 VTGGNVVEMGDAVKARLAELESQRPLGMDLHVISMQSDSVRASVANFIDNLIAAVAIVFV 351
TG N ++ A+KA+LAEL+ P GM + + V+ S+ + L A+ +VF+
Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353

Query: 352 VLLLFMG-VRSGVIIGFVLLLTVAGTLCVMLIDDIAMQRISLGALIIALGMLVDNAIVVT 410
V+ LF+ +R+ +I + + + GT ++ ++ +++ +++A+G+LVD+AIVV
Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413

Query: 411 DGVLVRFQQEPNADKQQVVSEVVNATKWPLLGGTVVGIFAFSAIGLSPSDMGEYAGSLFW 470
+ V R E ++ + ++ + L+G +V F + G
Sbjct: 414 ENV-ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSI 472

Query: 471 VILYSMFLSWVFAVTVTPMLCHDFLRVKAPTKEAKPSK-----------LVTGYKAVLQW 519
I+ +M LS + A+ +TP LC L+ + V Y +
Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGK 532

Query: 520 VLSHRVVSCAMLLGTLVAAVWGAQFIPPGFMPESQRPQFVVDVYLPQGSDIRRTEQVVAS 579
+L + + V +P F+PE + F+ + LP G+ RT++V+
Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592

Query: 580 IEKDVTQKDGITNITSFIGGGGLRFMLTYSPEARNPSYGQL-LIDIDDYTKIAPLVGELQ 638
+ D K+ N+ S G F S +A+N + L ++ +
Sbjct: 593 VT-DYYLKNEKANVESVFTVNGFSF----SGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 639 NELDAKY---PDASIKVWKFM----LGRGGGKKIE-AGFKGPDSHVLRQLAEQ-AKAIMH 689
+ + D + + LG G E G L Q Q
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707

Query: 690 NDPNLIAVQDDWRQQVPVLQPVYSAQEAQRLGLTTQEISAAIAQTLNGRNVGVYREGNDL 749
+ +L++V+ + + + ++AQ LG++ +I+ I+ L G V + + +
Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767

Query: 750 IPLMVRAPENERHHERAIENSEVFSAQAGRYIPVSQLVDSVDTVYQDALLRRINRMPTIL 809
L V+A R ++ V SA G +P S VY L R N +P++
Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSA-NGEMVPFSAFTT-SHWVYGSPRLERYNGLPSME 825

Query: 810 VQADPAPGVMTADAFNNVREKIEQI--ELPAGYELIWYGEYKASKDANEGLALSAPYGFA 867
+Q + APG + DA +E + +LPAG W G + + F
Sbjct: 826 IQGEAAPGTSSGDA----MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881

Query: 868 AMILAVVFMFNALRQPLVIWMTAPFAVVGVTIGLIAFQTPFEFMAILGFLSLIGMMVKNA 927
+ L + ++ + P+ + + P +VGV + F + ++G L+ IG+ KNA
Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941

Query: 928 IVLVDQA-DAEIRAGKEAYFAIIDAAVSRARPVLLGAFTTILGVAPLLVDP-----FFKS 981
I++V+ A D + GK A + A R RP+L+ + ILGV PL + +
Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001

Query: 982 MAVTIMFGLLFATILTLVVIPLFYAVLFR 1010
+ + +M G++ AT+L + +P+F+ V+ R
Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1763OMPADOMAIN300.007 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 30.3 bits (68), Expect = 0.007
Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 15/78 (19%)

Query: 98 STLTFASGKSSIPNDATIQQAVKDIGVVLHSAIQKKDRFQYLDTIFIEGHTDSDSIHYRG 157
S + F K+++ + Q A+ + L S + KD ++ + G+TD G
Sbjct: 219 SDVLFNFNKATLKPEG--QAALDQLYSQL-SNLDPKDG-----SVVVLGYTDR-----IG 265

Query: 158 KG--NWGLSTDRAISVWN 173
N GLS RA SV +
Sbjct: 266 SDAYNQGLSERRAQSVVD 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1764TYPE4SSCAGA320.008 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 32.4 bits (73), Expect = 0.008
Identities = 61/278 (21%), Positives = 106/278 (38%), Gaps = 37/278 (13%)

Query: 420 SSMESTIKDLVENVSAQSQVLTDFVQNQVVQLTQTFSERDGMA--SQMEKERNDIFVNQT 477
S +E+++KD++ N Q +TD V N L Q S S++E+ D
Sbjct: 765 SDLENSVKDVIIN-----QKVTDKVDN----LNQAVSVAKATGDFSRVEQALAD------ 809

Query: 478 QAMKAGTDELLAQVKAATESQQITTNSIIEQGKQLQNSIDSS-VSASARATESMQQSANE 536
+K + E LAQ ES S I Q ++N ++ + V E+ S N
Sbjct: 810 --LKNFSKEQLAQQAQKNESLNARKKSEIYQ--SVKNGVNGTLVGNGLSQAEATTLSKNF 865

Query: 537 LRVAADSMNVFGSNIKDAGNKLSGAVTEAVNSTKDLAEQNHLG-------AVKVQALREQ 589
+ + G+ + N L A + K + L A KV A ++
Sbjct: 866 SDIKKELNAKLGNFNNNNNNGLKNEPIYAKVNKKKAGQAASLEEPIYAQVAKKVNAKIDR 925

Query: 590 LLEDTSKFSAIADQINNMLISAEQ--SFSTLRTTQNEFLAEQKGNLNELTSEMKRNVKEL 647
L + S + L ++ S + ++N+ LA++ NLN+ SE K
Sbjct: 926 LNQIASGLGVVGQAAGFPLKRHDKVDDLSKVGLSRNQELAQKIDNLNQAVSEAKAGFFGN 985

Query: 648 TEQMAQLLEDYAEQANGQTAEHLKVWANSSTQYAESMN 685
EQ L+D + + +W S+ + S++
Sbjct: 986 LEQTIDKLKDSTKH------NPMNLWVESAKKVPASLS 1017


41VC_1831VC_1838N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1831015-2.502665sensor histidine kinase
VC_1832220-1.924216hypothetical protein
VC_1833220-2.054808quinolinate synthetase A
VC_1834221-2.163517conserved hypothetical protein
VC_1835221-2.560835peptidoglycan-associated lipoprotein
VC_1836020-2.609199tolB protein
VC_1837320-2.523421tolA protein
VC_1838218-2.529820tolR membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1831HTHFIS617e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.4 bits (149), Expect = 7e-12
Identities = 23/134 (17%), Positives = 46/134 (34%), Gaps = 3/134 (2%)

Query: 603 RVLIVEDNRTNIMILEAFMRNKGFECHSVMDGVQAITALQESSFDLVLMDNHMPLKDGIQ 662
+L+ +D+ +L + G++ + + DLV+ D MP ++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 663 ATREIRQLPLPQAKILLFGCTADVFKDTRDKMLSAGADDIIAKPIAEHELDMALEQHSER 722
I++ P +L+ T K GA D + KP EL + +
Sbjct: 65 LLPRIKKA-RPDLPVLVMSAQNTF--MTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 723 LYQFHREPSLPSVE 736
+ + S +
Sbjct: 122 PKRRPSKLEDDSQD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1835OMPADOMAIN1033e-29 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 103 bits (259), Expect = 3e-29
Identities = 35/124 (28%), Positives = 57/124 (45%), Gaps = 4/124 (3%)

Query: 49 NAQGQLTEQELKEQALRENQTIYFAFDNATIASDYEAMLAAHAAYL--VKNPSLRVTIEG 106
A E++ + + F F+ AT+ + +A L + L + V + G
Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLG 259

Query: 107 HADERGTPEYNIALGERRAQAVAKYLEALGVQAGQLSIVSYGEEKPLVLGQSEEAYAKNR 166
+ D G+ YN L ERRAQ+V YL + G+ A ++S GE P V G + + K R
Sbjct: 260 YTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGNTCD-NVKQR 317

Query: 167 RAVL 170
A++
Sbjct: 318 AALI 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1837IGASERPTASE651e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 65.1 bits (158), Expect = 1e-13
Identities = 36/197 (18%), Positives = 70/197 (35%), Gaps = 6/197 (3%)

Query: 72 AKKEQERLDKLRRESEQLEKNRQAEEERIRQLKEQQAKEAKAAREAEKLREQKEQERLAA 131
++ + + ++ES+ +EKN Q E Q +E AKEAK+ +A + Q
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREV-AKEAKSNVKANTQTNEVAQSGSET 1092

Query: 132 EQKAREEKERAAKAEAERKVKEEAAKKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKA 191
++ E + A E E K K E K E +V + + ++ + E AE
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP-----KQEQSETVQPQAEPAR 1147

Query: 192 KREKEVAAKAEQERLAKEKAAKEAADKAKKEKERAAKAEAERKAQEAALNDIFGSLSEES 251
+ + V K Q + ++ A + E+ + + + + +
Sbjct: 1148 ENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207

Query: 252 QQNNAARQQFVTSEVGR 268
Q + R
Sbjct: 1208 QPTVNSESSNKPKNRHR 1224



Score = 63.9 bits (155), Expect = 4e-13
Identities = 37/196 (18%), Positives = 66/196 (33%), Gaps = 4/196 (2%)

Query: 76 QERLDKLRRESEQLEKNRQAEEERIRQLKEQQAKEAKAAREAEKLREQKEQERLAAEQKA 135
++R + + N QA+ + E+ A+ +A E AE
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045

Query: 136 REEKE---RAAKAEAERKVKEEAAKKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKAK 192
+E K A E AK+A+ A A+ +E + +E A
Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105

Query: 193 REKEVAAKAEQERLAKEKAAKEAADKAKKEKERAAKAEAERKAQEAALNDIFGSLSEESQ 252
EKE AK E E+ + + K+E+ + +AE + +I S+ +
Sbjct: 1106 VEKEEKAKVETEKTQEV-PKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 253 QNNAARQQFVTSEVGR 268
+ + TS
Sbjct: 1165 TADTEQPAKETSSNVE 1180



Score = 58.2 bits (140), Expect = 3e-11
Identities = 31/143 (21%), Positives = 50/143 (34%), Gaps = 3/143 (2%)

Query: 96 EEERIRQLKEQQAKEAKAAREAEKLREQKEQERLAAEQKAREEKERAAKAEAERKVKEEA 155
E E+ Q + +A+ E +A +A A + E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 156 AKKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKAKREKEVAAKAEQERLAKEKAAKEA 215
+K+ + E A + Q E KEAK + + EVA + + + KE
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 216 ADKAKKEKERAAKAEAERKAQEA 238
A K+EK AK E E+ +
Sbjct: 1104 ATVEKEEK---AKVETEKTQEVP 1123



Score = 52.8 bits (126), Expect = 1e-09
Identities = 27/203 (13%), Positives = 64/203 (31%), Gaps = 4/203 (1%)

Query: 38 SDPEPTGQMIEAVVIDPQLVRQQAQQIRSQREEAAKKEQERLDKLRRE-SEQLEKNRQAE 96
S+ E ++ EA V P E +K+E + ++K ++ +E +NR+
Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETT--ETVAENSKQESKTVEKNEQDATETTAQNREVA 1069

Query: 97 EERIRQLKEQQAKEAKAAREAEKLREQKEQERLAAEQKAREEKERAAKAEAERKVKEEAA 156
+E +K + + A+ + +E + E +EEK + + + K +
Sbjct: 1070 KEAKSNVKAN-TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 157 KKAEQERVAKEAAAAKAEQQRIEREKEAKLAEEKAKREKEVAAKAEQERLAKEKAAKEAA 216
+QE+ A+ ++ + + E ++ +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTT 1188

Query: 217 DKAKKEKERAAKAEAERKAQEAA 239
+ Q
Sbjct: 1189 VNTGNSVVENPENTTPATTQPTV 1211



Score = 43.9 bits (103), Expect = 9e-07
Identities = 38/208 (18%), Positives = 67/208 (32%), Gaps = 4/208 (1%)

Query: 63 QIRSQREEAAKKEQERLDKLRRESEQLEKNRQAEEERIRQLKEQQAKEAKAAREAEKLRE 122
Q +E A +++E+ K+ E Q ++ ++ E +A+ ARE +
Sbjct: 1096 QTTETKETATVEKEEK-AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154

Query: 123 QKEQERLAAEQKAREEKERAAKAEAERKVKEEAAKKAEQERVAKEAAAAKAEQQRIEREK 182
KE + E+ + + E+ V E V A Q +
Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSE 1214

Query: 183 EAKLAEEKAKRE-KEVAAKAEQERLAKEKAAKEAADKAKKEKERAAKAEAERKAQEAALN 241
+ + + +R + V E + + A A ++A KAQ ALN
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALN 1274

Query: 242 DIFGSLSEESQ--QNNAARQQFVTSEVG 267
SQ NN + S
Sbjct: 1275 VGKAVSQHISQLEMNNEGQYNVWVSNTS 1302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1838adhesinmafb270.028 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 27.3 bits (60), Expect = 0.028
Identities = 12/40 (30%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 37 FVTQGVDVELP-KTHSAKSAQDLAGDSDSSFIIVEIDKEG 75
F G + P H+A SA + G+ D F + ++ EG
Sbjct: 98 FSGHGHEEHAPFDNHAADSASEEKGNVDEGFTVYRLNWEG 137


42VC_1919VC_1926N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_1919321-0.463426DNA-binding protein HU-beta
VC_1920114-0.120043ATP-dependent protease LA
VC_1921112-0.116011ATP-dependent Clp protease, ATP-binding subunit
VC_19220120.484751ATP-dependent Clp protease, proteolytic subunit
VC_19230110.472927trigger factor
VC_1924-1110.635184hypothetical protein
VC_1925-1120.802057C4-dicarboxylate transport sensor protein
VC_1926-2131.052050C4-dicarboxylate transport transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1919DNABINDINGHU1201e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 120 bits (303), Expect = 1e-39
Identities = 49/87 (56%), Positives = 63/87 (72%)

Query: 2 NKTQLVEQIAANADISKASAGRALDAFIEAVSGTLQSGDQVALVGFGTFSVRTRAARTGR 61
NK L+ ++A +++K + A+DA AVS L G++V L+GFG F VR RAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPKTGEEIKIAEAKVPSFKAGKALKDA 88
NP+TGEEIKI +KVP+FKAGKALKDA
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1920HTHFIS310.018 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.018
Identities = 26/102 (25%), Positives = 43/102 (42%), Gaps = 16/102 (15%)

Query: 351 LCLVGPPGVGKTSLGRSIAAATGRQ---YVRMALGGVRD---EAEIRGHRRTYIGSMPGK 404
L + G G GK + R++ R+ +V + + + E+E+ GH + G+ G
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGA 219

Query: 405 LIQKMAKVGVKN--PLFLLDEIDKMSSDMRGDPASALLEVLD 444
+ + LF LDEI M D + + LL VL
Sbjct: 220 QTRSTGRFEQAEGGTLF-LDEIGDMPMDAQ----TRLLRVLQ 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1925ANTHRAXTOXNA310.018 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.018
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 83 DITNRYLEQVNEVIQAADT--YLIDRFGNTIASSNWNLDRSFIGRNFAWRPYFY 134
D+ N EQ NE D ++I+ G I + NW + FI +N + Y Y
Sbjct: 573 DVVNHGTEQDNEEFPEKDNEIFIINPEGEFILTKNWEMTGRFIEKNITGKDYLY 626


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_1926HTHFIS447e-157 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 447 bits (1152), Expect = e-157
Identities = 167/483 (34%), Positives = 245/483 (50%), Gaps = 50/483 (10%)

Query: 4 VFLIDDESDIRIALAQSFELADLNAQFFASVEEALLAIKAQGLPLVIVSDICLPGLSGQN 63
+ + DD++ IR L Q+ A + + ++ I A G ++V+D+ +P + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI-AAGDGDLVVTDVVMPDENAFD 64

Query: 64 LLSSVLHQDSELPVILITGHGDISMAVKAMHDGAYDFIEKPFAPERLIDTVHRAIEKRRL 123
LL + +LPV++++ A+KA GAYD++ KPF LI + RA+ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 124 TLENRQLKRSLKVSQTLGPRIIGETAAIQTLRDTIAQVADTQADILLFGETGTGKELVAR 183
+ G ++G +AA+Q + +A++ T +++ GE+GTGKELVAR
Sbjct: 125 RPSKLEDDSQD------GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178

Query: 184 SLHEQSSRREQNFVAINCGAVPENLIESELYGHEKGAFTGADSRRVGKFEHAQGGTLFLD 243
+LH+ RR FVAIN A+P +LIESEL+GHEKGAFTGA +R G+FE A+GGTLFLD
Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 244 EIESMPMQAQIRLLRVLQERVIERIGSNELIPLDIRVIAATKIDLKQAAAEGKFRQDLYY 303
EI MPM AQ RLLRVLQ+ +G I D+R++AAT DLKQ+ +G FR+DLYY
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 304 RLNIVTLTIPPLRERREDIPALFHHFLLVAAARYGKAATALTASDVQSLLSHDWPGNVRE 363
RLN+V L +PPLR+R EDIP L HF+ A + G ++ + +H WPGNVRE
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQ-QAEKEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 364 LRNAAERYVLLGK--------LAQFAESHEPKSKLLG----------------------- 392
L N R L + S P S +
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 393 -----------LNEQVAEFEKSLLEQTLIECGGSIKATMERLHLPRKTLYDKMQKYQLDK 441
+ +AE E L+ L G+ + L L R TL K+++ +
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSV 477

Query: 442 ESY 444

Sbjct: 478 YRS 480


43VC_2062VC_2067N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_2062117-0.783143protein-glutamate methylesterase CheB
VC_2063215-0.423782chemotaxis protein CheA
VC_2064-114-0.104404chemotaxis protein CheZ
VC_20650130.328532chemotaxis protein CheY
VC_20660130.604950RNA polymerase sigma factor for flagellar operon
VC_2067-1110.462123MinD-related protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2062HTHFIS667e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.0 bits (161), Expect = 7e-14
Identities = 26/125 (20%), Positives = 50/125 (40%), Gaps = 6/125 (4%)

Query: 2 AIKVLVVDDSSFFRRRVSEIINSESRLEVIDVAVNGKEAVEKAARLKPDVITMDIEMPVM 61
+LV DD + R +++ ++ + + N A D++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 62 DGISAVREIMANNP-VPILMFSSLTHDGAKATLDALDAGALDFLPKKFEDIARNRDEAVT 120
+ + I P +P+L+ S+ + + A + GA D+LPK F D+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSA--QNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGR 117

Query: 121 LLQQR 125
L +
Sbjct: 118 ALAEP 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2063PF06580419e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.4 bits (97), Expect = 9e-06
Identities = 13/79 (16%), Positives = 32/79 (40%), Gaps = 10/79 (12%)

Query: 490 ETDLDKNLVEALADPLI--HLVRNSVDHGIEMPDERAKNGKSRTGKVILSASQEGDHIQL 547
E ++ +++ P++ LV N + HGI + GK++L +++ + L
Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294

Query: 548 AIVDDGAGMDPDKLRGIAV 566
+ + G+ +
Sbjct: 295 EVENTGSLALKNTKESTGT 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2065HTHFIS881e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 1e-23
Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 3/105 (2%)

Query: 10 KILIVDDFSTMRRIVKNLLRDLGFNNTQEADDGLTALPMLKKGDFDFVVTDWNMPGMQGI 69
IL+ DD + +R ++ L G++ + T + GD D VVTD MP
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 70 DLLKNIRADEELKHLPVLMITAEAKREQIIEAAQAGVNGYIVKPF 114
DLL I+ + LPVL+++A+ I+A++ G Y+ KPF
Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2067CABNDNGRPT290.025 NodO calcium binding signature.
		>CABNDNGRPT#NodO calcium binding signature.

Length = 479

Score = 29.2 bits (65), Expect = 0.025
Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 5/68 (7%)

Query: 141 IRAFGTLEDEMDILLIDTAAGISDMVVSFSRAAQDVVVVVCDEPTSITDAYALIKLLSKE 200
I F D++D+ +S + F+ Q+V++ D SIT+ + L
Sbjct: 404 IADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEVMLQW-DAANSITN----LWLHEAG 458

Query: 201 HQVQRFKI 208
H F +
Sbjct: 459 HSSVDFLV 466


44VC_2120VC_2147N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_2120-1181.112103flagellar biosynthetic protein FlhB
VC_21211160.692439flagellar biosynthetic protein FliR
VC_21221170.852283flagellar biosynthetic protein FliQ
VC_2123-1122.548448flagellar biosynthetic protein FliP
VC_2124-1132.607272flagellar protein FliO
VC_21250133.066947flagellar motor switch protein FliN
VC_2126-1123.238536flagellar motor switch protein FliM
VC_2127-1133.247331flagellar protein FliL, putative
VC_2128-1122.985710flagellar hook-length control protein FliK,
VC_21290142.029213flagellar protein FliJ, putative
VC_2130-2132.336112flagellum-specific ATP synthase FliI
VC_2131-2131.926047flagellar assembly protein FliH, putative
VC_2132-2131.624292flagellar motor switch protein FliG
VC_2133-2121.522172flagellar M-ring protein FliF
VC_2134-2130.829430flagellar hook-basal body complex protein FliE
VC_2135-2131.731009sigma-54 dependent response regulator
VC_2136-1131.540991sensory box sensor histidine kinase
VC_2137-1141.465499sigma-54 dependent transcriptional activator
VC_2138-1151.202676flagellar protein FliS
VC_2139-1131.424799flagellar rod protein FlaI, putative
VC_2140-1111.671774flagellar hook-associated protein FliD
VC_2141-1100.933319flagellin FlaG
VC_21420110.712749flagellin FlaB
VC_21430100.338348flagellin FlaD
VC_21441140.379655flagellin FlaE
VC_2145016-0.511465tyrA protein
VC_2146420-1.429148conserved hypothetical protein
VC_21471180.534626hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2120TYPE3IMSPROT346e-120 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 346 bits (890), Expect = e-120
Identities = 107/350 (30%), Positives = 184/350 (52%), Gaps = 4/350 (1%)

Query: 8 ERTEEATPRRLQQAKEKGQVARSKELASVSVLVVGAVSLMWFGEILAKGLMVAMQRLFEL 67
E+TE+ TP++++ A++KGQVA+SKE+ S +++V + LM + + M E
Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQ 63

Query: 68 SREEIFDLGKLFDIIGGSLVNLLLPLLMILSTLFIAALIGAAGVGGISFSAEAAMPKLSK 127
S + L ++ L+ +L+ + A+ G S EA P + K
Sbjct: 64 SY--LPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121

Query: 128 MNPLSGLKRMFGMQSWVELLKSILKVLLVSGVAFYLIEASQKDLFQLSLDVYPQNIFHAL 187
+NP+ G KR+F ++S VE LKSILKV+L+S + + +I+ + L QL + I L
Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPT-CGIECITPLL 180

Query: 188 -DILLNFVLLISCSLLVVVAIDIPFQIWQHAEQLKMTKQEVKDEYKDTEGKPEVKGRIRM 246
IL +++ + +V+ D F+ +Q+ ++LKM+K E+K EYK+ EG PE+K + R
Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240

Query: 247 LQREAAQRRMMAALPQADVIITNPEHFSVALRYKQNTDKAPVVIAKGVDHMALKIREIAR 306
+E R M + ++ V++ NP H ++ + YK+ P+V K D +R+IA
Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300

Query: 307 EYDIAIVPAPPLARALYHTTELEQQIPDGLFVAVAQVLAFVFQLKQYRRK 356
E + I+ PLARALY ++ IP A A+VL ++ + ++
Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQH 350


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2121TYPE3IMRPROT1212e-35 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 121 bits (305), Expect = 2e-35
Identities = 84/215 (39%), Positives = 127/215 (59%), Gaps = 2/215 (0%)

Query: 15 YFWPYTRIAAMLMVMTVTGARFVPARVRLYLGLALTFAVMPAIPAVPSDIALLSLQGFMI 74
YFWP R+ A++ + R VP RV+L L + +TFA+ P++PA + + S +
Sbjct: 16 YFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPAND--VPVFSFFALWL 73

Query: 75 TFEQIVIGVAMGMVTQFLVQIFVMLGQILGMQSSLGFASMVDPANGQNTPLLGQLFMLLA 134
+QI+IG+A+G QF G+I+G+Q L FA+ VDPA+ N P+L ++ +LA
Sbjct: 74 AVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLA 133

Query: 135 TLFFLVSDGHLKMIQLVVFSFKSLPIGSGSLTTVDFRELALWLGIMFKASLAVSLSGIIA 194
L FL +GHL +I L+V +F +LPIG L + F L ++F L ++L I
Sbjct: 134 LLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITL 193

Query: 195 LLTVNLSFGVMTRAAPQLNIFSLGFSFALLVGLLL 229
LLT+NL+ G++ R APQL+IF +GF L VG+ L
Sbjct: 194 LLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISL 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2122TYPE3IMQPROT558e-14 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 54.8 bits (132), Expect = 8e-14
Identities = 24/75 (32%), Positives = 41/75 (54%)

Query: 7 VELFKESLWLVLIMVCAIIIPSLLVGLVVAIFQAATSINEQTLSFLPRLIITLLALMFFG 66
V ++L+LVLI+ I + ++GL+V +FQ T + EQTL F +L+ L L
Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64

Query: 67 HWMTQILMDFFYSMI 81
W ++L+ + +I
Sbjct: 65 GWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2123FLGBIOSNFLIP2803e-97 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 280 bits (718), Expect = 3e-97
Identities = 116/232 (50%), Positives = 165/232 (71%), Gaps = 1/232 (0%)

Query: 67 IALGSSSGGGGIPAFTMTTNSDGSEDYSINLQILALMTMLGFLPAMVILMTSFTRIVVVM 126
+ L + +P T G + +S+ +Q L +T L F+PA++++MTSFTRI++V
Sbjct: 12 LWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVF 71

Query: 127 SILRQAMGLQQTPSNQVIIGIALFLTFFIMAPVFNQINEQAVQPYLNEQITARQAFDLAQ 186
+LR A+G P NQV++G+ALFLTFFIM+PV ++I A QP+ E+I+ ++A +
Sbjct: 72 GLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGA 131

Query: 187 EPLKAFMLKQTRVNDLETFVEMSGA-QVTAPEQVSMAVLIPAFITSELKTAFQIGFMLFL 245
+PL+ FML+QTR DL F ++ + PE V M +L+PA++TSELKTAFQIGF +F+
Sbjct: 132 QPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFI 191

Query: 246 PFLIIDLVVASVLMAMGMMMLSPMIVSLPFKLMLFVLVDGWNLILSTLAGSF 297
PFLIIDLV+ASVLMA+GMMM+ P ++LPFKLMLFVLVDGW L++ +LA SF
Sbjct: 192 PFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2125FLGMOTORFLIN1081e-33 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 108 bits (272), Expect = 1e-33
Identities = 47/84 (55%), Positives = 72/84 (85%)

Query: 54 RKLDTIMDIPVTISMEVGRSKISIRNLLQLNQGSVVELDRIAGESLDVMVNGTLIAHGEV 113
+ +D IMDIPV +++E+GR++++I+ LL+L QGSVV LD +AGE LD+++NG LIA GEV
Sbjct: 52 QDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEV 111

Query: 114 VVVNDKFGIRLTDVISQTERIKKL 137
VVV DK+G+R+TD+I+ +ER+++L
Sbjct: 112 VVVADKYGVRITDIITPSERMRRL 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2126FLGMOTORFLIM2502e-83 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 250 bits (641), Expect = 2e-83
Identities = 89/326 (27%), Positives = 168/326 (51%), Gaps = 9/326 (2%)

Query: 1 MTDLLSQDEIDALLHGVDSVDEEEEELPKSGP--SATNFDFSSQDRIVRGRMPTLELINE 58
MT++LSQDEID LL + S D E+ T +DF D+ + +M TL L++E
Sbjct: 1 MTEVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHE 60

Query: 59 RFARHLRISLFNMLRKTAEVSINGVQMMKFGEYQNTLYVPTSLNMVRFRPLKGTALVTME 118
FAR SL LR V + V + + E+ ++ P++L ++ PLKG A++ ++
Sbjct: 61 TFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVD 120

Query: 119 ARLVFILVENFFGGDGRYHARIEGREFTPTERRVIQLLLKIVFADYKEAWSPVMGVEFEY 178
+ F +++ FGG G+ R+ T E V++ ++ + A+ +E+W+ V+ +
Sbjct: 121 PSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRL 178

Query: 179 LDSEVNPSMANIVSPTEVIVVSSFHVEVDGGGGDFHMVMPYSMVEPIRELLDAG--VQSD 236
E NP A IV P+E++V+ + +V G + +PY +EPI L + S
Sbjct: 179 GQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSV 238

Query: 237 KMETDVRWSTALREEIMDVPVNFRVNLLEMDIALRDLMELQVGDVI---PIKMPEHAVMF 293
+ + ++ LR+++ V ++ + + +++RD++ L+VGD+I + + V+
Sbjct: 239 RRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLS 298

Query: 294 VEELPTYRVKMGRSGEKLAVQVSEKI 319
+ + + G G+K+A Q+ E+I
Sbjct: 299 IGNRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2128FLGHOOKFLIK562e-10 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 56.0 bits (134), Expect = 2e-10
Identities = 79/351 (22%), Positives = 139/351 (39%), Gaps = 16/351 (4%)

Query: 323 AEVSEALAASSQALKATPLTQSALNPASIMADEGLNQSTSATETGKVVIPWGTQMPTDNE 382
A +SEALA + KA P A + + + ++ S + ++IP P N+
Sbjct: 31 ALLSEALAGETTTDKAAPQLLVATDKPTTKGEPLISDIVSDAQQANLLIPVDETPPVIND 90

Query: 383 LAALTPELKAMLEEGGKAKAPVPNALAQSVAQGLTPAHLAAQQTGTTALPMNPTAATPID 442
+ + L A A N A L A+ LP D
Sbjct: 91 EQSTSTPLTTAQTMALAAVAD-KNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVTD 149

Query: 443 IAALAPQTVAVNPMLNPAATVNPELAASSAMLAALGGRALAGSDERRAVSESGQEDLAQQ 502
AP TV L S + A A + + + A+
Sbjct: 150 ----APSTV-----LPTEKPTLFTKLTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEV 200

Query: 503 IAAAAGQGTAQNQALNRAESQLVQTNATPV---PLNKEMAADQLAERVQMMMSKNLKNID 559
I+ + A + + ++Q + T A PV PL L++ + + + ++ +
Sbjct: 201 ISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAE 260

Query: 560 IRLDPPELGRMHIRMNMQGDGATVHFTVANQHAREALEQTMPRLREMLAQQGVQLGDTSV 619
+RL P +LG + I + + + A + +QH R ALE +P LR LA+ G+QLG +++
Sbjct: 261 LRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNI 320

Query: 620 QQQS-AGQQQRYTGQEQSGFGQSARNERLNSEENLDTDIKLDLNVATKRDG 669
+S +GQQQ + Q+QS ++A +E L E++ + + L +
Sbjct: 321 SGESFSGQQQAASQQQQS--QRTANHEPLAGEDDDTLPVPVSLQGRVTGNS 369


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2129FLGFLIJ401e-06 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 39.8 bits (92), Expect = 1e-06
Identities = 31/133 (23%), Positives = 68/133 (51%)

Query: 21 ALDFLLEQAQESEDKAVLALSKARSELDSYYHQLRQIEQYRLEYCQQLVERGKSGLTASQ 80
AL L + A++ + A L + R QL+ + Y+ EY L +G+T+++
Sbjct: 6 ALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITSNR 65

Query: 81 YGHLNRFLTQLDETLAKQKSAEQHFRLQVENCEQHWMNMRQKRKSYQWLMEKKQTERQLL 140
+ + +F+ L++ + + + + +V+ W +Q+ +++Q L E++ T L
Sbjct: 66 WINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAALLA 125

Query: 141 QEKREQKQMDEFS 153
+ + +QK+MDEF+
Sbjct: 126 ENRLDQKKMDEFA 138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2131FLGFLIH672e-15 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 66.7 bits (162), Expect = 2e-15
Identities = 48/190 (25%), Positives = 95/190 (50%), Gaps = 13/190 (6%)

Query: 69 EEEIALIRTAAQQEGFEAGQAEGYQQGFEQGKAEGFQAGHQEGQTQGYQDGVAEGQALIQ 128
E+++A ++ A ++G++AG AEG QQG +Q G+QEG QG + G+AE ++
Sbjct: 41 EQQLAQLQMQAHEQGYQAGIAEGRQQGHKQ--------GYQEGLAQGLEQGLAEAKSQQA 92

Query: 129 EQVKTFMALANQFAQPLDLLNAQVEKQLVDMVLALTKEVVHVEVQTNPQVILDTVKASVE 188
L ++F LD L++ + +L+ M L ++V+ + ++ ++ ++
Sbjct: 93 PIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQ 152

Query: 189 ALPIAGHAITLKLNPEDVEIIRQAYGEQEIETRNWTLLSEPALSRGDVQIEAGE----SS 244
P+ L+++P+D++ + G + W L +P L G ++ A E +S
Sbjct: 153 QEPLFSGKPQLRVHPDDLQRVDDMLGAT-LSLHGWRLRGDPTLHPGGCKVSADEGDLDAS 211

Query: 245 VSYRMEERIR 254
V+ R +E R
Sbjct: 212 VATRWQELCR 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2132FLGMOTORFLIG2833e-96 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 283 bits (726), Expect = 3e-96
Identities = 107/330 (32%), Positives = 195/330 (59%)

Query: 17 DISEIPGEEKAAILLLSLNEEDAAGIIRHLEPKQVQRVGSAMARAKDLSQTKVSAVHRAF 76
D+S + G++KAAILL+S+ E ++ + ++L ++++ + +A+ + ++ V F
Sbjct: 11 DVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEF 70

Query: 77 LEDIQKYTNIGMGSEDFLRNALVAALGADKANNLVDQILLGTGSKGLDSLKWMDPRQVAS 136
E + I G D+ R L +LG KA ++++ + S+ + ++ DP + +
Sbjct: 71 KELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILN 130

Query: 137 IIINEHPQIQTIVLSYLEPDQSAEILAQFAQRDALDLLMRIANLEEVQPSALAELNEIME 196
I EHPQ ++LSYL+P +++ IL+ ++ RIA ++ P + E+ ++E
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 197 KQFAGQAGAQAAKIGGLKAAADIMNYLDNNIESVLMEGMREKDEDLATQIQDLMFVFENL 256
K+ A + GG+ +I+N D E ++E + E+D +LA +I+ MFVFE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 257 VEVDDQGIQKLLRDVPQDVLQKALKGADDTLREKIFKNMSKRAAEMMKDDLEAMPPIKVS 316
V +DD+ IQ++LR++ L KALK D ++EKIFKNMSKRAA M+K+D+E + P +
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 317 DVEAAQKEILSIARRMADNGEIMLGGGADE 346
DVE +Q++I+S+ R++ + GEI++ G +E
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGEE 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2133FLGMRINGFLIF2903e-93 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 290 bits (744), Expect = 3e-93
Identities = 159/556 (28%), Positives = 257/556 (46%), Gaps = 40/556 (7%)

Query: 48 GDLDLLRQVVLVLSISICVALIVMLFFWVREPDMRPL-GAYETEELIPVLDYLDQQKQQY 106
L ++ L+++ S VA++V + W + PD R L ++ ++ L Q Y
Sbjct: 17 NRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY 76

Query: 107 KL--DGNTILVPVSDYNSLKLSMVRAGLNQNRQAGDEILMQDMGFGVSQRLEQERLKLSR 164
+ I VP + L+L + + GL + G E+L FG+SQ EQ + +
Sbjct: 77 RFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELL-DQEKFGISQFSEQVNYQRAL 135

Query: 165 ERQLAKAIEEMRQVNKARVLLALPKQSVFVRHNQEASASVFLTVRTGANLKQEEIDAVVD 224
E +LA+ IE + V ARV LA+PK S+FVR + SASV +T+ G L + +I AVV
Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195

Query: 225 MVASAVPGMKPSRVTVTDQHGRLLSSGSQDPVSAARRKEQELEKQQEEALRGKIDSVLIP 284
+V+SAV G+ P VT+ DQ G LL+ + + + E ++ +I+++L P
Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDA-QLKFANDVESRIQRRIEAILSP 254

Query: 285 ILGLGNYTAQVDIELDFSAVEQTRKVFDPNTPATRSEYTLEDYNNGNVVA-----GVPGA 339
I+G GN AQV +LDF+ EQT + + PN A+++ N V GVPGA
Sbjct: 255 IVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGA 314

Query: 340 LSNQPPADASIP-----------QDVAQ---MKDGSVMGQGSVHKEATRNYELDTTISHE 385
LSNQP P Q+ Q + + G S + T NYE+D TI H
Sbjct: 315 LSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHT 374

Query: 386 RKQSGVINRQTVAVAVKSRSSVNPDTGEVTYTPLSEADLNSIRQVLIGTVGYSENRGDLL 445
+ G I R +VAV V D PL+ + I + +G+S+ RGD L
Sbjct: 375 KMNVGDIERLSVAVVV--NYKTLADG---KPLPLTADQMKQIEDLTREAMGFSDKRGDTL 429

Query: 446 NVLSMPFAEPEMEQIVDVPIWEHPNFNDWVRWFASALVIIIVILVLVRPAMKKLINPAAD 505
NV++ PF+ + ++P W+ +F D + L++++V +L R K + P
Sbjct: 430 NVVNSPFSAVD-NTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWR----KAVRPQLT 484

Query: 506 NDDQMYGPDGMPIGA--DGETSLIGSDIDGGELFEFGSGIDLPNLHKDEDVLKAVRALVA 563
+ + E ++ +L + + L E + + +R +
Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGA----EVMSQRIREMSD 540

Query: 564 NEPELAAQVVKNWMQN 579
N+P + A V++ WM N
Sbjct: 541 NDPRVVALVIRQWMSN 556


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2134FLGHOOKFLIE623e-16 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 62.4 bits (151), Expect = 3e-16
Identities = 32/101 (31%), Positives = 55/101 (54%), Gaps = 3/101 (2%)

Query: 9 LDGLNNEMQAMMFEAMNTQPASTGQKVGADFGAMLTKAINNVNGLQKTSSDMQTRFDRGD 68
++G+ +++QA A + F L A++ ++ Q + +F G+
Sbjct: 6 IEGVISQLQATAMSARAQESLPQPT---ISFAGQLHAALDRISDTQTAARTQAEKFTLGE 62

Query: 69 EGISLSDVMIARNKSSVAFEATIQVRNKLVEAYKELMNMPV 109
G++L+DVM K+SV+ + IQVRNKLV AY+E+M+M V
Sbjct: 63 PGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2135HTHFIS498e-176 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 498 bits (1285), Expect = e-176
Identities = 169/486 (34%), Positives = 257/486 (52%), Gaps = 18/486 (3%)

Query: 3 MAQSKVLIVEDDEGLREALIDTLALAGYEWLEADCAEDALLKLKSHSVDIVVSDVQMAGM 62
M + +L+ +DD +R L L+ AGY+ A + + D+VV+DV M
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 63 GGLALLRSIKQHWPNLPVLLMTAYANIQDAVSAMKDGAIDYMAKPFAPEVLLNMVSR--- 119
LL IK+ P+LPVL+M+A A+ A + GA DY+ KPF L+ ++ R
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 -------YAPVKSDDNGDAVVADTKSLKLLALADKVAKTDANVMILGPSGSGKEVMSRYI 172
S D V ++ + ++ +TD +MI G SG+GKE+++R +
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 173 HNASPRKEGPFIAINCAAIPDNMLEATLFGYEKGAFTGAVQACPGKFEQAQGGTILLDEI 232
H+ R+ GPF+AIN AAIP +++E+ LFG+EKGAFTGA G+FEQA+GGT+ LDEI
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 233 SEMDLNLQAKLLRVLQEREVERLGSRKSIKLDVRVLATSNRDLKQYVQAGHFREDLYYRL 292
+M ++ Q +LLRVLQ+ E +G R I+ DVR++A +N+DLKQ + G FREDLYYRL
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 293 NVFPLTWPALCERKDDIEPLANHLIERHCKKLGLPVPSIAPNAITKLLNYPWPGNVRELD 352
NV PL P L +R +DI L H +++ K GL V A+ + +PWPGNVREL+
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELE 359

Query: 353 NVVQRALILSENGHIQSEHILLEGVDWHDASSLQQAVAGESMAAPQIKPVAEPEVFKPLV 412
N+V+R L I E I E + + + + + E
Sbjct: 360 NLVRRLTALYPQDVITREIIENE----LRSEIPDSPIEKAAARSGSLSISQAVEENMRQY 415

Query: 413 QGGGQNSSSQSSGLGGELRDQEFAIILDTLAECQGRRKEMAEKLGISPRTLRYKLAKMRD 472
++ S L + E+ +IL L +G + + A+ LG++ TLR K+ ++
Sbjct: 416 FASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL-- 473

Query: 473 AGIDIP 478
G+ +
Sbjct: 474 -GVSVY 478


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2136PF06580320.003 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.1 bits (73), Expect = 0.003
Identities = 22/99 (22%), Positives = 43/99 (43%), Gaps = 20/99 (20%)

Query: 246 LVMNALQ--IAGK--GSQIDVFFRPVNGELKISVQDNGPGVPESLQHKIMEPFFTTRSQG 301
LV N ++ IA G +I + NG + + V++ G ++ + +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------------ES 310

Query: 302 TGLGLA-VVQMVCRAHGG--RLELISKEGEGACFTMCIP 337
TG GL V + + +G +++L K+G+ + IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2137HTHFIS507e-180 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 507 bits (1306), Expect = e-180
Identities = 189/494 (38%), Positives = 273/494 (55%), Gaps = 25/494 (5%)

Query: 1 MQSLAKLLVIEDDAAIRLNLSVILEFVGEQCEVIESTQIDQINWSAVWGGCILGSLR-GQ 59
M A +LV +DDAAIR L+ L G + + +A G ++ +
Sbjct: 1 MTG-ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 60 ALSEQLIQSLTKAN-HIPLLVANKQPYSLEEFPNYV--GELDF---PLNYPQLSDALRHC 113
+ L+ + KA +P+LV + Q + G D+ P + +L +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQN-TFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 114 KEFLGRKGFQVLATARKNTLFRSLVGQSMGIQEVRHLIEQVSTTEANVLILGESGTGKEV 173
R+ ++ ++ LVG+S +QE+ ++ ++ T+ ++I GESGTGKE+
Sbjct: 119 LAEPKRRPSKLEDDSQD---GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 174 VARNIHYHSGRRNGPFVPINCGAIPAELLESELFGHEKGAFTGAITARKGRFELAEGGTL 233
VAR +H + RRNGPFV IN AIP +L+ESELFGHEKGAFTGA T GRFE AEGGTL
Sbjct: 176 VARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235

Query: 234 FLDEIGDMPMSMQVKLLRVLQERCFERVGGNSTIKANVRVIAATHRNLEEMIDGQKFRED 293
FLDEIGDMPM Q +LLRVLQ+ + VGG + I+++VR++AAT+++L++ I+ FRED
Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRED 295

Query: 294 LYYRLNVFPIEMPALRDRIDDIPLLLQELMTRMEAEGAQPICFTPRAINSMMEHDWPGNV 353
LYYRLNV P+ +P LRDR +DIP L++ + + E EG F A+ M H WPGNV
Sbjct: 296 LYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355

Query: 354 RELANLVERMVILYPNSLVDVNHLPTKYRYSDIPEFQPEPSRFSSVEEQERDVLEGIFAE 413
REL NLV R+ LYP ++ + + R S+IP+ E + S +E +
Sbjct: 356 RELENLVRRLTALYPQDVITREIIENELR-SEIPDSPIEKAAARSGSLSISQAVEENMRQ 414

Query: 414 DFNFEEPQEFVPDIDAPQALPPEGVNLKELLADLEVNLINQALEAQGGVVARAADMLGMR 473
F ALPP G+ +LA++E LI AL A G +AAD+LG+
Sbjct: 415 YFA-----------SFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLN 462

Query: 474 RTTLVEKMRKYNMQ 487
R TL +K+R+ +
Sbjct: 463 RNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2140GPOSANCHOR372e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 2e-04
Identities = 46/214 (21%), Positives = 81/214 (37%), Gaps = 20/214 (9%)

Query: 199 KQLEYKTLEQRVRDLEKARAQAQQLIAPLTPEQQKVAAKVAEKIGDAARLVDQEVAQEIR 258
+ E LE R +LEKA A + + + + A+ A + A L Q
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 259 SAAQSAQGAAGEALNAGELTESAVKAAANAASEAKKYIRPEDRIPGWTETASGTLLDSYW 318
+ A E N SEA + D LD+
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD-------------LDASR 357

Query: 319 EPEEELDAQGQKKAADVPGWSNTASGTLLDSYVTPQEAQQKLEQKLAQEKAQIEA----- 373
E +++L+A+ QK S + +L +EA++++E+ L + +++ A
Sbjct: 358 EAKKQLEAEHQKLEEQN-KISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLN 416

Query: 374 -AIRSGKMTPEEAKAQARAKLSPEERAYIEQVEK 406
+ K E+ KA+ +AKL E +A E++ K
Sbjct: 417 KELEESKKLTEKEKAELQAKLEAEAKALKEKLAK 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2142FLAGELLIN1854e-56 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 185 bits (471), Expect = 4e-56
Identities = 82/302 (27%), Positives = 138/302 (45%), Gaps = 5/302 (1%)

Query: 2 AINVNTNVSAMTAQRYLNGAADGMQKSMERLSSGYKINSARDDAAGLQISNRLTSQSRGL 61
A +NTN ++ Q LN + + ++ERLSSG +INSA+DDAAG I+NR TS +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DMAVKNANDGISIAQTAEGAMNETTNILQRMRDLALQSSNGSNSSSERRAIQEEVSALND 121
A +NANDGISIAQT EGA+NE N LQR+R+L++Q++NG+NS S+ ++IQ+E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGNKLLNGSFGSKSFQIGADSGEAVMLSMGSMRSDTQAMGGKSYRAQEG 181
E++R++ T F G K+L+ Q+GA+ GE + + + + + + G + +
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KAADWRVGA-ATDLTLSYTNKQGEAREVTINAKQGDDLEELATYINGQTEDVKASVGEDG 240
+ V +N+ T + +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 241 KLQLFASSQKVNGDVTIGGGLGGEIGFDAGRNVTVA---DVNVSTVAGSQEAVSILDGAL 297
+ + + G + A + D T + + +G +
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299

Query: 298 KA 299

Sbjct: 300 ST 301



Score = 124 bits (313), Expect = 8e-34
Identities = 71/218 (32%), Positives = 101/218 (46%), Gaps = 20/218 (9%)

Query: 178 AQEGKAADWRVGA-ATDLTLSYTNKQGEAREVTINAKQGDDLEEL-ATYINGQTEDVKAS 235
+ G + +V ++ T A ++A + + + +NGQ +
Sbjct: 289 TKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 236 VGEDGKLQLFASSQKVNGDVTIGGGLGGEIGFDAGRNVTVA------------------D 277
E KL ++ V G+ I AG VT+A +
Sbjct: 349 KNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINE 408

Query: 278 VNVSTVAGSQEAVSILDGALKAVDSQRASLGAFQNRFGHAISNLDNVNENVNASRSRIRD 337
+ + ++ +D AL VD+ R+SLGA QNRF AI+NL N N+N++RSRI D
Sbjct: 409 DAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468

Query: 338 TDYARETTAMTKAQILQQASTSVLAQAKQSPSAALSLL 375
DYA E + M+KAQILQQA TSVLAQA Q P LSLL
Sbjct: 469 ADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2143FLAGELLIN1913e-58 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 191 bits (486), Expect = 3e-58
Identities = 89/297 (29%), Positives = 141/297 (47%), Gaps = 2/297 (0%)

Query: 2 AVNVNTNVAAMTAQRYLTGATNAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGL 61
A +NTN ++ Q L + ++ +++ERLSSG +INSAKDDAAG I+NR +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 DVAVRNANDGISIAQTAEGAMNETTNILQRMRDLSLQSANGSNSKSERVAIQEEITALND 121
A RNANDGISIAQT EGA+NE N LQR+R+LS+Q+ NG+NS S+ +IQ+EI +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFGGNKLLNGTFSTKSFQIGADNGEAVMLTLKDMRSDNRMMGGTSYVAAEG 181
E++R++ T F G K+L+ Q+GA++GE + + L+ + + + G + +
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KDKDWKVQAGANDITFTLKDIDGNDQTITVNAKEGDDIEEVATYINGQTDMVKASVNEKG 241
+ N + + N + VN+ T +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 QLQIFAGNNKVTGDVAFSGGL-AGALNMQAGTAETVDTIDVTSVGGAQQSVAVIDSA 297
+ + + +G A A+ + DT D V + D
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 125 bits (316), Expect = 3e-34
Identities = 81/326 (24%), Positives = 125/326 (38%), Gaps = 21/326 (6%)

Query: 70 DGISIAQTAEGAMNETTNILQRMRDLSLQSANGSNSKSERVAIQEEITALNDELNRIAET 129
+ + + + R A +++ + V + + A N +L
Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242

Query: 130 TSFGGNKLLNGTFSTKSFQIGADNGEAVMLTLKDMRSDNRMMGGTSYVAAEGKDKDWKVQ 189
+ + + + + A G D + G D + KV
Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKG--VTFTIDTKTGNDGNGKVS 300

Query: 190 AGANDITFTLKDIDGNDQTITVNAKEGDDIEE-VATYINGQTDMVKASVNEKGQLQIFAG 248
N TL D V+A + + +NGQ + NE +L
Sbjct: 301 TTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEA 360

Query: 249 NNKVTGDVAFSGGLAGALNMQAGTAETVD------------------TIDVTSVGGAQQS 290
NN V G+ + A AG T+ +
Sbjct: 361 NNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANP 420

Query: 291 VAVIDSALKYVDSHRAELGAFQNRFNHAISNLDNINENVNASKSRIKDTDFAKETTALTK 350
+A IDSAL VD+ R+ LGA QNRF+ AI+NL N N+N+++SRI+D D+A E + ++K
Sbjct: 421 LASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSK 480

Query: 351 SQILSQASSSVLAQAKQAPNAALSLL 376
+QIL QA +SVLAQA Q P LSLL
Sbjct: 481 AQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2144FLAGELLIN2065e-64 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 206 bits (525), Expect = 5e-64
Identities = 89/297 (29%), Positives = 142/297 (47%), Gaps = 2/297 (0%)

Query: 2 AMTVNTNVSALVAQRHLNSASEMLNQSLERLSSGNRINSAKDDAAGLQISNRLETQMRGL 61
A +NTN +L+ Q +LN + L+ ++ERLSSG RINSAKDDAAG I+NR + ++GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GIAVRNANDGISIMQTAEGAMQETTQLLQRMRDLSLQSANGSNSAAERVALQEEMAALND 121
A RNANDGISI QT EGA+ E LQR+R+LS+Q+ NG+NS ++ ++Q+E+ +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRIAETTSFAGRKLLNGQFMKASFQIGASSGEAVQLSLRNMRSDSLEMGGFSYVAAAL 181
E++R++ T F G K+L+ + Q+GA+ GE + + L+ + SL + GF+
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 ADKQWQVTKGKQQLNISYVNAQGENENIQIQAKEGDDIEELATYINGKTDKVSASVNEKG 241
A + K + + + T + +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 QLQLYIAGKETSGTLSFSGSL-ANELQMNLLGYEAVDNLDISSAGGAQRAVSVIDTA 297
+ A T S +G+ A + + G + D D + D
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 129 bits (325), Expect = 2e-35
Identities = 63/219 (28%), Positives = 91/219 (41%), Gaps = 19/219 (8%)

Query: 178 AAALADKQWQVTKGKQQLNISYVNAQGENENIQIQAKEGDDIEEL-ATYINGKTDKVSAS 236
D +V+ ++ A + A + + + +NG+ +
Sbjct: 289 TKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 237 VNEKGQLQLYIAGKETSGTLSFSGSLANELQMNLLGYEAVD------------------N 278
NE +L A G + + A +
Sbjct: 349 KNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINE 408

Query: 279 LDISSAGGAQRAVSVIDTALKYVDGHRSELGAMQNRFQHAISNLDNVHENLAASNSRIKD 338
++ ++ ID+AL VD RS LGA+QNRF AI+NL N NL ++ SRI+D
Sbjct: 409 DAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468

Query: 339 ADYAKETTQMIKQQILQQVSTSVLAQAKRQPKFVLFLLR 377
ADYA E + M K QILQQ TSVLAQA + P+ VL LLR
Sbjct: 469 ADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2147IGASERPTASE300.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.002
Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 8/82 (9%)

Query: 4 TTLIPSEQTQQEALKIAKATQRPGQTKEQTKLITQGIEKGIALYKKQQKEKHRQADKLRK 63
TT +E ++QE+ + K Q +T Q + +A K + + Q +++ +
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNRE--------VAKEAKSNVKANTQTNEVAQ 1087

Query: 64 KALKAKQSSTEEIHEADDYAAE 85
+ K++ T E E E
Sbjct: 1088 SGSETKETQTTETKETATVEKE 1109


45VC_2187VC_2202N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_2187-114-0.295700flagellin FlaC
VC_21880140.063426flagellin core protein A
VC_21890140.082398hypothetical protein
VC_21900150.344041flagellar hook-associated protein FlgL
VC_21911130.936945flagellar hook-associated protein FlgM
VC_21921130.946830flagellar protein FlgJ
VC_21931150.886415flagellar P-ring protein FlgI
VC_21940150.338564flagellar L-ring protein FlgH
VC_21950160.284217flagellar basal-body rod protein FlgG
VC_2196015-0.191296flagellar basal-body rod protein FlgF
VC_2197115-1.591077flagellar hook protein FlgE
VC_2198116-2.017866basal-body rod modification protein FlgD
VC_2199216-2.136398flagellar basal-body rod protein FlgC
VC_2200215-2.297544flagellar basal-body rod protein FlgB
VC_2201116-2.144579chemotaxis protein methyltransferase CheR
VC_2202113-1.853360chemotaxis protein CheV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2187FLAGELLIN1914e-58 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 191 bits (485), Expect = 4e-58
Identities = 87/297 (29%), Positives = 137/297 (46%), Gaps = 2/297 (0%)

Query: 2 AVNVNTNVSAMTAQRYLTSATNAQQSSMERLSSGYKINSAKDDAAGLQISNRLNVQSRGL 61
A +NTN ++ Q L + ++ S++ERLSSG +INSAKDDAAG I+NR +GL
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GVAVRNANDGISMAQTAEGAMKETTNILQRMRDLSLQSANGSNSKADRVAIQEEITALND 121
A RNANDGIS+AQT EGA+ E N LQR+R+LS+Q+ NG+NS +D +IQ+EI +
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 ELNRVAETTSFGGNKLLNGTFATKSFQIGADNGEAVMLNIKDMRSDNALMGGKTYQAANG 181
E++RV+ T F G K+L+ Q+GA++GE + ++++ + + + G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 182 KDKNWGVEAGKTDLTITLKDKREGDVTISINAKEGDDIEELATYINGQTDMIKASVDEEG 241
+ K + +N+ T +
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 242 KLQLFTDNNRIDGAATFGGALAGELGIGAAQDVTV-DTLDVTTVGGAQESVAIVDAA 297
+ T + + G + GA + DT D V ++ D
Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 127 bits (320), Expect = 1e-34
Identities = 76/339 (22%), Positives = 123/339 (36%), Gaps = 25/339 (7%)

Query: 57 QSRGLGVAVRNANDGISMAQTAEGAMKETTNILQRMRDLSLQSANGSNSKADRVAIQEEI 116
+ V + + + ++ + + + D+V +
Sbjct: 174 VNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV----YV 229

Query: 117 TALNDELNRVAETTSFGGNKLLNGTFATKSFQIGADNGEAVMLNIKDMRSDNALMGGKTY 176
A N +L + + + + A G D T
Sbjct: 230 NAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKG--VTFTI 287

Query: 177 QAANGKDKNWGVEAGKTDLTITLKDKREGDVTISINAKEGDDIEEL-ATYINGQTDMIKA 235
G D N V +TL +++A + + + +NGQ
Sbjct: 288 DTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDK 347

Query: 236 SVDEEGKLQLFTDNNRIDGAATFGGALAGELGIGAAQDVTVD------------------ 277
+ +E KL NN + G + A A VT+
Sbjct: 348 TKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLIN 407

Query: 278 TLDVTTVGGAQESVAIVDAALKYVDSHRAELGAFQNRFNHAINNLDNINENVNASKSRIK 337
+A +D+AL VD+ R+ LGA QNRF+ AI NL N N+N+++SRI+
Sbjct: 408 EDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIE 467

Query: 338 DTDFAKETTALTKAQILSQASSSVLAQAKQAPNSALALL 376
D D+A E + ++KAQIL QA +SVLAQA Q P + L+LL
Sbjct: 468 DADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2188FLAGELLIN1943e-59 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 194 bits (493), Expect = 3e-59
Identities = 85/297 (28%), Positives = 135/297 (45%), Gaps = 2/297 (0%)

Query: 3 INVNTNVSAMTAQRYLTKATGELNTSMERLSSGNRINSAKDDAAGLQISNRLTAQSRGLD 62
+NTN ++ Q L K+ L++++ERLSSG RINSAKDDAAG I+NR T+ +GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 63 VAMRNANDGISIAQTAEGAMNESTSILQRMRDLALQSANGTNSASERQALNEESVALQDE 122
A RNANDGISIAQT EGA+NE + LQR+R+L++Q+ NGTNS S+ +++ +E +E
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 123 LNRIAETTSFGGRKLLNGSFGEASFQIGSSSGEAIIMGLTSVRADDFRMGGQSFIAEQPK 182
++R++ T F G K+L+ Q+G++ GE I + L + + G + +
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180

Query: 183 TKEWGVPPTARDLKFEFTKKDGEAVVLDIIAKDGDDIEELATYINGQTDLFKASVDQEGK 242
T ++ +D+ + T +
Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240

Query: 243 LQIFVAEPNIEGNFNISGGLATELGLNGGPGVKTTVQDIDITSVGGSQNAVGIIDAA 299
+ A + + G A + G D V + + D
Sbjct: 241 AENNTAVDLFKTT-KSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGN 296



Score = 140 bits (353), Expect = 3e-39
Identities = 65/219 (29%), Positives = 103/219 (47%), Gaps = 17/219 (7%)

Query: 178 AEQPKTKEWGVPPTARDLKFEFTKKDGEAVVLDIIAKDGDDIEEL-ATYINGQTDLFKAS 236
+ V T K T D A ++ A + + + +NGQ +
Sbjct: 289 TKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 237 VDQEGKLQIFVAEPNIEGNFNISGGLATELGLNGG----------------PGVKTTVQD 280
++ KL A ++G I+ A G GV T + +
Sbjct: 349 KNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINE 408

Query: 281 IDITSVGGSQNAVGIIDAALKYVDSQRADLGAKQNRLSHSISNLSNIQENVEASKSRIKD 340
+ + N + ID+AL VD+ R+ LGA QNR +I+NL N N+ +++SRI+D
Sbjct: 409 DAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468

Query: 341 TDFAKETTQLTKSQILQQAGTSILAQAKQLPNSAISLLQ 379
D+A E + ++K+QILQQAGTS+LAQA Q+P + +SLL+
Sbjct: 469 ADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2190FLAGELLIN290.031 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 29.2 bits (65), Expect = 0.031
Identities = 24/133 (18%), Positives = 48/133 (36%), Gaps = 3/133 (2%)

Query: 243 GSVVQAGEFDAKTGIQFEELNIQVKGQISKGDAIELTPRTTFSVFDTFRDAIRYAEGSVS 302
+ +A G +N + GD + L +T F + E + +
Sbjct: 353 AKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAA 412

Query: 303 DASNTAKLHQLVEEFHTAFIHLNKTRTDIGARLSTLDIQEEQHEDFKISLAKSKSTFEDL 362
+TA ++ +A ++ R+ +GA + D + +L ++S ED
Sbjct: 413 AKKSTANPLASID---SALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDA 469

Query: 363 DYAEAVIEFNENS 375
DYA V ++
Sbjct: 470 DYATEVSNMSKAQ 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2191FLGHOOKAP15370.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 537 bits (1384), Expect = 0.0
Identities = 122/455 (26%), Positives = 211/455 (46%), Gaps = 19/455 (4%)

Query: 3 SDLLNVGTQSVLTAQRQLNTTGHNISNVNTEGYSRQSVIQGTNAPRQYGGETYGMGVHVE 62
S L+N + AQ LNT +NIS+ N GY+RQ+ I G G GV+V
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 63 NVRRSWDQFAVKELNIASTDYAFKRDTEENLDMLSKLLSSVASKKIPENLNEWFDSVKSL 122
V+R +D F +L A T + E + + +LS+ ++ + + ++F S+++L
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLST-STSSLATQMQDFFTSLQTL 119

Query: 123 ADSPNDLGARKVVLEKSKLISQNLNDFHETVRLQKDITNKGLALGVERINQLALEIRDLQ 182
+ D AR+ ++ KS+ + + +R Q N + V++IN A +I L
Sbjct: 120 VSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLN 179

Query: 183 RLMMRVPG-----PHNDLMDKHEKLVSELSQYTKVTVTQRKHGEGFNIHIGNGHTLVSGT 237
+ R+ G N+L+D+ ++LVSEL+Q V V+ + G +NI + NG++LV G+
Sbjct: 180 DQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGT-YNITMANGYSLVQGS 238

Query: 238 EASQLRVIDGFPDTQQHRLAMVEGKALKAISARDI--GGKMEAILDMRDEHIPYLMDEVG 295
A QL + D + +A V+G A + G + IL R + + + +G
Sbjct: 239 TARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLG 298

Query: 296 RLALSFSHEVNTLQSQGLDLRGNVGSALFTDVNLDVIARSRVVTNSNSKADMAV--FIED 353
+LAL+F+ NT G D G+ G F I + V+ N+ +K D+A+ + D
Sbjct: 299 QLALAFAEAFNTQHKAGFDANGDAGEDFFA------IGKPAVLQNTKNKGDVAIGATVTD 352

Query: 354 VSQLKGGEYSMQYNGSEFVVTLPSGQ--QTVLPVVKGNVYVDGLRVEVRNPPQVGERILV 411
S + +Y + ++ +++ VT + TV P G V DGL + P V + +
Sbjct: 353 ASAVLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTL 412

Query: 412 RPTRNGAAAIRLATEDATKIAAQSFEASTTFAQGK 446
+P + + + D KIA S E +
Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRN 447



Score = 145 bits (367), Expect = 3e-39
Identities = 32/105 (30%), Positives = 61/105 (58%)

Query: 520 EGDNGNLRKMINIQTAKRMNDNESTIIDLYHNLNTDVGLKMATMTRLTDVARLEKEAAQS 579
+ DN N + ++++Q+ + + D Y +L +D+G K AT+ + +
Sbjct: 442 DSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSN 501

Query: 580 RIASISGVNLDEEAANMMKFQQAYMASSRIIQASNDTFNTILALR 624
+ SISGVNLDEE N+ +FQQ Y+A+++++Q +N F+ ++ +R
Sbjct: 502 QQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2192FLGFLGJ401e-143 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 401 bits (1031), Expect = e-143
Identities = 105/299 (35%), Positives = 157/299 (52%), Gaps = 15/299 (5%)

Query: 13 DIAGLDKLRQKAVNGDENAGQSALTAAARQFESIFTSMMLKSMRDANSDFKSDLMSSQNE 72
D L++L+ KA G++ A + ARQ E +F MMLKSMRDA K L SS++
Sbjct: 14 DAQSLNELKAKA--GEDPAAN--IRPVARQVEGMFVQMMLKSMRDALP--KDGLFSSEHT 67

Query: 73 DLYRQMLDEQMASEFSSSGSLGLADMIVAQLSTGQTASEQKGEDGFQEAMRRVEHARKTA 132
LY M D+Q+A + ++ LGLA+M+V Q++ Q E+ ++ +T
Sbjct: 68 RLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEE------STPAAPMKFPLETV 121

Query: 133 SERSNEDLVAAVYPLRKTQAVQSTQFDSRHSFVTKLKPYADKAARMLGVDSSLLIAQAAL 192
N+ L V S DS+ +F+ +L A A++ GV L++AQAAL
Sbjct: 122 VRYQNQALSQLVQKAVPRNYDDSLPGDSK-AFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 193 ETGWGQKMVKNARGN-SNNLFNIKADRSWQGDKVATQTLEYHNNVPVVEKAAFRSYASFD 251
E+GWGQ+ ++ G S NLF +KA +W+G T EY N KA FR Y+S+
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 252 ESFNDYVRFLENNPRYTNALDHGGNSERFIHGIHRAGYATDPQYADKVLRVKAQIDQMN 310
E+ +DYV L NPRY A+ ++E+ + AGYATDP YA K+ + Q+ ++
Sbjct: 241 EALSDYVGLLTRNPRYA-AVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSIS 298


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2193FLGPRINGFLGI425e-151 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 425 bits (1094), Expect = e-151
Identities = 155/360 (43%), Positives = 221/360 (61%), Gaps = 12/360 (3%)

Query: 8 LMLLLASSAQAARIKDVAQVAGVRNNQLVGYGLVTGLPGTGES---TPFTDQSFNAMLQS 64
+ + A +RIKD+A + R+NQL+GYGLV GL GTG+S +PFT+QS AMLQ+
Sbjct: 18 FLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQN 77

Query: 65 FGIQLPPGTKPKTKNVAAVIVTADLPAFSKQGQTIDITVSSIGSAKSLRGGTLMQTFLKG 124
GI G KN+AAV+VTA+LP F+ G +D+TVSS+G A SLRGG L+ T L G
Sbjct: 78 LGITTQGGQ-SNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSG 136

Query: 125 LDGQVYAVAQGNLVVSGFSATGADGSKIVGNNPTVGMISSGAIVEREVPNPFGRGDYITF 184
DGQ+YAVAQG L+V+GFSA G D + + T + +GAI+ERE+P+ F +
Sbjct: 137 ADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVL 195

Query: 185 NLFESDFTTAQRLADAVNQF----LGPQMASAVDAASIKVRAPRDLSQRVAFLSAIENLE 240
L DF+TA R+AD VN F G +A D+ I V+ PR ++ ++ IENL
Sbjct: 196 QLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLMAEIENLT 254

Query: 241 FNPADSAAKIIVNSRTGTIVVGQNVRLKPAAVTHGGMTVAIKENLNVSQPNALGGGQTVV 300
D+ AK+++N RTGTIV+G +VR+ AV++G +TV + E+ V QP GQT V
Sbjct: 255 VET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAV 313

Query: 301 VPNTEIEVTEKQGKMFKLEPGVTLDDLVRAVNEVGAAPSDLMAILQALKQAGAIEGQLII 360
P T+I ++ K+ +E G L LV +N +G ++AILQ +K AGA++ +L++
Sbjct: 314 QPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAELVL 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2194FLGLRINGFLGH1361e-41 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 136 bits (344), Expect = 1e-41
Identities = 69/205 (33%), Positives = 106/205 (51%), Gaps = 16/205 (7%)

Query: 68 AWAPIHP--------KGKPEHYAAETGSLFNLASSS-----SMYDDSKPRGVGDIITVTL 114
AW P P + P GS+F A +++D +PR +GD +T+ L
Sbjct: 23 AWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVL 82

Query: 115 NESTKAAKSADADLNKKNDASMDPLAVGGKDLTIGDYNFSYALK--NDNKFSGSAAANQS 172
E+ A+KS+ A+ ++ + V + L N ++ N F+G AN S
Sbjct: 83 QENVSASKSSSANASRDGKTNFGFDTVP-RYLQGLFGNARADVEASGGNTFNGKGGANAS 141

Query: 173 NSMSGSITVEVIEVLANGNLVIRGEKWLTLNTGDEYIRLSGTIRPDDIDFDNTIASNRIS 232
N+ SG++TV V +VL NGNL + GEK + +N G E+IR SG + P I NT+ S +++
Sbjct: 142 NTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVA 201

Query: 233 NARIQYSGTGTNQDMQEPGFLARFF 257
+ARI+Y G G + Q G+L RFF
Sbjct: 202 DARIEYVGNGYINEAQNMGWLQRFF 226


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2195FLGHOOKAP1435e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.4 bits (102), Expect = 5e-07
Identities = 10/46 (21%), Positives = 20/46 (43%)

Query: 215 IRQSMLETSNVNVTEELVNMIEAQRVYEMNSKVISAVDKMMSFVNQ 260
+ S VN+ EE N+ Q+ Y N++V+ + + +
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544



Score = 38.4 bits (89), Expect = 2e-05
Identities = 16/77 (20%), Positives = 34/77 (44%), Gaps = 14/77 (18%)

Query: 5 LWVSKTGLDAQQTNIATISNNLANASTIGFKKGRAVFEDLFYQNINQPGAQSSQNTRLPS 64
+ + +GL+A Q + T SNN+++ + G+ + + + N+ L +
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTLGA 49

Query: 65 GLMLGAGSKVVATQKVH 81
G +G G V Q+ +
Sbjct: 50 GGWVGNGVYVSGVQREY 66


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2197FLGHOOKAP1402e-05 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 39.6 bits (92), Expect = 2e-05
Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 3 YVSLSGLSAAQMDLNTTSNNIANANTFGFKESR 35
++SGL+AAQ LNT SNNI++ N G+
Sbjct: 5 NNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37



Score = 36.1 bits (83), Expect = 3e-04
Identities = 12/49 (24%), Positives = 26/49 (53%)

Query: 386 SISNGSLEQSNIDMTQELVDLISAQRNFQANSRALEVHNGLQQNILQIR 434
+SN S +++ +E +L Q+ + AN++ L+ N + ++ IR
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2199FLGHOOKAP1325e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 32.2 bits (73), Expect = 5e-04
Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 98 SNVNVMEEMANMISASRSYQTNVQVADASKQML 130
S VN+ EE N+ + Y N QV + +
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIF 539


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2202HTHFIS657e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.9 bits (158), Expect = 7e-14
Identities = 30/125 (24%), Positives = 56/125 (44%), Gaps = 11/125 (8%)

Query: 183 RILIADDSSVARKQVQRAIESIGFEVVTTKDGKDAYEKLLEMLSEGPISNQISLVISDIE 242
IL+ADD + R + +A+ G++V T + ++ G LV++D+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATL----WRWIAAG----DGDLVVTDVV 56

Query: 243 MPEMDGYTLTAEIRRHNELKDLYVILHSSLSGVFNQAMVERVGANAFIAK-FNPDELGNA 301
MP+ + + L I++ DL V++ S+ + GA ++ K F+ EL
Sbjct: 57 MPDENAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 302 VKAAL 306
+ AL
Sbjct: 115 IGRAL 119


46VC_2727VC_2734N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
VC_27270233.777162general secretion pathway protein J
VC_2728-1193.316141general secretion pathway protein I
VC_27290183.188102general secretion pathway protein H
VC_2730-1162.917850general secretion pathway protein G
VC_2731-1152.996597general secretion pathway protein F
VC_2732-1131.934879general secretion pathway protein E
VC_2734-2121.553917general secretion pathway protein C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2727PilS_PF08805328e-04 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 32.2 bits (73), Expect = 8e-04
Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 3/54 (5%)

Query: 3 RTNQVSSRQNM---AGFTLIEVLVAIAIFASLSVGAYQVLNQVQRSNEISAERT 53
+ +S+R+ G TL+EVL+ + + L+ AY++ + VQ + + S E+
Sbjct: 12 VFSSLSARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQN 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2728BCTERIALGSPH300.001 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.9 bits (67), Expect = 0.001
Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 3 SKRGFTLLEVLVALAIFATAAISVIRSVS 31
+RGFTLLE+++ L + +A V+ +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFP 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2729BCTERIALGSPH1263e-39 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 126 bits (317), Expect = 3e-39
Identities = 48/173 (27%), Positives = 71/173 (41%), Gaps = 43/173 (24%)

Query: 5 RGFTLLEILLVLVLVSASAVAVIATFPVSVKDEAKISAQSFYQRLLLLNEEAILSGQDFG 64
RGFTLLE++L+L+L+ SA V+ FP S D A + F +L + + + +GQ FG
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFFG 63

Query: 65 VRIDVDTRRLTFLQLTADKG--------------WQKWQNDKMTNQTTLKEG-LQLDFEL 109
V + D R FL L A G W + ++ ++ G L L F
Sbjct: 64 VSVHPD--RWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAF-A 120

Query: 110 GGGAWQKDDRLFNPGSLFDEEMFADEKKEQKQEPAPQLFVLSSGEVTPFTLSI 162
G AW D P + + GE+TPF L++
Sbjct: 121 QGEAWTPGDN-------------------------PDVLIFPGGEMTPFRLTL 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2730BCTERIALGSPG2294e-81 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 229 bits (586), Expect = 4e-81
Identities = 92/143 (64%), Positives = 110/143 (76%), Gaps = 4/143 (2%)

Query: 1 MKKMRKQTGFTLLEVMVVVVILGILASFVVPNLLGNKEKADQQKAVTDIVALENALDMYK 60
M+ KQ GFTLLE+MVV+VI+G+LAS VVPNL+GNKEKAD+QKAV+DIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNSVYPTTDQGLEALVTKPT-NPEPRNYREGGYIKRLPKDPWGNDYQYLSPGDKGTIDV 119
LDN YPTT+QGLE+LV PT P NY + GYIKRLP DPWGNDY ++PG+ G D+
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 120 FTLGADGQEGGEGTGADIGNWNI 142
+ G DG+ G E DI NW +
Sbjct: 121 LSAGPDGEMGTED---DITNWGL 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2731BCTERIALGSPF5640.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 564 bits (1455), Expect = 0.0
Identities = 223/407 (54%), Positives = 300/407 (73%), Gaps = 2/407 (0%)

Query: 1 MAAFEYKALDAKGRHKKGVIEGDNARQVRQRLKEQSLVPMEVVETQVKAARSRSQGFAFK 60
MA + Y+ALDA+G+ +G E D+ARQ RQ L+E+ LVP+ V E + +S S G + +
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 61 RG--ISTPDLALITRQLATLVQSGMPLEECLRAVAEQSEKPRIRTMLVAVRAKVTEGYTL 118
R +ST DLAL+TRQLATLV + MPLEE L AVA+QSEKP + ++ AVR+KV EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 119 SDSLGDYPHVFDELFRSMVAAGEKSGHLDSVLERLADYAENRQKMRSKLQQAMIYPVVLV 178
+D++ +P F+ L+ +MVAAGE SGHLD+VL RLADY E RQ+MRS++QQAMIYP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 179 VFAVGIVAFLLAAVVPKIVGQFVQMGQALPASTQFLLDASDFLQHWGISLLVGLLMLIYL 238
V A+ +V+ LL+ VVPK+V QF+ M QALP ST+ L+ SD ++ +G +L+ LL
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 239 VRWLLTKPDIRLRWDRRVISLPVIGKIARGLNTARFARTLSICTSSAIPILDGMRVAVDV 298
R +L + R+ + RR++ LP+IG+IARGLNTAR+ARTLSI +SA+P+L MR++ DV
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 299 MTNQFVKQQVLAAAENVREGSSLRKALEQTKLFPPMMLHMIASGEQSGELEGMLTRAADN 358
M+N + + ++ A + VREG SL KALEQT LFPPMM HMIASGE+SGEL+ ML RAADN
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 359 QDNSFESTVNIALGIFTPALIALMAGMVLFIVMATLMPILEMNNLMS 405
QD F S + +ALG+F P L+ MA +VLFIV+A L PIL++N LMS
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
VC_2734BCTERIALGSPC341e-120 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 341 bits (875), Expect = e-120
Identities = 93/305 (30%), Positives = 146/305 (47%), Gaps = 37/305 (12%)

Query: 1 MEFKQLPPLAAWPRLLSQNTLRWQKPISEGLTLLLLVASAWTLGKMVWVVSAEQTPVPTW 60
M +LPPL+ I L LL++ L + W + P
Sbjct: 1 MNISKLPPLSP-------------SVIRRILFYLLMLLFCQQLAMIFWRIGLPDN-APVS 46

Query: 61 SPTLSGLKAERQPLDISVLQKGELFGVFTEPKEAPVVEQPVVVDAPKTRLSLVLSGVVAS 120
S ++ +A +QP+ L LFGV E +A ++ + + P + L+L L+GV+A
Sbjct: 47 SVQITPAQARQQPV---TLNDFTLFGVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAG 103

Query: 121 NDAQKSLAVIANRGVQATYGINEVIEGTQAKLKAVMPDRVIISNSGRDETLMLEGLDYTA 180
+D +S+A+I+ Q + G+NE + G AK+ ++ PDRV++ GR E L L Y+
Sbjct: 104 DDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGL----YSQ 159

Query: 181 PATASVSNPPRPRPNQPNAVPQFEDKVDAIREAIARNPQEIFQYVRLSQVKRDDKVLGYR 240
+ + VP + + R + YV S + D+K+ GYR
Sbjct: 160 EDSG------------SDGVPGAQVN----EQLQQRASTTMSDYVSFSPIMNDNKLQGYR 203

Query: 241 VSPGKDPVLFESIGLQDGDMAVALNGLDLTDPNVMNTLFQSMNEMTEMSLTVERDGQQHD 300
++PG F +GLQD DMAVALNGLDL D + M ++ +LTVERDGQ+ D
Sbjct: 204 LNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQD 263

Query: 301 VYIQF 305
+Y++F
Sbjct: 264 IYMEF 268



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.