PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeLMA28_genomic.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in HE999757 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1BN424_1BN424_17Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_12150.965870chromosomal replication initiator protein DnaA
BN424_22141.222420DNA polymerase III, beta subunit
BN424_33122.137972S4 domain protein YaaA
BN424_43132.246072DNA replication and repair RecF family protein
BN424_53121.685853DNA gyrase, B subunit
BN424_61100.832884DNA gyrase, A subunit
BN424_7212-0.716344ribosomal protein S6
BN424_8012-0.745448single-stranded DNA-binding protein ssb
BN424_9211-1.489831ribosomal protein S18
BN424_10211-1.307055DHH family protein
BN424_11516-1.806569ribosomal protein L9
BN424_12317-2.363707putative uncharacterized protein
BN424_13219-2.521603putative uncharacterized domain protein
BN424_14018-2.544241transposase family protein
BN424_16018-2.383006integrase core domain protein
BN424_15116-4.063543putative membrane protein
BN424_17016-3.603448putative membrane protein
2BN424_51BN424_56Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_51-3133.167021alpha-Glycerophosphate Oxidase
BN424_520174.199687hypothetical protein
BN424_530144.787796hypothetical protein
BN424_540124.340787MIP channel s family protein
BN424_550143.237370S-ribosylhomocysteine lyase
BN424_56-1143.054293methyltransferase domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_55LUXSPROTEIN2091e-72 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 209 bits (534), Expect = 1e-72
Identities = 59/144 (40%), Positives = 88/144 (61%), Gaps = 7/144 (4%)

Query: 6 VESFNLDHTKVKAPYVRLAGRKDGENGDVILKYDVRFKQPNKEHMEMKSLHSLEHLTAEL 65
++SF +DHT++ AP VR+A GD I +D+RF PNK+ + K +H+LEHL A
Sbjct: 3 LDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGF 62

Query: 66 IRNHAD----YVVDWSPMGCQTGFYLTVINHDNYDDILSVLEATMKDVLVA---TEVPAS 118
+RNH + ++D SPMGC+TGFY+++I + + A M+DVL ++P
Sbjct: 63 MRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIPEL 122

Query: 119 NEVQCGWAASHTLEGAQQLATEFL 142
NE QCG AA H+L+ A+Q+A L
Sbjct: 123 NEYQCGTAAMHSLDEAKQIAKNIL 146


3BN424_78BN424_115Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_78320-1.085973putative uncharacterized protein
BN424_79419-2.149800hypothetical protein
BN424_80620-1.631969hypothetical protein
BN424_81721-1.181088putative uncharacterized protein
BN424_821020-4.244711putative uncharacterized protein
BN424_83921-2.683347putative uncharacterized protein
BN424_84622-0.334337hypothetical protein
BN424_85622-0.677221putative uncharacterized protein
BN424_864260.993029conserved hypothetical protein
BN424_873262.174261hypothetical protein
BN424_882304.051181hypothetical protein
BN424_893314.185119putative uncharacterized domain protein
BN424_903325.114493hypothetical protein
BN424_913315.453862resolvase, N terminal domain protein
BN424_924285.041336resolvase, N terminal domain protein
BN424_934253.322783hypothetical protein
BN424_945220.334554hypothetical protein
BN424_95421-1.176683conserved hypothetical protein
BN424_96520-2.843300hypothetical protein
BN424_97519-2.754296putative lipoprotein
BN424_98520-3.836926conserved domain protein
BN424_99519-5.265848hypothetical protein
BN424_100518-4.845041ATPase associated with various cellular
BN424_101517-5.399463conserved hypothetical protein
BN424_102518-5.042600hypothetical protein
BN424_103619-6.798711hypothetical protein
BN424_104518-6.904782putative endonuclease
BN424_105519-8.230962mcrBC 5-methylcytosine restriction system
BN424_106620-7.925619hypothetical protein
BN424_107522-7.325097putative membrane protein
BN424_108315-7.174616putative membrane protein
BN424_109214-6.944572putative membrane protein
BN424_110316-6.418490hypothetical protein
BN424_111217-4.151777helix-turn-helix family protein
BN424_112215-2.653642NUDIX domain protein
BN424_113114-1.224176mga helix-turn-helix domain protein
BN424_1143181.572880hypothetical protein
BN424_1153182.346303LPXTG-motif cell wall anchor domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_78OMADHESIN310.006 Yersinia outer membrane adhesin signature.
		>OMADHESIN#Yersinia outer membrane adhesin signature.

Length = 455

Score = 31.4 bits (70), Expect = 0.006
Identities = 26/71 (36%), Positives = 35/71 (49%), Gaps = 6/71 (8%)

Query: 78 IATDYKTAETKAVGSGVAGAGAGVAVVAIGP-SVAMG-LATTFGVAST----GTAISALS 131
I + A+ AV G GV VAIGP S A+G A T+G AST G AI A +
Sbjct: 75 IGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARA 134

Query: 132 GAAATNASLAW 142
+ T ++ +
Sbjct: 135 STSDTGVAVGF 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_81RTXTOXIND290.028 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.028
Identities = 13/99 (13%), Positives = 36/99 (36%), Gaps = 23/99 (23%)

Query: 100 LQDLQRRRDELQREKTNWLHQIAEQASKIPGLGHLFDDYSVLQAEKKV-QLMEDFRDFIH 158
+ + + + E E + S++ + +L A+++ + + F+
Sbjct: 254 VLEQENKYVEAVNE-------LRVYKSQLEQIES-----EILSAKEEYQLVTQLFK---- 297

Query: 159 RHDSDFSELNGLIEQVLLGLEELGKQRSFNGDKQGYSPI 197
+E+ + Q + L + + N ++Q S I
Sbjct: 298 ------NEILDKLRQTTDNIGLLTLELAKNEERQQASVI 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_82PF06917260.032 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 26.4 bits (58), Expect = 0.032
Identities = 14/44 (31%), Positives = 19/44 (43%)

Query: 18 FIANENKLIIWNGYFETILDNLLDCDVEKKGIVKEYFNHEGWYD 61
+ NE+ L W G+ LD L K V E +H +YD
Sbjct: 106 GVHNESGLFYWGGHRFLNLDTLKTEGPASKDQVHELKHHLPYYD 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_113PF05043330.002 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.4 bits (76), Expect = 0.002
Identities = 31/139 (22%), Positives = 60/139 (43%), Gaps = 13/139 (9%)

Query: 1 MDTLLMKKDLAKLTLFRELVYNQPKELSLDYFSELLNISKRSTLRTVEELAHDLEKDFED 60
M LL KK +L L EL++ + +ELLN ++R+ V++ ++ F D
Sbjct: 1 MRDLLSKKSHRQLELL-ELLFEHKRWFHRSELAELLNCTERA----VKDDLSHVKSAFPD 55

Query: 61 MEIKKNKYSYSIMNNSLMNNEYFIVSLQLFY--LKNSIQFNIIYSLLTKYFDSMTQLSEY 118
+ +S N + N +++ K+S F+I+ + + +
Sbjct: 56 L------IFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKE 109

Query: 119 LYISTPHLYRQMPEIKRFL 137
YIS+ LYR + +I + +
Sbjct: 110 FYISSSSLYRIISQINKVI 128


4BN424_148BN424_155Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_148422-1.930566LPXTG-motif cell wall anchor domain protein
BN424_149325-2.104687NUDIX domain protein
BN424_150425-2.001107CAAX amino terminal protease family protein
BN424_151426-2.371686thiF family protein
BN424_152327-3.106102protein
BN424_153428-3.029632ABC transporter family protein
BN424_154329-2.816056ABC-2 type transporter family protein
BN424_155225-2.594808ABC transporter family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_154ABC2TRNSPORT336e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 33.4 bits (76), Expect = 6e-04
Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 176 QVIMIGGLLF-SPITYPTDRLPSLLVRFFEILPFVPSSNLIRSMFYDQGIVNI-YNIIVI 233
Q ++I +LF S +P D+LP + LP S +LIR + +V++ ++ +
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGAL 241

Query: 234 CFWLVLNMLLSLVSLSRR 251
C ++V+ LS L RR
Sbjct: 242 CIYIVIPFFLSTALLRRR 259


5BN424_216BN424_247Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_216314-1.759256peptide methionine sulfoxide reductase MsrA
BN424_217515-1.657392mga helix-turn-helix domain protein
BN424_218419-0.537522cyclic nucleotide-binding domain protein
BN424_219320-1.027006hypothetical protein
BN424_220220-0.848685LPXTG-motif cell wall anchor domain protein
BN424_221218-3.643260cell surface protein
BN424_222317-3.345712hypothetical protein
BN424_223315-3.151146putative uncharacterized protein
BN424_224419-2.349703conserved hypothetical protein
BN424_225622-0.073692hypothetical protein
BN424_226421-0.242783mga helix-turn-helix domain protein
BN424_2273195.445193phosphoglycerate mutase family protein
BN424_2291153.973488putative uncharacterized protein
BN424_2301123.337038putative uncharacterized protein
BN424_2312142.888100putative uncharacterized domain protein
BN424_2341111.648780hypothetical protein
BN424_2351111.834915ykud domain protein
BN424_2360102.016850glycosyl hydrolase 1 family protein
BN424_2370102.276266glycosyl hydrolases 18 family protein
BN424_238082.255043chitinase
BN424_2391112.088486PRD domain protein
BN424_2400133.463393PTS system, Lactose/Cellobiose specific IIB
BN424_2410113.875776sugar-specific permease, SgaT/UlaA family
BN424_2420124.052370transaldolase
BN424_2430113.807816peptidase M23 family protein
BN424_244-1113.761364shikimate 5-dehydrogenase
BN424_245-2113.857127phospho-2-dehydro-3-deoxyheptonate aldolase
BN424_246-2123.9796713-dehydroquinate synthase
BN424_247-2123.076958chorismate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_217PF05043532e-09 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 52.6 bits (126), Expect = 2e-09
Identities = 32/169 (18%), Positives = 66/169 (39%), Gaps = 7/169 (4%)

Query: 1 MKRLLDPNFIPILSLLKQLNKDYPSRSITFFSEQLKLDRRTILKTIHTLQLDISRNHWEN 60
M+ LL L LL+ L + + +E L R + + ++ + +
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60

Query: 61 MLTIEIIDKSVYTTISPFFSIEVFFSHYMSESFAVRLFLSLFKYPSDSIDEICEYLYVSK 120
+ + IE+ + H+ S + +F + IC+ Y+S
Sbjct: 61 ------STNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISS 114

Query: 121 ATFYRRIKYSKEVLDDFNLSLDFTDSKNKLVGSETQIRYFFSTLFWEVF 169
++ YR I +V+ + + + +++G+E IRYFF+ F E +
Sbjct: 115 SSLYRIISQINKVIKR-QFQFEVSLTPVQIIGNERDIRYFFAQYFSEKY 162


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_222SALSPVBPROT280.025 Salmonella virulence plasmid 65kDa B protein signature.
		>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature.

Length = 591

Score = 28.2 bits (62), Expect = 0.025
Identities = 26/110 (23%), Positives = 41/110 (37%), Gaps = 14/110 (12%)

Query: 121 LKGKKVPTNILKASISIPKGKISTTADGDMQAPAVALPLTLS--KAAAPVMTAKKNNGLG 178
L G T L +PKG + + G ++ LPL +S + AP + ++G G
Sbjct: 4 LNGFSSATLALITPPFLPKGGKALSQSGPDGLASITLPLPISAERGFAPALALHYSSGGG 63

Query: 179 IWANDFNGLAGTVTIKVPTNAYIDQYS------------ANITWSLQDAP 216
T++I T+ + QY+ T S DAP
Sbjct: 64 NGPFGVGWSCATMSIARSTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDAP 113


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_223V8PROTEASE280.024 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 28.4 bits (63), Expect = 0.024
Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 4/36 (11%)

Query: 48 EDPMGPLDPLNPDNPN----PPSPVDPMDPENPGTG 79
+P P +P NPDNPN P +P +P +P+NP G
Sbjct: 290 NNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNPDNG 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_226PF05043340.001 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 33.8 bits (77), Expect = 0.001
Identities = 17/95 (17%), Positives = 43/95 (45%), Gaps = 7/95 (7%)

Query: 80 TSILEIRQYFLEESIAFKLLINLYQKQFIKLKNFSLTYYYSPSVVY---KKVNELKVKLK 136
+ I + +F + S F +L ++ + + ++ +Y S S +Y ++N++ +
Sbjct: 73 SDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQF 132

Query: 137 GYGLTIKSEHGSIYLGGEEEKKRYFYSEIFYYVYG 171
+ +++ + G E RYF+++ F Y
Sbjct: 133 QFEVSLTPVQ----IIGNERDIRYFFAQYFSEKYY 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_243IGASERPTASE424e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.4 bits (99), Expect = 4e-06
Identities = 46/228 (20%), Positives = 77/228 (33%), Gaps = 25/228 (10%)

Query: 87 QQENQELTVKIDQREDQLEKQARVVQVNGDTQNYIDFVLEAKSMSDIIGRVDVVAQMVSA 146
+ E + TV QA V V + E + D S
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNN--------EEIARVDEAPVPPPAPATPSE 1035

Query: 147 NRAMVKQQADDKAQVVKQEKEVAKKSDEQKVLAADLAKTQEKLQTQKLEKESIVAQIAAD 206
V + + +++ V++ ++ A ++ Q A AK+ K TQ E VAQ ++
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE----VAQSGSE 1091

Query: 207 TATAEGDKNKFLAQKAAAEKEAEDLR----IAKVAADKKASED--------AEAARV-VQ 253
T + + K A EK + + KV + ++ AE AR
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 254 LANAKAAEAAKNTTVAAAPPAENPQGGGTPPVSTGAYGRPTNAPVSSS 301
N K ++ NTT PA+ PV+ N+ V +
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199


6BN424_266BN424_289Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2666170.210203maltose O-acetyltransferase
BN424_2674160.945926hypothetical protein
BN424_2684150.223338glyoxalase/Bleomycin resistance /Dioxygenase
BN424_2693130.341278chorismate mutase
BN424_2700130.499978hypothetical protein
BN424_271-1100.851111hypothetical protein
BN424_272-1110.581762arginine deiminase
BN424_273212-0.104397ornithine carbamoyltransferase, catabolic domain
BN424_2741130.008266ornithine carbamoyltransferase, catabolic
BN424_275114-0.015196arginine/ornithine antiporter
BN424_2763170.321834carbamate kinase
BN424_2773170.305484crp family arginine deiminase pathway
BN424_2783150.522028hypothetical protein
BN424_279-113-0.352581putative uncharacterized protein
BN424_280010-0.425000chitin binding domain protein
BN424_281010-0.662754phosphoglycerate mutase family protein
BN424_28219-1.339655bacterial regulatory helix-turn-helix, lysR
BN424_283110-1.842057ATPase, P-type (transporting), HAD super, subIC
BN424_284-114-2.941987mga helix-turn-helix domain protein
BN424_285-111-3.506314threonine synthase
BN424_286315-5.352077hypothetical protein
BN424_287213-4.757847cyclic nucleotide-binding domain protein
BN424_288012-3.208266hypothetical protein
BN424_289-113-3.066975hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_272ARGDEIMINASE5130.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 513 bits (1324), Expect = 0.0
Identities = 190/410 (46%), Positives = 271/410 (66%), Gaps = 8/410 (1%)

Query: 3 NPIHIMSEIGKLKTVLLKRPGEEVENLTPDIMGRLLFDDIPYLPIIQEEHDYFAKALTDN 62
NPI+I SEIG+LK VLL RPGEE+ENLTP IM LFDDIPYL + ++EH+ FA L +N
Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65

Query: 63 GTEVLYLEKLTAEAI-DAGGIREVFIDRMLSESEISSPKIASALREYLLSMETFPMVTKI 121
E+ Y+E L +E + + + FI + + E+EI + + L++Y S+ M++K+
Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKM 125

Query: 122 MAGVRTRDIDVTTSNLVDISNKEHYPFFMDPMPNLYFTRDPAASLGNGLTINSMHYTARR 181
++GV T ++ TS+L D+ N F +DPMPN+ FTRDP AS+GNG+TIN M R+
Sbjct: 126 ISGVVTEELKNYTSSLDDLVN-GANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKVRQ 184

Query: 182 RESMFMEIIIQYHPRFANKGVEVWLDRDHPESIEGGDELVLNERVVAIGISQRTSAKAIE 241
RE++F E I +YHP + + V +WL+R S+EGGDELVLN+ ++ IGIS+RT AK++E
Sbjct: 185 RETIFAEYIFKYHPVYK-ENVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAKSVE 243

Query: 242 ALAKALFSRNSNFEKVVAIKIPNVRAMMHLDTVFTMVDYDKFTIHPGIQADGGKVDTYII 301
LA +LF ++F+ ++A +IP R+ MHLDTVFT +DY FT Y++
Sbjct: 244 KLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSD---DMYFSIYVL 300

Query: 302 EPSTVPGEIRMTERN-DLQEVLREVLNVPELILIPCGNGDEIVAPREQWNDGSNTLAIAP 360
+ +I + + +++VL L ++ +I C GD I REQWNDG+N LAIAP
Sbjct: 301 TYNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAP 359

Query: 361 GVVVTYNRNYVSNELLRSYGVKVIEVISSELSRGRGGPRCMSMPLIREDL 410
G ++ Y+RN+V+N+L G+KV + SSELSRGRGGPRCMSMPLIRED+
Sbjct: 360 GEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_276CARBMTKINASE393e-140 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 393 bits (1011), Expect = e-140
Identities = 136/313 (43%), Positives = 194/313 (61%), Gaps = 4/313 (1%)

Query: 3 KRKIVVALGGNAIL--STDASDKAQKEALKATAAYLVEIIKQGNELIISHGNGPQVGNLV 60
+++V+ALGGNA+ S + + ++ TA + EII +G E++I+HGNGPQVG+L+
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 61 LQQQAAAS-KSNPAMPLDTCVAMTQGSIGYWLQNALENAFKKEGIEKSVISVVSQVVVDQ 119
L A + PA P+D AM+QG IGY +Q AL+N +K G+EK V+++++Q +VD+
Sbjct: 62 LHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDK 121

Query: 120 NDVAFIHPTKPIGPFLTQSEAHEQMLLSDDTYQEDAGRGWRKVVPSPKPVSILEYPIINQ 179
ND AF +PTKP+GPF + A +ED+GRGWR+VVPSP P +E I +
Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181

Query: 180 LVENGVVTISVGGGGIPVIEAENEFVGVEAVIDKDFASQKLAELVEADLLVILTGVEQVY 239
LVE GV+ I+ GGGG+PVI + E GVEAVIDKD A +KLAE V AD+ +ILT V
Sbjct: 182 LVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA 241

Query: 240 INYNQPNQKALTTVTTKELYQYIQENQFAPGSMLPKIEAAISFVEHNPKGKAVITSLENL 299
+ Y ++ L V +EL +Y +E F GSM PK+ AAI F+E + +A+I LE
Sbjct: 242 LYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGE-RAIIAHLEKA 300

Query: 300 GNFNTENAGTTIV 312
GT ++
Sbjct: 301 VEALEGKTGTQVL 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_278IGASERPTASE436e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.7 bits (100), Expect = 6e-06
Identities = 52/313 (16%), Positives = 95/313 (30%), Gaps = 35/313 (11%)

Query: 77 PFQVTQKTNPNPSQTAE------SEYKARLQELETNFKEKQLHFEQEMLEKEKEAQQNRQ 130
T T PN Q +E AR+ E E E Q+++
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKT 1050

Query: 131 DLERNLKEKLEQQYNEQVATIKEEKEQNREQLQKQFNEQIAEIEHLSLEKRIEIETKYQE 190
EK EQ E A +E ++ + + N Q E+ E ET+ E
Sbjct: 1051 ------VEKNEQDATETTAQNREVAKE--AKSNVKANTQTNEVAQSGSE---TKETQTTE 1099

Query: 191 KLKMLEGIYAEQQDLSEAKYAEKEANLEKAYQDKQAQFDKQQQAKIAEIEAQKRKQDEEY 250
E + + E++A +E + + Q K + E + + +
Sbjct: 1100 T--------KETATVEK----EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147

Query: 251 ERKYLEKIEKLDLFYKERQTVLEKDIAEKEKEIIRNAETLAEEKLDRAAREAEQLLSEIK 310
E I++ + QT D + KE N E E + E
Sbjct: 1148 ENDPTVNIKE-----PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202

Query: 311 AKVALQQQNLEHSTIETEEKNSKAIED-AKKMASNIINSANGEAKKKLEKAELQAKTALE 369
Q S+ + + ++ +++ + +S + + L
Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 370 SARSDSQLLIENT 382
AR+ +Q + N
Sbjct: 1263 DARAKAQFVALNV 1275



Score = 35.0 bits (80), Expect = 0.001
Identities = 33/265 (12%), Positives = 73/265 (27%), Gaps = 8/265 (3%)

Query: 115 EQEMLEKEKEAQQNRQDLERNLKEKLEQQYNEQVATIKEEKEQNREQLQKQFNEQIAEIE 174
E E + + NE++A + E +
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 175 HLSLEKRIEIETKYQEKLKMLEGIYAEQQDLSEAKYAEKEANLEKAYQDKQAQFDKQQQA 234
K +E + + A+ + S K + E A + + + +
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAK-EAKSNVKANTQTN--EVAQSGSETKETQTTET 1100

Query: 235 KIAEIEAQKRKQDEEYERKYLEKIEKLDLFYKERQTVLEKDIAEKEKEIIRNAETLAEEK 294
K ++ K E E+ + K+ Q+ + AE +E + +E
Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE--NDPTVNIKEP 1158

Query: 295 LDRAAREAEQLLSEIKAKVALQQQNLEHSTIET---EEKNSKAIEDAKKMASNIINSANG 351
+ A+ + ++Q E +T+ T +N + A + S+N
Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218

Query: 352 EAKKKLEKAELQAKTALESARSDSQ 376
+ + S +
Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSND 1243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_284PF05043622e-12 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 62.3 bits (151), Expect = 2e-12
Identities = 80/423 (18%), Positives = 165/423 (39%), Gaps = 53/423 (12%)

Query: 1 MEELLSSSEQRELKIIHLLYQEEKLWTVEQLANYLQCSIDTCYRYIDRIKQIFYDHGNEF 60
M +LLS R+L+++ LL++ ++ + +LA L C+ + +K F D
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPD----- 55

Query: 61 ELISKKTKGVLLKKTEHASLSKYESIYIEETIDFKLLSELFHSTYLTTEKLADHLFISKS 120
+ T G+ + T+ + + + + + F +L +F + E + +IS S
Sbjct: 56 LIFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSS 115

Query: 121 TLYRKLKKIAILLRKN-GIHLNISTLQLTGNEVWIREFFYLVYWSTSDSGFWPFE----- 174
+LYR + +I ++++ ++++ +Q+ GNE IR FF + WPFE
Sbjct: 116 SLYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSSE 175

Query: 175 -------------SVPKHVLTHRVENIISSQN------SYFSTIEKLKLTYR-----MAI 210
S P ++ THR+ ++ N +F ++K + M
Sbjct: 176 PLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFMEVDKDSFNDQSLDFLMQA 235

Query: 211 SFIRVQQKNFITH--------SIGDSFIDPFKEEYFEFITLNLMKTVPSNYQKNEKDYLS 262
I ++F + + F+ F++ +F +L + +Y + LS
Sbjct: 236 EGIEGVAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKDSYVEKSYHLLS 295

Query: 263 LIFCTYPYLDKSDLNFCGIVSWHSINNTIPYQLTDKLLSSLSTIYPTQNLLTNKKLFYQL 322
+ ++ + WH N Y+ L T + + N +Q
Sbjct: 296 DFIDQISVKYQIEIENKDNLIWHLHNTAHLYR------QELFTEFILFDQKGNTIRNFQN 349

Query: 323 LCISIYATYFQASFSKTSEFLKL---STLLKQTHTCFYLNTKHALATICQEEPFKRILLQ 379
+ + + S E L++ S ++ F +TKH + + Q +P ++L+
Sbjct: 350 IFPKFVSD-VKKELSHYLETLEVCSSSMMVNHLSYTFITHTKHLVINLLQNQPKLKVLVM 408

Query: 380 PNF 382
NF
Sbjct: 409 SNF 411


7BN424_303BN424_310Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3030183.604368hypothetical protein
BN424_304-1153.014441hypothetical protein
BN424_3055174.453030hypothetical protein
BN424_3064154.119997hypothetical protein
BN424_3074164.106024putative uncharacterized protein
BN424_3084143.826466cell surface protein with WxL domain
BN424_3094113.507192conserved hypothetical protein
BN424_3104123.666682LPXTG-motif cell wall anchor domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_308V8PROTEASE320.002 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 32.3 bits (73), Expect = 0.002
Identities = 15/41 (36%), Positives = 22/41 (53%)

Query: 37 NSISDVTFTTNTDPTNPVNPTDPTKPVLPVDPLDPADPHEP 77
+I D+ F + P NP NP +P P P +P +P +P P
Sbjct: 276 QNIEDIHFANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNP 316



Score = 29.6 bits (66), Expect = 0.012
Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 36 MNSISDVTFTTNTDPTNPVNPTDPTKPVLPVDPLDPADPHEP 77
++ +D +P NP NP +P P P +P +P +P P
Sbjct: 281 IHFANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNP 322


8BN424_370BN424_375Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3702141.311126conserved hypothetical protein
BN424_3712141.291954EDD, DegV family domain protein
BN424_372318-0.934281hypothetical protein
BN424_373420-1.836047conserved hypothetical protein
BN424_374321-1.129449MIP channel s family protein
BN424_375319-1.107135nlpC/P60 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_375GPOSANCHOR401e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 40.0 bits (93), Expect = 1e-05
Identities = 34/243 (13%), Positives = 81/243 (33%), Gaps = 6/243 (2%)

Query: 27 ADDYSDKINSQNEKIKEIETQEKDVTTKLEGVTKEIVVAEEKARVLVEQSQATHAEMEKL 86
++ + KI+E+E ++ D+ LEG K + L + A A L
Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160

Query: 87 TKEVDSLNAKIEKRTAQLEKQARAVQVSASSEGYVDFIL------SADSLSDVVGRVDVV 140
K ++ +A+++ + + ++ L S + +
Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220

Query: 141 AQMVSANRELVKAQAEDKATVESNKKKTETKLTEQHEVAGQLEKLKGELEGKKLEQESVV 200
A + + +L KA ++ K +T E+ + + +L+ LEG +
Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280

Query: 201 ATLAASKASAEGERDGFIAQKEEADRKAADLKAAEEAAKKAPVLQTSTEDKKAPVTTTNQ 260
A + +A + ++ A+ ++ + + E + + N+
Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340

Query: 261 ESN 263
S
Sbjct: 341 ISE 343



Score = 39.7 bits (92), Expect = 2e-05
Identities = 50/253 (19%), Positives = 90/253 (35%), Gaps = 13/253 (5%)

Query: 23 LTALADDYSDKINSQNEKIKEIETQEKDVTTKLEGVTKEIVVAEEKARVLVEQSQATHAE 82
L A + + ++ + K++ + E E + L QSQ +A
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 83 MEKLTKEVDSLNAKIEKRTAQLEKQARAVQVS-ASSEGYVDFILSADSLSDVVGRVDVVA 141
+ L +++D+ ++ A+ +K ++S AS + L D + + + A
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS-----LRRDLDASREAKKQLEA 365

Query: 142 QMVSANRELVKAQAE------DKATVESNKKKTETKLTEQHEVAGQLEKLKGELEGKKLE 195
+ + ++A D KK+ E L E + LEKL ELE K
Sbjct: 366 EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKL 425

Query: 196 QESVVATLAAS-KASAEGERDGFIAQKEEADRKAADLKAAEEAAKKAPVLQTSTEDKKAP 254
E A L A +A A+ ++ Q EE + A + + P + +AP
Sbjct: 426 TEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAP 485

Query: 255 VTTTNQESNPPAP 267
T N
Sbjct: 486 QAGTKPNQNKAPM 498


9BN424_424BN424_441Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_424314-0.140522cobalt transport family protein
BN424_4250141.409901ABC transporter family protein
BN424_4260131.937213ABC transporter family protein
BN424_4271152.148393putative membrane protein
BN424_4282171.324878hypothetical protein
BN424_4292161.153325ethanolamine utilization protein, EutP
BN424_430113-0.483548cob(I)yrinic acid a,c-diamide
BN424_431-114-0.259878iron-containing alcohol dehydrogenase family
BN424_432-1130.014273alcohol dehydrogenase, iron-dependent domain
BN424_433-1130.095253propanediol utilization protein PduU
BN424_434-1130.502191putative transcriptional regulatory protein
BN424_4350140.779688histidine kinase-, DNA gyrase B-, and HSP90-like
BN424_4361172.473964ethanolamine utilisation EutA family protein
BN424_4372203.067116ethanolamine ammonia-lyase heavy chain
BN424_4384182.596212ethanolamine ammonia-lyase light chain family
BN424_4394173.166528ethanolamine utilization protein EutL
BN424_4403152.683088BMC domain protein
BN424_4412152.901545acetaldehyde dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_426PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.007
Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 16 IVGKNGTGKSTFLMVLAGL 34
+ G G GKST + L GL
Sbjct: 601 LEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_434HTHFIS752e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 2e-18
Identities = 34/177 (19%), Positives = 64/177 (36%), Gaps = 15/177 (8%)

Query: 3 GRIVIVDDEPITRMDIRDILEAGGYDVVGEASDGFEAIELCKSQHPDLVIMDIQMPLLDG 62
I++ DD+ R + L GYDV S+ + DLV+ D+ MP +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 LKAGKKIASENLAGGIILLSAFSDPTNTERAKNFGALGYLVKPLDEKSLIPTVEMSIAKG 122
+I ++++SA + +A GA YL KP D
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL-------------- 108

Query: 123 KETQKLEEQLNKLTKKLEERKIIERAKGILMIENKITEEDAYQMIRTLSMDKRSPMI 179
E + + K+ + + G+ ++ ++ Y+++ L + MI
Sbjct: 109 TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMI 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_435PF06580454e-07 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 44.9 bits (106), Expect = 4e-07
Identities = 39/216 (18%), Positives = 79/216 (36%), Gaps = 28/216 (12%)

Query: 265 KTEIKNKEAEIISKSVAIREIHHRVK-----NNLQSVVSLLRIQARRCESQEAKTALNES 319
+ EI + +++ + + ++ N L ++ +L+ +A+ L S
Sbjct: 146 QAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDP-----TKAREML-TS 199

Query: 320 VSRILAISATHELLSKQVEDGIQLKTVLESV-VY-NIQRC-FLDRNHITVVSDVSPDIVI 376
+S ++ S L + L L V Y + F DR + + ++P I
Sbjct: 200 LSELMRYS-----LRYSNARQVSLADELTVVDSYLQLASIQFEDR--LQFENQINPAI-- 250

Query: 377 DSDRTVAIALIVNELLQNSYDHAFGN-EQVGLIKLTAQAEEKVITISVIDDGTGFDVKKV 435
++V L++N H Q G I L + +T+ V + G+
Sbjct: 251 --MDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK 308

Query: 436 STTSLGLQIVNSYVKD--KLRGKIKIKSKEETGTST 469
+T GLQ V ++ +IK+ K+ +
Sbjct: 309 ESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344


10BN424_467BN424_479Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_4672121.088654S4 domain protein
BN424_4681121.605750septum formation initiator family protein
BN424_4690132.589903S1 RNA binding domain protein
BN424_470-1122.734315tRNA(Ile)-lysidine synthetase
BN424_471-1173.083265hypoxanthine phosphoribosyltransferase
BN424_472-1163.914632ATP-dependent metallopeptidase HflB family
BN424_473-1154.71376733 kDa chaperonin
BN424_474-1166.286375cysteine synthase A
BN424_4751176.174773TIM-barrel, nifR3 family protein
BN424_4762235.514777lysyl-tRNA synthetase
BN424_4785418.614271putative uncharacterized protein
BN424_4794313.804880putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_472IGASERPTASE404e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.0 bits (93), Expect = 4e-05
Identities = 17/76 (22%), Positives = 33/76 (43%)

Query: 636 PRGKEAKEEYPREDGASFEEAKKALEAKEAEKMKEEKAELEARKKAEEEATVEEEKGAEE 695
+ +E +E A+ + + A E ++ + + + A + EE+A VE EK E
Sbjct: 1063 AQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEV 1122

Query: 696 ASKEIKLEKKEEDSHE 711
++ K+E S
Sbjct: 1123 PKVTSQVSPKQEQSET 1138



Score = 30.4 bits (68), Expect = 0.029
Identities = 13/60 (21%), Positives = 27/60 (45%)

Query: 648 EDGASFEEAKKALEAKEAEKMKEEKAELEARKKAEEEATVEEEKGAEEASKEIKLEKKEE 707
+ G+ +E + + A KEEKA++E K E + +E S+ ++ + +
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146


11BN424_558BN424_572Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_5582161.955409uracil-DNA glycosylase
BN424_5592171.695618phosphate acetyltransferase
BN424_5603161.126743putative uncharacterized protein
BN424_561415-0.497464glyoxalase/Bleomycin resistance /Dioxygenase
BN424_562414-0.974175transposase DDE domain protein
BN424_563112-3.288825putative uncharacterized protein
BN424_564113-3.144285ribose 5-phosphate isomerase A
BN424_565013-2.859747activator of Hsp90 ATPase homolog 1-like family
BN424_566013-2.683009hypothetical protein
BN424_567015-1.032709putative uncharacterized protein
BN424_568014-1.042313yibE/F-like family protein
BN424_569014-0.991647yibE/F-like family protein
BN424_570013-1.888874putative zinc transport system zinc-binding
BN424_571217-2.309846ribosomal protein L33
BN424_572315-2.445460alternate 30S ribosomal protein S14
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_570adhesinb1583e-47 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 158 bits (401), Expect = 3e-47
Identities = 47/193 (24%), Positives = 93/193 (48%), Gaps = 3/193 (1%)

Query: 241 DHHDADEGSDTDEHDHEGHSHAFDPHIWLDPVIAQQQVQTIKDGLVKADDVNKDSYEKNA 300
D++ EG D + + DPH WL+ Q I L + D NK++YEKN
Sbjct: 115 DYYAVSEGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNL 174

Query: 301 ASYIEKLKKLDKDFENELKD--TKNRTFVTQHTAFAYLANRYNLEQVAISGLSPDLEPSP 358
+Y+EKL LDK+ + + + + + VT F Y + YN+ I ++ + E +P
Sbjct: 175 KAYVEKLSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTP 234

Query: 359 AKLAELSDFVKENNISVIYFENSASPKISKTLASGTGAVLEVLSPIEGVSQSDQDKGIDY 418
++ L + +++ + ++ E+S + KT++ T + + V++ ++G Y
Sbjct: 235 DQIKTLVEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAE-KGEEGDSY 293

Query: 419 IKVMEANLKALKK 431
+M+ NL+ + +
Sbjct: 294 YSMMKYNLEKIAE 306



Score = 100 bits (250), Expect = 8e-26
Identities = 36/154 (23%), Positives = 71/154 (46%), Gaps = 17/154 (11%)

Query: 5 RKLKLLLPLLVVILVVVGCTQPGKTTAPQEKTKLQVVTTFFPMYDFTRNVTKEHADVTML 64
+K + L+ LL+ + + C+ K++ +KL VV T + D T+N+ + ++ +
Sbjct: 2 KKCRFLVLLLLAFVGLAACSS-QKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60

Query: 65 MKAGVEPHDYEPSAKDIAKIADADVFIYNSEYMET----WVPSVLKNIDSKKTT-VIDAS 119
+ G +PH+YEP +D+ K + AD+ YN +ET W +++N K+ S
Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120

Query: 120 KNIPLLAGSDEHSEEDSEHHHDTDEHEEEPHFHL 153
+ + ++ + + D PH L
Sbjct: 121 EGVDVI----YLEGQSEKGKED-------PHAWL 143


12BN424_589BN424_601Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_5892122.533321DEAD/DEAH box helicase family protein
BN424_5903151.237014holo-[acyl-carrier-protein] synthase
BN424_5912140.845608alanine racemase
BN424_5923160.064497transposase DDE domain protein
BN424_593-213-2.239751hypothetical protein
BN424_594015-0.378626putative uncharacterized protein
BN424_5950181.834160mRNA interferase EndoA
BN424_5962224.306128queT transporter family protein
BN424_5973244.892079CBS domain pair family protein
BN424_5981213.443409O-methyltransferase family protein
BN424_6010213.099059*putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_589cloacin455e-07 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 45.1 bits (106), Expect = 5e-07
Identities = 28/82 (34%), Positives = 35/82 (42%), Gaps = 4/82 (4%)

Query: 445 SGGGG-GNRGGASRGKGGYRGGSGGGERRGGAQRG---GKDNRRSGGGAGSGGSWNKDAK 500
SGG G G+ GA G GG G GGA G +N GGG+GSG W +
Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61

Query: 501 RGSNAGGSSSSEARKGGRGNSS 522
G+ G +S G S+
Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSA 83



Score = 37.8 bits (87), Expect = 1e-04
Identities = 24/60 (40%), Positives = 27/60 (45%), Gaps = 9/60 (15%)

Query: 443 GGSGGGGGNRGGASRGKG--------GYRGGSGGGERRGGAQRGGKDNRRSGGGAGSGGS 494
GG G G GGAS G G G GSG G G N SGGG+G+GG+
Sbjct: 22 GGPTGLGVG-GGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 36.2 bits (83), Expect = 3e-04
Identities = 21/72 (29%), Positives = 24/72 (33%), Gaps = 5/72 (6%)

Query: 464 GGSGGGERRGGAQRGGKDNRRSGGGAGSGGS-----WNKDAKRGSNAGGSSSSEARKGGR 518
GG G G G G N G GG+ W+ + GS G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62

Query: 519 GNSSDNRRSGGG 530
GN N SGGG
Sbjct: 63 GNGGGNGNSGGG 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_591ALARACEMASE356e-124 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 356 bits (916), Expect = e-124
Identities = 113/373 (30%), Positives = 181/373 (48%), Gaps = 25/373 (6%)

Query: 7 RNTYALIDRNAIFNNIKNQMNLLEHDTEVYAVVKADGYGHGALEVATIAREAGVQGFCVA 66
R A +D A+ N+ H V++VVKA+ YGHG + + GF +
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATH-ARVWSVVKANAYGHGIERIWSAIGATD--GFALL 59

Query: 67 LIDEALELRQAGFKEPILIM-GLVEAKYAKLLLDQRISVAVGYLKWLEEAEFYLKREKSF 125
++EA+ LR+ G+K PIL++ G A+ ++ R++ V L+ +
Sbjct: 60 NLEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARL----- 114

Query: 126 SENRKLDIHLAIDTGMGRVGFRTSAELAEVESYLTHSTSFNCQGVFTHFATADSKDTKQF 185
LDI+L +++GM R+GF+ + V L + + +HFA A+ D
Sbjct: 115 --KAPLDIYLKVNSGMNRLGFQPD-RVLTVWQQLRAMANVGEMTLMSHFAEAEHPDG--I 169

Query: 186 HQQVEKFQKLVEGMTVKPTYIHSANSATSLWHQKYQKRIVRLGIAMYGLNPSGRELDL-P 244
+ + ++ EG +NSA +LWH + VR GI +YG +PSG+ D+
Sbjct: 170 SGAMARIEQAAEG---LECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIAN 226

Query: 245 IKLKPAMSVETTLVQVKQMSAGETVSYGATYKAKEGEWIGTLPIGYADGWRRSLQGQT-V 303
L+P M++ + ++ V+ + AGE V YG Y A++ + IG + GYADG+ R T V
Sbjct: 227 TGLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPV 286

Query: 304 LVEGERCEIVGRICMDQCMIRLNK--EVPIGTKVVLIGKDQNDEISAQEIAEYLDTINYE 361
LV+G R VG + MD + L + IGT V L GK EI ++A T+ YE
Sbjct: 287 LVDGVRTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWGK----EIKIDDVAAAAGTVGYE 342

Query: 362 IVCGFTQRLPRVY 374
++C R+P V
Sbjct: 343 LMCALALRVPVVT 355


13BN424_672BN424_689Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_6722150.172058universal stress family protein
BN424_6732161.344565major Facilitator Superfamily protein
BN424_6742150.803696transcriptional regulator PadR-like family
BN424_6750151.049161BH2160 protein
BN424_6760150.556114bacterial transferase hexapeptide family
BN424_677-1170.078926hypothetical protein
BN424_678014-0.496701SPFH domain / Band 7 family protein
BN424_679112-1.765073hypothetical protein
BN424_680313-2.207599aspartate kinase domain protein
BN424_681414-1.980638putative membrane protein
BN424_682114-1.300638putative membrane protein
BN424_683014-0.794241helix-turn-helix family protein
BN424_6840140.523708putative uncharacterized domain protein
BN424_6850140.855148helix-turn-helix family protein
BN424_6861110.866928cadmium-translocating P-type ATPase
BN424_6871120.906533putative uncharacterized protein
BN424_6883110.530605N-acetylmuramic acid 6-phosphate etherase
BN424_6892142.371465phosphotransferase system, EIIC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_673TCRTETA788e-18 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 77.6 bits (191), Expect = 8e-18
Identities = 69/390 (17%), Positives = 137/390 (35%), Gaps = 27/390 (6%)

Query: 27 LKQPKPAWAVAFACVIAFMGIGLVDPILKSISEQLHAT---PAETSLLFTSYMLVTGIVM 83
+K +P + + +GIGL+ P+L + L + A +L Y L+
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 84 LFSGYISSRIGAKKTILYGLVIIIIFAVLGGFSNSVGELVGFRAGWGLGNALFISTALSA 143
G +S R G + +L L + + + + L R G+ A + A +
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAY 119

Query: 144 IVGVSVGGTEQSIIMY-EAAMGLGMSVGPLLGGLLGSISWRAPFFGVATLMAVAFISVTI 202
I ++ G + A G GM GP+LGGL+G S APFF A L + F++
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 203 LL------------EKIPKPTKKVAFWDGLRALKHKGLLIMGITALFYNFGFFTLLAYSP 250
LL + P + G+ + L+ + L
Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVV--AALMAVFFIMQLVGQVPAALWVIFG 237

Query: 251 FRMESYSAMQVGFVFFGWGLCLAISSVFLAPKLQEKFGTKNMMYAALLLFALDLMIMGFG 310
+ A +G +G+ +++ + + + G + + ++ +++ F
Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297

Query: 311 ANNSTIISICIIIA--GLFQGVNNTLITTAVMEVSPVDRSIASSAYSFIRFTGGALAPWL 368
I +++A G+ +++ +V + + + + + P L
Sbjct: 298 TRGWMAFPIMVLLASGGIGMPALQAMLSR---QVDEERQGQLQGSLAALTSLTSIVGPLL 354

Query: 369 AGKLADWYNPHVTFWFAAVAVACGALVLFI 398
+ Y +T W +A AL L
Sbjct: 355 FTAI---YAASITTWNGWAWIAGAALYLLC 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_675BACSURFANTGN270.017 Yersinia/Haemophilus virulence surface antigen sign...
		>BACSURFANTGN#Yersinia/Haemophilus virulence surface antigen

signature.
Length = 322

Score = 27.4 bits (60), Expect = 0.017
Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 4/53 (7%)

Query: 3 HHIDIYVKDLEKQSNFWSWFLGELGY---QEFQKWETGISWKKADFYYVLSIG 52
H I YV + + F F GE + ++F+KW T W + ++Y L +G
Sbjct: 258 HAIAAYVNEKSGVTFFDPNF-GEFHFSDKEKFRKWFTNSFWGNSMYHYPLGVG 309


14BN424_738BN424_743Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_738-116-3.232579nitroreductase
BN424_739219-4.545376transposase, Mutator family protein
BN424_740218-6.335115putative membrane protein
BN424_741016-5.066791hypothetical protein
BN424_742-115-4.646667hypothetical protein
BN424_743-113-3.320246putative membrane protein
15BN424_956BN424_970Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_956011-3.320171putative membrane protein
BN424_957-110-1.760669CAAX amino terminal protease family protein
BN424_958011-1.115837putative uncharacterized protein
BN424_959-1130.019339helix-turn-helix family protein
BN424_960-1203.819042putative uncharacterized protein
BN424_9610244.626278hypothetical protein
BN424_9620265.085902lin1909 protein
BN424_9630284.654405feS assembly ATPase SufC
BN424_9640254.342403feS assembly protein SufD
BN424_9650222.773288cysteine desulfurase, SufS subfamily protein
BN424_9662210.065342SUF system FeS assembly protein, NifU family
BN424_9672180.024901feS assembly protein SufB
BN424_968318-1.811294hypothetical protein
BN424_969319-1.914594protein
BN424_970220-1.888874helix-turn-helix family protein
16BN424_1027BN424_1064Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1027-1163.359218putative uncharacterized protein
BN424_1028-2204.517630phosphoglucose isomerase family protein
BN424_10291283.382629acetyltransferase family protein
BN424_10311213.766169putative uncharacterized protein
BN424_10320131.594402putative uncharacterized protein
BN424_10331120.482668cell wall-associated hydrolase
BN424_10352120.067484hypothetical protein
BN424_1056210-0.303883******************putative membrane protein
BN424_10573120.736188methionine adenosyltransferase
BN424_1058210-0.189595drug resistance MFS transporter, drug:H+
BN424_1059112-0.103876rRNA methylase family protein
BN424_1060112-0.243304radical SAM superfamily protein
BN424_1061112-1.270674major Facilitator Superfamily protein
BN424_1062113-0.316010PAP2 superfamily protein
BN424_10630140.416792hypothetical protein
BN424_10642161.113671conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1058TCRTETB1231e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (309), Expect = 1e-32
Identities = 88/399 (22%), Positives = 174/399 (43%), Gaps = 15/399 (3%)

Query: 21 TFMTAVEGTIVSTAMPTIVGSLEGM-AIMNWVFSIYLLTNAMMTPVYGKLSDMIGRKPIF 79
+F + + +++ ++P I A NWV + ++LT ++ T VYGKLSD +G K +
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 80 IIGAIIFVIGSSLCGLAQTMDQLILF-RAIQGIGAGAIMPVSFTIIADIYPYEKRAKVMG 138
+ G II GS + + + L++ R IQG GA A + ++A P E R K G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 139 MNGAAWGIAGIFGPLLGGFIVDQLSWHWIFYINVPVGIITIILIALFLHEDFSFEKKPID 198
+ G+ + GP +GG I + HW + + +P+ I + + L + K D
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 199 FLGCFSLMAALLFLLYGFQIVGDTGEFSASMAGVFALAVGMFALFIFAEKRAIDPIIPLS 258
G + ++F + ++V F +F+ ++ DP +
Sbjct: 201 IKGIILMSVGIVFFMLFTTS---------YSISFLIVSVLSFLIFVKHIRKVTDPFVDPG 251

Query: 259 LFNNRTFVIQNIVAALVSGFLIGIDVYIPMWMQGLLGMK-AAMGGFAITPMSLTWIIGSF 317
L N F+I + ++ G + G +P M+ + + A +G I P +++ II +
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 318 IAGRVILKHPVKSILSSSLVIVGISGLMMVLAPMTTPFAFFLLVTAIIGIGMGITITTTT 377
I G ++ + +L+ + + +S L TT + +++ ++G +T
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 378 VTAQSVVPQDQIGVATSSNTLFRILGQTVMVSVYGIVLN 416
+ + S+ Q + G S L + +++ G +L+
Sbjct: 372 IVSSSLKQQ-EAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1061TCRTETA449e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 9e-07
Identities = 53/364 (14%), Positives = 109/364 (29%), Gaps = 13/364 (3%)

Query: 3 RDLWIVAIGMVLLYTGLSFIWPFNMLYMTENLGMSDTAAGTALLVN--SGIGIIGSVIGG 60
R L ++ + L G+ I P + + + +D A +L+ + + + + G
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 61 IIFDRVSGYVSLAIGTGILVLTTGSLFLFHGHPAF--IYNIWAVSVAMGMVFAGLYTAAG 118
+ DR L + L + P +Y V+ G A
Sbjct: 65 ALSDRFGRRPVLLVS---LAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 119 LTHPSGGRTG-FNTIYVAQNIGVAVGPFLAGFLAKDGLGNVYTGSFAFALIYALFFFVYF 177
R F + G+ GP L G + + + A + L
Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181

Query: 178 RKIDWHSNKVTSETKHKQKGTVRGKATKIGLISFGLLLLTYLFCQLPHVQWQSNLSTYMT 237
+ + + + G+ L+ + QL + +
Sbjct: 182 PE---SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 238 SQKGVTTAQYGNLWSINGTLILIGQVLIIPLVARFKEKLSLQIYIGIGLFFCSFLFAMQA 297
+ G + G L + Q +I VA + + +G+ ++ A
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA-LMLGMIADGTGYILLAFA 297

Query: 298 ESYGGFLLGMILLTLGEMFAWPAIPAIAYKLAPVGQAGLYQGLVNGTATAARMIAPIFGA 357
M+LL G + PA+ A+ + + G QG + + ++ P+
Sbjct: 298 TRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356

Query: 358 VVVA 361
+ A
Sbjct: 357 AIYA 360


17BN424_1147BN424_1158Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1147216-0.266042primosome component
BN424_11481170.076938threonyl-tRNA synthetase
BN424_1149115-0.466744polysaccharide deacetylase family protein
BN424_1150424-0.706441translation initiation factor IF-3
BN424_1151222-0.913264ribosomal protein L35
BN424_1152219-0.644706ribosomal protein L20
BN424_1153417-5.139429PIN family toxin-antitoxin system
BN424_1154516-5.456321hypothetical protein
BN424_1155416-5.222207PIN family toxin-antitoxin system
BN424_1156415-4.572108transposase, Mutator family protein
BN424_1157112-3.581418truncated transposase
BN424_1158012-3.397901putative uncharacterized protein
18BN424_1231BN424_1247Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1231321-2.843984hypothetical protein
BN424_1232423-3.135925putative uncharacterized domain protein
BN424_1233626-8.283893putative uncharacterized protein
BN424_1234625-7.856324putative uncharacterized protein
BN424_1235723-7.384948group-specific protein
BN424_1236924-7.841255putative uncharacterized protein
BN424_12371025-8.168216hypothetical protein
BN424_1238925-7.906665hypothetical protein
BN424_12391023-6.412299hypothetical protein
BN424_12401024-6.095476hypothetical protein
BN424_12411327-5.325968conserved hypothetical protein
BN424_12421432-4.280503hypothetical protein
BN424_12431430-4.296699putative uncharacterized protein
BN424_12441429-5.507524putative uncharacterized domain protein
BN424_12451127-4.557812hypothetical protein
BN424_1246826-4.962536hypothetical protein
BN424_1247624-5.161431putative uncharacterized protein
19BN424_1270BN424_1275Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1270215-0.730682uncharacterized ABC transporter solute-binding
BN424_12715200.754804conserved hypothetical protein
BN424_12725220.528775putative membrane protein
BN424_12736250.840350hypothetical protein
BN424_12745190.277035ribosomal protein S4
BN424_12752120.634908transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1270FERRIBNDNGPP497e-09 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 49.2 bits (117), Expect = 7e-09
Identities = 39/185 (21%), Positives = 72/185 (38%), Gaps = 25/185 (13%)

Query: 57 NPERVVVFDMGMLDTIDALGESDAVVGVA----------KDSLPKYLSKFDSDKVESAGG 106
+P R+V + ++ + ALG GVA + LP D V G
Sbjct: 34 DPNRIVALEWLPVELLLALGIVP--YGVADTINYRLWVSEPPLP--------DSVIDVGL 83

Query: 107 IKEPDFEKINALKPDLIIISGRQSDSLDELKKIAPTLSLE--IDSKDLWESINKNVSTIG 164
EP+ E + +KP ++ S S + L +IAP + L + K+++ +
Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMA-RKSLTEMA 142

Query: 165 TIFDKSDEAKKKLDALSEKIDVLNKKNTGSDMK--TLTVLLNEGSLSAYGKGSRFAILND 222
+ + A+ L + I + + + LT L++ + +G S F + D
Sbjct: 143 DLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202

Query: 223 VFGFP 227
+G P
Sbjct: 203 EYGIP 207


20BN424_1463BN424_1491Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_14632161.119722ribulose-phosphate 3-epimerase
BN424_14642152.442192thiamine pyrophosphokinase
BN424_14650132.585399ribosomal protein L28
BN424_1466-1122.343454conserved hypothetical protein
BN424_1467-1122.252957conserved hypothetical protein
BN424_1468-1142.419150L-serine dehydratase, iron-sulfur-dependent,beta
BN424_14691131.873806L-serine dehydratase,
BN424_14701131.142506ATP-dependent DNA helicase RecG
BN424_14711130.628382fatty acid/phospholipid synthesis protein PlsX
BN424_14722130.382156acyl carrier protein
BN424_14731120.896256ribonuclease III
BN424_14742121.066398chromosome segregation protein SMC
BN424_14750141.363159phosphatase YidA
BN424_14762151.195626signal recognition particle-docking protein
BN424_14772151.987095helix-turn-helix, YlxM / p13 like family
BN424_14783163.007392signal recognition particle protein
BN424_14794182.338179ribosomal protein S16
BN424_14804170.502756UPF0109 protein ylqC
BN424_1481215-0.25779416S rRNA processing protein RimM
BN424_1482116-0.288200tRNA (guanine-N1)-methyltransferase
BN424_1483123-1.589771ribosomal protein L19
BN424_1484121-2.594519phage integrase family protein
BN424_1485021-2.175304putative membrane protein
BN424_1486120-1.014681phage lysozyme family protein
BN424_1487523-1.208602phage lysozyme family protein
BN424_1488523-1.659779csbD-like family protein
BN424_1489320-0.646372putative uncharacterized protein
BN424_14901130.277793hypothetical protein
BN424_1491213-0.262857hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1470SECA310.023 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 30.6 bits (69), Expect = 0.023
Identities = 27/99 (27%), Positives = 41/99 (41%), Gaps = 10/99 (10%)

Query: 261 AQKRVVNEICSDLRQSLHMHRLLQGDV-----GSGKTIVAAIALFATVNAGFQGALMVPT 315
A KRV D+ Q L L + + G GKT+ A + A +NA + V T
Sbjct: 74 ASKRVFGMRHFDV-QLLGGMVLNERCIAEMRTGEGKTLTATLP--AYLNALTGKGVHVVT 130

Query: 316 --GILAEQHMESLDQLFDPLEVKVALLTGATKTKERREI 352
LA++ E+ LF+ L + V + +RE
Sbjct: 131 VNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREA 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1474IGASERPTASE512e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 51.2 bits (122), Expect = 2e-08
Identities = 50/318 (15%), Positives = 98/318 (30%), Gaps = 32/318 (10%)

Query: 165 AGVLKYKTRKKKAEQKLFETE-DNLNRVQDIVYELEDQIEPLREQSSIAKDYVMQKEQLS 223
G KYK R L+ E + N+ D I +Q + S
Sbjct: 964 LGAWKYKLRNVNGRYDLYNPEVEKRNQTVD-----TTNITTPNN---------IQADVPS 1009

Query: 224 EVEIALTVVEVEMLKEKWLANKNQAETLATEISEARQELQTAETTVADLREKRQKMDAQL 283
+ V+ A +ET T ++QE +T E D E +
Sbjct: 1010 VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA 1069

Query: 284 DESQARLVELVKTYEQTEAQKKVLSERSKNTKENREQFEQSKAKLEVKIQELDQQLADLT 343
E+++ + +T E ++ + ++ TKE ++ KAK+E E Q++ +T
Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET---EKTQEVPKVT 1126

Query: 344 KDLTEKQAHERELRGTLAAAEKEQKMFNQNSSVTVESLRDDYVDLMQKQTTLRNEQGYLE 403
++ KQ ++ A + N + Q QT +
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVN--------------IKEPQSQTNTTADTEQPA 1172

Query: 404 KTFLQASQKNMKSDATVRALESDSALAKAKVQEKQLELSTVQKNLATKLLGHQEIQADLQ 463
K ++ + TV S + + + K + +++
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232

Query: 464 KNRYDLETNEQKMYEALR 481
++ + AL
Sbjct: 1233 NVEPATTSSNDRSTVALC 1250


21BN424_1618BN424_1646Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1618117-4.258904linear amide C-N hydrolases, choloylglycine
BN424_1619217-3.488712hypothetical protein
BN424_1620117-2.521667transposase, Mutator family protein
BN424_1621418-2.721167hypothetical protein
BN424_1622317-2.559115hypothetical protein
BN424_1623116-3.711150conserved hypothetical protein
BN424_1624215-3.178800protein
BN424_1625016-4.113868hypothetical protein
BN424_1626016-3.150817putative uncharacterized protein
BN424_1627014-2.215963LPXTG-motif cell wall anchor domain protein
BN424_1628014-2.100802mga helix-turn-helix domain protein
BN424_1629-113-0.450965hypothetical protein
BN424_1630-114-0.535860V-type ATPase, D subunit
BN424_1631-1140.078280V-type sodium ATPase subunit B
BN424_1632012-1.269477V-type sodium ATPase catalytic subunit A
BN424_1633216-2.936554ATP synthase (F/14-kDa) subunit
BN424_1634116-3.330983ATP synthase (C/AC39) subunit
BN424_1635116-1.521298hypothetical protein
BN424_1636014-0.697807V-type sodium ATPase subunit K
BN424_1637016-1.035888V-type ATPase 116kDa subunit
BN424_1638218-0.890035hypothetical protein
BN424_1639117-0.492965transcriptional regulator, TetR family protein
BN424_16400150.525500transposase DDE domain protein
BN424_1641-111-1.221815antisigma-factor antagonist, STAS
BN424_1642-214-1.688974universal stress family protein
BN424_1643-114-2.117011helix-turn-helix family protein
BN424_1644-114-2.489258putative uncharacterized protein
BN424_1645014-2.761943pyridine nucleotide-disulphide oxidoreductase
BN424_1646015-3.370977arsenical-resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1618BLACTAMASEA290.032 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 28.6 bits (64), Expect = 0.032
Identities = 10/49 (20%), Positives = 20/49 (40%), Gaps = 17/49 (34%)

Query: 106 LVSWLLGNIASIDELKKEASNINVVSAKNNLLDVVVPLHWIVADQTGSS 154
L+ W++ + + L+ V+P W +AD+TG+
Sbjct: 203 LLQWMVDDRVA-----------------GPLIRSVLPAGWFIADKTGAG 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1623PREPILNPTASE330.002 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 32.9 bits (75), Expect = 0.002
Identities = 8/34 (23%), Positives = 16/34 (47%)

Query: 282 IVALLIFLLIASCCIVIIGLLIRRKKKIEREKKL 315
AL I LL++S +G+ + + + K +
Sbjct: 228 WQALPIVLLLSSLVGAFMGIGLILLRNHHQSKPI 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1628PF05043514e-09 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 51.5 bits (123), Expect = 4e-09
Identities = 63/344 (18%), Positives = 141/344 (40%), Gaps = 19/344 (5%)

Query: 1 MNFFFNRRTLRKILAFYFIVENNGNVDLNNVSEYLKCTVRTTKDVLDELENDVKQWDSEV 60
M ++++ R++ + E+ + ++E L CT R KD L + K ++
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHV----KSAFPDL 56

Query: 61 YLQQKSDGYLYVYIPSDLSIHGIYLYYLEDNINFNWMKQYFFENTVEINEYALDHYVSYS 120
++G + + D I +Y ++ + + +F+ ++ FF + + Y+S S
Sbjct: 57 IFHSSTNG-IRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSS 115

Query: 121 TMYKNMKVIDRNILSRYKLQFNTNVNATILGDEKQKRLFFFDFFWYSYSGLKWPFYNVEK 180
++Y+ + I++ I +++ + + I+G+E+ R FF +F Y L+WPF N
Sbjct: 116 SLYRIISQINKVIKRQFQFEVSLTPV-QIIGNERDIRYFFAQYFSEKYYFLEWPFENFSS 174

Query: 181 KKFDIFFKYIEKIRVRNIGLSEKEILRYYFAIIFHRIEIGETC--DESILKNQLLDDSKH 238
+ + + K + LS +L+ +RI+ G D+ +Q LD
Sbjct: 175 EPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFMEVDKDSFNDQSLDFLMQ 234

Query: 239 YTILKESLFPIYKSIFPKLMKKDIESEIAFLYLTIFGLEFHLEDNAIVSEMLVYVQTHDI 298
++ + +S + E + L+++ + + + E L
Sbjct: 235 AEGIEG----VAQSFESEYNISLDEEVVCQLFVS------YFQKMFFIDESLFMKCVKKD 284

Query: 299 NVVEYTNFWMKEFFLYFDIKLNAREYSILYSNLIHIHSRASLFE 342
+ VE + + +F +K E + + H+H+ A L+
Sbjct: 285 SYVEKSYHLLSDFIDQISVKYQI-EIENKDNLIWHLHNTAHLYR 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1642TYPE3OMBPROT280.016 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 27.7 bits (61), Expect = 0.016
Identities = 22/81 (27%), Positives = 42/81 (51%), Gaps = 6/81 (7%)

Query: 53 KNYEERKNVRIDEVKQQLAGLVDTFEVSILIGDPASEIIKHVKKNNYDLLIMGSRGLNIL 112
K+ ++R ++ E+K+++ +T + S L +SE +K + ++M S + I
Sbjct: 440 KSGKDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSE-----EKRLFSTILMNSGNMEI- 493

Query: 113 QEFVMGSVSHKVMKYVPIPVL 133
QE G +KVMK +P+ L
Sbjct: 494 QEMNTGVPGNKVMKKLPLSSL 514


22BN424_1656BN424_1675Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1656212-0.439598putative uncharacterized domain protein
BN424_1657312-0.773275helix-turn-helix family protein
BN424_16581120.005549metal ion transporter, metal ion family protein
BN424_1659114-1.144138aminoglycoside 6-adenylyltransferase
BN424_1660113-1.566812putative membrane protein
BN424_1661113-0.961432putative membrane protein
BN424_1662-212-0.715341LD-carboxypeptidase family protein
BN424_16631213.371897endoribonuclease L-PSP family protein
BN424_16640243.807482uncharacterized isochorismatase family protein
BN424_16651253.861686SNARE associated Golgi family protein
BN424_16660244.638101intracellular protease, PfpI family protein
BN424_16671233.791940regulatory protein msrR
BN424_16680233.983207transposase DDE domain protein
BN424_1669312-3.021560hypothetical protein
BN424_1670213-3.687637marR family protein
BN424_1671214-4.736355ABC transporter family protein
BN424_1672113-6.603000branched-chain amino acid transport system /
BN424_1673115-7.593933ABC transporter substrate binding family
BN424_1674215-7.976642hypothetical protein
BN424_1675-113-3.897894hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1664ISCHRISMTASE544e-11 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 53.9 bits (129), Expect = 4e-11
Identities = 48/191 (25%), Positives = 84/191 (43%), Gaps = 21/191 (10%)

Query: 3 DKSHSVLLVVDVQKAFNDVSWGERSNQTAE--SHIAELITLFRQNEIDVIHIKHQSN-NP 59
D + +VLL+ D+Q F D ++ ++ E ++I +L Q I V++ + NP
Sbjct: 27 DPNRAVLLIHDMQNYFVD-AFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85

Query: 60 E-----SLFY-------PEHITSEFKSEATPLKNELILTKTVNSAFIGTNLEEILHEKGI 107
+ + F+ P + +E P ++L+LTK SAF TNL E++ ++G
Sbjct: 86 DDRALLTDFWGPGLNSGPYE--EKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGR 143

Query: 108 TKLYIVGLTTPHCISTTTRMAANLGFKCYLVEDATASFELIGH-TGIKYSANEVQELTVV 166
+L I G+ T A K + V DA A F L H ++Y+A V
Sbjct: 144 DQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCA--FTV 201

Query: 167 TLNEEFAEILS 177
+ ++ +
Sbjct: 202 MTDSLLDQLQN 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1670TETREPRESSOR280.009 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 28.3 bits (63), Expect = 0.009
Identities = 13/38 (34%), Positives = 23/38 (60%)

Query: 26 LKQHRINEAQLKILTELKIEALSLKKLALSLTTDKSTL 63
L + + +A L++L E I+ L+ +KLA L ++ TL
Sbjct: 4 LNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTL 41


23BN424_1690BN424_1701Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1690214-0.894250putative uncharacterized protein
BN424_1691111-1.271208putative uncharacterized protein
BN424_1692111-1.679625conserved hypothetical protein
BN424_1693111-1.824149helix-turn-helix family protein
BN424_1694-1120.017767lin0939 protein
BN424_1695012-0.472378chaperone/heat shock domain protein
BN424_1696013-0.304401chaperone/heat shock protein
BN424_16973140.952656glycerophosphodiester phosphodiesterase domain
BN424_16983131.196841hypothetical protein
BN424_16994131.490176ZIP Zinc transporter family protein
BN424_17003120.681766ABC transporter family protein
BN424_17012130.928996fecCD transport family protein
24BN424_1812BN424_1823Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_18122120.041225PTS system, mannitol-specific IIC component
BN424_1813016-2.427881hypothetical protein
BN424_1814116-2.720722LD-carboxypeptidase family protein
BN424_1815217-5.007782hypothetical protein
BN424_1816115-5.412482hypothetical protein
BN424_1817215-4.957214hypothetical protein
BN424_1818216-4.515333hypothetical protein
BN424_1819420-4.849714putative lipoprotein
BN424_1820121-4.119676putative membrane protein
BN424_1821124-3.783227hypothetical protein
BN424_1822223-3.913306hypothetical protein
BN424_1823222-4.220317helix-turn-helix family protein
25BN424_1850BN424_1864Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_18502160.038210conserved hypothetical protein
BN424_18515181.079803putative uncharacterized protein
BN424_18526200.597174conserved hypothetical protein
BN424_18536211.577203lysM domain protein
BN424_18546221.477463hypothetical protein
BN424_1855420-0.322748phage head morphogenesis, SPP1 gp7 family domain
BN424_18563190.000465transposase DDE domain protein
BN424_1857118-3.722584hypothetical protein
BN424_1858018-3.398579conserved hypothetical protein
BN424_1859317-7.227979putative uncharacterized protein
BN424_1860319-8.006227hypothetical protein
BN424_1861315-7.158264hypothetical protein
BN424_1862316-7.026163hypothetical protein
BN424_1863320-5.060395putative uncharacterized protein
BN424_1864319-4.653991hypothetical protein
26BN424_1882BN424_1890Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1882419-2.190260hypothetical protein
BN424_1883225-2.543753hypothetical protein
BN424_1884523-4.232166hypothetical protein
BN424_1885820-0.326374hypothetical protein
BN424_1886619-0.744927hypothetical protein
BN424_1887720-2.190305hypothetical protein
BN424_1888720-2.163530hypothetical protein
BN424_1889519-2.600690helix-turn-helix family protein
BN424_1890214-0.867343transposase DDE domain protein
27BN424_2145BN424_2173Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2145222-0.381198terminase-like family protein
BN424_2146523-2.514852putative uncharacterized protein
BN424_2147422-4.154735hypothetical protein
BN424_2148522-3.298743hypothetical protein
BN424_2149721-2.636537sigma-70, region 4 family protein
BN424_2150523-3.263444hypothetical protein
BN424_2151221-2.874095hypothetical protein
BN424_2152223-1.271590hypothetical protein
BN424_2153123-0.422207ASCH domain protein
BN424_2154124-0.442742hypothetical protein
BN424_21552240.443846hypothetical protein
BN424_21562250.942529hypothetical protein
BN424_21571250.691771yopX family protein
BN424_2158223-0.019112ORF24 domain protein
BN424_2159321-0.545772hypothetical protein
BN424_2160321-0.570148hypothetical protein
BN424_2161221-0.916741dnaD and phage-associated domain protein
BN424_2162218-1.666980putative uncharacterized protein
BN424_2163418-1.503369phage recombination protein Bet
BN424_2164519-1.535204putative uncharacterized domain protein
BN424_2165123-3.202194hypothetical protein
BN424_2166224-4.206164hypothetical protein
BN424_2167524-4.100808hypothetical protein
BN424_2168223-4.356218hypothetical protein
BN424_2169119-5.764843hypothetical protein
BN424_2170321-5.826495hypothetical protein
BN424_2171321-5.871717helix-turn-helix family protein
BN424_2172321-5.997184helix-turn-helix family protein
BN424_2173121-5.032093conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2164CHANLCOLICIN320.011 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 31.6 bits (71), Expect = 0.011
Identities = 36/245 (14%), Positives = 80/245 (32%), Gaps = 31/245 (12%)

Query: 300 LQSGKQIVYEQAMSAQNEVLEEEKRISSLKTDIESEKSYISRLQQEKESLRNKYISIRDN 359
L ++ ++A +A+ E E+R + +IE EK+ R + E+ + ++ +
Sbjct: 132 LAKAEEKARKEAEAAEKAFQEAEQR----RKEIEREKAETERQLKLAEAEEKRLAALSE- 186

Query: 360 NFPDFDEHKTTCQFCNQDLPVEQQATIKETYQKEREAFNLNRASELEQINEQGTSLSKEE 419
E K VE Q E + + +++ + E
Sbjct: 187 ------EAKA----------VEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEM 230

Query: 420 EVHEELLQDLKNQAADTTELDKRIKVRDDLKSQHTAIIKQIEAIQNNATPFTETEQYKKK 479
+ +L +A ELD+ +K + EA + E+ +K+
Sbjct: 231 KTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQ 290

Query: 480 ITEIEAIKQEIITIQTGDDKELQSQKEVISNIKLSINQLNEQLYEYVLSEKQEARKQELI 539
+T E I T K + + +++ +E+ + Q +
Sbjct: 291 VTASETRINRINADITQIQKAISQVSNNRNAGIARVHE----------AEENLKKAQNNL 340

Query: 540 EEEKL 544
++
Sbjct: 341 LNSQI 345


28BN424_2198BN424_2209Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2198017-4.525017hypothetical protein
BN424_2199013-3.991481putative uncharacterized protein
BN424_2200-113-3.674133hypothetical protein
BN424_2201012-3.305963hypothetical protein
BN424_2202111-3.037628prepilin-type N-terminal cleavage/methylation
BN424_2203012-2.542468bacterial type II secretion system F domain
BN424_2204014-2.023645type II/IV secretion system family protein
BN424_2205016-2.198820bacterial regulatory helix-turn-helix, lysR
BN424_2206-113-0.786466conserved hypothetical protein
BN424_2207-112-0.948961conserved hypothetical protein
BN424_2208017-1.022584acetyltransferase family protein
BN424_2209217-1.680800hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2201BCTERIALGSPH347e-05 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 34.2 bits (78), Expect = 7e-05
Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 14/80 (17%)

Query: 5 RGFTLIETILILSII-----MVLFGLPTVIANKTYEKVQKMLFFEAFQSHLLATQNYALL 59
RGFTL+E +LIL ++ MVL P + + + + F++ L Q L
Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLAR------FEAQLRFVQQRGLQ 57

Query: 60 ANKKTSLTIFKSGTVRYQVL 79
+ +++ R+Q L
Sbjct: 58 TGQFFGVSVHPD---RWQFL 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2202BCTERIALGSPG469e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 46.4 bits (110), Expect = 9e-10
Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 2/69 (2%)

Query: 16 KGFTLVEMILVLFVISVLLILVIPNVVQQKKKIDNQGTEALMTVIETQIELFLLE--KEP 73
+GFTL+E+++V+ +I VL LV+PN++ K+K D Q + + +E ++++ L+ P
Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYP 67

Query: 74 GIEVSFAAL 82
+L
Sbjct: 68 TTNQGLESL 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2203BCTERIALGSPF747e-17 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 74.5 bits (183), Expect = 7e-17
Identities = 57/344 (16%), Positives = 140/344 (40%), Gaps = 11/344 (3%)

Query: 16 KKIKETQQSFFLTKLAVLVAEGFSLKESLLFLKIML--PKQAVWLNQALNQLEEGQEFFQ 73
++ + + +LA LVA L+E+L + P + + +++ EG
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 74 VLNQLG--FSERISSQVYLAQIHGQFSQVLADSGAFLEANGKRKKKLKQLLQYPMLLVIF 131
+ F + V + G VL + E + + +++Q + YP +L +
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 132 MFGILFGIRLLLLPHFNDLVQQNGS----FTSLVSGIAIGLIYYFPYVFMGLLFSLLIIK 187
++ + +++P + T ++ G++ + + P++ + LL + +
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 188 ISLTNYFKKQTAIAKLNFIVALPFFGNLIKLYYTYYFSYEWAQLVKSGYSMLRIIEVMKA 247
+ L +++ ++ ++ LP G + + T ++ + L S +L+ + +
Sbjct: 243 VMLR---QEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGD 299

Query: 248 KETTKIMQEVANEMEKGMKNGIGLHVVMKQLPFLKTELGAIIFHGELTSQLASELNLYGQ 307
+ + + ++ G+ LH ++Q + +I GE + +L S L
Sbjct: 300 VMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 308 ICQNEFVQKIEKFMGWIQPLVFILVAFFILCIYLALLLPMFTMM 351
EF ++ +G +PL+ + +A +L I LA+L P+ +
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLN 403


29BN424_2246BN424_2255Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_22463283.290409hypothetical protein
BN424_22473253.096194excinuclease ABC, C subunit
BN424_22484355.339629hypothetical protein
BN424_22491183.786539hypothetical protein
BN424_22500183.583465transposase, IS4 domain protein
BN424_2251-1132.036360transposase, IS4
BN424_2252-3110.281563alpha/beta hydrolase fold family protein
BN424_2253-1110.680253thioredoxin
BN424_2254-1100.632586mutS2 family protein
BN424_2255313-1.068361hypothetical protein
30BN424_2315BN424_2355Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2315318-4.237487hypothetical protein
BN424_2316416-3.821241lin0431 protein
BN424_2317416-4.084745sortase family protein
BN424_2318016-2.763138HAMP domain protein
BN424_2319015-2.114353response regulator
BN424_2320115-2.537449hypothetical protein
BN424_2321016-3.090645multicopper oxidase mco
BN424_2322-115-3.363033heavy-metal-associated domain protein
BN424_2323-114-3.417448copper-translocating P-type ATPase
BN424_2324015-5.509563conserved hypothetical protein
BN424_2325116-6.355652glutamate-cysteine ligase family protein
BN424_2326117-5.169192conserved hypothetical protein
BN424_2327419-3.810442prolipoprotein diacylglyceryl transferase
BN424_2328117-1.804271hypothetical protein
BN424_2329117-1.861003protein
BN424_2330018-1.688824hypothetical protein
BN424_2331119-0.586144peptidase M23 family protein
BN424_2332019-1.102383copper-exporting P-type ATPase B domain protein
BN424_2333119-1.308023copper-translocating P-type ATPase
BN424_2334421-2.787219hypothetical protein
BN424_2335420-1.869650resolvase, N terminal domain protein
BN424_2336319-2.441142transposase family protein
BN424_2337319-2.526954hypothetical protein
BN424_2338319-2.852118permease family protein
BN424_2339018-1.812363ABC transporter family protein
BN424_2340-119-1.888874cclF
BN424_2341120-4.420958putative membrane protein
BN424_2342-120-4.767952circular bacteriocin, circularin A/uberolysin
BN424_2343117-6.134974hypothetical protein
BN424_2344-117-5.768655transposase, Mutator family protein
BN424_2345717-6.767791hypothetical protein
BN424_2346317-5.341642conserved hypothetical protein
BN424_2347116-3.731184ABC transporter family protein
BN424_2348116-2.436519hypothetical protein
BN424_2349117-1.863716transcriptional regulator, ArsR family
BN424_2350216-1.854030hypothetical protein
BN424_2351115-2.399907putative uncharacterized protein EP0023
BN424_2352116-2.614421putative membrane protein
BN424_2353316-3.023880EF0021
BN424_2354316-4.741438hypothetical protein
BN424_2355215-3.350633putative uncharacterized domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2318PF06580290.035 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.035
Identities = 19/102 (18%), Positives = 34/102 (33%), Gaps = 25/102 (24%)

Query: 364 LLANAIK--FTQKY--GEISISLNKVGNNVEIKVKDNGIGINEEEVEHIFDRFYMADPAR 419
L+ N IK Q G+I + K V ++V++ G + E
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309

Query: 420 SSSQGGQGIGLAIVKSIVEAHSG---SIRVESNIGTGSCFFI 458
G GL V+ ++ G I++ G + +
Sbjct: 310 -----STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2319HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 30/117 (25%), Positives = 56/117 (47%), Gaps = 1/117 (0%)

Query: 2 KILIVDDEPKILEIVDAYLISKNYSVYKATSGKEALEKYHFISPDLVILDLMLPDISGLD 61
IL+ DD+ I +++ L Y V ++ DLV+ D+++PD + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCETIRKE-TETPIIMLTAKSGEEDILKGLALGADDYIVKPFSPKELVARVETVLRR 117
+ I+K + P+++++A++ +K GA DY+ KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2340RTXTOXIND310.012 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.012
Identities = 17/157 (10%), Positives = 45/157 (28%), Gaps = 5/157 (3%)

Query: 64 GMAIPQKETKTYLDPTLGNLNELYVTEGQSIDIGTPLISYQDDKIQEQINEQARGIERVK 123
G +K + E+ V EG+S+ G L+ + + + + +
Sbjct: 88 GKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR 147

Query: 124 TSIANTQERNGEAQKKKETAISNITTTKAMINQLEQSSEPEDLAKVQQYTQEIAKYEGIV 183
Q + + K + + SE E L ++ + ++
Sbjct: 148 LEQTRYQILSRSIELNKLPELKLPDEPY-----FQNVSEEEVLRLTSLIKEQFSTWQNQK 202

Query: 184 EGQEAQIETLETALQDAQADLTERQAAVDQLQQKVTS 220
+E ++ A + + + ++
Sbjct: 203 YQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2352TYPE4SSCAGX330.004 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 33.2 bits (75), Expect = 0.004
Identities = 27/101 (26%), Positives = 48/101 (47%), Gaps = 3/101 (2%)

Query: 466 PRRMNKMYKQAEKTRKDDKKARK--NDAKNKKKEEQEKNNKKDGGSNKNGLNEQKNRNNK 523
P+ + + K EK ++ ++A+K D + K+KEE+ KN N Q NNK
Sbjct: 138 PKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNK 197

Query: 524 DKSD-SRKPKENENKDNNHAPDLNKRNKNDTNKPNSNENKK 563
+ S+ ++ +ENE D+ ++ + + K NKK
Sbjct: 198 NLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKK 238


31BN424_2460BN424_2493Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2460010-3.087914glycosyl transferases group 1 family protein
BN424_2461110-3.408757putative membrane protein
BN424_2462-112-0.620375DNA helicase domain protein
BN424_24631202.231570DNA helicase domain protein
BN424_24640223.035030hypothetical protein
BN424_24650163.154258conserved hypothetical protein
BN424_24661194.092309hypothetical protein
BN424_24671173.747951transposase
BN424_24682122.152697transposase
BN424_24693131.023407hypothetical protein
BN424_2470190.864391hypothetical protein
BN424_2471010-0.543946hypothetical protein
BN424_2472-19-1.311232hypothetical protein
BN424_2473-29-1.238634hypothetical protein
BN424_2474-29-1.594721LPXTG-motif cell wall anchor domain protein
BN424_2475-39-2.007030hypothetical protein
BN424_2476-212-4.037936mga helix-turn-helix domain protein
BN424_2477013-2.513093conserved hypothetical protein
BN424_2478415-0.086035enoyl-[acyl-carrier-protein] reductase [NADH]
BN424_24792140.112946hypothetical protein
BN424_24800140.666588putative membrane protein
BN424_2481212-0.574502bacterial regulatory, arsR family protein
BN424_2482011-0.140981domain of unknown function family protein
BN424_24830100.171465putative uncharacterized protein
BN424_2484-111-0.360532glycosyl hydrolase 1 family protein
BN424_2485110-0.448806SPBc2 prophage-derived aminoglycoside
BN424_2486211-0.423445hypothetical protein
BN424_24871101.917646helix-turn-helix family protein
BN424_24882121.821444putative membrane protein
BN424_24893122.438540hypothetical protein
BN424_24903122.526817tagatose 1,6-diphosphate aldolase
BN424_24913132.3893241-phosphofructokinase
BN424_24923121.952272N-acetylglucosamine-6-phosphate deacetylase
BN424_24933130.957917PTS system mannose/fructose/sorbose IID
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2463FERRIBNDNGPP290.029 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 29.1 bits (65), Expect = 0.029
Identities = 17/49 (34%), Positives = 20/49 (40%), Gaps = 6/49 (12%)

Query: 256 LDSNQMLIFSPNKLFNHYISNVLPELGEKNMIQ--TTFLDFAQSRISGL 302
+D ML+F PN LF +L E G N Q T F I L
Sbjct: 183 IDPRHMLVFGPNSLF----QEILDEYGIPNAWQGETNFWGSTAVSIDRL 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2476PF05043609e-12 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 60.0 bits (145), Expect = 9e-12
Identities = 50/228 (21%), Positives = 92/228 (40%), Gaps = 14/228 (6%)

Query: 1 MNVFLERISLRKMYLLSLLDSEKRGFSIKELEQKLGHNSKTITKMVQSLKIELAPWQNSI 60
M L + S R++ LL LL KR F EL + L + + + +K
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAF-----PD 55

Query: 61 TLVTNNDRTLSLKKKASFSLETINLYYLKESFIFKACDAIFNEEFIDIATFSSANYISYS 120
+ ++ + + +E + ++ K S F + IF E + YIS S
Sbjct: 56 LIFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSS 115

Query: 121 TLYGRLNEIKPLLEH-YSIEFKANNMASFEGEEKQIRYFFYHFYWSTHWGMEWPFQKIDK 179
+LY +++I +++ + E + G E+ IRYFF ++ ++ +EWPF+
Sbjct: 116 SLYRIISQINKVIKRQFQFEVSLTPV-QIIGNERDIRYFFAQYFSEKYYFLEWPFENFSS 174

Query: 180 N---QFCEIIKRIEGLRKTTTLYISEQESIAFWLGVITTRINLGHTIE 224
Q E++ + + +S + L RI GH +E
Sbjct: 175 EPLSQLLELVYKETSFP----MNLSTHRMLKLLLVTNLYRIKFGHFME 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2478DHBDHDRGNASE532e-10 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 53.1 bits (127), Expect = 2e-10
Identities = 60/251 (23%), Positives = 94/251 (37%), Gaps = 35/251 (13%)

Query: 18 IAWGCAKAMNDCGATVI---YTYQNDRVKKQLEKLVGTEANLVECDVATDEQVEEAFNQI 74
I A+ + GA + Y + K A DV ++E +I
Sbjct: 20 IGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAIDEITARI 79

Query: 75 HKTYGTIDGLVHSIAFARKEELGGNVFDSTREGFAIAHDISSYSLLLVSRYASKIM--NP 132
+ G ID LV+ R G + + E + ++S + SR SK M
Sbjct: 80 EREMGPIDILVNVAGVLRP----GLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDRR 135

Query: 133 GGSIITMTYIGSERA------IANYNIMGLAKASLEAAVRYLALDLAKQDIRVNAISSGA 186
GSI+T +GS A +A Y +KA+ + L L+LA+ +IR N +S G+
Sbjct: 136 SGSIVT---VGSNPAGVPRTSMAAY---ASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 187 IKTL-----------AASGIKGFNALLDEQAARTPSGKQVTTEEVGNTAAFLMSDMSRGI 235
+T A IKG L+ P K ++ + FL+S + I
Sbjct: 190 TETDMQWSLWADENGAEQVIKGS---LETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 236 VGEIIYVDKGT 246
+ VD G
Sbjct: 247 TMHNLCVDGGA 257


32BN424_2503BN424_2532Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2503013-3.871940cation diffusion facilitator transporter family
BN424_2504115-3.905772hypothetical protein
BN424_2505215-4.159616cyclic nucleotide-binding domain protein
BN424_2506214-3.397070hypothetical protein
BN424_2507214-4.089917putative membrane protein
BN424_2508216-4.038576ABC transporter family protein
BN424_2509414-1.421896bacterial regulatory s, gntR family protein
BN424_2510514-1.110485putative uncharacterized protein
BN424_2511615-0.451451LPXTG-motif cell wall anchor domain protein
BN424_25127242.195905entI protein
BN424_25136325.956560hypothetical protein
BN424_25147377.464684transposase DDE domain protein
BN424_251554310.681272putative uncharacterized protein
BN424_25164255.989344putative uncharacterized protein
BN424_25313255.524969*************hypothetical protein
BN424_25324224.476134cell wall-associated hydrolase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2508PF05272310.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.004
Identities = 11/46 (23%), Positives = 17/46 (36%), Gaps = 12/46 (26%)

Query: 36 GRNGIGKSTLLRSLAQMEPIQKGEILWEGNWLTKADVSFVNLSDYY 81
G GIGKSTL+ +L ++ + + D Y
Sbjct: 603 GTGGIGKSTLINTLVGLD------------FFSDTHFDIGTGKDSY 636


33BN424_2569BN424_2575Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2569017-3.847753transcriptional regulator lytR
BN424_2570219-5.714010hypothetical protein
BN424_2571219-5.590939transposase DDE domain protein
BN424_2572221-6.170496UDP-glucose 6-dehydrogenase ywqF
BN424_2573219-6.205540polysaccharide biosynthesis family protein
BN424_2574321-4.894844putative uncharacterized protein
BN424_2575116-3.204096putative membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2574ENTEROTOXINA290.024 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 29.2 bits (65), Expect = 0.024
Identities = 9/39 (23%), Positives = 18/39 (46%)

Query: 183 GIPPNIMEWSEKSWLKYHIKYCVDNKKYFVYPNCSLSTN 221
G PP+ W E+ W+ + + C ++ + C+ T
Sbjct: 184 GFPPDHQAWREEPWIHHAPQGCGNSSRTITGDTCNEETQ 222


34BN424_2626BN424_2661Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_26262280.640993phage minor capsid family protein
BN424_26273300.704487phage terminase, large subunit, PBSX family
BN424_26286311.221069putative uncharacterized protein
BN424_26309311.974558putative uncharacterized protein
BN424_26299312.511126protein
BN424_26316312.040150phage transcriptional regulator, ArpU family
BN424_26323251.277579hypothetical protein
BN424_26330260.667364conserved hypothetical protein
BN424_26341280.996809single-strand binding family protein
BN424_2635426-0.110544gp36
BN424_2636225-0.272245hypothetical protein
BN424_2637124-0.590716yopX family protein
BN424_2638025-0.471941DNA methylase family protein
BN424_26394290.211967endodeoxyribonuclease RusA family protein
BN424_26404290.394666hypothetical protein
BN424_26412270.273873hypothetical protein
BN424_26423270.487887hypothetical protein
BN424_26434270.095901conserved hypothetical protein
BN424_2644526-0.545543conserved phage C-terminus family protein
BN424_2645425-0.711017conserved hypothetical protein
BN424_2646321-2.321774conserved hypothetical protein
BN424_2647623-3.491438hypothetical protein
BN424_2648523-3.177389hypothetical protein
BN424_2649723-2.950821hypothetical protein
BN424_2650625-2.881285hypothetical protein
BN424_2651524-2.690706hypothetical protein
BN424_2652424-3.583789hypothetical protein
BN424_2653326-2.048173hypothetical protein
BN424_2654424-2.829734hypothetical protein
BN424_2655525-2.330050hypothetical protein
BN424_2656628-1.834585hypothetical protein
BN424_2657627-1.836545putative uncharacterized protein
BN424_2658929-0.507412phage antirepressor KilAC domain protein
BN424_2659926-2.087582hypothetical protein
BN424_2660925-2.071023hypothetical protein
BN424_2661419-1.955319hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2631BINARYTOXINA280.010 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 28.5 bits (63), Expect = 0.010
Identities = 25/114 (21%), Positives = 50/114 (43%), Gaps = 7/114 (6%)

Query: 6 PDVDVEATKKNARRVLRQYSRLEREAGKNYSQRLTVEISDMPRGSASIKSTPIEDMVTKK 65
+ ++ KK A RV + LE+EA + Y + + +IS+ + IE
Sbjct: 54 KENAIQWEKKEAERVEKNLDTLEKEALELYKKD-SEQISNYSQTRQYFYDYQIES----- 107

Query: 66 VTAEKKVWEILEAIYLLPRLSKEILWYSYIDKDHWSVTKIARALDYSDKAIEKY 119
EK+ + AI ++ K I Y + + ++ K R + ++ ++EK+
Sbjct: 108 NPREKEYKNLRNAI-SKNKIDKPINVYYFESPEKFAFNKEIRTENQNEISLEKF 160


35BN424_2747BN424_2797Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2747-111-3.323432bacterial regulatory s, tetR family protein
BN424_2748011-3.726995sulfatase family protein
BN424_2749015-5.038480glycosyl transferase 2 family protein
BN424_2750116-5.402339gtrA-like family protein
BN424_2751115-4.659899NH(3)-dependent NAD(+) synthetase
BN424_2752215-5.039280nicotinate phosphoribosyltransferase family
BN424_2753115-5.210882glycosyl transferase 2 family protein
BN424_2754216-5.392868glycosyl transferases group 1 family protein
BN424_2755317-5.100865dTDP-4-dehydrorhamnose reductase
BN424_2756416-5.527564dTDP-glucose 4,6-dehydratase
BN424_2757117-6.747631dTDP-4-dehydrorhamnose 3,5-epimerase
BN424_2758217-7.158750glucose-1-phosphate thymidylyltransferase
BN424_2759220-8.203455glycosyl transferases group 1 family protein
BN424_2760319-8.080510polysaccharide biosynthesis family protein
BN424_2761215-6.924177glycosyl transferase 2 family protein
BN424_2762315-5.823372glycosyl transferase 2 family protein
BN424_2763214-4.689994hypothetical protein
BN424_2764214-5.237875chain length determinant family protein
BN424_2765214-4.864221glycosyl transferase 2 family protein
BN424_2766-114-3.412383sulfatase family protein
BN424_2767-114-2.868678mannosyl-glycoendo-beta-N-acetylglucosaminidase
BN424_2768-214-1.599995UDP-glucose 4-epimerase
BN424_2769-216-2.275614putative membrane protein
BN424_27701160.351452linear amide C-N hydrolases, choloylglycine
BN424_27712140.836377chaperonin GroL
BN424_27720140.03543710 kDa chaperonin
BN424_2773-2130.990418CAAX amino terminal protease family protein
BN424_2774-2141.381710putative membrane protein
BN424_2775-1141.577086DNA-binding N-terminus family protein
BN424_2776-1162.351964ABC transporter family protein
BN424_2777-2173.047513transcriptional regulator lytR
BN424_2778-3162.565275metalloendopeptidase, glycoprotease family
BN424_2779-1170.742705ribosomal-protein-alanine acetyltransferase
BN424_2780-1140.176023ribosomal-protein-alanine acetyltransferase
BN424_2781113-3.064980glycoprotease family protein
BN424_2782313-4.702089csbD-like family protein
BN424_2783313-5.2725043-demethylubiquinone-9 3-methyltransferase
BN424_2784213-4.938034hypothetical protein
BN424_2785012-4.293648hypothetical protein
BN424_2786-112-3.164713mga helix-turn-helix domain protein
BN424_2787015-1.360063hypothetical protein
BN424_2788115-1.283844mga helix-turn-helix domain protein
BN424_27892152.345736sortase family protein
BN424_27901112.565895conserved hypothetical protein
BN424_27911122.986310cell surface protein with WxL domain
BN424_27920112.649658cell surface protein with WxL domain
BN424_2793-1100.395939LPXTG-motif cell wall anchor domain protein
BN424_2794-1100.348263conserved repeat domain protein
BN424_2795012-1.265017conserved repeat domain protein
BN424_2796011-2.669048PTS family
BN424_2797011-3.011617helix-turn-helix family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2747HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 1e-15
Identities = 30/204 (14%), Positives = 75/204 (36%), Gaps = 20/204 (9%)

Query: 6 KKTDLRILRSKKMIFEAFVKLVKLKGYEAVTIQDIATEAMINRATFYAHFKDKNDLYDEV 65
+KT +++ I + ++L +G + ++ +IA A + R Y HFKDK+DL+ E+
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 66 FSYALDTFTKI---LDSELLENGNQIQINKLEHMITEIFYVIRENRIFYLILTEGNSANS 122
+ + ++ ++ + + L H++ R+ I+
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLEST-VTEERRRLLMEIIFHKCEFVG 121

Query: 123 LKKKVHTLIEQRYAEIFNQLK-----------ITEN-DVEVPIDFIIDYMSSIFISMVHW 170
V E +++++ + + + Y+S + +
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN---- 177

Query: 171 WLTSKSDFPPEQMAHLLIKLVGNG 194
WL + F ++ A + ++
Sbjct: 178 WLFAPQSFDLKKEARDYVAILLEM 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2755NUCEPIMERASE602e-12 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 59.8 bits (145), Expect = 2e-12
Identities = 53/315 (16%), Positives = 97/315 (30%), Gaps = 55/315 (17%)

Query: 1 MKVLITGGNGQLGTELTRLLDEANIDYITTDA------------------------KSMD 36
MK L+TG G +G +++ L EA + D +D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 37 ITDEKIVNQIIQKIKPNIIYHCAAYTAVDKAEDEGKSLNQLINVDGTRYVAKAAEKIG-A 95
+ D + + + ++ AV + + + N+ G + +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADS-NLTGFLNILEGCRHNKIQ 119

Query: 96 TIVYISTDYVFEGNKKENYTVDDSPN-PRNEYGRAKYEGELEIQKYASKYYI----IRTS 150
++Y S+ V+ N+K ++ DDS + P + Y K EL Y+ Y + +R
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 151 WVYGEFG----ANFVYTMQRLAKSNPVL----------TVVSD--------QLGRPTWTR 188
VYG +G A F +T L + + T + D Q P
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239

Query: 189 NLAEFMLFITEKKADYGIYHFSNDETCSWYEFASEILKNTDTKIMPISSEEFPQKAKRPQ 248
A Y +Y+ N ++ + + P
Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETS 299

Query: 249 HSILDLRKTKELGFK 263
L + +GF
Sbjct: 300 ADTKALY--EVIGFT 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2756NUCEPIMERASE1742e-54 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 174 bits (444), Expect = 2e-54
Identities = 73/341 (21%), Positives = 137/341 (40%), Gaps = 38/341 (11%)

Query: 1 MNVLVTGGAGFIGSNYVHYMLENHPDYNIINLDLLTYAGNIHNLDDV---------IDNP 51
M LVTG AGFIG + +LE + ++ +D N+++ DV + P
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEA--GHQVVGID------NLNDYYDVSLKQARLELLAQP 52

Query: 52 NHVFVEGNICNRELVRNLVKTYGITHFVNFAAESHVDRSILNPEIFVETNIQGTLALLDV 111
F + ++ +RE + +L + V S+ NP + ++N+ G L +L+
Sbjct: 53 GFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEG 112

Query: 112 AKELSIEKYLQVSTDEVYGSLGAEGYFTEETPLA-PNSPYSASKTGADLLVRAYYETYDM 170
+ I+ L S+ VYG L + F+ + + P S Y+A+K +L+ Y Y +
Sbjct: 113 CRHNKIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171

Query: 171 NVNITRCSNNYGPYHFPEKLIPLMISNGMDNKELPIYGDGLNIRDWLHVQDHCQAIDLVL 230
R YGP+ P+ + ++ K + +Y G RD+ ++ D +AI +
Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231

Query: 231 HKGRK------------------GEVYNVGGHNERTNNEIVDIVIEKLGLSRDLIKYVDD 272
VYN+G + + + + + LG+ +
Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK-KNMLPL 290

Query: 273 RLGHDKRYAIDPTKLETELGWKPKYTFDTGIVETIEWYQAN 313
+ G + D L +G+ P+ T G+ + WY+
Sbjct: 291 QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2766ACRIFLAVINRP320.008 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.7 bits (72), Expect = 0.008
Identities = 37/222 (16%), Positives = 75/222 (33%), Gaps = 31/222 (13%)

Query: 1 MIIFVLIVFLWSLIGEIWITYLLTIIIAFSFGIANMIKINLRFEP------IYPEELKMA 54
+F+ + F G I+ + +TI+ A + + + L P + P +
Sbjct: 450 SAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL----VALILTPALCATLLKPVSAEHH 505

Query: 55 GNPGDLFSFF------SLEQYNLSVGKIVILILLSITVVVALIFISHKLVKKIFKVSVKY 108
N G F +F S+ Y SVGKI+ + + ++ L ++ +
Sbjct: 506 ENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPE 565

Query: 109 LDRKILLIRGILLVISSFLL-------LTVYNFNQPGNEVKKIVDKYAI-WSDNSQNSTY 160
D+ + L L ++ +T Y V+ + +S +QN+
Sbjct: 566 EDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAG- 624

Query: 161 HENGFVIGFIYNFPVAVISKPSNYSEESIKKIMDKYMVRADA 202
+ F+ P + N +E I + + D
Sbjct: 625 ------MAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2767FLGFLGJ503e-08 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 50.1 bits (119), Expect = 3e-08
Identities = 42/156 (26%), Positives = 71/156 (45%), Gaps = 19/156 (12%)

Query: 129 EFIMQVSENAKLLASQNDLYASVMIAQSILESAHGTSVLGKI---PVNNLFGIK--GRYN 183
F+ Q+S A+L + Q+ + +++AQ+ LES G + + P NLFG+K G +
Sbjct: 151 AFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWK 210

Query: 184 NQFFEKESLEQLPDGTWVTKKSEFRKYESWEKSQLDYVEKIKKGPNANAGDNSWNPSYYA 243
E + E +G K++FR Y S+ ++ DYV + + P A +
Sbjct: 211 GPVTEITTTE-YENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAAVTTA------- 262

Query: 244 GAWRSNTSSYRDATAALVGKYASDKTYDSKLNQIIE 279
S+ + A A YA+D Y KL +I+
Sbjct: 263 ------ASAEQGAQALQDAGYATDPHYARKLTNMIQ 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2768NUCEPIMERASE1636e-50 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 163 bits (414), Expect = 6e-50
Identities = 81/344 (23%), Positives = 144/344 (41%), Gaps = 42/344 (12%)

Query: 1 MTVLVLGGAGYIGSHAVDQLITKGYDVAVVDNLKTGHKESLSDK---------ARFYQGD 51
M LV G AG+IG H +L+ G+ V +DNL + SL +F++ D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 IRDKAFMEDVFTKENIEGVIHFAASSLVGESMEIPLDYFNNNVYGTQVVLEVMEKYNVKS 111
+ D+ M D+F + E V V S+E P Y ++N+ G +LE ++
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 112 IIFSSSAATYGEPKVIPI-EETAATNPESTYGETKLMMEKMLKWCDKAYGMRFVALRYFN 170
++++SS++ YG + +P + + +P S Y TK E M YG+ LR+F
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 171 VAGAKLDGTIGEDHNPESHLLPIILQTALGQREKFTIYGEDYETPDGTCIRDYVHVVDLI 230
V G P+ L G+ +Y G RD+ ++ D+
Sbjct: 181 VYGPWGR--------PDMALFKFTKAMLEGKS--IDVYN------YGKMKRDFTYIDDIA 224

Query: 231 DAHILALEYLQAGNSSNT---------------FNLGSSTGFSVKQMLEAAREVTGKEIP 275
+A I + + ++ T +N+G+S+ + ++A + G E
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 276 ATVVSRRAGDPSTLIAASDKAREVLGWKPQYTDVNKIIESAWNW 319
++ + GD A + EV+G+ P+ T V +++ NW
Sbjct: 285 KNMLPLQPGDVLETSADTKALYEVIGFTPE-TTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2776PF05272320.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.013
Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 6/61 (9%)

Query: 355 KLDAV-ALVGPNGIGKSTLLKSLVD-----DIPLIQGEKRFGANVEVGYYDQEQANLNST 408
K D L G GIGKSTL+ +LV D G + G E + + +
Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAF 653

Query: 409 K 409
+
Sbjct: 654 R 654


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2779SACTRNSFRASE384e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 4e-06
Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 5/84 (5%)

Query: 68 ELLGFIGYRQLFDE-VELTNIAVHPKVQGQGLSQAFL---VQWIKKLHQARVVHLEVRKS 123
+G I R ++ + +IAV + +G+ A L ++W K+ H ++ LE +
Sbjct: 75 NCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLM-LETQDI 133

Query: 124 NQVAIHVYKKVGFKLINIRQDYYD 147
N A H Y K F + + Y
Sbjct: 134 NISACHFYAKHHFIIGAVDTMLYS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2780SACTRNSFRASE401e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 40.3 bits (94), Expect = 1e-06
Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 6/101 (5%)

Query: 82 MKNNRNALYLVLRVYDVAIGFI---GSWFVEGEAHVTNIAIIPNYRRYGLASFLMEQMRH 138
++ A +L + + IG I +W G A + +IA+ +YR+ G+ + L+ +
Sbjct: 60 VEEEGKAAFLY-YLENNCIGRIKIRSNW--NGYALIEDIAVAKDYRKKGVGTALLHKAIE 116

Query: 139 LAEADHNQLFSLEVRMSNTGAQELYRKLGFKDGKIKKAYYS 179
A+ +H LE + N A Y K F G + YS
Sbjct: 117 WAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYS 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2786PF05043516e-09 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 51.1 bits (122), Expect = 6e-09
Identities = 38/183 (20%), Positives = 85/183 (46%), Gaps = 6/183 (3%)

Query: 6 DKIIYRKIQLLKVLDASNDYLKTVDLANFLDLSLKTVQKELESLIEDLKNSSYAVKLEKV 65
K +R+++LL++L + +LA L+ + + V+ +L + +
Sbjct: 6 SKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHV-KSAFPDLIFHSSTNG 64

Query: 66 GNLYRFIKKSSVNMDLIYLDFKRESIYFYLMRKAVFRKTIK-EKKEIDYFYSASHLYKNK 124
+ +++++Y F + S +F ++ F + + E +++ S+S LY+
Sbjct: 65 IRIINT---DDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRII 121

Query: 125 RIFKTYLMN-YGLELDLSTLSIEGNEINIRFLYFQFFWENYRGVEWPFDTIDRQELIREI 183
+ + E+ L+ + I GNE +IR+ + Q+F E Y +EWPF+ + L + +
Sbjct: 122 SQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSSEPLSQLL 181

Query: 184 EQL 186
E +
Sbjct: 182 ELV 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2788PF05043554e-10 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 54.9 bits (132), Expect = 4e-10
Identities = 41/222 (18%), Positives = 92/222 (41%), Gaps = 15/222 (6%)

Query: 1 MRNLLNREDGKKVSLFRFIEECTQQTASF--SAIMHELEISEFVLLRTAENLSKDIELND 58
MR+LL+++ +++ L +E + F S + L +E + +
Sbjct: 1 MRDLLSKKSHRQLEL---LELLFEHKRWFHRSELAELLNCTERAVKDDLS------HVKS 51

Query: 59 LTPHFSLTISRTHKTITLKKDSDASISMLYIIYIKNSLSYNILVDILNGKFVSMTDFGEF 118
P + S T+ + D + + + K+S ++IL I + +
Sbjct: 52 AFPDL-IFHSSTNGIRIINTDDSDIEMVYHHFF-KHSTHFSILEFIFFNEGCQAESICKE 109

Query: 119 NFVSYSAVHKKIQEVKKELAN-YQVRLS-SKYELVGDEIKIRMFFYHLYYPRFNQLNFPF 176
++S S++++ I ++ K + +Q +S + +++G+E IR FF + ++ L +PF
Sbjct: 110 FYISSSSLYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPF 169

Query: 177 EAKYEKLAQQFIRLLENHLDHSIQETLKTKINFFLSVSLKRI 218
E + Q + L+ + + + L +L RI
Sbjct: 170 ENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRI 211


36BN424_2849BN424_2863Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_28490123.493699putative DNA-directed RNA polymerase subunit
BN424_2850-1123.450822conserved hypothetical protein
BN424_2851-1133.283862putative sugar-binding domain protein
BN424_2852-1153.602456hypothetical protein
BN424_2853-1143.552134transketolase, C-terminal domain protein
BN424_2854-1132.686709dehydrogenase E1 component family protein
BN424_2855-2151.352539BH3822 protein
BN424_2856-2150.991498HD domain protein
BN424_2857-1121.636085phosphatase YidA
BN424_28581131.718850tautomerase enzyme family protein
BN424_28592111.493336peroxiredoxin, Ohr subfamily protein
BN424_28603101.406513NUDIX domain protein
BN424_28612121.251497conserved hypothetical protein
BN424_28622142.364398dipeptidase, family protein
BN424_28633151.863886selenoB, glycine/betaine/sarcosine/D-proline
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2859PF05043310.001 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 31.5 bits (71), Expect = 0.001
Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 5/61 (8%)

Query: 49 EQLFALGYSACFNSALEL--VMGQEKVSGKSQVTATVELLSDPSDNGFKLAVELDVAIEG 106
E++ + + F + + + V S V + LLSD D +++V+ + IE
Sbjct: 255 EEVVCQLFVSYFQKMFFIDESLFMKCVKKDSYVEKSYHLLSDFID---QISVKYQIEIEN 311

Query: 107 K 107
K
Sbjct: 312 K 312


37BN424_2912BN424_2927Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2912220-0.423672transposase
BN424_2913118-1.411319flavodoxin-like fold family protein
BN424_2914219-1.912993cyclic nucleotide-binding domain protein
BN424_2915220-1.583996bacterial regulatory s, tetR family protein
BN424_2916320-1.006049putative membrane protein
BN424_2917621-1.245870transposase DDE domain protein
BN424_2918219-1.016146dihydrofolate reductase
BN424_2919117-0.094854hypothetical protein
BN424_2920-1161.771895hypothetical protein
BN424_2921-1152.453469glyoxalase family protein
BN424_2922-1153.268719bacterial regulatory helix-turn-helix s, AraC
BN424_29230174.059704GMP synthase, N-terminal domain
BN424_29240173.966083pantothenate kinase
BN424_29250184.124489mevalonate kinase
BN424_2926-1163.491176diphosphomevalonate decarboxylase
BN424_2927-2153.388556phosphomevalonate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2915HTHTETR491e-09 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 49.2 bits (117), Expect = 1e-09
Identities = 30/185 (16%), Positives = 63/185 (34%), Gaps = 29/185 (15%)

Query: 6 QSLLSKKWIIDSLLYLLKTKPYSEITITEITKKAGVARLTFYRNFESKDQILI------- 58
++ +++ I+D L L + S ++ EI K AGV R Y +F+ K +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 59 TRSNYLFQEYLDEIRQN--GKITSIQQALLQCFNNW-----------QRDSQVMELLIKN 105
+ L EY + + + I +L+ + V E+ +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127

Query: 106 DLIYLIEQPFYTFLEKIIEEI-------PDLDNLDNTQKIFVLGRVTRTMLDWISTKSTK 158
+ Y +E+ ++ DL I + G ++ M +W+ +
Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLM--TRRAAIIMRGYISGLMENWLFAPQSF 185

Query: 159 SSTEI 163
+
Sbjct: 186 DLKKE 190


38BN424_3034BN424_3042Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3034291.641035FAD dependent oxidoreductase family protein
BN424_30353110.034597conserved hypothetical protein
BN424_3036312-0.435299putative membrane protein
BN424_3037313-0.572800ABC transporter family protein
BN424_3038416-0.774520hypothetical protein
BN424_3039217-1.335331bacterial Ig-like domain family protein
BN424_3040219-1.942659mga helix-turn-helix domain protein
BN424_3041219-1.024978conserved hypothetical protein
BN424_3042217-1.471434hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3039INTIMIN340.004 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 33.9 bits (77), Expect = 0.004
Identities = 31/128 (24%), Positives = 48/128 (37%), Gaps = 6/128 (4%)

Query: 557 YPDRSTVGNSIGKILVEESLSTGNKVQYEYEVNFEVAAVPESIEVSPKIATIKTGETQQL 616
Y + + GK LV +S EV F + + + T G+ +
Sbjct: 717 YAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE-IVGTGVKGKLPTV 775

Query: 617 SAKV----LPENSVNTDLKWTSSNEELATVD-EEGIVFGKRRGEVEISVETTNGLTDRAT 671
+ L + N W S+N +A+VD G V K +G ISV +++ T T
Sbjct: 776 WLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYT 835

Query: 672 IQIVNIEI 679
I N I
Sbjct: 836 IATPNSLI 843


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3040PF05043432e-06 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 43.0 bits (101), Expect = 2e-06
Identities = 21/108 (19%), Positives = 52/108 (48%), Gaps = 8/108 (7%)

Query: 87 QTVISYICQKTLEFKILELFLYGNLKKVHQFLLEHGIGYTTYYRVLRKISGLLQK-YGIS 145
+ V + + + F ILE + + E I ++ YR++ +I+ ++++ +
Sbjct: 76 EMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFE 135

Query: 146 INTNSLELVGKESEIRLFYFQFLWTLCEGFG---WPFKNSDEKKIIER 190
++ ++++G E +IR F+ Q+ E + WPF+N + + +
Sbjct: 136 VSLTPVQIIGNERDIRYFFAQY---FSEKYYFLEWPFENFS-SEPLSQ 179


39BN424_3061BN424_3070Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3061-116-3.292761glycosyl transferases group 1 family protein
BN424_3062015-4.169262bacterial sugar transferase family protein
BN424_3063016-4.009909glycosyl transferase 2 family protein
BN424_3064116-4.176455hypothetical protein
BN424_3065017-4.571685O-Antigen Polymerase family protein
BN424_3066018-4.289612capsular exopolysaccharide family domain
BN424_3067-213-2.866800chain length determinant family protein
BN424_3068-111-2.608298glycosyl transferases group 1 family protein
BN424_3069112-1.267053putative lipoprotein
BN424_3070215-0.579545hypothetical protein
40BN424_3216BN424_3223Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_32162132.363704naphthoate synthase
BN424_3217015-3.055540uncharacterized 1 domain protein
BN424_3218115-3.043856universal stress family protein
BN424_3219215-2.540993hypothetical protein
BN424_3220215-1.668415hypothetical protein
BN424_3221115-1.910011hypothetical protein
BN424_3222114-0.192799mga helix-turn-helix domain protein
BN424_32232131.739779conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3222PF05043601e-11 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 59.6 bits (144), Expect = 1e-11
Identities = 51/235 (21%), Positives = 104/235 (44%), Gaps = 12/235 (5%)

Query: 6 FLEKQDIRKIGLFKFLEASYQQRATFSDISENLNISDFILLNTVDELTRDIEANQLTDCF 65
L K+ R++ L + L +++ S+++E LN ++ + + + + + D
Sbjct: 4 LLSKKSHRQLELLELL-FEHKRWFHRSELAELLNCTERAVKDDLSHVK-----SAFPDLI 57

Query: 66 KLEKTEKHIILKKSGKASVQVLAWLYLKKSHSFKILDEIYRGTFTNIASYSESNFVSYTS 125
T I+ V + K S F IL+ I+ S + ++S +S
Sbjct: 58 FHSSTNGIRIINTDDSDIEMVYHHFF-KHSTHFSILEFIFFNEGCQAESICKEFYISSSS 116

Query: 126 VYNRIQELKKILR-SYEIELS-SRFKLIGDEMKIRMYFYQVYYERFNRIEFPFEPKKKEV 183
+Y I ++ K+++ ++ E+S + ++IG+E IR +F Q + E++ +E+PFE E
Sbjct: 117 LYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSSEP 176

Query: 184 NMLFIQQLEEQLQHSFSEVDKAKLDFFLAVNLNRIQQGTDLHSSTVERKKINVQL 238
++ + ++ + L L NL RI+ G H V++ N Q
Sbjct: 177 LSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFG---HFMEVDKDSFNDQS 228


41BN424_3290BN424_3351Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3290213-1.024476phage major capsid protein, HK97 family
BN424_3291115-0.883409phage prohead protease, HK97 family
BN424_3292016-0.180656phage portal protein, HK97 family
BN424_3293117-0.035276hypothetical protein
BN424_32941190.637851phage Terminase family protein
BN424_32951271.740890phage terminase, small subunit, P27 family
BN424_32962302.477031hypothetical protein
BN424_32974293.080823hypothetical protein
BN424_32985270.625633hypothetical protein
BN424_32993250.343496hypothetical protein
BN424_33013260.428135putative uncharacterized protein
BN424_33004302.398217protein
BN424_33042322.571558**phage transcriptional regulator, ArpU family
BN424_33052271.905555hypothetical protein
BN424_33060282.063016conserved hypothetical protein
BN424_33071313.059043hypothetical protein
BN424_3308-1282.346303single-strand binding family protein
BN424_33090241.176260putative uncharacterized protein
BN424_33102230.930363yopX family protein
BN424_33111260.861033conserved hypothetical protein
BN424_33124220.808096hypothetical protein
BN424_33132180.146160domain protein
BN424_33142180.347216hypothetical protein
BN424_3315120-0.083005phage replication protein
BN424_3316219-0.597996hypothetical protein
BN424_3317319-1.068361conserved hypothetical protein
BN424_3318021-1.936493conserved hypothetical protein
BN424_3319022-1.957461hypothetical protein
BN424_3320122-3.200123hypothetical protein
BN424_3321325-2.044274hypothetical protein
BN424_3322329-2.065241hypothetical protein
BN424_3323428-2.316224hypothetical protein
BN424_3324527-1.137863putative membrane protein
BN424_3325624-1.614751hypothetical protein
BN424_3326325-1.736435protein
BN424_3327222-3.225895hypothetical protein
BN424_3328320-2.552745gp36 domain protein
BN424_3329421-2.907682phage antirepressor KilAC domain protein
BN424_3330319-2.094794helix-turn-helix family protein
BN424_3331318-1.361296helix-turn-helix family protein
BN424_3332418-1.037448conserved hypothetical protein
BN424_3333317-0.689048conserved hypothetical protein
BN424_3334-1120.426166phage integrase family protein
BN424_3335-1100.967749pyrroline-5-carboxylate reductase
BN424_3336-1121.066988tRNA-specific adenosine deaminase
BN424_3337-2142.272617bacterial regulatory, Fis family protein
BN424_3338-1163.682157putative uncharacterized protein
BN424_3339-2184.964273clp amino terminal domain protein
BN424_33402347.143062transcriptional regulator CtsR
BN424_33502366.185392********hypothetical protein
BN424_33510265.258220cell wall-associated hydrolase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3317IGASERPTASE423e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.0 bits (98), Expect = 3e-06
Identities = 24/124 (19%), Positives = 43/124 (34%), Gaps = 3/124 (2%)

Query: 197 GLERGETAAQIITEIQAVEKRKKEREEQRKKEEEARIARELEQEQLRAKREE-EARLAAE 255
G E ET E VEK +K + E K +E ++ ++ +Q +++ + +A A E
Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148

Query: 256 RAELEKANEQQETYLEDDVLTEENYTEDPFVDVPEPIEEEVVVPVKEEEPVRTAIIEITG 315
E Q + E ++ +V +P+ E V
Sbjct: 1149 NDPTVNIKEPQS--QTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 316 TNEQ 319
T
Sbjct: 1207 TQPT 1210



Score = 32.7 bits (74), Expect = 0.002
Identities = 22/108 (20%), Positives = 40/108 (37%), Gaps = 4/108 (3%)

Query: 204 AAQIITEIQAVEKRKKEREEQRKKEEEARIARELEQEQLRAKREEEARLAAERAELEKAN 263
AQ +E + + + + +KEE+A++ E QE + + + +A
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144

Query: 264 EQQE---TYLEDDVLTEENYTEDPFVDVPEPIEEEVVVPVKEEEPVRT 308
+E T + ++ N T D + V PV E V T
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADT-EQPAKETSSNVEQPVTESTTVNT 1191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3333cloacin310.007 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.8 bits (69), Expect = 0.007
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 102 QILYVKAGNLKEKVNILPLNADLLAKQKQRADEEQRSLAETRAKKEAEDKLAAEAKAEED 161
Q+ +KA + VN D AK+K AD S E+R KKE + + +AE ++
Sbjct: 391 QMAGLKAQRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDKKR-SAENNLNDE 449

Query: 162 RIK 164
+ K
Sbjct: 450 KNK 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3337HTHFIS475e-08 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 46.7 bits (111), Expect = 5e-08
Identities = 16/58 (27%), Positives = 29/58 (50%)

Query: 210 SALEESELLSELKKIFSDQAEWRELVKALWETQGNISMAAKSLYVHRNTLQYRIDRFN 267
++ ++ S L + E+ ++ AL T+GN AA L ++RNTL+ +I
Sbjct: 417 ASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3339HTHFIS382e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.9 bits (88), Expect = 2e-04
Identities = 36/173 (20%), Positives = 63/173 (36%), Gaps = 18/173 (10%)

Query: 502 EQKESERLLNLEKVLHSRVVGQEDAVSAVSRAMRR-ARSGLKDPNRPIGSFMFLGPTGVG 560
E K L + +VG+ A+ + R + R ++ L + M G +G G
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL--------TLMITGESGTG 172

Query: 561 KTELAKALAESMFGSEDALIRVDMSEYMEKYSTSRLIGSPPG-YVGYDEGGQLTEKIRQK 619
K +A+AL + + ++M+ S L G G + G + Q
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST--GRFEQA 230

Query: 620 PYSVVLLDEVEKAHPDVFNILLQVLDDGHLT---DAKGRKVDFKNTILIMTSN 669
+ LDE+ D LL+VL G T + D + ++ +N
Sbjct: 231 EGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN 280


42BN424_3599BN424_3622Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3599-3154.224850linear amide C-N hydrolases, choloylglycine
BN424_36000164.965256threonine dehydratase
BN424_36010155.2705183-isopropylmalate dehydratase small subunit
BN424_3602-1155.4312783-isopropylmalate dehydratase, large subunit
BN424_36030155.5902763-isopropylmalate dehydrogenase
BN424_36040155.1683222-isopropylmalate synthase
BN424_36050155.063731ketol-acid reductoisomerase
BN424_36060174.836729acetolactate synthase, small subunit
BN424_36071174.147403acetolactate synthase, large
BN424_36080163.621606dihydroxy-acid dehydratase
BN424_3609-1151.763808hypothetical protein
BN424_36100131.372220hypothetical protein
BN424_36111130.134248alpha,alpha-phosphotrehalase
BN424_3612112-2.303902PTS system, trehalose-specific IIBC component
BN424_3613215-5.134488putative lipoprotein
BN424_3614517-8.584780bacterial regulatory s, tetR family protein
BN424_3615621-9.308494hypothetical protein
BN424_3616722-9.146938hypothetical protein
BN424_3617722-8.841178hypothetical protein
BN424_3618823-9.396381hypothetical protein
BN424_3619724-8.415505hypothetical protein
BN424_3620724-7.797965ABC-2 type transporter family protein
BN424_3621420-4.457367putative bacitracin transport ATP-binding BcrA
BN424_3622219-3.595701putative ABC transporter, ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3614HTHTETR423e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 42.3 bits (99), Expect = 3e-07
Identities = 15/58 (25%), Positives = 25/58 (43%)

Query: 7 TKKVIAHSLKELMQLTAFQKISIRDIMGHADIRRQTFYYHFQDKYELLAWIYNQEASE 64
T++ I L S+ +I A + R Y+HF+DK +L + I+ S
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3622PF05272423e-07 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 42.0 bits (98), Expect = 3e-07
Identities = 12/55 (21%), Positives = 21/55 (38%), Gaps = 2/55 (3%)

Query: 38 GPNGSGKSTILRLLAGVLLNTDGVISLENQDDNYVKWTKQNSIYVSSGERGLMSK 92
G G GKST++ L G+ +D + D+Y + + E +
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI--AGIVAYELSEMTAFRR 655


43BN424_64BN424_72N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_64111-0.291526PTS system, glucose subfamily, IIA component
BN424_65112-0.736013CAT RNA binding domain protein
BN424_66111-0.246486polysaccharide biosynthesis family protein
BN424_67091.024831transcriptional regulatory protein yycF
BN424_68-1101.093944sensory box protein
BN424_69-3101.030958yycH family protein
BN424_70-3110.630612yycH family protein
BN424_71-2120.468650metallo-beta-lactamase superfamily protein
BN424_72-313-0.357145trypsin family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_64RTXTOXIND270.032 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.032
Identities = 9/18 (50%), Positives = 14/18 (77%)

Query: 106 VKIGDAVKKGDPLVKIDR 123
VK G++V+KGD L+K+
Sbjct: 112 VKEGESVRKGDVLLKLTA 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_65PHPHTRNFRASE290.023 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 29.0 bits (65), Expect = 0.023
Identities = 25/145 (17%), Positives = 45/145 (31%), Gaps = 25/145 (17%)

Query: 10 NAVLVVDGTTEKVAIGKGIGFNKKKNDLVFDYDIEQLFIMENEQENFQQLLSQIDESYFF 69
+ G VAI K + + N + I + E E L + E
Sbjct: 6 TGIAASSG----VAIAKAF-IHLEPNVDIEKTSITDV---STEIEKLTAALEKSKEE--- 54

Query: 70 ASERIIEHAEHALKEKLNEHIHIALADHIAFAMDRLKNGIVVRNKLRKEIEVLYAEEFLI 129
+ + + + A H+ D L I+ E +
Sbjct: 55 -----LRAIKDQTEASMGADKAEIFAAHLLVLDDPE---------LVDGIKGKIENEQMN 100

Query: 130 AEWAIEYLSTQFGSVFTLDEAAYIA 154
AE+A++ +S F S+F + Y+
Sbjct: 101 AEYALKEVSDMFVSMFESMDNEYMK 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_67HTHFIS951e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 1e-24
Identities = 29/117 (24%), Positives = 63/117 (53%), Gaps = 1/117 (0%)

Query: 3 KILVVDDEKPISDIVKFNLTKEGYEVSTAYDGEEALKMVPEVEPDLIILDLMLPKIDGLE 62
ILV DD+ I ++ L++ GY+V + + + + DL++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VCREVRKNY-DMPIIMVTAKDSEIDKVLGLELGADDYVTKPFSNRELVARVKANLRR 118
+ ++K D+P+++++A+++ + + E GA DY+ KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_68PF06580300.026 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.2 bits (68), Expect = 0.026
Identities = 35/213 (16%), Positives = 64/213 (30%), Gaps = 55/213 (25%)

Query: 408 IAPRFL--------AVTQEETDRMIRMITDLLNLSRMDAGKDTFELEYVNINELFSHVLN 459
I P F+ A+ E+ + M+T L L R V++ + + V +
Sbjct: 170 INPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYS--NARQVSLADELTVVDS 227

Query: 460 RFDMMLQSADKPVKPFVIKRDFTKRDLSVEVDADKMIQ-------VLDNIMNNAIKY--- 509
+ F R L E + I ++ ++ N IK+
Sbjct: 228 YLQLA-------------SIQFEDR-LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 510 -SPSGGTITCRLMETHNNIVISIADEGLGVPKKDIPHVFDRFFRVDKARARSMGGTGLGL 568
P GG I + + + + + + + G K TG GL
Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------------STGTGL 315

Query: 569 AISKEVVQKHGGKIWLESIENK--GSTFFISLP 599
+E +Q G + K + +P
Sbjct: 316 QNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_72V8PROTEASE545e-10 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 54.2 bits (130), Expect = 5e-10
Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 34/161 (21%)

Query: 250 IVTNNHVIDGSDAIEVILK------------DGTKVEAKLIGADQWTDLAVLSIPADKVK 297
++TN HV+D + LK +G ++ DLA++ ++
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 298 -------TVATFGNSDDIKVGEPAIAIGSPLGTNFATSVTQGIVSAKDRSVAMDIDGDGV 350
AT N+ + +V + G P AT + + + G+
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATM-------WESKGKITYLKGEA- 225

Query: 351 EDWDMTAIQTDAAINPGNSGGALINLAGQVIGINSMKISQD 391
+Q D + GNSG + N +VIGI+ + +
Sbjct: 226 -------MQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNE 259


44BN424_154BN424_164N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_154329-2.816056ABC-2 type transporter family protein
BN424_155225-2.594808ABC transporter family protein
BN424_156025-2.302952putative membrane protein
BN424_157-123-1.014630histidine kinase-, DNA gyrase B-, and HSP90-like
BN424_158-1180.007760response regulator
BN424_159-1170.642167LPXTG-motif cell wall anchor domain protein
BN424_160-1130.902044phenazine biosynthesis PhzF family protein
BN424_161-1140.825903acetyltransferase family protein
BN424_162-1131.078539conserved domain protein
BN424_1630130.587427HAMP domain protein
BN424_1640130.948006response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_154ABC2TRNSPORT336e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 33.4 bits (76), Expect = 6e-04
Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 176 QVIMIGGLLF-SPITYPTDRLPSLLVRFFEILPFVPSSNLIRSMFYDQGIVNI-YNIIVI 233
Q ++I +LF S +P D+LP + LP S +LIR + +V++ ++ +
Sbjct: 182 QTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGAL 241

Query: 234 CFWLVLNMLLSLVSLSRR 251
C ++V+ LS L RR
Sbjct: 242 CIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_157PF06580388e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 8e-05
Identities = 18/94 (19%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 335 ILAIVIGNVIDNAIQASIRICPEDRHINIVIKQFNNDLLVEVSNNFNPEELSTRHHRKNE 394
+ +++ +++N I+ I P+ I + + N + +EV N L+ ++ +++
Sbjct: 255 VPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT---GSLALKNTKEST 311

Query: 395 GFGMKNIDGLLQQI-GGIYRHWTEESKHFVTVVI 427
G G++N+ LQ + G + E + V ++
Sbjct: 312 GTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_158HTHFIS631e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.9 bits (153), Expect = 1e-13
Identities = 20/120 (16%), Positives = 49/120 (40%), Gaps = 5/120 (4%)

Query: 2 KVAICDDNPSLTEVINTMLTDYDPNMFETFTFYNPHNLINQLDIEKFDFFILDIEMDEMS 61
+ + DD+ ++ V+N L+ ++ N L + D + D+ M + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAG---YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GIDLAKNIRERGILSPIVFLTSYKEYMEEV--FQVQTFDYLLKPPTKDRMKQVLDKLNQH 119
DL I++ P++ +++ +M + + +DYL KP + ++ +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_159GPOSANCHOR260.015 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 25.8 bits (56), Expect = 0.015
Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 6 SGPTTPPIVPENPATNEIDIAQVQGNLPTTGE 37
G P P N+ + + + LP+TGE
Sbjct: 479 PGKGQAPQAGTKPNQNKAPMKETKRQLPSTGE 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_163PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.0 bits (70), Expect = 0.007
Identities = 34/179 (18%), Positives = 77/179 (43%), Gaps = 27/179 (15%)

Query: 173 AIIIQESGRLTTLSSTILHLSKVENHEI-ISEKRAIQLDEQLR--QTILLLEPKWQKKRI 229
A+I+++ + + + LS++ + + S R + L ++L + L L + R+
Sbjct: 184 ALILEDPTKAREM---LTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRL 240

Query: 230 IWELELDDSILNSD--EDLLQQMWINLLDNAIKFSPENGVVKVKLMNLTDTVIVKITDQG 287
+E +++ +I++ L+Q + N + + I P+ G + +K TV +++ + G
Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 288 SGMSSETQQRLFDKFYQGDASHSKEGNGLGMSLVKNILRICDGE---IGLKSSLGNGSS 343
S ++KE G G+ V+ L++ G I L G ++
Sbjct: 301 SLA----------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_164HTHFIS852e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.3 bits (211), Expect = 2e-21
Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 3/120 (2%)

Query: 3 TILIVEDDPHTRNLMEIILKNNGFQTVTATNGIEALDVLDKRMISLIILDIMIPEMDGYQ 62
TIL+ +DD R ++ L G+ +N + L++ D+++P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 LTQNLREADFQLPILMVTAKETPSEKKKGFLVGTDDYMTKPVDEEEMIL---RILALLRR 119
L +++A LP+L+++A+ T K G DY+ KP D E+I R LA +R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124


45BN424_207BN424_217N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_207-27-0.362156mga helix-turn-helix domain protein
BN424_208-190.856415tat pathway signal sequence domain protein
BN424_209-2121.499589UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L-
BN424_210-1111.416566response regulator
BN424_211-1121.719741his Kinase A domain protein
BN424_212-2120.886637D-alanyl-D-alanine carboxypeptidase dacA
BN424_2130110.249211hypothetical protein
BN424_214011-0.043531ATPase, P-type (transporting), HAD super, subIC
BN424_215013-1.413769seryl-tRNA synthetase
BN424_216314-1.759256peptide methionine sulfoxide reductase MsrA
BN424_217515-1.657392mga helix-turn-helix domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_207PF05043601e-11 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 59.6 bits (144), Expect = 1e-11
Identities = 55/268 (20%), Positives = 102/268 (38%), Gaps = 30/268 (11%)

Query: 1 MKKLLDKPFHLILKLLEHFYKKTPQETINYYSDFLNVDRRTILKIITDLERDIADCQWEN 60
M+ LL K H L+LLE ++ + ++ LN R + ++ ++ D
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPD----- 55

Query: 61 QLTLEVTETKIIATFSTNFSLENFYRYYMERSLCVELVQSIFKEAEISLDQIIENFFVSR 120
+ T I + +E Y ++ + S +++ IF + I + F++S
Sbjct: 56 LIFHSSTNGIRIINTDDS-DIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISS 114

Query: 121 TTFYRRITPLKEVL-AEFDLELDFTKKQFLIGEEKQIRYFFSVFFWEIFRSTGEYKHPDL 179
++ YR I+ + +V+ +F E+ T Q +IG E+ IRYFF+ +F E + +
Sbjct: 115 SSLYRIISQINKVIKRQFQFEVSLTPVQ-IIGNERDIRYFFAQYFSEKYY----FLEWPF 169

Query: 180 KDTEYLNRIKQDLNLSIPH---------FLYFQLYLNISLTRISQGYLVSEVIPYPIKEI 230
++ + Q L L +L L +L RI G+ +
Sbjct: 170 ENFS-SEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFME----VDKDSF 224

Query: 231 NYSYAQFKNLTAPYFNKLMPAQQSLEIH 258
N F + QS E
Sbjct: 225 NDQSLDF----LMQAEGIEGVAQSFESE 248


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_210HTHFIS992e-26 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 99 bits (249), Expect = 2e-26
Identities = 44/137 (32%), Positives = 70/137 (51%), Gaps = 2/137 (1%)

Query: 2 KILVVDDDKEIVELLSIYIKNEGYEVEKAYNGKEAMTKIVTNPDIDLMVLDVMMPKMDGI 61
ILV DDD I +L+ + GY+V N + + D DL+V DV+MP +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLW-RWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EVVKELRKE-SQMPVLMLSAKTTDMDKIQGLITGADDYVAKPFNPLEVMARIKSLLRRSN 120
+++ ++K +PVL++SA+ T M I+ GA DY+ KPF+ E++ I L
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 YQVTNDEPDILEIGPLV 137
+ + E D + PLV
Sbjct: 124 RRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_211PF06580330.002 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.002
Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 4/44 (9%)

Query: 272 LISNALKYGVGGKK----ITIEAQKVGKEVIIAVNNDGPIIPEE 311
L+ N +K+G+ I ++ K V + V N G + +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_212BLACTAMASEA416e-06 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 40.9 bits (96), Expect = 6e-06
Identities = 35/168 (20%), Positives = 56/168 (33%), Gaps = 23/168 (13%)

Query: 45 FAIEPKTGKVLLNQNGDAQLGIASMTKMITEYLVLEAIKEGKLTWDQKLSIDDYSYNVSQ 104
++ +G+ L D + + S K++ VL + G ++K+ Q
Sbjct: 43 IEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHY-------RQ 95

Query: 105 NNELSNVPLRK---DSQYTVKELFEAMAIYSANAAAITLATAVSGSEPAFVDAMREKVKS 161
+ + P+ + TV EL A S N+AA L V G +R
Sbjct: 96 QDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPA-GLTAFLR----Q 150

Query: 162 WGAKDFYLVNATGLTNSDLHGNIYPGSADTDENTMTARDMAIVAQHLL 209
G L N L G+ +T T MA + LL
Sbjct: 151 IGDNVTRLDRWETELNEALPGDA--------RDTTTPASMAATLRKLL 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_217PF05043532e-09 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 52.6 bits (126), Expect = 2e-09
Identities = 32/169 (18%), Positives = 66/169 (39%), Gaps = 7/169 (4%)

Query: 1 MKRLLDPNFIPILSLLKQLNKDYPSRSITFFSEQLKLDRRTILKTIHTLQLDISRNHWEN 60
M+ LL L LL+ L + + +E L R + + ++ + +
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60

Query: 61 MLTIEIIDKSVYTTISPFFSIEVFFSHYMSESFAVRLFLSLFKYPSDSIDEICEYLYVSK 120
+ + IE+ + H+ S + +F + IC+ Y+S
Sbjct: 61 ------STNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISS 114

Query: 121 ATFYRRIKYSKEVLDDFNLSLDFTDSKNKLVGSETQIRYFFSTLFWEVF 169
++ YR I +V+ + + + +++G+E IRYFF+ F E +
Sbjct: 115 SSLYRIISQINKVIKR-QFQFEVSLTPVQIIGNERDIRYFFAQYFSEKY 162


46BN424_256BN424_263N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_256-113-0.975036PRD domain protein
BN424_257-212-2.644787transcriptional regulator domain protein
BN424_258-112-1.888874helix-turn-helix domain, rpiR family protein
BN424_259-113-1.371690hypothetical protein
BN424_260013-1.134157PPIC-type PPIASE domain protein
BN424_261-115-1.401888hypothetical protein
BN424_262-116-0.859063mga helix-turn-helix domain protein
BN424_263-1150.160935legume lectin domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_256PF05043320.004 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 31.8 bits (72), Expect = 0.004
Identities = 21/133 (15%), Positives = 47/133 (35%), Gaps = 18/133 (13%)

Query: 10 ILEFLLMNKANVTNAVLASELDVSERTVRRDLHEVEAILDTFQLKLSKENSQLSIIGTEI 69
+LE L +K + LA L+ +ER V+ DL V++ + S N I +
Sbjct: 15 LLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDL-IFHSSTNGIRIINTDDS 73

Query: 70 NRQNFKWQLLDLA-------------HNEFTPLERQNFI----LKTLLRETEPLKLMALA 112
+ + + + + ++ +I L ++ + +
Sbjct: 74 DIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQ 133

Query: 113 TDLSVTISTISSD 125
++S+T I +
Sbjct: 134 FEVSLTPVQIIGN 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_261PF05043381e-05 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 38.0 bits (88), Expect = 1e-05
Identities = 33/147 (22%), Positives = 63/147 (42%), Gaps = 6/147 (4%)

Query: 6 KTNMQLKLMNGFTFYQKLIFKGGLLNDVPLQFKQRTLFIINRLPEIIQRTFKKYFPHEDE 65
K N+ L N Y++ +F +L D + I + +++ Y +
Sbjct: 312 KDNLIWHLHNTAHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKKELSHYLETLEV 371

Query: 66 ----WMANYFIYIIITHWDTFIPNMLKKAPIIHVGIVVETDLEHALYLKNKLAYYYP--F 119
M N+ Y ITH + N+L+ P + V ++ D HA ++ L+YY F
Sbjct: 372 CSSSMMVNHLSYTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVAETLSYYCSNNF 431

Query: 120 NLDAMLIPDITTERIDNKKLDIILTTF 146
L+ +++ E +++ DII++ F
Sbjct: 432 ELEVWTELELSKESLEDSPYDIIISNF 458


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_262PF05043735e-17 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 72.7 bits (178), Expect = 5e-17
Identities = 53/199 (26%), Positives = 92/199 (46%), Gaps = 2/199 (1%)

Query: 24 IDIYTLADELFMSVRNLKKYIDDLNVLINPISIYFIDTNSVNIHYPDSLNYQHIYKSIYV 83
LA+ L + R +K + + P I+ TN + I D + + +Y +
Sbjct: 26 FHRSELAELLNCTERAVKDDLSHVKSAF-PDLIFHSSTNGIRIINTDDSDIEMVYHHFFK 84

Query: 84 NNLNYSLLELLFLEENNTLETLEEHFFLSESTLRRTISFINQRL-APFDIIIDTKNFNII 142
++ ++S+LE +F E E++ + F++S S+L R IS IN+ + F + II
Sbjct: 85 HSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPVQII 144

Query: 143 GDEKNIIQFFVSYFQEKYTFQDIKLGNSLVQFLDYIYSDFTKFLNFPTNFPTKNRFIFWV 202
G+E++I FF YF EKY F + N + L + K +FP N T +
Sbjct: 145 GNERDIRYFFAQYFSEKYYFLEWPFENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLL 204

Query: 203 GVGLKRIERNHSLPINNNS 221
L RI+ H + ++ +S
Sbjct: 205 VTNLYRIKFGHFMEVDKDS 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_263BACINVASINB300.047 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 30.1 bits (67), Expect = 0.047
Identities = 28/113 (24%), Positives = 46/113 (40%), Gaps = 21/113 (18%)

Query: 575 DVSKAGEHPIELTVADKAGNK---SEMIHSTLKIL-------EANKQLESKKQLELKSKD 624
D KA + + VA KAG+ ++ S + + A ++L S+ QL L
Sbjct: 32 DFLKAADKAFKDVVATKAGDLKAGTKSGESAINTVGLKPPTDAAREKLSSEGQLTLLLGK 91

Query: 625 LIQKTTIEKNEFLLKQLAA--TAWEINAEAEKIDLTEQIIIQNSDEITENPGE 675
L+ + L QL + W+ E++K ++ IQ S E GE
Sbjct: 92 LMTLL----GDVSLSQLESRLAVWQAMIESQK-----EMGIQVSKEFQTALGE 135


47BN424_420BN424_426N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_420-2130.039147putative uncharacterized domain protein
BN424_421-112-0.287996conserved protein
BN424_4221120.416541putative membrane protein
BN424_4231130.493684LPXTG-motif cell wall anchor domain protein
BN424_424314-0.140522cobalt transport family protein
BN424_4250141.409901ABC transporter family protein
BN424_4260131.937213ABC transporter family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_420IGASERPTASE413e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 41.2 bits (96), Expect = 3e-06
Identities = 45/216 (20%), Positives = 74/216 (34%), Gaps = 41/216 (18%)

Query: 32 ATPQQNKTQVVKKETKKDTVANSKKTA---KEKKESLESEKKNESSKESSKDSSKE--AD 86
A Q N+ ETK+ +K+TA KE+K +E+EK E K +S+ S K+ ++
Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137

Query: 87 TEKKIADKNVEKATSTVIESADD----------------------VQEEVANSTTNNETK 124
T + A+ E + I+ V E +T N+ +
Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197

Query: 125 NPVVEHKEEPNAPPTTPS------------QPEPTQPTPENKPTPEPKKEVTFSIKGTAT 172
NP + S + P P + + + T T
Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257

Query: 173 NS--SSYFISPQKVEMKEGQSVMDVLSDYCRNNGIQ 206
N+ S Q V + G++V +S NN Q
Sbjct: 1258 NAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293



Score = 40.0 bits (93), Expect = 8e-06
Identities = 27/152 (17%), Positives = 59/152 (38%), Gaps = 9/152 (5%)

Query: 32 ATPQQNKTQVVKKETKKDTVANSKKTAKEKKESLESE----KKNESSKESSKDSSKEADT 87
+ Q++KT ++ +T A +++ AKE K ++++ + +S E+ + + E
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 88 EKKIADKNVEKATSTVIESADDVQEEVANSTTNNETKNPVVEHKEEPNA-----PPTTPS 142
+ + K + + V +V+ +ET P E E + P + +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 143 QPEPTQPTPENKPTPEPKKEVTFSIKGTATNS 174
P + + ++ VT S NS
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194



Score = 31.2 bits (70), Expect = 0.005
Identities = 32/163 (19%), Positives = 55/163 (33%), Gaps = 8/163 (4%)

Query: 25 NNKYAHVATPQQNKTQVVKKETKKDTVANSKKTAKEKKESLESEKKNESSKESSKDSSKE 84
N + VA + ETK+ ++ AK + E K E K +S+ S K+
Sbjct: 1079 NTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETE-----KTQEVPKVTSQVSPKQ 1133

Query: 85 --ADTEKKIADKNVEKATSTVIESADDVQEEVANSTTNNETKNPVVEHKEEPNAPPTTPS 142
++T + A+ E + I+ A++ + + VE + T +
Sbjct: 1134 EQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193

Query: 143 QPEPTQPTPENKPTPEPKKEVTFSIKGTATNSSSYFISPQKVE 185
P T +P S K + S P VE
Sbjct: 1194 SVVEN-PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235



Score = 30.4 bits (68), Expect = 0.009
Identities = 35/187 (18%), Positives = 61/187 (32%), Gaps = 26/187 (13%)

Query: 24 LNNKYAHVATPQQNKTQVVKKETKKDT-------VANSKKTAKEKKESLESEKKNESSKE 76
NN A V + N ++ + + ++ A+ K+ ++ +KNE
Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT 1059

Query: 77 SSKDSSKEADTEKKIADKNVEKATSTVIESADDVQEEVANSTTNNETKNPVVEHKEEPN- 135
+ ++E E K NV+ T T E + T ETK KEE
Sbjct: 1060 ETTAQNREVAKEAK---SNVKANTQT--NEVAQSGSETKETQTT-ETKETATVEKEEKAK 1113

Query: 136 ------------APPTTPSQPEPTQPTPENKPTPEPKKEVTFSIKGTATNSSSYFISPQK 183
+P Q + P+ +P E V + TN+++ P K
Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173

Query: 184 VEMKEGQ 190
+
Sbjct: 1174 ETSSNVE 1180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_421adhesinb280.015 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 27.5 bits (61), Expect = 0.015
Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 1/58 (1%)

Query: 1 MKKIIRLVLSIGVIALIVGCGNAAVTKEDSSSSKTKDVTVTVILKENHKEFDQKKIEV 58
MKK LVL + + C + + ++ SSK V I+ + K KI +
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQK-SSTETGSSKLNVVATNSIIADITKNIAGDKINL 57


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_423PF03544377e-05 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 37.3 bits (86), Expect = 7e-05
Identities = 26/129 (20%), Positives = 43/129 (33%), Gaps = 8/129 (6%)

Query: 306 GGASIYDFVNNPVPQLIETPVDPDP---EVTDPEVTDPETTDPEVTDPETTEPEVTNPDP 362
GA + + V Q+IE P P + P + EPE P+P
Sbjct: 25 HGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADL-EPPQAVQPPPEPVVEPE---PEP 80

Query: 363 GVTEPETTNPEKEKTPDPTPTEKPTVVEPVAVKEADKPANLIKAPISNTAENTKSDETNN 422
PE P P KP V++ + +++ ++ ENT +
Sbjct: 81 EPI-PEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTS 139

Query: 423 FPKTGESSE 431
T +S+
Sbjct: 140 STATAATSK 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_426PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.007
Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 16 IVGKNGTGKSTFLMVLAGL 34
+ G G GKST + L GL
Sbjct: 601 LEGTGGIGKSTLINTLVGL 619


48BN424_872BN424_879N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_872-2101.377067lin1944 protein
BN424_873-39-0.536484preprotein translocase, SecG subunit
BN424_874-29-0.772515esterase D
BN424_875-310-1.292668ribonuclease R
BN424_876-112-2.277601ssrA-binding protein
BN424_877-112-2.3452716-phospho-beta-glucosidase gmuD
BN424_878115-3.457501putative membrane protein
BN424_879-115-1.180085response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_872DHBDHDRGNASE441e-07 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 43.9 bits (103), Expect = 1e-07
Identities = 45/181 (24%), Positives = 63/181 (34%), Gaps = 29/181 (16%)

Query: 2 KKALIIGGNGTIGSAVSNALNDSYEIITA------------------GRTHGDVKVDITS 43
K A I G IG AV+ L I A R D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 44 VESI----TRLFEIVGEVDAVIITAGQAHFGALKDMTPQD--NLISVNSKLLGQVNTVLI 97
+I R+ +G +D ++ AG G + ++ ++ SVNS G N
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNS--TGVFNASRS 126

Query: 98 GTNYVKDH--GSFTLVTGIMMDDPILAGASAALANGGVKAFAKSAALEL-PRGIRINTVS 154
+ Y+ D GS V P + A+ A + F K LEL IR N VS
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 155 P 155
P
Sbjct: 187 P 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_873SECGEXPORT412e-08 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 41.1 bits (96), Expect = 2e-08
Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 1 MYNALLLAMLVISVLLIIVITMQPTKTNSASSALTGGAE-QLFGKQKARGFEAVLQRVTV 59
MY ALL+ L++++ L+ +I +Q K ++ GA LFG + F + R+T
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNF---MTRMTA 57

Query: 60 ILGIAFFVIALVLAYV 75
+L FF+I+LVL +
Sbjct: 58 LLATLFFIISLVLGNI 73


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_874CHLAMIDIAOM6300.010 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 30.0 bits (67), Expect = 0.010
Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 74 TQNALAF--LREKGYQEIAVFGLSLGGIFATKALEEEGLLAAGTLCSPLFLNENNHV 128
T N + F L G +E F ++L + A A E +L++ TL P+ EN H+
Sbjct: 491 TGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDA-RGEAILSSDTLTVPVSDTENTHI 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_879HTHFIS422e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.7 bits (98), Expect = 2e-06
Identities = 23/124 (18%), Positives = 47/124 (37%), Gaps = 13/124 (10%)

Query: 2 HIAICEDETVQQEELYNLLQAYQSFFPTPLAIEVFSNAEDLIEVCHYSGNRFDLIFLDIA 61
I + +D+ + L L + + SNA L + DL+ D+
Sbjct: 5 TILVADDDAAIRTVLNQALS------RAGYDVRITSNAATLWR--WIAAGDGDLVVTDVV 56

Query: 62 LPKLNGIEAAKIIRQMDAEVELVFLT--SMLDYSLEGYHVKALRYLLKPIKAHQLDELLK 119
+P N + I++ ++ ++ ++ + +++ A YL KP L EL+
Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIG 113

Query: 120 TIKN 123
I
Sbjct: 114 IIGR 117


49BN424_1537BN424_1543N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_15371120.939291lysM domain protein
BN424_15380100.692438cytidylate kinase
BN424_15390110.11534630S ribosomal protein S1 homolog
BN424_15402110.146433ribosome-associated GTPase EngA
BN424_15412100.417206DNA-binding protein HU
BN424_15422100.7018003-dehydroquinate synthase family protein
BN424_15433101.167605tetratricopeptide repeat family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1537TONBPROTEIN280.023 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 28.0 bits (62), Expect = 0.023
Identities = 13/42 (30%), Positives = 16/42 (38%)

Query: 106 SKPAESSSAVSSSSAPVEETPAPVEPTPPVEETPPPVEEPPV 147
PA+ S + A +E A P PV E P E P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPE 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1540TCRTETOQM300.017 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 30.2 bits (68), Expect = 0.017
Identities = 20/76 (26%), Positives = 33/76 (43%), Gaps = 10/76 (13%)

Query: 48 WLGKEFNIIDT-GGIDIGDEPFLEQIKQQAEIAMDEADVIIFITSGRESVTDADENVAKM 106
W + NIIDT G +D FL ++ ++ D I + S ++ V +
Sbjct: 65 WENTKVNIIDTPGHMD-----FLAEV----YRSLSVLDGAILLISAKDGVQAQTRILFHA 115

Query: 107 LYRTKKPVLLAVNKID 122
L + P + +NKID
Sbjct: 116 LRKMGIPTIFFINKID 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1541DNABINDINGHU1339e-45 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 133 bits (337), Expect = 9e-45
Identities = 71/91 (78%), Positives = 78/91 (85%)

Query: 1 MANKAELIESVATSTGLTKKDATAAVDAVFETIQTTLSSGEKVQLIGFGNFEVRERAARK 60
MANK +LI VA +T LTKKD+ AAVDAVF + + L+ GEKVQLIGFGNFEVRERAARK
Sbjct: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARK 60

Query: 61 GRNPQTGEEIQIAASKVPAFKPGKALKDAVK 91
GRNPQTGEEI+I ASKVPAFK GKALKDAVK
Sbjct: 61 GRNPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1543SYCDCHAPRONE391e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 38.8 bits (90), Expect = 1e-05
Identities = 32/132 (24%), Positives = 51/132 (38%), Gaps = 7/132 (5%)

Query: 202 ETTDLLFELGFTYLQNKEYRRASETLFKLKELDPSYTSLYPYLAKSLEEENQLDRATEVI 261
+T + L+ L F Q+ +Y A + L LD + + L + Q D A
Sbjct: 34 DTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSY 93

Query: 262 REGLRADQYNPELFYYAADLFLKLGDEEQGEYYYQESLELNPDN-------ETVQLALIN 314
G D P ++AA+ L+ G+ + E + EL D V L
Sbjct: 94 SYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEA 153

Query: 315 LYLKQERFNEAV 326
+ LK+E +E V
Sbjct: 154 IKLKKEMEHECV 165


50BN424_1611BN424_1618N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_1611-110-1.723550bacterial regulatory helix-turn-helix, lysR
BN424_1612-110-0.892659transcription regulator
BN424_1613-19-0.986021manganese-dependent inorganic pyrophosphatase
BN424_1614-110-1.556579conserved hypothetical protein
BN424_1615-214-1.369516response regulator
BN424_1616-117-2.264196lrgA family protein
BN424_1617117-2.949665antiholin-like protein LrgB
BN424_1618117-4.258904linear amide C-N hydrolases, choloylglycine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1611HTHFIS290.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.008
Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 25 NYTQAAQLLGITQPALTQQIKKL 47
N +AA LLG+ + L ++I++L
Sbjct: 451 NQIKAADLLGLNRNTLRKKIREL 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1614PF065802121e-65 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 212 bits (541), Expect = 1e-65
Identities = 66/216 (30%), Positives = 116/216 (53%), Gaps = 13/216 (6%)

Query: 362 QLELGEAEIQSKLLKDAEIKSLQAQVNPHFFFNAMNTISALMRQNAEQARTLLLQLSTYF 421
Q E+ + ++ + ++A++ +L+AQ+NPHF FNA+N I AL+ ++ +AR +L LS
Sbjct: 146 QAEIDQWKMA-SMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELM 204

Query: 422 RANLQGARQVLIPLTAELKHVEAYLSLEQARFPQRFQVTFNIYPNLETLLLPPFLLQVLV 481
R +L+ + + L EL V++YL L +F R Q I P + + +PP L+Q LV
Sbjct: 205 RYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV 264

Query: 482 ENAIRHAFGNRKTDNQIVVQLEQKENFVLVQVSDNGIGIPLDRVEKVGKEVIESEKGTGT 541
EN I+H +I+++ + V ++V + G + +++ TGT
Sbjct: 265 ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-----------SLALKNTKESTGT 313

Query: 542 ALENLNKRLVGLFGLEASLQFSQNKTGGTTVLLKIP 577
L+N+ +RL L+G EA ++ S K G ++ IP
Sbjct: 314 GLQNVRERLQMLYGTEAQIKLS-EKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1615HTHFIS759e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.3 bits (185), Expect = 9e-18
Identities = 33/162 (20%), Positives = 61/162 (37%), Gaps = 13/162 (8%)

Query: 2 HILIVDDEPLARDELAYLVETHPNVISVDKAESIEEALEKMVNQKPDLVFLDIHLTDESG 61
IL+ DD+ R L + V + + DLV D+ + DE+
Sbjct: 5 TILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 FDLADKFKKMNHPPKIVFATAYD--EYALKAFEVDAIDYILKPFEEERVRQAVEKSHSAI 119
FDL + KK ++ +A + A+KA E A DY+ KPF+ + + + A+
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---AL 119

Query: 120 SHLSTGEETTNLAREINGKI-----AIQAD-ERIFVIALSDI 155
+ + + A+Q + + +D+
Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1618BLACTAMASEA290.032 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 28.6 bits (64), Expect = 0.032
Identities = 10/49 (20%), Positives = 20/49 (40%), Gaps = 17/49 (34%)

Query: 106 LVSWLLGNIASIDELKKEASNINVVSAKNNLLDVVVPLHWIVADQTGSS 154
L+ W++ + + L+ V+P W +AD+TG+
Sbjct: 203 LLQWMVDDRVA-----------------GPLIRSVLPAGWFIADKTGAG 234


51BN424_1895BN424_1899N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_18951171.657226GTP-binding protein LepA
BN424_18961171.360487chaperone protein DnaJ
BN424_18971150.350985chaperone protein DnaK
BN424_1898-2100.357201grpE family protein
BN424_1899-1110.610119heat-inducible transcription repressor HrcA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1895TCRTETOQM1752e-49 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 175 bits (446), Expect = 2e-49
Identities = 99/436 (22%), Positives = 173/436 (39%), Gaps = 83/436 (19%)

Query: 12 RIRNFSIIAHIDHGKSTLADRILEK---TDTVANRDMQAQLLDSMDLERERGITIKLNAV 68
+I N ++AH+D GK+TL + +L + + D D+ LER+RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 ELTYTAKDGIDYTFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLA 128
+ + ++IDTPGH+DF EV RSL+ +GAIL++ A G++AQT +
Sbjct: 62 SFQWE-----NTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 129 LDNDLEIIPVINKIDLPAADPERVRGEIEDVIG--------------------------- 161
+ I INKID D V +I++ +
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 162 --IDASDAVLA--------------------------------SAKAGIGIEDILEQIVE 187
I+ +D +L SAK IGI++++E I
Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236

Query: 188 KVPAPTGDLDAPLQALIFDSVYDSYRGVILNVRIMSGVVKSGDKIQMMSNGATFEVADVG 247
K + T + L +F Y R + +R+ SGV+ D +++
Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYT 296

Query: 248 IFSPKPIKRDFLMVGDVGYITASIKTVQDTRVGDTITLANNPATEALPGYRKMNPMVYCG 307
+ + K D G++ + + +GDT L E P++
Sbjct: 297 SINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQRERIENPL------PLLQTT 349

Query: 308 LYPIDSSRYNELREALERLQLNDAALQFE--AETSQALGFGFRCGFLGLLHMDVIQERLE 365
+ P + L +AL + +D L++ + T + + FLG + M+V L+
Sbjct: 350 VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCALLQ 404

Query: 366 REFDLDLITTAPSVIY 381
++ +++ P+VIY
Sbjct: 405 EKYHVEIEIKEPTVIY 420



Score = 39.5 bits (92), Expect = 4e-05
Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 2/80 (2%)

Query: 410 EPYVKASIMVPNDYVGAVMEIAQRKRGEFITMDYLDEFRVNVIYEIPLSEIVYDFFDKLK 469
EPY+ I P +Y+ A + + + V + EIP I ++ L
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNN-EVILSGEIPARCI-QEYRSDLT 594

Query: 470 SSTKGYASLDYDLIGYRPSK 489
T G + +L GY +
Sbjct: 595 FFTNGRSVCLTELKGYHVTT 614


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1896cloacin300.014 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.5 bits (68), Expect = 0.014
Identities = 13/32 (40%), Positives = 14/32 (43%)

Query: 69 GHASTDPNFGGGGFGGGGFGGFGGSSAGFGGG 100
G S GG G G GG G G +G GG
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1897SHAPEPROTEIN1465e-41 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 146 bits (369), Expect = 5e-41
Identities = 81/362 (22%), Positives = 141/362 (38%), Gaps = 55/362 (15%)

Query: 2 SKIIGIDLGTTNSAVAVLEGGEAKIIANPEGNRTTPSVV-------SFKNGEIQVGEVAK 54
S + IDLGT N+ + V G I+ N PSVV VG AK
Sbjct: 10 SNDLSIDLGTANTLIYVKGQG---IVLN------EPSVVAIRQDRAGSPKSVAAVGHDAK 60

Query: 55 RQAVTNPNTISSVKRHIGEAGFSIEMEGKKYTAQEISAMILQY-LKGFAEEYLGEKVEKA 113
+ P I++++ M+ ++ +LQ+ +K +
Sbjct: 61 QMLGRTPGNIAAIR----------PMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRV 110

Query: 114 VITVPAYFNDAQRQATKDAGKIAGLEVERIVNEPTAAALAYGLDKTDKDEKVLVFDLGGG 173
++ VP +R+A +++ + AG ++ EP AAA+ GL + +V D+GGG
Sbjct: 111 LVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGL-PVSEATGSMVVDIGGG 169

Query: 174 TFDVSILELGDGVFDVLATAGDNKLGGDDFDNKIIDYMVAEFKKENAIDLSKDKMAVQRL 233
T +V+++ L V + ++GGD FD II+Y+ +
Sbjct: 170 TTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG------------- 211

Query: 234 KDAAEKAKKDLS----GVTSTQISLPFITAGEAGPLHLEMNLTRAKFDELTHDLVDRTKV 289
+ AE+ K ++ G +I + E P +N + + L + +
Sbjct: 212 EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLN-SNEILEAL-QEPLTGIVS 269

Query: 290 PVRQALKD-AGLTASDIDE--VILVGGSTRIPAVVEAVKKETNQEPNKSVNPDEVVAMGA 346
V AL+ ASDI E ++L GG + + + +ET + +P VA G
Sbjct: 270 AVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGG 329

Query: 347 AI 348

Sbjct: 330 GK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_1899TYPE4SSCAGA290.026 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 29.3 bits (65), Expect = 0.026
Identities = 15/45 (33%), Positives = 23/45 (51%)

Query: 232 FVRGRMNILDFNEAMDVEKFKSIYNLMDGNTSLTSLINQSHEGIE 276
F G M +LD D++ L+ N +L+S++ SH GIE
Sbjct: 267 FTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIE 311


52BN424_2380BN424_2389N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2380-110-2.320653cyclopentanol dehydrogenase
BN424_2381-211-2.870090response regulator
BN424_2382-213-2.116017HAMP domain protein
BN424_2383013-1.489672conserved hypothetical protein
BN424_2384-113-0.902680bacterial extracellular solute-binding family
BN424_2385-114-0.303459binding--dependent transport system inner
BN424_2386-2130.631172binding--dependent transport system inner
BN424_2387-2130.985333HAMP domain protein
BN424_2388-2121.497499response regulator
BN424_2389-2131.129499hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2380DHBDHDRGNASE1342e-40 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 134 bits (337), Expect = 2e-40
Identities = 75/251 (29%), Positives = 117/251 (46%), Gaps = 14/251 (5%)

Query: 4 LKNKVALVTGGTSGIGEKITDCFIAEGATVVVCDINQEALNKAKGKENVVTK-----KLD 58
++ K+A +TG GIGE + ++GA + D N E L K + D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 59 ISSEAEWTQVVQEVIEQFGKIDILANNAGISSDKGLETTTVEEWELQHKINALGPFLGMK 118
+ A ++ + + G IDIL N AG+ + + + EEWE +N+ G F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 119 AVVPYMKKAGQGSIVNTASYTALVG-AGINGYTGSKGSIRAVSKAAAADLGYFNIRVNSV 177
+V YM GSIV S A V + Y SK + +K +L +NIR N V
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 178 YPGVIETPMSAAVSEYKDAMAQLIQAT--------PLGRIGKPEEVANAILFLASDEASF 229
PG ET M ++ ++ Q+I+ + PL ++ KP ++A+A+LFL S +A
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 230 INGAELVIDGG 240
I L +DGG
Sbjct: 246 ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2381HTHFIS805e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.9 bits (197), Expect = 5e-18
Identities = 30/139 (21%), Positives = 57/139 (41%), Gaps = 3/139 (2%)

Query: 3 RMLLVDDEYMILAGLQKLIPWQELGIEIVGTAKNGQEALDFVRNNVVDIVISDVTMPLLS 62
+L+ DD+ I L + + G ++ N ++ D+V++DV MP +
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GIEFIRQAQSEDIYFHFLILSGYQEFDYVKEGLRMGADNYLIKPVDKVELIETLEKIIKE 122
+ + + + L++S F + GA +YL KP D ELI + + + E
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 LNSEAEQLQTQSVLFDYLL 141
+L+ S L+
Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2382PF065801885e-57 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 188 bits (480), Expect = 5e-57
Identities = 51/209 (24%), Positives = 99/209 (47%), Gaps = 16/209 (7%)

Query: 354 DIYTLEIKQKDAHMRALQSQINPHFLYNTLEYIRMYAVSEGAEELADVVYTFATLLRNN- 412
D + + ++A + AL++QINPHF++N L IR + E + +++ + + L+R +
Sbjct: 150 DQWKMASMAQEAQLMALKAQINPHFMFNALNNIRA-LILEDPTKAREMLTSLSELMRYSL 208

Query: 413 -TSQEKTTTLKKELEFCEKYVYLYQMRYPGNIAYSFAIDSAIENLVIPKFSIQPLIENYF 471
S + +L EL + Y+ L +++ + + I+ AI ++ +P +Q L+EN
Sbjct: 209 RYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGI 268

Query: 472 VHGIDYMRIDNVISVKANIEEDKITILIRDNGKGMSSEKIKDLNQSLMESHSKFGGSIGI 531
HGI + I +K + +T+ + + G SL ++K G+
Sbjct: 269 KHGIAQLPQGGKILLKGTKDNGTVTLEVENTG-------------SLALKNTKESTGTGL 315

Query: 532 LNVNERLRSYFGESYQMCIQETQAHGVTI 560
NV ERL+ +G Q+ + E Q +
Sbjct: 316 QNVRERLQMLYGTEAQIKLSEKQGKVNAM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2387PF06580320.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.006
Identities = 18/105 (17%), Positives = 37/105 (35%), Gaps = 22/105 (20%)

Query: 359 IILNLLSNALKFTESGGKIQVKVKIKDQFAVLMVEDSGCGIDSTDLEHIFDRFYMADDSR 418
++ N + + + GGKI +K + L VE++G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG------------------SLAL 304

Query: 419 KQNNEGQGIGLAIVKSIVKA---HDGTVSVSSSVNLGTRFIIQLP 460
K E G GL V+ ++ + + +S ++ +P
Sbjct: 305 KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2388HTHFIS933e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.4 bits (232), Expect = 3e-24
Identities = 33/163 (20%), Positives = 68/163 (41%), Gaps = 10/163 (6%)

Query: 2 KILIVDDEPKILDVIEAYLVVNGHLVYRAETGSQALEKYRVVGPDLIILDWMLPDSSGME 61
IL+ DD+ I V+ L G+ V + DL++ D ++PD + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 VCQEIR-LESSVPIIMLTAKAGEKNIISGLKMGADDYVVKPFSPKELVMRVETVLRRTGY 120
+ I+ +P+++++A+ I + GA DY+ KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 QAKTQKIKFNDGKLVIDV---------LGKQVFQSNQLVCLTG 154
+ + DG ++ + ++ Q++ + +TG
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2389MECHCHANNEL280.001 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 27.9 bits (62), Expect = 0.001
Identities = 9/32 (28%), Positives = 19/32 (59%)

Query: 28 GMGMFSSSIIDFLLLILLIFVVVKVVQGIRSK 59
G+F ++ DFL++ IF+ +K++ + K
Sbjct: 74 HYGVFIQNVFDFLIVAFAIFMAIKLINKLNRK 105


53BN424_2727BN424_2734N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_2727011-1.123846sensory box protein
BN424_2728013-0.574019alkaline phosphatase synthesis transcriptional
BN424_2729115-0.057964putative membrane protein
BN424_27302170.558421nlpC/P60 family protein
BN424_2731214-0.197709cell division protein ftsX
BN424_2732112-0.132934cell division ATP-binding protein FtsE
BN424_2733-111-0.049000peptide chain release factor 2
BN424_2734010-0.150400preprotein translocase, SecA subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2727PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/170 (16%), Positives = 63/170 (37%), Gaps = 38/170 (22%)

Query: 427 LSKLEQKQVPLEQELIEVQ-----EAVRSSFRL-VKHKADEKNMNLLLNDADPIYLLGDS 480
L +QV L EL V +++ RL +++ + M++ + P L+
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV----PPMLV--- 260

Query: 481 GRLKQIITNLLTNAVSYTEAGGKVEVFVEQSETEATIKISDNGMGIPEAELDRIFERFYR 540
+ ++ N + + ++ GGK+ + + T+++ + G +
Sbjct: 261 ---QTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------------ 305

Query: 541 VDKARSRNSGGTGLGLSIVKYLVENFNG---TIQVESKLGLGTTFTIILP 587
TG GL V+ ++ G I++ K G +++P
Sbjct: 306 ------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2728HTHFIS937e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.6 bits (230), Expect = 7e-24
Identities = 34/151 (22%), Positives = 78/151 (51%), Gaps = 4/151 (2%)

Query: 3 KVLIVDDEESILTLLAFNLEKAGYEVQTAMDGLIGYQLALENQYDFIILDLMMPSMDGME 62
+L+ DD+ +I T+L L +AGY+V+ + ++ D ++ D++MP + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VCKKLRQEKIETPIMILTAKDDELEKIIGLELGADDYMTKPFSPREVLARMKAIMRRIKP 122
+ ++++ + + P+++++A++ + I E GA DY+ KPF E++ + R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI----IGRALA 120

Query: 123 ESKEKINESHEASPEEEVVIGELQIFPELYE 153
E K + ++ + S + ++G E+Y
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2730IGASERPTASE386e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.1 bits (88), Expect = 6e-05
Identities = 40/193 (20%), Positives = 64/193 (33%), Gaps = 16/193 (8%)

Query: 81 TLKTLTTEVTALNEKIAQREDKLKEQARTIQVNGDTQNYIDFVLSAKSFGDVVGRVDVVS 140
L+ + N ++ +R + I + Q V S S + + RVD
Sbjct: 970 KLRNVNGRYDLYNPEVEKRNQTVD--TTNITTPNNIQAD---VPSVPSNNEEIARVD--E 1022

Query: 141 QMVSANQDLVKEQKSDKEEVASKQKETETKSQEQALLAAKLEATKADLEQQKLEKEAIVA 200
V + ++ SKQ+ + EQ + + ++ K +A
Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA-KEAKSNVKANTQ 1081

Query: 201 TLASEQSGAESEKASFLAKKE--DAEKAAKAIATANAAPVVAVQTSTTAP------AATP 252
T QSG+E+++ KE EK KA V TS +P P
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 253 VAENNAPAAPPVN 265
AE P VN
Sbjct: 1142 QAEPARENDPTVN 1154



Score = 34.7 bits (79), Expect = 7e-04
Identities = 41/228 (17%), Positives = 74/228 (32%), Gaps = 25/228 (10%)

Query: 46 KKDAAQTEIGTITDTIAKNEENSVKLVAEMKETQATLKTLTTEVTALNEKIAQREDKLKE 105
A + + + IA+ +E V A ++ TTE A N K + + E
Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE------TTETVAENSKQESKTVEKNE 1055

Query: 106 Q------ARTIQVNGDTQNYIDFVLSAKSFGDVVGRVDVVSQMVSANQDLVKEQKSDKEE 159
Q A+ +V + ++ + + V++++ K E
Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115

Query: 160 VASKQKETETKSQEQALLAAKLEATKADLEQQKLEKEAIVATLASEQSGAESEKASFLAK 219
Q+ + SQ ++ K E ++ Q + +E E + A
Sbjct: 1116 TEKTQEVPKVTSQ----VSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA----- 1166

Query: 220 KEDAEKAAKAIATANAAPVVAVQTSTT--APAATPVAENNAPAAPPVN 265
D E+ AK ++ PV T T + P A P VN
Sbjct: 1167 --DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_2734SECA11160.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1116 bits (2887), Expect = 0.0
Identities = 424/904 (46%), Positives = 583/904 (64%), Gaps = 73/904 (8%)

Query: 1 MANFLKQLIEN-DKKEIKSLEKVADKIDAYGDRMAALSDEELQAKTPEFKKRYQAGETLD 59
+ L ++ + + + ++ + KV + I+A M LSDEEL+ KT EF+ R + GE L+
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 DLLPEAFAVVREAAKRVLGLYPYRVQLMGGMTLHKGNIPEMKTGEGKTLTATMPVYLNAL 119
+L+PEAFAVVREA+KRV G+ + VQL+GGM L++ I EM+TGEGKTLTAT+P YLNAL
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 AGEGVHVVTVNEYLASRDATEMGELYTFLGLTVGLNLNSKSSEEKREAYNADITYSTNNE 179
G+GVHVVTVN+YLA RDA L+ FLGLTVG+NL + KREAY ADITY TNNE
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 LGFDYLRDNMVVYREQMVQRPLNYAIVDEVDSILIDEARTPLIISGQAEKSTALYTRADF 239
GFDYLRDNM E+ VQR L+YA+VDEVDSILIDEARTPLIISG AE S+ +Y R +
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNK 241

Query: 240 FVKSLT-----------AEADYTIDVQSKTIALTEEGMVKAEKTF-------KVENLYDI 281
+ L E +++D +S+ + LTE G+V E+ + E+LY
Sbjct: 242 IIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSP 301

Query: 282 DNTALIHHIDQALRANYIMLRDIDYVVQEGEVLIVDQFTGRIMDGRRYSDGLHQAIEAKE 341
N L+HH+ ALRA+ + RD+DY+V++GEV+IVD+ TGR M GRR+SDGLHQA+EAKE
Sbjct: 302 ANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKE 361

Query: 342 GVEIENESKTMANVTFQNFFRMYKKLSGMTGTAKTEQEEFREIYNIQVVEIPTNKPIIRD 401
GV+I+NE++T+A++TFQN+FR+Y+KL+GMTGTA TE EF IY + V +PTN+P+IR
Sbjct: 362 GVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRK 421

Query: 402 DRPDLLYPTLESKFNAVVEDIKTRHANGQPILVGTVAVETSELLSDLLTKAKIHHEVLNA 461
D PDL+Y T K A++EDIK R A GQP+LVGT+++E SEL+S+ LTKA I H VLNA
Sbjct: 422 DLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA 481

Query: 462 KNHFKEAEIIMSAGQKGAVTIATNMAGRGTDIKLGA------------------------ 497
K H EA I+ AG AVTIATNMAGRGTDI LG
Sbjct: 482 KFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADW 541

Query: 498 -----GVIEAGGLCVIGTERHESRRIDNQLRGRAGRQGDPGVTQFYLSLEDELMKRFGSE 552
V+EAGGL +IGTERHESRRIDNQLRGR+GRQGD G ++FYLS+ED LM+ F S+
Sbjct: 542 QVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASD 601

Query: 553 RIQAILERLRVQEEDAVIQSKMISRQVESAQKRVEGNNYDTRKNVLEYDDVMREQREIMY 612
R+ ++ +L ++ +A I+ +++ + +AQ++VE N+D RK +LEYDDV +QR +Y
Sbjct: 602 RVSGMMRKLGMKPGEA-IEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660

Query: 613 GQRLEVIMATESLKKITMAMIQRTVNRMVS--VNTQGNKEEWNLQGIHDFATSAIVHEDS 670
QR E ++ + + ++ + + + Q +E W++ G+ + + +
Sbjct: 661 SQRNE-LLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLD-- 717

Query: 671 LTVQDLENKTPEEIEALLMKRV----EDIYTTKEQQF-NEQMLEFEKVVILRVVDSKWTD 725
L + + +K PE E L +R+ ++Y KE+ E M FEK V+L+ +DS W +
Sbjct: 718 LPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKE 777

Query: 726 HIDTMDQLRQGIGLRAYAQTNPLVEYQAEGFKLFEEMIAAIEYDVTRLLMKSEIR----- 780
H+ MD LRQGI LR YAQ +P EY+ E F +F M+ +++Y+V L K ++R
Sbjct: 778 HLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEV 837

Query: 781 ------QNLQREQVAQGSPARSTGDGDVVEAAKHKPVKNDD-KIGRNDPCPCGSGKKYKN 833
+ ++ E++AQ D AA + + K+GRNDPCPCGSGKKYK
Sbjct: 838 EELEQQRRMEAERLAQMQQLSHQDDDS--AAAAALAAQTGERKVGRNDPCPCGSGKKYKQ 895

Query: 834 CHGK 837
CHG+
Sbjct: 896 CHGR 899


54BN424_3053BN424_3058N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3053113-0.137500heme ABC transporter, heme-binding protein isdE
BN424_3054112-0.184514sortase B cell surface sorting signal domain
BN424_3055115-0.526153heme uptake protein IsdC
BN424_3056015-0.559562hypothetical protein
BN424_3057015-0.668067hypothetical protein
BN424_3058018-1.264774short chain dehydrogenase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3053FERRIBNDNGPP631e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 63.4 bits (154), Expect = 1e-13
Identities = 40/193 (20%), Positives = 74/193 (38%), Gaps = 9/193 (4%)

Query: 39 TQEKKEQRVIATTVASAEIMAKLDYPLVGIPTTSK-----ELPKQYKDVVEVGSPMGPDL 93
R++A E++ L G+ T P V++VG P+L
Sbjct: 30 AAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNL 89

Query: 94 EIMRTLKPDLVLSTSTLQTDLEDGLTAAKLKT-NFLDFRS-IASMEKEIKTLGEQLNRQS 151
E++ +KP ++ ++ E A + NF D + +A K + + + LN QS
Sbjct: 90 ELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQS 149

Query: 152 EAEQLTQSIDQKVGKAQTK-VKQDKKPKVLILMGIPGSYLVVTEKAYIGDLVRLAGGENV 210
AE + + + + VK+ +P +L + P LV + +++ G N
Sbjct: 150 AAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNA 209

Query: 211 ITGQEQEYLASNT 223
G E + S
Sbjct: 210 WQG-ETNFWGSTA 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3054PRTACTNFAMLY330.004 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 33.1 bits (75), Expect = 0.004
Identities = 17/75 (22%), Positives = 30/75 (40%)

Query: 424 LTTDLNAHVAYQVDMGGGKLYNGQANFRLLFDPTQAVAIPSNEFPSAPEPEKPVTPVDPE 483
+ T L + + + GK+ G +RL + ++ + P AP+P P P+
Sbjct: 528 VQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQ 587

Query: 484 KPTDPEKPINPVKPA 498
P + P PA
Sbjct: 588 PPQPQPEAPAPQPPA 602


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3055AEROLYSIN280.035 Aerolysin signature.
		>AEROLYSIN#Aerolysin signature.

Length = 493

Score = 27.7 bits (61), Expect = 0.035
Identities = 14/33 (42%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 116 SAPIAAKIKVDIE--NSDLSYHHEYKINLSFDL 146
+ P +KI V IE +D+SY +E+K ++S+DL
Sbjct: 307 TVPARSKIPVKIELYKADISYPYEFKADVSYDL 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3057INTIMIN320.024 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 31.6 bits (71), Expect = 0.024
Identities = 58/313 (18%), Positives = 95/313 (30%), Gaps = 41/313 (13%)

Query: 603 STGVKVTTIVKANNQTSEASTIVKNGEIAKTTISKIDNATTF----------VSGTGEPN 652
S V +T V +N Q + + A T +K D V+ P
Sbjct: 539 SNNVLLTITVLSNGQVVDQVGVTDFT--ADKTSAKADGTEAITYTATVKKNGVAQANVPV 596

Query: 653 GAITLSANGTI-LASGKVDSAGKYSFTIAKQAVGVTVLAKVTLNGKESEVSTIVTKASEK 711
+S + S + +GK + T+ G V++ T S + A
Sbjct: 597 SFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEM----TSALNANA--- 649

Query: 712 VVAPVIHDYYITDINAKGTIGGSAKQVAI-YVNGVKKRTAAVTNGSFTIYT--GDLGLTV 768
V+ IT+I A T + Q AI Y V K V+N T T G L +
Sbjct: 650 VIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNST 709

Query: 769 A---GQSFQIAGLFDGVEGPKTT--KIVEARNQLIAPTINDYY-----TTDANVSGTITG 818
+ L G ++ + + AP + + + + GT
Sbjct: 710 EKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVK 769

Query: 819 SAKQVAIFIDGVQKRTAAVNNGKYVIYTGDLGLTTLGKKFQVAGIDGIMVGPKTEATVK- 877
G A+ NGKY + + + ++ + K T+
Sbjct: 770 GKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKE-----KGTTTISV 824

Query: 878 --SKQQLVAPTIN 888
S Q TI
Sbjct: 825 ISSDNQTATYTIA 837



Score = 31.6 bits (71), Expect = 0.025
Identities = 50/345 (14%), Positives = 103/345 (29%), Gaps = 52/345 (15%)

Query: 506 VFTVEDGKNIDTSKPGNYTIQYTVKNSNGNEAQAVTELIVEEKKIVQTTISDLDTTSTTV 565
+ + +G+ +D ++T T ++G EA T + + +++ + V
Sbjct: 546 ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGV----AQANVPVSFNIV 601

Query: 566 SGLGEPNGLIEVKANQQVIATGTVGSDGKYTIQMPKQSTGVKVTTIVKANNQTS---EAS 622
SG + + GK T+ + G V + A ++ A
Sbjct: 602 SGTAVLS-----------ANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAV 650

Query: 623 TIVKNGEIAKTTISKIDNATTFVSGTGEPNGAITLSANGTILASGKV------------- 669
V + + T I K D T +G + + +++ +V
Sbjct: 651 IFVDQTKASITEI-KADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNST 709

Query: 670 ---DSAGKYSFTIAKQAVGVTVLAKVTLNGKESEVSTIVTKASEKVVAPVIHDYYITDIN 726
D+ G T+ G K ++ + S+V+ V + + D +I
Sbjct: 710 EKTDTNGYAKVTLTSTTPG-----KSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIV 764

Query: 727 AKGTIGGSAKQVAIYVNGVKKRTAAVTNGSFTIYTGDLG----------LTVAGQSFQIA 776
G G Y G A+ NG +T + + +T+ +
Sbjct: 765 GTGVKGKLPTVWLQY--GQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTI 822

Query: 777 GLFDGVEGPKTTKIVEARNQLIAPTINDYYTTDANVSGTITGSAK 821
+ T I + ++ DA + G
Sbjct: 823 SVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKL 867


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3058NUCEPIMERASE2082e-67 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 208 bits (530), Expect = 2e-67
Identities = 86/345 (24%), Positives = 143/345 (41%), Gaps = 52/345 (15%)

Query: 3 TILITGGAGFIGSHLV------NHYGTYAKVVVVDNLSMGH-------RENILPSENVVF 49
L+TG AGFIG H+ H +VV +DNL+ + R +L F
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-----QVVGIDNLNDYYDVSLKQARLELLAQPGFQF 56

Query: 50 IKEDIGNKKLLNQLFKEFTFDYVFHLAAVANVAESIEFPWSTHLINQDATLLLLEQVKKQ 109
K D+ +++ + LF F+ VF V S+E P + N L +LE +
Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116

Query: 110 KQLLKRFVFASSASVYGDSPHAVQTEGDTV-QPLSPYALDKYASEQFTLMYHRLYGVKTT 168
K ++ ++ASS+SVYG + + D+V P+S YA K A+E Y LYG+ T
Sbjct: 117 K--IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPAT 174

Query: 169 AVRFFNVYGENQNPNSPYSGVLSLLNQGLKTNENSEYFEFNKYGDGEQTRDFVYVQDVIQ 228
+RFF VYG P+ + +G + Y G+ RDF Y+ D+ +
Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGK---------SIDVYNYGKMKRDFTYIDDIAE 225

Query: 229 ALLLVSE-----------------KEKAIGEVYNIGTGAKTSLNQLLVLSQSLSQKKLCI 271
A++ + + A VYNIG + L + + +
Sbjct: 226 AIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKK 285

Query: 272 QTKLARKGDIMNSLANITKIKE-LGYQPLYSIDKGIVCYWKRTIE 315
+ GD++ + A+ + E +G+ P ++ G+ K +
Sbjct: 286 NMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGV----KNFVN 326


55BN424_3073BN424_3078N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3073-3101.042896large conductance mechanosensitive channel
BN424_3074-3101.116959yhgE/Pip C-terminal domain protein
BN424_3075-291.167153PEP phosphonomutase
BN424_3076-371.599200methylated-DNA-[]-cysteine S-methyltransferase
BN424_3077-281.720752LPXTG-motif cell wall anchor domain protein
BN424_3078-291.398002sugar (and other) transporter family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3073MECHCHANNEL1224e-39 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 122 bits (308), Expect = 4e-39
Identities = 60/136 (44%), Positives = 87/136 (63%), Gaps = 11/136 (8%)

Query: 4 NMLAEFKEFALRGSVLDLAVGVVIGGAFSAIVTSLVTNIITPIIVALTGGSNISDLSIKI 63
+++ EF+EFA+RG+V+DLAVGV+IG AF IV+SLV +II P + L GG + ++ +
Sbjct: 2 SIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTL 61

Query: 64 LNAK-------LMYGAFLQSIIDFLIIAFSIFMFIKVINTFVAKMKKPVEEVEEEVEINA 116
+A+ + YG F+Q++ DFLI+AF+IFM IK+IN K+ + EE
Sbjct: 62 RDAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLIN----KLNRKKEEPAAAPAPTK 117

Query: 117 TEEYLKEIRDLLAQQN 132
E L EIRDLL +QN
Sbjct: 118 EEVLLTEIRDLLKEQN 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3074GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.3 bits (78), Expect = 0.002
Identities = 39/226 (17%), Positives = 77/226 (34%), Gaps = 8/226 (3%)

Query: 203 DLEKNLPQIDTYTQEILDLQTKMPDIKAKLTKANEFVEYLPQ--VNQMTAKISEVNSLMP 260
DLEK L ++ + KA L +E + +N TA +++ +L
Sbjct: 124 DLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEA 183

Query: 261 QLDQTGSLILDLQKNIPQIQNAGRQIAQIDQDFDGIAATLNQGIDEANQALTVIQEVQTI 320
+ + +L+K + N + + + A L + +AL T
Sbjct: 184 EKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTA 243

Query: 321 MPDVIELGQDASQTVESTKEVIAKIQSALEPVKQAIDTGLTILKSVASSIGKLADNLSST 380
I+ + + A+++ ALE +K++ + L +
Sbjct: 244 DSAKIK---TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADL 300

Query: 381 VVTEDNKAAIIASLTSLSTHLDTANTMLSGLIDNLEKLQEASGSNE 426
E + A+ SL LD + L +KL+E + +E
Sbjct: 301 ---EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 343


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3076FLGHOOKAP1300.019 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.5 bits (66), Expect = 0.019
Identities = 13/76 (17%), Positives = 25/76 (32%)

Query: 141 HIRNGESLIDTQLASGFESSNGFRDAFSKTMGDVPQRSKKIQVLSSAWIETKLGSMLAIS 200
+ NG SL+ A + D T+ V + I++ LG +L
Sbjct: 227 TMANGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFR 286

Query: 201 DEHHLFLLEFVDRVGL 216
+ + ++ L
Sbjct: 287 SQDLDQTRNTLGQLAL 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3077SUBTILISIN1314e-35 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 131 bits (330), Expect = 4e-35
Identities = 61/239 (25%), Positives = 96/239 (40%), Gaps = 56/239 (23%)

Query: 196 QLANINQVWEEYQLKGEGMVVSIIDTGIDPSHKDLRLSDTTKEKISLEEIQINIAEMGHG 255
++ VW + +G G+ V+++DTG D H D
Sbjct: 27 EMIQAPAVWNQT--RGRGVKVAVLDTGCDADHPD-------------------------- 58

Query: 256 KAFTRKIPYGYNYADNNTTIIDENPTTNMHGMHVAGIAAANGIGADSTTAVLGVAPEAQL 315
+I G N+ D++ + N HG HVAG AA V+GVAPEA L
Sbjct: 59 --LKARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENE----NGVVGVAPEADL 112

Query: 316 LAMKVFS-NSSGAVALNDDVIAAIEDSVKLGADILNMSLGSVAGNRDSNDPVQISIREAA 374
L +KV + SG D +I I +++ DI++MSLG + + ++++A
Sbjct: 113 LIIKVLNKQGSGQY---DWIIQGIYYAIEQKVDIISMSLGGP----EDVPELHEAVKKAV 165

Query: 375 EVGVLSVIAAGNSGLSTSNDTNVAPQNKFGTIDTGTLGSPGVTDEGLTVASLESSVQIS 433
+L + AAGN G T LG PG +E ++V ++ S
Sbjct: 166 ASQILVMCAAGNEGDGDDR--------------TDELGYPGCYNEVISVGAINFDRHAS 210



Score = 75.7 bits (186), Expect = 2e-16
Identities = 43/133 (32%), Positives = 58/133 (43%), Gaps = 20/133 (15%)

Query: 587 GKISDFSSWGPTPSLEFKPEITAPGGQIYSTANQNSYQTNSGTSMAAPFVAGTEALIYQA 646
S+FS+ + ++ APG I ST Y T SGTSMA P VAG ALI Q
Sbjct: 207 RHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSMATPHVAGALALIKQL 260

Query: 647 LKAEQSP-LTGLNLIEFAKASLLNTAIPVMDQKHSDVIISPRRQGAGLLQADQAIK-NKV 704
A LT L A L+ IP+ + SP+ +G GLL + +++
Sbjct: 261 ANASFERDLTEPEL----YAQLIKRTIPLGN--------SPKMEGNGLLYLTAVEELSRI 308

Query: 705 YLTDAKTGKASIA 717
+ T G S A
Sbjct: 309 FDTQRVAGILSTA 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3078TCRTETB606e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.9 bits (145), Expect = 6e-12
Identities = 35/159 (22%), Positives = 71/159 (44%), Gaps = 1/159 (0%)

Query: 21 MDIMFLTFALTSIIADLNVSGAAAGLISSITNVGMLLGGVTFGILADRFGRIKIFTYTIL 80
++ M L +L I D N A+ +++ + +G +G L+D+ G ++ + I+
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 81 IFAFATGAMYFASNIYLVYLF-RFLSGIGAGGEYGIGMAIVAEAFPKEKLGKMTSIVAIT 139
I F + + + + + + RF+ G GA + M +VA PKE GK ++
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 140 GQVGSIIAAIIAAIIIPRFGWNALFLFGLLPVVLTFFIR 178
+G + I +I W+ L L ++ ++ F+
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186


56BN424_3116BN424_3123N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_3116-2110.349729sigma-54 interaction domain protein
BN424_3117-3121.087935glycerophosphoryl diester phosphodiesterase
BN424_31180140.468029glycerophosphoryl diester phosphodiesterase
BN424_31190120.093917bacterial extracellular solute-binding family
BN424_3120012-0.088462binding--dependent transport system inner
BN424_31210130.130080binding--dependent transport system inner
BN424_31220120.527867ABC transporter family protein
BN424_3123112-0.767206major Facilitator Superfamily protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3116HTHFIS1411e-37 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 141 bits (357), Expect = 1e-37
Identities = 61/240 (25%), Positives = 117/240 (48%), Gaps = 22/240 (9%)

Query: 74 DNPSLQMMIEQA-RAAILYPPKGLNMLFYGETGVGKSMFANLIYEYACSVKKTTKHYPFI 132
+ ++Q + R L ++ GE+G GK + A +++Y ++ PF+
Sbjct: 142 RSAAMQEIYRVLARLM----QTDLTLMITGESGTGKELVARALHDYG-----KRRNGPFV 192

Query: 133 HFNCSDYANNPQLLMGQLFGVAKNAYTGATEEKKGLIEEANGGMLFLDEIHRLPPEGQEM 192
N + A L+ +LFG K A+TGA G E+A GG LFLDEI +P + Q
Sbjct: 193 AINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTR 250

Query: 193 FFTFIDRGLYRRLGESNSESSAEVQILAATTEAPESALLKTFQR-----RI-PMVIKIPN 246
+ +G Y +G + ++V+I+AAT + + ++ + R R+ + +++P
Sbjct: 251 LLRVLQQGEYTTVG-GRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPP 309

Query: 247 LLERTIEERASLITLFFNQEADRLGIPIK-VSQNSMRALLSYNCPNNIGQLKNDIQLACA 305
L +R E + F Q+A++ G+ +K Q ++ + ++ P N+ +L+N ++ A
Sbjct: 310 LRDRA--EDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTA 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3118ISCHRISMTASE300.008 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 30.0 bits (67), Expect = 0.008
Identities = 16/42 (38%), Positives = 24/42 (57%)

Query: 67 DYTLDELRALTLAERYQQFPLYDRSWELEKIPTLEEVLKLLQ 108
D LD +R +TL E++++ EL + PT+EE KLL
Sbjct: 258 DRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3119MALTOSEBP431e-06 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 43.2 bits (101), Expect = 1e-06
Identities = 86/375 (22%), Positives = 140/375 (37%), Gaps = 41/375 (10%)

Query: 75 VEVKAEFQGTYEESLPKFQSVGGTKDAPTIVQVQEIGTKMMIDSGFIEPMQKFIDADNYD 134
++V E EE KF V T D P I+ SG + I D
Sbjct: 59 IKVTVEHPDKLEE---KFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAE----ITPDKAF 111

Query: 135 TSDLEENIANYYKVDGKFYSMPFNSSTPVMYYNKEAFKKAGLDPENPPQTFEEIEKAGLA 194
L + + +GK + P + YNK+ NPP+T+EEI
Sbjct: 112 QDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKE 164

Query: 195 IKKSNPAMKGFALQA--YGWLYEELLANQGSLLMNNDNGRSKTPTKVAYDNAAGRSIFEW 252
+K + F LQ + W L+A G +NG+ V DNA ++ +
Sbjct: 165 LKAKGKSALMFNLQEPYFTW---PLIAADGGYAFKYENGKYDI-KDVGVDNAGAKAGLTF 220

Query: 253 AEQMIKDETFANYGTNADNMVAGFINGDVAMFLQSSASAGQVIDGAKFEVGEAYLP-YPE 311
+IK++ N T+ A F G+ AM + + A ID +K G LP +
Sbjct: 221 LVDLIKNKHM-NADTDYSIAEAAFNKGETAMTI-NGPWAWSNIDTSKVNYGVTVLPTFKG 278

Query: 312 KAEREGVVIGGASLWMSKGKETAEQEAAWDFLK-YLATPEVQAEWHVATGYFAINSKAYD 370
+ + V + A + + +E A +FL+ YL T E G A+N
Sbjct: 279 QPSKPFVGVLSAGI----NAASPNKELAKEFLENYLLTDE---------GLEAVNKDKPL 325

Query: 371 EAIVAEAYKKKPQLKVAVEQLQATKTSAATQGALMNMLPEERKIMETALEQVYNGAEIEP 430
A+ ++Y++ ++A + A A +G +M +P+ V N A
Sbjct: 326 GAVALKSYEE----ELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQ 381

Query: 431 TFKAAVEQVNQAIEQ 445
T A++ I +
Sbjct: 382 TVDEALKDAQTRITK 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3122PF05272363e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 35.8 bits (82), Expect = 3e-04
Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 9/55 (16%)

Query: 34 VLVGPSGCGKSTMLRMIAGLEDISDGTLKIDGEVVNHLPPKERDLAMVFQNYALY 88
VL G G GKST++ + GL+ SD I +D Y
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY 645


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3123TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 63/364 (17%), Positives = 132/364 (36%), Gaps = 29/364 (7%)

Query: 40 LVSSGYEIQLVSVIFSLYGLFVAIFSWLTSFFVNIFSVRKVMIAGLIIYLVSAVILIAGI 99
LV S ++ +LY L + + + F R V++ L V I+
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP 94

Query: 100 YFELLPVIAVAYTLRGASYPLFAYAFLIWITLRSEFKNLGKATSWFWFSFNLGLTIISPL 159
+ +L + + + GA+ + ++ G + F G+ + P+
Sbjct: 95 FLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFG----FMSACFGFGM-VAGPV 149

Query: 160 LASLLLKFSNSINIL---AVGMVMALLGSFL---SLKVNRDHLPTFNNNKSILYEMQEGI 213
L L+ FS A+ + L G FL S K R L N + G+
Sbjct: 150 LGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM 209

Query: 214 MILFEYPRLAIGLVVKAINNIGQFGFVIMMPIFLVNHGYSLSQWGIIWATTYVVNSFA-G 272
++ +A+ +++ + + +VI + + GI A +++S A
Sbjct: 210 TVV--AALMAVFFIMQLVGQVPAALWVIFGEDRF---HWDATTIGISLAAFGILHSLAQA 264

Query: 273 ILFGNLGDYYGWRKIVCYFSGTLTALSCFLIGSVVFYFPGNFFLLMLAFIVFSFGIAAFG 332
++ G + G R+ + + G ++ F ++ ++ + G
Sbjct: 265 MITGPVAARLGERRALML------GMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318

Query: 333 PLSALIPAMALEKKTTALSVLNLG-SGLSNFLGPVLVTVLF----QKFDGFFVLAVFAVL 387
L A++ E++ L + L++ +GP+L T ++ ++G + A L
Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNG-WAWIAGAAL 377

Query: 388 YLLA 391
YLL
Sbjct: 378 YLLC 381


57BN424_3524BN424_3531N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
BN424_35240131.363977major Facilitator Superfamily protein
BN424_35251151.283080bacterial regulatory s, gntR family protein
BN424_3526-117-0.610510GNAT family acetyltransferase
BN424_3527-217-0.218445peptidase propeptide and YPEB domain protein
BN424_3528-2140.030423hypothetical protein
BN424_3529-2140.279775LPXTG-motif cell wall anchor domain protein
BN424_3530014-0.618456DNA-binding domain protein
BN424_3531015-0.377843mga helix-turn-helix domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3524TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.7 bits (85), Expect = 1e-04
Identities = 34/139 (24%), Positives = 57/139 (41%), Gaps = 7/139 (5%)

Query: 242 GYFLTLYTVIQMLCSFIIPTLMDQFGKMKQWMFFSSGLVFIGAGMIALAPSVVFFVLGII 301
G L LY ++Q C+ ++ L D+FG+ + + S + ++A AP + +G I
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGR-RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI 104

Query: 302 LAAI-GLGGLFPIAVLLPIRNTKTAEETSLWTSMIQSFGYILGGFMPVLMGAVKDTTGST 360
+A I G G A + I + + S FG + G + LMG S
Sbjct: 105 VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF-----SP 159

Query: 361 IAPFLIMVLLSSVLLILSF 379
APF L+ + +
Sbjct: 160 HAPFFAAAALNGLNFLTGC 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3526TYPE3OMBPROT260.025 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 26.2 bits (57), Expect = 0.025
Identities = 13/38 (34%), Positives = 22/38 (57%)

Query: 55 INSEIVGHGLLSPVIIEQKESSIIEDAVMALAPLAVKK 92
++ +IV LL+P + E S+++D V AL L K+
Sbjct: 272 VDLKIVSTSLLTPTSLTGGEESMLKDQVNALKGLNSKR 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3529TONBPROTEIN320.008 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 32.3 bits (73), Expect = 0.008
Identities = 15/58 (25%), Positives = 20/58 (34%)

Query: 55 APEEEAITVPPAETPITPPIPETEQEPLKQDVATPTITDAPTIADPQPEAAPEKKTIP 112
A E V P P+ P PE E P A I P+P+ + + P
Sbjct: 53 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
BN424_3531PF05043646e-13 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 63.8 bits (155), Expect = 6e-13
Identities = 42/180 (23%), Positives = 85/180 (47%), Gaps = 7/180 (3%)

Query: 3 LDSLLSTSTLREITLIKLLNQRYPNWISKEEISSQLYISNRTLKSTVIVINSFFKEKNYE 62
+ LLS + R++ L++LL + W + E++ L + R +K + + S F +
Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKR-WFHRSELAELLNCTERAVKDDLSHVKSAFPD---- 55

Query: 63 QYIETDAALGYKLSASTNAISNLIYLERLKSSTSFNLLLSIYNKKFISSKHFTDYYFISL 122
I + G ++ + ++ ++Y K ST F++L I+ + ++ ++IS
Sbjct: 56 -LIFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISS 114

Query: 123 TSFYKDVRTINLILDK-FGIQFNSREGTLSGESSQIRYFFCKFFWFSYGMSEWPFQQVDE 181
+S Y+ + IN ++ + F + + + G IRYFF ++F Y EWPF+
Sbjct: 115 SSLYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSEKYYFLEWPFENFSS 174



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.