PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeNC_007530.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_007530 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1GBAA_0263GBAA_0283Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_02633323.181353redox-sensing transcriptional repressor Rex
GBAA_02642313.806972lipoprotein
GBAA_02650273.588224CAAX amino terminal protease
GBAA_02661222.862462co-chaperonin GroES
GBAA_02670202.396620molecular chaperone GroEL
GBAA_0268-1141.257242GMP synthase
GBAA_0270013-0.026839xanthine/uracil permease
GBAA_0271318-1.367098DNA-binding response regulator
GBAA_0272218-1.758410sensor histidine kinase
GBAA_0273319-1.388787hypothetical protein
GBAA_0274419-1.646478hypothetical protein
GBAA_0275419-1.974597hypothetical protein
GBAA_0276421-2.079771hypothetical protein
GBAA_0278421-2.269433hypothetical protein
GBAA_0283321-1.428174UDP pyrophosphate phosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0265SSPAMPROTEIN290.007 Salmonella surface presentation of antigen gene typ...
		>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type

M signature.
Length = 147

Score = 29.3 bits (65), Expect = 0.007
Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 3 LSSIAGLPLLLKTGLYDNRGFTREEKFQLI 32
+ IAGL LLL T +NR +REE + L+
Sbjct: 43 VEQIAGLKLLLDTLRAENRQLSREEIYALL 72


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0271HTHFIS908e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 8e-23
Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 1/122 (0%)

Query: 1 MAHETILVVDDEKEIRNLITIYLKNEGYKVLQAGDGEEGLRLLEENEVHLVVLDIMMPKV 60
M TILV DD+ IR ++ L GY V + R + + LVV D++MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGIHMCMKIREE-KEMPIIMLSAKTQDMDKILGLTTGADDYVTKPFNPLELIARIKSQLR 119
+ + +I++ ++P++++SA+ M I GA DY+ KPF+ ELI I L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RY 121

Sbjct: 121 EP 122


2GBAA_0442GBAA_0457Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_04422200.382682hypothetical protein
GBAA_04431180.450655hypothetical protein
GBAA_04442170.119691hypothetical protein
GBAA_04454170.278365prophage lambdaba04 transactivating regulatory
GBAA_0446116-0.063410hypothetical protein
GBAA_0447222-0.923205hypothetical protein
GBAA_0448122-0.963425hypothetical protein
GBAA_0450222-1.555199hypothetical protein
GBAA_0452022-1.203799hypothetical protein
GBAA_0453227-1.034502hypothetical protein
GBAA_0454428-0.692233hypothetical protein
GBAA_04552250.016135hypothetical protein
GBAA_0456217-0.229371hypothetical protein
GBAA_0457217-0.109514ArpU family phage transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0442HTHFIS270.028 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 27.1 bits (60), Expect = 0.028
Identities = 4/20 (20%), Positives = 9/20 (45%)

Query: 18 GISRRVLYMRMYRYGWELQE 37
G++R L ++ G +
Sbjct: 460 GLNRNTLRKKIRELGVSVYR 479


3GBAA_0791GBAA_0816Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_07913191.417324PTS system cellobiose-specific transporter
GBAA_07921211.552377PTS system cellobiose-specific transporter
GBAA_07931241.939930PTS system cellobiose-specific transporter
GBAA_07943252.224609hypothetical protein
GBAA_07952221.975731hypothetical protein
GBAA_07961211.259934hypothetical protein
GBAA_0797-1150.413442ABC transporter permease
GBAA_0798113-0.100746ABC transporter ATP-binding protein
GBAA_0799013-0.325092hypothetical protein
GBAA_0800215-0.361320ABC transporter permease
GBAA_0801115-0.275415ABC transporter permease
GBAA_57420140.353256ABC transporter permease
GBAA_0802-113-0.384702branched chain amino acid ABC transporter
GBAA_0803016-1.378709spore coat-associated protein JC
GBAA_0804314-2.551722spore coat-associated protein JB
GBAA_0805214-2.411914spore coat-associated protein JA
GBAA_0806-114-1.868317hypothetical protein
GBAA_0807013-1.551086hypothetical protein
GBAA_0808314-1.485327hypothetical protein
GBAA_08095160.367270hypothetical protein
GBAA_08100120.908065DedA family protein
GBAA_08110130.604462hypothetical protein
GBAA_0812-1120.199394hypothetical protein
GBAA_08130101.408531hypothetical protein
GBAA_08141131.287969hypothetical protein
GBAA_08150131.099682purple acid phosphatase/fibronectin
GBAA_08162171.441391hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0796IGASERPTASE522e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 52.4 bits (125), Expect = 2e-09
Identities = 32/143 (22%), Positives = 61/143 (42%), Gaps = 4/143 (2%)

Query: 163 QDTAKAVATTKAREVAETQAKAKAEEATKAREVAEAQAAAKAR-EAAKAQEAAKAQAEAK 221
+ + A T+ EVA++ ++ K + T+ +E A + KA+ E K QE K ++
Sbjct: 1071 EAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130

Query: 222 AQEAAEAQAAAKAQEAAKAREAAKAQAEAKAQEAAEAREAAKAQK--PATQQPVAKETET 279
++ +++ E A+ + E ++Q A A++ +QPV + T
Sbjct: 1131 PKQ-EQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTV 1189

Query: 280 SAPSSSRELRVVATAYTADPLEN 302
+ +S E T T P N
Sbjct: 1190 NTGNSVVENPENTTPATTQPTVN 1212



Score = 47.4 bits (112), Expect = 8e-08
Identities = 27/139 (19%), Positives = 53/139 (38%), Gaps = 9/139 (6%)

Query: 152 PVVKAEKTTTVQDTAKAVATTKAR-----EVAETQAKAKAEEATKAREVAEAQAAAKARE 206
P +E T TV + +K + T + Q + A+EA K+ A Q A+
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEA-KSNVKANTQTNEVAQS 1088

Query: 207 AAKAQEAAKAQAEAKAQEAAEAQAAAKAQEAAKAREAAKAQAEAKAQEAAEAREAAKAQK 266
++ +E + + A E + AK E K +E K ++ ++ +A+
Sbjct: 1089 GSETKETQTTETKETATV--EKEEKAKV-ETEKTQEVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 267 PATQQPVAKETETSAPSSS 285
P E + +++
Sbjct: 1146 ARENDPTVNIKEPQSQTNT 1164



Score = 42.7 bits (100), Expect = 2e-06
Identities = 22/119 (18%), Positives = 37/119 (31%), Gaps = 10/119 (8%)

Query: 166 AKAVATTKAREVAETQAKAKAEEATKAREV--------AEAQAAAKAREAAKAQEAAKAQ 217
+ T + + EE + E ++ E +K + +
Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK 1053

Query: 218 AEAKAQEAAEAQAAAKAQEAAKAREAAKAQAEAKAQEAAEAREAAKAQKPATQQPVAKE 276
E A E AQ A+EA +A E AQ +E +E + T +E
Sbjct: 1054 NEQDATET-TAQNREVAKEAKSNVKANTQTNE-VAQSGSETKETQTTETKETATVEKEE 1110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0799RTXTOXIND355e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.8 bits (80), Expect = 5e-04
Identities = 22/125 (17%), Positives = 42/125 (33%), Gaps = 18/125 (14%)

Query: 75 ELAVKKGDEVKKGQLLFKYNDPAAKQGVTEAEMQKKIAQKEVTLFQKQIDAAK------- 127
E+ VK+G+ V+KG +L K A+ + + A+ E T +Q + +
Sbjct: 109 EIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPEL 168

Query: 128 -----------QKLQKDKNAGLPAEALKASEIEVQQLESQLEMKKFEVEKSDEMIKAAKE 176
+ + + L E + + Q E L+ K+ E I +
Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYEN 228

Query: 177 RVNTL 181

Sbjct: 229 LSRVE 233



Score = 31.3 bits (71), Expect = 0.006
Identities = 24/215 (11%), Positives = 73/215 (33%), Gaps = 43/215 (20%)

Query: 92 KYNDPAAKQGVTEAEMQKKIAQKEVTLFQKQIDAAKQKLQKDKNAGLPAEALKASEIEVQ 151
KY + + V ++++++ + E+ +++ Q + + + L+ + +
Sbjct: 260 KYVEAVNELRVYKSQLEQ--IESEILSAKEEYQLVTQLFKNEI-----LDKLRQTTDNIG 312

Query: 152 QLESQLEMKKFEVEKSDEMIKAAKERVNTLSVTSPADGVIDDIVKIADEKTGMSGITLRH 211
L +L +ER + +P + + K G +
Sbjct: 313 LLTLELAK--------------NEERQQASVIRAPVSVKVQQL------KVHTEGGVVTT 352

Query: 212 A----------GPFKVKGQLSEYELASMKVGQEVTVSSKTVAGKTW---TGKVTEIGSTP 258
A +V + ++ + VGQ + + + GKV I
Sbjct: 353 AETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN--- 409

Query: 259 LKSMDENKTVSNYQFTVTLDNSEELQNGFHVYVTS 293
L ++++ + + ++++ + ++ ++S
Sbjct: 410 LDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSS 444


4GBAA_0880GBAA_0897Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0880215-0.827203hypothetical protein
GBAA_0881321-0.507351hypothetical protein
GBAA_0882321-0.480578preprotein translocase subunit SecA
GBAA_0883222-0.599525polysaccharide biosynthesis protein CsaA
GBAA_0884322-0.684122pyruvyl-transferase
GBAA_0885222-0.827217s-layer protein sap
GBAA_0887017-0.830424s-layer protein ea1
GBAA_0888118-0.652283hypothetical protein
GBAA_0889115-0.483701alginate O-acetyltransferase
GBAA_0890113-0.672618alginate O-acetyltransferase
GBAA_08912140.204824hypothetical protein
GBAA_08930130.967633hypothetical protein
GBAA_08940142.260090enoyl-CoA hydratase
GBAA_08950172.030717hypothetical protein
GBAA_0896-2142.556709hypothetical protein
GBAA_0897-2143.422482M20/M25/M40 family peptidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0882SECA9000.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 900 bits (2327), Expect = 0.0
Identities = 354/829 (42%), Positives = 506/829 (61%), Gaps = 52/829 (6%)

Query: 1 MLNSVKKLLGDSQKRKLKKYEQLVQEINNLEEKLSDLSDEELRHKTITFKDMLRDGKTVD 60
++ + K+ G R L++ ++V IN +E ++ LSDEEL+ KT F+ L G+ ++
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 61 DIKVEAFAVVREAAKRVLGLRHYDVQLIGGLVLLEGNIAEMPTGEGKTLVSSLPTYVRAL 120
++ EAFAVVREA+KRV G+RH+DVQL+GG+VL E IAEM TGEGKTL ++LP Y+ AL
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 121 EGKGVHVITVNDYLAKRDKELIGQVHEFLGLKVGLNIPQIDPSEKKLAYEADITYGIGTE 180
GKGVHV+TVNDYLA+RD E + EFLGL VG+N+P + K+ AY ADITYG E
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 181 FGFDYLRDNMAASKNEQVQRPYHFAIIDEIDSVLIDEAKTPLIIAGKKSSSSDLHYLCAK 240
+GFDYLRDNMA S E+VQR H+A++DE+DS+LIDEA+TPLII+G SS+++ K
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNK 241

Query: 241 VIKS-----------FQDTLHYTYDAESKSASFTEDGITKIEDLFDI-------DNLYDL 282
+I FQ H++ D +S+ + TE G+ IE+L ++LY
Sbjct: 242 IIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSP 301

Query: 283 EHQTLYHYMIQALRAHVAFQCDVDYIVHDEKILLVDIFTGRVMDGRSLSDGLHQALEAKE 342
+ L H++ ALRAH F DVDYIV D ++++VD TGR M GR SDGLHQA+EAKE
Sbjct: 302 ANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKE 361

Query: 343 GLEITEENQTQASITIQNFFRMYPALSGMTGTAKTEEKEFNRVYNMEVMPIPTNRPIIRE 402
G++I ENQT ASIT QN+FR+Y L+GMTGTA TE EF+ +Y ++ + +PTNRP+IR+
Sbjct: 362 GVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRK 421

Query: 403 DKKDVVYVTADAKYKAVREDVLKHNKQGRPILIGTMSILQSETVARYLDEANITYQLLNA 462
D D+VY+T K +A+ ED+ + +G+P+L+GT+SI +SE V+ L +A I + +LNA
Sbjct: 422 DLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA 481

Query: 463 KSAEQEADLIATAGQKGQITIATNMAGRGTDILLG------------------------- 497
K EA ++A AG +TIATNMAGRGTDI+LG
Sbjct: 482 KFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADW 541

Query: 498 ----EGVHELGGLHVIGTERHESRRVDNQLKGRAGRQGDPGSSQFFLSLEDEMLKRFAQE 553
+ V E GGLH+IGTERHESRR+DNQL+GR+GRQGD GSS+F+LS+ED +++ FA +
Sbjct: 542 QVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASD 601

Query: 554 EVEKLTKSLKTDETGLILTSKVHDFVNRTQLICEGSHFSMREYNLKLDDVINDQRNVIYK 613
V + + L I V + Q E +F +R+ L+ DDV NDQR IY
Sbjct: 602 RVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYS 661

Query: 614 LRNNLLQEDTNMIEIIIPMIDHAVEAISKQYLVEGMLPEEWDFASLTASLNEI--LSVEN 671
RN LL + +++ E I + + +A Y+ L E WD L L L +
Sbjct: 662 QRNELL-DVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720

Query: 672 MPSLSANNVHSPEDLQS-VLKETLSLYKERVNELDSNTDLQQSLRYVALHFLDQNWVNHL 730
L E L+ +L +++ +Y+ + + + ++ + V L LD W HL
Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEM-MRHFEKGVMLQTLDSLWKEHL 779

Query: 731 DAMTHLKEGIGLRQYQQEDPTRLYQKEALDIFLYTYGNFEKEMCRYVAR 779
AM +L++GI LR Y Q+DP + Y++E+ +F + + E+ +++
Sbjct: 780 AAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSK 828


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0885INTIMIN521e-08 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 51.6 bits (123), Expect = 1e-08
Identities = 66/340 (19%), Positives = 116/340 (34%), Gaps = 32/340 (9%)

Query: 190 TVTKAEAAQFIAKTDKQFGTEAAKVESAKAVTTQKVEVKFSKAVEKLTKEDIKVT----- 244
T+T Q + + A SAKA T+ + + + + ++ V+
Sbjct: 545 TITVLSNGQVVDQV--GVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVS 602

Query: 245 -------NKANNDKVLVKEVTLSEDKKSATVELYSNLAAKQTYTVDVNKVGKTEVAVGSL 297
N AN + VTL DK V A+ T ++ N V + S+
Sbjct: 603 GTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK--TAEMTSALNANAVIFVDQTKASI 660

Query: 298 EAKTIEMADQTVVADEPTALQFTVKDENGTEVVSPEGIEFVTPAAEKINAKGEITLAKGT 357
I+ T VA+ A+ +TVK G + VS + + F T K++ E T G
Sbjct: 661 --TEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTT-TLGKLSNSTEKTDTNGY 717

Query: 358 ST-TVKAVYKKDGKVVAESKEVKVSAEGAAVASISNWTVAEQNKADFTSKDFKQNNKVYE 416
+ T+ + V A +V V + V D + + +
Sbjct: 718 AKVTLTSTTPGKSLVSARVSDVAVDVKAPEV------EFFTTLTIDDGNIEIVGTGVKGK 771

Query: 417 GDNAYVQVELKDQFNAVTTG---KVEYESLNTEVAVVDKATGKVTVLSAGKAPVKVTVKD 473
++Q Q N +G K + S N +A VD ++G+VT+ G + V D
Sbjct: 772 LPTVWLQ---YGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSD 828

Query: 474 SKGKELVSKTVEIEAFAQKAMKEIKLEKTNVALSTKDVTD 513
++ T + + + N +
Sbjct: 829 NQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLP 868


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0887INTIMIN350.002 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 34.7 bits (79), Expect = 0.002
Identities = 43/268 (16%), Positives = 77/268 (28%), Gaps = 34/268 (12%)

Query: 335 VKFVANNLDGSPANIFEGGEATSTTGKLAVGIK----QGDYKVEVQVTKRGGLTVSNTGI 390
+ AN+ S T L+ G V ++ K G + VS
Sbjct: 580 YTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTA 639

Query: 391 ITVKNLDTPA-----------SAIKNVVFALDADNDGVVNYGSKLSGKDFALNSQNLVVG 439
L+ A + IK A+ + Y K+ D +++Q +
Sbjct: 640 EMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV--- 696

Query: 440 EKASLNKLVATIAGEDKVVDPGSISIKSSNHGIISVVNNYITAEAAGEATLTIKVGDVTK 499
+ + + K+ +G V +T+ G++ ++ +V DV
Sbjct: 697 ------------TFTTTLGKLSNSTEKTDTNGYAKVT---LTSTTPGKSLVSARVSDVAV 741

Query: 500 DVKFKVTTDSRKLVSVKANPDKLQVVQNKTLPVTFVTTDQYGDPFGANTAAIKEVLPKTG 559
DVK L N + + LP ++ Q
Sbjct: 742 DVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPA 801

Query: 560 VVAEGGLDVVTTDSGSIGTKTIGVTGND 587
+ + T GT TI V +D
Sbjct: 802 IASVDASSGQVTLKEK-GTTTISVISSD 828


5GBAA_0925GBAA_0935Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0925216-0.102498lipoprotein
GBAA_0926318-1.003362tellurium resistance protein
GBAA_0927317-0.694056hypothetical protein
GBAA_09283281.895325hypothetical protein
GBAA_09294424.708571hypothetical protein
GBAA_09305445.087558hypothetical protein
GBAA_09316425.193556merR family transcriptional regulator
GBAA_09325396.215040hypothetical protein
GBAA_09334405.483708DnaD domain-containing protein
GBAA_09342272.841451replicative DNA helicase
GBAA_09353200.614862hypothetical protein
6GBAA_0966GBAA_1007Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0966419-6.450858hypothetical protein
GBAA_0967619-6.296790hypothetical protein
GBAA_0971519-7.349567CAAX amino terminal protease
GBAA_0972-114-3.343321hypothetical protein
GBAA_0973-114-3.275649hypothetical protein
GBAA_0975-115-2.818299HD domain-containing protein
GBAA_0977013-1.213837hypothetical protein
GBAA_0978013-1.822553hypothetical protein
GBAA_0981-215-1.278410s-layer protein
GBAA_0982219-2.907062hypothetical protein
GBAA_0983420-2.168215hypothetical protein
GBAA_0984321-2.256280hypothetical protein
GBAA_0985420-2.663166lipoprotein
GBAA_0986321-2.127621hypothetical protein
GBAA_0987220-1.570142hypothetical protein
GBAA_0988120-1.645889hypothetical protein
GBAA_0989117-2.594124hypothetical protein
GBAA_0990015-2.895659anti sigma b factor antagonist RsbV
GBAA_0991-112-3.209928serine-protein kinase RsbW
GBAA_0992012-3.451685RNA polymerase sigma factor SigB
GBAA_0993-112-4.143337hypothetical protein
GBAA_0994-113-3.855033response regulator
GBAA_0995-213-3.987677chemotaxis protein CheR
GBAA_0996-213-3.434622sensor histidine kinase/response regulator
GBAA_0997119-2.258923hypothetical protein
GBAA_0998217-1.152537hypothetical protein
GBAA_09991111.154266hypothetical protein
GBAA_10002150.858181hypothetical protein
GBAA_10010151.758171hypothetical protein
GBAA_1002-2161.609467hypothetical protein
GBAA_1003-1192.948726hypothetical protein
GBAA_10041203.549118alcohol dehydrogenase
GBAA_10053202.306061hypothetical protein
GBAA_10062202.242807hypothetical protein
GBAA_10072151.031675hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0983ACRIFLAVINRP280.027 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.027
Identities = 5/35 (14%), Positives = 13/35 (37%)

Query: 155 QGVSFLWSFLFETPFALMRGLAWLFIPAAIVMYLV 189
G+ + W+ + L + +V++L
Sbjct: 852 AGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLC 886


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0994HTHFIS832e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-19
Identities = 34/125 (27%), Positives = 67/125 (53%), Gaps = 10/125 (8%)

Query: 2 SILIVDDNPVNIFVIKKILKQAGYQDLVSLNSAQELFEYIHFGKDSSRHNEIDLILLDIM 61
+IL+ DD+ V+ + L +AGY D+ ++A L+ +I + DL++ D++
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWI-------AAGDGDLVVTDVV 56

Query: 62 MPEIDGLEVCRRLQKEEKFKDIPIIFVTALEDANKLAEALDIGAMDYITKPINKVELLAR 121
MP+ + ++ R++K D+P++ ++A +A + GA DY+ KP + EL+
Sbjct: 57 MPDENAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114

Query: 122 MRVAL 126
+ AL
Sbjct: 115 IGRAL 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0996HTHFIS686e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 6e-14
Identities = 25/107 (23%), Positives = 50/107 (46%), Gaps = 3/107 (2%)

Query: 777 TIMIVDDDHRNIFALQNALKKQHANIITAQNGLECLEILKNNTNIDLILMDIMMPNMDGY 836
TI++ DDD L AL + ++ N + DL++ D++MP+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDG-DLVVTDVVMPDENAF 63

Query: 837 ETMEHIRMNLGLHEIPIIALTAKAMPNDKEKCLSAGASDYISKPLNL 883
+ + I+ ++P++ ++A+ K GA DY+ KP +L
Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0999PF07132290.010 Harpin protein (HrpN)
		>PF07132#Harpin protein (HrpN)

Length = 356

Score = 28.9 bits (64), Expect = 0.010
Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 6/79 (7%)

Query: 51 SGEKVNSETAHKADIFSATGLVAGGVAGGLGGLLTGLGVLAVSGMGPIVAAGPIAAAIGG 110
G + ++ +DI + + + GGLGG L GLG G ++ G G
Sbjct: 40 FGGQRSNIAEQLSDIMTTMMFMGSMMGGGLGGGLGGLGSSLGGLGGGLLGGG------LG 93

Query: 111 AGIGGGAGSLIGAFIGLGI 129
G+G GS +G+ +G G+
Sbjct: 94 GGLGSSLGSGLGSALGGGL 112


7GBAA_1038GBAA_1068Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_10382221.671539EmrB/QacA family drug resistance transporter
GBAA_10395231.044263hypothetical protein
GBAA_10406241.316916UvrD/REP helicase
GBAA_1041626-0.954223peptidyl-prolyl isomerase
GBAA_1043317-0.225183hypothetical protein
GBAA_10443180.298432hypothetical protein
GBAA_10452181.034226transcriptional regulator Hpr
GBAA_10463170.863271hypothetical protein
GBAA_10472180.923286HIT family protein
GBAA_10482201.475361ABC transporter ATP-binding protein
GBAA_10491190.611925ABC transporter permease
GBAA_10500180.611001hypothetical protein
GBAA_1052016-0.680240TetR family transcriptional regulator
GBAA_1053117-1.353304hypothetical protein
GBAA_1054-115-2.803190hypothetical protein
GBAA_1056118-3.838410hypothetical protein
GBAA_1057218-2.640251hypothetical protein
GBAA_1059118-1.245754hypothetical protein
GBAA_1061216-0.760648lipoprotein
GBAA_10630151.303715merR family transcriptional regulator
GBAA_10640152.738321hypothetical protein
GBAA_10651142.873542hypothetical protein
GBAA_1066-1143.489243hypothetical protein
GBAA_1068-2123.068071hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1038TCRTETB1385e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 138 bits (348), Expect = 5e-38
Identities = 91/412 (22%), Positives = 191/412 (46%), Gaps = 13/412 (3%)

Query: 17 NVKRLPILISMIIGAFFTILNETLLNVAFPQLMIELNVTPSTLQWLSTGYMLVVAVLIPA 76
N++ ILI + I +FF++LNE +LNV+ P + + N P++ W++T +ML ++
Sbjct: 9 NLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV 68

Query: 77 SALLVQWFTTRQVFIGAMVVFTFGTLVSAIA-PGFSILLMGRLLQAAGTGLMMPVLMNTI 135
L +++ + +++ FG+++ + FS+L+M R +Q AG ++M +
Sbjct: 69 YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVV 128

Query: 136 LLLYPPEKRGAAMGSIGLVIMFAPAIGPTLSGIILETLNWRWLFYIVLPFAIFSIVFAFI 195
P E RG A G IG ++ +GP + G+I ++W +L ++P V +
Sbjct: 129 ARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLM 186

Query: 196 YLKNVSEPTKPKVDVLSILLSTIGFGGIVYGFSSSGEGWDSFQVYGIILIGLVALLFFVL 255
L K D+ I+L ++G + +S +++ +++ L FV
Sbjct: 187 KLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSY--------SISFLIVSVLSFLIFVK 238

Query: 256 RQLKLKEPLLDLSAFKYPMFTLTTILLTIMMMTMFSTMTLLPFLFQGALGLTVYATG-LI 314
K+ +P +D K F + + I+ T+ ++++P++ + L+ G +I
Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298

Query: 315 MLPGSLLNGLLSPVSGKLFDKFGPRALIIPGTLLLASVMWFFTQVTADTSKITFILLHVT 374
+ PG++ + + G L D+ GP ++ G L SV + +T+ ++ V
Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFMTIIIVF 357

Query: 375 MMVSISMIMMPAQTNGLNQLPKRFYPHGTAILNTLSQVAGAVGVAFFISVMT 426
++ +S T + L ++ G ++LN S ++ G+A +++
Sbjct: 358 VLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1052HTHTETR762e-19 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 76.2 bits (187), Expect = 2e-19
Identities = 34/170 (20%), Positives = 67/170 (39%), Gaps = 3/170 (1%)

Query: 6 QTSQNIVEASFKLMAEHGIEKMSLSMIAKEVGISKPAIYYHFSSKEALVDFLFEEIFS-- 63
+T Q+I++ + +L ++ G+ SL IAK G+++ AIY+HF K L ++E S
Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70

Query: 64 GYHFVSYFDKEQYTKENFVEKLIADGLHMLSEYEGQEGILRVINEFIVTAARNEKYQKRL 123
G + Y K + + +++ L E + ++ +I Q+
Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 124 FEIQEEFLNGFHELLKKGARLG-VVSQHATEENAHTLALVIDNMSNYMLM 172
+ E + + LK + + T A + I + L
Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180


8GBAA_1093GBAA_1112Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1093211-0.516589S-layer protein
GBAA_1094311-0.441739wall-associated protein
GBAA_1095522-4.723220hypothetical protein
GBAA_1096623-2.454283hypothetical protein
GBAA_1098318-1.010226wall-associated domain-containing protein
GBAA_1099313-1.019240hypothetical protein
GBAA_1100414-0.400202hypothetical protein
GBAA_11032120.303966hypothetical protein
GBAA_11061120.145235hypothetical protein
GBAA_11112130.568959HD domain-containing protein
GBAA_1112213-0.104250hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1094PF03544340.005 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 34.2 bits (78), Expect = 0.005
Identities = 16/62 (25%), Positives = 20/62 (32%)

Query: 11 IQLIVVALIVTSVPLNGLAETAPPFTPSPNSEQSPETEKKEEKELPAPHPDQSKKDKAKA 70
I + +VA P P P P E PE K+ + P P K K
Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109

Query: 71 KA 72
K
Sbjct: 110 KV 111


9GBAA_1131GBAA_1146Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_11312122.253268malate synthase
GBAA_11323151.535041isocitrate lyase
GBAA_1133316-1.182682trifolitoxin immunity domain-containing protein
GBAA_1135519-1.038572cold shock protein CspA
GBAA_11363141.664645hypothetical protein
GBAA_11373152.682009hypothetical protein
GBAA_11383152.945624competence transcription factor
GBAA_11392163.326639hypothetical protein
GBAA_11402153.299315signal peptidase I
GBAA_11413163.544299ATP-dependent nuclease subunit B
GBAA_11423152.811451ATP-dependent nuclease subunit A
GBAA_11433240.425321hypothetical protein
GBAA_11444250.159852spore germination protein gerpf
GBAA_11455180.197377spore germination protein GerPE
GBAA_11464190.239807spore germination protein GerPD
10GBAA_1161GBAA_1170Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1161014-4.510442hypothetical protein
GBAA_1162116-3.994395alpha-amylase
GBAA_1163319-4.159453DNA-binding protein
GBAA_1166216-0.716949nucleotide-binding protein
GBAA_1167117-0.279532hypothetical protein
GBAA_1168322-0.367898hypothetical protein
GBAA_11693170.357374peptidyl-prolyl isomerase
GBAA_11703191.978044hypothetical protein
11GBAA_1207GBAA_1234Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1207215-0.702778hypothetical protein
GBAA_1208-1150.444392hypothetical protein
GBAA_12090131.643628hypothetical protein
GBAA_12100131.940298hypothetical protein
GBAA_1211-1131.326242hypothetical protein
GBAA_12120131.122729hypothetical protein
GBAA_12130130.880675inorganic polyphosphate/ATP-NAD kinase
GBAA_12144223.339992ribosomal large subunit pseudouridine synthase
GBAA_12174212.054476bis(5'-nucleosyl)-tetraphosphatase
GBAA_12194201.099576group 2 family glycosyl transferase
GBAA_12205201.968405hypothetical protein
GBAA_12215232.125662bacteriocin O-metyltransferase
GBAA_12225242.447711hypothetical protein
GBAA_1224118-0.818882group 2 family glycosyl transferase
GBAA_1225219-0.382507hypothetical protein
GBAA_12262210.132635hypothetical protein
GBAA_12270170.583519streptomycin biosynthesis strf domain-containing
GBAA_1228-1180.660957glucose-1-phosphate thymidylyltransferase
GBAA_1229-2150.843585dTDP-4-dehydrorhamnose 3,5-epimerase
GBAA_1230-2160.609533dTDP-glucose 4,6-dehydratase
GBAA_12310171.033288dTDP-4-dehydrorhamnose reductase
GBAA_12324181.682127enoyl-ACP reductase
GBAA_12335181.538260hypothetical protein
GBAA_12342170.613480spore coat protein z
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1230NUCEPIMERASE1881e-59 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 188 bits (478), Expect = 1e-59
Identities = 75/332 (22%), Positives = 141/332 (42%), Gaps = 26/332 (7%)

Query: 1 MNILVTGGAGFIGSNFVHYMLQSYETYKIINFDALT--YSGNLNNVK-SIQDHPNYYFVK 57
M LVTG AGFIG + +L+ ++++ D L Y +L + + P + F K
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLE--AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHK 58

Query: 58 GEIQNGELLEHVIKERDVQVIVNFAAESHVDRSIENPIPFYDTNVIGTVTLLELVKKYPH 117
++ + E + + + + V S+ENP + D+N+ G + +LE +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 118 IKLVQVSTDEVYGSLGKTGRFTEETPLA-PNSPYSSSKASADMIALAYYKTYQLPVIVTR 176
L+ S+ VYG L + F+ + + P S Y+++K + +++A Y Y LP R
Sbjct: 119 QHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177

Query: 177 CSNNYGPYQYPEKLIPLMVTNALEGKKLPLYGDGLNVRDWLHVTDHCSAIDVVLHKGRV- 235
YGP+ P+ + LEGK + +Y G RD+ ++ D AI +
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 236 -----------------GEVYNIGGNNEKTNVEVVEQIITLLGKTKKDIEYVTDRLGHDR 278
VYNIG ++ ++ ++ + LG + + + G
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI-EAKKNMLPLQPGDVL 296

Query: 279 RYAINAEKMKNEFDWEPKYTFEQGLQETVQWY 310
+ + + + + P+ T + G++ V WY
Sbjct: 297 ETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1231NUCEPIMERASE444e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 43.6 bits (103), Expect = 4e-07
Identities = 36/200 (18%), Positives = 70/200 (35%), Gaps = 38/200 (19%)

Query: 4 RVIITGANGQLGKQLQEEL--NPEE----------YDIYPFDKKL------------LDI 39
+ ++TGA G +G + + L + YD+ +L +D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 40 TNISQVQQVVQEIRPHIIIHCAAYTKVDQAEKERDLAYV-INAIGARNVAVASQLVGAK- 97
+ + + + V + E AY N G N+ + +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYS-LENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LVYISTDYVFQGDRPEGYDEFHNPA-PINIYGASKYAGEQFVKELHNKYFIVRTSW---- 152
L+Y S+ V+ +R + + P+++Y A+K A E + Y + T
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 153 LYGKYGN------NFVKTMI 166
+YG +G F K M+
Sbjct: 181 VYGPWGRPDMALFKFTKAML 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1232DHBDHDRGNASE577e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.0 bits (137), Expect = 7e-12
Identities = 60/259 (23%), Positives = 105/259 (40%), Gaps = 19/259 (7%)

Query: 4 LQGKTFVVMGVANQRSIAWGIARSLHNAGAKLI-FTYAGERLERNVRELADTLEGQESLV 62
++GK + G A + I +AR+L + GA + Y E+LE+ V L E + +
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL--KAEARHAEA 61

Query: 63 LPCDVTNDEELTACFETIKQEVGTIHGVAHCIAFANRDDLKGEFVDTSRDGFLLAQNISA 122
P DV + + I++E+G I + + G S + + ++++
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLR----PGLIHSLSDEEWEATFSVNS 117

Query: 123 FSLTAVAREAKKVMT--EGGNILTLTYLGGERVVKNYNVMGVAKASLEASVKYLANDLGQ 180
+ +R K M G+I+T+ + +KA+ K L +L +
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 181 HGIRVNAISAGPIRT-----LSAKGVGDFNSILREIEE---RAPLRRTTTQEEVGDTAVF 232
+ IR N +S G T L A G I +E PL++ ++ D +F
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 233 LFSDLARGVTGENIHVDSG 251
L S A +T N+ VD G
Sbjct: 238 LVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1233IGASERPTASE300.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.009
Identities = 10/53 (18%), Positives = 28/53 (52%)

Query: 48 DRKEESNRNENVVSSAVEEVIEQEEQQQEQEQEQEEQVEEKTEEEEQVQEQQE 100
++ +SN N ++ V + + ++ Q E ++ VE++ + + + ++ QE
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121



Score = 29.3 bits (65), Expect = 0.009
Identities = 23/80 (28%), Positives = 34/80 (42%), Gaps = 6/80 (7%)

Query: 29 LELAAPKIKRIILTNFENEDRKEESNRNENVVSSAVEEVIEQEEQQQEQEQEQEE----- 83
+ A +K TN E E+ + + V ++E+ + E E+ QE
Sbjct: 1069 AKEAKSNVKANTQTN-EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 84 QVEEKTEEEEQVQEQQEPVR 103
QV K E+ E VQ Q EP R
Sbjct: 1128 QVSPKQEQSETVQPQAEPAR 1147


12GBAA_1265GBAA_1287Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_12650193.209358hypothetical protein
GBAA_1266-1193.473762hypothetical protein
GBAA_1267-1203.470235hypothetical protein
GBAA_12680223.195335hypothetical protein
GBAA_12692263.216052dihydrolipoamide succinyltransferase
GBAA_12700222.4677062-oxoglutarate dehydrogenase E1
GBAA_1271-124-3.284564DNA-binding protein
GBAA_1272219-2.689337hypothetical protein
GBAA_1273318-2.861349hypothetical protein
GBAA_1274519-2.771744hypothetical protein
GBAA_1275419-2.348612hypothetical protein
GBAA_1276417-2.536006hypothetical protein
GBAA_1277318-1.436792hypothetical protein
GBAA_1278022-1.350979hypothetical protein
GBAA_1279-320-0.488172hypothetical protein
GBAA_1280-119-0.375430hypothetical protein
GBAA_1281-115-1.129708*hypothetical protein
GBAA_1282215-1.412659hypothetical protein
GBAA_1284115-1.308450hypothetical protein
GBAA_1286212-1.342480D-alanyl-D-alanine carboxypeptidase
GBAA_1287511-1.629271signal peptidase I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1274FIMBRIALPAPF300.002 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 29.7 bits (66), Expect = 0.002
Identities = 33/127 (25%), Positives = 50/127 (39%), Gaps = 21/127 (16%)

Query: 6 IFFLLTCLLLVASTTYIICNKREQV--PPMLVWEGQEYYVTNEPAKAEEVGQRLGEVTKK 63
I LLT + ++A + N R V PP + GQ V E V GEVTK
Sbjct: 8 ISLLLTSVAVLAD---VQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGEVTKN 64

Query: 64 IETSEKPIKN--------------SESNIVQEKTEVFTM-IEEEKGPHSPLIIKEPDGEE 108
I S P K+ ++N++ F + + + KG +PL + G
Sbjct: 65 ISIS-CPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGSGNG 123

Query: 109 YRIVRAM 115
YR+ +
Sbjct: 124 YRVTAGL 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1276HTHFIS280.009 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.009
Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 1/36 (2%)

Query: 69 EDRLKHLPEGSHQTVVIDVRGPDETG-EILKQIREE 103
+ + G VV DV PDE ++L +I++
Sbjct: 37 ATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKA 72


13GBAA_1573GBAA_1580Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1573220-4.178543Holliday junction-specific endonuclease
GBAA_1574324-3.921169hypothetical protein
GBAA_15751321-1.353304hypothetical protein
GBAA_1577721-1.505348hypothetical protein
GBAA_1578619-1.700621hypothetical protein
GBAA_15793190.456125hypothetical protein
GBAA_15802190.561590hypothetical protein
14GBAA_1603GBAA_1618Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1603-217-3.363022hypothetical protein
GBAA_1605-116-3.606817cation transporter
GBAA_1606017-3.8071885'-3' exonuclease
GBAA_1607018-5.862991hypothetical protein
GBAA_1608019-5.259306acyltransferase
GBAA_1609018-4.568057chain length determinant protein
GBAA_1612415-0.043856capsular polysaccharide biosynthesis
GBAA_1613414-0.134885polysaccharide biosynthesis protein
GBAA_16144150.050073hypothetical protein
GBAA_16153160.709203group 2 family glycosyl transferase
GBAA_16173170.824405hypothetical protein
GBAA_16183191.137105hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1618CHLAMIDIAOM6394e-04 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 39.3 bits (91), Expect = 4e-04
Identities = 24/63 (38%), Positives = 37/63 (58%), Gaps = 7/63 (11%)

Query: 1130 SNTVSTQINLANVVIVKQVDLTIAD---VGQPITYTIALANPGNTPANNVVVTDILPPGT 1186
+ +V+T IN V QV + AD V +P+ Y I+++NPG+ +VVV D L PG
Sbjct: 305 TASVTTVINEPCV----QVSIAGADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGV 360

Query: 1187 TLV 1189
T++
Sbjct: 361 TVL 363



Score = 37.4 bits (86), Expect = 0.002
Identities = 73/382 (19%), Positives = 135/382 (35%), Gaps = 101/382 (26%)

Query: 1956 YTVLLENIGNTTATNIIFTDPIPNHTVFIEDSVRVGGILLPGVNPANGIPIGDIIAGDFI 2015
Y + + N G TA N++ +P+P+ G +GD+ G+
Sbjct: 229 YKINIVNQGTATARNVVVENPVPD------------GYAHSSGQRVLTFTLGDMQPGE-- 274

Query: 2016 NITFRVQVVSIPNPIFTIGPGGPNSPVVNGASINYQFMTGPNLPLASRSTTSNPVSTQIN 2075
+ T V+ + N A+++Y G + AS +T N Q++
Sbjct: 275 HRTITVEFCPLKR-----------GRATNIATVSY---CGGHKNTASVTTVINEPCVQVS 320

Query: 2076 SGEIALVKSVDKTFVTIGDTLSYSISLSNPGNVTSQNIIFTDVLPEGITFISGTLTNDSG 2135
+ D ++V + Y IS+SNPG++ ++++ D L G+T +
Sbjct: 321 ------IAGADWSYVC--KPVEYVISVSNPGDLVLRDVVVEDTLSPGVTVLEA------- 365

Query: 2136 TQQIGNPATGIQIGNINPGSTATIVINALVTNIPSINPISNFSSVQFAHVVDPSQPSVSQ 2195
G A I N +V + +NP S+Q+ +V P
Sbjct: 366 -----------------AG--AQISCNKVVWTVKELNP---GESLQYKVLVRAQTPG--- 400

Query: 2196 TNLSNTVSTTIKSAILTTTKSADKSV------------------ISVGDTITYTTTITNT 2237
+N V S T T A+ + + VG+ Y +TN
Sbjct: 401 -QFTNNVVVKSCSDCGTCTSCAEATTYWKGVAATHMCVVDTCDPVCVGENTVYRICVTNR 459

Query: 2238 GNTAAANI----KFT-------SAIPANTTFIPNSVTINGVQQSGVQPAL--GVNIPNIA 2284
G+ N+ KF+ + P T N+V + + + G + + V + ++
Sbjct: 460 GSAEDTNVSLMLKFSKELQPVSFSGPTKGTITGNTVVFDSLPRLGSKETVEFSVTLKAVS 519

Query: 2285 PGETV-TVTFQVNVLSVPSSSS 2305
G+ + L+VP S +
Sbjct: 520 AGDARGEAILSSDTLTVPVSDT 541



Score = 35.8 bits (82), Expect = 0.005
Identities = 36/160 (22%), Positives = 62/160 (38%), Gaps = 30/160 (18%)

Query: 2884 IVYSVTITNSGNVNATNVIFTDVIPDGTSFEPNSFTLNGTIIENANIITGVPIGDIAPNE 2943
+VY + I N G A NV+ + +PDG + L T +GD+ P E
Sbjct: 227 VVYKINIVNQGTATARNVVVENPVPDGYAHSSGQRVLTFT------------LGDMQPGE 274

Query: 2944 SAI--VEFHITSNEIPAINPITNQASVSFQHIVNPANPPVSKNITSNSVTTTIESAILTT 3001
VEF TN A+VS+ + + SVTT I +
Sbjct: 275 HRTITVEFCPLKR-----GRATNIATVSY----------CGGHKNTASVTTVINEPCVQV 319

Query: 3002 TKIGDKAFATIGDTITYTTTITNIGNIPANNVIFSDPIPS 3041
+ I ++ + + Y +++N G++ +V+ D +
Sbjct: 320 S-IAGADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSP 358



Score = 33.9 bits (77), Expect = 0.018
Identities = 35/180 (19%), Positives = 69/180 (38%), Gaps = 27/180 (15%)

Query: 4316 IAVKTTPIQYADLQTIIPYTISITNNGNIQVENIIVTDIIPANTNFIENSVIVNGNTRPN 4375
I VK + A L+ + Y I+I N G N++V + +P +G +
Sbjct: 211 ICVKQEGPENACLRCPVVYKINIVNQGTATARNVVVENPVP------------DGYAHSS 258

Query: 4376 DNPLSGIPIDNILPNTTATVLFQVRVTSIPQT-NPISNTSTIEYEYTVGDQPPITKTIIS 4434
+ + ++ P T+ V P +N +T+ Y + +T I
Sbjct: 259 GQRVLTFTLGDMQPGEHRTIT----VEFCPLKRGRATNIATVSYCGGHKNTASVTTVINE 314

Query: 4435 SAALTEINHANLNSNKAVDLAFAMVGDTLTYTITLNQTGNVAANDVIIQDMIPQGTTFIE 4494
I A+ ++ V + Y I+++ G++ DV+++D + G T +E
Sbjct: 315 PCVQVSIAGAD----------WSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVTVLE 364


15GBAA_1651GBAA_1673Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1651219-3.024269hypothetical protein
GBAA_1652121-3.387128permease
GBAA_1653017-2.933360glyoxalase
GBAA_1654-213-2.511036hypothetical protein
GBAA_1655-214-2.913136hypothetical protein
GBAA_1656-210-1.595550host factor-I protein
GBAA_165717-1.274240hypothetical protein
GBAA_1658110-1.250831flagellar motor protein MotP
GBAA_165929-1.598670flagellar motor protein MotS
GBAA_166027-1.950082chemotaxis response regulator
GBAA_1662310-2.056279flagellar motor switch protein
GBAA_1663513-3.381643hypothetical protein
GBAA_1664314-3.056847hypothetical protein
GBAA_1665212-3.514778chemotaxis protein CheR
GBAA_1666214-3.539065hypothetical protein
GBAA_1667215-3.080386hypothetical protein
GBAA_1668217-2.964283hypothetical protein
GBAA_1669118-1.994290flagellar hook-associated protein FlgK
GBAA_1671219-2.611333flagellar capping protein
GBAA_1672321-1.868891flagellar protein FliS
GBAA_1673217-1.346412hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1652TCRTETA479e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 46.7 bits (111), Expect = 9e-08
Identities = 56/292 (19%), Positives = 109/292 (37%), Gaps = 17/292 (5%)

Query: 59 LPQLLLSPFIGGVVDRFSKKNIMIFTDITRGILVLTYILASYK-IEIIFIANICLSVLSC 117
L Q +P +G + DRF ++ +++ + G V I+A+ + +++I I ++ ++
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLA--GAAVDYAIMATAPFLWVLYIGRI-VAGITG 110

Query: 118 LFEPAKQATLKNIVHENHFVTANSLSSTMNGFMSIMGASLGGIIAQ-SLHIEFAF--LVN 174
A + +I + S GF + G LGG++ S H F +N
Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 175 SLSYFISAYFIYSMCIPSHNTCNKKKAFLTDIKDGYTYILQTKIILTLILVGISWGLIGG 234
L++ + + ++ + + + ++ L+ V L+G
Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREA---LNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 235 AYQLLLTIYAEKIFH---TNIGILYTVQGA-GLMIGSLLVNLYISHNKEKIKKAFGWACF 290
L I+ E FH T IGI G + +++ + E+ G
Sbjct: 228 VPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIAD 287

Query: 291 LQGVFFLGFILSSQLIFGLTTLLCMRIAGGIIVPLDTTLLQTYTRENMIGKV 342
G L F + F + LL +GGI +P +L E G++
Sbjct: 288 GTGYILLAFATRGWMAFPIMVLL---ASGGIGMPALQAMLSRQVDEERQGQL 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1659OMPADOMAIN636e-14 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 63.4 bits (154), Expect = 6e-14
Identities = 30/127 (23%), Positives = 56/127 (44%), Gaps = 17/127 (13%)

Query: 110 SVVIVDNLIFDTGDANVKPEAKEIISQLVGFFQSVPNP---IVVEGHTDSRPIHNDKFPS 166
+ +++F+ A +KPE + + QL ++ +VV G+TD +D +
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI--GSDAY-- 269

Query: 167 NWELSSARAANMIHHLIEVYNVDDKRLAAVGYADTKPVVPN---------DSPQNWEKNR 217
N LS RA +++ +LI + +++A G ++ PV N +R
Sbjct: 270 NQGLSERRAQSVVDYLIS-KGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328

Query: 218 RVVIYIK 224
RV I +K
Sbjct: 329 RVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1660HTHFIS839e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 9e-22
Identities = 28/112 (25%), Positives = 46/112 (41%), Gaps = 2/112 (1%)

Query: 4 KILVVDDAMFMRTMIKNLLKSNSEFEVIGEAENGVEAIQKYKELQPDIVTLDITMPEMDG 63
ILV DD +RT++ L + + N + D+V D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LEALKEIIKIDASAKVVICSAMGQQGMVLDAIKGGAKDFIVKPFQADRVIEA 115
+ L I K V++ SA + A + GA D++ KPF +I
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1662FLGMOTORFLIN561e-11 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 55.7 bits (134), Expect = 1e-11
Identities = 23/71 (32%), Positives = 40/71 (56%)

Query: 473 DTSILQNVEMNVKFVFGSTVKTIQDILSLQENEAVVLDEDIDEPIRIYVNDVLVAYGELV 532
D ++ ++ + + G T TI+++L L + V LD EP+ I +N L+A GE+V
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 533 NVDGFFGVKVT 543
V +GV++T
Sbjct: 113 VVADKYGVRIT 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1663IGASERPTASE340.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.002
Identities = 18/126 (14%), Positives = 51/126 (40%), Gaps = 1/126 (0%)

Query: 301 EQKTEEDKKIEEPENEDKLENKLEDKKVTEKQEDSKVEISLPEEKTPVVQIPKKEEKVND 360
+ EE K+E + ++ + + E+ E + + E P V I + + + N
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 361 LIKEPLKEKEKITYVIKEPLTDNKEVNKTKAQKDKDNNNQVISKKKEKKEEPEEKKEAKS 420
+ ++ + +++P+T++ VN + + N + + E K + +
Sbjct: 1165 TADTE-QPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 421 EQGIQA 426
+ +++
Sbjct: 1224 RRSVRS 1229



Score = 29.3 bits (65), Expect = 0.045
Identities = 26/109 (23%), Positives = 44/109 (40%), Gaps = 7/109 (6%)

Query: 22 LQSKAEEQNVP-EQNINEV-NVQEENKEVQEQLEQVEMKQDKEEQQEAKNEQETEKKIET 79
+K + NV NEV E KE Q + + E++++AK ETEK E
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTT--ETKETATVEKEEKAK--VETEKTQEV 1122

Query: 80 DQGVITVNKPELKVGEEVLVTIEPKEKNVQSIKGILRLPKNGDQYEQER 128
+ V + P+ + E V EP +N ++ + + E+
Sbjct: 1123 PK-VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1669FLGHOOKAP11043e-26 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 104 bits (260), Expect = 3e-26
Identities = 72/249 (28%), Positives = 112/249 (44%), Gaps = 14/249 (5%)

Query: 4 SDYNTPLSGLLAAQMGLQTTKQNLSNIHTPGYVRQMVNYGSAGASQGYSPEQKIGYGVQT 63
S N +SGL AAQ L T N+S+ + GY RQ A ++ G +G GV
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLG--AGGWVGNGVYV 59

Query: 64 LGVDRITDEVKTKQFNDQLSQLSYYNYMNSTLSRVESMVGTTGKNSLSSLMDGFFNAFRE 123
GV R D T Q +Q S +S++++M+ T+ SL++ M FF + +
Sbjct: 60 SGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTS-SLATQMQDFFTSLQT 118

Query: 124 VAKNPEQPNYYDTLISETGKFTSQVNRLAKSLDTAEAQTTEDIEAHVNEFNRLAGSLAEA 183
+ N E P LI ++ +Q + L + Q I A V++ N A +A
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 184 NKKI----GQAGTQVPNQLLDERDRIITEMSKYANIEVS---YESMNPNIASVRMNGVLT 236
N +I G PN LLD+RD++++E+++ +EVS + N +A NG
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA----NGYSL 234

Query: 237 VNGQDTYPL 245
V G L
Sbjct: 235 VQGSTARQL 243



Score = 54.2 bits (130), Expect = 6e-10
Identities = 19/51 (37%), Positives = 35/51 (68%)

Query: 380 LLEGIQQEKMGIEGVNMEEEMVNLMAFQKYFVANSKAITTMNEVFDSLFSI 430
++ + ++ I GVN++EE NL FQ+Y++AN++ + T N +FD+L +I
Sbjct: 495 VVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


16GBAA_1687GBAA_1725Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1687116-5.353721hypothetical protein
GBAA_1692017-4.979721hypothetical protein
GBAA_1693421-5.117530group 2 family glycosyl transferase
GBAA_1696322-4.752876hypothetical protein
GBAA_1697016-3.116754hypothetical protein
GBAA_1698-113-2.021914TPR/glycosyl transferase domain-containing
GBAA_17013140.597573hypothetical protein
GBAA_17033181.302330hypothetical protein
GBAA_17043161.326116hypothetical protein
GBAA_17062181.090407flagellin
GBAA_17074250.812815Slt family transglycosylase
GBAA_1710324-0.388765flagellar motor switch protein
GBAA_1711420-0.248808hypothetical protein
GBAA_1712317-0.361893flagellar biosynthesis protein FliP
GBAA_1713214-0.394593flagellar biosynthesis protein FliQ
GBAA_1714112-0.287071flagellar biosynthesis protein FliR
GBAA_1715090.068243flagellar biosynthetic protein FlhB
GBAA_1716190.416021flagellar biosynthesis protein FlhA
GBAA_17190100.201918flagellar basal body rod protein FlgG
GBAA_1720-112-0.288201alanyl-tRNA synthetase
GBAA_1721013-0.861335hypothetical protein
GBAA_1722314-1.702225AzlC family protein
GBAA_1723114-2.854559hypothetical protein
GBAA_1724-114-2.508932hypothetical protein
GBAA_1725014-3.343344TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1698SYCDCHAPRONE412e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 41.5 bits (97), Expect = 2e-06
Identities = 23/101 (22%), Positives = 34/101 (33%), Gaps = 11/101 (10%)

Query: 444 DNEQIQLALIREDIRQLINQGMISQAKYLISEYEKTFPITSEIYQMKGIVAFSENNYLDA 503
D ++ QLA+ + G IS T E + Y DA
Sbjct: 7 DTQEYQLAME-----SFLKGGGTIAMLNEISSD------TLEQLYSLAFNQYQSGKYEDA 55

Query: 504 ENFFKLALKLYHFDVDALFNLGYLYEVQEQYDRAVQNYNLA 544
F+ L H+D LG + QYD A+ +Y+
Sbjct: 56 HKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1706FLAGELLIN1259e-35 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 125 bits (314), Expect = 9e-35
Identities = 76/282 (26%), Positives = 130/282 (46%), Gaps = 18/282 (6%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNVSMNRLSSGKRINSAADDAAGLAIATRMRARQSGLE 60
INTN S+ TQ + ++Q ++ ++ RLSSG RINSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 KASQNTQDGMSLIRTAESAMNSVSNILTRMRDIAVQSSNGTNTAENQSALQKEFAELQEQ 120
+AS+N DG+S+ +T E A+N ++N L R+R+++VQ++NGTN+ + ++Q E + E+
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 IDYIAKNTEFNDKNLLAGTGAVTIGSTSISGAEISIETLDSSATNQQITIKLANTTAEKL 180
ID ++ T+FN +L+ + I + G + ITI L + L
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQMKIQVGANDG--------------ETITIDLQKIDVKSL 167

Query: 181 GIDATTSN----ISISGAASALAAISALNTALNTVAGNRATLGATLNRLDRNVENLNNQA 236
G+D N ++ S+ ++ +T R + + D + ++
Sbjct: 168 GLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227

Query: 237 TNMASAASQIEDADMAKEMSEMTKFKILNEAGISMLSQANQT 278
A+ D ++ K + A
Sbjct: 228 YVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 86.3 bits (213), Expect = 4e-21
Identities = 62/259 (23%), Positives = 107/259 (41%), Gaps = 7/259 (2%)

Query: 36 INSAADDAAGLAIATRMRARQSGLEKASQNTQDGMSLIRTAESAMNSVSNILTRMRDIAV 95
+ AG A A + G ++ G++ ++ + + T + V
Sbjct: 249 LFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKV 308

Query: 96 QSSNGTNTAENQSALQKEFAELQEQID-YIAKNTEFNDKN------LLAGTGAVTIGSTS 148
+ TA + + + F+DK L + S
Sbjct: 309 TLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGES 368

Query: 149 ISGAEISIETLDSSATNQQITIKLANTTAEKLGIDATTSNISISGAASALAAISALNTAL 208
+ T +++ + K G+ + + + S ++++++AL
Sbjct: 369 KITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSAL 428

Query: 209 NTVAGNRATLGATLNRLDRNVENLNNQATNMASAASQIEDADMAKEMSEMTKFKILNEAG 268
+ V R++LGA NR D + NL N TN+ SA S+IEDAD A E+S M+K +IL +AG
Sbjct: 429 SKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAG 488

Query: 269 ISMLSQANQTPQMVSKLLQ 287
S+L+QANQ PQ V LL+
Sbjct: 489 TSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1707PF06580290.021 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.021
Identities = 8/42 (19%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 122 LTKKY-NIQKIRSSNEGKYEDIIDRVSHTYGIPKTLIQKMIE 162
+ Y + I+ + ++E+ I+ +P L+Q ++E
Sbjct: 224 VVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVE 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1710FLGMOTORFLIN592e-14 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 58.8 bits (142), Expect = 2e-14
Identities = 22/94 (23%), Positives = 51/94 (54%)

Query: 13 LEDFAGKRNEASKAHIDTVSDISIELGVKLGKASITLGDVKQLKVGDVLEVEKNLGHKVD 72
+ G + ID + DI ++L V+LG+ +T+ ++ +L G V+ ++ G +D
Sbjct: 39 FQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLD 98

Query: 73 VYLSNMKVGIGEAIVMDEKFGIIISEIEADKKQA 106
+ ++ + GE +V+ +K+G+ I++I ++
Sbjct: 99 ILINGYLIAQGEVVVVADKYGVRITDIITPSERM 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1712FLGBIOSNFLIP1642e-52 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 164 bits (417), Expect = 2e-52
Identities = 75/239 (31%), Positives = 136/239 (56%), Gaps = 2/239 (0%)

Query: 14 FVFSIVFSIIFVNPAYAAQNGFINFENGKEFTSN--SSVQLFALVTLLSLSSSIVLLFTH 71
+ + + P AQ I + + VQ +T L+ +I+L+ T
Sbjct: 4 LLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTS 63

Query: 72 FTYFMIVLGITRQGLGVMNLPPNQVLVGLALFLSLFTMQPVLGQLKSDVWDPMTKEKITV 131
FT +IV G+ R LG + PPNQVL+GLALFL+ F M PV+ ++ D + P ++EKI++
Sbjct: 64 FTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISM 123

Query: 132 SQAAETTAPIMKEYMSKHTYKHDLKMMLKVRGEELPKDLKDLSLFTLVPSFTLTQIQKGL 191
+A E A ++E+M + T + DL + ++ + + + + L+P++ ++++
Sbjct: 124 QEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAF 183

Query: 192 LTGMFIYLAFVFIDLIISTLLMYLGMMMVPPMILSLPFKILIFVYLGGYTKIVDIMFKT 250
G I++ F+ IDL+I+++LM LGMMMVPP ++LPFK+++FV + G+ +V + ++
Sbjct: 184 QIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1713TYPE3IMQPROT421e-08 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 41.7 bits (98), Expect = 1e-08
Identities = 15/81 (18%), Positives = 35/81 (43%)

Query: 4 SPIIDIFQTFFYKGVMILMPVAGVSMIVVIIIAVIMAMMQIQEQTLTFLPKMASIVLVII 63
++ Y +++ V+ I+ +++ + + Q+QEQTL F K+ + L +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 ILGPWMFQELTTLILDLFDKI 84
+L W + L + +
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1714TYPE3IMRPROT967e-26 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 96.0 bits (239), Expect = 7e-26
Identities = 51/233 (21%), Positives = 113/233 (48%), Gaps = 1/233 (0%)

Query: 10 FFAFCRITSFLYFLPFFSGRSIPAMAKVTFGLALSITVADQVDVSHIKTVWDVAA-YAGT 68
F+ R+ + + P S RS+P K+ + ++ +A + + + A A
Sbjct: 17 FWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQ 76

Query: 69 QIVIGLSLSKIVEMLWNIPKMAGHILDFDIGLSQASLFDVNAGSQSTLLSTIFDIFFLII 128
QI+IG++L ++ + + AG I+ +GLS A+ D + +L+ I D+ L++
Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136

Query: 129 FISLGGINYFVATILKSFQYTEAISKLLTTSFLDSLLATLLFAITSAVEIALPLMGSLFI 188
F++ G + ++ ++ +F + L ++ +L + + +ALPL+ L
Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLLT 196

Query: 189 INFVLILIAKNAPQLNVFMNAYVIKITCGILFIAMSVPMLGYVFKNMTDVLLE 241
+N L L+ + APQL++F+ + + +T GI +A +P++ +++ +
Sbjct: 197 LNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFN 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1715TYPE3IMSPROT2892e-98 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 289 bits (742), Expect = 2e-98
Identities = 92/343 (26%), Positives = 186/343 (54%), Gaps = 2/343 (0%)

Query: 4 DNKTEKATPQKRKKSREEGNIARSKDLNNLFSILVLAVVVYFFGDWLGFEIANSVSVLFD 63
KTE+ TP+K + +R++G +A+SK++ + I+ L+ ++ D+ + + + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 64 QIGKNTDS--TEYFYMMGILLLKVSAPILILVYAFHLFNYMIQVGFLFSSKVIKPKASRI 121
Q + + + + P+L + + ++++Q GFL S + IKP +I
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NPKNYFTRLFSRKSLVDILKSLFYMGLIGYVAYVLFKKNLEKIVSMIGFNWTASLTEIIR 181
NP R+FS KSLV+ LKS+ + L+ + +++ K NL ++ + + +
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 QIKFIFLAILIILIVLSIIDFIYQKWEYEQDIKMKKEEVKQEHKDNEGDPQVKGKRKNFM 241
++ + + + +V+SI D+ ++ ++Y +++KM K+E+K+E+K+ EG P++K KR+ F
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 HAILQGTIAKKMDGATFIVNNPTHISVVLRYNKHVDAAPIVVAKGEDELALYIRTLAREQ 301
I + + + ++ +V NPTHI++ + Y + P+V K D +R +A E+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 EIPMVENRPLARSLYYQVEEDETIPEDLYVAVIEVMRYLIQTN 344
+P+++ PLAR+LY+ D IP + A EV+R+L + N
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1719FLGHOOKAP1280.033 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.033
Identities = 11/47 (23%), Positives = 24/47 (51%)

Query: 203 NGVGTVKNYMLENSNVDMTKEMADLMTDQRMISASQRVMTSFDKIYE 249
N V + N S V++ +E +L Q+ A+ +V+ + + I++
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1720DPTHRIATOXIN280.039 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 27.8 bits (61), Expect = 0.039
Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 16/113 (14%)

Query: 63 EQGEIVHYIKDGAQVKLGPVKLEINWERRHNLMRHHSLLHLIGAVVYEKYGALCTGNQIY 122
E V YI + Q K V+LEIN+E R + +YE C GN++
Sbjct: 174 EGSSSVEYINNWEQAKALSVELEINFETRGKRGQD---------AMYEYMAQACAGNRVR 224

Query: 123 PDKA------RIDFNELQELSSVEVEGIVKEVNKLIEQNKEISTRYMSREEAE 169
+D++ +++ + ++E + KE + + E + +S E+A+
Sbjct: 225 RSVGSSLSCINLDWDVIRDKTKTKIESL-KEHGPIKNKMSESPNKTVSEEKAK 276


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1725HTHTETR741e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 73.9 bits (181), Expect = 1e-18
Identities = 35/191 (18%), Positives = 70/191 (36%), Gaps = 33/191 (17%)

Query: 1 MAKPN----VVNKEKLLQAAKEIIAEHGMEKLTLKAVAESAQVTQGTVYYHFKTKDQLLL 56
MA+ ++ +L A + ++ G+ +L +A++A VT+G +Y+HFK K L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 57 EVTEAFCKASWEQIGKDVQLEKALQSAESRCVKDSMYHHLFFQLVASGLQNDAMKDKIGG 116
E+ + S IG+ +A + V + H+ S + + + +
Sbjct: 61 EI----WELSESNIGELELEYQAKFPGDPLSVLREILIHVL----ESTVTEERRRLLMEI 112

Query: 117 LLHYENQQ--------------------LTRVLNKNI-GGTMTSQISTETWSVLCNALID 155
+ H + + L I + + + T +++ I
Sbjct: 113 IFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172

Query: 156 GLALQALFNPS 166
GL LF P
Sbjct: 173 GLMENWLFAPQ 183


17GBAA_1835GBAA_1847Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_18352170.866601sodium-dependent transporter
GBAA_18362211.455505polysaccharide deacetylase
GBAA_18372212.493054hypothetical protein
GBAA_18382202.769456hypothetical protein
GBAA_18400194.035570fibronectin-binding protein
GBAA_1842-1195.305885dehydrogenase
GBAA_1843-2132.565089hypothetical protein
GBAA_1844-2123.089921hypothetical protein
GBAA_1846-2113.797821peptide methionine sulfoxide reductase
GBAA_1847-2113.309984short chain dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1840PF07299334e-120 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 334 bits (859), Expect = e-120
Identities = 209/213 (98%), Positives = 212/213 (99%)

Query: 1 MEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELID 60
MEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELID
Sbjct: 7 MEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVFENLTDEQKELID 66

Query: 61 TVLTVQNREDAESFLTKINPYVIPFQEVTAQTLKKLFPKAKKLKLPDMEEMDMKEISYLS 120
TVLTVQNREDAESFL KINPYVIPFQEVTAQTLKKLFPKAKKLKLPDMEE+DMKE+SYLS
Sbjct: 67 TVLTVQNREDAESFLLKINPYVIPFQEVTAQTLKKLFPKAKKLKLPDMEELDMKELSYLS 126

Query: 121 WVDKGSSRKFIIAKNDKNKFVGLQGTFQSLNKKSICSLCHGHEEVGMFLVEIKGDIPGTF 180
W+DKGSSRKFIIAKNDKNKFVGLQGTFQSLNKKSICSLCHGHEEVGMFLVEIKGDIPGTF
Sbjct: 127 WIDKGSSRKFIIAKNDKNKFVGLQGTFQSLNKKSICSLCHGHEEVGMFLVEIKGDIPGTF 186

Query: 181 VKKGNYICKDGVACNQNMKSLDKLQDFIERLKK 213
VKKGNYICKDGVACNQNMKSLDKLQDFIERLKK
Sbjct: 187 VKKGNYICKDGVACNQNMKSLDKLQDFIERLKK 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1847DHBDHDRGNASE885e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 88.2 bits (218), Expect = 5e-23
Identities = 68/263 (25%), Positives = 121/263 (46%), Gaps = 21/263 (7%)

Query: 2 LKGKVALVTGASRGIGRAIAKRLANDGALV-AIHYGNRKEEAEETVYEIQSNGGSAFSIG 60
++GK+A +TGA++GIG A+A+ LA+ GA + A+ Y K E + + ++ AF
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP-- 63

Query: 61 ANLESLHGVEALYSSLDNELQNRTGSTKFDILINNAGIGPGAFIEETTEQFFDRMVSVNA 120
A++ ++ + + ++ E+ DIL+N AG+ I +++ ++ SVN+
Sbjct: 64 ADVRDSAAIDEITARIEREMG------PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNS 117

Query: 121 KAPFFIIQQALSRLRD--NSRIINISSAATRISLPDFIAYSMTKGAINTMTFTLAKQLGA 178
F + + D + I+ + S + AY+ +K A T L +L
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 179 RGITVNAILPGFVKTDMNAELLSDP---------MMKQYATTISAFNRLGEVEDIADTAA 229
I N + PG +TDM L +D ++ + T I +L + DIAD
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIP-LKKLAKPSDIADAVL 236

Query: 230 FLASPDSRWVTGQLIDVSGGSCL 252
FL S + +T + V GG+ L
Sbjct: 237 FLVSGQAGHITMHNLCVDGGATL 259


18GBAA_2022GBAA_2039Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2022224-1.425590spermidine acetyltransferase
GBAA_20245252.115474hypothetical protein
GBAA_20256220.579871hypothetical protein
GBAA_20265190.452385hypothetical protein
GBAA_2027620-0.474510hypothetical protein
GBAA_2028320-1.253382hypothetical protein
GBAA_2029320-0.901557hypothetical protein
GBAA_2031017-3.084244hypothetical protein
GBAA_2032-217-1.480180hypothetical protein
GBAA_2033-116-0.988479hypothetical protein
GBAA_2035013-0.021391adhesion lipoprotein
GBAA_2036013-0.133274hypothetical protein
GBAA_2037013-0.368998hypothetical protein
GBAA_2038114-0.144293NADPH dehydrogenase NamA
GBAA_2039318-0.568064methylated-DNA--protein-cysteine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2035adhesinb2144e-70 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 214 bits (547), Expect = 4e-70
Identities = 75/319 (23%), Positives = 137/319 (42%), Gaps = 20/319 (6%)

Query: 3 KRLTIFSFLLIFTLIFTGCSNTKEGNAKKDGKLTVYTTIFPLADFAKKIGGDYVTVEAIY 62
K+ LL+ + CS+ K KL V T +AD K I GD + + +I
Sbjct: 2 KKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIV 61

Query: 63 PPGADSHTFEPSQKQTVKVAKADLFVYNGAELE-----PFAEKMEKSLQKENVKIVNASK 117
P G D H +EP + K ++ADL YNG LE F + +E + +KEN S+
Sbjct: 62 PVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSE 121

Query: 118 GIELRTSTEEEHHDHGDGHKEDEHHHDKDPHIWLDPTLAMKQAEKIKNALVALQPDHKQE 177
G+++ + DPH WL+ + A+ I L P +K+
Sbjct: 122 GVDVIYLEGQSEKGKE------------DPHAWLNLENGIIYAQNIAKRLSEKDPANKET 169

Query: 178 FEKNFAALQTKFTDLDDQFKAVVAN--AKTKDILVSHAAYGYWEQRYGLKQIAIAGISAS 235
+EKN A K + LD + K N + K I+ S + Y+ + Y + I I+
Sbjct: 170 YEKNLKAYVEKLSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTE 229

Query: 236 DEPSQKQLADITKTVKEHNLKYILFETFSTPKVASVIQKETGTKVLRLNHLATISEDDAK 295
+E + Q+ + + +++ + + E+ + + K+T + +++E +
Sbjct: 230 EEGTPDQIKTLVEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEE 289

Query: 296 NNKDYFTLMEENVNTLKEA 314
+ Y+++M+ N+ + E
Sbjct: 290 GD-SYYSMMKYNLEKIAEG 307


19GBAA_2083GBAA_2103Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2083314-1.302339glycosyl transferase
GBAA_2084215-1.311202hypothetical protein
GBAA_2085315-1.126308GntR family transcriptional regulator
GBAA_2086216-2.186264acetyltransferase
GBAA_2088216-2.384647hypothetical protein
GBAA_2089219-1.958736acetyltransferase
GBAA_2091224-2.646665acetyltransferase
GBAA_2092426-3.384770hypothetical protein
GBAA_2093521-4.855077hypothetical protein
GBAA_2094623-5.666892hypothetical protein
GBAA_2095420-3.167001hypothetical protein
GBAA_2096417-2.675682hypothetical protein
GBAA_2097217-3.404083hypothetical protein
GBAA_2098115-2.727001hypothetical protein
GBAA_2099115-1.928929hypothetical protein
GBAA_2101017-0.156642hypothetical protein
GBAA_2102-1170.471092hypothetical protein
GBAA_21032181.196510hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2086SACTRNSFRASE376e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.2 bits (86), Expect = 6e-06
Identities = 25/103 (24%), Positives = 42/103 (40%), Gaps = 5/103 (4%)

Query: 27 SREEASSLFQKMKEENYKLFSLRNEENEVVSLAGVAICTNFYNEKHVFVYDLVTAEAHRS 86
E+ ++EE F E N + + I +N+ + + D+ A+ +R
Sbjct: 49 QYEDDDMDVSYVEEEGKAAFLYYLENNCI---GRIKIRSNW--NGYALIEDIAVAKDYRK 103

Query: 87 KGYGNVLLSYVEKWGKEKGCSSIVLTSAFPRIDAHRFYEREGF 129
KG G LL +W KE ++L + I A FY + F
Sbjct: 104 KGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2089PF05616290.017 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.6 bits (63), Expect = 0.017
Identities = 16/42 (38%), Positives = 23/42 (54%)

Query: 43 YFSFSMQEYSVYKEKMQTRLKEEPLSNLIIENNGQVIGTVGF 84
+ SFS+Q S YKE+M + EE LS + N + I G+
Sbjct: 220 FISFSLQGNSKYKEEMDAKKLEEILSLKVDANPDKYIKATGY 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2098PF06580240.027 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 24.4 bits (53), Expect = 0.027
Identities = 5/30 (16%), Positives = 12/30 (40%)

Query: 15 IRLQKEFDTLEAFLNVEVSTFGDWITETVD 44
+ L E ++++L + F D +
Sbjct: 216 VSLADELTVVDSYLQLASIQFEDRLQFENQ 245


20GBAA_2191GBAA_2213Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2191415-1.209244hypothetical protein
GBAA_2192215-0.927697hypothetical protein
GBAA_2193117-1.526930hypothetical protein
GBAA_2197119-0.769388hypothetical protein
GBAA_2198219-0.839731hypothetical protein
GBAA_2199121-2.920553hypothetical protein
GBAA_2201218-2.493502resolvase site-specific recombinase
GBAA_2202620-4.021353IS110 family transposase orfa
GBAA_2203319-1.913493IS110 family transposase orfb
GBAA_2204218-2.990180hypothetical protein
GBAA_2205118-2.591021hypothetical protein
GBAA_2206-118-2.842666hypothetical protein
GBAA_2207-216-3.154503hypothetical protein
GBAA_2210-215-2.871694hypothetical protein
GBAA_2211-215-3.551091sodium/solute symporter family protein
GBAA_2212017-3.339120DNA-binding response regulator
GBAA_2213-116-3.277890sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2193SYCDCHAPRONE290.019 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 29.1 bits (65), Expect = 0.019
Identities = 20/109 (18%), Positives = 39/109 (35%), Gaps = 6/109 (5%)

Query: 131 REIDKENNEAAYLLASANFRIGKYQEAVQNFEQALANNAKGIEPYKKDAMRDLAVSHMKM 190
EI + E Y LA ++ GKY++A + F+ ++ Y L M
Sbjct: 29 NEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCV-----LDHYDSRFFLGLGACRQAM 83

Query: 191 KEFEKAEDVIVKMSTKTNEDKAIVSYLKGQLSTATVQLEKAESFFKEAI 239
+++ A + ++ + + +L +AES A
Sbjct: 84 GQYDLAIHSYSYGAIMDIKE-PRFPFHAAECLLQKGELAEAESGLFLAQ 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2198RTXTOXIND392e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.4 bits (92), Expect = 2e-05
Identities = 27/175 (15%), Positives = 66/175 (37%), Gaps = 30/175 (17%)

Query: 5 LEIKVKPEQLEQIAKNISEMQTHSQNIQQNLN--QSMFSIQMQWQGATSQHFY----GEY 58
L + K + + I+ + S+ + L+ S+ + A ++H +Y
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLH-----KQAIAKHAVLEQENKY 261

Query: 59 MRSMRLMESYIRNLQVTEKELRRIAQKFRQADEEYQKKQNEKLKEAHKK--EKKNEKSWW 116
+ ++ + Y L+ E E+ ++++ + ++ + +KL++ E +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELA-- 319

Query: 117 EKGIEGAAEFIGVNDAIRAVTGKDPITG--KELS--TKERLIAAGWTLLNFVPVG 167
E IRA P++ ++L T+ ++ TL+ VP
Sbjct: 320 ------KNEERQQASVIRA-----PVSVKVQQLKVHTEGGVVTTAETLMVIVPED 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2212HTHFIS1036e-28 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 103 bits (259), Expect = 6e-28
Identities = 31/115 (26%), Positives = 62/115 (53%)

Query: 4 RILIVEDEEKIARVVQLELEFEGYESEIAKTGTEAMEKFGNGNWDLILLDVMLPNISGLE 63
IL+ +D+ I V+ L GY+ I G+ DL++ DV++P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 VLRRIRLKNAVIPIILLTARDSVVDKVSGLDQGASDYITKPFQIEELLARIRACL 118
+L RI+ +P+++++A+++ + + ++GA DY+ KPF + EL+ I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119


21GBAA_2300GBAA_2338Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2300316-1.523796L-lysine 2,3-aminomutase
GBAA_2301115-3.917577hypothetical protein
GBAA_2302017-4.472837hypothetical protein
GBAA_2303016-3.568556hypothetical protein
GBAA_2304-117-3.373549hypothetical protein
GBAA_2305015-1.992017hypothetical protein
GBAA_2306-113-1.644529hypothetical protein
GBAA_2307-113-1.374064protein kinase domain-containing protein
GBAA_2308-314-1.313593sporulation-control protein Spo0M
GBAA_2309-117-0.888359PAP2 family protein
GBAA_2310116-1.016158cation efflux family protein
GBAA_2311317-3.247008thioredoxin
GBAA_2312417-3.273941hypothetical protein
GBAA_2313417-3.529812hypothetical protein
GBAA_2314315-2.861410hypothetical protein
GBAA_2315215-3.382564S-layer protein
GBAA_2316215-1.650087hypothetical protein
GBAA_23172181.020880Mrr restriction system protein
GBAA_2318116-0.120982DNA-binding protein
GBAA_2319216-0.532938hypothetical protein
GBAA_2320216-0.445450hypothetical protein
GBAA_2321217-0.820394DNA translocase FtsK
GBAA_2322319-2.559429hypothetical protein
GBAA_2324419-2.812845hypothetical protein
GBAA_2326722-1.332247hypothetical protein
GBAA_2327525-0.709394hypothetical protein
GBAA_2328324-1.096259hypothetical protein
GBAA_2330121-1.775066hypothetical protein
GBAA_2331117-2.400915hypothetical protein
GBAA_2332217-2.931233hypothetical protein
GBAA_2333216-3.685530hypothetical protein
GBAA_2334217-4.160633hypothetical protein
GBAA_2335317-3.991787hypothetical protein
GBAA_2336117-4.454831peptidyl-prolyl isomerase
GBAA_2337021-4.426995hypothetical protein
GBAA_2338118-4.208525hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2335TCRTETB260.007 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 26.4 bits (58), Expect = 0.007
Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 3/30 (10%)

Query: 18 VTFFGPYNEVITNVS---IINQLSTPKCQT 44
++FF NE++ NVS I N + P T
Sbjct: 22 LSFFSVLNEMVLNVSLPDIANDFNKPPAST 51


22GBAA_2392GBAA_2450Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2392213-2.651407alpha/beta hydrolase
GBAA_2393114-2.668031ABC transporter permease
GBAA_2395215-1.504232zinc transporter family protein
GBAA_2396316-2.336164hypothetical protein
GBAA_2397115-2.303648hypothetical protein
GBAA_2398115-1.926524lipoprotein
GBAA_2399215-2.116006metallo-beta-lactamase
GBAA_2400316-1.553293inosine-uridine preferring nucleoside hydrolase
GBAA_2401416-3.568429hypothetical protein
GBAA_2402012-1.890686hypothetical protein
GBAA_2403014-1.808482hypothetical protein
GBAA_2404015-2.309365acetyltransferase
GBAA_2405-115-2.689692hydrolase
GBAA_2406-117-3.568169TetR family transcriptional regulator
GBAA_2407-118-3.069895MmpL family membrane protein
GBAA_2408019-4.210397hypothetical protein
GBAA_2409119-4.125756chloramphenicol acetyltransferase
GBAA_2410-117-4.216035acetyltransferase
GBAA_2411018-4.376961acetyltransferase
GBAA_2412017-3.009273acetyltransferase
GBAA_2413-114-3.370418hypothetical protein
GBAA_2414014-4.199986DNA-binding protein
GBAA_2415114-4.387625hypothetical protein
GBAA_2416215-3.556717hypothetical protein
GBAA_2417114-2.688894alpha/beta hydrolase
GBAA_2418015-3.564047protoporphyrinogen oxidase
GBAA_2419120-5.000825hypothetical protein
GBAA_2420118-4.586452acetyltransferase
GBAA_2421-115-3.804608hypothetical protein
GBAA_2422-116-4.093746cold shock protein CspA
GBAA_2424017-3.907410hypothetical protein
GBAA_2425218-4.203237hypothetical protein
GBAA_2426421-2.684369HAD superfamily hydrolase
GBAA_2427524-1.786522hypothetical protein
GBAA_2428425-2.723988hypothetical protein
GBAA_2431120-1.290262hypothetical protein
GBAA_2432017-2.168069hypothetical protein
GBAA_2433016-2.076185hypothetical protein
GBAA_2434014-2.949403hypothetical protein
GBAA_2435017-2.751243LysR family transcriptional regulator
GBAA_2436218-2.812643aspartate-semialdehyde dehydrogenase
GBAA_2437218-3.649529hypothetical protein
GBAA_2438-115-2.968866hypothetical protein
GBAA_2439-213-2.545294LysR family transcriptional regulator
GBAA_2440-213-2.036204hypothetical protein
GBAA_2441-312-1.677761hypothetical protein
GBAA_2442-311-1.882157hypothetical protein
GBAA_2443-313-1.999000ABC transporter permease/ATP-binding protein
GBAA_2444-113-2.170943ABC transporter permease/ATP-binding protein
GBAA_24454172.362538hypothetical protein
GBAA_24464182.969883N-acetylmuramoyl-L-alanine amidase
GBAA_24476202.933962amino acid transporter LysE
GBAA_24484203.737132DNA-binding protein
GBAA_24495183.638513hypothetical protein
GBAA_24503184.025839hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2392PF06057310.005 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 31.3 bits (71), Expect = 0.005
Identities = 16/46 (34%), Positives = 24/46 (52%)

Query: 121 EDLLAMTDYISKRLGKEKAILIGHSYGTYIGMQAANKAPEKYEAYV 166
+D LA+ D G +K ILIG+S+G + N+ P +Y V
Sbjct: 101 QDTLAIIDKYQAEFGTQKVILIGYSFGAEVIPFVLNEMPARYRKNV 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2393TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 24/119 (20%), Positives = 45/119 (37%), Gaps = 8/119 (6%)

Query: 49 ATMTQIMIALPAL--IFF--LLVGTVVDRFDRQRICTVSNICCSLCNIGILISLYYGMII 104
AT I +A + ++ G V R +R + I I + + M
Sbjct: 245 ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304

Query: 105 LVFLFLFLENACIQFFSPSEQSMIQGVVESDQYGAAAGINQMVNSLYALFGVGIATMVY 163
+ + L A P+ Q+M+ V+ ++ G G + SL ++ G + T +Y
Sbjct: 305 PIMVLL----ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359



Score = 31.7 bits (72), Expect = 0.005
Identities = 28/198 (14%), Positives = 67/198 (33%), Gaps = 28/198 (14%)

Query: 173 LVNTLTFIMSGILIQTISIPEKVRLPNGRTKWKEVNLKMLITEFKEGIRYIYQNETLKKL 232
+N L F+ L+ K + L+ R+ + L
Sbjct: 168 ALNGLNFLTGCFLLPE------------SHKGERRPLRREALNPLASFRWARGMTVVAAL 215

Query: 233 LLGFIVFGLLNGILSVSTTYIL----KYKLAPATYESLAMVGGVVGGISLLIGSIVATSI 288
+ F + L+ + + +++ ++ T G++ ++ + + +
Sbjct: 216 MAVFFIMQLVGQVPA--ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM---ITGPV 270

Query: 289 GKKYAPKPIIVFGMAGSGIFFGMCYFVNYVWSFY---VCIAFATFFLPFINVAIMGWMYE 345
+ + ++ GM G + + F W + V +A +P A+ +
Sbjct: 271 AARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMP----ALQAMLSR 326

Query: 346 IVEESFMGRVQSLLSPLT 363
V+E G++Q L+ LT
Sbjct: 327 QVDEERQGQLQGSLAALT 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2406HTHTETR836e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 83.1 bits (205), Expect = 6e-22
Identities = 35/164 (21%), Positives = 66/164 (40%), Gaps = 10/164 (6%)

Query: 20 KSTKETILEVATRLFLTQNYQVVSMDEVAKVCGVTKATVYYYFSTKADLFTATMIQMMIR 79
+ T++ IL+VA RLF Q S+ E+AK GVT+ +Y++F K+DLF+
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 80 IRENMSQILS-TNNTLEERLLNFAKVYLHATMDIDMKNFMKDAKLSLSEEQLKELKK--- 135
I E + + L L +T+ + + + + + E + E+
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEI-IFHKCEFVGEMAVVQQ 128

Query: 136 ----AEDSMYEVLEKALDKAMQLGEIQKG-NPKFAAHAFVSLLS 174
Y+ +E+ L ++ + + AA +S
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2407ACRIFLAVINRP528e-09 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 51.8 bits (124), Expect = 8e-09
Identities = 37/232 (15%), Positives = 85/232 (36%), Gaps = 25/232 (10%)

Query: 203 LLVATVLLVLVLLILLYRSPILAILPLLVVGFAYGIISPTLGFLADHGWIKVDAQAISIM 262
L A +L+ LV+ + L ++ ++P + V + T LA G+ +I+ +
Sbjct: 344 LFEAIMLVFLVMYLFL-QNMRATLIPTIAVPVV---LLGTFAILAAFGY------SINTL 393

Query: 263 T----VLLFGAGTDYCLFLISRYREYLLEEESKYK-ALQLAIKASGGAIIMSALTVVLGL 317
T VL G D + ++ ++E++ K A + ++ GA++ A+ +
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVF 453

Query: 318 GTLLL--AHYGAFHR-FAVPFSVAVFIMGIAALTILPAFLLIFGRTAFFPFIPRTTSMNE 374
+ GA +R F++ A+ + + AL + PA + P + +E
Sbjct: 454 IPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLK-------PVSAEHHE 506

Query: 375 ELARRKKKVVKVKKSKGAFSKKLGDVVVRRPWTIIMLTVFVLGGLASFVPRI 426
++ +++ ++ G+ R+
Sbjct: 507 NKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558



Score = 38.3 bits (89), Expect = 1e-04
Identities = 28/161 (17%), Positives = 68/161 (42%), Gaps = 9/161 (5%)

Query: 203 LLVATVLLVLVLLILLYRSPILAILPLLVVGFAYGIISPTLGFLADHGWIKVDAQAISIM 262
L+ + ++V + L LY S + + +LVV I+ L + V +
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLG--IVGVLLAATLFNQKNDVYFM---VG 929

Query: 263 TVLLFGAGTDYCLFLISRYREYLLEE-ESKYKALQLAIKASGGAIIMSALTVVLGLGTLL 321
+ G + ++ ++ + +E + +A +A++ I+M++L +LG+ L
Sbjct: 930 LLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLA 989

Query: 322 LAH---YGAFHRFAVPFSVAVFIMGIAALTILPAFLLIFGR 359
+++ GA + + + + A+ +P F ++ R
Sbjct: 990 ISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030



Score = 31.7 bits (72), Expect = 0.012
Identities = 32/202 (15%), Positives = 69/202 (34%), Gaps = 21/202 (10%)

Query: 533 AGISNAEDQL--WIGGETASLYDTKQITERDEAVIIPVMISIIALLLLVYLRSIVAMIYL 590
A + N +L IG + + ++++ ++ + ++ L L S + +
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 591 IVTVVLSFFSALGAGWLLLHYGMGAPAIQGAIPLYAFVFLVALGEDYNIFMVSEIWKNRK 650
++ V L L A L + + G + + L I +V +
Sbjct: 901 MLVVPLGIVGVLLAAT-LFNQKNDVYFMVG------LLTTIGLSAKNAILIVEFAKDLME 953

Query: 651 TQNHLDAVKNGVIQTGSVITSAGLILAGTFAVLGTLPIQV------LVQFGIVTAI--GV 702
+ V + + L+ + F +LG LP+ + Q + + G+
Sbjct: 954 KEGK--GVVEATLMAVRMRLRPILMTSLAF-ILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010

Query: 703 LLDTFIVRPLLVPAITVVLGRF 724
+ T + VP VV+ R
Sbjct: 1011 VSATLLAI-FFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2411SACTRNSFRASE411e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.1 bits (96), Expect = 1e-06
Identities = 26/98 (26%), Positives = 42/98 (42%), Gaps = 6/98 (6%)

Query: 49 YSSVEMMRYSIEELDS--YKVIMDEKIIGGIIVTISGKSYGRIDRIFVEPVYQGKGIGSN 106
Y +M +EE + ++ IG I + + Y I+ I V Y+ KG+G+
Sbjct: 50 YEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTA 109

Query: 107 VIKL-IE--AEYPSIRIWDLETSSRQINNHHFYKKMGY 141
++ IE E + LET I+ HFY K +
Sbjct: 110 LLHKAIEWAKENHFCGLM-LETQDINISACHFYAKHHF 146


23GBAA_2462GBAA_2499Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2462218-0.589602PTS system cellobiose-specific transporter
GBAA_2463118-1.385022PTS system cellobiose-specific transporter
GBAA_2464116-1.276485hypothetical protein
GBAA_2465016-1.772421anhydro-N-acetylmuramic acid kinase
GBAA_2466114-3.168354hypothetical protein
GBAA_2467015-3.913436glycerol-3-phosphate acyltransferase PlsY
GBAA_2468-213-2.648678acetyltransferase
GBAA_2469-212-2.588468threonine dehydratase
GBAA_2472-213-2.771744metallo-beta-lactamase
GBAA_2473-212-2.221429hypothetical protein
GBAA_2474-212-1.507699hypothetical protein
GBAA_2475-212-3.134405DEAD/DEAH box helicase
GBAA_2476015-4.247438hypothetical protein
GBAA_2477015-4.221019hypothetical protein
GBAA_2479-116-4.521825TetR family transcriptional regulator
GBAA_2480015-4.257129ABC transporter permease
GBAA_2481-113-3.742038ABC transporter permease
GBAA_2482-111-1.986944ABC transporter ATP-binding protein
GBAA_2483-112-1.607260hypothetical protein
GBAA_2484-111-1.698472hypothetical protein
GBAA_2485012-2.115576hypothetical protein
GBAA_2486-212-2.348858indolepyruvate decarboxylase
GBAA_2487015-3.421591marR family transcriptional regulator
GBAA_2488217-3.868790phosphoglyceromutase
GBAA_2490518-5.606389hypothetical protein
GBAA_2491217-4.636279hypothetical protein
GBAA_2492118-4.500417hypothetical protein
GBAA_2493119-4.253991hypothetical protein
GBAA_2494117-3.363091hypothetical protein
GBAA_2496117-3.654935hypothetical protein
GBAA_2498017-3.903355aminoacyl-histidine dipeptidase
GBAA_2499118-4.170185hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2466PRPHPHLPASEC280.048 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 28.1 bits (62), Expect = 0.048
Identities = 13/72 (18%), Positives = 26/72 (36%), Gaps = 11/72 (15%)

Query: 112 GILTIGGTGAICLGRKGEVYEYSGGW-GHILGDEGSGYWIALQGLKRMANQFDQGVTLCP 170
++ ++ G +VY W G I G G+ I QG+ + N +
Sbjct: 8 ALICATLATSLWAGASTKVY----AWDGKIDG-TGTHAMIVTQGVSILENDLSKNEP--- 59

Query: 171 LSLRIQDEFQLL 182
++ ++L
Sbjct: 60 --ESVRKNLEIL 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2467ACRIFLAVINRP280.025 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.025
Identities = 15/62 (24%), Positives = 28/62 (45%), Gaps = 6/62 (9%)

Query: 91 VMLTLLAVIMGHIYPMLFKGKGGKGIS-----TFIGGLIAFDYLIALTLVAVFIIFYLIF 145
+++T LA I+G + P+ G G +GG+++ L + F++ F
Sbjct: 974 ILMTSLAFILGVL-PLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCF 1032

Query: 146 KG 147
KG
Sbjct: 1033 KG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2468AUTOINDCRSYN290.044 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 28.7 bits (64), Expect = 0.044
Identities = 10/52 (19%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 15 ESIHKLNYKTFVEEIPQHEETKDRVRIDRFHEENT-YLICLDDDKLVGMVAL 65
+ L +TF + + + D + D++ NT YL + D+ ++ +
Sbjct: 18 GELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKDNTVICSLRF 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2475TONBPROTEIN300.013 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.3 bits (68), Expect = 0.013
Identities = 20/113 (17%), Positives = 39/113 (34%), Gaps = 6/113 (5%)

Query: 338 AGGSGLAITFVAAKDEKH------LEEIEKTLGAPIQREIIEQPKIKRVDENGKPLPKPA 391
A +++T V D + E + + V E KP PKP
Sbjct: 40 APAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 99

Query: 392 PKKSGEYRQRDSREGSRSGSKGRTRNDSRNSSRNENNRSFNKPSNKKGSTKQG 444
PK + +++ R+ S+ + ++ +R ++ + S S G
Sbjct: 100 PKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASG 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2476BACTRLTOXIN280.005 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 27.6 bits (61), Expect = 0.005
Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 31 KINWYNDMKTSFANKELADLVKG 53
K+ Y+ +KT N++LA K
Sbjct: 84 KLKNYDKVKTELLNEDLAKKYKD 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2479HTHTETR728e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 8e-18
Identities = 30/174 (17%), Positives = 72/174 (41%), Gaps = 13/174 (7%)

Query: 8 EERRKEILETAERLFLTKGYTKTTVNDILKEIGIAKGTFYHYFKSKEEVMDEIIMRIIKE 67
+E R+ IL+ A RLF +G + T++ +I K G+ +G Y +FK K ++ E I + +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE-IWELSES 68

Query: 68 DVAKAKVIVSNPNIPVLEKLFRVLME---QSPKSGDIKDKMIE-QFHQPNNA---EMYQK 120
++ + ++ + R ++ +S + + + ++E FH+ + Q+
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 121 SLVQSIIHLSPVLTEILEQGIEEGIFSTSY-PQETIELLLSSAQVIFDEGLFQW 173
+ + + + L+ IE + + ++ + W
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY----ISGLMENW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2480TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 2e-05
Identities = 58/342 (16%), Positives = 125/342 (36%), Gaps = 36/342 (10%)

Query: 47 IFAGLYAITSIPFLLAPLGGAIADRFNRRNLMVIFDFINTAIVLSFIVLLFTGSVSILLI 106
I LYA+ F AP+ GA++DRF RR +++ A+ + ++ + +L I
Sbjct: 47 ILLALYALMQ--FACAPVLGALSDRFGRR-PVLLVSLAGAAV--DYAIMATAPFLWVLYI 101

Query: 107 GTIMFLLAIVNAMYAPVVMASIPQLVPEKKLEQANGIVNGVQALSNIVAPVLGGILYGII 166
G I +A + V A I + + + G ++ + PVLGG++ G
Sbjct: 102 GRI---VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGF 157

Query: 167 GLKMLVIISCLAFFLSAILEMFITIPFIKRVQESHIIPTIVKDMKGGFIYVLKQPFILKS 226
+ L+ + F+ +P + + + + +
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFL-LPESHKGERRPLRREALNPLASFRWARGMTVVAA-L 215

Query: 227 MLLAALLNLILTPLFVVGAPIIIRVTMESSH-TLYGIGMGLIDFATIIGALSMVFFAKKL 285
M + ++ L+ V A + + + H IG+ L F I+ +L+ +
Sbjct: 216 MAVFFIMQLVGQ----VPAALWVIFGEDRFHWDATTIGISLAAFG-ILHSLAQAMITGPV 270

Query: 286 QMQTLYYWMILIALLVIPMALSVTPFILNLGY------YPPFILFILSSILIAMIMTVVS 339
+ L++ M T +IL +P +L I + + ++S
Sbjct: 271 AA-----RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325

Query: 340 IYVITVVQKKTPNENLGKVMAIITAVSQCMAPIGQVIYGFMF 381
V E G++ + A++ + +G +++ ++
Sbjct: 326 RQV--------DEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359



Score = 29.0 bits (65), Expect = 0.032
Identities = 16/79 (20%), Positives = 34/79 (43%), Gaps = 3/79 (3%)

Query: 86 TAIVLSFIVLLFTGSVSILLIGTIMFLLAIVNAMYAPVVMASIPQLVPEKKLEQANGIVN 145
A +I+L F + ++ + P + A + + V E++ Q G +
Sbjct: 285 IADGTGYILLAFATRGWMAFPIMVLLASG---GIGMPALQAMLSRQVDEERQGQLQGSLA 341

Query: 146 GVQALSNIVAPVLGGILYG 164
+ +L++IV P+L +Y
Sbjct: 342 ALTSLTSIVGPLLFTAIYA 360


24GBAA_2510GBAA_2540Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2510-1113.028813alpha-ketoglutarate permease
GBAA_25110123.260542NAD-binding oxidoreductase
GBAA_25120143.427199myo-inositol catabolism protein IolC
GBAA_25131142.891777methylmalonic acid semialdehyde dehydrogenase
GBAA_25141150.224288acetolactate synthase IolD
GBAA_2516117-1.380334fructose-bisphosphate aldolase
GBAA_2517219-3.472334myo-inositol catabolism protein IolB
GBAA_2518121-4.770235hypothetical protein
GBAA_2521120-4.877007lipoprotein
GBAA_2522220-3.510785hypothetical protein
GBAA_2523219-2.363461DNA-binding protein
GBAA_2524318-2.176885hypothetical protein
GBAA_2526318-2.128243D-alanyl-D-alanine carboxypeptidase
GBAA_2527219-3.161607hypothetical protein
GBAA_2528320-3.214810N-acetylmuramoyl-L-alanine amidase
GBAA_2530421-3.736656TetR family transcriptional regulator
GBAA_2531420-3.481738ABC transporter ATP-binding protein
GBAA_2532320-3.481372hypothetical protein
GBAA_2533421-2.756269sensory box/GGDEF family protein
GBAA_2534623-1.503352acetyltransferase
GBAA_2535521-1.795340hypothetical protein
GBAA_2536319-1.484190spore coat protein
GBAA_2537217-1.816396hypothetical protein
GBAA_2538019-2.186126metallo-beta-lactamase/rhodanese-like
GBAA_2539-122-3.060761hypothetical protein
GBAA_2540-121-3.289342hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2510TCRTETB402e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.9 bits (93), Expect = 2e-05
Identities = 72/361 (19%), Positives = 140/361 (38%), Gaps = 58/361 (16%)

Query: 60 TIYGITVSASSWFSGVFVQMWGPRKVMTFGLVSFILGS-IGFIGIGIQHMNYPVILICYA 118
T + +T S + G G ++++ FG++ GS IGF+G H + ++++
Sbjct: 56 TAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG----HSFFSLLIMARF 111

Query: 119 LRGFGYPLFAYSFLVWVSYSTPQQ-------------------------MLSRAVGWFWF 153
++G G F +V V+ P++ M++ + W +
Sbjct: 112 IQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYL 171

Query: 154 VFQLGLSVIGAFYSSYMVPKIGEI--------ATLWSALIFVVVGGLFSIVVNKDKFKAQ 205
+ +++I + ++ K I L S I + LF+ +
Sbjct: 172 LLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFM--LFTTSYSISFLIVS 229

Query: 206 TVSANKSSELLKGITIAFENPKVG------IGGIVKIINSAAQFGFVVFLPTYMMKYNFT 259
+S + ++ +T F +P +G IG + I GFV +P YMMK
Sbjct: 230 VLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVP-YMMKDVHQ 288

Query: 260 MTEWLQIWGTLFFVNMVFNIIFGIVG----DKFGWINTIKWFGGVGCGIVTLALYYVPQM 315
++ +I + F + IIFG +G D+ G + + +G ++++ +
Sbjct: 289 LSTA-EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLN----IGVTFLSVSFLTASFL 343

Query: 316 VGHNYWAILF-VACCYGATLAGYVPLTALVP-SLSPENKGAAMSVLNLGSGLSAFVGPLV 373
+ W + + G ++ +V SL + GA MS+LN S LS G +
Sbjct: 344 LETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403

Query: 374 V 374
V
Sbjct: 404 V 404


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2513HTHTETR300.012 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 30.4 bits (68), Expect = 0.012
Identities = 37/201 (18%), Positives = 67/201 (33%), Gaps = 24/201 (11%)

Query: 187 AARLAELAEEAGLPKGVLNIVNGAHDVVNGLLEHKLVKAISFVGSQPVAEYVYKKGTENL 246
+ L E+A+ AG+ +G + L + S +G EY K + L
Sbjct: 31 STSLGEIAKAAGVTRGAIYWHFKDKS---DLFSEIWELSESNIGELE-LEYQAKFPGDPL 86

Query: 247 KRVQALAGAKNHSIVLNDANLELATKQIISAAFGSAGERCMAASVVTVEEEIADQLVERL 306
++ + S V + +II GE A V + + + +R+
Sbjct: 87 SVLREILIHVLESTVTEER--RRLLMEIIFHKCEFVGEM---AVVQQAQRNLCLESYDRI 141

Query: 307 VAEANKIVIGNGLDEDVFLGPVIRDNHKERTI--GYIDSGVEQGA------TLVRDGRED 358
+ L D+ + I GYI +E L ++ R+
Sbjct: 142 EQTLKHCIEAKMLPADL-------MTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDY 194

Query: 359 TAVKGAGYFVGPTIFDHVTKE 379
A+ Y + PT+ + T E
Sbjct: 195 VAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2530HTHTETR653e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.0 bits (158), Expect = 3e-15
Identities = 30/151 (19%), Positives = 66/151 (43%), Gaps = 8/151 (5%)

Query: 1 MEKSREQTMENILKAAKKKFGERGYEGTSIQEIAKEAKVNVAMASYYFNGKENLYYEVFK 60
++ ++T ++IL A + F ++G TS+ EIAK A V ++F K +L+ E+++
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 K-YGLANELPNFLEKNQF-NPINALREYLTVFTTHIKENPE-----IGTLAYEEIIKESA 113
EL + +P++ LRE L E + E A
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 114 RLEK-IKPYFIGSFEQLKEILQEGEKQGVFH 143
+++ + + S++++++ L+ + +
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLP 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2531PF05272340.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.9 bits (77), Expect = 0.002
Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 338 IVLDGKNGSGKSSILKLILGQSIQYTGLVTLGTG 371
+VL+G G GKS+++ ++G +GTG
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTG 632


25GBAA_2554GBAA_2618Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2554220-4.148017hypothetical protein
GBAA_2555017-4.273649acetyltransferase
GBAA_2556-116-4.412504hypothetical protein
GBAA_2557-114-4.139265hypothetical protein
GBAA_2558-114-4.750542hypothetical protein
GBAA_2559-116-4.350574D-alanyl-D-alanine carboxypeptidase
GBAA_2560-117-4.448129sensor histidine kinase
GBAA_2561019-4.978205DNA-binding response regulator
GBAA_2562020-5.640361hypothetical protein
GBAA_2563-120-5.040505hypothetical protein
GBAA_2564-120-4.761408S-adenosylhomocysteine nucleosidase
GBAA_2565120-5.767072hypothetical protein
GBAA_2566217-5.040420acetyltransferase
GBAA_2567217-5.031631hypothetical protein
GBAA_2569216-5.005421hypothetical protein
GBAA_2570116-5.501050hypothetical protein
GBAA_2571115-5.324496hypothetical protein
GBAA_2572014-4.566002excinuclease ABC subunit A
GBAA_2573118-4.758767hypothetical protein
GBAA_2574117-4.275922hypothetical protein
GBAA_2575216-3.668873penicillin-binding protein
GBAA_2576014-3.878428merR family transcriptional regulator
GBAA_2577215-4.395729permease
GBAA_2578214-5.562176mutT/nudix family protein
GBAA_2579214-5.533757hypothetical protein
GBAA_2580314-5.914454hypothetical protein
GBAA_2581315-5.913820hypothetical protein
GBAA_2582316-5.681523araC family transcriptional regulator
GBAA_2583216-5.579155hypothetical protein
GBAA_2584219-5.597381lipoprotein
GBAA_2585018-4.443186hypothetical protein
GBAA_2588016-2.692189alcohol dehydrogenase
GBAA_2589-114-2.039762hypothetical protein
GBAA_2590-112-1.849280hypothetical protein
GBAA_2592014-1.559622hypothetical protein
GBAA_2594215-1.823127cell wall hydrolase
GBAA_2596113-2.699227acetamidase/formamidase
GBAA_2597214-4.365164DNA-binding response regulator
GBAA_2599115-4.823026hypothetical protein
GBAA_2601-116-4.808922acetyltransferase
GBAA_2602-114-4.251835hypothetical protein
GBAA_2603-112-3.274159ABC transporter ATP-binding protein
GBAA_2605-212-3.046577hypothetical protein
GBAA_2606-212-1.973708hypothetical protein
GBAA_2608-213-1.798199homoserine dehydrogenase
GBAA_2609-113-1.616120GntR family transcriptional regulator
GBAA_2610-312-1.362809D-alanine--D-alanine ligase
GBAA_2611-318-4.163484transcriptional activator TenA
GBAA_2612-119-4.961525hypothetical protein
GBAA_2613021-5.835087hypothetical protein
GBAA_2614-121-6.361487hypothetical protein
GBAA_2617-220-5.691607hypothetical protein
GBAA_2618-220-5.081761hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2555SACTRNSFRASE280.011 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.011
Identities = 25/105 (23%), Positives = 38/105 (36%), Gaps = 6/105 (5%)

Query: 41 EQQLEKYIESENTLAFKVIDEETKEVIGHISLGQIDHINKSARIGKVLVGDTRMRGRSIG 100
+ Y+E E AF E IG I + + N A I + V R + +G
Sbjct: 53 DDMDVSYVEEEGKAAFLYYLEN--NCIGRIKIRS--NWNGYALIEDIAV-AKDYRKKGVG 107

Query: 101 KHMMKAVLHIAFDELKLHRVTLGVYDFNTSAISCYEKIGFVKEGL 145
++ + A E + L D N SA Y K F+ +
Sbjct: 108 TALLHKAIEWA-KENHFCGLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2559BLACTAMASEA361e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 35.9 bits (83), Expect = 1e-04
Identities = 23/112 (20%), Positives = 46/112 (41%), Gaps = 7/112 (6%)

Query: 39 ILIDANSGEVV--YKKNEENSIQSATLSKLMTEYIVLEQLDKGNIQLDEVVKISNEVFRA 96
I +D SG + ++ +E + S K++ VL ++D G+ QL+ + +
Sbjct: 43 IEMDLASGRTLTAWRADERFPMMST--FKVVLCGAVLARVDAGDEQLERKIHYRQQDLV- 99

Query: 97 ETSPIQVTSKDKT-TVRDLLHALLLTGNNRSTLALAEHIAGNEDNFTQLMNE 147
+ SP+ TV +L A + +N + L + G T + +
Sbjct: 100 DYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPA-GLTAFLRQ 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2561HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 40/150 (26%), Positives = 74/150 (49%), Gaps = 3/150 (2%)

Query: 5 ILIIDDDKEIVELLAVYLRNEGYNIYKAYDGDEALQMISTYEVDLMILDIMMPKRNGLEV 64
IL+ DDD I +L L GY++ + + I+ + DL++ D++MP N ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 65 CQEVRE-NNTVPILMLSAKAEDMDKILGLMTGADDYMIKPFNPLELVARV-KALLRRSSF 122
+++ +P+L++SA+ M I GA DY+ KPF+ EL+ + +AL
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 123 QNASSPKNEDGM-IRIRSAEIHKHNHTVKV 151
+ ++DGM + RSA + + +
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2566SACTRNSFRASE427e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 41.9 bits (98), Expect = 7e-07
Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 8/91 (8%)

Query: 186 DMDYIEKTNHTFYGAYVDNDLKGSICI----NEQGKISFIFIDKEYRNRGIGSKLLQVAR 241
D+ Y+E+ + Y++N+ G I I N I I + K+YR +G+G+ LL A
Sbjct: 56 DVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAI 115

Query: 242 D---ELNLESLLISFPNNSLLE-GFVKKTGF 268
+ E + L++ + ++ F K F
Sbjct: 116 EWAKENHFCGLMLETQDINISACHFYAKHHF 146



Score = 39.2 bits (91), Expect = 5e-06
Identities = 18/52 (34%), Positives = 22/52 (42%)

Query: 83 LAVHPNYRGVGVSQKLFELHKEEALQNECKQLFLEVIVGNDRAIRFYNKLGY 134
+AV +YR GV L E A +N L LE N A FY K +
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2577TCRTETA418e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 40.6 bits (95), Expect = 8e-06
Identities = 66/351 (18%), Positives = 119/351 (33%), Gaps = 27/351 (7%)

Query: 40 LPWIAYQLTGSAVIMSS---LFAINVLPIVLFGPLVGVIIDRYDRKKLLLVADITNIILV 96
LP + L S + + L A+ L P++G + DR+ R+ +LLV+ +
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 97 SFVPILHSLHLLEIWHLYIITFMLAVMSMLFDVTTVTVIPKIAGASLTKANSFYQMVNQL 156
+ + L W LYI + + V + G + F
Sbjct: 88 AIMATAPFL-----WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 157 ASLFGPMIAGVFISFIGGFQLLWINVLSFIATLVAVMLLPSMKTTNKKCEDKNTLQNVLS 216
+ GP++ G+ F L+ + L LLP + L+
Sbjct: 143 GMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE-----RRPLRREAL 197

Query: 217 DLVNGFTWLKNDRLNLALSFQAMIGNFGASAVLGVFMYYLLSTLQLTPEQSGVNYSLIGI 276
+ + F W + + AL I +++ + G++ + GI
Sbjct: 198 NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI 257

Query: 277 -GGLLGSLIAIPLEKRLQRSILIPLLLFVGAIGLTFALWNT-YWFA-PGI----AFGVAM 329
L ++I P+ RL + L + G + T W A P + + G+ M
Sbjct: 258 LHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317

Query: 330 TCNIAWNTIVATVRQETVPSNMQGRVLGFSRVLTRLAMPLGALVGGIISAY 380
A +++ V QG++ G LT L +G L+ I A
Sbjct: 318 P---ALQAMLSR----QVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2597HTHFIS676e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.8 bits (163), Expect = 6e-15
Identities = 27/112 (24%), Positives = 54/112 (48%), Gaps = 1/112 (0%)

Query: 3 KIMIVEDDMKIAELLSTHVAKYGYEGIIVSDFQNVLNIFLEEQPELVLLDINLPSFDGYY 62
I++ +DD I +L+ +++ GY+ I S+ + +LV+ D+ +P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 WCRQIRGV-STCPILFISAREGTMDQVMALENGGDDFISKPFHYEVVMAKIR 113
+I+ P+L +SA+ M + A E G D++ KPF ++ I
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2602PF06057290.012 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 28.7 bits (64), Expect = 0.012
Identities = 10/50 (20%), Positives = 17/50 (34%), Gaps = 12/50 (24%)

Query: 6 WFRHLP-QISMDLSEWTPFIQNNWHRKHYMKFVYVLQIIIFLIPYYFGAD 54
W + P ++ D Q + + + LI Y FGA+
Sbjct: 91 WKQKDPKDVTQDTLAIIDKYQAEFGTQK-----------VILIGYSFGAE 129


26GBAA_2631GBAA_2718Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2631-117-3.515238hypothetical protein
GBAA_2632015-3.219897cytochrome P450 family protein
GBAA_2633015-3.628441hypothetical protein
GBAA_2634014-3.645709HAD superfamily hydrolase
GBAA_2635113-3.479253hypothetical protein
GBAA_2636113-3.432194sensor histidine kinase
GBAA_2637113-3.623532penicillin-binding protein
GBAA_2638215-3.994623glycosyl transferase
GBAA_2639215-4.858700aspartate racemase
GBAA_2640215-4.687666hypothetical protein
GBAA_2641115-5.037661ABC transporter ATP-binding protein
GBAA_2642316-4.370361cobalt transport protein
GBAA_2643313-2.839016hypothetical protein
GBAA_2644215-3.084599sporulation kinase B
GBAA_2645314-2.388602hypothetical protein
GBAA_2646315-2.433178hypothetical protein
GBAA_2647316-2.371251alcohol dehydrogenase
GBAA_2648114-2.724193penicillin-binding protein
GBAA_2649016-3.087999permease
GBAA_2650217-3.107540penicillin-binding protein
GBAA_2651219-2.894383acetyltransferase
GBAA_2652318-3.196072hypothetical protein
GBAA_2653420-3.464641degV family protein
GBAA_2654522-1.771971thiJ/pfpI family protein
GBAA_2656620-3.424950hypothetical protein
GBAA_2657617-3.772856thiJ/pfpI family protein
GBAA_2658415-3.925590hypothetical protein
GBAA_2659315-3.662329hypothetical protein
GBAA_2661215-3.761843alkaline d-peptidase
GBAA_2663216-5.463484hypothetical protein
GBAA_2664418-4.296670permease
GBAA_2665319-3.835162hypothetical protein
GBAA_2667420-3.856920acetyltransferase
GBAA_2668419-3.093583glycerophosphoryl diester phosphodiesterase
GBAA_2670217-2.638499hypothetical protein
GBAA_2672115-2.173404cysteine transporter
GBAA_2673013-2.177264chitosanase
GBAA_2676-214-2.302889mutT/nudix family protein
GBAA_2677-114-2.553451hypothetical protein
GBAA_2678-114-3.799728hypothetical protein
GBAA_2680014-3.876280oxalate/formate antiporter
GBAA_2683116-4.139903mutT/nudix family protein
GBAA_2684-116-4.470139DNA polymerase III subunit beta
GBAA_2685014-5.199750mutT/nudix family protein
GBAA_2686-213-5.004230hypothetical protein
GBAA_2687-214-4.900661alpha/beta hydrolase
GBAA_2688014-5.009747hypothetical protein
GBAA_2689014-5.544349intein homing endonuclease-like protein
GBAA_2690113-5.330379hypothetical protein
GBAA_2691216-4.791946endoribonuclease L-PSP
GBAA_2692215-4.599420hypothetical protein
GBAA_2693218-4.993966hypothetical protein
GBAA_2694118-4.573545esterase
GBAA_2695018-5.158195hypothetical protein
GBAA_2696-116-4.375119hypothetical protein
GBAA_2698-217-4.112070hypothetical protein
GBAA_2699-117-4.256460acetyltransferase
GBAA_2700016-3.479157metal-dependent hydrolase
GBAA_2701017-3.266905acetyltransferase
GBAA_2702117-2.867598hypothetical protein
GBAA_2704418-3.516080hypothetical protein
GBAA_2705417-3.646708endo/excinuclease amino terminal
GBAA_2708519-3.410633hypothetical protein
GBAA_2709016-1.661437hypothetical protein
GBAA_2710-213-2.058731hypothetical protein
GBAA_2711-312-2.178348hypothetical protein
GBAA_2712-213-2.243025hypothetical protein
GBAA_2713-314-2.358307mutT/nudix family protein
GBAA_2715-213-2.868713DadA family oxidoreductase
GBAA_2716-214-4.656413hypothetical protein
GBAA_2717-116-5.009098N-acetyltransferase
GBAA_2718016-3.699879hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2636PF06580386e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 6e-05
Identities = 17/102 (16%), Positives = 37/102 (36%), Gaps = 18/102 (17%)

Query: 378 QVFI-NILQNSIEAMPDGGRISIHIKEIGKDGIIISVIDKGIGIPAERIKRLGEPFYSTK 436
Q + N +++ I +P GG+I + + + + V + G
Sbjct: 261 QTLVENGIKHGIAQLPQGGKILLKGTKDNGT-VTLEVENT------------GSLALKNT 307

Query: 437 EKGTGIGLMLSYKIIESHQGN---ISIMSEVGVGTTVTIYLP 475
++ TG GL + ++ G I + + G + +P
Sbjct: 308 KESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2644PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 2e-04
Identities = 18/94 (19%), Positives = 39/94 (41%), Gaps = 12/94 (12%)

Query: 320 NLIKNGIEAMPNGGTLNISSSISNNKVIIRIEDSGIGMSQEQVNRFGEPYFNTKTKGTGL 379
N IK+GI +P GG + + + N V + +E++G + ++ GTGL
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT----------KESTGTGL 315

Query: 380 G-TMVAVKIIETMQGSLRIRSVVNKGTTLTITFP 412
++++ + +++ K + P
Sbjct: 316 QNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2649TCRTETA454e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.8 bits (106), Expect = 4e-07
Identities = 64/318 (20%), Positives = 119/318 (37%), Gaps = 24/318 (7%)

Query: 13 LLLSGVGIANLGAWIYLIALNVLVYHMGGSALAVATLYVIKPLAAL---FTNAWSGSMID 69
++LS V + +G + + L L+ + S A ++ L AL G++ D
Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68

Query: 70 RLNKRKLMIHLDIYRAVCIAILPLLPSLWMVYVFVFFISMANAIYEPTAMTYMTKLIPVE 129
R +R +++ AV AI+ P LW++Y+ + A A Y+ + +
Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADITDGD 127

Query: 130 QRQRFNSLRSLIGSGASVIGPAIAGALLIASTPE---FAIYMNAIAFLLSGVITLLLPNL 186
+R R S V GP + G + S A +N + FL LLP
Sbjct: 128 ERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG---CFLLPES 184

Query: 187 DKKFDSHTSNDTLSLAVLKKDWNIVLNFSKKSLYIVFVYFLFQGMMVLAAANDSLELSFA 246
K L L + + + +F M ++ +L + F
Sbjct: 185 HK-----GERRPLRREALNPLASFRWARGMTVVA--ALMAVFFIMQLVGQVPAALWVIFG 237

Query: 247 KEVLLLTDSEYGFLVSIAGAGFILGAITNAI----LSKKLTPSLLIGIGSLFIAIGYIIY 302
++ + G S+A G IL ++ A+ ++ +L + +G + GYI+
Sbjct: 238 EDRFHWDATTIGI--SLAAFG-ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294

Query: 303 AFSNEFLIAAIGFFILSF 320
AF+ +A +L+
Sbjct: 295 AFATRGWMAFPIMVLLAS 312



Score = 29.0 bits (65), Expect = 0.032
Identities = 21/115 (18%), Positives = 48/115 (41%), Gaps = 1/115 (0%)

Query: 48 TLYVIKPLAALFTNAWSGSMIDRLNKRKLMIHLDIYRAVCIAILPLLPSLWMVYVFVFFI 107
+L L +L +G + RL +R+ ++ I +L WM + + +
Sbjct: 251 SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLL 310

Query: 108 SMANAIYEPTAMTYMTKLIPVEQRQRFNSLRSLIGSGASVIGPAIAGALLIASTP 162
+ + I P +++ + E++ + + + S S++GP + A+ AS
Sbjct: 311 A-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2651SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 0.001
Identities = 22/65 (33%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 81 IWHIAVHPDFRRMKIGNQLLNEGEKLAKERKLNRLEAWTRD-NLWVHGWYEKNGFV--KV 137
I IAV D+R+ +G LL++ + AKE L T+D N+ +Y K+ F+ V
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151

Query: 138 DSYLH 142
D+ L+
Sbjct: 152 DTMLY 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2661BLACTAMASEA349e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.0 bits (78), Expect = 9e-04
Identities = 12/50 (24%), Positives = 18/50 (36%)

Query: 81 YAAGIADLRTKKQMKTDFRFRIGSTTKTFIATVLLQLAGENRLNLDDSIE 130
+A RT + D RF + ST K + +L L+ I
Sbjct: 43 IEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIH 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2667SACTRNSFRASE290.011 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.1 bits (65), Expect = 0.011
Identities = 18/104 (17%), Positives = 40/104 (38%), Gaps = 4/104 (3%)

Query: 151 EGQYVQAFYNQTASAHLWNSENMKLYLGFYKDEVVSVGSLVCTLDSIG-IYDIATKEEMR 209
Y + + + E +L + ++ + + + I DIA ++ R
Sbjct: 43 SKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYR 102

Query: 210 GKGFGSTMFNYLLQEAKELNVAQCVLQASPDGV---NIYKKAGF 250
KG G+ + + ++ AKE + +L+ + + Y K F
Sbjct: 103 KKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2680TCRTETA476e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.1 bits (112), Expect = 6e-08
Identities = 38/186 (20%), Positives = 79/186 (42%), Gaps = 8/186 (4%)

Query: 206 MMGTKQVYLLFFMLFTSCMGGLYLIGMVKDIGVQLVGLSTATAANAVAMIAIFNTVGRI- 264
M + + ++ + +G + LI V ++ + S A+ ++A++ +
Sbjct: 1 MKPNRPLIVILSTVALDAVG-IGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFAC 59

Query: 265 --VLGTLSDKIGRMKIVSATFIIIGLSVFTLSFIPLNYGIYFACVASFAFCFGGNITIFP 322
VLG LSD+ GR ++ + + ++ P + +Y + A G +
Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI--VAGITGATGAVAG 117

Query: 323 AIVGDFFGLKNHSTNYGIVYQGFGFGALAGSFIGAILGGFQP--TFIIIGVLSVISFIIS 380
A + D + ++G + FGFG +AG +G ++GGF P F L+ ++F+
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 381 ILIRPP 386
+ P
Sbjct: 178 CFLLPE 183



Score = 37.9 bits (88), Expect = 6e-05
Identities = 27/146 (18%), Positives = 58/146 (39%), Gaps = 13/146 (8%)

Query: 8 PLLIVLGTIIVQIGLGTIYTWSLFNQPLVSKFGWNLNSVAITFS-ITSFSLSFSTLFAGK 66
L+ + I+ +G W +F + +F W+ ++ I+ + + G
Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGE---DRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 67 LQQKLGLRKLIATAGIVLGLGLILSSQVSS----LPLLYLLAGVVVGYADGTAYITSLSN 122
+ +LG R+ + I G G IL + + P++ LLA +G A ++ +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVD 329

Query: 123 LIKWFPNRKGLISGISVSAYGMGSLI 148
R+G + G + + S++
Sbjct: 330 -----EERQGQLQGSLAALTSLTSIV 350


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2701BLACTAMASEA342e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.0 bits (78), Expect = 2e-04
Identities = 13/49 (26%), Positives = 25/49 (51%), Gaps = 4/49 (8%)

Query: 64 LIRNEEKEIVGRINLVDIDTETRISSLGYRVGEKF----TKKGVATAAV 108
I+ E ++ GR+ ++++D + + +R E+F T K V AV
Sbjct: 28 QIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAV 76


27GBAA_2731GBAA_2742Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2731-214-3.249025hypothetical protein
GBAA_2732-312-3.000054RNA polymerase sigma factor SigX
GBAA_2733-115-2.456496hypothetical protein
GBAA_2734-114-2.298691preprotein translocase subunit SecY
GBAA_2735-116-2.564275hypothetical protein
GBAA_2736115-1.900157cysteine transporter
GBAA_2737116-2.207126GntR family transcriptional regulator
GBAA_2738117-2.771744alpha/beta hydrolase
GBAA_2739118-3.784089hypothetical protein
GBAA_2740-118-4.185434hypothetical protein
GBAA_2741120-4.708720hypothetical protein
GBAA_2742016-3.659116hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2734SECYTRNLCASE443e-156 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 443 bits (1141), Expect = e-156
Identities = 180/445 (40%), Positives = 270/445 (60%), Gaps = 22/445 (4%)

Query: 1 MFRTISNFMRVAEIRRKILFTLAMLIVFRIGTFIPVPHTNAEVLK-----IQDQANVLGM 55
M + R ++R+K+LFTLA+++V+R+GT IP+P + + ++ + G+
Sbjct: 1 MLTAFARAFRTPDLRKKLLFTLAIIVVYRVGTHIPIPGVDYKNVQQCVREASGNQGLFGL 60

Query: 56 LNVFGGGALQHFSIFAVGITPYITASIIVQLLQMDVIPKFSEWAKQGEMGRKKSAQFTRY 115
+N+F GGAL +IFA+GI PYITASII+QLL + VIP+ K+G+ G K Q+TRY
Sbjct: 61 VNMFSGGALLQITIFALGIMPYITASIILQLLTV-VIPRLEALKKEGQAGTAKITQYTRY 119

Query: 116 FTIILAFIQAIGMSYGFNNI-------AGGQLITDQSWTTYLFIATVLTAGTAFLLWLGE 168
T+ LA +Q G+ + GGQ++ DQS T + + +TAGT ++WLGE
Sbjct: 120 LTVALAILQGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWLGE 179

Query: 169 QITANGVGNGISMLIFAGLVAAIPNVANQIYLQQFQNAGDQLFMHIIKMLLIGLVILAIV 228
IT G+GNG+S+L+F + A P+ I Q G F +I V L +V
Sbjct: 180 LITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVI------AVGLIMV 233

Query: 229 VGVIYIQQAVRKIPIQYAKAVSGNNQYQGAKNTHLPLKVNSAGVIPVIFASAFLMTPRTI 288
V++++QA R+IP+QYAK + G Y G +T++PLKVN AGVIPVIFAS+ L P +
Sbjct: 234 ALVVFVEQAQRRIPVQYAKRMIGRRSY-GGTSTYIPLKVNQAGVIPVIFASSLLYIPALV 292

Query: 289 AQLFPDSSVSKWLVAN--LDFAHPIGMTLYVGLIVAFTYFYAFIQVNPEQMAENLKKQNG 346
AQ +S K V HPI + Y LIV F +FY I NPE++A+N+KK G
Sbjct: 293 AQFAGGNSGWKSWVEQNLTKGDHPIYIVTYFLLIVFFAFFYVAISFNPEEVADNMKKYGG 352

Query: 347 YVPCIRPGKSTEQYVTKILYRLTFIGAIFLGAISILPLVFTKIATLPPSAQIGGTSLLII 406
++P IR G+ T +Y++ +L R+T+ G+++LG I+++P + + GGTS+LII
Sbjct: 353 FIPGIRAGRPTAEYLSYVLNRITWPGSLYLGLIALVPTMALVGFGASQNFPFGGTSILII 412

Query: 407 VGVALETMKTLESQLVKRHYKGFIK 431
VGV LET+K +ESQL +R+Y+GF++
Sbjct: 413 VGVGLETVKQIESQLQQRNYEGFLR 437


28GBAA_2754GBAA_2766Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2754116-3.894193TetR family transcriptional regulator
GBAA_2755-116-2.987106mutT/nudix family protein
GBAA_2756-116-3.556951hypothetical protein
GBAA_2757-217-3.259829hypothetical protein
GBAA_2759-120-3.248335hypothetical protein
GBAA_2760018-3.023696hypothetical protein
GBAA_2761017-3.033088acetoin operon transcriptional activator
GBAA_2762322-4.561679hypothetical protein
GBAA_2763320-3.591416hypothetical protein
GBAA_2764218-3.389525acetyltransferase
GBAA_2765117-3.322948DeoR family transcriptional regulator
GBAA_2766118-3.077274lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2754HTHTETR594e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 59.3 bits (143), Expect = 4e-13
Identities = 34/196 (17%), Positives = 74/196 (37%), Gaps = 11/196 (5%)

Query: 12 RSLETKKKLLHSGYTIFIRNGFQKTTITQIIKHAETGYGTAYVYFKNKDDLLIVLMEDVM 71
+ ET++ +L +F + G T++ +I K A G Y +FK+K DL + ++
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW-ELS 66

Query: 72 NQFYNIAERSFSPQTTEEARTMIQNQVKAFLQLAEKE------RAILQVVEEAIGLSKEI 125
E + + + ++++ + L+ E I+ E +G +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 126 RQKWDEIRERFINSITQDITYSQESGLAHSKLNKEIVARAWFAMNEMFLWTIVQNDKKLE 185
+Q + + I Q + + E+ + + L A + + + +
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186

Query: 186 LEEI----VHTLTEMY 197
L++ V L EMY
Sbjct: 187 LKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2761HTHFIS386e-131 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 386 bits (994), Expect = e-131
Identities = 119/343 (34%), Positives = 188/343 (54%), Gaps = 32/343 (9%)

Query: 304 TFPGVIGTSDAFQHTLEEIKLVSPTDASVYVCGETGVGKEYVARAIHENSPRKNGPFIAV 363
++G S A Q + + TD ++ + GE+G GKE VARA+H+ R+NGPF+A+
Sbjct: 135 DGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194

Query: 364 NCGALPKELMESELFGYAEGAFTGARRQGYKGKFEQADGGTIFLDEIGEVPPEMQVALLR 423
N A+P++L+ESELFG+ +GAFTGA+ + G+FEQA+GGT+FLDEIG++P + Q LLR
Sbjct: 195 NMAAIPRDLIESELFGHEKGAFTGAQTRS-TGRFEQAEGGTLFLDEIGDMPMDAQTRLLR 253

Query: 424 VLQERTVTPIGSSKEVPVNIRIITATHKDLLRLVEEGKFRQDLYYRLHVYPLYVPSLIER 483
VLQ+ T +G + ++RI+ AT+KDL + + +G FR+DLYYRL+V PL +P L +R
Sbjct: 254 VLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDR 313

Query: 484 KEDIPYFIKHFCKRKNWNVVFPKSI----CNQFSQHTWPGNIRELLNALERIYILSQGRE 539
EDIP ++HF ++ + K H WPGN+REL N + R+ L
Sbjct: 314 AEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDV 373

Query: 540 ICEKQISFLLQTMMRNQHQLELQTENKTEDTLN--------------------------F 573
I + I L++ + +E +++
Sbjct: 374 ITREIIENELRSEI-PDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRV 432

Query: 574 REKIQRDSMIEALEKTNGNVSSAAKLLDVPRSTFYKRMQKYKL 616
+++ ++ AL T GN AA LL + R+T K++++ +
Sbjct: 433 LAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2764SACTRNSFRASE444e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 44.2 bits (104), Expect = 4e-08
Identities = 25/94 (26%), Positives = 42/94 (44%), Gaps = 4/94 (4%)

Query: 65 EESKNLFLVAEVHDRIVGFSRCEGSNLKRLSHKIEFGVCILKEFWGYGIGKSLLGQSIHW 124
EE K FL + + +G + SN + + V K++ G+G +LL ++I W
Sbjct: 62 EEGKAAFL-YYLENNCIGRIKIR-SNWNGYALIEDIAVA--KDYRKKGVGTALLHKAIEW 117

Query: 125 ADENEIKKISLQVLETNEKAIQLYKKLGFEVEGI 158
A EN + L+ + N A Y K F + +
Sbjct: 118 AKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151


29GBAA_2777GBAA_2782Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2777419-4.458653hypothetical protein
GBAA_2778319-4.524539short chain dehydrogenase
GBAA_2779622-4.786187hypothetical protein
GBAA_2780518-3.414070hypothetical protein
GBAA_2781215-1.529343mutT/nudix family protein
GBAA_2782213-1.368986cpsh domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2778DHBDHDRGNASE932e-24 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 92.8 bits (230), Expect = 2e-24
Identities = 53/188 (28%), Positives = 86/188 (45%), Gaps = 3/188 (1%)

Query: 4 KVAIITGASSGFGLLTTLELAKKDYLIIATMRNLEKQANLISQATQLNLQQNITVQQLDV 63
K+A ITGA+ G G LA + I A N EK ++S DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE--AFPADV 66

Query: 64 TDQNSIHNF-QLYIKEINRVDLLINNAGYANGGFVEEIPVEEYRKQFETNLFGAISITQL 122
D +I +E+ +D+L+N AG G + + EE+ F N G + ++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 123 VLPYMREQKSGKIINISSISGQVGFPGLSPYVSSKYALEGWSESLRLEVKSFGIDVALIE 182
V YM +++SG I+ + S V ++ Y SSK A +++ L LE+ + I ++
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PGSYNTNI 190
PGS T++
Sbjct: 187 PGSTETDM 194


30GBAA_2853GBAA_2867Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2853-419-3.588070cell division protein DivIC
GBAA_2854021-2.818758hypothetical protein
GBAA_2855016-0.726591hypothetical protein
GBAA_2856116-2.177264hypothetical protein
GBAA_2857114-2.199848hypothetical protein
GBAA_2858214-2.555166hypothetical protein
GBAA_2859115-3.225393hypothetical protein
GBAA_2860015-3.047226x-prolyl-dipeptidyl aminopeptidase
GBAA_2861014-4.408648sensor histidine kinase SrrB
GBAA_2863117-3.445144hypothetical protein
GBAA_2864016-2.714007GNAT family acetyltransferase
GBAA_2865016-2.540128hypothetical protein
GBAA_2866016-2.271635bifunctional
GBAA_2867214-2.348015hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2861PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 33/173 (19%), Positives = 60/173 (34%), Gaps = 38/173 (21%)

Query: 282 IIKQSDHISNLIEEL---LRFS---KLERDVLQKEEFSIKSLVQSILDKHKIELESKEIN 335
I++ ++ L +R+S R V +E + +V S L I+ E +
Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELT---VVDSYLQLASIQFEDR--- 239

Query: 336 LQVNYNVGDAIVYADVNKMRMVFQNLISNAIKY-----TSNQNIKITLEDRNESVYFQIQ 390
LQ + AI+ V M + Q L+ N IK+ I + N +V +++
Sbjct: 240 LQFENQINPAIMDVQVPPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297

Query: 391 NGMNAEHMKDIDKIWEPFYVLESSRSKDRSGTGLGLAIVKS-IVERHGFDYGV 442
N + TG GL V+ + +G + +
Sbjct: 298 N--TGSLALK----------------NTKESTGTGLQNVRERLQMLYGTEAQI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2864SACTRNSFRASE260.037 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.037
Identities = 20/101 (19%), Positives = 38/101 (37%), Gaps = 27/101 (26%)

Query: 26 DEGFYFLIKLISEYENKINTF-----------------------NKTGECLYGIFQGEKL 62
+E F ++I +EN + T+ + G+ + +
Sbjct: 17 NEPFVVFGRMIPAFENGVWTYTEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNC 76

Query: 63 IGIGGLNADPYTENNKIGRLRRFYIAKDYRRIGLGKLLLNK 103
IG + ++ N + +AKDYR+ G+G LL+K
Sbjct: 77 IGRIKIRSNW----NGYALIEDIAVAKDYRKKGVGTALLHK 113


31GBAA_2880GBAA_2913Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2880-214-3.172035hypothetical protein
GBAA_2881-213-2.439388solute-binding family 5 protein
GBAA_2882112-2.265298major facilitator family transporter protein
GBAA_2883215-2.893177lipoprotein
GBAA_2884115-3.122006hypothetical protein
GBAA_2886116-2.975202hypothetical protein
GBAA_2887119-2.997223hypothetical protein
GBAA_2888118-3.889843inosine-uridine preferring nucleoside hydrolase
GBAA_2889-112-3.610724hypothetical protein
GBAA_2890-111-3.675261UbiE/COQ5 family methlytransferase
GBAA_2891016-4.298746hypothetical protein
GBAA_2892019-2.316681hypothetical protein
GBAA_2894018-2.192491hypothetical protein
GBAA_2896019-2.293846transporter
GBAA_2897119-3.629994hypothetical protein
GBAA_2898117-4.066005lipoprotein
GBAA_2899118-3.668490aspartate aminotransferase
GBAA_2900320-5.662536(3R)-hydroxymyristoyl-ACP dehydratase
GBAA_2901321-6.545329pantothenate kinase
GBAA_2902122-7.725971CAAX amino terminal protease
GBAA_2904123-6.710475hypothetical protein
GBAA_2909633-6.093215hypothetical protein
GBAA_2910837-5.723773hypothetical protein
GBAA_2911936-5.929638hypothetical protein
GBAA_2912424-3.453175hypothetical protein
GBAA_2913116-3.222748hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2880PYOCINKILLER310.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.9 bits (69), Expect = 0.004
Identities = 14/53 (26%), Positives = 20/53 (37%), Gaps = 5/53 (9%)

Query: 12 LELTGISYGQLYRWKRKNLIPEDWFVRKSTFTGQETFFPKEKILERINKIQTM 64
L+ + G KNL P D R T G +K+L KI ++
Sbjct: 97 LDKADAALGPA-----KNLAPLDVINRSLTIVGNALQQKNQKLLLNQKKITSL 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2882TCRTETA801e-18 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 80.3 bits (198), Expect = 1e-18
Identities = 53/318 (16%), Positives = 113/318 (35%), Gaps = 9/318 (2%)

Query: 50 LIFGLQPFSDIVFTLIAGGITDKYGRKKIMLLGLLLQGVAIGSFVFAQSVFIFALLYVIN 109
++ L + G ++D++GR+ ++L+ L V A +++ + ++
Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106

Query: 110 GIGRSLYIPAQRAQIADLIKQGQQAEIFALLQTMGAIGTVIGPLIGAVFYNTHPEYLFIM 169
GI + A IAD+ ++A F + G V GP++G + P F
Sbjct: 107 GITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFA 165

Query: 170 QSITLMVYAVVVWTQLPETAPAITMPKQKLEVSSPKQF--VRNHSAVIGLMVSTLPISFF 227
+ + + LPE+ P ++ ++ F R + V LM +
Sbjct: 166 AAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLV 225

Query: 228 YAQTETNYRIFAEDVFPNFIFILAFISTCRAIMEIILQIFLV-KWSERFSMAKIIIISYT 286
+ IF ED F + I+ + Q + + R + +++
Sbjct: 226 GQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG-- 283

Query: 287 CYIVAAIGYGFSATIVS--LFFTLLFLVIGESIALNHLLRFVSEIAPSDKRGLYFSIYGL 344
I GY A + F ++ L+ I + L +S +++G
Sbjct: 284 -MIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAA 342

Query: 345 HWDVSRTCGPVIGAILLS 362
++ GP++ + +
Sbjct: 343 LTSLTSIVGPLLFTAIYA 360



Score = 47.5 bits (113), Expect = 5e-08
Identities = 20/121 (16%), Positives = 53/121 (43%), Gaps = 1/121 (0%)

Query: 45 IMITMLIFGLQPFSDIVFTLIAGGITDKYGRKKIMLLGLLLQGVAIGSFVFAQSVFIFAL 104
I + + + +I G + + G ++ ++LG++ G FA ++
Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305

Query: 105 LYVINGIGRSLYIPAQRAQIADLIKQGQQAEIFALLQTMGAIGTVIGPLIGAVFYNTHPE 164
+ V+ G + +PA +A ++ + + +Q ++ L + ++ +++GPL+ Y
Sbjct: 306 IMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364

Query: 165 Y 165

Sbjct: 365 T 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2883TYPE4SSCAGA290.014 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 29.3 bits (65), Expect = 0.014
Identities = 35/130 (26%), Positives = 52/130 (40%), Gaps = 20/130 (15%)

Query: 20 LAACKGTDEKKETNP----TSENSKNEQNTSSEGK-----KEPEVKSNTDSNSKDIVINQ 70
L A KG+ + NP EN N GK K + KS+ +++ KD++INQ
Sbjct: 719 LKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDVIINQ 778

Query: 71 KSINHVKNLFELAKEGKVPNVPFAAHTGDIEEIEKAWGKADKTEQAGNGMYATFTNKNVS 130
K + V NL + K TGD +E+A + A KN S
Sbjct: 779 KVTDKVDNLNQAVSVAKA--------TGDFSRVEQALADLKNFSKE---QLAQQAQKNES 827

Query: 131 FGFNKGSQVF 140
K S+++
Sbjct: 828 LNARKKSEIY 837


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2887CHANLCOLICIN359e-06 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 35.4 bits (81), Expect = 9e-06
Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 6 IVGGILGWLASLITGRDVPGGVIG-NIIAGIIGSWIGGKLLGSFGPVIG 53
V ++ L SL+ G G+ G I+ GI+ S+I L + V+G
Sbjct: 475 GVSYVVALLFSLLAG--TTLGIWGIAIVTGILCSYIDKNKLNTINEVLG 521


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2896TCRTETA454e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.8 bits (106), Expect = 4e-07
Identities = 65/342 (19%), Positives = 118/342 (34%), Gaps = 11/342 (3%)

Query: 1 MWRNKNVWIVLIGEFIAGLGLWLGILGNLEFMQKYVPSDFMKS---VILFIGLLAGVLVG 57
M N+ + ++L + +G+ L + ++ V S+ + + ++L + L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 58 PMAGRIIDQYEKKKVHLYAGFGRVISVIFMFFAIQFESIAFMIAFMVALQISAAFYFPAL 117
P+ G + D++ ++ V L + G + M A + I +VA A
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVL--YIGRIVAGITGATGAVAG- 117

Query: 118 QSVIPLIVREHELLQMNGVHMNVGTIARIAGTSLGGILLVVMSLQYMYAFSMAAYALLFL 177
+ I I E + G +AG LGG L+ S + + A L FL
Sbjct: 118 -AYIADITDGDERARHFGFMSACFGFGMVAGPVLGG-LMGGFSPHAPFFAAAALNGLNFL 175

Query: 178 STFFLQFEDKKSTTPSKQAAKDNSFMEVFRILRGIPIAFTALILSIIPLLFIAGFNLMVI 237
+ FL E K + N +A + I+ L+ L VI
Sbjct: 176 TGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI 235

Query: 238 -NISEMQHDPTIKGFIYTIEGIAFMLG-AFVIKRLSDHFKPEKLLYFFAVCTAFAHLSLF 295
D T G GI L A + ++ + L + ++ L
Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLA 295

Query: 296 FSDIKWMSLTSFGLFGFSVGCFFPIMSTIFQTKVEKSYHGRL 337
F+ WM+ L G P + + +V++ G+L
Sbjct: 296 FATRGWMAFPIMVLLAS-GGIGMPALQAMLSRQVDEERQGQL 336


32GBAA_3002GBAA_3007Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3002217-4.183924hypothetical protein
GBAA_3003121-4.196011DNA-binding protein
GBAA_3004022-4.190713hypothetical protein
GBAA_3005020-3.972309lipoprotein
GBAA_3006020-3.596243CAAX amino terminal protease
GBAA_3007020-3.717370histidine kinase domain-containing protein
33GBAA_3043GBAA_3067Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3043113-4.292425hypothetical protein
GBAA_3044113-2.308023mutT/nudix family protein
GBAA_3046113-2.590865hypothetical protein
GBAA_3047013-3.825680hypothetical protein
GBAA_3048014-4.147285ABC transporter ATP-binding protein
GBAA_3049016-3.521275GntR family transcriptional regulator
GBAA_3051016-2.902316hypothetical protein
GBAA_3052017-3.823570ABC transporter ATP-binding protein
GBAA_3053016-3.507623ABC transporter permease
GBAA_3054217-2.795707ABC transporter permease
GBAA_3055217-1.309015hypothetical protein
GBAA_3056117-0.965985hypothetical protein
GBAA_30572130.036878araC family transcriptional regulator
GBAA_3058-1120.703999acetyltransferase
GBAA_3059-1130.288201hypothetical protein
GBAA_3060013-0.503100mutT/nudix family protein
GBAA_3061113-2.170059cysteine transporter
GBAA_3062214-3.846333GntR family transcriptional regulator
GBAA_3063419-6.356474hypothetical protein
GBAA_3064419-5.998239hypothetical protein
GBAA_3065219-5.412204hypothetical protein
GBAA_3066017-4.116487sensor histidine kinase
GBAA_3067015-3.020500DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3051NUCEPIMERASE362e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 35.5 bits (82), Expect = 2e-04
Identities = 28/135 (20%), Positives = 45/135 (33%), Gaps = 42/135 (31%)

Query: 1 MKILILGGTRFLGRAFVEEALQRGHEV-----------TLFNRGTNQEI------FLE-- 41
MK L+ G F+G + L+ GH+V + + + F +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 42 ------VEQLIGDRNGDV-----------SSLENRKWDVVINTCGFSPHHIRNVGEVLKD 84
+ L + + SLEN N GF N+ E +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFL-----NILEGCRH 115

Query: 85 -NIEHYIFISSLSVY 98
I+H ++ SS SVY
Sbjct: 116 NKIQHLLYASSSSVY 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3058SACTRNSFRASE362e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 2e-05
Identities = 16/71 (22%), Positives = 29/71 (40%)

Query: 60 LSQTHKEEAYVHFIGVNPKYRRRGIASTLYSYFFDVARANKRKVVKAITSPVNKKSIQFH 119
+ A + I V YR++G+ + L + A+ N + T +N + F+
Sbjct: 82 IRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141

Query: 120 REIGFRIEAGD 130
+ F I A D
Sbjct: 142 AKHHFIIGAVD 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3066PF06580290.017 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.4 bits (66), Expect = 0.017
Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 7/78 (8%)

Query: 250 IQRLFDNIFQNVLKHSKAK---KLKIIIDEDIVYF--RDNGIGFDINSK-GTGLGLKNI- 302
+Q L +N ++ + LK D V + G N+K TG GL+N+
Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVR 319

Query: 303 EDISKMFDIKYTLQSNSE 320
E + ++ + ++ + +
Sbjct: 320 ERLQMLYGTEAQIKLSEK 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3067HTHFIS758e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.9 bits (184), Expect = 8e-18
Identities = 36/117 (30%), Positives = 61/117 (52%), Gaps = 3/117 (2%)

Query: 6 ILIVEDDLIIGDLLQKILQREKYNVYWEKEGRKVLDII--HEIDLVVMDVMLPGEDGYQI 63
IL+ +DD I +L + L R Y+V + I + DLVV DV++P E+ + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 TKKIKNLGLNIPIIFLSARNDMDSKLKGLTIGE-EYMIKPFDPRELLLRIQKMLGNQ 119
+IK ++P++ +SA+N + +K G +Y+ KPFD EL+ I + L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122


34GBAA_3079GBAA_3097Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3079218-2.807780hypothetical protein
GBAA_3081-117-3.609440hypothetical protein
GBAA_3082-115-4.214897hypothetical protein
GBAA_3083-214-2.711027lipoprotein
GBAA_3085012-1.352263hypothetical protein
GBAA_3086012-0.655342signal peptidase I
GBAA_30881130.547947hypothetical protein
GBAA_30891132.040979NAD-dependent deacetylase
GBAA_30902152.591032pyrrolidone-carboxylate peptidase
GBAA_30932152.372289hypothetical protein
GBAA_30941161.948418hypothetical protein
GBAA_30950161.705868LamB/YcsF family protein
GBAA_30962150.004295urea amidolyase
GBAA_3097213-0.940727hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3088ANTHRAXTOXNA270.049 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 27.4 bits (60), Expect = 0.049
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 5/50 (10%)

Query: 11 APHCLNFMIYIQNIYLNQKE-----KKGNLRFPYIAKQFNFSTNFEANFK 55
AP N+ Y++ NQ + +K N+ F + KQ NF+ N NF+
Sbjct: 742 APEYKNYFQYLKERITNQVQLLLTHQKSNIEFKLLYKQLNFTENETDNFE 791


35GBAA_3118GBAA_3153Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3118320-0.575839metallo-beta-lactamase
GBAA_3119320-1.700315hypothetical protein
GBAA_3120120-1.676453hypothetical protein
GBAA_3121219-2.181773spore coat protein CotF
GBAA_3123219-1.388920hypothetical protein
GBAA_3124219-1.363933hypothetical protein
GBAA_31262180.292118hypothetical protein
GBAA_3127117-0.981620small acid-soluble spore protein alpha/beta
GBAA_3128315-1.045829hypothetical protein
GBAA_31292120.497447hypothetical protein
GBAA_31302120.027956small, acid-soluble spore protein
GBAA_3131212-0.274933alcohol dehydrogenase
GBAA_3133112-1.630603hypothetical protein
GBAA_3134211-0.658945catalase
GBAA_3136213-1.245639aspartate ammonia-lyase
GBAA_3137418-2.540633L-asparaginase
GBAA_3138518-3.053377transcriptional regulator AnsR
GBAA_3140616-2.463419hypothetical protein
GBAA_3141416-1.082036amino acid permease
GBAA_3142218-2.043146branched chain amino acid ABC transporter
GBAA_3143217-1.430646pyrroline-5-carboxylate reductase
GBAA_3144318-1.970140hypothetical protein
GBAA_3145217-2.553369malate dehydrogenase
GBAA_3150117-2.859555spore germination protein GerAA
GBAA_3153217-4.071705response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3141BACINVASINB300.030 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 29.7 bits (66), Expect = 0.030
Identities = 48/195 (24%), Positives = 87/195 (44%), Gaps = 35/195 (17%)

Query: 174 TRESKRINNIMVLIK--IGMILLFITVGIFYVKPMNWIPIAPYGLSGVFTGGAAILFAFT 231
TR+++ N IM I +G +L ++V ++ VFTGGA++ A
Sbjct: 304 TRKAEETNRIMGCIGKVLGALLTIVSV-----------------VAAVFTGGASLALAAV 346

Query: 232 GFDILATSAEEVKDPKRNLPIGIIASLIICTIIYVMVCLVMTGMVSYKE-LNVPEAMAYV 290
G ++ E VK I + I+ ++ ++ L+ + E L V + A
Sbjct: 347 GLAVMVAD-EIVKAATGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTA-- 403

Query: 291 MEVVGQ--GKVAGAIAVGAVIGLMAVIFSNMYAATRVFFAMSRDGLLPKSFAKVNKKTGA 348
E+ G G + AIA+ AVI ++AV+ AA ++ A+S+ ++ ++ K+
Sbjct: 404 -EMAGSIVGAIVAAIAMVAVIVVVAVVGKG--AAAKLGNALSK--MMGETIKKL-----V 453

Query: 349 PTFITGLAGIGSSII 363
P + LA GS +
Sbjct: 454 PNVLKQLAQNGSKLF 468


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3153HTHFIS507e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 7e-09
Identities = 20/123 (16%), Positives = 54/123 (43%), Gaps = 8/123 (6%)

Query: 5 IVDDEKAVRSMLAQIIEDEDLGEVIGEAENGLSLEQQMLILKN--IDILFIDLLMPIQDG 62
+ DD+ A+R++L Q + +V + + + D++ D++MP ++
Sbjct: 8 VADDDAAIRTVLNQALSRAGY-DVRITS----NAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 IKTIRQIKPSFKG-KIIMVSQVESKELIAEAYSLGVEYYIIKPINRIEVLTVVRKVIERI 121
+ +IK + ++++S + +A G Y+ KP + E++ ++ + +
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 122 RLE 124
+
Sbjct: 123 KRR 125


36GBAA_3251GBAA_3313Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3251211-0.7900653-oxoacyl-ACP synthase
GBAA_3252412-1.146217hypothetical protein
GBAA_3253214-0.460582hypothetical protein
GBAA_3254214-0.352257cell wall anchor domain-containing protein
GBAA_32551140.346650ABC transporter ATP-binding protein
GBAA_3256-113-0.478975ABC transporter permease
GBAA_3257-113-0.755136arsR family transcriptional regulator
GBAA_3258013-1.012155permease
GBAA_3260112-1.456522sensor histidine kinase
GBAA_3261111-0.539407DNA-binding response regulator
GBAA_3262112-0.006568hypothetical protein
GBAA_32632141.618137marR family transcriptional regulator
GBAA_32651152.146869protease synthase and sporulation negative
GBAA_32661142.193516hypothetical protein
GBAA_32672142.102884major facilitator family transporter protein
GBAA_32682131.743307hypothetical protein
GBAA_32693141.002226(Fe-S)-binding protein
GBAA_3270417-1.642161hypothetical protein
GBAA_3271518-2.432625LysR family transcriptional regulator
GBAA_3272518-3.271644ATP synthase F0F1 subunit alpha
GBAA_3275620-4.435606hypothetical protein
GBAA_3278621-5.366995hypothetical protein
GBAA_3279620-4.927076hypothetical protein
GBAA_3280420-4.061930Gfo/Idh/MocA family oxidoreductase
GBAA_3281420-4.205435hypothetical protein
GBAA_3283219-3.398048LacI family transcriptional regulator
GBAA_3284219-2.471443hypothetical protein
GBAA_3285322-1.950388hypothetical protein
GBAA_3286217-2.368603hypothetical protein
GBAA_3287114-2.083445hypothetical protein
GBAA_3288114-2.507792impB/mucB/samB family protein
GBAA_3289215-2.312860hypothetical protein
GBAA_3290112-2.446296hypothetical protein
GBAA_3291012-2.525789methyl-accepting chemotaxis protein
GBAA_3294112-1.821009CAAX amino terminal protease
GBAA_3295014-1.604546spermine/spermidine acetyltransferase
GBAA_3296-113-1.190348MATE efflux family protein
GBAA_3299014-0.869228collagenase
GBAA_33001191.394923hypothetical protein
GBAA_33022181.223414transporter
GBAA_33031180.211856TetR family transcriptional regulator
GBAA_3305318-1.510007arsR family transcriptional regulator
GBAA_3306319-1.605408serine/threonine transporter family protein
GBAA_3307418-1.919680L-serine dehydratase, iron-sulfur-dependent
GBAA_3308519-2.740282l-serine dehydratase, iron-sulfur-dependent
GBAA_3312619-2.915857diaminobutyrate--2-oxoglutarate
GBAA_3313518-3.144274hydrogenase maturation protein HypF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3254GPOSANCHOR362e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.2 bits (83), Expect = 2e-04
Identities = 40/258 (15%), Positives = 96/258 (37%), Gaps = 3/258 (1%)

Query: 40 LAEIKQHKQGLDAKLQQHKENVDQTLNELNKVKENVDTKVNELHERKQVADEKINEIKQH 99
A Q + A L++ E + + ++ + L RK ++ +
Sbjct: 111 KASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNF 170

Query: 100 KQELDAKLQQ---DKQIAEDKIAEIKEHKKQVEDKVAEVKEHKQNIDNKVNEIKEHKQTV 156
AK++ +K E + AE+++ + + + ++ + + K +
Sbjct: 171 STADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 230

Query: 157 DEKVNEMKQHKENIDQKVNELKEVKKQVDEKLAELKKAKQTAEDKLAELKENKPNTGNTL 216
++ + K+ L+ K ++ + AEL+KA + A +
Sbjct: 231 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 290

Query: 217 EELKKIKSNLDSLSANLELAKQDVKNKLAALQEARQDLINKINEIKQSKQTVSDDLSKKK 276
L+ K++L+ S L +Q ++ L A +EA++ L + ++++ + +
Sbjct: 291 AALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLR 350

Query: 277 QDLDIKINDFKHTEKKID 294
+DLD K E +
Sbjct: 351 RDLDASREAKKQLEAEHQ 368



Score = 34.3 bits (78), Expect = 7e-04
Identities = 33/227 (14%), Positives = 74/227 (32%), Gaps = 4/227 (1%)

Query: 41 AEIKQHKQGLDAKLQQHKENVDQTLNELNKVKENVDTKVNELHERKQVADEKINEIKQHK 100
A Q E L + + N T + + + + K
Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160

Query: 101 QELDAKLQQDKQIAEDKIAEIKEHKKQVEDKVAEVKEHKQNIDNKVNEIKEHKQTVDEKV 160
++ KI ++ K +E + AE+++ + N +T++ +
Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220

Query: 161 NEMKQHKENIDQKVNELKEVKKQVDEKLAELKKAKQTAEDKLAELKE----NKPNTGNTL 216
+ K ++++ + K+ L+ K E + AEL++ +
Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280

Query: 217 EELKKIKSNLDSLSANLELAKQDVKNKLAALQEARQDLINKINEIKQ 263
++K +++ +L A + + A Q R+DL KQ
Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQ 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3260PF06580385e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 38.3 bits (89), Expect = 5e-05
Identities = 17/102 (16%), Positives = 34/102 (33%), Gaps = 22/102 (21%)

Query: 359 NIFTNSIKFSNEGGTIEFFVEELESSVIISISDNGIGMEKEEMDRIFDRFYKVDTARARN 418
N + I +GG I + +V + + + G K
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK----------------- 308

Query: 419 VEGSGLGLSIVQKIVELHNGN---VSVYSTKGEGTTVRVELP 457
E +G GL V++ +++ G + + +G V +P
Sbjct: 309 -ESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3261HTHFIS822e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.2 bits (203), Expect = 2e-20
Identities = 34/123 (27%), Positives = 61/123 (49%), Gaps = 1/123 (0%)

Query: 1 MKMIHILLADDDKHIRELLHYHLQKEGFKVFEAEDGKVAQEVLEKENIHLAIVDIMMPFV 60
M IL+ADDD IR +L+ L + G+ V + + + L + D++MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGYTLCEEIRK-YHDIPVILLTAKDQLVDKEKGFISGTDDYIVKPFEPAEVIFRMKALLR 119
+ + L I+K D+PV++++A++ + K G DY+ KPF+ E+I + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RYQ 122
+
Sbjct: 121 EPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3267TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.0 bits (78), Expect = 0.001
Identities = 61/379 (16%), Positives = 127/379 (33%), Gaps = 42/379 (11%)

Query: 45 WGAILGYFGYGYMIGSLLGGIFSDKKGPKFVWIVAATAWSIFEIATAFAGEIGIAVFGGS 104
+G +L + + + G SD+ G + V +V+ ++ A A + + G
Sbjct: 45 YGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG-- 102

Query: 105 ALIGFAIFRVLFGLTEGPSFAVSNKTAANWAAPKERAFLTSLGFVGVPLGAVLTA-PVAV 163
R++ G+T G + AV+ A+ ERA GF+ G + A PV
Sbjct: 103 --------RIVAGIT-GATGAVAGAYIADITDGDERA--RHFGFMSACFGFGMVAGPVLG 151

Query: 164 LLLSFTSWKIMFFILGTIGIVWAIIWYFTFTNMPEDHPRVTKEELAEIRSTEGVLQSAKV 223
L+ S FF + + + F +PE H + E + +
Sbjct: 152 GLMGGFSPHAPFFAAAALNGLNFLTGCFL---LPESHKGERRPLRREALNPLASFR---- 204

Query: 224 EKEIPKEPWYSFFKVPTFVMVTIAYFCFQYINFLILTWTPKYLQDVFHFQLSSLWYLGMI 283
W V +M +F Q + + + +D FH+ ++ +G+
Sbjct: 205 --------WARGMTVVAALMAV--FFIMQLVGQVPAALWVIFGEDRFHWDATT---IGIS 251

Query: 284 PWLGACITLPLGAKLSDRILRKTGNLRLARTGLPIIALLLTAICFSFIPAMNNYVAVLAL 343
+ A ++ + + G R G ++ + + +
Sbjct: 252 LAAFGILHSLAQAMITGPVAARLGERRALMLG-----MIADGTGYILLAFATRGWMAFPI 306

Query: 344 MSLGNAFAFLPSSLFWAIIVDTAPAYSGTYSGIMHFIANIATILAPTLTGYL---VVSYG 400
M L + +L + G G + + ++ +I+ P L + ++
Sbjct: 307 MVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTW 366

Query: 401 YPSMFIVAAILAAIAMGAM 419
+I A L + + A+
Sbjct: 367 NGWAWIAGAALYLLCLPAL 385



Score = 29.0 bits (65), Expect = 0.037
Identities = 28/161 (17%), Positives = 45/161 (27%), Gaps = 12/161 (7%)

Query: 290 ITLPLGAKLSDRILRKTGNLRLARTGLPIIALLLTAICFSFIPAMNNYVAVLALMSLGNA 349
P+ LSDR R+ ++ L A I A ++ VL + +
Sbjct: 58 ACAPVLGALSDRFGRR----------PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAG 107

Query: 350 FAFLPSSLFWAIIVDTAPAYSGT-YSGIMHFIANIATILAPTLTGYLVVSYGYPSMFIVA 408
++ A I D + G M + P L G + + + F A
Sbjct: 108 ITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAA 166

Query: 409 AILAAIAMGAMLFVKPGQQTKTESLFNWRGKKRLEEPRANF 449
A L + F+ P L R
Sbjct: 167 AALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWAR 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3299MICOLLPTASE7490.0 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 749 bits (1935), Expect = 0.0
Identities = 417/885 (47%), Positives = 578/885 (65%), Gaps = 18/885 (2%)

Query: 93 YTLAELNKMPNSELIDTLSKISWNQITDLFQFNQDTKAFYQNKERMNVIINELGQRGRTF 152
YT ELN+M S+L++ + IS+ + DLF FN + F+ N++R+ II L GRT+
Sbjct: 93 YTFDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTY 152

Query: 153 TKENSKGIETFVEVLRSAFYVGYYNNELSYLKERSFHEKCLPALKAIAKNPNFTLGTAEQ 212
T ++ KGI T VE LR+ +Y+G+YN +LSYL +CLPA+KAI N NF LGT Q
Sbjct: 153 TADDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQLKNECLPAMKAIQYNSNFRLGTKAQ 212

Query: 213 DRVVAAYGKLIGNASSDTETVQYAVNVLKQYNDNLTTYVSDYAKGQAVYEIVKGIDYDIQ 272
D VV A G+LIGNAS+D E + + VL + DN+ Y S+Y+KG AV+ ++KGIDY
Sbjct: 213 DGVVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGNAVFNLMKGIDYYTN 272

Query: 273 SYLQDT-NKQPNETMWYGKIDNFINEVNRIALVGN-ITNENSWLINNGIYYAGRLGKFHS 330
S + +T T +Y +ID ++ + + +G+ + N+N+WL+NN +YY GR+GKF
Sbjct: 273 SVIYNTKGYDAKNTEFYNRIDPYMERLESLCTIGDKLNNDNAWLVNNALYYTGRMGKFRE 332

Query: 331 NPYKGLEVITQAMSLYPRLSGPYFVAVEQIKTNYGGKDYSGKAVDLQKIREEGKRQYLPK 390
+P + +AM YP LS Y A + N+GGK+ SG +D KI+ + + +YLPK
Sbjct: 333 DPSISQRALERAMKEYPYLSYQYIEAANDLDLNFGGKNSSGNDIDFNKIKADAREKYLPK 392

Query: 391 TYTFDDGSIVFKTGDKVTEEKIKRLYWAAKEVKAQYHRVIGNDKALEPGNADDVLTIVIY 450
TYTFDDG V K GDKVTEEKIKRLYWA+KEVKAQ+ RV+ NDKALE GN DD+LT+VIY
Sbjct: 393 TYTFDDGKFVVKAGDKVTEEKIKRLYWASKEVKAQFMRVVQNDKALEEGNPDDILTVVIY 452

Query: 451 NNPDEYQLNRQLYGYETNNGGIYIEEKRTFFTYERTPKQSIYSLEELFRHEFTHYLQGRY 510
N+P+EY+LNR + G+ T+NGGIYIE TFFTYERTP++SIY+LEELFRHEFTHYLQGRY
Sbjct: 453 NSPEEYKLNRIINGFSTDNGGIYIENIGTFFTYERTPEESIYTLEELFRHEFTHYLQGRY 512

Query: 511 EVPGLFGSGEMYQNERLTWFQEGNAEFFAGSTRTNNVVPRKSMISGLSSDPASRYTAKQT 570
VPG++G GE YQ LTW++EG AEFFAGSTRT+ + PRKS+ GL+ D +R +
Sbjct: 513 VVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAYDRNNRMSLYGV 572

Query: 571 LFSKYGSWDFYKYSFALQSYLYNHQFDTFDKLQDLIRVNDVKNYDSYRESLSNNTQLNAE 630
L +KYGSWDFY Y FAL +Y+YN+ F+K+ + I+ NDV Y Y S+S++ LN +
Sbjct: 573 LHAKYGSWDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDK 632

Query: 631 YQAYMQQLIDNQDKYNVPQVTNDYLIQHAPKPLAEVKNEIVDVANIKDAKITKYESQFFN 690
YQ YM L++N D +VP V+++Y+ H K + E+ N+I +V+NIKD +SQFF
Sbjct: 633 YQDYMDSLLNNIDNLDVPLVSDEYVNGHEAKDINEITNDIKEVSNIKDLSSNVEKSQFFT 692

Query: 691 TFTVEGKYTGGTSKGESEDWKTMSKQVNRTLEQLSQKGWSGYKTVTAYFVNYRVNAANQF 750
T+ + G Y GG S+GE DWK M+ ++N L++LS+K W+GYKTVTAYFVN++V+ +
Sbjct: 693 TYDMRGTYVGGRSQGEENDWKDMNSKLNDILKELSKKSWNGYKTVTAYFVNHKVDGNGNY 752

Query: 751 EYDIVFHGVATE--EKEKTNTIVN--MNGPYSGIVNEEIQFHSDGTKSENGKVTSYLWNF 806
YD+VFHG+ T+ N + S IV EEI F +K E+G++ +Y W+F
Sbjct: 753 VYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDF 812

Query: 807 GDGTTSTEANPTHVYEEKGTYTVELTVKDRRGKESKEQTKVTVKQD----------PQTG 856
GDG S EA TH Y + G Y V+LTV D G + E K+ V +D P
Sbjct: 813 GDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIKVVEDKPVEVINESEPNND 872

Query: 857 EFHEEEKVLLFNTLVKGNLVTPDQTDVYTFDVTDTKEVDISVVNEQNIGMTWVLYHESDM 916
F + ++ N LVKG L D +D Y FDV V I++ N ++G+TW LY E D+
Sbjct: 873 -FEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITLNNLNSVGITWTLYKEGDL 931

Query: 917 QNYVA-CGEDEGNVIKGKFEAKPGKYYLNVYKFDDKNGEYSLLVK 960
NYV ++G V+KG+ +PG+YYL+VY +D+++G Y++ VK
Sbjct: 932 NNYVLYATGNDGTVLKGEKTLEPGRYYLSVYTYDNQSGTYTVNVK 976



Score = 89.8 bits (222), Expect = 2e-20
Identities = 102/514 (19%), Positives = 173/514 (33%), Gaps = 90/514 (17%)

Query: 488 KQSIYSLEELF--RHEFTHYLQGRYEVPGLFGSGEMYQNERLTWFQEGNAEFFAGSTRTN 545
K I S+ + ++ Y+ + A+
Sbjct: 617 KDYIASMSSDYGLNDKYQDYMDSLLNNIDNLD----VPLVSDEYVNGHEAKDIN---EIT 669

Query: 546 NVVPRKSMISGLSSDPASRYTAKQTLFSKYGSWDFYKYSFALQSYLYNH-QFDTFDKLQD 604
N + S I LSS K F+ Y D +S + D KL D
Sbjct: 670 NDIKEVSNIKDLSS-----NVEKSQFFTTY---DMRGTYVGGRSQGEENDWKDMNSKLND 721

Query: 605 LIRVNDVKNYDSYRESL----------SNNTQLNAEYQAYMQQLIDNQDKYNVPQ--VTN 652
+++ K+++ Y+ + N + + + P+ + +
Sbjct: 722 ILKELSKKSWNGYKTVTAYFVNHKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKS 781

Query: 653 DYLIQHAPKPLAEVKNEI-VDVANIKDA--KITKYESQFFNTFTVEGKYTGGTSKGESED 709
D + V+ EI D KD +I YE F + G E++
Sbjct: 782 DSSVI--------VEEEINFDGTESKDEDGEIKAYEWDFGD----------GEKSNEAKA 823

Query: 710 WKTMSKQVNRTLEQLSQKGWSGYKTVTAYFVNYRVNAANQFEYDIVFHGVATEEKEKTNT 769
+K ++ G T + ++ +++ + EK N
Sbjct: 824 THKYNKTGEYEVKLTVTDNNGGINTESK-----KIKVVEDKPVEVINESEPNNDFEKANQ 878

Query: 770 IVNMNGPYSGIVNEE---IQFHSDGTKSENGKVTS----------YLWNFGDGTT-STEA 815
I N G ++EE +++ D K N K+T L+ GD A
Sbjct: 879 IAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITLNNLNSVGITWTLYKEGDLNNYVLYA 938

Query: 816 NPTHVYEEKGTYTVE-------------------LTVKDRRGKESKEQTKVTVKQDPQTG 856
KG T+E + VK E KE K +K+
Sbjct: 939 TGNDGTVLKGEKTLEPGRYYLSVYTYDNQSGTYTVNVKGNLKNEVKETAKDAIKEVENNN 998

Query: 857 EFHEEEKVLLFNTLVKGNLVTPDQTDVYTFDVTDTKEVDISVVNEQNIGMTWVLYHESDM 916
+F + KV N+ + G L D D+Y+ D+ + +++I V N NI M W+LY D+
Sbjct: 999 DFDKAMKVDS-NSKIVGTLSNDDLKDIYSIDIQNPSDLNIVVENLDNIKMNWLLYSADDL 1057

Query: 917 QNYVACGEDEGNVIKGKFEAKPGKYYLNVYKFDD 950
NYV +GN + + PGKYYL VY+F++
Sbjct: 1058 SNYVDYANADGNKLSNTCKLNPGKYYLCVYQFEN 1091


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3302TCRTETB348e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 8e-04
Identities = 36/152 (23%), Positives = 69/152 (45%), Gaps = 3/152 (1%)

Query: 42 ISNEIGLSNSSAGLIVTLTQIGYVVGLLFLVPLGDIVENKKLILILLFLSAFA-LISMVF 100
I+N+ +S + T + + +G L D + K+L+L + ++ F +I V
Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99

Query: 101 VKSATLLLIASFFIGLGSVAAQVLVP-LVSYLSSENARGRVVGNVMSGLLLGIMLARPIS 159
+LL++A F G G+ A LV +V+ + RG+ G + S + +G + I
Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159

Query: 160 SLVADMWGWNAIFALSATVIIVLAFVLSKVLP 191
++A W+ + L + I+ L K+L
Sbjct: 160 GMIAHYIHWSYLL-LIPMITIITVPFLMKLLK 190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3303HTHTETR842e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 83.9 bits (207), Expect = 2e-22
Identities = 26/170 (15%), Positives = 56/170 (32%), Gaps = 13/170 (7%)

Query: 4 KRGRPRNIETQKAILSASYELLLESGFKAVTVDKIADRAKVSKATIYKWWPNKAAVVM-- 61
++ + ET++ IL + L + G + ++ +IA A V++ IY + +K+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 62 -----DGFLSAAARLPVPDTGS---ALNDILTHATSLANFLISREGTIINELVGEGQFDS 113
G L +IL H R +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 114 --KLAEEYRARYFQPRRLQAKQLLEKGMKRGELKENLDVELSIDLIYGPI 161
+ + R + +Q L+ ++ L +L + ++ G I
Sbjct: 123 MAVVQQAQRNLCLESYDR-IEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


37GBAA_3471GBAA_3519Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_34712160.408918short chain dehydrogenase/reductase
GBAA_34721150.108691acetyltransferase
GBAA_3473016-0.294025AMP-binding protein
GBAA_3478217-0.701613ankyrin repeat-containing protein
GBAA_3479422-1.255404arsR family transcriptional regulator
GBAA_3482422-0.910240hypothetical protein
GBAA_3483322-2.869210RNA polymerase sigma factor SigI
GBAA_3486420-1.064819CAAX amino terminal protease
GBAA_34871191.585555TetR family transcriptional regulator
GBAA_3488-1212.288985hypothetical protein
GBAA_3489-2223.273846hypothetical protein
GBAA_3491-2193.227213hypothetical protein
GBAA_3492-2162.056917ABC transporter permease
GBAA_3493-2120.794148ABC transporter ATP-binding protein
GBAA_3494-114-0.745469hypothetical protein
GBAA_3495013-1.237101hypothetical protein
GBAA_3497-112-1.431139hydroxylamine reductase
GBAA_3498217-2.683404hypothetical protein
GBAA_3500-220-2.380966beta-lactamase II
GBAA_3501-218-1.922487lysozyme
GBAA_3502-120-2.386635hypothetical protein
GBAA_3503120-1.932763hypothetical protein
GBAA_3504218-2.392237hypothetical protein
GBAA_3505017-2.911682hypothetical protein
GBAA_3506116-2.903288penicillin-binding protein
GBAA_3507017-3.825027hypothetical protein
GBAA_3508115-4.159428hypothetical protein
GBAA_3509013-3.497754hypothetical protein
GBAA_3510-115-1.440312cyclic nucleotide-binding protein
GBAA_3511012-1.053093hypothetical protein
GBAA_351209-1.742302hypothetical protein
GBAA_3513-19-1.623812hypothetical protein
GBAA_351409-1.163352metallo-beta-lactamase
GBAA_3516010-2.221429amino acid permease
GBAA_3518018-4.768179Bcr/CflA subfamily drug resistance transporter
GBAA_3519-118-3.477211hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3471DHBDHDRGNASE592e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 58.9 bits (142), Expect = 2e-12
Identities = 46/197 (23%), Positives = 82/197 (41%), Gaps = 19/197 (9%)

Query: 3 VLITGGNRGLGLQLVKVFHENGHII----YPLVRTEVAVTQLK-QMFSCRCFPILADLAA 57
ITG +G+G + + G I Y + E V+ LK + FP AD+
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP--ADVRD 68

Query: 58 DESTEQIKKQLEEYTEYMDLVINNAGITGKETEVLHTNS-EELTDLFNIHCLGVIRAVKG 116
+ ++I ++E +D+++N AG+ ++H+ S EE F+++ GV A +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL--RPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 117 TYMALAKSDHPRIINVSSRLGSLHKMANKEFPQGHFSYSYRIAKAAQNMLTLCLQQEFEN 176
+ I+ V S + + + +Y +KAA M T CL E
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMA---------AYASSKAAAVMFTKCLGLELAE 177

Query: 177 KGISVTAIHPGKLKTEI 193
I + PG +T++
Sbjct: 178 YNIRCNIVSPGSTETDM 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3482cloacin330.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 0.003
Identities = 23/88 (26%), Positives = 29/88 (32%), Gaps = 6/88 (6%)

Query: 326 GNNGRGSQGNNGHQQENNGRGSQGNNGNQQGNNGRGSQGNNGHQQENNGRGSQGNNGNQQ 385
G +GRG N G G ++G +G ENN G +G
Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDG------SGWSSENNPWGGGSGSGIHW 56

Query: 386 GDNGRGSQQGNNGNQQGDNGRGSQKENV 413
G G NGN G +G G V
Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAV 84



Score = 29.7 bits (66), Expect = 0.027
Identities = 21/73 (28%), Positives = 30/73 (41%), Gaps = 2/73 (2%)

Query: 295 NNGRESQQGN--NGNQQGNNGRESQQGNNGNQQGNNGRGSQGNNGHQQENNGRGSQGNNG 352
N G S GN G G + G+ + + N G G+ H +G G+ G NG
Sbjct: 10 NTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69

Query: 353 NQQGNNGRGSQGN 365
N G +G G +
Sbjct: 70 NSGGGSGTGGNLS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3487HTHTETR712e-17 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 71.2 bits (174), Expect = 2e-17
Identities = 28/172 (16%), Positives = 68/172 (39%), Gaps = 3/172 (1%)

Query: 8 KEKIIETSLYLFNTNGITRTSIQDIMTATELPKGSIYRRFKNKEEIVLAAYDKSGEIMWS 67
++ I++ +L LF+ G++ TS+ +I A + +G+IY FK+K ++ ++ S +
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 68 HFHKAMENK-KTAIDKILAIFLVYQDAANNPPI-AGGCPLLNSAIESTGVFPELQKAAAK 125
+ + + I + ++ ++ E G +Q+A
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132

Query: 126 GYDDTVMLMASLIKEGIEKQELKEEINIISLASFLASSMEGAIMASRVSNDN 177
++ + +K IE + L ++ A + + G M + +
Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL-MENWLFAPQ 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3518TCRTETB795e-18 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 78.8 bits (194), Expect = 5e-18
Identities = 44/180 (24%), Positives = 85/180 (47%), Gaps = 1/180 (0%)

Query: 9 LLLMIILVAFPQISETIYTPSLPDISKALHVSNNEVQLTLSVYFAGFALGVFFIGWLSDI 68
L+ + IL F ++E + SLPDI+ + + + F++G G LSD
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 69 IGRRPAMLFGIVVYGIGSFLCFIANS-IEVLLVSRFIQAFGASAGSVVTQTILRESVEGH 127
+G + +LFGI++ GS + F+ +S +L+++RFIQ GA+A + ++ +
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 128 KRHVMFAQISAVIAFTPAIGPLIGGFLDQMFGFKIVFLSLVVMSVGIFLYTFVSLPETKT 187
R F I +++A +GP IGG + + + L ++ + + + E +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRI 195


38GBAA_3589GBAA_3599Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_35893150.578010hypothetical protein
GBAA_35902130.442443hypothetical protein
GBAA_3591117-0.441450hypothetical protein
GBAA_3592-313-0.441828hypothetical protein
GBAA_3593-1130.258559exonuclease
GBAA_3594-212-0.388324cold shock protein CspB
GBAA_3595-112-0.127217BNR repeat-containing protein
GBAA_35964152.780280flavodoxin
GBAA_35973132.595662hypothetical protein
GBAA_35984142.370367mutT/nudix family protein
GBAA_35992131.898536hypothetical protein
39GBAA_3755GBAA_3827Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3755217-3.593661hypothetical protein
GBAA_3756317-4.278543hypothetical protein
GBAA_3757419-4.436330hypothetical protein
GBAA_3760516-2.971344prophage LambdaBa01 TPR domain-containing
GBAA_3761415-0.475825hypothetical protein
GBAA_3763517-0.021744hypothetical protein
GBAA_37645151.130373hypothetical protein
GBAA_37663131.611373hypothetical protein
GBAA_37673141.625264prophage lambdaba01, n-acetylmuramoyl-l-alanine
GBAA_37684141.151021prophage lambdaba01, holin
GBAA_37693131.119766prophage lambdaba01, AbrB family transcriptional
GBAA_37743120.923317prophage LambdaBa01, membrane protein
GBAA_37752150.623620hypothetical protein
GBAA_3776216-0.691365hypothetical protein
GBAA_3777118-0.755615hypothetical protein
GBAA_3778219-0.307773prophage lambdaba01, major tail protein
GBAA_3779419-1.081189hypothetical protein
GBAA_3780418-0.419593hypothetical protein
GBAA_3781517-0.448590hypothetical protein
GBAA_3782216-0.626150hypothetical protein
GBAA_3783315-0.607242hypothetical protein
GBAA_3784215-0.572670phage major capsid protein
GBAA_3785316-0.723578prophage lambdaba01, prohead protease
GBAA_3786318-1.604610hypothetical protein
GBAA_3787318-2.413642prophage lambdaba01, terminase, large subunit
GBAA_3788521-4.062066hypothetical protein
GBAA_3791621-4.153436hypothetical protein
GBAA_3792724-4.620055hypothetical protein
GBAA_3793926-5.101140hypothetical protein
GBAA_3795923-4.429202hypothetical protein
GBAA_3796823-4.107019hypothetical protein
GBAA_3798824-2.997732hypothetical protein
GBAA_3799823-4.117574hypothetical protein
GBAA_3800822-4.992818hypothetical protein
GBAA_3801823-4.763358hypothetical protein
GBAA_3802823-4.061329hypothetical protein
GBAA_3803925-4.532172positive control sigma-like factor
GBAA_3804925-5.782241hypothetical protein
GBAA_3805926-4.448893prophage LambdaBa01, acyltransferase
GBAA_3806827-2.467924hypothetical protein
GBAA_3807623-0.461864hypothetical protein
GBAA_38095241.531856hypothetical protein
GBAA_38103201.797311hypothetical protein
GBAA_38113202.303444hypothetical protein
GBAA_38124212.004085hypothetical protein
GBAA_38134201.768855prophage LambdaBa01, thymidylate
GBAA_38145200.513513prophage LambdaBa01, C-5 cytosine-specific DNA
GBAA_3815617-1.103532hypothetical protein
GBAA_3816516-0.883254hypothetical protein
GBAA_3817517-0.576698hypothetical protein
GBAA_3818418-0.913121hypothetical protein
GBAA_3819221-1.151873hypothetical protein
GBAA_3820020-1.320497hypothetical protein
GBAA_3821122-1.271062hypothetical protein
GBAA_3822321-2.013154hypothetical protein
GBAA_3823623-2.720697hypothetical protein
GBAA_3824720-2.676912hypothetical protein
GBAA_3825418-2.997886hypothetical protein
GBAA_3826518-2.999742hypothetical protein
GBAA_3827219-0.901152hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3775GPOSANCHOR371e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 1e-04
Identities = 32/234 (13%), Positives = 77/234 (32%), Gaps = 9/234 (3%)

Query: 14 DGETTGLQNALKDVNKRSNDLTKELKDVERLLKFDPGNIEALAQKQQLLTQQIENTTQKL 73
E + + L+ +K ++ +++++E +E + +I+ +
Sbjct: 91 TEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEK 150

Query: 74 DKLKAAEQQVQAQFQNGKISEEQYRAFRREIEFTEGSLNGLKNKLGNMKAEQDSVASSTR 133
L A + ++ + A + +E + +L + +L + +++
Sbjct: 151 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 210

Query: 134 QLETLFSATGKSVDDFAGALGNRLVNAIRSGTATSKQLDQAIGIIGREALGTEADIEKLQ 193
A + A L A+ S I + E EA +L+
Sbjct: 211 AKIKTLEAEKAA----LAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 266

Query: 194 RALRSV-----DAGNTIQQVQNELRDLQQEAGKTEKKFEGLKIGLENVIGGLAA 242
+AL I+ ++ E L+ E E + + L +++ L A
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3783PF07675260.024 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 26.2 bits (57), Expect = 0.024
Identities = 17/58 (29%), Positives = 23/58 (39%), Gaps = 3/58 (5%)

Query: 23 HYSVADSYESNDAERVMYLQDEGFLNKERIIEKQEGSKGPVHVGGGYYE---LPNGEK 77
HY+V S NDA E L + ++ E +G G Y + LP G K
Sbjct: 1172 HYAVYASSTGNDASNFANALLEEVLTAKTVVTAPEAIRGTRAQGTWYQKTVQLPAGTK 1229


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3786PF05043300.020 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 29.9 bits (67), Expect = 0.020
Identities = 18/99 (18%), Positives = 34/99 (34%), Gaps = 11/99 (11%)

Query: 196 AIKNSAVVKWILKFKSVLKQEDIDS------QVKNFVNNYLNISNDGGAASSDPRYDLEQ 249
I+N + W L + L ++++ + Q N + N+ NI SD + +L
Sbjct: 308 EIENKDNLIWHLHNTAHLYRQELFTEFILFDQKGNTIRNFQNIFPK---FVSDVKKELSH 364

Query: 250 VKPEAFVPDSKQMQETVQRIYNFFNTNEKIIQSKYNEDE 288
V S M Y F + ++ +
Sbjct: 365 YLETLEVCSSSMM--VNHLSYTFITHTKHLVINLLQNQP 401


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3795UREASE290.003 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 28.6 bits (64), Expect = 0.003
Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 2/30 (6%)

Query: 27 DEVLTTPEVMDVLGISKARISKMIKDGKLV 56
D V+T ++D GI KA I +KDG++
Sbjct: 69 DTVITNALILDHWGIVKADIG--LKDGRIA 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3803HELNAPAPROT325e-04 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 32.2 bits (73), Expect = 5e-04
Identities = 9/50 (18%), Positives = 18/50 (36%), Gaps = 3/50 (6%)

Query: 1 MQDLIKQYNTTLRQLREAQKDAKEEDVKVLTDMISDITYSLE---WMKKA 47
+Q L+ Y + + A+E D+ + +E WM +
Sbjct: 101 VQALVNDYKQISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSS 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3809TCRTETB280.003 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 27.9 bits (62), Expect = 0.003
Identities = 7/21 (33%), Positives = 12/21 (57%)

Query: 1 MITFVGVLLTIKFTREESRRE 21
MIT + V +K ++E R +
Sbjct: 176 MITIITVPFLMKLLKKEVRIK 196


40GBAA_3872GBAA_3879Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3872213-1.016038peptidase T
GBAA_3873211-3.409266hypothetical protein
GBAA_3874112-4.001388hypothetical protein
GBAA_3876113-4.092037phosphoglycerate mutase
GBAA_3877113-3.688290alpha/beta hydrolase
GBAA_3878013-3.865494glyoxylase
GBAA_3879-212-3.036878sensory box/GGDEF family protein
41GBAA_4018GBAA_4039Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4018-2143.269524hypothetical protein
GBAA_40191272.890944hypothetical protein
GBAA_40210292.921628orotate phosphoribosyltransferase
GBAA_40220283.105482orotidine 5'-phosphate decarboxylase
GBAA_40230272.903308dihydroorotate dehydrogenase 1B
GBAA_40240282.925565dihydroorotate dehydrogenase electron transfer
GBAA_40251272.620061carbamoyl phosphate synthase large subunit
GBAA_40262222.662319carbamoyl phosphate synthase small subunit
GBAA_40271202.286622dihydroorotase
GBAA_40282191.409047aspartate carbamoyltransferase
GBAA_40292201.641896uracil permease
GBAA_40302190.792541bifunctional pyrimidine regulatory protein
GBAA_40311190.634965ribosomal large subunit pseudouridine synthase
GBAA_40320200.516305lipoprotein signal peptidase
GBAA_40331191.038903hypothetical protein
GBAA_40341190.915653isoleucyl-tRNA synthetase
GBAA_4035212-0.146173cell-division initiation protein DivIVA
GBAA_4036214-0.212329s4 domain-containing protein
GBAA_4037215-0.343086hypothetical protein
GBAA_4038216-1.091744hypothetical protein
GBAA_4039217-1.559190hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4027UREASE330.003 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 32.8 bits (75), Expect = 0.003
Identities = 25/83 (30%), Positives = 36/83 (43%), Gaps = 20/83 (24%)

Query: 17 IVATDLLVQDGKIAKV--AEN---------ITADNAEVIDVNGKLIAPGLVDVHVHLREP 65
IV D+ ++DG+IA + A N I EVI GK++ G +D H+H P
Sbjct: 83 IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFICP 142

Query: 66 GGEHKETIETGTLAAAKGGFTTI 88
+ IE A G T +
Sbjct: 143 -----QQIEE----ALMSGLTCM 156


42GBAA_4063GBAA_4110Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4063213-2.034281hypothetical protein
GBAA_4065115-2.610245prophage LambdaBa02, lipoprotein
GBAA_4066219-2.063317hypothetical protein
GBAA_4067316-2.468713DNA translocase FtsK
GBAA_4068516-0.876228hypothetical protein
GBAA_4069417-0.361104hypothetical protein
GBAA_4070013-1.213808prophage LambdaBa02, repressor protein
GBAA_4071013-1.152696hypothetical protein
GBAA_4072112-0.508855hypothetical protein
GBAA_4073014-0.272901prophage lambdaba02, n-acetylmuramoyl-l-alanine
GBAA_4074014-0.333727prophage lambdaba02, holin
GBAA_4075114-0.609509prophage lambdaba02, site-specific recombinase
GBAA_40762140.219220prophage lambdaba02, AbrB family transcriptional
GBAA_40771120.793881hypothetical protein
GBAA_40781120.871056phage minor structural protein
GBAA_40792111.644617hypothetical protein
GBAA_40803121.461061hypothetical protein
GBAA_40815131.895913hypothetical protein
GBAA_40823121.601144prophage LambdaBa02, tape measure protein
GBAA_40842170.318491hypothetical protein
GBAA_40852200.701418prophage LambdaBa02, major tail protein
GBAA_4086116-0.466779hypothetical protein
GBAA_4087116-0.315276hypothetical protein
GBAA_4088215-0.821389hypothetical protein
GBAA_4089214-1.142956hypothetical protein
GBAA_4090214-0.729032hypothetical protein
GBAA_4091214-0.770662prophage LambdaBa02, major capsid protein
GBAA_4092315-0.848246prophage LambdaBa02, Clp protease
GBAA_4093417-1.392731hypothetical protein
GBAA_4094318-2.193999prophage LambdaBa02, terminase, large subunit
GBAA_4095521-2.590781hypothetical protein
GBAA_4096420-3.085101prophage LambdaBa02, HNH endonuclease
GBAA_4097321-3.049521hypothetical protein
GBAA_4098321-2.959643hypothetical protein
GBAA_4099221-2.580173prophage lambdaba02, site-specific recombinase
GBAA_4100021-2.179390hypothetical protein
GBAA_4101-124-1.390702hypothetical protein
GBAA_4103221-1.579416hypothetical protein
GBAA_4104120-2.771744hypothetical protein
GBAA_4105318-2.243406hypothetical protein
GBAA_4106419-1.994290hypothetical protein
GBAA_41073210.462758hypothetical protein
GBAA_41085190.557091hypothetical protein
GBAA_41093200.133455fosfomycin resistance protein FosB
GBAA_41102201.415052hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4080SHAPEPROTEIN290.013 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.0 bits (65), Expect = 0.013
Identities = 10/41 (24%), Positives = 23/41 (56%)

Query: 29 KMQFTGVQMANGIAEGIKTQYSVVRDALQETVSGAVNSIRS 69
+++ G +A G+ G + + +ALQE ++G V+++
Sbjct: 233 EIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMV 273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4096TYPE3IMPPROT290.004 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.0 bits (65), Expect = 0.004
Identities = 12/58 (20%), Positives = 21/58 (36%), Gaps = 1/58 (1%)

Query: 10 RKFYDKYNRDKEAKKFYDSTAWRRCRELALIRDNYRCQECMKHDPLIPVPADMVHHIK 67
R + K D+E +F+++ +R E K +PA + IK
Sbjct: 100 RDYLIK-YSDRELVQFFENAQLKRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIK 156


43GBAA_4120GBAA_4151Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4120220-1.288323hypothetical protein
GBAA_4121120-2.287091prophage LambdaBa02, DNA replication protein
GBAA_4122-120-4.693102hypothetical protein
GBAA_4123016-3.977064hypothetical protein
GBAA_4124116-3.265571prophage LambdaBa02, DNA-binding protein
GBAA_4125216-2.812148prophage LambdaBa02, DNA-binding protein
GBAA_4126015-1.735628prophage LambdaBa02, repressor protein
GBAA_4129-314-1.244662hypothetical protein
GBAA_4130-1150.848276prophage lambdaba02, repressor protein
GBAA_41320120.910045hypothetical protein
GBAA_41330130.458092hypothetical protein
GBAA_4134-1120.185521prophage lambdaba02, site-specific recombinase
GBAA_41350130.103777hypothetical protein
GBAA_41362150.225282pdz domain-containing protein
GBAA_41370140.280085phospholipase
GBAA_41380150.461255hypothetical protein
GBAA_4139-2170.530868phosphopantetheine adenylyltransferase
GBAA_4140-2170.914531methyltransferase
GBAA_4142-2180.574843hypothetical protein
GBAA_4143-3170.468551ComK regulator
GBAA_41441180.203959phosphoglycerate mutase
GBAA_4145219-0.390791hypothetical protein
GBAA_4146318-0.613294hypothetical protein
GBAA_4147419-0.871673hypothetical protein
GBAA_41484130.486550hypothetical protein
GBAA_41493120.620741formamidase
GBAA_41503130.242534hypothetical protein
GBAA_41513100.437094cytochrome c oxidase subunit IVB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4139LPSBIOSNTHSS2285e-80 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 228 bits (583), Expect = 5e-80
Identities = 88/155 (56%), Positives = 115/155 (74%)

Query: 4 IAISSGSFDPITLGHLDIIKRGAKVFDEVYVVVLNNSSKKPFFSVEERLDLIREATKDIP 63
AI GSFDPIT GHLDII+RG ++FD+VYV VL N +K+P FSV+ERL+ I +A +P
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 64 NVKVDSHSGLLVEYAKMRNANAILRGLRAVSDFEYEMQITSMNRKLDENIETFFIMTNNQ 123
N +VDS GL V YA+ R A AILRGLR +SDFE E+Q+ + N+ L ++ET F+ T+ +
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 124 YSFLSSSIVKEVARYGGSVVDLVPPVVERALKEKF 158
YSFLSSS+VKEVAR+GG+V VP V AL ++F
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQF 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_414256KDTSANTIGN260.016 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 26.5 bits (58), Expect = 0.016
Identities = 9/34 (26%), Positives = 19/34 (55%), Gaps = 2/34 (5%)

Query: 43 MEQIEHMMQKLNKLPFVKKIEQSYRPYLKTEFEN 76
+EQI+ +Q+L ++++ S+ Y+ F N
Sbjct: 297 IEQIQSKIQELGDT--LEELRDSFDGYINNAFVN 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4146ANTHRAXTOXNA270.030 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 27.0 bits (59), Expect = 0.030
Identities = 10/27 (37%), Positives = 17/27 (62%)

Query: 65 SSVKENKKEKDNRTEEEKTADVMGQML 91
S +K N K + N+TE+EK D + ++
Sbjct: 41 SDIKRNHKTEKNKTEKEKFKDSINNLV 67


44GBAA_4161GBAA_4182Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_41612140.117904PhoH family protein
GBAA_41626210.220097hypothetical protein
GBAA_41633200.877090hypothetical protein
GBAA_41643190.834092hypothetical protein
GBAA_41652180.943116hypothetical protein
GBAA_41662190.872903GTP-binding protein TypA
GBAA_4167-1110.684221hypothetical protein
GBAA_4168-1110.950518inositol monophosphatase
GBAA_41690140.572928hypothetical protein
GBAA_4170-1140.592689hypothetical protein
GBAA_4171-1160.979490hypothetical protein
GBAA_41720161.109535lysine decarboxylase
GBAA_4173122-0.900221transglutaminase
GBAA_4174525-3.426766hypothetical protein
GBAA_41751230.689505hypothetical protein
GBAA_41763332.147611hypothetical protein
GBAA_41773382.809346hypothetical protein
GBAA_41783433.278739hypothetical protein
GBAA_41794464.093289hypothetical protein
GBAA_41813464.377644dihydrolipoamide dehydrogenase
GBAA_41821353.315213branched-chain alpha-keto acid dehydrogenase E2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4166TCRTETOQM1812e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 181 bits (461), Expect = 2e-51
Identities = 101/476 (21%), Positives = 195/476 (40%), Gaps = 96/476 (20%)

Query: 8 LRNIAIIAHVDHGKTTLVDQLLRQAGTFRANEHVEE--RAMDSNDLERERGITILAKNTA 65
+ NI ++AHVD GKTTL + LL +G V++ D+ LER+RGITI T+
Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62

Query: 66 IHYEDKRINILDTPGHADFGGEVERIMKMVDGVLLVVDAYEGCMPQTRFVLKKALEQNLT 125
+E+ ++NI+DTPGH DF EV R + ++DG +L++ A +G QTR + + +
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 126 PIVVVNKIDRDFARPDEVVDEVIDLF---------IELG-------------------AN 157
I +NKID++ V ++ + +EL N
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182

Query: 158 EDQLE--------------------------FPVVFASAMNGTASLDSNPANQEENMKSL 191
+D LE FPV SA N + +L
Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNN------------IGIDNL 230

Query: 192 FDTIIEHIPAPIDNSEEPLQFQVALLDYNDYVGRIGVGRVFRGTMKVGQQVALMKVDGSV 251
+ I + + L +V ++Y++ R+ R++ G + + V + + +
Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKE--- 287

Query: 252 KQFRVTKLFGYMGLKRQEIEEAKAGDLVAVSGMEDINVGETVCPVEHQDALPLLRIDEPT 311
+ ++T+++ + + +I++A +G++V + E + + + + + P
Sbjct: 288 -KIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345

Query: 312 LQMTFLVNNSPFAGREGKYITSRKIEER------LRSQLETDVSLRVDNTESPDAWIVSG 365
LQ T + K ++R L ++D LR + I+S
Sbjct: 346 LQTT---------------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSF 390

Query: 366 RGELHLSILIENMRRE-GYELQVSKPEVIIKEVDGVRCEPVERVQIDVPEEYTGSI 420
G++ + + ++ + E+++ +P VI E + E +++ P + SI
Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASI 445



Score = 39.8 bits (93), Expect = 3e-05
Identities = 17/77 (22%), Positives = 28/77 (36%), Gaps = 1/77 (1%)

Query: 403 EPVERVQIDVPEEYTGSIMESMGARKGEMLDMVNNGNGQVRLTFMVPARGLIGYTTEFLT 462
EP +I P+EY ++D N +V L+ +PAR + Y ++
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNN-EVILSGEIPARCIQEYRSDLTF 595

Query: 463 LTRGYGILNHTFDCYQP 479
T G + Y
Sbjct: 596 FTNGRSVCLTELKGYHV 612


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4182RTXTOXIND290.031 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.031
Identities = 14/38 (36%), Positives = 18/38 (47%)

Query: 45 VVEIPSPVKGKVLEVLVEEGTVAVVGDTLIKFDAPGYE 82
EI V E++V+EG GD L+K A G E
Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE 133


45GBAA_4256GBAA_4266Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4256214-2.8794442-hydroxy-3-keto-5-methylthiopentenyl-1-
GBAA_4257114-1.627953methylthioribulose-1-phosphate dehydratase
GBAA_4258215-2.1874885-methylthio-3-oxo-1-penten-1,2-diol
GBAA_4259215-1.869100hypothetical protein
GBAA_4260316-0.656740hypothetical protein
GBAA_4263418-0.243779sensory box/GGDEF family protein
GBAA_42643311.559491nitroreductase
GBAA_42652260.609104hypothetical protein
GBAA_42662230.479081hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4259CHANLCOLICIN306e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 6e-04
Identities = 14/47 (29%), Positives = 19/47 (40%), Gaps = 9/47 (19%)

Query: 21 GAIMEELEVGVLGFVASCVSALFF--------GLFG-AIPISILCAF 58
+ LE S V AL F G++G AI ILC++
Sbjct: 461 KPLFLTLEKKAADAGVSYVVALLFSLLAGTTLGIWGIAIVTGILCSY 507


46GBAA_4277GBAA_4294Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4277222-3.053038hypothetical protein
GBAA_4278020-1.596261segregation and condensation protein A
GBAA_4279222-0.855576hypothetical protein
GBAA_4280216-0.643418hypothetical protein
GBAA_4281012-0.457933ribT protein
GBAA_42820120.183012hypothetical protein
GBAA_42831120.653374cyclophilin type peptidyl-prolyl cis-trans
GBAA_42841141.132339hypothetical protein
GBAA_42851171.475777HAD family hydrolase
GBAA_42861181.632098stage V sporulation protein AF
GBAA_42873181.990161stage V sporulation protein AE
GBAA_4287a2142.298743stage V sporulation protein AE
GBAA_4288-1132.055556stage V sporulation protein AD
GBAA_4289-1151.199888stage v sporulation protein ac
GBAA_4290-2151.473768stage V sporulation protein AB
GBAA_4291-2141.149825stage V sporulation protein AA
GBAA_4293-1141.450905sodium-dependent symporter family protein
GBAA_4294214-0.101121sporulation sigma factor SigF
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4281SACTRNSFRASE315e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 5e-04
Identities = 11/54 (20%), Positives = 25/54 (46%)

Query: 35 DYEAKDDWQLYLWKQNEDFVGIMGIVKKENQVLEIQHLSVNPSHRHMGIGTKMV 88
Y ++ +L+ + +G + I N I+ ++V +R G+GT ++
Sbjct: 58 SYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4284RTXTOXINA330.002 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 32.6 bits (74), Expect = 0.002
Identities = 46/168 (27%), Positives = 71/168 (42%), Gaps = 25/168 (14%)

Query: 114 TNALITAGVKDAEIQITAPFKVSGTAALTGLMKAYETTSNKA------IPEEVKKVAN-- 165
T A I + ++ A+ +G + L KA E T N IP++ K +
Sbjct: 5 TTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSL 64

Query: 166 EEMVQTSQLGDKIGEEKAVQLVAKIKEEIAKEQPQTTEDLRSLIKKIADQLGITLTDEQL 225
++V+T+ D++G E VQ K I K+ T E L L ++ G+T+ QL
Sbjct: 65 NDLVRTA---DELGIE--VQYDEKNGTAITKQVFGTAEKLIGLTER-----GVTIFAPQL 114

Query: 226 DNLVALFDKMKN-LNIDWNQVGSQLNKAKEHVSAFLGSEEGQSFLDKV 272
D L+ + K N L +G L KA +S F Q+FL
Sbjct: 115 DKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTF------QNFLGTA 156


47GBAA_4405GBAA_4414Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_44052141.195358bifunctional 5,10-methylene-tetrahydrofolate
GBAA_44062140.556508transcription antitermination protein NusB
GBAA_44072120.197605hypothetical protein
GBAA_44080130.154003acetyl-CoA carboxylase biotin carboxylase
GBAA_4409216-0.746103acetyl-CoA carboxylase biotin carboxyl carrier
GBAA_4410216-0.998399stage III sporulation protein AH
GBAA_4411219-0.218674stage III sporulation protein AG
GBAA_44121181.468256stage III sporulation protein AF
GBAA_44131212.494511stage III sporulation protein AE
GBAA_44142202.433247stage III sporulation protein AD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4409RTXTOXIND270.025 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.025
Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 140 GEIVEILVNNGQLVEYGQPLFLVKA 164
+ EI+V G+ V G L + A
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTA 129


48GBAA_4429GBAA_4439Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_44292141.271267spore photoproduct lyase
GBAA_44302181.019007hypothetical protein
GBAA_44313201.318236lipoate-protein ligase A
GBAA_44322271.799283rhodanese-like domain-containing protein
GBAA_44332231.196510LacI family transcriptional regulator
GBAA_4434119-1.357066TetR family transcriptional regulator
GBAA_4435020-2.165683sugE protein
GBAA_4436221-3.269828sugE protein
GBAA_4437021-3.718713hypothetical protein
GBAA_4438-221-3.471896hypothetical protein
GBAA_4439-222-3.195601hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4430IGASERPTASE354e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 34.7 bits (79), Expect = 4e-04
Identities = 28/114 (24%), Positives = 40/114 (35%), Gaps = 7/114 (6%)

Query: 104 KENKETAEQEETVVEATPKKEVVVEVPKAVTPAPKPVTRVETPAIASTPKPTPAPT--PK 161
E KETA E+ +A + E EVPK + + ET + P PT K
Sbjct: 1098 TETKETATVEKEE-KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK 1156

Query: 162 PVSVEAAVELSTPAPVKK---AVPTPVTKQETTPVAPVKPKQSALTETNSKLQE 212
+ T P K+ V PVT+ T ++ T + Q
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN-SVVENPENTTPATTQP 1209



Score = 32.7 bits (74), Expect = 0.002
Identities = 21/96 (21%), Positives = 31/96 (32%), Gaps = 10/96 (10%)

Query: 105 ENKETAEQEETVVEATPKKEVVVEVPKAVTPAPKPVTRV----------ETPAIASTPKP 154
E ++T E + + +PK+E V PA + V T K
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 155 TPAPTPKPVSVEAAVELSTPAPVKKAVPTPVTKQET 190
T + +PV+ V TP T Q T
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPT 1210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4431DHBDHDRGNASE300.008 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 30.0 bits (67), Expect = 0.008
Identities = 26/98 (26%), Positives = 41/98 (41%), Gaps = 8/98 (8%)

Query: 93 VIVSEDHPNMPKTVTEAYRVISQGLLDGFKALGLE-AYYAVPKTEADRENLKNPRSG-VC 150
V V + +P+T AY + K LGLE A Y + R N+ +P S
Sbjct: 140 VTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI------RCNIVSPGSTETD 193

Query: 151 FDAPSWYEIVVEGRKIAGSAQTRQKGVILQHGSIPLEI 188
W + + I GS +T + G+ L+ + P +I
Sbjct: 194 MQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDI 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4434HTHTETR616e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 6e-14
Identities = 39/203 (19%), Positives = 71/203 (34%), Gaps = 25/203 (12%)

Query: 2 TANRIKAVALSHFARYGYEGTSLANIAQEVGIKKPSIYAHFKGKEELYFICLESALQKDL 61
T I VAL F++ G TSL IA+ G+ + +IY HFK K +L+ E +
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 62 QSFTDDIENFSNSSTEELLLQLLKGYAKRFGESEESMFWLRTSYFPPDAFRE-QIIEK-- 118
+ + F +L ++L + E + + + E ++++
Sbjct: 72 ELELEYQAKFP-GDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130

Query: 119 ANAHIENVGKLLFPIFKQANEKSELH-NIEVKDALEAFLCLLDGLM-------------- 163
N +E+ ++ K E L ++ + A + GLM
Sbjct: 131 RNLCLESYDRIE-QTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189

Query: 164 -----VELLFAGLNRFETRLNAS 181
V +L T N +
Sbjct: 190 EARDYVAILLEMYLLCPTLRNPA 212


49GBAA_4624GBAA_4651Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_46242153.074877hypothetical protein
GBAA_46253163.106233tRNA-specific 2-thiouridylase MnmA
GBAA_46263203.838594class V aminotransferase
GBAA_46274253.898169rrf2 family protein
GBAA_46284253.611235recombination factor protein RarA
GBAA_46295263.552133prespore-specific transcriptional regulator
GBAA_46303252.834180hesA/moeB/thiF family protein
GBAA_46323262.948828aspartyl-tRNA synthetase
GBAA_46330161.948915histidyl-tRNA synthetase
GBAA_4634-1131.608598hypothetical protein
GBAA_46360161.545386D-tyrosyl-tRNA(Tyr) deacylase
GBAA_46370161.377243GTP pyrophosphokinase
GBAA_46381131.426730adenine phosphoribosyltransferase
GBAA_46390111.143203single-stranded-DNA-specific exonuclease RecJ
GBAA_46401160.755330cation efflux family protein
GBAA_46412170.504214preprotein translocase subunit SecD/SecF
GBAA_4642-1191.108574hypothetical protein
GBAA_4643-1191.588145stage V sporulation protein B
GBAA_4644-2191.377972hypothetical protein
GBAA_46450232.919917hypothetical protein
GBAA_46460213.216560preprotein translocase subunit YajC
GBAA_46470202.451084queuine tRNA-ribosyltransferase
GBAA_46480151.734003S-adenosylmethionine--tRNA
GBAA_46490130.987239hypothetical protein
GBAA_46501140.577166Holliday junction DNA helicase RuvB
GBAA_4651215-0.672233holliday junction DNA helicase RuvA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4624SYCDCHAPRONE334e-04 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 33.0 bits (75), Expect = 4e-04
Identities = 17/90 (18%), Positives = 32/90 (35%)

Query: 8 GIQYMQEGNWEEAAKNFTEAIEENPKDALGYINFANLLDVLGDSERAILFYKRALELDDK 67
Q G +E+A K F + D+ ++ +G + AI Y +D K
Sbjct: 43 AFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIK 102

Query: 68 SAAAYYGLGNVYYGQEQFAEAKAVFEQAMQ 97
+ + + AEA++ A +
Sbjct: 103 EPRFPFHAAECLLQKGELAEAESGLFLAQE 132



Score = 31.1 bits (70), Expect = 0.002
Identities = 17/96 (17%), Positives = 27/96 (28%)

Query: 109 LGITHVQLGNDRLALPFLQRATELDENDVEAVFQCGLCFARLEHIQEAKPYFEKVLEMDE 168
L Q G A Q LD D G C + A + MD
Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101

Query: 169 EHADAYYNLGVAYVFEENNEKALALFKKATEIQPDH 204
+ ++ + + +A + A E+ D
Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLAQELIADK 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4626RTXTOXINA300.028 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.028
Identities = 25/123 (20%), Positives = 46/123 (37%), Gaps = 8/123 (6%)

Query: 114 GFEVTYLPVDETGRVQVSDIQKAL-TEETILVSVMFGNNEVGTMQPIAEIGKLLKEHQAY 172
G++ + E +S K E ++L++ + +G + + G ++Y
Sbjct: 444 GYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHWDTLIGELAGVTRNGDKTLSGKSY 503

Query: 173 FHTDAVQAYGLVEINVKEFGIDLLSISAHKINGPKGVGFLYAGTNVKF-EPLLIGGEQER 231
D + +E EF + I+ T +KF PLL GE+ R
Sbjct: 504 --IDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDSKS----STLLKFVTPLLTPGEEIR 557

Query: 232 KRR 234
+RR
Sbjct: 558 ERR 560


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4634PF05043250.021 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 24.9 bits (54), Expect = 0.021
Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 25 FISKEQNNTSMELASEFGISLQDVKRLKKQIE 56
FI + + + EF IS + R+ QI
Sbjct: 94 FIFFNEGCQAESICKEFYISSSSLYRIISQIN 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4636THERMOLYSIN280.010 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 28.4 bits (63), Expect = 0.010
Identities = 24/118 (20%), Positives = 46/118 (38%), Gaps = 16/118 (13%)

Query: 16 DGEIVGQIPFGLTLLVGITHEDTEKDATYIAEKIANLRIFEDESGKMNHSVLDVEGQVLS 75
DG+ +PF + V + HE T + + A L ++++ESG +N ++ D+ G ++
Sbjct: 352 DGDGQTFLPFSGGIDV-VGHELTHA----VTDYTAGL-VYQNESGAINEAMSDIFGTLVE 405

Query: 76 ----------ISQFTLYGDCRKGRRPNFMDAAKPDYAEHLYDFFNEEVRKQGLHVETG 123
I + + D AK +H + G+H +G
Sbjct: 406 FYANRNPDWEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSG 463


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4641SECFTRNLCASE2702e-86 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 270 bits (691), Expect = 2e-86
Identities = 100/318 (31%), Positives = 165/318 (51%), Gaps = 21/318 (6%)

Query: 443 PTKFDRINFVNVGHKFLIFSIVVVIAGAIILPIFKLNLGIDFASGTRIDLQSKQSVTVSD 502
P K + +F +IV++IA I+ + LN GIDF GT I +S ++ V
Sbjct: 9 PEKTN-FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGV 67

Query: 503 VHKDFKELNID---VKEENIVPTGDDNKGFAVR-----------TLGVLSKDEIAKTKTF 548
+ L + + E +D +R G ++ + K +T
Sbjct: 68 YRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETA 127

Query: 549 FH--DKYGTDPNVSTVSPTIGKEIARNAFIAVLIASAVIILYVSIRFRFTYALSAVLALL 606
D + +V P + E+ A ++L A+ VI+ Y+ +RF + +AL AV+AL+
Sbjct: 128 LTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALV 187

Query: 607 HDAFVMIVIFSIFQLEVDLTFIAAVLTIIGYSINDSIVTFDRNRELYKQKKRVRDIKDLE 666
HD + + +F++ QL+ DLT +AA+LTI GYSIND++V FDR RE + K L
Sbjct: 188 HDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKT----MPLR 243

Query: 667 EIVNASIRQTLGRSINTVLTVLFPVIALLIFGSESLRNFSFALLVGLVVGTYSSVFVASQ 726
+++N S+ +TL R++ T +T L ++ +LI+G + +R F FA++ G+ GTYSSV+VA
Sbjct: 244 DVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKN 303

Query: 727 IWLMLENRRLKKGKNKKK 744
I L + R K+ K+
Sbjct: 304 IVLFIGLDRNKEKKDPSD 321



Score = 66.0 bits (161), Expect = 1e-13
Identities = 38/180 (21%), Positives = 84/180 (46%), Gaps = 11/180 (6%)

Query: 249 SVGAKFGQQALEQTIFASAIGIALIFLFMLV-FYRLPGLVAVIMLGLYIFVTLLVFNWMH 307
SVG K + + +++ +I ++ V F L AV+ L + +T+ +F +
Sbjct: 142 SVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQ 201

Query: 308 AVLTLPGIAALVLGVGIAVDANIITYERLKEELKIGKSMM------SAFRAGNHRSLATI 361
L +AAL+ G +++ ++ ++RL+E L K+M + R++ T
Sbjct: 202 LKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTG 261

Query: 362 LDANITTLAAAGVLFVYGNSSVKGFATSLIVSILVGFITNVFGTRFLLSLLVKSRYFDKK 421
+ TTL A + ++G ++GF +++ + G ++V+ + ++ + R +KK
Sbjct: 262 M----TTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEKK 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4646PF06580280.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.5 bits (61), Expect = 0.006
Identities = 8/39 (20%), Positives = 19/39 (48%)

Query: 7 NIVMIVAMFAIFYFLLIRPQQKRQKAVAQMQSELKKGDA 45
N+V++ M+++ YF + +Q + Q + +A
Sbjct: 123 NVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEA 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4649ACRIFLAVINRP260.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.0 bits (57), Expect = 0.011
Identities = 13/59 (22%), Positives = 30/59 (50%), Gaps = 5/59 (8%)

Query: 1 MTEMPKLLITAGILLIVVGLAWKFIGRLPGDIFVKKGNVTFYFPIITCIVLSIVLSFIM 59
M+++ L+ ++L V + F G G I+ + F I++ + LS++++ I+
Sbjct: 435 MSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ-----FSITIVSAMALSVLVALIL 488


50GBAA_4684GBAA_4706Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_46840173.101604rod shape-determining protein MreB
GBAA_4685-1142.735140DNA repair protein RadC
GBAA_46860132.376982Maf-like protein
GBAA_46880152.830497stage II sporulation protein B
GBAA_4689-1172.884969folylpolyglutamate synthase
GBAA_46900173.119790valyl-tRNA synthetase
GBAA_4691-1132.686150hypothetical protein
GBAA_4692-1121.967921stage VI sporulation protein D
GBAA_46931131.957545glutamate-1-semialdehyde aminotransferase
GBAA_46941120.854319delta-aminolevulinic acid dehydratase
GBAA_46952140.849603uroporphyrinogen-III synthase
GBAA_46961140.360100porphobilinogen deaminase
GBAA_46970140.903986hemX protein
GBAA_4698-1152.082625glutamyl-tRNA reductase
GBAA_4699-1172.226025marR family transcriptional regulator
GBAA_47001192.779396organic hydroperoxide resistance protein
GBAA_47012162.086805ribosome biogenesis GTP-binding protein YsxC
GBAA_47021152.188421ATP-dependent protease La 1
GBAA_47032161.539676ATP-dependent protease LA
GBAA_4704420-0.026904ATP-dependent protease ATP-binding subunit ClpX
GBAA_4705520-0.482759trigger factor
GBAA_4706017-3.109296hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4684SHAPEPROTEIN497e-180 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 497 bits (1281), Expect = e-180
Identities = 194/336 (57%), Positives = 252/336 (75%), Gaps = 5/336 (1%)

Query: 4 FGGFTRDLGIDLGTANTLVYVKGKGVVLREPSVVALQTD----TKQIVAVGSDAKQMIGR 59
G F+ DL IDLGTANTL+YVKG+G+VL EPSVVA++ D K + AVG DAKQM+GR
Sbjct: 6 RGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGR 65

Query: 60 TPGNVVALRPMKDGVIADYETTATMMKYYIQQAQKSNGFFSRKPYVMVCVPSGITAVERR 119
TPGN+ A+RPMKDGVIAD+ T M++++I+Q SN F P V+VCVP G T VERR
Sbjct: 66 TPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQV-HSNSFMRPSPRVLVCVPVGATQVERR 124

Query: 120 AVIDATRQAGARDAYPIEEPFAAAIGANLPVWEPTGSMVVDIGGGTTEVAIISLGGIVTS 179
A+ ++ + AGAR+ + IEEP AAAIGA LPV E TGSMVVDIGGGTTEVA+ISL G+V S
Sbjct: 125 AIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYS 184

Query: 180 QSVRVAGDDMDDSIIQYIKKSYNLMIGERTAEALKLEIGSAGEPEGIEPMEIRGRDLVSG 239
SVR+ GD D++II Y++++Y +IGE TAE +K EIGSA + + +E+RGR+L G
Sbjct: 185 SSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEG 244

Query: 240 LPKTVLIQPEEIADALKDTVDAIVESVKNTLEKTPPELAADIMDRGIVLTGGGALLRNLD 299
+P+ + EI +AL++ + IV +V LE+ PPELA+DI +RG+VLTGGGALLRNLD
Sbjct: 245 VPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLD 304

Query: 300 KVISEETNMPVLVAEDPLDCVAIGTGKALDNIDLFK 335
+++ EET +PV+VAEDPL CVA G GKAL+ ID+
Sbjct: 305 RLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHG 340


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4694ENTEROVIROMP310.004 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 31.0 bits (70), Expect = 0.004
Identities = 32/157 (20%), Positives = 54/157 (34%), Gaps = 25/157 (15%)

Query: 146 AVLAKTAVSQAKAGADIIAPSNMMDGFVTAIRHALDENGFGHVPVMSYAVKYSSAFYGPF 205
+V A + V+ A +D N M GF R+ D + G + +Y K +A G +
Sbjct: 21 SVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPLGVIGSFTYTEKSRTASSGDY 80

Query: 206 RDAAHGAPQFGDRKTYQMDPANRME-----------AFREAESDVMEGADFLIVKPALSY 254
+ G PA R+ + + ++ SY
Sbjct: 81 NKNQYYGITAG--------PAYRINDWASIYGVVGVGYGKFQTTEYPTYKHDTSDYGFSY 132

Query: 255 LDIVRDVKNNFN-LPVVAYNVSGEYSMIKAAAQNGWI 290
++ FN + VA + S E S I++ WI
Sbjct: 133 GAGLQ-----FNPMENVALDFSYEQSRIRSVDVGTWI 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4701TCRTETOQM280.027 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 27.9 bits (62), Expect = 0.027
Identities = 18/90 (20%), Positives = 37/90 (41%), Gaps = 13/90 (14%)

Query: 58 KTQTLNFFLINEMMHFVDVPGYGYAKVSKTERAAWGKMIETYFTTREQLDAAVLVVDLRH 117
+T +F N ++ +D PG+ +++ R+ LD A+L++ +
Sbjct: 57 QTGITSFQWENTKVNIIDTPGH-MDFLAEVYRSL------------SVLDGAILLISAKD 103

Query: 118 KPTNDDVMMYDFLKHYDIPTIIIATKADKI 147
+++ L+ IPTI K D+
Sbjct: 104 GVQAQTRILFHALRKMGIPTIFFINKIDQN 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4702HTHFIS372e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.1 bits (86), Expect = 2e-04
Identities = 29/101 (28%), Positives = 43/101 (42%), Gaps = 14/101 (13%)

Query: 349 LCLVGPPGVGKTSLARSI-ATSLNRN--FVRVSLGGVRD---ESEIRGHRRTYVGAMPGR 402
L + G G GK +AR++ RN FV +++ + ESE+ GH + GA G
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEK---GAFTGA 219

Query: 403 IIQGMKKAKSVNP-VFLLDEIDKMSNDFRGDPSAALLEVLD 442
+ + + LDEI M D + LL VL
Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIGDMPMDAQ----TRLLRVLQ 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4703HTHFIS584e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 58.3 bits (141), Expect = 4e-11
Identities = 43/214 (20%), Positives = 76/214 (35%), Gaps = 41/214 (19%)

Query: 44 ELEQLRKMREISLTEPLAEKVR----PTSFLDIVGQEDGIKSLK--AALCGPNPQHVIIY 97
+L +L + +L EP + + +VG+ ++ + A ++I
Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166

Query: 98 GPPGVGKTAAARLVLEEAKRNPKSPFRTNATFIELDATTARFDERGIADPLIGSVHDPIY 157
G G GK AR + + KR F+ ++ A I L G
Sbjct: 167 GESGTGKELVARALHDYGKRRNGP-------FVAINM--AAIPRDLIESELFGHE----- 212

Query: 158 QGAGAMGQAGIPQPKKGAVTDAHGGILFIDEIGELHPIQMNKMLKVLEDRKVFLESAYYS 217
GA G G A GG LF+DEIG++ ++L+VL+ +
Sbjct: 213 --KGAF--TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGG--- 265

Query: 218 EENTMIPTYIHDIFQKGLPADFRLVGATTRSPEE 251
+ + +D R+V AT + ++
Sbjct: 266 --------------RTPIRSDVRIVAATNKDLKQ 285


51GBAA_4756GBAA_4774Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_47562132.307996hypothetical protein
GBAA_47571112.441981excinuclease ABC subunit C
GBAA_47580133.395313thioredoxin
GBAA_4759-2134.064018electron transfer flavoprotein subunit alpha
GBAA_4760-1102.825809electron transfer flavoprotein subunit beta
GBAA_4761-1123.299528enoyl-CoA hydratase
GBAA_4762-1133.109711TetR family transcriptional regulator
GBAA_4763-1132.759968long-chain-fatty-acid--CoA ligase
GBAA_47640172.176321hypothetical protein
GBAA_4766116-0.167866iron ABC transporter substrate-binding protein
GBAA_47671170.007654iron-hydroxamate transporter permease subunit
GBAA_4768119-2.497771hypothetical protein
GBAA_4769116-4.148262spore coat protein C
GBAA_4770-117-4.350202hypothetical protein
GBAA_4771-114-4.281616hypothetical protein
GBAA_4772-215-3.409039hypothetical protein
GBAA_4773-114-3.417113hypothetical protein
GBAA_4774-113-3.256945bacitracin ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4762HTHTETR1132e-33 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 113 bits (283), Expect = 2e-33
Identities = 36/192 (18%), Positives = 75/192 (39%), Gaps = 10/192 (5%)

Query: 5 RPKYNQIIDAAVIVIAENGYHQAQVSKIAKQAGVADGTIYLYFKNKEDILISLFQEKMGE 64
+ I+D A+ + ++ G + +IAK AGV G IY +FK+K D+ +++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 65 FVETIRQKTAGIESAVSKLFMLVETHFLLLSQNDPL--AIVTQLELRQSNQDLRLKINEV 122
E + A + + H L + + ++ + + + +
Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129

Query: 123 LKGY----LQVIDEILETGIKQGEFQADLNVRVARQMIFGTVDEVVTNWVMSDHKYDLVA 178
+ I++ L+ I+ ADL R A ++ G + ++ NW+ + +DL
Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDL-- 187

Query: 179 LSKTVHGLLIAA 190
K +A
Sbjct: 188 --KKEARDYVAI 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4766FERRIBNDNGPP1835e-58 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 183 bits (465), Expect = 5e-58
Identities = 62/258 (24%), Positives = 115/258 (44%), Gaps = 11/258 (4%)

Query: 52 AKKVVVLEWVYSEDLLALGVQPVGMADIKNYNKWVNTKTKPSKDVVDVGTRQQPNLEEIS 111
++V LEW+ E LLALG+ P G+AD NY WV+ P V+DVG R +PNLE ++
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLP-DSVIDVGLRTEPNLELLT 93

Query: 112 RLKPDLIITASFRGKAIKNELEQIAPTVMFDPSTSNNDHFAEMTETFKQIAKAVGKEEEG 171
+KP ++ S L +IAP F+ S A ++ ++A + +
Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQP-LAMARKSLTEMADLLNLQSAA 151

Query: 172 KKVLADMDKAFADAKAKIEKADLKDKNIAMAQAFTAKNVPTFRILTDNSLALQVTKKLGL 231
+ LA + K + K + + + +++ + NSL ++ + G+
Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARP--LLLTTLIDPRHM---LVFGPNSLFQEILDEYGI 206

Query: 232 TNTFEAGKSEPDGFKQTTVESLQSVQDSNFIYIVADEDNIFDTQLKGNPAWEELKFKKEN 291
N ++ G++ G +++ L + +D + + D D L P W+ + F +
Sbjct: 207 PNAWQ-GETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMD-ALMATPLWQAMPFVRAG 264

Query: 292 KMYKLKGDTWIFGGPESA 309
+ ++ W +G SA
Sbjct: 265 RFQRVP-AVWFYGATLSA 281


52GBAA_4788GBAA_4819Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_47883130.364886hypothetical protein
GBAA_47892141.342410cell wall anchor domain-containing protein
GBAA_47901151.270462branched chain amino acid ABC transporter
GBAA_47922161.617468RNA pseudouridine synthase
GBAA_47933181.857417hypothetical protein
GBAA_47942181.849207recombination and DNA strand exchange inhibitor
GBAA_47950191.004691hypothetical protein
GBAA_47960190.075828colicin V production protein CvpA
GBAA_47972312.691219cell division protein ZapA
GBAA_47984353.442317ribonuclease HIII
GBAA_47994373.567054hypothetical protein
GBAA_48004363.611590hypothetical protein
GBAA_48014373.623020hypothetical protein
GBAA_48024353.590895asparaginyl-tRNA synthetase
GBAA_48033292.579134phenylalanyl-tRNA synthetase subunit beta
GBAA_48040200.197318phenylalanyl-tRNA synthetase subunit alpha
GBAA_4805017-1.217185RNA methyltransferase
GBAA_4806121-2.085873small acid-soluble spore protein SspI
GBAA_4807114-0.397132HD domain-containing protein
GBAA_4808113-0.577046caax amino terminal protease
GBAA_4809113-0.176933caax amino terminal protease
GBAA_48102151.523365hypothetical protein
GBAA_48112171.159263hypothetical protein
GBAA_48123201.382440EmrB/QacA family drug resistance transporter
GBAA_48135231.277552hypothetical protein
GBAA_48145231.602314TetR family transcriptional regulator
GBAA_48153322.125775M42 family peptidase
GBAA_48163270.387814hypothetical protein
GBAA_48172231.04129450S ribosomal protein L20
GBAA_48182171.36118550S ribosomal protein L35
GBAA_48192151.235366translation initiation factor IF-3
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4794GPOSANCHOR372e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.4 bits (86), Expect = 2e-04
Identities = 35/118 (29%), Positives = 60/118 (50%), Gaps = 11/118 (9%)

Query: 518 KIENMIAKLEE-------SQKNAERDWNEAEALRKQSEKLHREL--QRQIIEFNEERDER 568
++E KLEE S+++ RD + + +KQ E H++L Q +I E + + R
Sbjct: 327 QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRR 386

Query: 569 LLKAQKEGEEKVEAAKKEAEGIIQELRQLRKAQLANVK--DHELIEAKSRLEGAAPEL 624
L A +E +++VE A +EA + L +L K + K + E E +++LE A L
Sbjct: 387 DLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKAL 444


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4806DNABINDINGHU240.031 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 24.3 bits (53), Expect = 0.031
Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query: 19 DQLQETIVDAIQSGEEKMLPGLGVLFEVIWKNA 51
D + + + GE+ L G G FEV + A
Sbjct: 27 DAVFSAVSSYLAKGEKVQLIGFGN-FEVRERAA 58


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4812TCRTETB1464e-40 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 146 bits (369), Expect = 4e-40
Identities = 86/400 (21%), Positives = 174/400 (43%), Gaps = 14/400 (3%)

Query: 108 FVSILNQTIINVALPPLMNEFNVSTSTAQWLITGFMLVNGILVPISAFLVSRFTYRKLFV 167
F S+LN+ ++NV+LP + N+FN ++ W+ T FML I + L + ++L +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 168 AAMLFFTVGSIICATSGN-FTMMMTGRVIQAVGAGILMPVGMNIFMTLFPPHKRGAAMGL 226
++ GS+I + F++++ R IQ GA + M + P RG A GL
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 227 LGVAMILAPAIGPTVTGWVIENYSWNLMFYAMFIIGLIITFLSLKFFTLAQPVSNTKLDI 286
+G + + +GP + G + W+ + I IIT L + DI
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI--TIITVPFLMKLLKKEVRIKGHFDI 201

Query: 287 FGVVSSSIGLGSLLYGFSEAGNNSWTSAEVIISLVIGVIGLALFIWRELTTDNKMLDLQV 346
G++ S+G+ + + S + L++ V+ +F+ + +D +
Sbjct: 202 KGIILMSVGIVFFMLF---TTSYSIS------FLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 347 FKYPVFTFTLVINAIVTMALFGGMLLLPVYLQNIRGFTPIESG-LLLLPGSLIMGIMGPV 405
K F ++ I+ + G + ++P ++++ + E G +++ PG++ + I G +
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 406 AGKLFDKYGIRPLAIIGLAITTYATYEFTKLSMDTPYSVIMTDYIIRSIGMSFIMMPIMT 465
G L D+ G + IG+ + + + L T + + + G+SF I T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTIIIVFVLGGLSFTKTVIST 371

Query: 466 AGMNALPMKLISHGTATQNTSRQVAGSIGTAILITLMTQQ 505
++L + G + N + ++ G AI+ L++
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4813RTXTOXIND793e-19 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 79.1 bits (195), Expect = 3e-19
Identities = 29/135 (21%), Positives = 49/135 (36%), Gaps = 12/135 (8%)

Query: 87 QTVDVTIPQNATVVQSNATT-NAFVGAGSPI-AYAFDMNNLWVTANIEETDVDDVQKGQD 144
Q + P + V Q T V + + + L VTA ++ D+ + GQ+
Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQN 385

Query: 145 VDVYVDAYPDTT---LTGKVEQVGLTTANTFSMLPSSNATANYTKVTQVVPVKISLDHSK 201
+ V+A+P T L GKV+ + L V + +K
Sbjct: 386 AIIKVEAFPYTRYGYLVGKVKNINLDAI-------EDQRLGLVFNVIISIEENCLSTGNK 438

Query: 202 SVNIVPGMNVTVRIH 216
++ + GM VT I
Sbjct: 439 NIPLSSGMAVTAEIK 453


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4814HTHTETR602e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 60.0 bits (145), Expect = 2e-13
Identities = 24/100 (24%), Positives = 39/100 (39%), Gaps = 6/100 (6%)

Query: 9 PRVKRTRQLIQDAFVALVGEKGFENVTVQHIAERAPVNRATFYSHYHDKYDLLDKSIEEM 68
+ TRQ I D + L ++G + ++ IA+ A V R Y H+ DK DL + E
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 69 LEKLTEVIKPKNRNKEDFQLAFDSPHPNFLALFEHIAENA 108
+ E+ E P + H+ E+
Sbjct: 67 ESNIGELE------LEYQAKFPGDPLSVLREILIHVLEST 100


53GBAA_4855GBAA_4875Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_48552150.051359hypothetical protein
GBAA_4856113-0.254581hypothetical protein
GBAA_48580130.224511thioesterase
GBAA_4859-114-0.896211hypothetical protein
GBAA_4860-113-0.972187metal-dependent hydrolase
GBAA_4861-114-1.678363proline dipeptidase
GBAA_4862217-3.379877lipoprotein
GBAA_4863016-2.607702lipoprotein
GBAA_4864016-2.938078hypothetical protein
GBAA_4865022-2.963535hypothetical protein
GBAA_4866222-2.966448acetyltransferase
GBAA_4867321-2.692941hypothetical protein
GBAA_4868-216-0.818032acetyltransferase
GBAA_4869-314-0.069041hypothetical protein
GBAA_4870-1160.266451hypothetical protein
GBAA_4871-116-0.058878DNA-binding protein
GBAA_4872-115-0.032389hypothetical protein
GBAA_48731160.281268alanine dehydrogenase
GBAA_4874213-0.0242063-ketoacyl-ACP reductase
GBAA_48752150.224511universal stress protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4866SACTRNSFRASE401e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 39.5 bits (92), Expect = 1e-06
Identities = 21/74 (28%), Positives = 31/74 (41%), Gaps = 3/74 (4%)

Query: 70 DNILGCYIAYSKSISGK--IEVLFVDEKHRGNGFGLKLMNSAVEWFKAKKIDEIELTVVY 127
+N I + +G IE + V + +R G G L++ A+EW K + L
Sbjct: 73 ENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQD 132

Query: 128 GN-EAISFYEKLGF 140
N A FY K F
Sbjct: 133 INISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4870ADHESNFAMILY280.020 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 27.5 bits (61), Expect = 0.020
Identities = 21/69 (30%), Positives = 32/69 (46%), Gaps = 2/69 (2%)

Query: 7 KQKQAKRQKYIKKKQEQNIPLSKKVVLMIEKTFRYICMALYVILCMYSFGVFYSLEITPN 66
K K K K K IP KK+++ E F+Y A Y + Y + + E TP
Sbjct: 177 TDKLDKLDKESKDKF-NKIPAEKKLIVTSEGAFKYFSKA-YGVPSAYIWEINTEEEGTPE 234

Query: 67 IIESILEFL 75
I++++E L
Sbjct: 235 QIKTLVEKL 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4874DHBDHDRGNASE821e-20 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 82.0 bits (202), Expect = 1e-20
Identities = 65/264 (24%), Positives = 107/264 (40%), Gaps = 19/264 (7%)

Query: 1 MNYSNLKGGRFVRHALITAGTKGLGKQVTEKLLAKGYSVTVTYHSDITAMKKMKETYKNM 60
MN ++G + A IT +G+G+ V L ++G + + ++K+ + K
Sbjct: 1 MNAKGIEG----KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAE 55

Query: 61 EERLQFVQADVTKKEDLHKIVEEAISRFGKIDFLINNAGPYVFERKKLVDYEEDEWNEMI 120
+ ADV + +I G ID L+N AG V + ++EW
Sbjct: 56 ARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAG--VLRPGLIHSLSDEEWEATF 113

Query: 121 QGNLTAVFHLLKLVVPIMRKQNFGRIINYGFQGADSAPGWIYRSAFAAAKVGLVSLTKTV 180
N T VF+ + V M + G I+ G A +A+A++K V TK +
Sbjct: 114 SVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPR--TSMAAYASSKAAAVMFTKCL 171

Query: 181 AYEEAEYGITANMVCPGDIIGEMK----------EATIQEARQLKERNTPIGRSGTGEDI 230
E AEY I N+V PG +M+ E I+ + + + P+ + DI
Sbjct: 172 GLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDI 231

Query: 231 ARTISFLCEEDSDMITGTIIEVTG 254
A + FL + IT + V G
Sbjct: 232 ADAVLFLVSGQAGHITMHNLCVDG 255


54GBAA_4894GBAA_4915Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4894-1133.516024hypothetical protein
GBAA_48950152.701369hypothetical protein
GBAA_48960142.574168acetyl-CoA synthetase
GBAA_48980132.046935small, acid-soluble spore protein B
GBAA_48990132.100903thiamine biosynthesis protein ThiI
GBAA_49001132.447031class V aminotransferase
GBAA_49011142.711285septation ring formation regulator EzrA
GBAA_49020183.620950LysR family transcriptional regulator
GBAA_49031223.248693cysteine transporter
GBAA_49052243.408601hypothetical protein
GBAA_49062243.151789methionine gamma-lyase
GBAA_49082272.61087630S ribosomal protein S4
GBAA_49090190.564995hypothetical protein
GBAA_49100172.441422hypothetical protein
GBAA_4911-1172.869599tyrosyl-tRNA synthetase
GBAA_49120152.717067hypothetical protein
GBAA_4913-1173.596820ECF subfamily RNA polymerase sigma factor
GBAA_49140173.158727lipoprotein
GBAA_4915-1173.024509acetyl-CoA synthetase
55GBAA_4927GBAA_4940Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4927313-1.099310hypothetical protein
GBAA_49283200.563890hypothetical protein
GBAA_49293190.336947catabolite control protein A
GBAA_4930318-0.460410lipoprotein
GBAA_49312200.820066hypothetical protein
GBAA_49321211.188344hypothetical protein
GBAA_4933-1201.348107aminopeptidase
GBAA_49340140.771171lipoprotein
GBAA_49352152.768275hypothetical protein
GBAA_49362152.638242hypothetical protein
GBAA_49373162.785927ribosomal-protein-serine acetyltransferase
GBAA_49383172.671185UDP-N-acetylmuramate--L-alanine ligase
GBAA_49393162.632148nicotinate phosphoribosyltransferase
GBAA_49403152.967467DNA translocase FtsK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4932TYPE4SSCAGA290.009 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 29.3 bits (65), Expect = 0.009
Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 104 VTDEIENNADKVAQVVQWSSAAIEVY---NHYRATRQEKKVEKEERKLERLEKKAEKK-E 159
+ D + +N + V + + ++ A + N+ + +K +EK RK E LEK+ EKK E
Sbjct: 574 IKDFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLE 633

Query: 160 KRSRLRMRGES 170
+S + + E+
Sbjct: 634 SKSGNKNKMEA 644


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4940IGASERPTASE645e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 63.5 bits (154), Expect = 5e-12
Identities = 56/328 (17%), Positives = 96/328 (29%), Gaps = 35/328 (10%)

Query: 553 PVVEGQSVVEEAPIAEEQPVAEETSVVEEQPVAEETSIVEEQPVAEEAPVVE-EQPVVQK 611
P VE ++ + P + V EE + V+E PV AP E
Sbjct: 983 PEVEKRNQTVDTTNIT-TPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA 1041

Query: 612 EEPKREKKRHVPFNVVMLKQDRARLMERHASRTNGMQSSMSERVENKPVHQVEEQPQVEE 671
E K+E K E+ A+ T +++ E + V+
Sbjct: 1042 ENSKQESKT-------------VEKNEQDATETTAQNREVAK----------EAKSNVKA 1078

Query: 672 KPMQQVV--VEPQVEEKQMQQVVEPQVEEKPMQQVVVEPQVEEKPMQQVVVEPQVEEKPM 729
V + +E Q + E EK + V + +E P V P+ E+
Sbjct: 1079 NTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138

Query: 730 QQVVVEPQVEEKPM------QQVVVEPQVEEKPMQQVVVEPQVEEKPMQQVVVEPQVEEK 783
Q EP E P Q E+P ++ + V V E
Sbjct: 1139 VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198

Query: 784 PVQ-QVVEPQVEEVQPVQQVVAEQVQKPISSTEVEEKAYVVNQRENDVRNVLQTPPTYTI 842
P Q + ++ + S + + + + T T
Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258

Query: 843 PSLT-LLSIPQQAALDNTEWLEEQKELL 869
L+ + Q AL+ + + + L
Sbjct: 1259 AVLSDARAKAQFVALNVGKAVSQHISQL 1286



Score = 63.2 bits (153), Expect = 7e-12
Identities = 61/283 (21%), Positives = 95/283 (33%), Gaps = 40/283 (14%)

Query: 311 EEIKRSTEIEQPTIEVEKQAPEESVIVKAEEKLE-ETIVVEIPEEVEVIAEAEEPEEVEV 369
E+ ++ + T QA SV EE + V P A A E E
Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPP------APATPSETTET 1039

Query: 370 IAETEESEEVEVIAETEESEEV-------------EVIAETEELEEVEVTAETEELEEVE 416
+AE + E V +++ E V A T+ E + +ET+E + E
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 417 VVAETEELEEVEVIAETEKLEELEEV--EVIAETEESEEVEVIAETEAPEEVEPVALEEM 474
+E + ETEK +E+ +V +V + E+SE V+ AE A E V ++E
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP-ARENDPTVNIKEP 1158

Query: 475 QQEMVLNEAIEQKNEFIHVAVADEQTKKDVQSFADVLIAEEQSVVEETPIVEEQPVAEEA 534
Q + EQ + V T E + V V E P
Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVT--------------ESTTVNTGNSVVENPENTTP 1204

Query: 535 PVVEEQSVVEETPIVEEAPVVEGQSV---VEEAPIAEEQPVAE 574
+ E + + +SV VE A +
Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247



Score = 48.1 bits (114), Expect = 3e-07
Identities = 45/223 (20%), Positives = 75/223 (33%), Gaps = 21/223 (9%)

Query: 214 EQGERQYEESKKEEKSVVDQWLEKNGYEIERQEPIVEEKEVVQEMSAPQEVPAAELLHET 273
E E E SK+E K+V E++ E Q V ++ + Q A+ ET
Sbjct: 1035 ETTETVAENSKQESKTVEKN--EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 274 IAERMEGAKQESDVVDKNILQEELVDSKVEHEDTILSEEIKRSTEIEQPTIEVEKQAPEE 333
+ K+ + E+ +KVE E T +E+ + T P KQ E
Sbjct: 1093 KETQTTETKETAT-------VEKEEKAKVETEKT---QEVPKVTSQVSP-----KQEQSE 1137

Query: 334 SVIVKAEEKLEETIVVEIPEEVEVIAEAEEPEEVEVIAETEESEEVEVIAETEESEEVEV 393
+V +AE E V I E ++ + E A+ S + + E+
Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQ---SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194

Query: 394 IAETEELEEVEVTAETEELEEVEVVAETEELEEVEVIAETEKL 436
+ E E T + E + V + +
Sbjct: 1195 VVENPE-NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236



Score = 46.6 bits (110), Expect = 7e-07
Identities = 54/266 (20%), Positives = 83/266 (31%), Gaps = 35/266 (13%)

Query: 423 ELEEVEVIAETEKLEELEEVEVIAETEESEEVEVIAETEAPEEVEPVALEEMQQEMVLNE 482
E+E+ +T + ++ + S E+ EAP A E V E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV-AE 1042

Query: 483 AIEQKNEFIHVAVADEQTKKDVQS------FADVLIAEEQSVV----EETPIVEEQPVAE 532
+Q+++ + D ++V + + V ET + E
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 533 EAPVVEEQSVVEETPIVEEAPVVEGQSVVEEAPIAEEQPVAE------------------ 574
A V +E+ ET +E P V Q ++ QP AE
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 575 ETSVVEEQPVAEETSIVEEQPVAEEAPV-----VEEQPVVQKEEPKREKKRHVPFNVVML 629
T+ EQP A+ETS EQPV E V V E P + N
Sbjct: 1163 NTTADTEQP-AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKN 1221

Query: 630 KQDRARLMERHASRTNGMQSSMSERV 655
+ R+ H S+ V
Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRSTV 1247


56GBAA_4971GBAA_5020Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4971-1173.070953molybdopterin converting factor subunit 1
GBAA_49724216.280563molybdopterin converting factor subunit 2
GBAA_49735226.335399molybdopterin-guanine dinucleotide biosynthesis
GBAA_49744216.809221molybdopterin biosynthesis protein moea
GBAA_49756236.683768molybdenum cofactor biosynthesis protein MoaC
GBAA_49767256.644779thiamine/molybdopterin biosynthesis MoeB-like
GBAA_49786256.464193triple helix repeat-containing collagen
GBAA_4979-1171.040373hypothetical protein
GBAA_4980-2160.202378hypothetical protein
GBAA_4981-214-0.799259rhodanese-like domain-containing protein
GBAA_4982-215-0.965836hypothetical protein
GBAA_4983-215-0.403536homoserine O-acetyltransferase
GBAA_49840171.031852spore germination protein GerHA
GBAA_49852191.073369spore germination protein GerHB
GBAA_49860263.519449spore germination protein GerHC
GBAA_49871255.058758hypothetical protein
GBAA_4988-1204.877724hypothetical protein
GBAA_4989-1174.412722hypothetical protein
GBAA_4990-4172.408999hypothetical protein
GBAA_4991-3171.261979leucyl-tRNA synthetase
GBAA_49920150.032149permease
GBAA_4993014-1.057555sodium/hydrogen exchanger family protein
GBAA_4994017-1.742937TrkA domain-containing protein
GBAA_4995219-1.409876phage integrase site specific recombinase
GBAA_4996220-0.081965ABC transporter permease
GBAA_49973212.158589ABC transporter ATP-binding protein
GBAA_49982201.224050hypothetical protein
GBAA_49991231.705437hypothetical protein
GBAA_50000221.918500hypothetical protein
GBAA_5001-1191.287958hypothetical protein
GBAA_5002-2180.239607hypothetical protein
GBAA_50030140.097876ABC transporter ATP-binding protein
GBAA_50042150.534387hypothetical protein
GBAA_50052151.370525aspartate racemase
GBAA_50061141.076131hypothetical protein
GBAA_50071161.797962hypothetical protein
GBAA_50081152.247561hypothetical protein
GBAA_50090152.390175hypothetical protein
GBAA_5010-1172.511059transferase
GBAA_5011-1172.085399PAP2 family protein
GBAA_50120143.334126group 1 family glycosyl transferase
GBAA_50133253.190698molybdopterin-guanine dinucleotide biosynthesis
GBAA_50143302.779488molybdenum cofactor biosynthesis protein B
GBAA_50153312.350273hypothetical protein
GBAA_50161312.266708hypothetical protein
GBAA_50170252.009221S-adenosylmethionine synthetase
GBAA_5019-1170.763191phosphoenolpyruvate carboxykinase
GBAA_5020416-0.761020hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4978THERMOLYSIN320.008 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 31.5 bits (71), Expect = 0.008
Identities = 27/120 (22%), Positives = 39/120 (32%), Gaps = 13/120 (10%)

Query: 304 TGSTGPTGSTGTTG-NTGVTGDTGPTGATGVSTTATYAFANNTSGSVISVLLGGTNIPLP 362
G P T T G GV GD T S Y +NT GS I G
Sbjct: 220 PGGAQPVAGTSTVGVGRGVLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDG------- 272

Query: 363 NNQNIGPGITVSGGNTVFTV-----ANAGNYYIAYTINLTAGLLVSSRITVNGSPLAGTI 417
N+ + PG + G+ F A +YY + + + + + T+
Sbjct: 273 RNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIRSTV 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4980PF07675250.045 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 24.7 bits (53), Expect = 0.045
Identities = 13/41 (31%), Positives = 18/41 (43%)

Query: 33 VYAGAGGSSAAIFLNGKRQPEAVIRTSVFLPPLATSTRTLG 73
VYA + G+ A+ F N + +T V P TR G
Sbjct: 1175 VYASSTGNDASNFANALLEEVLTAKTVVTAPEAIRGTRAQG 1215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4984IGASERPTASE473e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 47.0 bits (111), Expect = 3e-07
Identities = 38/253 (15%), Positives = 86/253 (33%), Gaps = 12/253 (4%)

Query: 9 KKKLNTTEKNETDNSEQKPNNQEDDNKEQTRSTKHNKSNNSEQKKEEHKESSQDKQQNQS 68
K++ T EKNE D +E N+E KE + K N N + + +Q + ++
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVA-KEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 69 NQNQQQSAKQDESSQGQQNHSKQDDSDQGQQQHSKQGNSDQGQQQHSKQGDSNQGQQNHS 128
+++ + E+ + Q+ Q+Q + +++ + + Q +
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 129 KQNDSDQGQQQHSKQDESSQEQQNHSKQDDS----DQGQQQHSKQDESSQEQQNHSKQDD 184
D++Q ++ S E + +S + + Q + E N K
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 185 SDQGQQQHSKQDESSQEQQNHSKQDDSDQDDSFQDTQQSSKQD-------DLAQDKQQHS 237
+ + ++ + S D + + S + ++ + QH
Sbjct: 1224 RRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHI 1283

Query: 238 KQDNSDQDKQQNS 250
Q + + Q N
Sbjct: 1284 SQLEMNNEGQYNV 1296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4992TCRTETA552e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 55.2 bits (133), Expect = 2e-10
Identities = 54/321 (16%), Positives = 118/321 (36%), Gaps = 10/321 (3%)

Query: 42 GMVLMINSLTGVIGNLLGGVLFDKWGGYKSTLVGIVITLVSILGLVFFHG-WPLYVVWLA 100
G++L + +L + G L D++G LV + V + W LY+
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG--R 103

Query: 101 LIGFGSGMVFPSMYAMVGTVWPEGGR-RAFNAMYVGQNVGIAIGTACGGLVASYRFDYIF 159
++ +G A + + R R F M G+ G GGL+ + F
Sbjct: 104 IVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPF 163

Query: 160 LANFILYFVFFLIAFIGFR-GMEDKKEPGVQKEVEAKKGWSLTPGFKALLIVCVAYALCW 218
A L + FL + ++ P ++ + + G + + + +
Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223

Query: 219 VTYVQWQGAIATHMQE-LNISLRHYSLLWTINGAMIVCAQPLVSMLIRWMKR-SLKQQIM 276
+ ++ + + G + AQ +++ R ++ +M
Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITG--PVAARLGERRALM 281

Query: 277 IGILIFAVSFIVLSQAQQFTMFLVAMVTLTIGELFVWPAVPTIANILAPKDKLGFYQGVV 336
+G++ +I+L+ A + M MV L G + + PA+ + + +++ G QG +
Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSL 340

Query: 337 NSAATVGKMFGPVVGGAIVDL 357
+ ++ + GP++ AI
Sbjct: 341 AALTSLTSIVGPLLFTAIYAA 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4994SECA290.006 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.5 bits (66), Expect = 0.006
Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 122 KNMKKFFNPGPDSIIEAGDMLVLSGARHEVKRIINEL 158
+ +K + D+++EAG + ++ RHE +RI N+L
Sbjct: 535 EKIKADWQVRHDAVLEAGGLHIIGTERHESRRIDNQL 571


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5000BACINVASINB270.008 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 27.4 bits (60), Expect = 0.008
Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 3/65 (4%)

Query: 8 ESYITQAEQAVEYAKEQLDQGMRQEHYNTMEYSDAQLQLEQAYNDLQTMQQHANDEQREQ 67
E+ + QA + AKE LD+ +DA+ + E+A N L Q AN + Q
Sbjct: 185 EAAVEQAGKEATEAKEALDKATDATV---KAGTDAKAKAEKADNILTKFQGTANAASQNQ 241

Query: 68 LNRAR 72
+++
Sbjct: 242 VSQGE 246


57GBAA_5104GBAA_5113Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_51040163.123272D-alanyl-D-alanine carboxypeptidase
GBAA_51051174.188697sensor histidine kinase
GBAA_51061175.316824DNA-binding response regulator
GBAA_51071155.026484N-acylamino acid racemase
GBAA_51080144.677435O-succinylbenzoic acid--CoA ligase
GBAA_51091144.432956naphthoate synthase
GBAA_51101133.792359alpha/beta hydrolase
GBAA_51110144.0151982-succinyl-5-enolpyruvyl-6-hydroxy-3-
GBAA_5112-1183.473035menaquinone-specific isochorismate synthase
GBAA_5113-1263.6058071,4-dihydroxy-2-naphthoate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5106HTHFIS1022e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 102 bits (256), Expect = 2e-27
Identities = 37/144 (25%), Positives = 70/144 (48%), Gaps = 5/144 (3%)

Query: 1 MKRISILIADDEAEIADLIEIHLEKEGYHVVKAADGEEAIHIIETQPIDLVVLDIMMPKM 60
M +IL+ADD+A I ++ L + GY V ++ I DLVV D++MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGYEVTRQIRA-KHHMPIIFLSAKTSDFDKVTGLVLGADDYMTKPFTPIELVARVNAQLR 119
+ +++ +I+ + +P++ +SA+ + + GA DY+ KPF EL+ +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII----G 116

Query: 120 RFLTLNQPKVAENKSALQVGGVTI 143
R L + + ++ + Q G +
Sbjct: 117 RALAEPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5113TYPE3IMSPROT300.011 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 30.1 bits (68), Expect = 0.011
Identities = 9/45 (20%), Positives = 16/45 (35%)

Query: 231 GRERAVGVLASMFIVSYIWTIALIIVGIVSPWMLIVFLSAPKAFK 275
VL F + + ++ I S + FL + +A K
Sbjct: 72 LSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIK 116


58GBAA_5131GBAA_5137Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_51312161.680811hypothetical protein
GBAA_51322181.467597general stress protein 13
GBAA_51332171.584893hypothetical protein
GBAA_51344181.663364asnC family transcriptional regulator
GBAA_51353181.957644gluconate 2-dehydrogenase
GBAA_51363181.533189alpha/beta hydrolase
GBAA_51372151.798407hypothetical protein
59GBAA_5192GBAA_5232Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_51922172.450578phosphatase
GBAA_51933161.719085DeoR family transcriptional regulator
GBAA_51943161.797407hypothetical protein
GBAA_51951232.431373fructose 1,6-bisphosphatase II
GBAA_51961201.320167hypothetical protein
GBAA_51980191.240056hypothetical protein
GBAA_51990161.737474lipoprotein
GBAA_52000182.859193transcriptional activator tipA
GBAA_52010172.633051hypothetical protein
GBAA_52031142.771157phosphoglycerate mutase
GBAA_52040163.138422hypothetical protein
GBAA_52051152.691281lipoyl synthase
GBAA_52062202.334964M23/37 family peptidase
GBAA_52072190.499411hypothetical protein
GBAA_52081201.626882hypothetical protein
GBAA_52091201.4783685'-nucleotidase
GBAA_52101231.131352hypothetical protein
GBAA_52111262.633163PadR family transcriptional regulator
GBAA_52122302.894413hypothetical protein
GBAA_52132302.904524hypothetical protein
GBAA_52143251.787300NifU domain-containing protein
GBAA_52153231.882750class V aminotransferase
GBAA_52163211.754318hypothetical protein
GBAA_52171160.356773ABC transporter ATP-binding protein
GBAA_5219015-0.325954ABC transporter substrate-binding protein
GBAA_5220-1150.167754ABC transporter substrate-binding protein
GBAA_52210151.065655ABC transporter permease
GBAA_52221140.363391ABC transporter ATP-binding protein
GBAA_5224215-1.080201hypothetical protein
GBAA_5225119-0.675935thioredoxin
GBAA_5226418-1.668626TOPRIM domain-containing protein
GBAA_5228619-3.960374glycine cleavage system protein H
GBAA_52291022-4.286895hypothetical protein
GBAA_5230925-4.004409hypothetical protein
GBAA_5232620-2.943123hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5220adhesinb280.044 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 27.9 bits (62), Expect = 0.044
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 4/49 (8%)

Query: 1 MKKLLLTALISTSIFGLAACGGKDNDEK----KLVVGASNVPHAEILEK 45
MKK L+ + GLAAC + + + KL V A+N A+I +
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN 49


60GBAA_5331GBAA_5348Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_53311213.047756DNA-binding response regulator
GBAA_53321243.522529ssrA-binding protein
GBAA_5334-1213.293891ribonuclease R
GBAA_53350161.330820carboxylesterase
GBAA_53361180.450066preprotein translocase subunit SecG
GBAA_53371180.580387hypothetical protein
GBAA_5741120-0.535418LrgA family protein
GBAA_5338220-0.440938inosine-uridine preferring nucleoside hydrolase
GBAA_5339221-1.454621hypothetical protein
GBAA_5340220-2.053795hypothetical protein
GBAA_5342322-1.977192hypothetical protein
GBAA_5344221-2.433143prophage LambdaBa03, HNH endonuclease
GBAA_5345323-2.290743hypothetical protein
GBAA_5346123-2.594753hypothetical protein
GBAA_5347-124-2.448746hypothetical protein
GBAA_5348323-2.384790hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5331HTHFIS586e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 57.5 bits (139), Expect = 6e-12
Identities = 23/134 (17%), Positives = 54/134 (40%), Gaps = 2/134 (1%)

Query: 4 VLVIKNERSLAKKIVSGLTEEGHFILKLHNENEGLNIIYEQDWDIIILDWDSLSISGPEI 63
+LV ++ ++ + L+ G+ + N I D D+++ D + ++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 CRQIR-LVKMTPIIIVTDNISSKDCVAGLQAGADDYIRKPFAKEELVARV-QAILRRSGC 121
+I+ P+++++ + + + GA DY+ KPF EL+ + +A+
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 122 NQQHETTFFQFKDL 135
+ E L
Sbjct: 126 PSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5336SECGEXPORT392e-07 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 38.8 bits (90), Expect = 2e-07
Identities = 21/77 (27%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 1 MHTLLSVLLIIVSILMIVMVLMQSSNSSGLSGAISGGAE-QLFGKQKARGIEAVLNRITI 59
M+ L V+ +IV+I ++ ++++Q + + + GA LFG + G + R+T
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFG---SSGSGNFMTRMTA 57

Query: 60 VLAVLFFALTIGVTYLN 76
+LA LFF +++ + +N
Sbjct: 58 LLATLFFIISLVLGNIN 74


61GBAA_5362GBAA_5376Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_53621373.680856hypothetical protein
GBAA_53633464.565783prophage lambdaba03, site-specific recombinase
GBAA_53644455.227080phosphopyruvate hydratase
GBAA_53654374.583499phosphoglyceromutase
GBAA_53664263.920244triosephosphate isomerase
GBAA_53673233.332674phosphoglycerate kinase
GBAA_53693172.297165glyceraldehyde-3-phosphate dehydrogenase
GBAA_53702172.104355gapA transcriptional regulator CggR
GBAA_53710173.162499glutaredoxin family protein
GBAA_53720194.092531RNA polymerase factor sigma-54
GBAA_53731284.878530*hypothetical protein
GBAA_53740283.566285lipoprotein
GBAA_5375-2294.277343stage v sporulation protein ac
GBAA_5376-2264.340588stage V sporulation protein AD
62GBAA_5391GBAA_5424Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_53911223.259490prolipoprotein diacylglyceryl transferase
GBAA_53921192.348545HPr kinase/phosphorylase
GBAA_53931172.066744hypothetical protein
GBAA_53941172.311423hypothetical protein
GBAA_53951162.057693excinuclease ABC subunit A
GBAA_53960180.353841excinuclease ABC subunit B
GBAA_5397021-2.019235IS605 family transposase
GBAA_5398025-1.296148lipoprotein
GBAA_53993251.773711hypothetical protein
GBAA_54001212.503420merR family transcriptional regulator
GBAA_54023273.689201hypothetical protein
GBAA_54032274.274825hypothetical protein
GBAA_54041243.599862DNA-binding protein
GBAA_54051213.085862hypothetical protein
GBAA_5406-1192.415894LysR family transcriptional regulator
GBAA_5407-1162.083899merR family transcriptional regulator
GBAA_5408-1151.655551FMN reductase, NADPH-dependent
GBAA_5409-1151.904275macrolide efflux pump
GBAA_5411-1152.071146ABC transporter permease/ATP-binding protein
GBAA_5412-1171.990161hypothetical protein
GBAA_54141202.141067carboxyl-terminal protease
GBAA_54152241.799573cell division ABC transporter permease FtsX
GBAA_54162232.386175cell division ABC transporter ATP-binding
GBAA_54173181.848261cytochrome c-551
GBAA_54192152.251599hypothetical protein
GBAA_54212162.164243preprotein translocase subunit SecA
GBAA_54221131.470681ribosomal subunit interface protein
GBAA_54242161.698541cold shock protein CspC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5414BINARYTOXINB300.028 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 29.7 bits (66), Expect = 0.028
Identities = 14/44 (31%), Positives = 22/44 (50%), Gaps = 1/44 (2%)

Query: 201 GKDIGYMQITSFAENTAKEFKDQLKELEKKNIKGLVIDVRGNPG 244
GKDI F + T++ K+QL EL NI ++ ++ N
Sbjct: 573 GKDITEFDFN-FDQQTSQNIKNQLAELNATNIYTVLDKIKLNAK 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5421SECA11710.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1171 bits (3030), Expect = 0.0
Identities = 446/897 (49%), Positives = 598/897 (66%), Gaps = 65/897 (7%)

Query: 1 MIGILKKVF-DVNQRQIKRMQKTVEQIDALESSIKPLTDEQLKGKTLEFKERLTKGETVD 59
+I +L KVF N R ++RM+K V I+A+E ++ L+DE+LKGKT EF+ RL KGE ++
Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61

Query: 60 DLLPEAFAVVREAATRVLGMRPYGVQLMGGIALHEGNISEMKTGEGKTLTSTLPVYLNAL 119
+L+PEAFAVVREA+ RV GMR + VQL+GG+ L+E I+EM+TGEGKTLT+TLP YLNAL
Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121

Query: 120 TGKGVHVVTVNEYLAQRDANEMGQLHEFLGLTVGINLNSMSREEKQEAYAADITYSTNNE 179
TGKGVHVVTVN+YLAQRDA L EFLGLTVGINL M K+EAYAADITY TNNE
Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181

Query: 180 LGFDYLRDNMVLYKEQCVQRPLHFAIIDEVDSILVDEARTPLIISGQAQKSTELYMFANA 239
GFDYLRDNM E+ VQR LH+A++DEVDSIL+DEARTPLIISG A+ S+E+Y N
Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNK 241

Query: 240 FVRTL-----------ENEKDYSFDVKTKNVMLTEDGITKAEKAFHI-------ENLFDL 281
+ L + E +S D K++ V LTE G+ E+ E+L+
Sbjct: 242 IIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSP 301

Query: 282 KHVALLHHINQALRAHVVMHRDTDYVVQEGEIVIVDQFTGRLMKGRRYSEGLHQAIEAKE 341
++ L+HH+ ALRAH + RD DY+V++GE++IVD+ TGR M+GRR+S+GLHQA+EAKE
Sbjct: 302 ANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKE 361

Query: 342 GVEIQNESMTLATITFQNYFRMYEKLSGMTGTAKTEEEEFRNIYNMNVIVIPTNKPIIRD 401
GV+IQNE+ TLA+ITFQNYFR+YEKL+GMTGTA TE EF +IY ++ +V+PTN+P+IR
Sbjct: 362 GVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRK 421

Query: 402 DRADLIFKSMKGKFNAVVEDIVNRHKQGQPVLVGTVAIETSELISKMLTRKGVRHNILNA 461
D DL++ + K A++EDI R +GQPVLVGT++IE SEL+S LT+ G++HN+LNA
Sbjct: 422 DLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNA 481

Query: 462 KNHAREADIIAEAGMKGAVTIATNMAGRGTDIKLG------------------------- 496
K HA EA I+A+AG AVTIATNMAGRGTDI LG
Sbjct: 482 KFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADW 541

Query: 497 ----DDIKNIG-LAVIGTERHESRRIDNQLRGRAGRQGDPGVTQFYLSMEDELMRRFGSD 551
D + G L +IGTERHESRRIDNQLRGR+GRQGD G ++FYLSMED LMR F SD
Sbjct: 542 QVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASD 601

Query: 552 NMKAMMDRLGMDDSQPIESKMVSRAVESAQKRVEGNNYDARKQLLQYDDVLRQQREVIYK 611
+ MM +LGM + IE V++A+ +AQ++VE N+D RKQLL+YDDV QR IY
Sbjct: 602 RVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYS 661

Query: 612 QRQEVMESENLRGIIEGMMKSTVERAV-ALHTQEEIEEDWNIKGLVDYLNTNLLQEGDVK 670
QR E+++ ++ I + + + + A + +EE W+I GL + L + + +
Sbjct: 662 QRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIA 721

Query: 671 E--EELRRLAPEEMSEPIIAKLIERYNDKEKLMPEEQMREFEKVVVFRVVDTKWTEHIDA 728
E ++ L E + E I+A+ IE Y KE+++ E MR FEK V+ + +D+ W EH+ A
Sbjct: 722 EWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAA 781

Query: 729 MDHLREGIHLRAYGQIDPLREYQMEGFAMFESMIASIEEEISRYIMKAEI---------- 778
MD+LR+GIHLR Y Q DP +EY+ E F+MF +M+ S++ E+ + K ++
Sbjct: 782 MDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEELE 841

Query: 779 -EQNLERQEVVQGEAVHPSSDGEEAKKKPVVKGDQ--VGRNDLCKCGSGKKYKNCCG 832
++ +E + + Q + + D A + + VGRND C CGSGKKYK C G
Sbjct: 842 QQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQCHG 898


63GBAA_5509GBAA_5559Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_5509215-3.374928udp-n-acetylglucosamine 2-epimerase
GBAA_5510317-4.251066teichoic acids export protein ATP-binding
GBAA_5511418-4.230778techoic acid ABC transporter efflux permease
GBAA_5512417-4.013049UDP-N-acetyl-D-mannosamine dehydrogenase
GBAA_5513316-3.797894hypothetical protein
GBAA_5516116-3.170503hypothetical protein
GBAA_5517016-2.379330hypothetical protein
GBAA_5518015-2.188522group 1 family glycosyl transferase
GBAA_5519014-0.410120group 1 family glycosyl transferase
GBAA_55203170.374515rod shape-determining protein Mbl
GBAA_5521315-0.723658stage III sporulation protein D
GBAA_55222130.762091lipoprotein
GBAA_55231152.132611hypothetical protein
GBAA_55241121.781990stage II sporulation protein
GBAA_5525-1131.459744ABC transporter permease
GBAA_5526-2162.674416ABC transporter ATP-binding protein
GBAA_5528-1173.796322stage II sporulation protein D
GBAA_55291184.033282UDP-N-acetylglucosamine
GBAA_55302233.908053hypothetical protein
GBAA_55313244.287677hypothetical protein
GBAA_55323244.174069NADH dehydrogenase subunit N
GBAA_55334244.605747NADH dehydrogenase subunit M
GBAA_55344264.991872NADH dehydrogenase subunit L
GBAA_55353275.335738NADH dehydrogenase subunit K
GBAA_55363265.412286NADH dehydrogenase subunit J
GBAA_55374265.636046NADH dehydrogenase subunit I
GBAA_55381152.696421NADH dehydrogenase subunit H
GBAA_55390131.558903NADH dehydrogenase subunit D
GBAA_5540-112-0.029969NADH dehydrogenase subunit C
GBAA_5541-29-1.253266NADH dehydrogenase subunit B
GBAA_5542-212-0.050334NADH dehydrogenase subunit A
GBAA_5543-1140.411203sensory box/GGDEF family protein
GBAA_55441253.162869hypothetical protein
GBAA_55452273.608535hypothetical protein
GBAA_55463314.242782ATP synthase F0F1 subunit epsilon
GBAA_55474334.259352ATP synthase F0F1 subunit beta
GBAA_55483283.468220ATP synthase F0F1 subunit gamma
GBAA_55491283.396337ATP synthase F0F1 subunit alpha
GBAA_5550-2202.120179ATP synthase F0F1 subunit delta
GBAA_5551-3222.481880ATP synthase F0F1 subunit B
GBAA_55520242.848162ATP synthase F0F1 subunit C
GBAA_55531243.161470ATP synthase F0F1 subunit A
GBAA_55541203.246342ATP synthase I
GBAA_55552213.152136hypothetical protein
GBAA_55563213.122565hypothetical protein
GBAA_55572213.001506uracil phosphoribosyltransferase
GBAA_55582233.002315serine hydroxymethyltransferase
GBAA_55590193.015058hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5511ABC2TRNSPORT300.007 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 30.3 bits (68), Expect = 0.007
Identities = 53/243 (21%), Positives = 99/243 (40%), Gaps = 19/243 (7%)

Query: 27 KQAYAGNLLGLLWVFLNPLSQIGVYWLVFGLGIRGGAPVHGVPYFVWLVCGLVTWFFVGT 86
K+A +LLG L PL I ++ L GLG+ G V GV Y +L G+V +
Sbjct: 28 KKAALASLLGHL---AEPL--IYLFGLGAGLGVMVGR-VGGVSYTAFLAAGMVATSAMTA 81

Query: 87 TITQSANSIYSRLN---TVSKMNFPLSIIPTYVVISQLY--THLILIIFALVIVIFNLGF 141
++ + + R+ T M + + V+ + T L + +V LG+
Sbjct: 82 ATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY 141

Query: 142 STINILELMYGLVASTLFLIALSFLTSTLSTMLRDIQLLI--QS--VTRMLFFLTPIFWE 197
+ L L+Y L L +A + L ++ + I Q+ +T +LF +F
Sbjct: 142 TQ--WLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVF-- 197

Query: 198 PKENMSNLLLFIIKINPLYYIVEVYRGALIYNDTSIVLSWYTLYFWGAVIILFIAGSMLH 257
P + + + + PL + +++ R ++ + V VI F++ ++L
Sbjct: 198 PVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLR 257

Query: 258 IRF 260
R
Sbjct: 258 RRL 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5520SHAPEPROTEIN478e-173 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 478 bits (1233), Expect = e-173
Identities = 179/330 (54%), Positives = 244/330 (73%), Gaps = 5/330 (1%)

Query: 1 MFARDIGIDLGTANVLIHVKGKGIVLNEPSVVAIDRNTG----KVLAVGEEARSMVGRTP 56
MF+ D+ IDLGTAN LI+VKG+GIVLNEPSVVAI ++ V AVG +A+ M+GRTP
Sbjct: 8 MFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTP 67

Query: 57 GNIVAIRPLKDGVIADFEITEAMLKYFINKLDVKSFFS-KPRILICCPTNITSVEQKAIR 115
GNI AIRP+KDGVIADF +TE ML++FI ++ SF PR+L+C P T VE++AIR
Sbjct: 68 GNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIR 127

Query: 116 EAAERSGGKTVFLEEEPKVAAVGAGMEIFQPSGNMVVDIGGGTTDIAVLSMGDIVTSSSI 175
E+A+ +G + VFL EEP AA+GAG+ + + +G+MVVDIGGGTT++AV+S+ +V SSS+
Sbjct: 128 ESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSV 187

Query: 176 KMAGDKFDMEILNYIKRKYKLLIGERTSEDIKIKVGTVFPGARSEELEIRGRDMVTGLPR 235
++ GD+FD I+NY++R Y LIGE T+E IK ++G+ +PG E+E+RGR++ G+PR
Sbjct: 188 RIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPR 247

Query: 236 TITVCSEEITEALKENAAVIVQAAKGVLERTPPELSADIIDRGVILTGGGALLHGIDMLL 295
T+ S EI EAL+E IV A LE+ PPEL++DI +RG++LTGGGALL +D LL
Sbjct: 248 GFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLL 307

Query: 296 AEELKVPVLIAENPMHCVAVGTGIMLENID 325
EE +PV++AE+P+ CVA G G LE ID
Sbjct: 308 MEETGIPVVVAEDPLTCVARGGGKALEMID 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5540IGASERPTASE386e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.7 bits (87), Expect = 6e-05
Identities = 22/121 (18%), Positives = 42/121 (34%), Gaps = 6/121 (4%)

Query: 51 KNDDMTIEEAKRRAAAAAKA--KAAALAKQKREGIEEVTEEEKVKAKAAAAAKAKAAALA 108
+ + E +K+ + K A Q RE +E K + A++ +
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 109 KQK--ASQGNGDSGDEKAKAIAAAKAKAAAAARAKTKGAEGKKEEELKQEEPSV-NEPYL 165
Q + +EKAK + ++ + + E Q EP+ N+P +
Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVT-SQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 166 N 166
N
Sbjct: 1154 N 1154



Score = 36.2 bits (83), Expect = 2e-04
Identities = 27/156 (17%), Positives = 47/156 (30%), Gaps = 11/156 (7%)

Query: 7 DLEDLKREAARRAKEEARKRLVAKHGVEISKLEEENREKEKA--LPKNDDMTIEEAKRRA 64
DL + + E + + ++ + N E + P ++
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 65 AAAAKAKAAALAKQKREGIEEVTEEEKVKAKAAAAAKAKAAALAKQKASQGNGDSGDEKA 124
A +K + +K E ++ TE + A AK+ A + +G E
Sbjct: 1039 TVAENSKQESKTVEKNE--QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096

Query: 125 KAIAAAKAKAAAAARAKTKGAEGKKEEELKQEEPSV 160
A +AK E E QE P V
Sbjct: 1097 TTETKETATVEKEEKAKV-------ETEKTQEVPKV 1125



Score = 34.7 bits (79), Expect = 6e-04
Identities = 26/154 (16%), Positives = 48/154 (31%), Gaps = 17/154 (11%)

Query: 14 EAARRAKEEARKRLVAKHGVEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAA 73
+A + E A+ K E EKE+ + T E K + + K + +
Sbjct: 1077 KANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS 1136

Query: 74 ALAKQKREGIEEVTEEEKVKAKAAAAAKAKAAALAKQKASQGNGDSGDEKAKAIAAAKAK 133
E V+ +A A + K+ SQ N + E+ ++ +
Sbjct: 1137 ----------------ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 134 AAAAARAKTKGAEGKKEEELKQEEPSVNEPYLNQ 167
T E + P+ +P +N
Sbjct: 1181 QPVTEST-TVNTGNSVVENPENTTPATTQPTVNS 1213



Score = 33.1 bits (75), Expect = 0.002
Identities = 19/152 (12%), Positives = 46/152 (30%), Gaps = 3/152 (1%)

Query: 13 REAARR-AKEEARKRLVAKHGVEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAK 71
+E KE A K VE K +E + + PK + E + +A A +
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS--ETVQPQAEPAREND 1150

Query: 72 AAALAKQKREGIEEVTEEEKVKAKAAAAAKAKAAALAKQKASQGNGDSGDEKAKAIAAAK 131
K+ + + E+ + ++ + ++ + A
Sbjct: 1151 PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPT 1210

Query: 132 AKAAAAARAKTKGAEGKKEEELKQEEPSVNEP 163
+ ++ + K + + E + +
Sbjct: 1211 VNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5551IGASERPTASE300.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.005
Identities = 23/116 (19%), Positives = 49/116 (42%), Gaps = 6/116 (5%)

Query: 36 PLMGIMKEREEHVANEIDAAERNNAEAKKLVEEQREMLKQSRVEAQELIERAKKQAVDQK 95
P E E VA + +++ + +K ++ E Q+R A+E ++ +A Q
Sbjct: 1028 PAPATPSETTETVA---ENSKQESKTVEKNEQDATETTAQNREVAKE--AKSNVKANTQT 1082

Query: 96 DVIVAAAKEEAESIKASAVQEIQREKEQAIAALQEQVASLSVQIASKVIEKELKEE 151
+ + + E E+ + EKE+ E+ + ++ S+V K+ + E
Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP-KVTSQVSPKQEQSE 1137


64GBAA_5569GBAA_5587Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_55692151.692323hypothetical protein
GBAA_55701162.340731stage II sporulation protein R
GBAA_5571-1183.782676HemK family modification methylase
GBAA_5572-1194.138151peptide chain release factor 1
GBAA_5573-1234.416295thymidine kinase
GBAA_5574-1234.04689350S ribosomal protein L31
GBAA_5575-1213.554014transcription termination factor Rho
GBAA_55760274.101476fructose 1,6-bisphosphatase II
GBAA_55782263.524123UDP-N-acetylglucosamine
GBAA_55803282.371877fructose-bisphosphate aldolase
GBAA_5581-1163.031734stage 0 sporulation protein F
GBAA_5582-1173.858789hypothetical protein
GBAA_5583-1184.205397CTP synthetase
GBAA_55840165.175276DNA-directed RNA polymerase subunit delta
GBAA_5585-1165.290934TetR family transcriptional regulator
GBAA_5586-1144.848806acyl-CoA dehydrogenase
GBAA_5587-1133.537828acyl-CoA dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5570IGASERPTASE401e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.0 bits (93), Expect = 1e-05
Identities = 27/99 (27%), Positives = 42/99 (42%), Gaps = 5/99 (5%)

Query: 177 AESPEEEQVKQIDDEEVVDTEEKKEDEVKEKKVVKQEVATKVTASEKKVVKNETKVEEQP 236
A E + + ++ T EK E + E +EVA + K VK T+ E
Sbjct: 1031 ATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE----AKSNVKANTQTNEVA 1086

Query: 237 VSKEETKTVEKVEKPVEQKQEKQNEY-VKVEEEEEEPEV 274
S ETK + E EK+ + V+ E+ +E P+V
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125



Score = 33.1 bits (75), Expect = 0.001
Identities = 24/119 (20%), Positives = 44/119 (36%), Gaps = 6/119 (5%)

Query: 166 TAVRKEEHVVKAESPEEEQVKQIDDEEVVDTEEKKEDEVKEKKVVKQEVATKVTASEKKV 225
+ ++ + V K E E Q V E K + + + ++ ++
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQ---NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 226 VKNETKVEEQPVSKEETKTVEKVEKPVEQ---KQEKQNEYVKVEEEEEEPEVKLFIVEA 281
K VE++ +K ET+ ++V K Q KQE+ E E + + I E
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158



Score = 29.3 bits (65), Expect = 0.020
Identities = 20/103 (19%), Positives = 39/103 (37%), Gaps = 5/103 (4%)

Query: 176 KAESPEEEQVKQIDDEEVVDTEEKKEDEVKEKKVV----KQEVATKVTASEKKVVKNETK 231
K+ Q ++ +T+E + E KE V K +V T+ T KV +
Sbjct: 1073 KSNVKANTQTNEVAQSGS-ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP 1131

Query: 232 VEEQPVSKEETKTVEKVEKPVEQKQEKQNEYVKVEEEEEEPEV 274
+EQ + + + P +E Q++ + E+ +
Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5581HTHFIS1122e-32 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 112 bits (281), Expect = 2e-32
Identities = 31/117 (26%), Positives = 56/117 (47%)

Query: 3 GKILIVDDQYGIRVLLHEVFQKEGYQTFQAANGFQALDIVKKDNPDLVVLDMKIPGMDGI 62
IL+ DD IR +L++ + GY +N + + DLVV D+ +P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 63 EILKHVKEIDESIKVILMTAYGELDMIQEAKDLGALMHFAKPFDIDEIRQAVRNELA 119
++L +K+ + V++M+A +A + GA + KPFD+ E+ + LA
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5585HTHTETR645e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 64.3 bits (156), Expect = 5e-15
Identities = 27/141 (19%), Positives = 61/141 (43%), Gaps = 6/141 (4%)

Query: 10 RREQMIKGAVQLFKQKGFPRTTTREIAKAAGFSIGTLYEYIRTKDDVLYLVCDSIYEHVK 69
R+ ++ A++LF Q+G T+ EIAKAAG + G +Y + + K D+ + + ++
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71

Query: 70 ERLEEV-VCTEKGSVESLKIAITNYFKVMDELQEE---VLIMYQEVRFLPKESLPYVLEK 125
E E + L+ + + + + + I++ + F+ + ++ ++
Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131

Query: 126 EF--QMVGMFENILEQCTENG 144
+ E L+ C E
Sbjct: 132 NLCLESYDRIEQTLKHCIEAK 152


65GBAA_5663GBAA_5675Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_56633101.028849pyridoxal kinase
GBAA_56643130.765127diguanylate cyclase
GBAA_56652120.788448hypothetical protein
GBAA_5666211-0.159803carbon starvation protein A
GBAA_5667111-1.615175response regulator
GBAA_5668-19-1.627184major facilitator family transporter protein
GBAA_5669-29-3.079815WecB/TagA/CpsF family glycosyl transferase
GBAA_5670-211-3.362421group 1 family glycosyl transferase
GBAA_5671-112-4.161892hypothetical protein
GBAA_5672015-3.901124hypothetical protein
GBAA_5673-114-3.319136methyl-accepting chemotaxis protein
GBAA_5674-110-3.435585hypothetical protein
GBAA_5675-112-3.210768cytosolic long-chain acyl-CoA thioester
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5667HTHFIS533e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.9 bits (127), Expect = 3e-10
Identities = 21/137 (15%), Positives = 49/137 (35%), Gaps = 12/137 (8%)

Query: 2 KILLIMEEAEERRSLAEKFIENIKNVECFEASMGTEALFIMKKHTPDFVFLNSKLMDGTG 61
IL+ ++A R L + + + S + D V + + D
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 FEYVNLLREVNCYAKFIFMGE--DIEESITAFRFQAFYYLLRPFREEDLQFLLYRMGKEQ 119
F+ + +++ + M +I A A+ YL +PF +L ++
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII------- 115

Query: 120 GEKAKSYLRKLPIEGQE 136
+A + ++ P + ++
Sbjct: 116 -GRALAEPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5668TCRTETA598e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 59.1 bits (143), Expect = 8e-12
Identities = 72/380 (18%), Positives = 142/380 (37%), Gaps = 35/380 (9%)

Query: 7 ISKRKLLGIAGLGWLFDAMDVGMLSFVMVALQKDWGLSTQEMGWIG---SINSIGMAVGA 63
+ + L + DA+ +G++ V+ L +D S G ++ ++ A
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 64 LVFGILSDKIGRKSVFIITLLLFSIGSGLTALTTTLAMFLVLRFLIGMGLGGELPVASTL 123
V G LSD+ GR+ V +++L ++ + A L + + R + G+ G VA
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAY 119

Query: 124 VSESVEAHERGKIVVLLESFWAGGWLIAALISYF---VIPKYGWEVAMILSAIPALYALY 180
+++ + ER + + + + G + ++ P + A L+ + L +
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 181 LRWNLPDSPRFQKVEKRPSVIENIKSVWSGEYRKATIMLWILWFSV---------VFSYY 231
L LP+S + ++ R + + S L ++F + ++ +
Sbjct: 180 L---LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236

Query: 232 GM--FLWLPSV--MVLKGFSLIKSFQYVLIMTLAQLPGYFTAAWFIERLGRKFVLVTYLI 287
G F W + + L F ++ S +I RLG + L+ +I
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGP-----------VAARLGERRALMLGMI 285

Query: 288 GTACSAYLFGVAESLTVLIVAGMLLSFFNLGAWGALYAYTPEQYPTVIRGTGAGMAAAFG 347
L A + +LL+ +G AL A Q +G G AA
Sbjct: 286 ADGTGYILLAFATRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALT 344

Query: 348 RIGGILGPLLVGYLVASQAS 367
+ I+GPLL + A+ +
Sbjct: 345 SLTSIVGPLLFTAIYAASIT 364



Score = 33.6 bits (77), Expect = 0.001
Identities = 29/125 (23%), Positives = 45/125 (36%), Gaps = 5/125 (4%)

Query: 274 ERLGRKFVLVTYLIGTACSAYLFGVAESLTVLIVAGMLLSFFNLGAWGALYAYTPEQYPT 333
+R GR+ VL+ L G A + A L VL + G +++ AY +
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI-GRIVAGITGATGAVAGAYIADITDG 126

Query: 334 VIRGTGAG-MAAAFGRIGGILGPLLVGYLVASQASLSLIFTIFCGSILIGVFAVIILGQE 392
R G M+A FG G + GP+L G + + L E
Sbjct: 127 DERARHFGFMSACFG-FGMVAGPVLGGLMGGFSPHAPFFAAAALN--GLNFLTGCFLLPE 183

Query: 393 TKQRE 397
+ + E
Sbjct: 184 SHKGE 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5671GPOSANCHOR355e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.4 bits (81), Expect = 5e-04
Identities = 9/47 (19%), Positives = 18/47 (38%)

Query: 289 EQEQSAKKEEKKKEEAKEHKPPVTQQEKEKEKEKEKVAEKKEETQAL 335
Q + K E + + E EK + + A+ + ++Q L
Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVL 307



Score = 32.3 bits (73), Expect = 0.004
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 9/63 (14%)

Query: 291 EQSAKKEEKKKEEAKE-----HKPPVTQQEKEKEKEKEKVAEKKEETQALIFSGRQLFEQ 345
++ K+ EK EEA K +E +K EKEK AE + + +A + L E+
Sbjct: 392 REAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEK-AELQAKLEA---EAKALKEK 447

Query: 346 MYK 348
+ K
Sbjct: 448 LAK 450


66GBAA_0064GBAA_0071N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0064-2163.249562cell division protein FtsH
GBAA_0065-2162.294336pantothenate kinase
GBAA_0066-2162.426366heat shock protein 33
GBAA_00670172.058884cysteine synthase A
GBAA_00682181.266676para-aminobenzoate synthase component I
GBAA_00691191.036937para-aminobenzoate/anthranilate synthase
GBAA_0070-1161.2293164-amino-4-deoxychorismate lyase
GBAA_00710161.477291dihydropteroate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0064HTHFIS364e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.3 bits (84), Expect = 4e-04
Identities = 38/179 (21%), Positives = 57/179 (31%), Gaps = 41/179 (22%)

Query: 185 RKFAEVGARIPKGVLLVGPPGTGKTLLARAV---AGEAGVPFFS-----ISGSDFVEMFV 236
+ + +++ G GTGK L+ARA+ PF + I
Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELF 209

Query: 237 GV------GASRVRD-LFENAKKNAPCIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQLL 289
G GA FE A+ +F+DEI + L +
Sbjct: 210 GHEKGAFTGAQTRSTGRFEQAEGGT---LFLDEIGDMPMDAQTRLLRVLQQG-------- 258

Query: 290 VEMDGFGANEGII----IIAATNRPDILDPALLRPGRFDRQITVDRPDVNGREAVLKVH 344
E G I I+AATN+ L + G F R D+ R V+ +
Sbjct: 259 -EYTTVGGRTPIRSDVRIVAATNKD--L-KQSINQGLF-------REDLYYRLNVVPLR 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0065PF03309379e-136 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 379 bits (975), Expect = e-136
Identities = 96/269 (35%), Positives = 163/269 (60%), Gaps = 12/269 (4%)

Query: 1 MIFVLDVGNTNAVLGVF----EEGELRQHWRMETDRHKTEDEYGMLVKQLLEHEGLSFED 56
M+ +DV NT+ V+G+ + ++ Q WR+ T+ T DE + + L+ G E
Sbjct: 1 MLLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLI---GDDAER 57

Query: 57 VKGIIVSSVVPPIMFALERMCEKYFKIKP-LVVGPGIKTGLNIKYENPREVGADRIVNAV 115
+ G S VP ++ + M E+Y+ P +++ PG++TG+ + +NP+EVGADRIVN +
Sbjct: 58 LTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCL 117

Query: 116 AGIHLYGSPLIIVDFGTATTYCYINEEKHYMGGVITPGIMISAEALYSRAAKLPRIEITK 175
A H YG+ I+VDFG++ ++ + ++GG I PG+ +S++A +R+A L R+E+T+
Sbjct: 118 AAYHKYGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTR 177

Query: 176 PSSVVGKNTVSAMQSGILYGYVGQVEGIVKRMKEEA----KQEPKVIATGGLAKLISEES 231
P SV+GKNTV MQ+G ++G+ G V+G+V R++++ + V+ATG A L+ +
Sbjct: 178 PRSVIGKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLVLPDL 237

Query: 232 NVIDVVDPFLTLKGLYMLYERNANLQHEK 260
++ D LTL GL +++ERN Q K
Sbjct: 238 RTVEHYDRHLTLDGLRLVFERNRANQRGK 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0070RTXTOXINA280.045 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 28.4 bits (63), Expect = 0.045
Identities = 19/94 (20%), Positives = 42/94 (44%), Gaps = 21/94 (22%)

Query: 191 ILYTPSLETGILNGITRAFIIKVAEELGIKVKEGFFTKDELLSADEVFVTNSIQEIVPLN 250
IL P G + + +++ A+ELGI+V+ + K+ +VF + ++++ L
Sbjct: 50 ILLIPKDYKGQGSSLND--LVRTADELGIEVQ--YDEKNGTAITKQVF--GTAEKLIGL- 102

Query: 251 RIEERDFPGKVGMVTKRFINLYEMQREKLWSRNE 284
T+R + ++ Q +KL + +
Sbjct: 103 --------------TERGVTIFAPQLDKLLQKYQ 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0071PF07201290.015 Hypersensitivity response secretion protein HrpJ
		>PF07201#Hypersensitivity response secretion protein HrpJ

Length = 293

Score = 29.4 bits (66), Expect = 0.015
Identities = 10/72 (13%), Positives = 26/72 (36%), Gaps = 4/72 (5%)

Query: 146 ILMHNRDNMNYRNLMADMIADLYDSIKIAKDAGVRDENIILDPGIGFAKTPEQNLEAMRN 205
L + + +L+ + + + G R I +++ L+ +R+
Sbjct: 145 ALKGRPELAHLSHLVEQALVSMAEEQGETIVLGAR----ITPEAYRESQSGVNPLQPLRD 200

Query: 206 LEQLNVLGYPVL 217
+ V+GY +
Sbjct: 201 TYRDAVMGYQGI 212


67GBAA_0258GBAA_0265N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0258-3110.474633*************hypothetical protein
GBAA_0259-2120.539109hypothetical protein
GBAA_0260-290.528990ribosomal-protein-alanine acetyltransferase
GBAA_0261-2110.782410DNA-binding/iron metalloprotein/AP endonuclease
GBAA_02620221.639200ABC transporter ATP-binding protein
GBAA_02633323.181353redox-sensing transcriptional repressor Rex
GBAA_02642313.806972lipoprotein
GBAA_02650273.588224CAAX amino terminal protease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0258PF05272300.005 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.005
Identities = 8/25 (32%), Positives = 12/25 (48%)

Query: 25 VRAQDVIILEGDLGAGKTTFTKGLA 49
+ ++LEG G GK+T L
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLV 617


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0260SACTRNSFRASE442e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 44.2 bits (104), Expect = 2e-08
Identities = 21/72 (29%), Positives = 32/72 (44%)

Query: 67 ITNIAILPEYRGLKLGDALLKEVISEAKTLGVKTMTLEVRVSNEVAKQLYRKYGFQNGGI 126
I +IA+ +YR +G ALL + I AK + LE + N A Y K+ F G +
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151

Query: 127 RKRYYADNQEDG 138
Y++
Sbjct: 152 DTMLYSNFPTAN 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0262PF05272300.024 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.024
Identities = 13/44 (29%), Positives = 18/44 (40%), Gaps = 2/44 (4%)

Query: 361 LVGPNGIGKSTLLKSIVNKLPLLHGDVSFGSNVSVGYYDQEQAN 404
L G GIGKSTL+ ++V G+ Y+Q
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD--SYEQIAGI 642


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0265SSPAMPROTEIN290.007 Salmonella surface presentation of antigen gene typ...
		>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type

M signature.
Length = 147

Score = 29.3 bits (65), Expect = 0.007
Identities = 14/30 (46%), Positives = 19/30 (63%)

Query: 3 LSSIAGLPLLLKTGLYDNRGFTREEKFQLI 32
+ IAGL LLL T +NR +REE + L+
Sbjct: 43 VEQIAGLKLLLDTLRAENRQLSREEIYALL 72


68GBAA_0384GBAA_0393N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0384-114-0.709280ABC transporter ATP-binding protein
GBAA_0385-114-0.302608chitinase
GBAA_0386019-1.957908hypothetical protein
GBAA_0387119-1.458612hypothetical protein
GBAA_0388014-0.867707hypothetical protein
GBAA_0389012-0.592180TetR family transcriptional regulator
GBAA_0390013-0.150733major facilitator family transporter protein
GBAA_0391-110-0.264281DNA-binding protein
GBAA_0392-1151.528756hypothetical protein
GBAA_0393-1141.300506hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0384PF05272320.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.008
Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 35 VGGNGIGKSTLLRILTGELIHDDGNIE 61
G GIGKSTL+ L G D + +
Sbjct: 602 EGTGGIGKSTLINTLVGLDFFSDTHFD 628


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0388cloacin250.025 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 25.4 bits (55), Expect = 0.025
Identities = 13/29 (44%), Positives = 16/29 (55%)

Query: 54 DSSHGGSHDCGGSFGGDSGGSCDGGGGGG 82
S G H GGS G+ GG+ + GGG G
Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSG 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0389HTHTETR843e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 83.5 bits (206), Expect = 3e-22
Identities = 46/198 (23%), Positives = 83/198 (41%), Gaps = 8/198 (4%)

Query: 1 MRRSAEEIKKEIAYKAEILFSQKGYAATSMEEICEITERSKGSIYYHFKSKEELFLFVVK 60
++ A+E ++ I A LFSQ+G ++TS+ EI + ++G+IY+HFK K +LF + +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 QHTYDWLEKWNEK-EKLYSTSTEKLYALAEYHVEDIQQPISN----AIEEFSMSQVVSKE 115
+ E E K L + + +E I V
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 116 ILDEMLALT-RESYVMFETLIEAGIQSGEFRED-NTRDLMYIVNGLLSGL-GVLYYELDY 172
++ + ESY E ++ I++ D TR I+ G +SGL +
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQS 184

Query: 173 KELKRIYKKAIDVLLKGM 190
+LK+ + + +LL+
Sbjct: 185 FDLKKEARDYVAILLEMY 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0390TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 3e-05
Identities = 27/130 (20%), Positives = 50/130 (38%), Gaps = 5/130 (3%)

Query: 34 FIMERTNNDPVSVSL-LSVMEYAPIFIFSFIGGALADRWNPKRTMVAGDVLSVLSIIGIV 92
F +R + D ++ + L+ + I G +A R +R ++ G + I +
Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI--L 293

Query: 93 LLLKLDYWQAIFFATLISAIVGQFSQPSSSRIFKRYVKEEQVANAIAFNQTLQSLFMIFG 152
L W + F ++ G P+ + R V EE+ L SL I G
Sbjct: 294 LAFATRGW--MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351

Query: 153 PVVGSLVYTQ 162
P++ + +Y
Sbjct: 352 PLLFTAIYAA 361



Score = 35.2 bits (81), Expect = 4e-04
Identities = 60/344 (17%), Positives = 124/344 (36%), Gaps = 26/344 (7%)

Query: 58 FIFSFIGGALADRWNPKRTMVAGDVLSVLS--IIGIVLLLKLDYWQAIFFATLISAIVGQ 115
F + + GAL+DR+ + ++ + + I+ L + ++ +++ I G
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWV-----LYIGRIVAGITGA 111

Query: 116 FSQPSSSRIFKRYVKEEQVANAIAFNQTLQSLFMIFGPVVGSL---VYTQLGLFTSLYSL 172
+ + ++ A F M+ GPV+G L F +
Sbjct: 112 -TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 173 IILFLLSAIALSFLPKWVEQEQVARDSLKNDIKEGWKYVLHTKNLRMITITFTIMGLAVG 232
+ FL LP+ + E+ + +++ + + F IM L
Sbjct: 171 GLNFLTGCF---LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 233 LTNPLEVFLVIERLGMEKEAVQYLAAADGI-GMLIGGIVAAVFASKVNPKKMFVFGMSIL 291
+ L V +R + + AA GI L ++ A+++ ++ + GM
Sbjct: 228 VPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIAD 287

Query: 292 AMSFLVEGLSTSFWITSFMRFGTGICLACVNI---VVGTLMIQLVPENMVGRVNGTILPL 348
+++ +T W M F + LA I + ++ + V E G++ G++ L
Sbjct: 288 GTGYILLAFATRGW----MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAAL 343

Query: 349 FMGAMLIGTALAGGLKEMTSLV---IVFCIAMALILLAIGPVLR 389
++G L + + + AL LL + P LR
Sbjct: 344 TSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCL-PALR 386


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0393TCRTETB461e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 46.4 bits (110), Expect = 1e-07
Identities = 30/158 (18%), Positives = 56/158 (35%), Gaps = 3/158 (1%)

Query: 264 DLGISATNLLIILFVTQIVACPFALLYGKLSTTFTGKKMLYVGIIIYIIICIYAYFLKTT 323
D + + + +YGKLS K++L GIII + + +
Sbjct: 43 DFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSF 102

Query: 324 LDFWILAMLV-ATSQGGIQALSRSYFAKLVPKESANEFFGFYNIFGKFAAIMGPVLVGVT 382
I+A + AL A+ +PKE+ + FG +GP + G+
Sbjct: 103 FSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI 162

Query: 383 TQLTGKTNAGVLSIIVLFIIGGFLLTRVPENNTSVTPP 420
+ +L I ++ II L ++ + +
Sbjct: 163 AHYIHWSY--LLLIPMITIITVPFLMKLLKKEVRIKGH 198


69GBAA_0552GBAA_0559N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_055208-1.711607internalin
GBAA_0553-19-0.903973acetyltransferase
GBAA_0554-18-0.745581glycine betaine transporter
GBAA_0555-19-0.675181collagenase
GBAA_0556010-0.435937hypothetical protein
GBAA_0557111-0.324626hypothetical protein
GBAA_0558112-0.983276methyl-accepting chemotaxis protein
GBAA_0559014-0.683485sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0552IGASERPTASE451e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.4 bits (107), Expect = 1e-06
Identities = 43/214 (20%), Positives = 78/214 (36%), Gaps = 18/214 (8%)

Query: 833 TQNIVAKEEPKEPVEEVEGSKEEPIKEAEGSKEEPKEPAKEVEGSKEEPKEPAKEVEGSK 892
T N + + P P E ++ + EA P P++ E E K+ +K VE ++
Sbjct: 999 TPNNIQADVPSVPSNNEEIAR---VDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE 1055

Query: 893 EEVKEPAKEVEGPKEEVKEPTK------EVEGPKEEVKEPTKEVEGPKEEVKEPMKEVEG 946
++ E + +E K K EV E KE V++ K
Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115

Query: 947 SKEEVKGPTKEAEGSKEEVK--------EPTTEVEGSKEVKEPGKEVEGSKDAINQSAVA 998
+++ + P ++ S ++ + EP E + + +KEP + + Q A
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ-TNTTADTEQPAKE 1174

Query: 999 QETNVNNQVGKEKVVENQNMKENKPAVTKQEESK 1032
+NV V + V N P T ++
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ 1208



Score = 41.6 bits (97), Expect = 2e-05
Identities = 31/189 (16%), Positives = 59/189 (31%), Gaps = 8/189 (4%)

Query: 860 AEGSKEEPKEPAKEVEGSKEEPKEPAKEVEGSKEEVKEPAKEVEGPKEEVKEPTKEVEGP 919
+ S E E P P++ E E K+ +K VE +++ E T +
Sbjct: 1009 SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREV 1068

Query: 920 KEEVKEPTKEVEGPKEEVKEPMKEVEGSKEEVKGPTKEAEGSKEEVKEPTTEVEGSKEVK 979
+E K K E + + E E K + K +V+ T+ +
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQ 1128

Query: 980 EPGKEVEGSKDAINQSAVAQETNVNNQVGKEKVVENQ------NMKENKPAVTKQEESKK 1033
K+ + + + A N KE + + + +Q ++
Sbjct: 1129 VSPKQEQ--SETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 1034 SLGATGGQE 1042
+ TG
Sbjct: 1187 TTVNTGNSV 1195



Score = 39.7 bits (92), Expect = 9e-05
Identities = 38/191 (19%), Positives = 65/191 (34%), Gaps = 12/191 (6%)

Query: 859 EAEGSKEEPKEPAKEVEGSKEEPKEPAKEVEGSKEEVKEPAKEVEGPKEEVKEPTKEVEG 918
E S E KE E KE A + K +V E K E PK + K+ +
Sbjct: 1084 EVAQSGSETKETQ------TTETKETATVEKEEKAKV-ETEKTQEVPKVTSQVSPKQEQS 1136

Query: 919 PKEEVKEPTKEVEGPKEEVKEPMKEVEGSKEEVKGPTKEAEGSKEEVKEPTTEVEGSKEV 978
+ + P +KEP + + + + + + ++ V E TT G+ V
Sbjct: 1137 ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVV 1196

Query: 979 KEPGKEVEGSKDAINQSAVAQETNVNNQVGKEKVVENQNMKENKPAVTKQEESKKSLGAT 1038
+ P + S + + ++ V N +PA T +
Sbjct: 1197 ENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNV-----EPATTSSNDRSTVALCD 1251

Query: 1039 GGQENTSTLLS 1049
NT+ +LS
Sbjct: 1252 LTSTNTNAVLS 1262



Score = 38.9 bits (90), Expect = 1e-04
Identities = 31/171 (18%), Positives = 51/171 (29%), Gaps = 8/171 (4%)

Query: 833 TQNIVAKEEPKEPVEEVEGSKEEPIKEAEGSKEEPKEPAKEVEGSKEEPKEPAKEVEGSK 892
TQ KE VE+ E +K E K E K + K+ + +P+
Sbjct: 1095 TQTTETKETAT--VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 893 EEVKEPAKEVEGPKEEVKEPTKEVEGPKEEVKEPTKEVEGPKEEVKEPMKEVEGSKEEVK 952
+KEP + + + + ++ V E T G E +
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP-----ENTTPATT 1207

Query: 953 GPTKEAEGSKEEVKEPTTEVEGSKEVKEPGKEVEGSKDAINQS-AVAQETN 1002
PT +E S + V EP + + + TN
Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258



Score = 35.8 bits (82), Expect = 0.001
Identities = 37/195 (18%), Positives = 69/195 (35%), Gaps = 16/195 (8%)

Query: 856 PIKEAEGSKEEPKEPAKEVEGSKEEPKEPAKEVEGSKEE--------VKEPAKEVEGPKE 907
P E + + P P+ E ++ + P++ E E
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 908 EVKEPTKEVEGPKEEVKEPTKEVEGPKEEVKEPMKEVEGSKEEVKGPTKEAEGSKEEVKE 967
K+ +K VE +++ E T + +E K +K + E + ++ E E KE
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 968 PTTEVEGSKEVKEPGKEVEGSKDAINQSAVAQETNVNNQVGKEKVVENQNMKENKPAVT- 1026
T + K E K E K V + + + + + + +EN P V
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPK-------VTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 1027 KQEESKKSLGATGGQ 1041
K+ +S+ + A Q
Sbjct: 1156 KEPQSQTNTTADTEQ 1170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0553SACTRNSFRASE381e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.4 bits (89), Expect = 1e-05
Identities = 25/101 (24%), Positives = 41/101 (40%), Gaps = 5/101 (4%)

Query: 178 TYYEGNEIIGRLSDTNK-LFVSMKNEKLEGYVYVEVNPEFQE-ANIEFIATAENSRRKGV 235
Y + + + + + K F+ G + ++ + A IE IA A++ R+KGV
Sbjct: 49 QYEDDDMDVSYVEEEGKAAFLYYLENNCIGRI--KIRSNWNGYALIEDIAVAKDYRKKGV 106

Query: 236 GERLLQAAIQYIFSFQGMREIELCLNTNNDRAVKLYKKVGF 276
G LL AI++ + L N A Y K F
Sbjct: 107 GTALLHKAIEWAKE-NHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0555MICOLLPTASE7550.0 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 755 bits (1950), Expect = 0.0
Identities = 412/886 (46%), Positives = 569/886 (64%), Gaps = 16/886 (1%)

Query: 94 YSMADLNKMNNQELVETLGSIKWHQITDLFQFNEDAKAFYKDKGKMQVVIDELAHRGSTF 153
Y+ +LN+MN +LVE + +I + + DLF FN+ + F+ ++ ++Q +I L G T+
Sbjct: 93 YTFDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTY 152

Query: 154 TKDDSKGIQTFTEVLRSAFYLAFYNNELSELNERSFQDKCLPALKAIAKNPNFKLGTTEQ 213
T DD KGI T E LR+ +YL FYN +LS LN +++CLPA+KAI N NF+LGT Q
Sbjct: 153 TADDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQLKNECLPAMKAIQYNSNFRLGTKAQ 212

Query: 214 DTVVSAYGKLISNASSDVETVQYASNILKQYNDNFTTYVNDRMKGQAIYDIMQGIDYDIQ 273
D VV A G+LI NAS+D E + +L + DN Y ++ KG A++++M+GIDY
Sbjct: 213 DGVVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGNAVFNLMKGIDYYTN 272

Query: 274 SYLIEARKE-ANETMWYGKVDGFINEINRIALL-NEVTQENKWLVNNGIYFASRLGKFHS 331
S + + A T +Y ++D ++ + + + +++ +N WLVNN +Y+ R+GKF
Sbjct: 273 SVIYNTKGYDAKNTEFYNRIDPYMERLESLCTIGDKLNNDNAWLVNNALYYTGRMGKFRE 332

Query: 332 NPNKGLEVVTQAMHMYPRLSEPYFVAVEQITTNYNGKDYSGNTVDLEKIRKEGKEQYLPK 391
+P+ + +AM YP LS Y A + N+ GK+ SGN +D KI+ + +E+YLPK
Sbjct: 333 DPSISQRALERAMKEYPYLSYQYIEAANDLDLNFGGKNSSGNDIDFNKIKADAREKYLPK 392

Query: 392 TYTFDDGSIVFKTGDKVSEEKIKRLYWAAKEVKAQYHRVIGNDKALEPGNADDILTIVIY 451
TYTFDDG V K GDKV+EEKIKRLYWA+KEVKAQ+ RV+ NDKALE GN DDILT+VIY
Sbjct: 393 TYTFDDGKFVVKAGDKVTEEKIKRLYWASKEVKAQFMRVVQNDKALEEGNPDDILTVVIY 452

Query: 452 NSPEEYQLNRQLYGYETNNGGIYIEETGTFFTYERTPEQSIYSLEELFRHEFTHYLQGRY 511
NSPEEY+LNR + G+ T+NGGIYIE GTFFTYERTPE+SIY+LEELFRHEFTHYLQGRY
Sbjct: 453 NSPEEYKLNRIINGFSTDNGGIYIENIGTFFTYERTPEESIYTLEELFRHEFTHYLQGRY 512

Query: 512 EVPGLFGRGDMYQNERLTWFQEGNAEFFAGSTRTNNVVPRKSIISGLSSDPASRYTAERT 571
VPG++G+G+ YQ LTW++EG AEFFAGSTRT+ + PRKS+ GL+ D +R +
Sbjct: 513 VVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAYDRNNRMSLYGV 572

Query: 572 LFAKYGSWDFYNYSFALQSYLYTHQFETFDKIQDLIRANDVKNYDAYRENLSKDLKLNEE 631
L AKYGSWDFYNY FAL +Y+Y + F+K+ + I+ NDV Y Y ++S D LN++
Sbjct: 573 LHAKYGSWDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDK 632

Query: 632 YQEYMQHLIDNQDKYNVPEVADDYLAEHTPKSLTAVEKEITETLPMKDAKMTKHSSQFFN 691
YQ+YM L++N D +VP V+D+Y+ H K + + +I E +KD SQFF
Sbjct: 633 YQDYMDSLLNNIDNLDVPLVSDEYVNGHEAKDINEITNDIKEVSNIKDLSSNVEKSQFFT 692

Query: 692 TFTLEGTYTGSVTKGDSEDWNAMSKKVNEALEQLAQKEWSGYKTVTAYFVNYGVNSSNQF 751
T+ + GTY G ++G+ DW M+ K+N+ L++L++K W+GYKTVTAYFVN+ V+ + +
Sbjct: 693 TYDMRGTYVGGRSQGEENDWKDMNSKLNDILKELSKKSWNGYKTVTAYFVNHKVDGNGNY 752

Query: 752 EYDVVFHG----IAKDDEENKAPTVNINGPYNGLVKEGIQFKSDGSKDEDGKIVSYLWDF 807
YDVVFHG D NK P I + +V+E I F SKDEDG+I +Y WDF
Sbjct: 753 VYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDF 812

Query: 808 GDGSTSAEVNPVHVYESEGSYKVALIVKDDKGKESKSEITVTV----KGGSLTESEPNNR 863
GDG S E H Y G Y+V L V D+ G + + V + ESEPNN
Sbjct: 813 GDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIKVVEDKPVEVINESEPNND 872

Query: 864 PEEANRIG-LNTTIKGSLIGGDHTDVYTFNVASAKNIDISVLNEYGIGMTWVLHHESDMQ 922
E+AN+I N +KG+L D++D Y F+VA N+ I++ N +G+TW L+ E D+
Sbjct: 873 FEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITLNNLNSVGITWTLYKEGDLN 932

Query: 923 NYAAYGQANGNHI---EANFNAKPGKYYLYVYKYDNGDGTYELSVK 965
NY Y A GN + +PG+YYL VY YDN GTY ++VK
Sbjct: 933 NYVLY--ATGNDGTVLKGEKTLEPGRYYLSVYTYDNQSGTYTVNVK 976



Score = 97.9 bits (243), Expect = 8e-23
Identities = 60/251 (23%), Positives = 99/251 (39%), Gaps = 49/251 (19%)

Query: 762 KDDEENKAPTVNINGPYNGLVKEGIQFKSD----GSKDEDGKIVSYLWDF---------- 807
K E+ +N + P N K KS+ G+ E+ Y +D
Sbjct: 854 KVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITL 913

Query: 808 ---------------GDGST-SAEVNPVHVYESEGSYKVA-----LIVKDDKGKESKSEI 846
GD + +G + L V +
Sbjct: 914 NNLNSVGITWTLYKEGDLNNYVLYATGNDGTVLKGEKTLEPGRYYLSVYTYDNQ--SGTY 971

Query: 847 TVTVKGG-----------SLTESEPNNRPEEANRIGLNTTIKGSLIGGDHTDVYTFNVAS 895
TV VKG ++ E E NN ++A ++ N+ I G+L D D+Y+ ++ +
Sbjct: 972 TVNVKGNLKNEVKETAKDAIKEVENNNDFDKAMKVDSNSKIVGTLSNDDLKDIYSIDIQN 1031

Query: 896 AKNIDISVLNEYGIGMTWVLHHESDMQNYAAYGQANGNHIEANFNAKPGKYYLYVYKYDN 955
+++I V N I M W+L+ D+ NY Y A+GN + PGKYYL VY+++N
Sbjct: 1032 PSDLNIVVENLDNIKMNWLLYSADDLSNYVDYANADGNKLSNTCKLNPGKYYLCVYQFEN 1091

Query: 956 -GDGTYELSVK 965
G G Y ++++
Sbjct: 1092 SGTGNYIVNLQ 1102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0557IGASERPTASE412e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 2e-05
Identities = 30/194 (15%), Positives = 67/194 (34%), Gaps = 12/194 (6%)

Query: 201 QPQIATVKRDATIANAEREKEARIEKARAEKEAKEAEYQRDAQIAEAEKHKELKVQSYKR 260
P + T AE K+ + E++A E Q EA+ + + Q+ +
Sbjct: 1026 PPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV 1085

Query: 261 EQEQARADADLSYELQQAKAQQGVTEEQMRVKIIEREKQIELEEKEIARREKQYDAEVKK 320
Q + E Q + ++ T E+ E + ++E E+ + + + ++
Sbjct: 1086 AQSGSETK-----ETQTTETKETATVEK------EEKAKVETEKTQEVPKVTSQVSPKQE 1134

Query: 321 KADADRYAVEQSAEAEKVKQIKKADADQYKIEAEARARAEEVRVEGLAKAEIEKAQGQAK 380
+++ + E + E + IK+ + A+ A+E
Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNT-TADTEQPAKETSSNVEQPVTESTTVNTGN 1193

Query: 381 AEVQKAQGTAEADV 394
+ V+ + T A
Sbjct: 1194 SVVENPENTTPATT 1207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0559PF06580310.008 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.008
Identities = 25/132 (18%), Positives = 51/132 (38%), Gaps = 27/132 (20%)

Query: 403 LKIEFMLDRESSLDKLSPPIESNYVVSILGNLITNAFE-AIERNEEHDKKVRMFVTDIGE 461
L+ E ++ + +D PP+ ++ L+ N + I + + K+ + T
Sbjct: 240 LQFENQIN-PAIMDVQVPPM-------LVQTLVENGIKHGIAQLPQ-GGKILLKGTKDNG 290

Query: 462 EIVIEVEDSGQGIHDEVITSIFYKGFSTKEGEKRGYGLAKVKELVEDLNG---SIAIEKG 518
+ +EVE++G E G GL V+E ++ L G I + +
Sbjct: 291 TVTLEVENTGSLALKNT-------------KESTGTGLQNVRERLQMLYGTEAQIKLSEK 337

Query: 519 DLGGALFIIALP 530
G ++ +P
Sbjct: 338 Q-GKVNAMVLIP 348


70GBAA_0566GBAA_0577N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0566-313-0.750467glycerol-3-phosphate ABC transporter ATP-binding
GBAA_0567-214-1.122438glycerol-3-phosphate ABC transporter permease
GBAA_0568-115-0.891311glycerol-3-phosphate ABC transporter permease
GBAA_0569117-0.758080glycerol-3-phosphate ABC transporter
GBAA_0570216-1.101834serine/threonine phosphatase
GBAA_0571216-1.413509DNA-binding response regulator
GBAA_0572217-1.753500sensor histidine kinase
GBAA_0573014-1.418318hypothetical protein
GBAA_0574-113-0.680967hypothetical protein
GBAA_0575-213-0.672221methyl-accepting chemotaxis protein
GBAA_0576-112-0.494311sensory histidine kinase DcuS
GBAA_0577013-0.006312response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0566PF05272330.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.7 bits (74), Expect = 0.002
Identities = 13/32 (40%), Positives = 16/32 (50%)

Query: 44 VLVGPSGCGKSTLLRMIAGLEEISSGDLIINE 75
VL G G GKSTL+ + GL+ S I
Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0569MALTOSEBP419e-06 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 40.9 bits (95), Expect = 9e-06
Identities = 72/327 (22%), Positives = 119/327 (36%), Gaps = 43/327 (13%)

Query: 131 IKKDKYDTSKLEKAITNYYSVDGKMYSMPFNSSTPVLIYNKDAFAKAGLDPEKAPKTYAE 190
I DK KL + +GK+ + P LIYNKD PKT+ E
Sbjct: 105 ITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEE 157

Query: 191 LQEAAKKLTIKEGGNVKQYGFSMLNYGWFFEELLATQGALYVDNENGRKDAAKKAVFNGK 250
+ K+L K G + + + W L+A G ENG+ D V N
Sbjct: 158 IPALDKELKAK-GKSALMFNLQEPYFTW---PLIAADGGYAFKYENGKYDIKDVGVDNAG 213

Query: 251 EGQKVFGMLDELNKAGALGKYGASWDDIRAAFQSGQVAMYLDSSAGVRDLIDASKFNVGV 310
+ ++D + + AAF G+ AM ++ + ID SK N GV
Sbjct: 214 AKAGLTFLVDLIKNKHM--NADTDYSIAEAAFNKGETAMTINGPWAWSN-IDTSKVNYGV 270

Query: 311 SYIPYPEDSKQN---GVVIGGASLWMTNMVSEETQQGAWDFMKYLTKPDVQAKWHTATGY 367
+ +P + GV+ G + N K L K ++ T G
Sbjct: 271 TVLPTFKGQPSKPFVGVLSAGINAASPN--------------KELAKEFLENYLLTDEGL 316

Query: 368 FSINPD----AYNEPLVKEQYEKYPQLKVTVDQLQATKQSPATQGALISVFPESRDAVVK 423
++N D A +E+ K P++ T++ Q +G ++ P+
Sbjct: 317 EAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQ--------KGEIMPNIPQMSAFWYA 368

Query: 424 ALEAMYDGENSKEALDEAAKATDRAIS 450
A+ + + ++ +DEA K I+
Sbjct: 369 VRTAVINAASGRQTVDEALKDAQTRIT 395


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0571HTHFIS926e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 6e-24
Identities = 35/140 (25%), Positives = 67/140 (47%), Gaps = 2/140 (1%)

Query: 2 RLLVVEDNASLLESIVQILCDE-FEVDTALNGEDGLFLALQNIYDAILLDVMMPEMDGFE 60
+LV +D+A++ + Q L ++V N D ++ DV+MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 61 VIQKIRDEKIETPVLFLTARDSLEDRVKGLDFGGDDYIVKPFQAPELKARI-RALLRRSG 119
++ +I+ + + PVL ++A+++ +K + G DY+ KPF EL I RAL
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 120 SLTTKQTIRYKGIELFGKDK 139
+ + G+ L G+
Sbjct: 125 RPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0572PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.5 bits (92), Expect = 2e-05
Identities = 34/198 (17%), Positives = 70/198 (35%), Gaps = 53/198 (26%)

Query: 234 TISKECRRLSKLVANLLL---------LARSDSNQIEMDKKIFELDKLLEEIVEPYKEIA 284
I +L L S++ Q+ + ++ +V+ Y ++A
Sbjct: 181 NIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADEL--------TVVDSYLQLA 232

Query: 285 SYQEKEMILKVEYDISFMGDRERIHQMMV------ILLDNAMKY----TNEGGHIQIDCT 334
S Q ++ L+ E I+ I + V L++N +K+ +GG I + T
Sbjct: 233 SIQFEDR-LQFENQIN-----PAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT 286

Query: 335 QTNSSIRIRVKDDGIGVKGEDIPKLFDRFYQGDKARSASEGAGLGLSIANWIVEKHYGK- 393
+ N ++ + V++ G ++ E G GL ++ YG
Sbjct: 287 KDNGTVTLEVENTG-----------------SLALKNTKESTGTGLQNVRERLQMLYGTE 329

Query: 394 --ISVESQWGEGTCFEVI 409
I + + G+ +I
Sbjct: 330 AQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0576PF06580387e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.9 bits (88), Expect = 7e-05
Identities = 20/99 (20%), Positives = 42/99 (42%), Gaps = 19/99 (19%)

Query: 434 LIDNALE-AVTNCEKK-RVEVKIQHED-ILTITVQDTGKGIQEKEIEELFTKGYSTKGDN 490
L++N ++ + + ++ +K ++ +T+ V++TG + E +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------S 310

Query: 491 RGYGLYLVKESIQRINGE---IHMHSLVGKGTTITIEIP 526
G GL V+E +Q + G I + GK + IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0577HTHFIS682e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 2e-15
Identities = 29/113 (25%), Positives = 51/113 (45%), Gaps = 5/113 (4%)

Query: 7 THYLEQVGGFELVQAVNSIKSAIEVLEESRIDLVLLDIFMPEETGFELLMYIRNQEKEID 66
L + G V+ ++ + + DLV+ D+ MP+E F+LL I+ ++
Sbjct: 20 NQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLP 77

Query: 67 IMMISAVHDMGSIKKALQYGVVDYLIKPFTFERFKEALTIYREKLTFMKEQQK 119
++++SA + + KA + G DYL KPF E + I L K +
Sbjct: 78 VLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT---ELIGIIGRALAEPKRRPS 127


71GBAA_0583GBAA_0587N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0583-215-2.159898acetyltransferase
GBAA_0584-1100.269731sensor histidine kinase
GBAA_0585-1110.018143DNA-binding response regulator
GBAA_0586-1120.686862hypothetical protein
GBAA_05870130.771153acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0583SACTRNSFRASE385e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.0 bits (88), Expect = 5e-06
Identities = 20/87 (22%), Positives = 31/87 (35%), Gaps = 4/87 (4%)

Query: 59 GAFKDGKLIGVATLETKPYVKQEHKAKIGSVYVSPKARGLGAGKALIKECLELAKSLEVE 118
+ + IG + + A I + V+ R G G AL+ + +E AK
Sbjct: 69 LYYLENNCIGRIKIRSN----WNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFC 124

Query: 119 QVMLDVVVGNDGAKKLYESLGFKTFGV 145
+ML+ N A Y F V
Sbjct: 125 GLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_058460KDINNERMP310.013 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 30.7 bits (69), Expect = 0.013
Identities = 23/87 (26%), Positives = 35/87 (40%), Gaps = 9/87 (10%)

Query: 152 KKSKFITTVSP-IHTTEFQGKLYMLLKTSFLENMLLKLMKQFLIISVLTIILTTISVFIF 210
+ + V+P + T G L+ + + F LLK + F+ +II+ T V
Sbjct: 312 EIQDKMAAVAPHLDLTVDYGWLWFISQPLF---KLLKWIHSFVGNWGFSIIIITFIV--- 365

Query: 211 SRVITEPL-IKMKRATEKMSKLNKPIQ 236
R I PL + KM L IQ
Sbjct: 366 -RGIMYPLTKAQYTSMAKMRMLQPKIQ 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0585HTHFIS941e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 1e-24
Identities = 34/121 (28%), Positives = 64/121 (52%), Gaps = 1/121 (0%)

Query: 3 KILLVDDEERMLRLLDLFLSPRGYFCMKATSGLEALKLIEQKDFDIILLDVMMPNMDGWD 62
IL+ DD+ + +L+ LS GY ++ + I D D+++ DV+MP+ + +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 TCYQIRQI-SNVPIIMLTARNQNYDMVKGLTMGADDYITKPFDEHVLVARIEAILRRTKK 121
+I++ ++P+++++A+N +K GA DY+ KPFD L+ I L K+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 D 122

Sbjct: 125 R 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0587SACTRNSFRASE431e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.6 bits (100), Expect = 1e-07
Identities = 29/123 (23%), Positives = 42/123 (34%), Gaps = 7/123 (5%)

Query: 23 TKNPEAFSSSYEDVLKHEDPVAAMAKRLSNPDKYTLGVFKDKDLIGIATLETKPFIKQEH 82
T E FS Y K + + K + + + IG + +
Sbjct: 36 TYTEERFSKPY---FKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSN----WNG 88

Query: 83 KAKIGSVFVSPKARGLGAGRALIKAIIENADKLHVEQLMLDVVVGNDAAKKLYESLGFQT 142
A I + V+ R G G AL+ IE A + H LML+ N +A Y F
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148

Query: 143 YGV 145
V
Sbjct: 149 GAV 151


72GBAA_0615GBAA_0621N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_06150130.518300iron ABC transporter substrate-binding protein
GBAA_06161141.340698iron ABC transporter permease
GBAA_06170131.435938iron ABC transporter permease
GBAA_06180140.493290iron ABC transporter ATP-binding protein
GBAA_06190170.174758hypothetical protein
GBAA_0620-1170.2109422-amino-3-ketobutyrate CoA ligase
GBAA_0621-2160.169919hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0615FERRIBNDNGPP973e-25 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 96.6 bits (240), Expect = 3e-25
Identities = 65/294 (22%), Positives = 110/294 (37%), Gaps = 46/294 (15%)

Query: 16 LAFSLLLSACGKSNTKEESKEDTKKEMIPVEHAMGKTEVPANPKRVVILTNEGTEALLEL 75
L L + NT + D P R+V L E LL L
Sbjct: 13 LTAMALSPLLWQMNTAHAAAID--------------------PNRIVALEWLPVELLLAL 52

Query: 76 GVKPVGAV-----KSWTGDPWYPHIKDKMKDVKVVGDEGQVNVETIASLKPDLIIGNKMR 130
G+ P G + W +P P V VG + N+E + +KP ++ +
Sbjct: 53 GIVPYGVADTINYRLWVSEPPLP------DSVIDVGLRTEPNLELLTEMKPSFMVWS-AG 105

Query: 131 HEKVYEQLKAIAPTV---FSETLR--GEWKDNFKFYAKALNKEKDGQKVLAAYDKRMKDL 185
+ E L IAP FS+ + + + A LN + + LA Y+ ++ +
Sbjct: 106 YGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSM 165

Query: 186 KAKLGDKVNQEISMVRFM-PGDVRIYHGDTFSGVILKELGFKRPGDQNKNDFAERNVSKE 244
K + + + + + + P + ++ ++ IL E G N + VS +
Sbjct: 166 KPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSID 225

Query: 245 RISAM-DGDVLFYFTFDKGNEKKGSELEKEYINDPLFKNLNAVKNGKAYKVDDV 297
R++A D DVL FD N K L PL++ + V+ G+ +V V
Sbjct: 226 RLAAYKDVDVLC---FDHDNSKDMDALMA----TPLWQAMPFVRAGRFQRVPAV 272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0616TYPE3IMSPROT320.002 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 32.4 bits (74), Expect = 0.002
Identities = 24/173 (13%), Positives = 67/173 (38%), Gaps = 16/173 (9%)

Query: 100 GAAFFIVVAIVIFSVTSLSAFTWIAFL-------GAAIAAVLVFASSSLGKEGTTPLKLT 152
A + ++ ++ ++ + + + L + ++ E
Sbjct: 31 STALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPL 90

Query: 153 LAGVAISALFSSLTQGLLVLNEKALE------EVLFWLAGSVQGRKL-EILQSVFPYLLI 205
L A+ A+ S + Q +++ +A++ + + L E L+S+ +L+
Sbjct: 91 LTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLL 150

Query: 206 GWIASIMMTGKVNTLMMGEDVAKGLGQRTILMKSFVLLIIVLLSGGSVAVAGP 258
+ I++ G + TL+ + G+ T L+ + ++V+ + G V ++
Sbjct: 151 SILIWIIIKGNLVTLL--QLPTCGIECITPLLGQILRQLMVICTVGFVVISIA 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0617BORPETOXINA290.020 Bordetella pertussis toxin A subunit signature.
		>BORPETOXINA#Bordetella pertussis toxin A subunit signature.

Length = 269

Score = 29.4 bits (65), Expect = 0.020
Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 286 PHISRRLVGSLYGALLPVAAIVGAILVLAAD 316
P+ SRR V S+ G L+ +A ++GA + A+
Sbjct: 211 PYTSRRSVASIVGTLVRMAPVIGACMARQAE 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0621NUCEPIMERASE885e-22 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 87.9 bits (218), Expect = 5e-22
Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 17/241 (7%)

Query: 3 KILVTGSLGQIGSELVMKLRD----VYGASNVIA---TDIRETDSEVVTSGPFE--TLDV 53
K LVTG+ G IG + +L + V G N+ +++ E++ F+ +D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 54 TDGQKLHDIAKRNEVDTIIHLAALLSAT-AEKNPLFAWNLNMGGLVNALEAARELNCKFF 112
D + + D+ + + L+ + +NP + N+ G +N LE R +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 113 T-PSSIGAFGPSTPKDNTPQDTIQRPTTMYGVNKVAGELLCDYYHQKFGVDTRGVRFPGL 171
SS +G + + D++ P ++Y K A EL+ Y +G+ G+RF
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF--- 178

Query: 172 ISYVAPPGGGTTDYAVEIYYEAIKKGTYTSYIAEGTYM-DMMYMPDALQAIISLMEADPS 230
V P G D A+ + +A+ +G G D Y+ D +AII L + P
Sbjct: 179 -FTVYGP-WGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236

Query: 231 K 231

Sbjct: 237 A 237


73GBAA_0719GBAA_0726N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_0719-219-1.271823AcrB/AcrD/AcrF family transporter
GBAA_0720315-2.210233*******************hypothetical protein
GBAA_0721316-1.994908hypothetical protein
GBAA_0722213-0.485436hypothetical protein
GBAA_0723111-0.237132hypothetical protein
GBAA_0724-1110.639856M23/37 family peptidase
GBAA_07250141.364509transcriptional activator TenA
GBAA_0726-1141.485013ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0719ACRIFLAVINRP5650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 565 bits (1458), Expect = 0.0
Identities = 222/1039 (21%), Positives = 447/1039 (43%), Gaps = 55/1039 (5%)

Query: 4 LTKFSLKNRAAVIIMVFLISILGVYSGSKLPMEFLPSIDNPAVTVTTLSPGLDAEAMTKE 63
+ F ++ ++ ++ + G + +LP+ P+I PAV+V+ PG DA+ +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 VTDPLEKQFRNLEHIDNITS-STHEGLSRIDIAYTSKANMKDATREVEKAINTIK--LPK 120
VT +E+ ++++ ++S S G I + + S + A +V+ + LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 DATKPIVSQLNTTMIPLAQIAIQKQNGFSKADE--KQIEKEIVPQLESIDGVANVMFFGK 178
+ + +S ++ L N + D+ + + L ++GV +V FG
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 179 STSELSIILDPNQLKDKNVTTEQILKVLQGKETSTPAG------AVTVNKEEYNLRVIGD 232
+ + I LD + L +T ++ L+ + AG A+ + ++
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 IKNVDDIKNITVAP-----HVKLQDVAQIEL-KQHYDTISHINGEEGTGLIIMKEPSKNA 286
KN ++ +T+ V+L+DVA++EL ++Y+ I+ ING+ GL I NA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 287 VAIGKEIDKKIKDISKQYKDQFSIKLLASTHEQVENAVTSMGKEVILGAIAATLIILIFL 346
+ K I K+ ++ + + T V+ ++ + K + + L++ +FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 347 RNFRTTLIAVVSIPLSILLTLFLLHQSNITLNTLTLGGLAVAVGRLVDDSIVVIENIFRR 406
+N R TLI +++P+ +L T +L ++NTLT+ G+ +A+G LVDD+IVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 407 LQKEYFS-KDIILDATKEVAVAITSSTLTTVAVFLPIGLVSGVIGKLMLPMVLAVVYSIL 465
+ ++ K+ + ++ A+ + AVF+P+ G G + + +V ++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 466 SSLIVALTVVPLMAFLLLKKIK---HKKPS------------SSPRYVATLKWALSHKFI 510
S++VAL + P + LLK + H+ S Y ++ L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 511 ILLTSFLLFAGSIAAYVLLPKANIKSEDDTMLSINMTFPADYALETQKQKAFDFEKKLLS 570
LL L+ AG + ++ LP + + ED + + PA E ++ L
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 571 NSDVTD-VILRMGSSAEDAQWGQTTKNNLASIFVVFK-----------KGSDIDQYIKEL 618
N + + + GQ N FV K + I + EL
Sbjct: 600 NEKANVESVFTVNGFSFS---GQA--QNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL 654

Query: 619 KKEHNAF-EPAELDYIKTSYSSSGGGNNLQFNVTATNETNLKKAATIVETKLKNMDDLSK 677
K + F P + I +++G L ++ + ++ ++ L
Sbjct: 655 GKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVS 714

Query: 678 VKTNLEDSKKEWQIHVDQTKAEQLGLTPELAAQQVAFLMKKSPIGEVSINNEKTTIMIEH 737
V+ N + ++++ VDQ KA+ LG++ Q ++ + + + + + ++
Sbjct: 715 VRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQA 774

Query: 738 KKESITKQEDILNTNILSPINGPIPLKDIATISEKQLQTEVFHKDGKETIQITAEASNED 797
+ ED+ + S +P T + +G +++I EA+
Sbjct: 775 DAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGT 834

Query: 798 LSKVSAEVNKAITDLDLPSGAKVNIAGATESMQENFTDLFKIMGIAIGIVYLIMVITFGQ 857
S + + + + LP+G + G + + + ++ I+ +V+L + +
Sbjct: 835 SSGDAMALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893

Query: 858 ARAPFAILFSLPLAAVGGILGLIISGTPVDVNSLIGALMLIGIVVTNAIVLIERVQQNRE 917
P +++ +PL VG +L + DV ++G L IG+ NAI+++E + E
Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953

Query: 918 H-GMETREALLEAGSTRLRPIIMTAITTIVAMLPLLFGQSQAGSMVSKSLAVVVIGGLAV 976
G EA L A RLRPI+MT++ I+ +LPL + AGS ++ + V+GG+
Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAIS-NGAGSGAQNAVGIGVMGGMVS 1012

Query: 977 STVLTLVVVPVMYELLDKI 995
+T+L + VPV + ++ +
Sbjct: 1013 ATLLAIFFVPVFFVVIRRC 1031



Score = 127 bits (321), Expect = 5e-32
Identities = 96/518 (18%), Positives = 198/518 (38%), Gaps = 42/518 (8%)

Query: 509 FIILLTSFLLFAGSIAAYVLLPKANIKSEDDTMLSINMTFPADYALETQKQKAFDFEKKL 568
F +L L+ AG++A + LP A + +S++ +P A Q
Sbjct: 11 FAWVLAIILMMAGALA-ILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT--------- 60

Query: 569 LSNSDVTDVILR--MGSSAEDAQWGQTTKNNLASIFVVFKKGSDIDQYIKELKKEHNAFE 626
VT VI + G + +I + F+ G+D D +++ +
Sbjct: 61 -----VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLAT 115

Query: 627 P-----AELDYIKTSYSSSGGGNNLQFNVTATNETNLKK---AATIVETKLKNMDDLSKV 678
P + I SSS F T A+ V+ L ++ + V
Sbjct: 116 PLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175

Query: 679 KTNLEDSKKEWQIHVDQTKAEQLGLTPE-----LAAQQVAFLMKKSPIGEVSINNEKTTI 733
L ++ +I +D + LTP L Q + G ++ ++
Sbjct: 176 --QLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQ-IAAGQLGGTPALPGQQLNA 232

Query: 734 MIEHKKESITKQEDILNTNILSPING-PIPLKDIATISE-KQLQTEVFHKDGKETIQIT- 790
I + E+ + +G + LKD+A + + + +GK +
Sbjct: 233 SIIAQTRFKNP-EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGI 291

Query: 791 AEASNEDLSKVSAEVNKAITDL--DLPSGAKVNIA-GATESMQENFTDLFKIMGIAIGIV 847
A+ + + + + +L P G KV T +Q + ++ K + AI +V
Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351

Query: 848 YLIMVITFGQARAPFAILFSLPLAAVGGILGLIISGTPVDVNSLIGALMLIGIVVTNAIV 907
+L+M + RA ++P+ +G L G ++ ++ G ++ IG++V +AIV
Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411

Query: 908 LIERVQQ-NREHGMETREALLEAGSTRLRPIIMTAITTIVAMLPLLFGQSQAGSMVSKSL 966
++E V++ E + +EA ++ S ++ A+ +P+ F G++ +
Sbjct: 412 VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY-RQF 470

Query: 967 AVVVIGGLAVSTVLTLVVVPVMYELLDKIGRKRRSRRK 1004
++ ++ +A+S ++ L++ P + L K K
Sbjct: 471 SITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508



Score = 94.1 bits (234), Expect = 1e-21
Identities = 74/516 (14%), Positives = 174/516 (33%), Gaps = 41/516 (7%)

Query: 3 RLTKFSLKNRAAVIIMVFLISILGVYSGSKLPMEFLPSIDNPAVTVT-TLSPGLDAE--- 58
L + +++ LI V +LP FLP D L G E
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 59 -AMTKEVTDPLEKQFRNLEHIDNITSSTHEGLSRID-IAYTSKANMKDATR---EVEKAI 113
+ + L+ + N+E + + + G ++ +A+ S ++ E I
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647

Query: 114 NTIKLPKDATK---------PIVSQLNTTMIPLAQIAIQKQNGFSKADEKQIEKEIVPQL 164
+ K+ + P + +L T + Q G Q +++
Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATG--FDFELIDQAGLGHDALTQARNQLLGMA 705

Query: 165 -ESIDGVANVMFFGKS-TSELSIILDPNQLKDKNVTTEQILKVLQGKETSTPAGAVTVNK 222
+ + +V G T++ + +D + + V+ I + + T
Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765

Query: 223 EEYNLRVIGD---IKNVDDIKNITVAPH----VKLQDVAQIELKQHYDTISHINGEEGTG 275
L V D +D+ + V V + NG
Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME 825

Query: 276 LIIMKEPSKNAVAIGKEIDKKIKDISKQYKDQFSIKLLASTHEQVENAVTSMGKEVILGA 335
+ P ++ + +++++ + ++++ S + L A
Sbjct: 826 IQGEAAPGTSS----GDAMALMENLASKLPAGIGYDWTGMSYQERL----SGNQAPALVA 877

Query: 336 IAATLIILIF---LRNFRTTLIAVVSIPLSILLTLFLLHQSNITLNTLTLGGLAVAVGRL 392
I+ ++ L ++ + ++ +PL I+ L N + + GL +G
Sbjct: 878 ISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLS 937

Query: 393 VDDSIVVIENIFRRLQKEYFS-KDIILDATKEVAVAITSSTLTTVAVFLPIGLVSGVIGK 451
++I+++E ++KE + L A + I ++L + LP+ + +G
Sbjct: 938 AKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSG 997

Query: 452 LMLPMVLAVVYSILSSLIVALTVVPLMAFLLLKKIK 487
+ + V+ ++S+ ++A+ VP+ ++ + K
Sbjct: 998 AQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0721MICOLLPTASE320.004 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 32.0 bits (72), Expect = 0.004
Identities = 14/68 (20%), Positives = 28/68 (41%), Gaps = 1/68 (1%)

Query: 50 KEFYKEENLAAFIVYGM-NKAKNLPQFHKDEIPTLVRILRLCQEIGWYEEANTFMVNQGL 108
F+ + I+YG+ + + IPTLV LR +G+Y + +++ L
Sbjct: 129 YTFFSNRDRVQAIIYGLEDSGRTYTADDDKGIPTLVEFLRAGYYLGFYNKQLSYLNTPQL 188

Query: 109 AEFVHTSL 116
++
Sbjct: 189 KNECLPAM 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0724RTXTOXIND320.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.004
Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 14/71 (19%)

Query: 284 AAQGNVSIQAAAAGKVVKSYYSASYGNVVFIAHQINGKLYTTVYAHMKDRTVQAGDQVQA 343
+ G V I A A GK+ S G I N + K+ V+ G+ V+
Sbjct: 75 SVLGQVEIVATANGKLTHS------GRSKEIKPIENSIV--------KEIIVKEGESVRK 120

Query: 344 GQLVGHMGNTG 354
G ++ + G
Sbjct: 121 GDVLLKLTALG 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_0726PF05272300.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.014
Identities = 11/32 (34%), Positives = 15/32 (46%)

Query: 39 GPSGCGKSTLFRLITGLEEASTGQIELTETKS 70
G G GKSTL + GL+ S ++ K
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


74GBAA_1230GBAA_1233N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1230-2160.609533dTDP-glucose 4,6-dehydratase
GBAA_12310171.033288dTDP-4-dehydrorhamnose reductase
GBAA_12324181.682127enoyl-ACP reductase
GBAA_12335181.538260hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1230NUCEPIMERASE1881e-59 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 188 bits (478), Expect = 1e-59
Identities = 75/332 (22%), Positives = 141/332 (42%), Gaps = 26/332 (7%)

Query: 1 MNILVTGGAGFIGSNFVHYMLQSYETYKIINFDALT--YSGNLNNVK-SIQDHPNYYFVK 57
M LVTG AGFIG + +L+ ++++ D L Y +L + + P + F K
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLE--AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHK 58

Query: 58 GEIQNGELLEHVIKERDVQVIVNFAAESHVDRSIENPIPFYDTNVIGTVTLLELVKKYPH 117
++ + E + + + + V S+ENP + D+N+ G + +LE +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 118 IKLVQVSTDEVYGSLGKTGRFTEETPLA-PNSPYSSSKASADMIALAYYKTYQLPVIVTR 176
L+ S+ VYG L + F+ + + P S Y+++K + +++A Y Y LP R
Sbjct: 119 QHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177

Query: 177 CSNNYGPYQYPEKLIPLMVTNALEGKKLPLYGDGLNVRDWLHVTDHCSAIDVVLHKGRV- 235
YGP+ P+ + LEGK + +Y G RD+ ++ D AI +
Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237

Query: 236 -----------------GEVYNIGGNNEKTNVEVVEQIITLLGKTKKDIEYVTDRLGHDR 278
VYNIG ++ ++ ++ + LG + + + G
Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI-EAKKNMLPLQPGDVL 296

Query: 279 RYAINAEKMKNEFDWEPKYTFEQGLQETVQWY 310
+ + + + + P+ T + G++ V WY
Sbjct: 297 ETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1231NUCEPIMERASE444e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 43.6 bits (103), Expect = 4e-07
Identities = 36/200 (18%), Positives = 70/200 (35%), Gaps = 38/200 (19%)

Query: 4 RVIITGANGQLGKQLQEEL--NPEE----------YDIYPFDKKL------------LDI 39
+ ++TGA G +G + + L + YD+ +L +D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 40 TNISQVQQVVQEIRPHIIIHCAAYTKVDQAEKERDLAYV-INAIGARNVAVASQLVGAK- 97
+ + + + V + E AY N G N+ + +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYS-LENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 98 LVYISTDYVFQGDRPEGYDEFHNPA-PINIYGASKYAGEQFVKELHNKYFIVRTSW---- 152
L+Y S+ V+ +R + + P+++Y A+K A E + Y + T
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 153 LYGKYGN------NFVKTMI 166
+YG +G F K M+
Sbjct: 181 VYGPWGRPDMALFKFTKAML 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1232DHBDHDRGNASE577e-12 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 57.0 bits (137), Expect = 7e-12
Identities = 60/259 (23%), Positives = 105/259 (40%), Gaps = 19/259 (7%)

Query: 4 LQGKTFVVMGVANQRSIAWGIARSLHNAGAKLI-FTYAGERLERNVRELADTLEGQESLV 62
++GK + G A + I +AR+L + GA + Y E+LE+ V L E + +
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL--KAEARHAEA 61

Query: 63 LPCDVTNDEELTACFETIKQEVGTIHGVAHCIAFANRDDLKGEFVDTSRDGFLLAQNISA 122
P DV + + I++E+G I + + G S + + ++++
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLR----PGLIHSLSDEEWEATFSVNS 117

Query: 123 FSLTAVAREAKKVMT--EGGNILTLTYLGGERVVKNYNVMGVAKASLEASVKYLANDLGQ 180
+ +R K M G+I+T+ + +KA+ K L +L +
Sbjct: 118 TGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAE 177

Query: 181 HGIRVNAISAGPIRT-----LSAKGVGDFNSILREIEE---RAPLRRTTTQEEVGDTAVF 232
+ IR N +S G T L A G I +E PL++ ++ D +F
Sbjct: 178 YNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLF 237

Query: 233 LFSDLARGVTGENIHVDSG 251
L S A +T N+ VD G
Sbjct: 238 LVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1233IGASERPTASE300.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.009
Identities = 10/53 (18%), Positives = 28/53 (52%)

Query: 48 DRKEESNRNENVVSSAVEEVIEQEEQQQEQEQEQEEQVEEKTEEEEQVQEQQE 100
++ +SN N ++ V + + ++ Q E ++ VE++ + + + ++ QE
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121



Score = 29.3 bits (65), Expect = 0.009
Identities = 23/80 (28%), Positives = 34/80 (42%), Gaps = 6/80 (7%)

Query: 29 LELAAPKIKRIILTNFENEDRKEESNRNENVVSSAVEEVIEQEEQQQEQEQEQEE----- 83
+ A +K TN E E+ + + V ++E+ + E E+ QE
Sbjct: 1069 AKEAKSNVKANTQTN-EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127

Query: 84 QVEEKTEEEEQVQEQQEPVR 103
QV K E+ E VQ Q EP R
Sbjct: 1128 QVSPKQEQSETVQPQAEPAR 1147


75GBAA_1312GBAA_1318N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1312-2120.810560DNA-binding response regulator
GBAA_1313-1110.334398sensor histidine kinase
GBAA_13140100.669751GntR family transcriptional regulator
GBAA_13150130.789428hypothetical protein
GBAA_13160130.182060(Fe-S)-binding protein
GBAA_1317015-0.548068hypothetical protein
GBAA_1318-117-0.456929late competence protein comC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1312HTHFIS1126e-31 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 112 bits (281), Expect = 6e-31
Identities = 33/130 (25%), Positives = 61/130 (46%), Gaps = 1/130 (0%)

Query: 1 MSKYRVLVVDDESDMRQLVGMYLDNFGYEWGEAENGKEALKKLETDHYDFVVLDIMMPEM 60
M+ +LV DD++ +R ++ L GY+ N + + D VV D++MP+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLSVCKEIRKT-SDVPIIFLTAKGEEWNRVNGLRMGADDYIVKPFSPGELIARMEAVLR 119
+ + I+K D+P++ ++A+ + GA DY+ KPF ELI + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RYTKQEQQEE 129
++ + E
Sbjct: 121 EPKRRPSKLE 130


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1313PF06580392e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.1 bits (91), Expect = 2e-05
Identities = 30/188 (15%), Positives = 73/188 (38%), Gaps = 32/188 (17%)

Query: 275 EKVTQLIHKEADRMQRLVHDLLDL--AQLEGEHFPLQKQPIVFSQ---LIEDVLDTYEIK 329
+ LI ++ + + ++ L +L L + + + +++ L I+
Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYS----NARQVSLADELTVVDSYLQLASIQ 235

Query: 330 FIEKKIRISTNLNPEII-VMIDEDRMQQVLHNVLDNAIRYTNQNGDIMITLRQIDDYCEL 388
F E +++ +NP I+ V + +Q ++ N + + I Q G I++ + + L
Sbjct: 236 F-EDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTL 294

Query: 389 SIKDTGIGIDTEHLENLGERFYRVDKARSRQHGGTGLGLAIVRQ-IVHIHDGQW--QIES 445
+++TG A TG GL VR+ + ++ + ++
Sbjct: 295 EVENTG------------------SLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSE 336

Query: 446 EKGNGTTV 453
++G +
Sbjct: 337 KQGKVNAM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1316ANTHRAXTOXNA320.007 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 31.6 bits (71), Expect = 0.007
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 7/87 (8%)

Query: 88 KTKEEAAKYIQDVAKKKQAKKVVKSKSMVTEEISMNHALEEIGCEVLE--SDLGEYILQV 145
KT++E K + K + K T+++ L++I +VLE S+LG I
Sbjct: 53 KTEKEKFKDSINNLVKTEFTNETLDKIQQTQDL-----LKKIPKDVLEIYSELGGEIYFT 107

Query: 146 DNDPPSHIIAPALHKNRTQIRDVFKEK 172
D D H L + + EK
Sbjct: 108 DIDLVEHKELQDLSEEEKNSMNSRGEK 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1318PREPILNPTASE1337e-40 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 133 bits (335), Expect = 7e-40
Identities = 64/264 (24%), Positives = 122/264 (46%), Gaps = 35/264 (13%)

Query: 4 YVYALLVGMVFGSFFMLIAMRIPL------------------------GESIIIPRSHCH 39
+ L ++ GSF ++ R+P+ ++++PRS C
Sbjct: 16 FSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSCCP 75

Query: 40 YCKYVLKPKELIPIISFCIQRGRCTNCKRKISILYVIFELVTGIICLLTVYMIGVERELI 99
+C + + E IP++S+ RGRC C+ IS Y + EL+T ++ + + +
Sbjct: 76 HCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWGTL 135

Query: 100 IILSLFSLLLIISVTDYIYMLIPNRI---LAWFSCLLILECVFVPLVTWTESIVGSGVIF 156
L L +L+ ++ D ML+P+++ L W L L FV L ++++G+ +
Sbjct: 136 AALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLG---DAVIGAMAGY 192

Query: 157 ILLYCMQKIY-----PEGLGGGDIKLLSLLGFIAGLKGVFMILFLSSFFSLCFFGAGLVL 211
++L+ + + EG+G GD KLL+ LG G + + ++L LSS ++L
Sbjct: 193 LVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILL 252

Query: 212 KRMKMRTQIPFGPFISLGAICYML 235
+ IPFGP++++ +L
Sbjct: 253 RNHHQSKPIPFGPYLAIAGWIALL 276


76GBAA_1659GBAA_1663N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_165929-1.598670flagellar motor protein MotS
GBAA_166027-1.950082chemotaxis response regulator
GBAA_1662310-2.056279flagellar motor switch protein
GBAA_1663513-3.381643hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1659OMPADOMAIN636e-14 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 63.4 bits (154), Expect = 6e-14
Identities = 30/127 (23%), Positives = 56/127 (44%), Gaps = 17/127 (13%)

Query: 110 SVVIVDNLIFDTGDANVKPEAKEIISQLVGFFQSVPNP---IVVEGHTDSRPIHNDKFPS 166
+ +++F+ A +KPE + + QL ++ +VV G+TD +D +
Sbjct: 214 HFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI--GSDAY-- 269

Query: 167 NWELSSARAANMIHHLIEVYNVDDKRLAAVGYADTKPVVPN---------DSPQNWEKNR 217
N LS RA +++ +LI + +++A G ++ PV N +R
Sbjct: 270 NQGLSERRAQSVVDYLIS-KGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328

Query: 218 RVVIYIK 224
RV I +K
Sbjct: 329 RVEIEVK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1660HTHFIS839e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 9e-22
Identities = 28/112 (25%), Positives = 46/112 (41%), Gaps = 2/112 (1%)

Query: 4 KILVVDDAMFMRTMIKNLLKSNSEFEVIGEAENGVEAIQKYKELQPDIVTLDITMPEMDG 63
ILV DD +RT++ L + + N + D+V D+ MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 64 LEALKEIIKIDASAKVVICSAMGQQGMVLDAIKGGAKDFIVKPFQADRVIEA 115
+ L I K V++ SA + A + GA D++ KPF +I
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1662FLGMOTORFLIN561e-11 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 55.7 bits (134), Expect = 1e-11
Identities = 23/71 (32%), Positives = 40/71 (56%)

Query: 473 DTSILQNVEMNVKFVFGSTVKTIQDILSLQENEAVVLDEDIDEPIRIYVNDVLVAYGELV 532
D ++ ++ + + G T TI+++L L + V LD EP+ I +N L+A GE+V
Sbjct: 53 DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVV 112

Query: 533 NVDGFFGVKVT 543
V +GV++T
Sbjct: 113 VVADKYGVRIT 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1663IGASERPTASE340.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.002
Identities = 18/126 (14%), Positives = 51/126 (40%), Gaps = 1/126 (0%)

Query: 301 EQKTEEDKKIEEPENEDKLENKLEDKKVTEKQEDSKVEISLPEEKTPVVQIPKKEEKVND 360
+ EE K+E + ++ + + E+ E + + E P V I + + + N
Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 361 LIKEPLKEKEKITYVIKEPLTDNKEVNKTKAQKDKDNNNQVISKKKEKKEEPEEKKEAKS 420
+ ++ + +++P+T++ VN + + N + + E K + +
Sbjct: 1165 TADTE-QPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223

Query: 421 EQGIQA 426
+ +++
Sbjct: 1224 RRSVRS 1229



Score = 29.3 bits (65), Expect = 0.045
Identities = 26/109 (23%), Positives = 44/109 (40%), Gaps = 7/109 (6%)

Query: 22 LQSKAEEQNVP-EQNINEV-NVQEENKEVQEQLEQVEMKQDKEEQQEAKNEQETEKKIET 79
+K + NV NEV E KE Q + + E++++AK ETEK E
Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTT--ETKETATVEKEEKAK--VETEKTQEV 1122

Query: 80 DQGVITVNKPELKVGEEVLVTIEPKEKNVQSIKGILRLPKNGDQYEQER 128
+ V + P+ + E V EP +N ++ + + E+
Sbjct: 1123 PK-VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170


77GBAA_1669GBAA_1686N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1669118-1.994290flagellar hook-associated protein FlgK
GBAA_1671219-2.611333flagellar capping protein
GBAA_1672321-1.868891flagellar protein FliS
GBAA_1673217-1.346412hypothetical protein
GBAA_1674-114-1.023747flagellar basal-body rod protein FlgB
GBAA_1675012-0.437477flagellar basal body rod protein FlgC
GBAA_1676-111-0.128997flagellar hook-basal body protein FliE
GBAA_1679011-0.072146flagellar motor switch protein G
GBAA_1680011-0.429506flagellar assembly protein H
GBAA_1681-112-0.744169flagellum-specific ATP synthase
GBAA_1685-213-1.746891flagellar basal body rod modification protein
GBAA_1686-214-2.628189flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1669FLGHOOKAP11043e-26 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 104 bits (260), Expect = 3e-26
Identities = 72/249 (28%), Positives = 112/249 (44%), Gaps = 14/249 (5%)

Query: 4 SDYNTPLSGLLAAQMGLQTTKQNLSNIHTPGYVRQMVNYGSAGASQGYSPEQKIGYGVQT 63
S N +SGL AAQ L T N+S+ + GY RQ A ++ G +G GV
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLG--AGGWVGNGVYV 59

Query: 64 LGVDRITDEVKTKQFNDQLSQLSYYNYMNSTLSRVESMVGTTGKNSLSSLMDGFFNAFRE 123
GV R D T Q +Q S +S++++M+ T+ SL++ M FF + +
Sbjct: 60 SGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTS-SLATQMQDFFTSLQT 118

Query: 124 VAKNPEQPNYYDTLISETGKFTSQVNRLAKSLDTAEAQTTEDIEAHVNEFNRLAGSLAEA 183
+ N E P LI ++ +Q + L + Q I A V++ N A +A
Sbjct: 119 LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASL 178

Query: 184 NKKI----GQAGTQVPNQLLDERDRIITEMSKYANIEVS---YESMNPNIASVRMNGVLT 236
N +I G PN LLD+RD++++E+++ +EVS + N +A NG
Sbjct: 179 NDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMA----NGYSL 234

Query: 237 VNGQDTYPL 245
V G L
Sbjct: 235 VQGSTARQL 243



Score = 54.2 bits (130), Expect = 6e-10
Identities = 19/51 (37%), Positives = 35/51 (68%)

Query: 380 LLEGIQQEKMGIEGVNMEEEMVNLMAFQKYFVANSKAITTMNEVFDSLFSI 430
++ + ++ I GVN++EE NL FQ+Y++AN++ + T N +FD+L +I
Sbjct: 495 VVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1674FLGHOOKAP1310.001 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 31.1 bits (70), Expect = 0.001
Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 20 NTVSSNIANANTPGYKAQDVTFAEQMNK 47
NT S+NI++ N GY Q A+ +
Sbjct: 19 NTASNNISSYNVAGYTRQTTIMAQANST 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1675FLGHOOKAP1333e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 33.0 bits (75), Expect = 3e-04
Identities = 19/75 (25%), Positives = 32/75 (42%), Gaps = 7/75 (9%)

Query: 5 INASGSGLTTARKWMEVTSNNIVNANTTAAPGADLYERRSVVLESNNSFANMLDGSPTNG 64
IN + SGL A+ + SNNI + N Y R++ ++ NS G NG
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAG------YTRQTTIMAQANSTLGA-GGWVGNG 56

Query: 65 VKIKSIEADKTENLV 79
V + ++ + +
Sbjct: 57 VYVSGVQREYDAFIT 71



Score = 28.0 bits (62), Expect = 0.013
Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 97 NIDVTAEMTNVMVAQKMYEANTSVLNANKKMLDKDLEI 134
+++ E N+ Q+ Y AN VL + D + I
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1676FLGHOOKFLIE355e-06 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 35.4 bits (81), Expect = 5e-06
Identities = 18/77 (23%), Positives = 36/77 (46%), Gaps = 1/77 (1%)

Query: 24 SQTSVVEGKKFIDLLEDMNQTQNNAQTAVYDLLTKGVG-ETHDVLIQQKKAESQMKTAAL 82
Q ++ + L+ ++ TQ A+T G +DV+ +KA M+
Sbjct: 27 PQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQ 86

Query: 83 VRDNLIENYKSLINMQI 99
VR+ L+ Y+ +++MQ+
Sbjct: 87 VRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1679FLGMOTORFLIG2004e-64 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 200 bits (511), Expect = 4e-64
Identities = 116/336 (34%), Positives = 197/336 (58%), Gaps = 6/336 (1%)

Query: 2 LDEISSKEKAAILIRTLEEGVAAKVIEYMTAKEKEVLLREIAKFRVYKSETLENVLGEFL 61
+ ++ K+KAAIL+ ++ +++KV +Y++ +E E L EIAK SE +NVL EF
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 62 YELNVKELNLVTPDKEYIRRIF-KNMPEDELEKLLEDLWYN-KDNPFEFLNSLTDLEPLL 119
EL + + + +Y R + K++ + ++ +L + PFEF+ D +L
Sbjct: 72 -ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRA-DPANIL 129

Query: 120 TVLNDESPQTIAIIASYIKPQLASQLIERLPDHKRVETVMGIAKLEQVDGELINQIGDLL 179
+ E PQTIA+I SY+ PQ AS ++ LP + IA +++ E++ ++ +L
Sbjct: 130 NFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVL 189

Query: 180 KSKLNNMAFNAINKTDGLKTIVNILNNVSRGVEKTVFQKLDEMDYELSEKIKENMFVFED 239
+ KL +++ G+ +V I+N R EK + + L+E D EL+E+IK+ MFVFED
Sbjct: 190 EKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFED 249

Query: 240 LLGLEDLALRRVLEEITDNGVIAKALKIAKEEIKEKLFTCMSSNRKEMILEELDGLGPLK 299
++ L+D +++RVL EI D +AKALK ++EK+F MS M+ E+++ LGP +
Sbjct: 250 IVLLDDRSIQRVLREI-DGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308

Query: 300 MTDAEKAQQTITDTVKKLEKEGRIIVQRG-EDDVLI 334
D E++QQ I ++KLE++G I++ RG E+DVL+
Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVLV 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1686FLGHOOKAP1441e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 1e-06
Identities = 15/36 (41%), Positives = 24/36 (66%)

Query: 5 LYTSITGMNAAQNALSVTSNNIANAQTVGYKKQKAI 40
+ +++G+NAAQ AL+ SNNI++ GY +Q I
Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI 39



Score = 37.6 bits (87), Expect = 9e-05
Identities = 10/39 (25%), Positives = 26/39 (66%)

Query: 397 SNVDLSVEFVDLMLYQRGFQGNAKVIKVSDEVLNEVVNL 435
S V+L E+ +L +Q+ + NA+V++ ++ + + ++N+
Sbjct: 507 SGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


78GBAA_1698GBAA_1720N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1698-113-2.021914TPR/glycosyl transferase domain-containing
GBAA_17013140.597573hypothetical protein
GBAA_17033181.302330hypothetical protein
GBAA_17043161.326116hypothetical protein
GBAA_17062181.090407flagellin
GBAA_17074250.812815Slt family transglycosylase
GBAA_1710324-0.388765flagellar motor switch protein
GBAA_1711420-0.248808hypothetical protein
GBAA_1712317-0.361893flagellar biosynthesis protein FliP
GBAA_1713214-0.394593flagellar biosynthesis protein FliQ
GBAA_1714112-0.287071flagellar biosynthesis protein FliR
GBAA_1715090.068243flagellar biosynthetic protein FlhB
GBAA_1716190.416021flagellar biosynthesis protein FlhA
GBAA_17190100.201918flagellar basal body rod protein FlgG
GBAA_1720-112-0.288201alanyl-tRNA synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1698SYCDCHAPRONE412e-06 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 41.5 bits (97), Expect = 2e-06
Identities = 23/101 (22%), Positives = 34/101 (33%), Gaps = 11/101 (10%)

Query: 444 DNEQIQLALIREDIRQLINQGMISQAKYLISEYEKTFPITSEIYQMKGIVAFSENNYLDA 503
D ++ QLA+ + G IS T E + Y DA
Sbjct: 7 DTQEYQLAME-----SFLKGGGTIAMLNEISSD------TLEQLYSLAFNQYQSGKYEDA 55

Query: 504 ENFFKLALKLYHFDVDALFNLGYLYEVQEQYDRAVQNYNLA 544
F+ L H+D LG + QYD A+ +Y+
Sbjct: 56 HKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYG 96


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1706FLAGELLIN1259e-35 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 125 bits (314), Expect = 9e-35
Identities = 76/282 (26%), Positives = 130/282 (46%), Gaps = 18/282 (6%)

Query: 1 MRINTNINSMRTQEYMRQNQDKMNVSMNRLSSGKRINSAADDAAGLAIATRMRARQSGLE 60
INTN S+ TQ + ++Q ++ ++ RLSSG RINSA DDAAG AIA R + GL
Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61

Query: 61 KASQNTQDGMSLIRTAESAMNSVSNILTRMRDIAVQSSNGTNTAENQSALQKEFAELQEQ 120
+AS+N DG+S+ +T E A+N ++N L R+R+++VQ++NGTN+ + ++Q E + E+
Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121

Query: 121 IDYIAKNTEFNDKNLLAGTGAVTIGSTSISGAEISIETLDSSATNQQITIKLANTTAEKL 180
ID ++ T+FN +L+ + I + G + ITI L + L
Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQMKIQVGANDG--------------ETITIDLQKIDVKSL 167

Query: 181 GIDATTSN----ISISGAASALAAISALNTALNTVAGNRATLGATLNRLDRNVENLNNQA 236
G+D N ++ S+ ++ +T R + + D + ++
Sbjct: 168 GLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227

Query: 237 TNMASAASQIEDADMAKEMSEMTKFKILNEAGISMLSQANQT 278
A+ D ++ K + A
Sbjct: 228 YVNAANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAI 269



Score = 86.3 bits (213), Expect = 4e-21
Identities = 62/259 (23%), Positives = 107/259 (41%), Gaps = 7/259 (2%)

Query: 36 INSAADDAAGLAIATRMRARQSGLEKASQNTQDGMSLIRTAESAMNSVSNILTRMRDIAV 95
+ AG A A + G ++ G++ ++ + + T + V
Sbjct: 249 LFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKV 308

Query: 96 QSSNGTNTAENQSALQKEFAELQEQID-YIAKNTEFNDKN------LLAGTGAVTIGSTS 148
+ TA + + + F+DK L + S
Sbjct: 309 TLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGES 368

Query: 149 ISGAEISIETLDSSATNQQITIKLANTTAEKLGIDATTSNISISGAASALAAISALNTAL 208
+ T +++ + K G+ + + + S ++++++AL
Sbjct: 369 KITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSAL 428

Query: 209 NTVAGNRATLGATLNRLDRNVENLNNQATNMASAASQIEDADMAKEMSEMTKFKILNEAG 268
+ V R++LGA NR D + NL N TN+ SA S+IEDAD A E+S M+K +IL +AG
Sbjct: 429 SKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAG 488

Query: 269 ISMLSQANQTPQMVSKLLQ 287
S+L+QANQ PQ V LL+
Sbjct: 489 TSVLAQANQVPQNVLSLLR 507


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1707PF06580290.021 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.021
Identities = 8/42 (19%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 122 LTKKY-NIQKIRSSNEGKYEDIIDRVSHTYGIPKTLIQKMIE 162
+ Y + I+ + ++E+ I+ +P L+Q ++E
Sbjct: 224 VVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVE 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1710FLGMOTORFLIN592e-14 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 58.8 bits (142), Expect = 2e-14
Identities = 22/94 (23%), Positives = 51/94 (54%)

Query: 13 LEDFAGKRNEASKAHIDTVSDISIELGVKLGKASITLGDVKQLKVGDVLEVEKNLGHKVD 72
+ G + ID + DI ++L V+LG+ +T+ ++ +L G V+ ++ G +D
Sbjct: 39 FQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLD 98

Query: 73 VYLSNMKVGIGEAIVMDEKFGIIISEIEADKKQA 106
+ ++ + GE +V+ +K+G+ I++I ++
Sbjct: 99 ILINGYLIAQGEVVVVADKYGVRITDIITPSERM 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1712FLGBIOSNFLIP1642e-52 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 164 bits (417), Expect = 2e-52
Identities = 75/239 (31%), Positives = 136/239 (56%), Gaps = 2/239 (0%)

Query: 14 FVFSIVFSIIFVNPAYAAQNGFINFENGKEFTSN--SSVQLFALVTLLSLSSSIVLLFTH 71
+ + + P AQ I + + VQ +T L+ +I+L+ T
Sbjct: 4 LLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTS 63

Query: 72 FTYFMIVLGITRQGLGVMNLPPNQVLVGLALFLSLFTMQPVLGQLKSDVWDPMTKEKITV 131
FT +IV G+ R LG + PPNQVL+GLALFL+ F M PV+ ++ D + P ++EKI++
Sbjct: 64 FTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISM 123

Query: 132 SQAAETTAPIMKEYMSKHTYKHDLKMMLKVRGEELPKDLKDLSLFTLVPSFTLTQIQKGL 191
+A E A ++E+M + T + DL + ++ + + + + L+P++ ++++
Sbjct: 124 QEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAF 183

Query: 192 LTGMFIYLAFVFIDLIISTLLMYLGMMMVPPMILSLPFKILIFVYLGGYTKIVDIMFKT 250
G I++ F+ IDL+I+++LM LGMMMVPP ++LPFK+++FV + G+ +V + ++
Sbjct: 184 QIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQS 242


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1713TYPE3IMQPROT421e-08 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 41.7 bits (98), Expect = 1e-08
Identities = 15/81 (18%), Positives = 35/81 (43%)

Query: 4 SPIIDIFQTFFYKGVMILMPVAGVSMIVVIIIAVIMAMMQIQEQTLTFLPKMASIVLVII 63
++ Y +++ V+ I+ +++ + + Q+QEQTL F K+ + L +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 ILGPWMFQELTTLILDLFDKI 84
+L W + L + +
Sbjct: 62 LLSGWYGEVLLSYGRQVIFLA 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1714TYPE3IMRPROT967e-26 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 96.0 bits (239), Expect = 7e-26
Identities = 51/233 (21%), Positives = 113/233 (48%), Gaps = 1/233 (0%)

Query: 10 FFAFCRITSFLYFLPFFSGRSIPAMAKVTFGLALSITVADQVDVSHIKTVWDVAA-YAGT 68
F+ R+ + + P S RS+P K+ + ++ +A + + + A A
Sbjct: 17 FWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAVQ 76

Query: 69 QIVIGLSLSKIVEMLWNIPKMAGHILDFDIGLSQASLFDVNAGSQSTLLSTIFDIFFLII 128
QI+IG++L ++ + + AG I+ +GLS A+ D + +L+ I D+ L++
Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136

Query: 129 FISLGGINYFVATILKSFQYTEAISKLLTTSFLDSLLATLLFAITSAVEIALPLMGSLFI 188
F++ G + ++ ++ +F + L ++ +L + + +ALPL+ L
Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLLT 196

Query: 189 INFVLILIAKNAPQLNVFMNAYVIKITCGILFIAMSVPMLGYVFKNMTDVLLE 241
+N L L+ + APQL++F+ + + +T GI +A +P++ +++ +
Sbjct: 197 LNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFN 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1715TYPE3IMSPROT2892e-98 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 289 bits (742), Expect = 2e-98
Identities = 92/343 (26%), Positives = 186/343 (54%), Gaps = 2/343 (0%)

Query: 4 DNKTEKATPQKRKKSREEGNIARSKDLNNLFSILVLAVVVYFFGDWLGFEIANSVSVLFD 63
KTE+ TP+K + +R++G +A+SK++ + I+ L+ ++ D+ + + + +
Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62

Query: 64 QIGKNTDS--TEYFYMMGILLLKVSAPILILVYAFHLFNYMIQVGFLFSSKVIKPKASRI 121
Q + + + + P+L + + ++++Q GFL S + IKP +I
Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122

Query: 122 NPKNYFTRLFSRKSLVDILKSLFYMGLIGYVAYVLFKKNLEKIVSMIGFNWTASLTEIIR 181
NP R+FS KSLV+ LKS+ + L+ + +++ K NL ++ + + +
Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182

Query: 182 QIKFIFLAILIILIVLSIIDFIYQKWEYEQDIKMKKEEVKQEHKDNEGDPQVKGKRKNFM 241
++ + + + +V+SI D+ ++ ++Y +++KM K+E+K+E+K+ EG P++K KR+ F
Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242

Query: 242 HAILQGTIAKKMDGATFIVNNPTHISVVLRYNKHVDAAPIVVAKGEDELALYIRTLAREQ 301
I + + + ++ +V NPTHI++ + Y + P+V K D +R +A E+
Sbjct: 243 QEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEE 302

Query: 302 EIPMVENRPLARSLYYQVEEDETIPEDLYVAVIEVMRYLIQTN 344
+P+++ PLAR+LY+ D IP + A EV+R+L + N
Sbjct: 303 GVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1719FLGHOOKAP1280.033 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.033
Identities = 11/47 (23%), Positives = 24/47 (51%)

Query: 203 NGVGTVKNYMLENSNVDMTKEMADLMTDQRMISASQRVMTSFDKIYE 249
N V + N S V++ +E +L Q+ A+ +V+ + + I++
Sbjct: 494 NVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1720DPTHRIATOXIN280.039 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 27.8 bits (61), Expect = 0.039
Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 16/113 (14%)

Query: 63 EQGEIVHYIKDGAQVKLGPVKLEINWERRHNLMRHHSLLHLIGAVVYEKYGALCTGNQIY 122
E V YI + Q K V+LEIN+E R + +YE C GN++
Sbjct: 174 EGSSSVEYINNWEQAKALSVELEINFETRGKRGQD---------AMYEYMAQACAGNRVR 224

Query: 123 PDKA------RIDFNELQELSSVEVEGIVKEVNKLIEQNKEISTRYMSREEAE 169
+D++ +++ + ++E + KE + + E + +S E+A+
Sbjct: 225 RSVGSSLSCINLDWDVIRDKTKTKIESL-KEHGPIKNKMSESPNKTVSEEKAK 276


79GBAA_1731GBAA_1743N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1731-212-0.470719permease
GBAA_1732-313-0.499016lipoprotein
GBAA_1733-314-0.694056LysR family transcriptional regulator
GBAA_1734-215-0.335195ABC transporter ATP-binding protein
GBAA_1735-215-0.624669ABC transporter substrate-binding protein
GBAA_1736-116-1.551585ABC transporter permease
GBAA_1737017-3.031147metallo-beta-lactamase
GBAA_1738-117-3.862561hypothetical protein
GBAA_1741-117-3.116571hypothetical protein
GBAA_1742014-2.580310hypothetical protein
GBAA_1743015-2.797444hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1731TCRTETA416e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.0 bits (96), Expect = 6e-06
Identities = 40/229 (17%), Positives = 86/229 (37%), Gaps = 11/229 (4%)

Query: 55 FATTLVCGSLPRMICGPIAGAVADRVSRRWLVIGTDLLSSLTMLIMFILATIFGPSLPFI 114
+ L +L + C P+ GA++DR RR +++ + +++ IM P L +
Sbjct: 45 YGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIM-----ATAPFLWVL 99

Query: 115 YISAALLSICASFYSVALTSSIPNLVDEGRIQKASALNQTAASLSNILGPIIGGVVFGFL 174
YI + I + +VA + I ++ D + + GP++GG++ G
Sbjct: 100 YIGRIVAGITGATGAVA-GAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGF 157

Query: 175 SIQSFFLLNSITFFLAVILQLFIVFDLYKKEVAESKEHFLTSIKEGFSYVKRQHEIYGLM 234
S + F + L + F++ + +K E + L + F + + + LM
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREAL-NPLASFRWARGMTVVAALM 216

Query: 235 KIALWVNFFACGLTVALPYIIVHTLHLSSKQLGTVEGMLAVGMLMGAIT 283
+ + H + +G LA ++ ++
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGI---SLAAFGILHSLA 262



Score = 33.6 bits (77), Expect = 0.001
Identities = 17/97 (17%), Positives = 35/97 (36%), Gaps = 2/97 (2%)

Query: 76 VADRVSRRWLVIGTDLLSSLTMLIMFILATIFGPSLPFIYISAALLSICASFYSVALTSS 135
+ V+ R +L + +IL ++ +L AL +
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRG--WMAFPIMVLLASGGIGMPALQAM 323

Query: 136 IPNLVDEGRIQKASALNQTAASLSNILGPIIGGVVFG 172
+ VDE R + SL++I+GP++ ++
Sbjct: 324 LSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1732CHANLCOLICIN270.047 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 27.0 bits (59), Expect = 0.047
Identities = 33/151 (21%), Positives = 60/151 (39%), Gaps = 16/151 (10%)

Query: 40 ATMWFEKAEKEKSGNEAKSYKEMAEKMDHGATALKDGKYLEAKDIANEVLQMKKDDALET 99
AT + A+ +K+ E + + A + A A +D KDI NE L+
Sbjct: 53 ATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSA 112

Query: 100 AVTSNAENM----------LQKAKDVEKKVNERVAK------RRKVEEEEGIDKLIKAVD 143
++A N L KA++ +K E K +R+ E E + + +
Sbjct: 113 TELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLK 172

Query: 144 SIDDVKEKEKKVSEALDKAEEAQAKIEAKKN 174
+ +++ +SE E AQ K+ A ++
Sbjct: 173 LAEAEEKRLAALSEEAKAVEIAQKKLSAAQS 203


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1733VACCYTOTOXIN300.016 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 29.6 bits (66), Expect = 0.016
Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 13/64 (20%)

Query: 123 TLMVKT--APEIRTMLQNHEINLGVISAAPFDESLLKQTNVMPDTLVLAFSKEHHFSKKE 180
TL++ + A RTM+ N + KQ N TL S EH S +
Sbjct: 914 TLLIDSHDAGYARTMIDATSAN-----------EITKQLNTATTTLNNIASLEHKTSGLQ 962

Query: 181 NVSL 184
+SL
Sbjct: 963 TLSL 966


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1734PF05272310.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.007
Identities = 14/41 (34%), Positives = 19/41 (46%), Gaps = 7/41 (17%)

Query: 32 TLLGPSGCGKTTLLRMIAGLEEPDKGEIYFGDTCMYSSTKK 72
L G G GK+TL+ + GL+ +F DT T K
Sbjct: 600 VLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGK 633


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1735MALTOSEBP290.045 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 28.5 bits (63), Expect = 0.045
Identities = 78/308 (25%), Positives = 116/308 (37%), Gaps = 69/308 (22%)

Query: 40 EKKIVVYSAGPKG---LAEKIQKDFEKKTGIKVEMFQGTTGKILARMEAEKKKPVVDV-- 94
E K+V++ G KG LAE + K FEK TGIKV + + E+K P V
Sbjct: 30 EGKLVIWINGDKGYNGLAE-VGKKFEKDTGIKVTVEHPD--------KLEEKFPQVAATG 80

Query: 95 ----VVLASLPAMEGLKKDGQTLAYKEAKQADKLRSEWSDDKGHYFG------YSASALG 144
++ + G + G K ++ D Y G + AL
Sbjct: 81 DGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALS 140

Query: 145 IVYNTKNVKTAPEDWSDI--------TKGEWKGKVNLPDP--------ALSGSALDFVTG 188
++YN + P+ W +I KG+ NL +P A G A + G
Sbjct: 141 LIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENG 200

Query: 189 -YVKKN-------GKDGWDLFEQLKKNEVTVAGANQEALDPVVT-GAKDMVIAG------ 233
Y K+ K G L KN+ A + + G M I G
Sbjct: 201 KYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSN 260

Query: 234 -----VDY-MTYSAKAKGEPVDIVYPKSGTVISPRAAGIMKDSKNVEGAKEFID-YLLSD 286
V+Y +T KG+P S + +AGI S N E AKEF++ YLL+D
Sbjct: 261 IDTSKVNYGVTVLPTFKGQP-------SKPFVGVLSAGINAASPNKELAKEFLENYLLTD 313

Query: 287 DVQKQISK 294
+ + ++K
Sbjct: 314 EGLEAVNK 321


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1741ACRIFLAVINRP240.049 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 23.6 bits (51), Expect = 0.049
Identities = 6/20 (30%), Positives = 15/20 (75%)

Query: 4 VAIVYGIILSTIIVLFFVGV 23
+ ++ G++ +T++ +FFV V
Sbjct: 1004 IGVMGGMVSATLLAIFFVPV 1023


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1743BLACTAMASEA378e-05 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 37.1 bits (86), Expect = 8e-05
Identities = 20/117 (17%), Positives = 46/117 (39%), Gaps = 15/117 (12%)

Query: 57 NGEVITSINENKKLPLASMAKIVIAVEFAKQVSEGKISRDEQISLHELNKYYVKDTDGGA 116
+G +T+ +++ P+ S K+V+ +V G + +I Y +
Sbjct: 49 SGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIH-------YRQQ----- 96

Query: 117 HPDWLEDARARELVKNGQITLEEVAKGMIHYSSNANTTHLLDKL-GIERVNESIKEL 172
D ++ + E +T+ E+ I S N+ LL + G + ++++
Sbjct: 97 --DLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQI 151


80GBAA_1795GBAA_1803N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1795-1120.446373hypothetical protein
GBAA_1796-1110.822392cardiolipin synthetase domain-containing
GBAA_1797-2111.378081uridylate kinase
GBAA_1799-2100.975851proton/sodium-glutamate symport protein
GBAA_1800-291.005729aspartate ammonia-lyase
GBAA_1801-1110.858093malate dehydrogenase
GBAA_18020110.034341sensor histidine kinase
GBAA_18031111.209363response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1795ABC2TRNSPORT451e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 45.3 bits (107), Expect = 1e-07
Identities = 28/106 (26%), Positives = 47/106 (44%)

Query: 262 IVMIGVLMLFALIAIGISLVLVAFSKNSASANTMQNLVIVPTCLLAGCYFPYDIMPKAVQ 321
+ + V+ L L + +V+ A + + Q LVI P L+G FP D +P Q
Sbjct: 148 LYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQ 207

Query: 322 KVADFLPQRWLLDTIAKLQQGIPFSELYVNILILFAFAVAFFLIAI 367
A FLP +D I + G P ++ ++ L + V F ++
Sbjct: 208 TAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLST 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1797CARBMTKINASE290.024 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 28.6 bits (64), Expect = 0.024
Identities = 15/60 (25%), Positives = 24/60 (40%), Gaps = 14/60 (23%)

Query: 122 LDNGYIVIFGGGNGQPFVTT-------------DYPSVQRAIEMNSDAILVAKQGVDGVF 168
++ G IVI GG G P + D + A E+N+D ++ V+G
Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT-DVNGAA 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1802PF06580388e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 8e-05
Identities = 26/130 (20%), Positives = 46/130 (35%), Gaps = 20/130 (15%)

Query: 297 LGKDIRFSKHIEGEHAAYHV--YTVLSIFNNLVANAVEAIEDRGLIHIKLYKREQHVIFE 354
++F I V V ++ N + + + + G I +K K V E
Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295

Query: 355 VIDDGPGIAQKYKKLVFKPGFTSKYDQTGTPSTGIGLSYIDEMVTEL-GGEVRLEDNENG 413
V + G + K+ STG GL + E + L G E +++ +E
Sbjct: 296 VENTGSLALKNTKE-----------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338

Query: 414 NGCKFIVCLP 423
+V +P
Sbjct: 339 GKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1803HTHFIS543e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.1 bits (130), Expect = 3e-10
Identities = 29/177 (16%), Positives = 71/177 (40%), Gaps = 10/177 (5%)

Query: 5 IVDDDEVFRSMLSQIIEDGDLGEVIGESEDGAFIEAEQLNYKKVDILFIDLLMPMRDGIE 64
+ DDD R++L+Q + I + + + D++ D++MP + +
Sbjct: 8 VADDDAAIRTVLNQALSRAGYDVRITSNAATLW---RWIAAGDGDLVVTDVVMPDENAFD 64

Query: 65 TVRHIASSFTG-KIIMISQVESKQLIGEAYTLGVEYYITKPLNKIEVVSVVRKVIERIRL 123
+ I + ++++S + +A G Y+ KP + E++ ++ + + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 124 ERSIYDIQKSLNNVFQWEKPQMRNETVQEGKKISDSGRFLLSELGIAGENGS-KDLL 179
S + M+ E + ++ + L+ I GE+G+ K+L+
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQ-EIYRVLARLMQTDLTLM----ITGESGTGKELV 176


81GBAA_1975GBAA_1982N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_1975-116-2.919610DNA-binding response regulator
GBAA_1976-114-2.515465sensor histidine kinase
GBAA_1977-114-1.934461polysaccharide deacetylase
GBAA_1978015-1.882460lipoprotein
GBAA_1979116-1.656444sapb protein
GBAA_1980114-1.842069hypothetical protein
GBAA_1981014-1.713088siderophore biosynthesis protein
GBAA_1982016-1.565359siderophore biosynthesis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1975HTHFIS933e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 93.4 bits (232), Expect = 3e-24
Identities = 29/123 (23%), Positives = 58/123 (47%), Gaps = 3/123 (2%)

Query: 2 PTILVLEDEMPIRSFIVLNLKRAGFYVLEASTGEEALQILCEHTVDVALLDVMLPGMDGF 61
TILV +D+ IR+ + L RAG+ V S + + D+ + DV++P + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 QVCKAIREENKKIGIIMLTARVQNEDKVQGLGIGADDYIAKPFSP---VELTARIQSLLR 118
+ I++ + +++++A+ ++ GA DY+ KPF + + R + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 119 RIE 121
R
Sbjct: 124 RRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1976PF06580401e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 1e-05
Identities = 22/101 (21%), Positives = 44/101 (43%), Gaps = 22/101 (21%)

Query: 359 IVQNAIKY----SHENGKVYIEATKNEGQAVIKVKDDGIGIAKEHLPYIEQSFYQINNHA 414
+V+N IK+ + GK+ ++ TK+ G ++V++ G K N
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK--------------NTK 308

Query: 415 TGAGLGLAIVKKMVELHGG---TINIISKEGIGTTILIKLP 452
G GL V++ +++ G I + K+G ++ +P
Sbjct: 309 ESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1980TYPE3OMBPROT270.050 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 27.0 bits (59), Expect = 0.050
Identities = 18/85 (21%), Positives = 35/85 (41%), Gaps = 11/85 (12%)

Query: 79 GVERTGTYNCEGELYAIYLLQEYQGMKIG-------QKLFQAFLSDCKNNDMQSLLVWVV 131
G +RTG + E + I + Q ++ ++LF L + N ++Q +
Sbjct: 442 GKDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEM----N 497

Query: 132 TNNPSKKFYEKFNPEKMDTKFLERV 156
T P K +K ++ + ER+
Sbjct: 498 TGVPGNKVMKKLPLSSLELSYSERI 522


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1981PF041832872e-91 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 287 bits (736), Expect = 2e-91
Identities = 101/543 (18%), Positives = 192/543 (35%), Gaps = 55/543 (10%)

Query: 82 QFYYQMGDSNSVMKADYVTVITFLIKEMSINYG-EGTNPAELMLRVIRSCQNIEEFTKER 140
+ + D+ ++ AD + L+ ++ AE M + + + K R
Sbjct: 54 IWGWLWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKAR 113

Query: 141 KEDTSALYGFHTSFIEAEQSLLFGHLTHPTPKSRQGILEWKSAMYSPELKGECQLHYFRA 200
+ +++ + Q LL GH K R+G + Y+PE +LH+
Sbjct: 114 RGLSASDL--INLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAV 171

Query: 201 HKSIVNEKSLLLDSTTVILKEELRNDEM-VSKEFISKYCNEDEYSLLPIHPLQAEWLLHQ 259
+ + + +L + E + + + + LP+HP Q + +
Sbjct: 172 KREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIAT 231

Query: 260 PYVQDWIEQGVLEYIGPTGKCYMATSSLRTLYHPDAKYMLKFSFPVKV--TNSMRINKLK 317
++ D +G + +G G ++A SLRTL + + L P+ + T+ R +
Sbjct: 232 DFIAD-FAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGR 290

Query: 318 ELESGLEGKAMLNTAI-GEVLEKFPGFDFICDPAFITL-----------NYGTQESGFEV 365
+ +G L + G + +PA + Y QE V
Sbjct: 291 YIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEM-LGV 349

Query: 366 IIRENPFYSEHADDATLIAGLVQDAIPGERTRLSNIIHRLADLESRSCEEVSLEWFRRYM 425
I RENP D++ ++ + + + I R E W +
Sbjct: 350 IWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRS----GLDAET----WLTQLF 401

Query: 426 NISLKPMVWMYLQYGVALEAHQQNSVVQLKDGYPVKYYFRDNQG-FYFCNSMKEMLNNEL 484
+ + P+ + +YGVAL AH QN + +K+G P + +D QG +++
Sbjct: 402 RVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLP 461

Query: 485 AGIGERTGNLYDDYIVDERFRYYL--IFNHMFGLINGFGTAGLIREEILLTELRTVLES- 541
+ + T L DY++ + + + + L+ G + E L VL
Sbjct: 462 QEVRDVTSRLSADYLIHDLQTGHFVTVLRFISPLMVRLG----VPERRFYQLLAAVLSDY 517

Query: 542 ----------FLPYNREPSTFLRELLEEDKLACKANLLTRFFDVDELSNPLEQAIYVQVQ 591
F ++ +R +L KL + D+D S L +Q
Sbjct: 518 MKKHPQMSERFALFSLFRPQIIRVVLNPVKLT--------WPDLDGGSRMLPN-YLEDLQ 568

Query: 592 NPL 594
NPL
Sbjct: 569 NPL 571


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_1982PF041835840.0 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 584 bits (1507), Expect = 0.0
Identities = 146/602 (24%), Positives = 267/602 (44%), Gaps = 45/602 (7%)

Query: 14 IESEDYISVRRRVLRQLVESLIYEGIITPARIEKEEQILFLIQGLDEDNKSVTYECYGRE 73
+ +D+ V RR++ +++ L YE + + +++ + G + + E
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHA-ESQGDDRYCINLPG-------AQWR-FIAE 51

Query: 74 RITFGRISIDSLIVRVQDGKQEIQSVAQFLEEVFRVVNVEQTKLDSFIHELEQTIFKDTI 133
R +G + ID+ +R D Q L ++ +V+++ + + +L T+ D
Sbjct: 52 RGIWGWLWIDAQTLRCADEPVLAQ---TLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQ 108

Query: 134 AQYER--CNKLKYTQKSYDELENHLIDGHPYHPSYKARIGFQYRDNFRYGYEFMRPIKLI 191
R + + D L+ L+ GHP K R G+ RY E+ +L
Sbjct: 109 LLKARRGLSASDLINLNADRLQ-CLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLH 167

Query: 192 WIAAHKKNATVGYENEVIYDKILKSEVGERKLEAYKERIHSMGCDPKQYLFIPVHPWQWE 251
W+A +++ +NE+ ++L + + ++ + + G D +L +PVHPWQW+
Sbjct: 168 WLAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQ 226

Query: 252 NFIISNYAEDIQDKGIIYLGESADDYCAQQSMRTLRNVTNPKRPYVKVSLNILNTSTLRT 311
I +++ D + ++ LGE D + AQQS+RTL N + +K+ L I NTS R
Sbjct: 227 QKIATDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRG 286

Query: 312 LKPYSVASAPAISNWLSNVVSQDSYLRDESRVILLKEFSSVM----YDTNKKATYG---S 364
+ +A+ P S WL V + D+ L VIL + + + Y +A Y
Sbjct: 287 IPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEM 346

Query: 365 LGCIWRESVHHYLGEQEDAVPFNGLYAKEKDGTPIIDAWLNKYGI--ENWLRLLIQKAII 422
LG IWRE+ +L E V L +++ P+ A++++ G+ E WL L + ++
Sbjct: 347 LGVIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVV 406

Query: 423 PVIHLVVEHGIALESHGQNMILVHKEGLPVRIALKDFHEGLEFYRPFLKEMNKCPDFTKM 482
P+ HL+ +G+AL +HGQN+ L KEG+P R+ LKDF + + EM+ P +
Sbjct: 407 PLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRD 466

Query: 483 HKTYANGKMNDFFEMDRIECLQEMVLDALFLFNVGELAFVLADKYEWKEESFWMIVVEEI 542
+ + D+ D L V L + E F+ ++ +
Sbjct: 467 VTSRLS---ADYLIHD---------LQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVL 514

Query: 543 ENHFRKYPHLKDRFESIQLYTPTFYAEQLTKRRL-YIDVESLVHEVP-------NPLYRA 594
++ +K+P + +RF L+ P L +L + D++ +P NPL+
Sbjct: 515 SDYMKKHPQMSERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWLV 574

Query: 595 RQ 596
Q
Sbjct: 575 TQ 576


82GBAA_2162GBAA_2169N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_21620120.693710hypothetical protein
GBAA_2163012-0.081764HD domain-containing protein
GBAA_21641150.547422ABC transporter ATP-binding protein
GBAA_2165-1130.057069hypothetical protein
GBAA_21661140.101820hypothetical protein
GBAA_21681110.621798single-stranded DNA-binding protein
GBAA_21690120.105954TetR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2162IGASERPTASE300.015 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.015
Identities = 29/130 (22%), Positives = 48/130 (36%), Gaps = 8/130 (6%)

Query: 40 ATPQNQELQYPQNPYTAPQTQEQQFQQNSYDTRPSYEYP----QNPYAAPQNQELQYPQN 95
ATP +N +T E+ Q + T + E N A Q E+ +
Sbjct: 1031 ATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGS 1090

Query: 96 PYVTPQTQEQQFQQNPYPTQPQQTQYQQQMYQPNYDARVSPPKPPTFDITQPQILP---P 152
QT E + + + + ++ P ++V PK + QPQ P
Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV-SPKQEQSETVQPQAEPAREN 1149

Query: 153 GPTLDITQPQ 162
PT++I +PQ
Sbjct: 1150 DPTVNIKEPQ 1159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2163ARGDEIMINASE280.027 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 28.3 bits (63), Expect = 0.027
Identities = 8/38 (21%), Positives = 19/38 (50%)

Query: 57 LHDVADEKLNESEEAGMKKVSDWLEELHVEEEESKHVL 94
+ D+ E L S K +S ++ E ++ + + ++L
Sbjct: 72 IEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLL 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2164TYPE4SSCAGX330.004 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 33.2 bits (75), Expect = 0.004
Identities = 34/125 (27%), Positives = 60/125 (48%), Gaps = 23/125 (18%)

Query: 513 EGEVREFLGSYTDYLEMEKTRELI-----------EKAEVQKEKKVVEEAPKQQRKRKLS 561
E E F DY E KT++LI +K ++KEK+ E+A K Q+ ++
Sbjct: 109 EKEAVNFALMTRDYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREK 168

Query: 562 YNEQREWETIEDTIAELEEKLESIGEELANVGSDFTKAQELSE-AQQKTEEELEKTMERW 620
E+R A+ LE++ ++N + + + LSE +Q+ E EL++ MER
Sbjct: 169 RKEER---------AKNRANLENLTNAMSN-PQNLSNNKNLSELIKQQRENELDQ-MERL 217

Query: 621 SELSD 625
++ +
Sbjct: 218 EDMQE 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2169HTHTETR725e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 5e-18
Identities = 33/201 (16%), Positives = 73/201 (36%), Gaps = 13/201 (6%)

Query: 3 RETRKKELKELIFLKAVQLFQERGYENVTVQDITTACGIAKGTFFNYFPKKENILLFLGD 62
+ +E ++ I A++LF ++G + ++ +I A G+ +G + +F K ++ + +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 63 SQIELWNESLKTYENVEH--PKERIKLVLGDLLDRFTGHGDLMKHAIFEIIKSNYLVENE 120
E Y+ P ++ +L +L+ T + + + EII E
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLE-STVTEERRR-LLMEIIFHKCEFVGE 122

Query: 121 LKSIQQLQEG--------LSSIITEAKETGKLNSQWDINIITSTVMSTYFYTLMSHSLLH 172
+ +QQ Q + + E L + + Y LM + L
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRG-YISGLMENWLFA 181

Query: 173 GNEINAKNILNQQLDVVWEGI 193
+ K + ++ E
Sbjct: 182 PQSFDLKKEARDYVAILLEMY 202


83GBAA_2225GBAA_2230N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2225018-1.939211acetyltransferase
GBAA_2227018-2.218493acetyltransferase
GBAA_2228-115-2.906243ABC transporter ATP-binding protein
GBAA_2230-112-2.157304hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2225SACTRNSFRASE522e-11 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 52.3 bits (125), Expect = 2e-11
Identities = 24/92 (26%), Positives = 44/92 (47%), Gaps = 5/92 (5%)

Query: 42 MERKESVIFVAVEDGEYIGFTQLYPSFSSISMKELWILNDLFVQAAKRGAGTGKKLLEAA 101
+E + F+ + IG ++ +++ ++ D+ V R G G LL A
Sbjct: 60 VEEEGKAAFLYYLENNCIGRIKIRSNWN-----GYALIEDIAVAKDYRKKGVGTALLHKA 114

Query: 102 KEFALENGAKGVKLQTEIDNLSAQRLYAENGY 133
E+A EN G+ L+T+ N+SA YA++ +
Sbjct: 115 IEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2227SACTRNSFRASE501e-10 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 50.3 bits (120), Expect = 1e-10
Identities = 27/109 (24%), Positives = 46/109 (42%), Gaps = 6/109 (5%)

Query: 29 SYEDMNNRLQFVQMSPFDFLYVYEEEKTIFGLLGFRIRENLEDITRYGEISIISVDSTIR 88
YED + + +V+ ++Y E G + R N Y I I+V R
Sbjct: 49 QYEDDDMDVSYVEEEG-KAAFLYYLENNCIGRIKIRSNWN-----GYALIEDIAVAKDYR 102

Query: 89 RKGIGHILMDYAEQLAKKHNCIGTWLVSGTKRVEAHPFYKKLGYEVNGY 137
+KG+G L+ A + AK+++ G L + + A FY K + +
Sbjct: 103 KKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2228PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.0 bits (72), Expect = 0.002
Identities = 24/92 (26%), Positives = 37/92 (40%), Gaps = 19/92 (20%)

Query: 33 LIGANGAGKSTTIKTMLGLLVNVNGEISFGAKKNPYAYVPEHPTYYDYLTLWEHIELLMA 92
L G G GKST I T++GL + G K+ Y + + EL
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI---------AGIV-AYEL--- 647

Query: 93 ARGNEVGSWERKAEELLHLF---RMDKYKHEY 121
+E+ ++ R E + F R D+Y+ Y
Sbjct: 648 ---SEMTAFRRADAEAVKAFFSSRKDRYRGAY 676


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2230TRNSINTIMINR290.019 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 28.5 bits (63), Expect = 0.019
Identities = 23/91 (25%), Positives = 41/91 (45%), Gaps = 9/91 (9%)

Query: 35 DKPTSTAGQQNLESTSYTYEETNDRLTTDTFITYAMQEAEKQSMQKFGTKIGPVIEDEFK 94
D PT+T Q + + T D+LT + F E +K ++ G I E K
Sbjct: 264 DDPTTTDPDQ---AANAAESATKDQLTQEAFKN---PENQKVNIDANGNAIP---SGELK 314

Query: 95 DVILPKIEEAIAELANDVPEESLQSLAISQK 125
D I+ +I + E +++++S A +Q+
Sbjct: 315 DDIVEQIAQQAKEAGEVARQQAVESNAQAQQ 345


84GBAA_2367GBAA_2380N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2367-2101.098069oxalate/formate antiporter
GBAA_2368-2111.1389182,3-dihydroxybenzoate-2,3-dehydrogenase
GBAA_2369-3101.058757isochorismate synthase DhbC
GBAA_2370-3120.9034492,3-dihydroxybenzoate-AMP ligase
GBAA_2371-2120.417136isochorismatase
GBAA_2372-3110.509627nonribosomal peptide synthetase DhbF
GBAA_2373-116-1.947156balhimycin biosynthetic protein MbtH
GBAA_2374-216-2.072443EmrB/QacA family drug resistance transporter
GBAA_2375017-2.0541004'-phosphopantetheinyl transferase
GBAA_2376-115-1.140492hypothetical protein
GBAA_2377-116-1.066714DNA-binding protein HU
GBAA_2378-114-1.444310hypothetical protein
GBAA_2379115-0.967004DinB family DNA polymerase
GBAA_2380015-0.835454alkaline serine protease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2367TCRTETA461e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 46.4 bits (110), Expect = 1e-07
Identities = 40/195 (20%), Positives = 82/195 (42%), Gaps = 8/195 (4%)

Query: 206 MLGTKQVYLLFIMLFTSCMSGLYLIGMVKDIGVELVGLSAATAANAVAMVAIFNTLGRI- 264
M + + ++ + + G+ LI V + + S A+ ++A++ +
Sbjct: 1 MKPNRPLIVILSTVALDAV-GIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFAC 59

Query: 265 --ILGPLSDKIGRLKIVTGTFVVMASSVLVLSFVDLNYGIYFVCVASVAFCFGGNITIFP 322
+LG LSD+ GR ++ + A +++ + +Y + VA G +
Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI--VAGITGATGAVAG 117

Query: 323 AIVGDFFGMKNHSKNYGIVYQGFGFGALAGSFIGALLGGFKP--TFMVIGLLCVVSFIIA 380
A + D ++++G + FGFG +AG +G L+GGF P F L ++F+
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 381 MLIQAPNQKKEQEEE 395
+ + K E+
Sbjct: 178 CFLLPESHKGERRPL 192



Score = 36.0 bits (83), Expect = 2e-04
Identities = 24/146 (16%), Positives = 59/146 (40%), Gaps = 13/146 (8%)

Query: 8 PWLVVLGTVIVQMGLGTIYTWSLFNQPLVSKYGWSLNAVAITFSITSLSLA-FSTLFASK 66
L+ + ++ +G W +F + ++ W + I+ + + + +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGE---DRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 67 LQEKWGLRKLIMIAGLALGLGLILSSQASS----LILLYVLAGVVVGYADGTAYITSLSN 122
+ + G R+ +M+ +A G G IL + A+ ++ +LA +G A ++ +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVD 329

Query: 123 LIKWFPERKGLIAGISVSAYGSGSLI 148
ER+G + G + S++
Sbjct: 330 -----EERQGQLQGSLAALTSLTSIV 350



Score = 32.1 bits (73), Expect = 0.004
Identities = 52/317 (16%), Positives = 108/317 (34%), Gaps = 38/317 (11%)

Query: 63 FASKLQEKWGLRKLIMIAGLALGLGLILSSQASSLILLYVLAGVVVGYADGT-------- 114
L +++G R +++++ + + + A L +LY + +V G T
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLY-IGRIVAGITGATGAVAGAYI 120

Query: 115 AYITSLSNLIKWFPERKGLIAGISVSAYGSGSLIFKYVNAQLIESVGVSQAFIYWGLIVT 174
A IT + F G + +G G ++ V L+ F +
Sbjct: 121 ADITDGDERARHF--------GFMSACFGFG-MVAGPVLGGLMGGFSPHAPFFAAAALNG 171

Query: 175 AMIVLGACLI---HQAADQSAVQETKTHEYTTKEMLGTKQVYLLFIMLFTSCMSGLYLIG 231
+ G L+ H+ + +E + + G V L + F + G
Sbjct: 172 LNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAA 231

Query: 232 MVKDIGVELVGLSAATAANAVAMVAIFNTLGRIIL-GPLSDKIGRLKIVTGTFVVMASSV 290
+ G + A T ++A I ++L + ++ GP++ ++G + + + +
Sbjct: 232 LWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGY 291

Query: 291 LVLSFVDLNYGIYFVCVASVAFCFGGNITIFPAIVGDFFGMKNHSKNYGIVYQGFGFGAL 350
++L+F + + PA+ S+ QG G+L
Sbjct: 292 ILLAFATRGW-----MAFPIMVLLASGGIGMPALQAML------SRQVDEERQGQLQGSL 340

Query: 351 A-----GSFIGALLGGF 362
A S +G LL
Sbjct: 341 AALTSLTSIVGPLLFTA 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2368DHBDHDRGNASE322e-114 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 322 bits (827), Expect = e-114
Identities = 163/261 (62%), Positives = 196/261 (75%), Gaps = 3/261 (1%)

Query: 1 MNVGEFDGKTVLVTGAAQGIGSVVAKMFLERGATVIAVDQNGEGLNVLLNQNETRMKI-- 58
MN +GK +TGAAQGIG VA+ +GA + AVD N E L +++ + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 59 -FHLDVSDSNAVEDTVKRIENDIAPIDILVNVAGVLRMGAIHSLSDEDWNKTFSVNSTGV 117
F DV DS A+++ RIE ++ PIDILVNVAGVLR G IHSLSDE+W TFSVNSTGV
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 118 FYMSRAVSKHMMQRKSGAIVTVGSNAANTPRVEMAAYAASKAATTMFMKCLGLELAAYNI 177
F SR+VSK+MM R+SG+IVTVGSN A PR MAAYA+SKAA MF KCLGLELA YNI
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 178 RCNLVSPGSTETEMQRLLWADENGAKNIIAGSQNTYRLGIPLQKIAQPSEITEAVLFLAS 237
RCN+VSPGSTET+MQ LWADENGA+ +I GS T++ GIPL+K+A+PS+I +AVLFL S
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240

Query: 238 DKASHITMHNLCVDGGATLGV 258
+A HITMHNLCVDGGATLGV
Sbjct: 241 GQAGHITMHNLCVDGGATLGV 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2371ISCHRISMTASE389e-139 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 389 bits (1000), Expect = e-139
Identities = 176/306 (57%), Positives = 232/306 (75%), Gaps = 11/306 (3%)

Query: 1 MAIPSISVYKMPIESELPKNKVNWTPDPKRAVLLIHDMQEYFLDAYSDKESPKVELISNI 60
MAIP+I Y+MP S++P+NKV+W PDP RAVLLIHDMQ YF+DA++ SP EL +NI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 KVIREKCKELGIPVVYTAQPGGQTLEQRGLLQDFWGDGIPAGPDKKKIVDELTPDEDDIF 120
+ ++ +C +LGIPVVYTAQPG Q + R LL DFWG G+ +GP ++KI+ EL P++DD+
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LTKWRYSAFKKTNLLEILNEQGRDQLIICGIYAHIGCLLTACEAFMDGIQPFFVADAVAD 180
LTKWRYSAFK+TNLLE++ ++GRDQLII GIYAHIGCL+TACEAFM+ I+ FFV DAVAD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSLEHHKQALEYASNRCAVTTSTNSLLTELQGLKDD-----------DEITLQKVHELVA 229
FSLE H+ ALEYA+ RCA T T+SLL +LQ D + T + + + +A
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 230 QLLREPVESVGTDEDLLNRGLDSVRIMSLVEKWRREGKEITFADLAENPTVVDWYRLLSP 289
+LL+E E + EDLL+RGLDSVRIM+LVE+WRREG E+TF +LAE PT+ +W +LL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTT 300

Query: 290 QTEHVL 295
+++ VL
Sbjct: 301 RSQQVL 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2374TCRTETB1215e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 121 bits (304), Expect = 5e-32
Identities = 90/398 (22%), Positives = 172/398 (43%), Gaps = 14/398 (3%)

Query: 20 FMAAMDATIVNVALQTISKELQVPPSAMGTVNVGYLVSLAVFLPISGWLGDRFGTKRIFL 79
F + ++ ++NV+L I+ + PP++ VN ++++ ++ + G L D+ G KR+ L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TALFVFTTASALCGIANDITSLNIF-RIIQGAGGGLLTPVGMAMLFRTFSPEERPKISRF 138
+ + S + + + SL I R IQGAG + M ++ R E R K
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 IVLPIAVAPAVGPIIGGFFVDQMSWRWAFYINLPFGIMALLFGLLFLKEHIEKSAGRFDS 198
I +A+ VGP IGG + W++ + +P + + L+ L + + G FD
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIH--WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 199 LGFILSAPGFAMIIYALSQGPSRGWISTEIISTGIAGTVFITLFILVELKVKQPMLDLRL 258
G IL + G + ++ IS I + +F+ KV P +D L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 259 LKEPVFRKMSLISLFSSAGLLGMLFVFPLMYQNVIGVSALESG-LTTFPEAIGLMISSQI 317
K F L + G + + P M ++V +S E G + FP + ++I I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 318 VPWSYKKLGARKVISIGLICTVIIFVLLSFVNHDTNPWQIRALLFGIGIFLGQSVGAVQF 377
+ G V++IG+ + F+ SF+ T+ + ++F +G L + +
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371

Query: 378 SAFNNITPPSMGRATTIFNVQNRLGSAIGVAVLASILA 415
+++ G ++ N + L G+A++ +L+
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2375ENTSNTHTASED391e-05 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 38.9 bits (90), Expect = 1e-05
Identities = 22/129 (17%), Positives = 48/129 (37%), Gaps = 23/129 (17%)

Query: 53 RARFIIGCVISRLVLGKILSMSPVQVPIDRMCPVCKLQHGRPQLPEGMPQLSVSHSGEWV 112
+A + G + + L + + + V D+ +P P+G+ S+SH
Sbjct: 47 KAEHLAGRIAAVHALRE-VGVRTVPGMGDK---------RQPLWPDGLFG-SISHCATTA 95

Query: 113 VVAFTKFAPVGVDVEQMNPNVDVMKMAEGVLTDIEKAQVMKLPNEQKIEGFLTYWTR--- 169
+ ++ +G+D+E++ ++A ++ E+ Q
Sbjct: 96 LAVISR-QRIGIDIEKIMSQHTATELAPSIIDSDER------QILQASLLPFPLALTLAF 148

Query: 170 --KEAVLKA 176
KE+V KA
Sbjct: 149 SAKESVYKA 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2377DNABINDINGHU1243e-41 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 124 bits (313), Expect = 3e-41
Identities = 57/89 (64%), Positives = 74/89 (83%)

Query: 2 NKTELIKNVAQSADISQKDASAAVQSVFDTIATALQSGDKVQLIGFGTFEVRERSARTGR 61
NK +LI VA++ ++++KD++AAV +VF +++ L G+KVQLIGFG FEVRER+AR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGEEIQIAAGKVPAFKAGKELKEAVK 90
NPQTGEEI+I A KVPAFKAGK LK+AVK
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2380SUBTILISIN2642e-88 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 264 bits (677), Expect = 2e-88
Identities = 101/304 (33%), Positives = 150/304 (49%), Gaps = 19/304 (6%)

Query: 110 TPNDPYYKN-QYGLQKIQAPLAWDSQRSDSSVKVAIIDTGVQGSHPDLSSKVIYGHDYVD 168
+ G++ IQAP W+ R VKVA++DTG HPDL +++I G ++ D
Sbjct: 13 IKQEQQVNEIPRGVEMIQAPAVWNQTRG-RGVKVAVLDTGCDADHPDLKARIIGGRNFTD 71

Query: 169 NDN----VSDDGNGHGTHCAGITGALTNNSVGIAGVAPHTSIYAVRVLDNQGSGTLDAVA 224
+D + D NGHGTH AG A T N G+ GVAP + ++VL+ QGSG D +
Sbjct: 72 DDEGDPEIFKDYNGHGTHVAGTIAA-TENENGVVGVAPEADLLIIKVLNKQGSGQYDWII 130

Query: 225 QGIREAADSGAKVISLSLGAPNGGTALQQAVQYAWNKGSVIVAAAGNAGNTKAN-----Y 279
QGI A + +IS+SLG P L +AV+ A +++ AAGN G+ Y
Sbjct: 131 QGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGY 190

Query: 280 PAYYSEVIAVASTDQSDRKSSFSTYGSWVDVAAPGSNIYSTYKGSTYQSLSGTSMATPHV 339
P Y+EVI+V + + S FS + VD+ APG +I ST G Y + SGTSMATPHV
Sbjct: 191 PGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGGKYATFSGTSMATPHV 250

Query: 340 AGVAAL-------LANQGYSNTQIRQIIESTSDKISGTGTYWKNGRVNAYKAVQYAKQLQ 392
AG AL + + ++ + + + + NG + + ++
Sbjct: 251 AGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEELSRIFD 310

Query: 393 ENKA 396
+
Sbjct: 311 TQRV 314


85GBAA_2466GBAA_2480N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2466114-3.168354hypothetical protein
GBAA_2467015-3.913436glycerol-3-phosphate acyltransferase PlsY
GBAA_2468-213-2.648678acetyltransferase
GBAA_2469-212-2.588468threonine dehydratase
GBAA_2472-213-2.771744metallo-beta-lactamase
GBAA_2473-212-2.221429hypothetical protein
GBAA_2474-212-1.507699hypothetical protein
GBAA_2475-212-3.134405DEAD/DEAH box helicase
GBAA_2476015-4.247438hypothetical protein
GBAA_2477015-4.221019hypothetical protein
GBAA_2479-116-4.521825TetR family transcriptional regulator
GBAA_2480015-4.257129ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2466PRPHPHLPASEC280.048 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 28.1 bits (62), Expect = 0.048
Identities = 13/72 (18%), Positives = 26/72 (36%), Gaps = 11/72 (15%)

Query: 112 GILTIGGTGAICLGRKGEVYEYSGGW-GHILGDEGSGYWIALQGLKRMANQFDQGVTLCP 170
++ ++ G +VY W G I G G+ I QG+ + N +
Sbjct: 8 ALICATLATSLWAGASTKVY----AWDGKIDG-TGTHAMIVTQGVSILENDLSKNEP--- 59

Query: 171 LSLRIQDEFQLL 182
++ ++L
Sbjct: 60 --ESVRKNLEIL 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2467ACRIFLAVINRP280.025 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.3 bits (63), Expect = 0.025
Identities = 15/62 (24%), Positives = 28/62 (45%), Gaps = 6/62 (9%)

Query: 91 VMLTLLAVIMGHIYPMLFKGKGGKGIS-----TFIGGLIAFDYLIALTLVAVFIIFYLIF 145
+++T LA I+G + P+ G G +GG+++ L + F++ F
Sbjct: 974 ILMTSLAFILGVL-PLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCF 1032

Query: 146 KG 147
KG
Sbjct: 1033 KG 1034


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2468AUTOINDCRSYN290.044 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 28.7 bits (64), Expect = 0.044
Identities = 10/52 (19%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 15 ESIHKLNYKTFVEEIPQHEETKDRVRIDRFHEENT-YLICLDDDKLVGMVAL 65
+ L +TF + + + D + D++ NT YL + D+ ++ +
Sbjct: 18 GELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKDNTVICSLRF 69


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2475TONBPROTEIN300.013 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 30.3 bits (68), Expect = 0.013
Identities = 20/113 (17%), Positives = 39/113 (34%), Gaps = 6/113 (5%)

Query: 338 AGGSGLAITFVAAKDEKH------LEEIEKTLGAPIQREIIEQPKIKRVDENGKPLPKPA 391
A +++T V D + E + + V E KP PKP
Sbjct: 40 APAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 99

Query: 392 PKKSGEYRQRDSREGSRSGSKGRTRNDSRNSSRNENNRSFNKPSNKKGSTKQG 444
PK + +++ R+ S+ + ++ +R ++ + S S G
Sbjct: 100 PKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASG 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2476BACTRLTOXIN280.005 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 27.6 bits (61), Expect = 0.005
Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 31 KINWYNDMKTSFANKELADLVKG 53
K+ Y+ +KT N++LA K
Sbjct: 84 KLKNYDKVKTELLNEDLAKKYKD 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2479HTHTETR728e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 8e-18
Identities = 30/174 (17%), Positives = 72/174 (41%), Gaps = 13/174 (7%)

Query: 8 EERRKEILETAERLFLTKGYTKTTVNDILKEIGIAKGTFYHYFKSKEEVMDEIIMRIIKE 67
+E R+ IL+ A RLF +G + T++ +I K G+ +G Y +FK K ++ E I + +
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE-IWELSES 68

Query: 68 DVAKAKVIVSNPNIPVLEKLFRVLME---QSPKSGDIKDKMIE-QFHQPNNA---EMYQK 120
++ + ++ + R ++ +S + + + ++E FH+ + Q+
Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 121 SLVQSIIHLSPVLTEILEQGIEEGIFSTSY-PQETIELLLSSAQVIFDEGLFQW 173
+ + + + L+ IE + + ++ + W
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY----ISGLMENW 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2480TCRTETA392e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 2e-05
Identities = 58/342 (16%), Positives = 125/342 (36%), Gaps = 36/342 (10%)

Query: 47 IFAGLYAITSIPFLLAPLGGAIADRFNRRNLMVIFDFINTAIVLSFIVLLFTGSVSILLI 106
I LYA+ F AP+ GA++DRF RR +++ A+ + ++ + +L I
Sbjct: 47 ILLALYALMQ--FACAPVLGALSDRFGRR-PVLLVSLAGAAV--DYAIMATAPFLWVLYI 101

Query: 107 GTIMFLLAIVNAMYAPVVMASIPQLVPEKKLEQANGIVNGVQALSNIVAPVLGGILYGII 166
G I +A + V A I + + + G ++ + PVLGG++ G
Sbjct: 102 GRI---VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGF 157

Query: 167 GLKMLVIISCLAFFLSAILEMFITIPFIKRVQESHIIPTIVKDMKGGFIYVLKQPFILKS 226
+ L+ + F+ +P + + + + +
Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFL-LPESHKGERRPLRREALNPLASFRWARGMTVVAA-L 215

Query: 227 MLLAALLNLILTPLFVVGAPIIIRVTMESSH-TLYGIGMGLIDFATIIGALSMVFFAKKL 285
M + ++ L+ V A + + + H IG+ L F I+ +L+ +
Sbjct: 216 MAVFFIMQLVGQ----VPAALWVIFGEDRFHWDATTIGISLAAFG-ILHSLAQAMITGPV 270

Query: 286 QMQTLYYWMILIALLVIPMALSVTPFILNLGY------YPPFILFILSSILIAMIMTVVS 339
+ L++ M T +IL +P +L I + + ++S
Sbjct: 271 AA-----RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325

Query: 340 IYVITVVQKKTPNENLGKVMAIITAVSQCMAPIGQVIYGFMF 381
V E G++ + A++ + +G +++ ++
Sbjct: 326 RQV--------DEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359



Score = 29.0 bits (65), Expect = 0.032
Identities = 16/79 (20%), Positives = 34/79 (43%), Gaps = 3/79 (3%)

Query: 86 TAIVLSFIVLLFTGSVSILLIGTIMFLLAIVNAMYAPVVMASIPQLVPEKKLEQANGIVN 145
A +I+L F + ++ + P + A + + V E++ Q G +
Sbjct: 285 IADGTGYILLAFATRGWMAFPIMVLLASG---GIGMPALQAMLSRQVDEERQGQLQGSLA 341

Query: 146 GVQALSNIVAPVLGGILYG 164
+ +L++IV P+L +Y
Sbjct: 342 ALTSLTSIVGPLLFTAIYA 360


86GBAA_2541GBAA_2549N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2541-217-1.720882ABC transporter ATP-binding protein
GBAA_2542-214-1.113369ABC transporter permease
GBAA_2543-214-0.650532TetR family transcriptional regulator
GBAA_2545-214-0.024716hypothetical protein
GBAA_2546-1130.488923hypothetical protein
GBAA_2547-1141.605361acyl-CoA dehydrogenase
GBAA_25480131.860773acetyl-CoA carboxylase biotin carboxylase
GBAA_25490131.715772acetyl-CoA carboxylase biotin carboxyl carrier
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2541PF05272320.003 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.6 bits (71), Expect = 0.003
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 10/65 (15%)

Query: 17 LVGPSGSGKTTLIKLIAGINEATEGEVLVYNTNMPNLNEMKRIGYMAQADALYE--ELSA 74
L G G GK+TLI + G++ ++ ++ K YE E++A
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHF--------DIGTGKDSYEQIAGIVAYELSEMTA 652

Query: 75 YENAD 79
+ AD
Sbjct: 653 FRRAD 657


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2542ABC2TRNSPORT499e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 48.8 bits (116), Expect = 9e-09
Identities = 37/163 (22%), Positives = 72/163 (44%), Gaps = 9/163 (5%)

Query: 166 SFVRERLSGALERLLSTPIKRWEIVVGYIIGFGIFAFIQSIIIVSFSVYILDLYVAGSIW 225
+F R E +L T ++ +IV+G + A + I + + Y
Sbjct: 90 AFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALG--YTQW--L 145

Query: 226 LTLLITCMLSLTAL---TLGTFLSAYANNEFQMIQFIPLVIVPQIFFSG-LFPIESMNKW 281
L +++LT L +LG ++A A + I + LVI P +F SG +FP++ +
Sbjct: 146 SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205

Query: 282 LQMLGKLFPLTYGADAMRQVMIRNQGFTEIALDLTVLLFFSVL 324
Q + PL++ D +R +M+ + ++ + L + V+
Sbjct: 206 FQTAARFLPLSHSIDLIRPIMLGHPV-VDVCQHVGALCIYIVI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2543HTHTETR852e-22 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 84.7 bits (209), Expect = 2e-22
Identities = 37/206 (17%), Positives = 82/206 (39%), Gaps = 12/206 (5%)

Query: 16 DKRNERQMRILEAAVDMFGEKGYASTSTSEIAKRAGVAEGTIFRYYKTKKDLLLAVVMPT 75
+ E + IL+ A+ +F ++G +STS EIAK AGV G I+ ++K K DL
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE----- 61

Query: 76 LMKFAAPFFVQAFAKEIFKSEYESYEGLLRVVIHNRFDFA---KKHFPMIKILIQEVPFH 132
+ + + + E +LR ++ + + ++ +++I+ + F
Sbjct: 62 IWELSESNIGELEL-EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 133 PELK--NEIQQLVETELLLHFKKLIEKFQEKGKIIEMPPATVLRLTLSAVFGLLLTRFLL 190
E+ + Q+ + E ++ ++ E + + + L+ +L
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 191 LPEEKWDDETEIENTIQFILYGLTPR 216
P+ D + E + + +L
Sbjct: 181 APQSF-DLKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2548PHPHTRNFRASE340.001 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 34.4 bits (79), Expect = 0.001
Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 6/80 (7%)

Query: 97 EEGIVFIGPSEEIITKMGSKIESRIAMQA--ADVPVVPGITTNIETAEEAIEIAKQIGYP 154
EGIV + P+EE + K + + A + P T + +E+A IG P
Sbjct: 224 IEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKD----GAHVELAANIGTP 279

Query: 155 LMLKASAGGGGIGMQLMETE 174
+ GG G+ L TE
Sbjct: 280 KDVDGVLANGGEGIGLYRTE 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2549RTXTOXIND321e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 1e-04
Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 41 IVSEEAGTVMKINVQEGDFVNEGDVLLEIE 70
I E V +I V+EG+ V +GDVLL++
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLT 128


87GBAA_2877GBAA_2887N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2877-113-1.520790EmrB/QacA family drug resistance transporter
GBAA_2878-112-2.400981hypothetical protein
GBAA_2879-213-2.926863hypothetical protein
GBAA_2880-214-3.172035hypothetical protein
GBAA_2881-213-2.439388solute-binding family 5 protein
GBAA_2882112-2.265298major facilitator family transporter protein
GBAA_2883215-2.893177lipoprotein
GBAA_2884115-3.122006hypothetical protein
GBAA_2886116-2.975202hypothetical protein
GBAA_2887119-2.997223hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2877TCRTETB1452e-40 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 145 bits (368), Expect = 2e-40
Identities = 91/406 (22%), Positives = 166/406 (40%), Gaps = 16/406 (3%)

Query: 19 ILMASMDNTIVVTAMGTIVGDLGGLENFV-WVVSAYMVAEMAGMPIFGKLSDMYGRKRFF 77
+ ++ ++ ++ I D WV +A+M+ G ++GKLSD G KR
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 78 IFGLIVFMVGSALCGTAENITQLGIY-RAIQGIGGGALVPIAFTIVFDIFPPEKRGKMGG 136
+FG+I+ GS + + L I R IQG G A + +V P E RGK G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 137 LFGAVFGLSSIFGPLLGAYITDYISWHWVFYINLPLGVLALIFITFFYKESRVHRKQKID 196
L G++ + GP +G I YI HW + + +P+ + + + V K D
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYI--HWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 197 WSGAITLVGAVICLMFALELGGQKYDWDSTFILSLFVGFAILIISFIFIERKVEEPIISF 256
G I + ++ M L Y S + + + F+ RKV +P +
Sbjct: 201 IKGIILMSVGIVFFM----LFTTSYSI------SFLIVSVLSFLIFVKHIRKVTDPFVDP 250

Query: 257 EMFKQRLFGMSTIIALCYGAAFMSATVYIPLFIQGVYGGSATNSG-LLLLPMMLGSVVTA 315
+ K F + + +P ++ V+ S G +++ P + ++
Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310

Query: 316 QLGGFLTTKLSYRNIMIISAVIMLIGLFLLSALTPETSRALLTVYMIIIGFGVGFSFSVL 375
+GG L + ++ I + + FL ++ ET+ +T+ ++ + G+ F+ +V+
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVS-FLTASFLLETTSWFMTIIIVFVLGGLSFTKTVI 369

Query: 376 SMAAIHNFGMEQRGSATSTSNFIRSLGMTLGITIFGMIQRTGFQDQ 421
S + ++ G+ S NF L GI I G + DQ
Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2880PYOCINKILLER310.004 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.9 bits (69), Expect = 0.004
Identities = 14/53 (26%), Positives = 20/53 (37%), Gaps = 5/53 (9%)

Query: 12 LELTGISYGQLYRWKRKNLIPEDWFVRKSTFTGQETFFPKEKILERINKIQTM 64
L+ + G KNL P D R T G +K+L KI ++
Sbjct: 97 LDKADAALGPA-----KNLAPLDVINRSLTIVGNALQQKNQKLLLNQKKITSL 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2882TCRTETA801e-18 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 80.3 bits (198), Expect = 1e-18
Identities = 53/318 (16%), Positives = 113/318 (35%), Gaps = 9/318 (2%)

Query: 50 LIFGLQPFSDIVFTLIAGGITDKYGRKKIMLLGLLLQGVAIGSFVFAQSVFIFALLYVIN 109
++ L + G ++D++GR+ ++L+ L V A +++ + ++
Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106

Query: 110 GIGRSLYIPAQRAQIADLIKQGQQAEIFALLQTMGAIGTVIGPLIGAVFYNTHPEYLFIM 169
GI + A IAD+ ++A F + G V GP++G + P F
Sbjct: 107 GITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFA 165

Query: 170 QSITLMVYAVVVWTQLPETAPAITMPKQKLEVSSPKQF--VRNHSAVIGLMVSTLPISFF 227
+ + + LPE+ P ++ ++ F R + V LM +
Sbjct: 166 AAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLV 225

Query: 228 YAQTETNYRIFAEDVFPNFIFILAFISTCRAIMEIILQIFLV-KWSERFSMAKIIIISYT 286
+ IF ED F + I+ + Q + + R + +++
Sbjct: 226 GQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG-- 283

Query: 287 CYIVAAIGYGFSATIVS--LFFTLLFLVIGESIALNHLLRFVSEIAPSDKRGLYFSIYGL 344
I GY A + F ++ L+ I + L +S +++G
Sbjct: 284 -MIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAA 342

Query: 345 HWDVSRTCGPVIGAILLS 362
++ GP++ + +
Sbjct: 343 LTSLTSIVGPLLFTAIYA 360



Score = 47.5 bits (113), Expect = 5e-08
Identities = 20/121 (16%), Positives = 53/121 (43%), Gaps = 1/121 (0%)

Query: 45 IMITMLIFGLQPFSDIVFTLIAGGITDKYGRKKIMLLGLLLQGVAIGSFVFAQSVFIFAL 104
I + + + +I G + + G ++ ++LG++ G FA ++
Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305

Query: 105 LYVINGIGRSLYIPAQRAQIADLIKQGQQAEIFALLQTMGAIGTVIGPLIGAVFYNTHPE 164
+ V+ G + +PA +A ++ + + +Q ++ L + ++ +++GPL+ Y
Sbjct: 306 IMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364

Query: 165 Y 165

Sbjct: 365 T 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2883TYPE4SSCAGA290.014 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 29.3 bits (65), Expect = 0.014
Identities = 35/130 (26%), Positives = 52/130 (40%), Gaps = 20/130 (15%)

Query: 20 LAACKGTDEKKETNP----TSENSKNEQNTSSEGK-----KEPEVKSNTDSNSKDIVINQ 70
L A KG+ + NP EN N GK K + KS+ +++ KD++INQ
Sbjct: 719 LKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDVIINQ 778

Query: 71 KSINHVKNLFELAKEGKVPNVPFAAHTGDIEEIEKAWGKADKTEQAGNGMYATFTNKNVS 130
K + V NL + K TGD +E+A + A KN S
Sbjct: 779 KVTDKVDNLNQAVSVAKA--------TGDFSRVEQALADLKNFSKE---QLAQQAQKNES 827

Query: 131 FGFNKGSQVF 140
K S+++
Sbjct: 828 LNARKKSEIY 837


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2887CHANLCOLICIN359e-06 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 35.4 bits (81), Expect = 9e-06
Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 3/49 (6%)

Query: 6 IVGGILGWLASLITGRDVPGGVIG-NIIAGIIGSWIGGKLLGSFGPVIG 53
V ++ L SL+ G G+ G I+ GI+ S+I L + V+G
Sbjct: 475 GVSYVVALLFSLLAG--TTLGIWGIAIVTGILCSYIDKNKLNTINEVLG 521


88GBAA_2935GBAA_2942N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_2935-219-0.726415acetyltransferase
GBAA_2936-217-0.922745hypothetical protein
GBAA_2937-119-1.838908hypothetical protein
GBAA_2938014-0.954594acetyltransferase
GBAA_2939114-1.550245acetyltransferase
GBAA_2940114-0.750746hypothetical protein
GBAA_2941-314-1.236915hypothetical protein
GBAA_2942-320-0.480907lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2935SACTRNSFRASE361e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 1e-05
Identities = 25/78 (32%), Positives = 33/78 (42%), Gaps = 9/78 (11%)

Query: 46 YEEQACIGIEIIGAN---KAKIRHIAVIPQYRHKGIALQMI---KEVVRIHQLTYLEAET 99
Y E CIG I +N A I IAV YR KG+ ++ E + + L ET
Sbjct: 71 YLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLET 130

Query: 100 DD---EAVEFYKRIGFQV 114
D A FY + F +
Sbjct: 131 QDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_293660KDINNERMP280.039 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 28.4 bits (63), Expect = 0.039
Identities = 10/45 (22%), Positives = 23/45 (51%), Gaps = 4/45 (8%)

Query: 12 FFFAFTFVLNRAMDLEGGSWI-WSASLRY---YFMVPMLLLIVMY 52
F A ++L +++L + W L Y+++P+L+ + M+
Sbjct: 432 IFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMF 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2938SACTRNSFRASE439e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 43.0 bits (101), Expect = 9e-08
Identities = 19/90 (21%), Positives = 35/90 (38%), Gaps = 5/90 (5%)

Query: 57 FGAFNEDHQLVGVVTLLTEEKEAYKHKGHIVAMYVDASNQRSGLARELICKAIERAKEMN 116
F + ++ +G + + + + I + V ++ G+ L+ KAIE AKE +
Sbjct: 68 FLYY-LENNCIGRIKI----RSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122

Query: 117 LEQLTLGVVSTNEPAKRLYESMGFKTYGIE 146
L L N A Y F ++
Sbjct: 123 FCGLMLETQDINISACHFYAKHHFIIGAVD 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2942IGASERPTASE250.046 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 25.0 bits (54), Expect = 0.046
Identities = 17/61 (27%), Positives = 28/61 (45%), Gaps = 2/61 (3%)

Query: 28 KDEKEPDPTEEPSEQRQEEKNEKQD-PAKEQNNELNK-KDEQEPDPTEEPSEEQKKKKEN 85
++ E D TE ++ R+ K K + A Q NE+ + E + T E E +KE
Sbjct: 1051 VEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110

Query: 86 E 86
+
Sbjct: 1111 K 1111


89GBAA_2958GBAA_2971N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_29581160.143451bifunctional 3-deoxy-7-phosphoheptulonate
GBAA_2961-3150.550607hypothetical protein
GBAA_2963016-0.511453isochorismatase
GBAA_2964117-1.266216acetyltransferase
GBAA_2965116-1.667037hypothetical protein
GBAA_2966216-1.893301hypothetical protein
GBAA_2967216-1.675391hypothetical protein
GBAA_2969014-1.695164hypothetical protein
GBAA_2970013-0.577702RNA polymerase sigma factor
GBAA_2971-1150.016879hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2958PF06776290.022 Invasion associated locus B
		>PF06776#Invasion associated locus B

Length = 214

Score = 29.1 bits (65), Expect = 0.022
Identities = 15/61 (24%), Positives = 23/61 (37%)

Query: 268 RATRNTLDISAVPILKKETHLPVVVDVTHSTGRRDLLLPTAKAALAIGADAVMAEVHPDP 327
R +R + AVP LK P + ++ RR A+ LA ++ D
Sbjct: 10 RISRRPVTNHAVPALKAIQMGPAELSPMLASCRRLARRNGARLMLAGAMAIALSFGWSDR 69

Query: 328 A 328
A
Sbjct: 70 A 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2963ISCHRISMTASE538e-11 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 52.7 bits (126), Expect = 8e-11
Identities = 43/158 (27%), Positives = 71/158 (44%), Gaps = 19/158 (12%)

Query: 2 KKALLVIDVQ---AGMYTAGMPVHNGEKFLETLQELIGECRSNDIPVIYVQHNGPKDHPL 58
+ LL+ D+Q +TAG + +++L +C IPV+Y G + +P
Sbjct: 30 RAVLLIHDMQNYFVDAFTAGASPV--TELSANIRKLKNQCVQLGIPVVYTAQPGSQ-NPD 86

Query: 59 EKG--TDGW-----------KIHAAIAPLEGECVVEKTTPDSFHKTNLKEVLQDKGIDHV 105
++ TD W KI +AP + + V+ K +F +TNL E+++ +G D +
Sbjct: 87 DRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146

Query: 106 IISGMQTQYCVDTTTRRACSEGYKITLVSDAHSTFDTE 143
II+G+ T A E K V DA + F E
Sbjct: 147 IITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLE 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2964SACTRNSFRASE364e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.7 bits (82), Expect = 4e-05
Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 14/89 (15%)

Query: 78 VDSESKTLYGYEESQNVWG-------------MDQFIGEPTYWGKGIGTKFVKAAITYIL 124
V+ E K + Y N G ++ Y KG+GT + AI +
Sbjct: 60 VEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAK 119

Query: 125 SEMGAEAIAMDPKVNNERAIKCYEKCGFK 153
E + ++ + N A Y K F
Sbjct: 120 -ENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2967IGASERPTASE541e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 53.5 bits (128), Expect = 1e-09
Identities = 45/216 (20%), Positives = 73/216 (33%), Gaps = 4/216 (1%)

Query: 146 PVEKKADEKTKQVAKVQKSVKAKEEAKTQKITKAKETIKPKEEVKVQEVVKPKEEVKVQE 205
V+ + SV + E + P + E V E K +
Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA--ENSKQES 1048

Query: 206 VVKPKEEVKVQEVAKPKEEVKVQEVAKPKEEVKVQEVAKPKEEVKVQEVVKPKEEVKVQE 265
K E E EV + + K + EVA+ E K + + KE V++
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 266 VAKAKEEA-KAQEIAKAKEEAKAQEIAKAKEEAKAQEIAKAKEEAKAQEIAKAKEEAKAR 324
KAK E K QE+ K + ++ +++ E A+ + + +++ A
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQ-EQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167

Query: 325 EALKAKEESKNNAQSAKRELTVVATAYTADPSENGT 360
AKE S N Q TV + EN T
Sbjct: 1168 TEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203



Score = 44.7 bits (105), Expect = 7e-07
Identities = 36/223 (16%), Positives = 79/223 (35%), Gaps = 11/223 (4%)

Query: 132 KTAYVNVSFLSSKAPVEKKADEKTKQVAKVQKSVKAKEEAKTQKITKAKETIKPKEEVKV 191
+T ++ +K ++ + + V + ++ + T+ E + E K
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 192 QEVVKPKEEVKVQEVVKPKEEVKVQEVAKPKEEVKVQEVAKPKEEVKVQEVAKPKEEVKV 251
+ + KE V++ K K E K +E KV PK+E + + V E +
Sbjct: 1095 TQTTETKETATVEKEEKAKV-----ETEKTQEVPKVTSQVSPKQE-QSETVQPQAEPARE 1148

Query: 252 QEVVKPKEEVKVQEVAKAKEEAKAQEIAKAKEEA----KAQEIAKAKEEAKAQEIAKAKE 307
+ +E + Q A E A+E + E+ + E +
Sbjct: 1149 NDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ 1208

Query: 308 EAKAQEIAKAKEEAKAREALKAKEESKNNAQSAKRELTVVATA 350
E + K + + R ++++ + A ++ + + VA
Sbjct: 1209 PTVNSESSN-KPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250



Score = 42.0 bits (98), Expect = 5e-06
Identities = 47/286 (16%), Positives = 88/286 (30%), Gaps = 30/286 (10%)

Query: 128 EYKGKTAYVNVSFLSSKAPVEKKADEKTKQVAKVQKSVK------AKEEAKTQKITKAKE 181
E ++ +A KA+ +T +VA+ K KE A +K KAK
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114

Query: 182 TIKPKEEV-KVQEVVKPK----EEVKVQEVVKPKEEVKVQEVAKPKEEVKVQEVAKPKEE 236
+ +EV KV V PK E V+ Q + + V + + +P +E
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 237 VKVQEVAKPKEEVKVQEVVKPKEEVKVQEVAKAKEEAKAQEIAKAKEEAKAQEIAK--AK 294
V +P E E + E + + + +
Sbjct: 1175 TS-SNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHN 1233

Query: 295 EEAKAQEIAKAKEEAKAQEIAKAKEEAKAREALKAKEESKNNAQSAKRELTVVATA---- 350
E A + + KA+ + N ++ + ++ +
Sbjct: 1234 VEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293

Query: 351 ---YTADPSENGTYGG---------RVLTAMGHDLTANPNMRIIAV 384
+ ++ S N Y T +G D T + N+++ V
Sbjct: 1294 YNVWVSNTSMNKNYSSSQYRRFSSKSTQTQLGWDQTISNNVQLGGV 1339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_2971RTXTOXINA270.038 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 26.9 bits (59), Expect = 0.038
Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 17 ISSGTIKIHFTNFHDSVDYDRQLYI 41
+S+G+ I+ HD V YD+
Sbjct: 625 LSAGSANIYAGKGHDVVYYDKTDTG 649


90GBAA_3033GBAA_3042N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3033-215-0.587384alkaline d-peptidase
GBAA_3034-115-1.495702hypothetical protein
GBAA_3037013-0.384023hypothetical protein
GBAA_3038113-0.485792ATPase AAA
GBAA_3039111-0.655342hypothetical protein
GBAA_3040111-0.494311nitroreductase
GBAA_3041-213-1.612126marR family transcriptional regulator
GBAA_3042-313-2.090207tetracycline resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3033BLACTAMASEA320.004 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 31.7 bits (72), Expect = 0.004
Identities = 14/55 (25%), Positives = 21/55 (38%)

Query: 74 GKISSYTAGVADLSTKKPVKSDYRFRIGSVTKTFTATTVLQLVGENRVQLDDSIE 128
G++ +A T ++D RF + S K VL V QL+ I
Sbjct: 38 GRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIH 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3037YERSSTKINASE290.029 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 29.3 bits (65), Expect = 0.029
Identities = 15/37 (40%), Positives = 23/37 (62%), Gaps = 3/37 (8%)

Query: 310 KHLQNVLKILASISDQDTPVSSSYFYTAGFRRKELDA 346
KHL+ +L++L ++S Q PVSS T GF + +A
Sbjct: 567 KHLETLLEVLVTLSQQGQPVSSE---TYGFLNRLTEA 600


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3038HTHFIS431e-06 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 42.9 bits (101), Expect = 1e-06
Identities = 33/156 (21%), Positives = 62/156 (39%), Gaps = 20/156 (12%)

Query: 15 IIGKDESI----ELAAIALIAKGHILLEDVPGTGKTTLAKSL---AKSVDAKFQRIQFTA 67
++G+ ++ + A + +++ GTGK +A++L K + F I A
Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198

Query: 68 DTLPGDVIGLEYFNVKESDF----KTRLGPI-FAN--IVLVDEINRAVPRTQSSLLEVME 120
+P D+I E F ++ F G A + +DEI Q+ LL V++
Sbjct: 199 --IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256

Query: 121 ERTVTIAKQTHSLPEPFLVIATQN-PLESA---GTF 152
+ T + ++A N L+ + G F
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLF 292


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3042TCRTETOQM6350.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 635 bits (1640), Expect = 0.0
Identities = 223/647 (34%), Positives = 343/647 (53%), Gaps = 13/647 (2%)

Query: 1 MTTINIEIVAHVDAGKTSLTERILYETNVIKEVGRVDSGSTQTDSMELERQRGITIKASV 60
M INI ++AHVDAGKT+LTE +LY + I E+G VD G+T+TD+ LERQRGITI+ +
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 61 VSFFIDDIKVNVIDTPGHADFIAEVERSFRVLDGAILVISAVEGVQAQTKILMQTLQKLN 120
SF ++ KVN+IDTPGH DF+AEV RS VLDGAIL+ISA +GVQAQT+IL L+K+
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 121 IPTILFVNKIDRTGANTEKVVKQIKTILSNETFPFYSVQNEGTKEARIIEYKSYDDCIER 180
IPTI F+NKID+ G + V + IK LS E V E + + + + +
Sbjct: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKV--ELYPNMCVTNF-TESEQWDT 177

Query: 181 LAPYNESLLESFVNNEIVTDTLLREELEKQIQQANLYPIFFGSALTGIGVTELLEDIPAL 240
+ N+ LLE +++ + + L +E + +L+P++ GSA IG+ L+E I
Sbjct: 178 VIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNK 237

Query: 241 LPANNPSQDEELSGIVFKIEREPSGEKIAYVRVFSGTLHVRKYVHIQRDGSLPHKEKIKK 300
++ EL G VFKIE +++AY+R++SG LH+R V I K KI +
Sbjct: 238 FYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKE----KIKITE 293

Query: 301 MCIFHNGNAVQTSTVPSGDFCKVWGLNNIKIGDIIGERT--DYIKDIHFAEPQMEAAINA 358
M NG + SG+ + +K+ ++G+ + I P ++ +
Sbjct: 294 MYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEP 352

Query: 359 VPKERIHDLYAALMELCEADPLIKVWKDDIHNELYIRLFGEVQKEVIETTLYEKYNLQVT 418
++ L AL+E+ ++DPL++ + D +E+ + G+VQ EV L EKY++++
Sbjct: 353 SKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIE 412

Query: 419 FSSTRVVCMEKPIGIGNSVEVMGEKANPFYATIGFKVERGELNSGITYKLGVELGSLPLA 478
V+ ME+P+ + NPF+A+IG V L SG+ Y+ V LG L +
Sbjct: 413 IKEPTVIYMERPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQS 472

Query: 479 FHKASEDTVFQTLKQGLYGWEVTDISVTLTHTGYASPVTTASDFRNLTPLVLMDALKQAE 538
F A + + +QGLYGW VTD + + Y SPV+T +DFR L P+VL LK+A
Sbjct: 473 FQNAVMEGIRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAG 532

Query: 539 TYVYEPVNEFELTVPEHAISTAMYKLAAILATFAEPIFNNDSYQLTGSLPVAKTESFKRM 598
T + EP F++ P+ +S A A + N+ L+G +P + ++
Sbjct: 533 TELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSD 592

Query: 599 LHSFTEGEGVFTTKPAGFTKLMAPLPTRKRVDYNPLNRKDYLLHVLK 645
L FT G V T+ G+ + R P +R D + ++
Sbjct: 593 LTFFTNGRSVCLTELKGYHVTTGEPVCQPR---RPNSRIDKVRYMFN 636


91GBAA_3214GBAA_3223N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_32141173.054690penicillin-binding protein
GBAA_32180142.089016hypothetical protein
GBAA_32191152.789169hypothetical protein
GBAA_32200142.937194marR family transcriptional regulator
GBAA_32211142.596427bifunctional P-450:NADPH-P450 reductase 1
GBAA_32222141.631824hypothetical protein
GBAA_32230151.162763EmrB/QacA family drug resistance transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3214BLACTAMASEA340.001 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.0 bits (78), Expect = 0.001
Identities = 30/151 (19%), Positives = 52/151 (34%), Gaps = 46/151 (30%)

Query: 94 DTLYGIGSTSKVYTAAAVMKLVDEGKVDLDASVTRYIPEFKMKDERYKRITPRMLLNHSS 153
D + + ST KV AV+ VD G L+ + + L+++S
Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIH---------------YRQQDLVDYSP 103

Query: 154 GLQGSTLNNAFLFKDNDVYAHDILLQQLSNQNLKADPGAFSVYCNDGFTLAEILVERVSG 213
+ A + + +L A ++ +D + A +L+ V G
Sbjct: 104 VSE-------------KHLADGMTVGELC---------AAAITMSDN-SAANLLLATVGG 140

Query: 214 M-SFTEFLHQKFTEPLKLNHTITSQDKWEDE 243
T FL Q + +T D+WE E
Sbjct: 141 PAGLTAFLRQ-------IGDNVTRLDRWETE 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3220ARGREPRESSOR270.021 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 26.8 bits (59), Expect = 0.021
Identities = 14/46 (30%), Positives = 22/46 (47%), Gaps = 8/46 (17%)

Query: 38 IISVLCSQRATTQKELAEAIDKD-----QTTVVRMIQSMERKGIVK 78
I ++ + TQ EL + + KD Q TV R I+ + +VK
Sbjct: 10 IREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL---HLVK 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3221MECHCHANNEL330.002 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 32.9 bits (75), Expect = 0.002
Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 13/64 (20%)

Query: 262 IITFLIAGHETTSGLLSFAIYFLLKNPDKLKKAYEEVDRVLTDSTPTYQQVMKLKYIRMI 321
+ FLI ++FAI+ +K +KL + EE PT ++V+ L IR +
Sbjct: 82 VFDFLI---------VAFAIFMAIKLINKLNRKKEEPA---AAPAPTKEEVL-LTEIRDL 128

Query: 322 LNES 325
L E
Sbjct: 129 LKEQ 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3223TCRTETB1282e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 128 bits (323), Expect = 2e-34
Identities = 86/362 (23%), Positives = 165/362 (45%), Gaps = 19/362 (5%)

Query: 14 MLVILFIGAFVSFLNNSLLNVALPSIMKDLDIKDYSTIQWLSTGYMLVSGILIPASAFLI 73
+L+ L I +F S LN +LNV+LP I D + ST W++T +ML I L
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPAST-NWVNTAFMLTFSIGTAVYGKLS 73

Query: 74 TRFSNRSLFITSMMIFTLGTALAAVAPN-FGLLLTGRMVQAAGSSVMGPLLMNIMLVSFP 132
+ + L + ++I G+ + V + F LL+ R +Q AG++ L+M ++ P
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 133 REKRGTAMGIFGLVMITAPAIGPTLSGYIVEYYDWRLLFEMILPLAIISLLLGIWKSENV 192
+E RG A G+ G ++ +GP + G I Y W L L + ++ + +
Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL-----LIPMITIITVPFLMKL 188

Query: 193 MRQNKNAK--LDYLSLLLSSIGFGGLLYGFSSASSDGWTNKVVVTTLILGAIALIAFIIR 250
+++ K D ++L S+G + +S S ++ LI+ ++ + F+
Sbjct: 189 LKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYS---------ISFLIVSVLSFLIFVKH 239

Query: 251 QLKMNEPLLDLRVYKYPMFALASVIAIVNAVAMFSGMILTPAYVQNVRGISPLSSG-LMM 309
K+ +P +D + K F + + + + + + P +++V +S G +++
Sbjct: 240 IRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII 299

Query: 310 LPGAVIMGIMSPITGKLFDKYGPRILGIVGLSITAVSTYMLANLQLDSSHTHTILIYTLR 369
PG + + I I G L D+ GP + +G++ +VS + L +S TI+I +
Sbjct: 300 FPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL 359

Query: 370 MF 371

Sbjct: 360 GG 361


92GBAA_3412GBAA_3419N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3412215-2.143761acetyltransferase
GBAA_3413214-2.133174hypothetical protein
GBAA_3414214-2.223252hypothetical protein
GBAA_3415115-2.561439hypothetical protein
GBAA_3416115-2.417424hypothetical protein
GBAA_3417014-1.454221hypothetical protein
GBAA_3418017-0.977340lipoprotein
GBAA_3419-117-0.733089hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3412SACTRNSFRASE384e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 38.0 bits (88), Expect = 4e-06
Identities = 25/122 (20%), Positives = 48/122 (39%), Gaps = 13/122 (10%)

Query: 21 IPAYEIEAKYINSTAIPRLY--------DTIADIQSCDEIFYGYFYEDTLAGFISFKID- 71
IPA+E + Y ++ ++ + + Y+ E+ G I + +
Sbjct: 27 IPAFENGVWTYTEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNW 86

Query: 72 KEEVDIHRLVVSPDHFHKGIATKLLLYIFDMFSSSKTY---IVQTGKENTPALSLYKKHG 128
I + V+ D+ KG+ T LL + ++ + +++T N A Y KH
Sbjct: 87 NGYALIEDIAVAKDYRKKGVGTALLHKAIE-WAKENHFCGLMLETQDINISACHFYAKHH 145

Query: 129 FI 130
FI
Sbjct: 146 FI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3416TCRTETA581e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 58.3 bits (141), Expect = 1e-11
Identities = 41/314 (13%), Positives = 96/314 (30%), Gaps = 11/314 (3%)

Query: 7 PIRFMLISSFFMSFGYFAVYAFLAIYLLTFLHFSAVQ--VGTVLTVMTITSRIIPLFSGL 64
P+ +L + + G + L L +H + V G +L + + G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 65 IADKIGYIIMMIAGLFLRGIGFIALGICSDFYTISISSALIGFGTAFYEPAARAIFGSQP 124
++D+ G +++ L + + + + + I + G A A I
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITD 125

Query: 125 AHTRKNLFTYLNLSFNCGAIMGPIAGGFLLLLDPIYAFSLTGSLMLIFAFIFYLLKDHFQ 184
R F +++ F G + GP+ GG + P F +L + L
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESH 185

Query: 185 VTTENTSITLGIQAILQNKSFLLFSFIMIFFYIMFT-QLTVALPLHMKNISNSNQLA--- 240
+ + + + + + F QL +P + I ++
Sbjct: 186 KGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245

Query: 241 ---TLVITINAITGVIFMVLFRKLFLKY-NTLSFIKYGVLLMSISFLLIPLFQHPYWLFI 296
+ + I + + + G++ ++L+ F W+
Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL-AFATRGWMAF 304

Query: 297 CVIFFTIGETLVLP 310
++ + +P
Sbjct: 305 PIMVLLASGGIGMP 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3417SYCDCHAPRONE364e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.4 bits (84), Expect = 4e-05
Identities = 24/133 (18%), Positives = 49/133 (36%), Gaps = 21/133 (15%)

Query: 69 YMKQKKWEEAKEALQKSISIQPSDEAYHNV-AVAHYNLGELEEASEFFLRVA----GDSD 123
+ K+E+A + Q + D + +G+ + A + A +
Sbjct: 46 QYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPR 105

Query: 124 YIMYSYVKCLIDLGRTKEAKEKLDAFNRESDNFLGEMMVAD------LYVELNCYKEAIE 177
+ ++ CL+ G EA+ L L + ++AD L ++ EAI+
Sbjct: 106 FPFHAAE-CLLQKGELAEAESGLF---------LAQELIADKTEFKELSTRVSSMLEAIK 155

Query: 178 WFEKGYKECWKSP 190
++ EC +P
Sbjct: 156 LKKEMEHECVDNP 168



Score = 30.7 bits (69), Expect = 0.004
Identities = 16/96 (16%), Positives = 33/96 (34%), Gaps = 2/96 (2%)

Query: 21 SRDVQSLNNLAWMYFYEEENDEKALELIGEVVKLNPSSYFPYNILRDIYMKQKKWEEAKE 80
S ++ L +LA Y+ E A ++ + L+ + L +++ A
Sbjct: 33 SDTLEQLYSLA-FNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIH 91

Query: 81 ALQKSISIQPSDEAYH-NVAVAHYNLGELEEASEFF 115
+ + + + + A GEL EA
Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGL 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3419TYPE3IMSPROT270.005 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 27.4 bits (61), Expect = 0.005
Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 5/69 (7%)

Query: 10 IGNIFWIIVFGIWAAIIWL--RDVDGAGVIQTPEIKSISLIVI---LIAFIIPVFFQVIW 64
+ + WII+ G ++ L ++ + ++ + +I ++ I F+
Sbjct: 150 LSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYAFEYYQ 209

Query: 65 LIINLRMSK 73
I L+MSK
Sbjct: 210 YIKELKMSK 218


93GBAA_3654GBAA_3661N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_3654-1140.604609TetR family transcriptional regulator
GBAA_36550130.929095Gfo/Idh/MocA family oxidoreductase
GBAA_3656-1121.155995DNA topoisomerase IV subunit A
GBAA_3657-3121.137630DNA topoisomerase IV subunit B
GBAA_3659-3140.299727CoA-binding domain-containing protein
GBAA_3660-3140.414097serine protease
GBAA_3661-2140.713459DNA-binding response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3654HTHTETR661e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 1e-15
Identities = 27/168 (16%), Positives = 57/168 (33%), Gaps = 22/168 (13%)

Query: 8 KEKKKRAIKEAAFLLFSERGFNEVKIEHIAKEANVSQVTIYNHFGSKDALFRELIQEFII 67
++ ++ I + A LFS++G + + IAK A V++ IY HF K LF E+ +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL--- 65

Query: 68 CEFQYYKELAEEKLP-------------FHDMMQKMIVRKMNTGGLFQPDMLLQMMQRDE 114
EL E +++ + + + + +
Sbjct: 66 -SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 115 ILREFIYSYQNEKILPWYLEILERAQRNNEI----NPHLTKEMMLLYI 158
++++ + E + L+ + +M YI
Sbjct: 125 VVQQAQRNLCLE-SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3657ACRIFLAVINRP310.015 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.015
Identities = 12/49 (24%), Positives = 22/49 (44%), Gaps = 1/49 (2%)

Query: 455 INTEKAKLADIFKNEEINTIIYAIGGGVGNEFDVEDINYDKVVIMTDAD 503
++ EKA+ + ++ TI A+GG N+F + + DA
Sbjct: 730 VDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK-LYVQADAK 777


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3660V8PROTEASE664e-14 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 66.2 bits (161), Expect = 4e-14
Identities = 33/166 (19%), Positives = 62/166 (37%), Gaps = 38/166 (22%)

Query: 134 NKAYIVTNNHVVDGANKLAVKLS------------DGKKVDAKLVGKDPWLDLAVVEI-- 179
K ++TN HVVD + L +G ++ DLA+V+
Sbjct: 110 GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSP 169

Query: 180 --DGANVN---KVATLGDSSKIRAGEKAIAIGNPLGFDG---SVTEGIISSKEREIPVDI 231
++ K AT+ ++++ + + G P ++G I+ + E
Sbjct: 170 NEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEA---- 225

Query: 232 DGDKRADWNAQVIQTDAAINPGNSGGALFNQNGEIIGINSSKIAQQ 277
+Q D + GNSG +FN+ E+IGI+ + +
Sbjct: 226 ------------MQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNE 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_3661HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 34/164 (20%), Positives = 76/164 (46%), Gaps = 16/164 (9%)

Query: 4 TVLLVEDERRLREIVSDYFRNEGFEVIEAEDGKKALELFAEHEIDLIMLDIMLPEIDGWS 63
T+L+ +D+ +R +++ G++V + A + DL++ D+++P+ + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 VCRRIRKESA-VPIIMLTARSDEDDTLLGFELGADEYVTKPFSPKVLVA---RAKTLLKR 119
+ RI+K +P+++++A++ + E GA +Y+ KPF L+ RA KR
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 120 ADGVVGVAEENAMSLAGIE------------VNRLSRTVLVDGE 151
+ ++ M L G + + T+++ GE
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGE 168


94GBAA_4203GBAA_4211N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4203-212-1.593518diguanylate phosphodiesterase
GBAA_4204-211-0.760683short chain dehydrogenase
GBAA_4205-210-2.328436Ser/thr protein phosphatase
GBAA_4207-114-2.888566hypothetical protein
GBAA_4208-117-3.550729polyphosphate kinase
GBAA_4209219-4.856792ppx/GppA phosphatase
GBAA_4210021-5.763688hypothetical protein
GBAA_4211-118-2.717543lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4203FbpA_PF05833363e-04 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 35.6 bits (82), Expect = 3e-04
Identities = 18/151 (11%), Positives = 44/151 (29%), Gaps = 1/151 (0%)

Query: 126 EQFNHLLMYYRTYGIQISINKVGTGTSN-LERISVLAPDILKVDLTNLRQTALLQSYQDI 184
+ + +K+ TG S L +DL+ +++ +D+
Sbjct: 179 DMIENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVEVCKDL 238

Query: 185 LYSLSLLARRIGATLLYEEIDAFYQLQYAWKNGGRYYQGNYLKECLPDFIETNVLKERLG 244
+ FY L K + Q + + L +F +RL
Sbjct: 239 FKEIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDKSDRLK 298

Query: 245 NECHQFIQHEKKKLQKIYNLTEMLRDRIGDV 275
++ + + + ++L + +
Sbjct: 299 SKSSDLQKIVMNNINRCTKKDKILNNTLKKC 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4204DHBDHDRGNASE1015e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 101 bits (253), Expect = 5e-28
Identities = 73/261 (27%), Positives = 122/261 (46%), Gaps = 25/261 (9%)

Query: 4 KVVIITGGSSGMGKGMATRFAKEGARVVITGRTKEKLEEAKLEI-------EQFPGQILT 56
K+ ITG + G+G+ +A A +GA + EKLE+ + E FP
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP----- 63

Query: 57 VQMDVRNTDDIQKMIEQIDEKFGRIDILINNAAGNFICPAEDLSVNGWNSVINIVLNGTF 116
DVR++ I ++ +I+ + G IDIL+N A LS W + ++ G F
Sbjct: 64 --ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 117 YCSQAIGKYWIEKGIKGNIINMVATYAWDAGPGVIHSAAAKAGVLAMTKTLAVEWGRKYG 176
S+++ KY +++ G+I+ + + A + A++KA + TK L +E +Y
Sbjct: 122 NASRSVSKYMMDRR-SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA-EYN 179

Query: 177 IRVNAIAPGPIERTGGADKLWISEEMAKRTIQ--------SVPLGRLGTPEEIAGLAYYL 228
IR N ++PG T LW E A++ I+ +PL +L P +IA +L
Sbjct: 180 IRCNIVSPGST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 229 CSDEAAYINGTCMTMDGGQHL 249
S +A +I + +DGG L
Sbjct: 239 VSGQAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4207TYPE3IMRPROT260.017 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 25.9 bits (57), Expect = 0.017
Identities = 6/39 (15%), Positives = 8/39 (20%)

Query: 23 FFPFFGVPFLAGIAGGLLGGALAFGPRPYYPPYPPPFPP 61
P + L + F P P P
Sbjct: 29 TAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFS 67


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4211cloacin270.047 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.4 bits (60), Expect = 0.047
Identities = 16/109 (14%), Positives = 33/109 (30%)

Query: 33 AFENAAKQEKTMFEDAKKLETLEKEGQELYNQIVQEGKDNNQTVKEKLNQAVKNTDEREK 92
+ A K + K+ + +++ Q Q + +N D K
Sbjct: 357 ELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDAAAK 416

Query: 93 VLKKEKESLNKAQEEVKSADKYVKKIEDKKLKDQADKVKSTYEKRHDSF 141
+L+ A E K + + E+ ++ K + HD
Sbjct: 417 EKSDADAALSSAMESRKKKEDKKRSAENNLNDEKNKPRKGFKDYGHDYH 465


95GBAA_4463GBAA_4471N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4463-112-1.551908competence protein ComG
GBAA_4464-110-0.903378competence protein ComG
GBAA_4465-210-1.090548competence protein ComG
GBAA_4466-312-0.413838competence protein ComG
GBAA_4467-115-1.710173hypothetical protein
GBAA_4468-215-1.574333hypothetical protein
GBAA_4469116-1.902718sodium:dicarboxylate symporter family protein
GBAA_4471322-3.333541hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4463BCTERIALGSPH422e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 41.9 bits (98), Expect = 2e-07
Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 1 MKQKGFTLLEMLLVLFAISVLSMVTYFNVHSLYEKQKIEQFLRQFSNDILYMQQLAINRQ 60
M+Q+GFTLLEM+L+L + V + + + + + R F + ++QQ +
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLAR-FEAQLRFVQQRGLQTG 59

Query: 61 KHYTLRWHKDRHMYY 75
+ + + H DR +
Sbjct: 60 QFFGVSVHPDRWQFL 74


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4464BCTERIALGSPG502e-11 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 50.3 bits (120), Expect = 2e-11
Identities = 18/65 (27%), Positives = 41/65 (63%)

Query: 1 MQNEEGFTLLEMLLVMVVITVLLLLIIPDVVTQRSSVEGKGCKAYVKSIEAQVQVYQLQH 60
+ GFTLLE+++V+V+I VL L++P+++ + + + + + ++E + +Y+L +
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 61 NKIPT 65
+ PT
Sbjct: 64 HHYPT 68


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4465BCTERIALGSPF919e-23 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 91.4 bits (227), Expect = 9e-23
Identities = 61/350 (17%), Positives = 150/350 (42%), Gaps = 22/350 (6%)

Query: 7 SLSDQVILLKRLGELLEKGYSLLQALEFLRFQLPLEKKVQLQRMIDGLKD----GKSLHD 62
S SD +L ++L L+ L +AL+ + Q +K L +++ ++ G SL D
Sbjct: 66 STSDLALLTRQLATLVAASMPLEEALDAVAKQS---EKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 63 SFHQLKFHQEMLSYLFYA-----EQHGDISFALQQGSALLYKKDKYRKDMIKIMQYPMFL 117
+ +K L+ A E G + L + + ++ + R + + M YP L
Sbjct: 123 A---MKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVL 179

Query: 118 AIFLIIMILIFNRILLPQVDMVYSSFGSTAPLFTEQILSTIKLL----PYLIISTLFIIM 173
+ I ++ I +++P+V + PL T ++ + P+++++ ++
Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLA---LLA 236

Query: 174 IVFGVYIVYFRKLPHMKQVKIILRIPLVKTFLILKHSHYFATQLSGLLHGGLSVLEALTI 233
++ ++ + + +L +PL+ ++ +A LS L + +L+A+ I
Sbjct: 237 GFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRI 296

Query: 234 MMEQKYHPFFQYEAGRIERQLIAGEPLQSIIAKSEYYEEELSYIITHGQANGNLAIELGD 293
+ + + ++ + G L + ++ + + ++I G+ +G L L
Sbjct: 297 SGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLER 356

Query: 294 YSDLIMEKMERKIKRMLVIIQPILFTCIGGIVVLMYLAMIMPMFQMMNSI 343
+D + ++ L + +P+L + +V+ + LA++ P+ Q+ +
Sbjct: 357 AADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4471LIPPROTEIN48270.033 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 26.9 bits (59), Expect = 0.033
Identities = 16/68 (23%), Positives = 31/68 (45%), Gaps = 2/68 (2%)

Query: 39 IEKNMELFIELIRD-KENPFETGYSSSISIAVLDEEGKMIEFYTVPIWECCSYFL-GVPL 96
IE + F L + KE+ F TGY+ + ++ DE +++ + + + F G
Sbjct: 157 IETEYKWFYSLQFNIKESAFTTGYAIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAK 216

Query: 97 QIRFWGSK 104
I ++ K
Sbjct: 217 GILYYNQK 224


96GBAA_4920GBAA_4924N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_4920012-0.787891DNA-binding response regulator
GBAA_4921-112-1.122834sensor histidine kinase
GBAA_4922-119-0.789365ankyrin repeat-containing protein
GBAA_4923019-1.974296Gfo/Idh/MocA family oxidoreductase
GBAA_4924117-1.763992large conductance mechanosensitive channel
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4920HTHFIS913e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 3e-23
Identities = 35/152 (23%), Positives = 74/152 (48%), Gaps = 7/152 (4%)

Query: 3 RILLIEDEVSIAELQRDYLEINDFQVDVEHSGETGLQMALQEDYDLIILDIMLPKMNGFE 62
IL+ +D+ +I + L + V + + T + D DL++ D+++P N F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 ICKQIRAI-KDIPILLVSAKKEDIDKIRGLGLGADDYITKPFSPSELVARVKAHISRYER 121
+ +I+ D+P+L++SA+ + I+ GA DY+ KPF +EL+ + ++ +R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 LLGNVSKQ-RDTLYIHGIS-----IDQRARKV 147
+ +D + + G S I + ++
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4921PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 33.7 bits (77), Expect = 0.001
Identities = 18/101 (17%), Positives = 37/101 (36%), Gaps = 24/101 (23%)

Query: 379 LIHNSVKY---MDKEEKKITVTVSSDNNKVIVKVMDNGSGIESDTLPYIFERFYRAEQSR 435
L+ N +K+ + KI + + DN V ++V + GS +T
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT--------------- 307

Query: 436 NSSTGGSGLGLAIAKQIIEEHGGN---IWAESELGEGTSIF 473
+G GL ++ ++ G I + G+ ++
Sbjct: 308 ---KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4922HTHFIS290.017 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.017
Identities = 16/96 (16%), Positives = 33/96 (34%), Gaps = 14/96 (14%)

Query: 100 GGTALIPASEHGYVDVIKELLTRTNIDVNHVNNLGWTALMEAIVLSNGNETQQQVIRLLI 159
G T L+ + V+ + L+R DV +N L I L++
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNA--ATLWRWI--------AAGDGDLVV 52

Query: 160 EHGADINIPDNDGVTPLEHARAHHFEEIEKILLEGH 195
D+ +PD + L + ++ +++
Sbjct: 53 ---TDVVMPDENAFDLLPRIKKAR-PDLPVLVMSAQ 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_4924MECHCHANNEL1452e-48 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 145 bits (368), Expect = 2e-48
Identities = 76/134 (56%), Positives = 96/134 (71%), Gaps = 9/134 (6%)

Query: 1 MWNEFKKFAFKGNVIDLAVGVVIGAAFGKIVSSLVKDIITPLLGMVLGGVDFTDLKITFG 60
+ EF++FA +GNV+DLAVGV+IGAAFGKIVSSLV DII P LG+++GG+DF +T
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62

Query: 61 KS-------SIMYGNFIQTIFDFLIIAAAIFMFVKVFNKLTSKREEEKEEEIPEPTKEEE 113
+ + YG FIQ +FDFLI+A AIFM +K+ NKL K+EE P PTKEE
Sbjct: 63 DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEPAAA--PAPTKEEV 120

Query: 114 LLGEIRDLLKQQNS 127
LL EIRDLLK+QN+
Sbjct: 121 LLTEIRDLLKEQNN 134


97GBAA_5302GBAA_5314N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_53021211.118310hypothetical protein
GBAA_53072222.028587hypothetical protein
GBAA_53081212.398711aldo/keto reductase
GBAA_53091182.472751major facilitator family transporter protein
GBAA_53111222.295452hypothetical protein
GBAA_53121151.528765hypothetical protein
GBAA_5313-1121.282096pyridine nucleotide-disulfide oxidoreductase
GBAA_5314-1140.273301tyrosyl-tRNA synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5302NUCEPIMERASE320.002 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 31.7 bits (72), Expect = 0.002
Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 4/71 (5%)

Query: 1 MKIGIIGAAGKAGSRILKEALDRGHEVTAI-VRNT---AKITEENVKVLEKDVFALTSND 56
MK + GAAG G + K L+ GH+V I N + + +++L + F D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 57 LQAFDVVVNAF 67
L + + + F
Sbjct: 61 LADREGMTDLF 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5309TCRTETA635e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 62.9 bits (153), Expect = 5e-13
Identities = 77/342 (22%), Positives = 136/342 (39%), Gaps = 32/342 (9%)

Query: 11 VQTNRRSMFALLALAISAFGIGTTEFISVGLLPSISKDLNVSVTTA---GLTVSLYALGA 67
++ NR + L +A+ A GIG + + +LP + +DL S G+ ++LYAL
Sbjct: 1 MKPNRPLIVILSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQ 56

Query: 68 AVGAPVLTALTASMSRKTLLMWIMVIFIIGNGIAAVATSFTILIIARIVSAFAHGVFMSI 127
APVL AL+ R+ +L+ + + I A A +L I RIV+
Sbjct: 57 FACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA 116

Query: 128 GSTIAAAIVPENKRASAIAIMFTGLTVATITGVPIGTFIGQQFGWRASFMAIVVIGIIAF 187
G+ I A I ++RA M + G +G +G F A F A + + F
Sbjct: 117 GAYI-ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNF 174

Query: 188 IANSILVPSNLK------NGVPVSFRDQFKLIKNGR-----LLLVFIITALGYGGT--FV 234
+ L+P + K ++ F+ + + + FI+ +G +V
Sbjct: 175 LTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWV 234

Query: 235 TFTYLSPLLQEVTGFEASTVTIILLVYGIAIAIGN-MVGGKLSNH-NPIRALFYMFLIQA 292
F ++ ++A+T+ I L +GI ++ M+ G ++ RAL +
Sbjct: 235 IFG------EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADG 288

Query: 293 IILFVLTFTAPFKVAGLITIIFMGLFAFMNVPGLQVYVVILA 334
+L F +A I ++ M P LQ +
Sbjct: 289 TGYILLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQV 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5312PF07472280.017 Fucose-binding lectin II
		>PF07472#Fucose-binding lectin II

Length = 245

Score = 27.7 bits (61), Expect = 0.017
Identities = 21/66 (31%), Positives = 31/66 (46%), Gaps = 8/66 (12%)

Query: 15 VQISASQGQLDVLDQLLKPEVQESLTTLVEQLPKLTELVNILTKSYDFAQTVATDEVLKS 74
VQ + Q LD + Q + T LVE+LP+ V+I T Y F ++V K+
Sbjct: 24 VQANGDQAVLDRMRQFMT-------TQLVEKLPQYDVFVDIATIPYSFDVGSWQNKV-KA 75

Query: 75 DTVGAI 80
D G +
Sbjct: 76 DAAGQV 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5314TACYTOLYSIN300.028 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 29.6 bits (66), Expect = 0.028
Identities = 23/92 (25%), Positives = 36/92 (39%), Gaps = 18/92 (19%)

Query: 333 DEIEQGFKEMPTFQSSKETKNIVEWLVDLGIEPSRRQAREDINNGAISMN---------- 382
D I+ KEMP + KE K + + S E+IN+ S+N
Sbjct: 77 DMIKLAPKEMPLESAEKEEKKSED------NKKSEEDHTEEINDKIYSLNYNELEVLAKN 130

Query: 383 GEKVTDVGTDVTVENSFDGRFIIIRKGKKNYS 414
GE + + +FI+I + KKN +
Sbjct: 131 GETIENF--VPKEGVKKADKFIVIERKKKNIN 160


98GBAA_5685GBAA_5697N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_5685-214-0.818399TetR family transcriptional regulator
GBAA_5686-214-0.738525AcrB/AcrD/AcrF family transporter
GBAA_5687-216-0.944531bifunctional methionine sulfoxide reductase A/B
GBAA_5688-115-0.829671hypothetical protein
GBAA_56891140.470918antiholin-like protein LrgB
GBAA_56900110.554328murein hydrolase regulator LrgA
GBAA_56911100.858200response regulator LytR
GBAA_5692-191.204427sensor histidine kinase LytS
GBAA_5693190.920921major facilitator family transporter protein
GBAA_56942120.575211BCCT family osmoprotectant transporter
GBAA_5695212-0.395463nitric-oxide synthase, oxygenase subunit
GBAA_5696413-1.186729superoxide dismutase
GBAA_5697213-2.381309hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5685HTHTETR635e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 63.1 bits (153), Expect = 5e-14
Identities = 21/62 (33%), Positives = 38/62 (61%)

Query: 2 KEKERLIIEMAMKLFATKGVNATSVQEIVTACGISKGAFYLYFKSKEELLLATLRYYYDK 61
+E + I+++A++LF+ +GV++TS+ EI A G+++GA Y +FK K +L
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 62 IQ 63
I
Sbjct: 70 IG 71


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5686ACRIFLAVINRP6690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 669 bits (1728), Expect = 0.0
Identities = 240/1066 (22%), Positives = 459/1066 (43%), Gaps = 68/1066 (6%)

Query: 4 IINFSLKNKFAVWLLTIIVTIAGIYSGLNMKLETIPDITTPVVTVTTVYPGATPEEVADK 63
+ NF ++ W+L II+ +AG + L + + P I P V+V+ YPGA + V D
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 64 VSKPMEEQLQNLSGVNVVSSSSFQNASS-IQVEYDFDKNMEKAETEIKDALANVK--LPE 120
V++ +E+ + + + +SS+S S I + + + + A+ ++++ L LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 GVKDPKVSRVNF--NAFPVISLSVASKNESLATLTENVEKNVVPGLKGLDGVASVQISGQ 178
V+ +S + V + + +++ V NV L L+GV VQ+ G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 179 QVDEVQLVFKKDKMKELGLSEDTVKNVIKGSDVSLPLGLYTFKDT------EKSVVVDGN 232
Q +++ D + + L+ V N +K + + G S++
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 ITTMKALKELKIPAVPSSASSQGSQTAGAGAQMPQMNPAAMNGIPTVTLSEIADIKEVGK 292
+ ++ + +G V L ++A ++ G+
Sbjct: 240 FKNPEEFGKVTLRVNS-------------------------DGSV-VRLKDVARVELGGE 273

Query: 293 A-ESISRTNGKEAIGIQIVKAADANTVDVVNAVKDKVKELEKKY-KDLEIISTFDQGAPI 350
I+R NGK A G+ I A AN +D A+K K+ EL+ + + ++++ +D +
Sbjct: 274 NYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFV 333

Query: 351 EKSVETMLSKAIFGAIFAIVIIMLFLRNIRTTLISVVSIPLSLLIAVLVIKQMDITLNIM 410
+ S+ ++ + +++ LFL+N+R TLI +++P+ LL ++ ++N +
Sbjct: 334 QLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTL 393

Query: 411 TLGAMTVAIGRVVDDSIVVIENIYRRMSLSEEKLRGKDLIREATKEMFIPIMSSTIVTIA 470
T+ M +AIG +VDD+IVV+EN+ R M E+KL K+ ++ ++ ++ +V A
Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVM--MEDKLPPKEATEKSMSQIQGALVGIAMVLSA 451

Query: 471 VFLPLGLVKGMIGEMFLPFALTIVFALLASLLVAVTIVPMLAHSLFKKESMREKEVHH-- 528
VF+P+ G G ++ F++TIV A+ S+LVA+ + P L +L K S E
Sbjct: 452 VFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGF 511

Query: 529 ----EEKPSKLANIYKRILAWALNHKIITSSIAVLLLVGSLALVPIIGVSFLPSEEEKMI 584
N Y + L I L++ G + L + SFLP E++ +
Sbjct: 512 FGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVF 571

Query: 585 IATYNPEPGQTLEDVEKIATKAEKHFQDNKDVKTIQ--FSLGGENPMSPGQSNQAMFFVQ 642
+ G T E +K+ + ++ + ++ F++ G + Q N M FV
Sbjct: 572 LTMIQLPAGATQERTQKVLDQVTDYYL-KNEKANVESVFTVNGFSFSGQAQ-NAGMAFVS 629

Query: 643 YD--NDTKNFEKEKEQVVKDLQKMSGKGEWKN---------QDFGASGGSNEIKLYVYGD 691
+ E E V+ + GK + G + G + + G
Sbjct: 630 LKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGL 689

Query: 692 SSEDIKPVVKDIQNIMKKN-KDLKDIDSSIAKTYAEYTLVADQEKLSKMGLTAAQIGMGL 750
+ + + + ++ L + + + A++ L DQEK +G++ + I +
Sbjct: 690 GHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTI 749

Query: 751 SNQHDRPVLTTIKKDGKDVNVYVEAEKQTYETIDDLTNRKITTPLGNEVAVKDVMTVKEG 810
S + G+ +YV+A+ + +D+ + + G V T
Sbjct: 750 STALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWV 809

Query: 811 ETSNTVKHRDGRVYAEVSAKLTSDDVSK-ASAAVQKEVDKMDLPSGVDVSMGGVTKDIEE 869
S ++ +G E+ + S A A ++ K LP+G+ G++
Sbjct: 810 YGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERL 867

Query: 870 SFKQLGLAMLAAIAIVYFVLVVTFGGALAPFAILFSLPFTIIGALVALLISGETLSVSAM 929
S Q + + +V+ L + P +++ +P I+G L+A + + V M
Sbjct: 868 SGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFM 927

Query: 930 IGALMLIGIVVTNAIVLIDRVIH-KENEGLSTREALLEAGATRLRPILMTAIATIGALIP 988
+G L IG+ NAI++++ E EG EA L A RLRPILMT++A I ++P
Sbjct: 928 VGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987

Query: 989 LALGFEGSGLISKGLGVTVIGGLTSSTLLTLLIVPIVYEVLSKFKK 1034
LA+ +G+ V+GG+ S+TLL + VP+ + V+ + K
Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFK 1033



Score = 93.4 bits (232), Expect = 2e-21
Identities = 93/518 (17%), Positives = 198/518 (38%), Gaps = 46/518 (8%)

Query: 546 ALNHKIITSSIAVLLLVGSLALVPIIGVSFLPSEEEKM--IIATYNPEPGQTLEDVE-KI 602
+ I +A++L++ + + V+ P+ + A Y PG + V+ +
Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANY---PGADAQTVQDTV 61

Query: 603 ATKAEKHFQDNKDVKTIQFSLGGENPMSPGQSNQAMFFVQYDNDTKNFEKEKEQVVKDLQ 662
E++ ++ + S S ++ + + + QV LQ
Sbjct: 62 TQVIEQNMNGIDNLMYMS---------STSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQ 112

Query: 663 KMSGK--GEWKNQDFGASGGSNEIKLYVYGDSSEDIKPVVKDIQNIMKKN--KDLKDID- 717
+ E + Q S+ L V G S++ DI + + N L ++
Sbjct: 113 LATPLLPQEVQQQGISVEKSSSSY-LMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG 171

Query: 718 ----SSIAKTYAEYTLVADQEKLSKMGLTAAQIGMGLSNQHDR----PVLTTIKKDGKDV 769
YA + D + L+K LT + L Q+D+ + T G+ +
Sbjct: 172 VGDVQLFGAQYA-MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL 230

Query: 770 NVYVEAEKQTYETIDDLTNRKI-TTPLGNEVAVKDVMTVKEG--ETSNTVKHRDGRVYAE 826
N + A+ + ++ + G+ V +KDV V+ G + +
Sbjct: 231 NASIIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289

Query: 827 VSAKLTSDDVSKASAAVQKEVDKM--DLPSGVDVSMGGVTKD----IEESFKQLGLAMLA 880
T + + A++ ++ ++ P G+ V D ++ S ++ +
Sbjct: 290 GIKLATGANALDTAKAIKAKLAELQPFFPQGMKVL---YPYDTTPFVQLSIHEVVKTLFE 346

Query: 881 AIAIVYFVLVVTFGGALAPFAILFSLPFTIIGALVALLISGETLSVSAMIGALMLIGIVV 940
AI +V+ V+ + A ++P ++G L G +++ M G ++ IG++V
Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLV 406

Query: 941 TNAIVLIDRVI-HKENEGLSTREALLEAGATRLRPILMTAIATIGALIPLALGFEGS-GL 998
+AIV+++ V + L +EA ++ + ++ A+ IP+A F GS G
Sbjct: 407 DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF-FGGSTGA 465

Query: 999 ISKGLGVTVIGGLTSSTLLTLLIVPIVYEVLSKFKKKK 1036
I + +T++ + S L+ L++ P + L K +
Sbjct: 466 IYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5691HTHFIS653e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.2 bits (159), Expect = 3e-14
Identities = 30/126 (23%), Positives = 57/126 (45%), Gaps = 6/126 (4%)

Query: 3 KVLVVDDEMLARDELKYLLERTK-EVEIIGEADCVEDALEELMKNKPDIVFLDIQLSDDN 61
+LV DD+ R L L R +V I A + D+V D+ + D+N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNA---ATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GFEIANILKKMKNPPAIVFATAYDQY--ALQAFEVDALDYILKPFDEERIVQTLKKYKKQ 119
F++ +KK + ++ +A + + A++A E A DY+ KPFD ++ + + +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 120 KQSQIE 125
+ +
Sbjct: 122 PKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5692PF065802293e-72 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 229 bits (586), Expect = 3e-72
Identities = 65/216 (30%), Positives = 111/216 (51%), Gaps = 13/216 (6%)

Query: 359 QLELGEAELQSKLLQDAEIKALQAQINPHFLFNAINTVSALCRTDVEKARKLLLQLSVYF 418
Q E+ + ++ + Q+A++ AL+AQINPHF+FNA+N + AL D KAR++L LS
Sbjct: 146 QAEIDQWKMA-SMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELM 204

Query: 419 RCNLQGARQLLIPLEQELNHVQAYLSLEQARFPNKYEVKMYIEDELKTTLVPPFVLQLLV 478
R +L+ + + L EL V +YL L +F ++ + + I + VPP ++Q LV
Sbjct: 205 RYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV 264

Query: 479 ENALRHAFPKKQPVCEVEVHVFEKEGMVHFEVKDNGQGIEEERLEQLGKMVVSSKKGTGT 538
EN ++H + ++ + + G V EV++ G + +K+ TGT
Sbjct: 265 ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN-----------TKESTGT 313

Query: 539 ALYNINERLIGLFGKETMLHIESEVNEGTEITFVIP 574
L N+ ERL L+G E + + + + +IP
Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVNA-MVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5693TCRTETB546e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 54.1 bits (130), Expect = 6e-10
Identities = 81/411 (19%), Positives = 148/411 (36%), Gaps = 28/411 (6%)

Query: 35 LDMLLLSFVLVYILKEFHLSPVEGGNLTLATTIGMLIGSYLFGFIADLFGRIRTMAFTIL 94
L+ ++L+ L I +F+ P + A + IG+ ++G ++D G R + F I+
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 95 LFSLATALIYFATDYWQLLIL-RFLVGMGVGGEFGIGMAIVTETWSKEMRAKATSVVALG 153
+ + + + ++ LLI+ RF+ G G + M +V KE R KA ++
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 154 WQFGVLIASLLPAFIVPHFGWRAVFLFGLIPALLAVYVRKSLSEPKIWEQKQRYKKELLQ 213
G + + I + W + L +I + ++ K L + + K +L
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILM 207

Query: 214 KEAEGN--LTTTEAA-----------------QLKQMKKFPLRKLFANKKVTITTIGLII 254
L TT + K F L N I ++
Sbjct: 208 SVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMI----GVL 263

Query: 255 MSFIQNFGYYGIFTWMPTILANKYNYTLAKA-SGWMFISTIGMLIGIATFGILADKIGRR 313
I G + +P ++ + + + A+ S +F T+ ++I GIL D+ G
Sbjct: 264 CGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPL 323

Query: 314 KTFTIYYVGGTIYCLIY-FFLFTDSTLLLWG-SALLGFFANGMMGGFGAVLAENYPAEAR 371
I ++ L F L T S + +LG + V + EA
Sbjct: 324 YVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAG 383

Query: 372 STAENFIFGTGRGLAGFGPVIIGLLAAGGNLMGALSLIFIIYPIGLVTMLL 422
+ F + G G I+G L + L L + + L + LL
Sbjct: 384 AGMSLLNFTSFLS-EGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLL 433


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5697NUCEPIMERASE361e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 36.3 bits (84), Expect = 1e-04
Identities = 12/26 (46%), Positives = 15/26 (57%)

Query: 5 KVLVLGGTRFFGKHLVEALLKDGHDV 30
K LV G F G H+ + LL+ GH V
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV 27


99GBAA_5710GBAA_5716N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
GBAA_5710-311-0.077536serine protease
GBAA_5711-2100.024430metallo-beta-lactamase
GBAA_5712-1110.759905hypothetical protein
GBAA_57130121.147292hypothetical protein
GBAA_5714-1101.334251sensory box histidine kinase YycG
GBAA_5715-1111.231233DNA-binding response regulator YycF
GBAA_5716-1121.118746****adenylosuccinate synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5710V8PROTEASE582e-11 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 58.1 bits (140), Expect = 2e-11
Identities = 34/179 (18%), Positives = 64/179 (35%), Gaps = 40/179 (22%)

Query: 95 SEADSEAGTGSG-VIYKKTNDQAYIVTNNHVVAGANRIEVSLS------------DGKKV 141
EA + SG V+ K T ++TN HVV + +L +G
Sbjct: 95 VEAPTGTFIASGVVVGKDT-----LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFT 149

Query: 142 PGKVLGTDVVTDLAVLEIDA----KHVKKVIE---IGDSNAVRRGEPVIAIGNPLGLQFS 194
++ DLA+++ KH+ +V++ + ++ + + + G P +
Sbjct: 150 AEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVA 209

Query: 195 GTVTQGIISANERIVPVDLDQDGHYDWQVEVLQTDAAINPGNSGGALVNAAGQLIGINS 253
T + E +Q D + GNSG + N ++IGI+
Sbjct: 210 ---TMWESKGKITYLKG------------EAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5714PF06580381e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 1e-04
Identities = 17/104 (16%), Positives = 35/104 (33%), Gaps = 25/104 (24%)

Query: 502 VLYNIISNALKY----SPEGGTVTYRLRDRGELLEISVSDQGMGIPKENVDKIFERFYRV 557
++ ++ N +K+ P+GG + + + + V + G K
Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------ 306

Query: 558 DKARSRQMGGTGLGLAIAKEMIEAHGG---SIWAKSEEGKGTTI 598
TG GL +E ++ G I ++GK +
Sbjct: 307 ------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5715HTHFIS951e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 1e-24
Identities = 31/140 (22%), Positives = 69/140 (49%), Gaps = 4/140 (2%)

Query: 1 MMGKKILVVDDEKPIADILKFNLEKEGFEIVMAHDGDEAIEKATEEQPDMVLLDIMLPGK 60
M G ILV DD+ I +L L + G+++ + + D+V+ D+++P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLEVCREIRK-SSEMPIIMLTAKDSEIDKVLGLELGADDYVTKPFS---TRELLARVKA 116
+ ++ I+K ++P+++++A+++ + + E GA DY+ KPF ++ R A
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 117 NLRRHQQGGAAEKEENTEMV 136
+R + ++ +V
Sbjct: 121 EPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
GBAA_5716HELNAPAPROT280.040 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 28.3 bits (63), Expect = 0.040
Identities = 9/48 (18%), Positives = 20/48 (41%), Gaps = 7/48 (14%)

Query: 152 DREAFKEKLEQNLAQKNRLFEK-------MYDTEGFSVDEIFEEYFEY 192
++ + L L+ L+ K + F++ E FEE +++
Sbjct: 9 NQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDH 56



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.