PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomeSalinicoccus.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP011366 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1AAT16_00020AAT16_00075Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_00020441-7.650148hypothetical protein
AAT16_00025539-7.419593hypothetical protein
AAT16_00030538-6.944674hypothetical protein
AAT16_00050536-6.826010transposase
AAT16_00055542-8.950132mercuric reductase
AAT16_00060-124-5.405597MerR family transcriptional regulator
AAT16_00065-219-3.056610hypothetical protein
AAT16_00070-217-3.081304hypothetical protein
AAT16_00075-215-3.561591multicopper oxidase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00050PF07520290.037 Virulence protein SrfB
		>PF07520#Virulence protein SrfB

Length = 1041

Score = 29.2 bits (65), Expect = 0.037
Identities = 12/74 (16%), Positives = 26/74 (35%)

Query: 310 HLRYSQWRHRCMSSNSKDAYKDLVRAVDNWHVEIFNYFDKRLTNAYTESINSIIRQVERM 369
W + + ++ DL V +W E+F F + + S ++ E
Sbjct: 184 DPGAMSWFLQRLEADEDGNAVDLQLWVSDWLKEMFLDFKRAERPGRSISEENLPHMFEHW 243

Query: 370 GRGYSFDALRAKIL 383
R S+ + + +
Sbjct: 244 ARYLSYLQVIQRAV 257


2AAT16_00195AAT16_00230Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_001950203.295947hypothetical protein
AAT16_002000202.904132recombinase RecF
AAT16_002050203.007973DNA gyrase subunit B
AAT16_00210-1183.636375DNA gyrase subunit A
AAT16_002150163.850658anti-terminator HutP
AAT16_002200163.806338histidine ammonia-lyase
AAT16_00225-2142.512509hypothetical protein
AAT16_00230-1153.006050histidine transporter
3AAT16_00340AAT16_00415Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_00340016-5.461746hypothetical protein
AAT16_00345-226-7.120409hypothetical protein
AAT16_00350232-7.701469metallohydrolase
AAT16_00355438-8.160200hypothetical protein
AAT16_00360538-7.48421150S rRNA methyltransferase
AAT16_00365334-5.029604hypothetical protein
AAT16_00370331-3.759601monooxygenase
AAT16_00375328-3.2736063-ketoacyl-ACP reductase
AAT16_00385326-2.461064GntR family transcriptional regulator
AAT16_00390428-1.742047transporter
AAT16_00395423-0.616162membrane protein
AAT16_00400624-1.845117NAD-dependent epimerase
AAT16_00410421-0.890638universal stress protein
AAT16_00415320-0.201581hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00365ACRIFLAVINRP270.049 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.7 bits (59), Expect = 0.049
Identities = 10/84 (11%), Positives = 25/84 (29%), Gaps = 12/84 (14%)

Query: 62 LLIGIAFVLLEVYFVKNKRMNKWILPSIILI--------ASIALSI---PFSVTFDASLI 110
+A + V+ W +P +++ +A ++ V F L+
Sbjct: 872 APALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLL 931

Query: 111 -LFQVVVLMGLYIEDLINHNQNRE 133
+ + I + +E
Sbjct: 932 TTIGLSAKNAILIVEFAKDLMEKE 955


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00375DHBDHDRGNASE1212e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 121 bits (304), Expect = 2e-35
Identities = 70/257 (27%), Positives = 125/257 (48%), Gaps = 17/257 (6%)

Query: 3 RTVLVTGSGRGLGSYIVKALSEKGFNVI-INYNNSKEES-EKLKKEIGSQAIAIQADITD 60
+ +TG+ +G+G + + L+ +G ++ ++YN K E K A A AD+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 61 REAVEQLVKKGTEHFGQIDVVVNNALVNFKFDPTTQKAFKDLTYKDYEQQLDGTLKAAFN 120
A++++ + G ID++VN A V + L+ +++E FN
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHS-----LSDEEWEATFSVNSTGVFN 122

Query: 121 VSQSVIPQFLERKDGAIISIGTNLYQNPVVPYHEYTTAKAALIGFTRNVAAELGQHGIRA 180
S+SV ++R+ G+I+++G+N P Y ++KAA + FT+ + EL ++ IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 181 NVVSGGLLKTT---------DASAVTTPEVFDLIAQSTPLRKVTTPQDVANMVVYLCSEA 231
N+VS G +T + + + PL+K+ P D+A+ V++L S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 232 ADGITGQNITVDGGLTM 248
A IT N+ VDGG T+
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00400NUCEPIMERASE280.028 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.8 bits (62), Expect = 0.028
Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 1 MKVFVFGGNEGAGEHVLKKLAAKGHEAVTI 30
MK V G G HV K+L GH+ V I
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00415IGASERPTASE310.004 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.004
Identities = 16/76 (21%), Positives = 26/76 (34%)

Query: 84 SNETPQNEVTETAQQEDAPQVTEESNEQTQQVAPNTEQQESAPQVTEETQQQPEQNTQQS 143
E P E + +Q T + + NT + P V E+ +P+ ++S
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRS 1226

Query: 144 EDVQAAAPEQNTESSE 159
E T SS
Sbjct: 1227 VRSVPHNVEPATTSSN 1242



Score = 29.6 bits (66), Expect = 0.014
Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 7/87 (8%)

Query: 86 ETPQNEVTETAQQ----EDAPQVTEESNEQTQQVAPNTEQQESAPQVTEETQQQPEQNTQ 141
P T + E++ Q ++ + Q T Q +V +E + + NTQ
Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR---EVAKEAKSNVKANTQ 1081

Query: 142 QSEDVQAAAPEQNTESSEATGGSTKEQ 168
+E Q+ + + T+++E +T E+
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEK 1108


4AAT16_00865AAT16_00940Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_00865-111-3.208476hypothetical protein
AAT16_00870-212-2.511009malate:quinone oxidoreductase
AAT16_00875-116-3.743015hemin ABC transporter ATP-binding protein
AAT16_00880-216-3.584615hemin ABC transporter permease
AAT16_00890-117-3.341213PadR family transcriptional regulator
AAT16_00895-115-2.786787hypothetical protein
AAT16_00900-117-2.453462hypothetical protein
AAT16_00905120-3.428737ferritin
AAT16_00910119-3.141622hypothetical protein
AAT16_00915019-3.381417amino acid ABC transporter ATP-binding protein
AAT16_00920019-3.304653ABC transporter permease
AAT16_00925021-3.863840monooxygenase
AAT16_00930222-3.738432FMN reductase
AAT16_00935121-3.768056cystathionine gamma-synthase
AAT16_00940021-3.152786cystathionine gamma-synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00875PF05272300.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.008
Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 18/93 (19%)

Query: 35 VILKGASGSGKTTLLSIIGGLLGRSGGEVSL--NGENYLDIKEKA------LTSMRLKEI 86
V+L+G G GK+TL++ + GL S + ++Y I +T+ R +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADA 658

Query: 87 ----GFIFQSSHLI--PYMKVID----QLTFIG 109
F Y + + Q+
Sbjct: 659 EAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWC 691


5AAT16_01005AAT16_01115Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_01005325-4.401833hypothetical protein
AAT16_01010325-4.834381membrane protein
AAT16_01015225-4.789104hypothetical protein
AAT16_01020322-3.474624multidrug resistance protein SMR
AAT16_01025218-2.103084transporter
AAT16_01035218-2.462507membrane protein
AAT16_01040116-2.195626general stress protein
AAT16_01045116-2.104341sodium:proton antiporter
AAT16_01050017-2.180718multidrug transporter MatE
AAT16_01055118-2.762721oxidoreductase
AAT16_01060-218-4.231621aldehyde dehydrogenase
AAT16_01065-117-1.835333hypothetical protein
AAT16_01070-212-0.694209hypothetical protein
AAT16_01075-113-0.803036hypothetical protein
AAT16_01080-113-0.803962autolysin
AAT16_01085-113-0.642474formate dehydrogenase
AAT16_01090013-1.370107MFS transporter
AAT16_01095-114-2.651521formate dehydrogenase
AAT16_01100018-4.068931transporter
AAT16_01105-116-3.908774hypothetical protein
AAT16_01110-214-3.324169glutamine synthetase
AAT16_01115-215-3.665532hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01005adhesinb300.009 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 29.8 bits (67), Expect = 0.009
Identities = 29/172 (16%), Positives = 62/172 (36%), Gaps = 17/172 (9%)

Query: 62 ESGPDEWWNNVVESYEMLKDKGYEKISIGGVSLGGILSLKAAYSLEDINSVVAMSVPQG- 120
E+G + W+ +VE+ + ++K Y +S G ++ L+ + +++ G
Sbjct: 94 ETGGNAWFTKLVENAKKKENKDYYAVSEG----VDVIYLEGQSEKGKEDPHAWLNLENGI 149

Query: 121 KDIEDLNKRVVSYIENFMEFVGRSDEEIDEKLKELDEKPMASLPDFEALIDEIHSRLGDI 180
+++ KR+ E ++ + EKL LD++ + I +
Sbjct: 150 IYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKFNNIPGEKKMI------V 203

Query: 181 SVPLAVKYGGKDAALYEESADHIYEEVASEAKDMKVYPNTGHLMTKGKDKKL 232
+ KY K + I E +K L+ K + K+
Sbjct: 204 TSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIK------TLVEKLRKTKV 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01055DHBDHDRGNASE1077e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 107 bits (269), Expect = 7e-30
Identities = 76/261 (29%), Positives = 118/261 (45%), Gaps = 15/261 (5%)

Query: 42 VGSEKLKNRKALVTGGDSGIGRAAAIAYAKEGADVAISYLPDEGSDAQEVKAVIEKA-GQ 100
+ ++ ++ + A +TG GIG A A A +GA +A D + E KA +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAV---DYNPEKLEKVVSSLKAEAR 57

Query: 101 KAVLLPGDLRDERFARELVHEAAEKLGGLDILVLNAAIQQFEKDIKNLSTEQLTDTFTVN 160
A P D+RD E+ ++G +DILV A + + I +LS E+ TF+VN
Sbjct: 58 HAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEWEATFSVN 116

Query: 161 IFSNVWMLQEALDHLP--EGGSVVVTTSVQAFQPSGHLSDYAMTKSSQVAFVLAMTQQLA 218
+ ++ GS+V S A P ++ YA +K++ V F + +LA
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 219 EKGIRINAVSPGPVWTVLQVA-----GGQPQE---SIPEFGQKEPLKRAGQPVELADTYV 270
E IR N VSPG T +Q + G Q S+ F PLK+ +P ++AD +
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 271 LLASDSASYITGQVYGITGGT 291
L S A +IT + GG
Sbjct: 237 FLVSGQAGHITMHNLCVDGGA 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01090TCRTETA484e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.9 bits (114), Expect = 4e-08
Identities = 67/347 (19%), Positives = 125/347 (36%), Gaps = 24/347 (6%)

Query: 60 AAFMGHFVEAKGPRISGLVSTLFFASGMAVAGLAVQLESLILLYFGYGVLGGIGLGIGY- 118
A +G + G R LVS A A+ A L L + G+ G G G
Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAY 119

Query: 119 ---ITPVSTLVKWFPDRRGMATGLAIMGFGFAAMLASPAMEWLIVNVSIAGTFYILAVIY 175
IT + F G + A GFG M+A P + L+ S F+ A +
Sbjct: 120 IADITDGDERARHF----GFMS--ACFGFG---MVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 176 FVVMIASSLYLERPPEGYEPEGMNLDEKVTAKKDIVQLTANEAVRTRRFYFLWSMLFLNV 235
+ + L PE ++ E L + A + + L ++ F+
Sbjct: 171 GLNFLTGCFLL---PESHKGERRPLRRE--ALNPLASFRWARGMTV--VAALMAVFFIMQ 223

Query: 236 TCGIAILAVASPMAQEIAGLSAGAAAVMVGIMGVFNGGGRLVWAS-ISDYIGRPNLYSLF 294
G A+ ++ A + + G+ + + + ++ +G L
Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG 283

Query: 295 FIIQIALFLLLPSVSHALVFQAMLFVIISCYGGGFSAIPAYIGDIFGTKQLGAIHGYILT 354
I ++LL + + + V+++ G G A+ A + ++ G + G +
Sbjct: 284 MIADGTGYILLAFATRGWMA-FPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAA 342

Query: 355 AWAAAGLVGPFISSTVYEAT-QSYTLTLYIFGALFIAALAISILIRG 400
+ +VGP + + +Y A+ ++ +I GA L + L RG
Sbjct: 343 LTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALY-LLCLPALRRG 388


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01095NUCEPIMERASE300.041 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.041
Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 698 IQEGEPIVIYNVNGVFQGFARIADIKAGNIGIQ 730
+ EG+ I +YN + + F I DI I +Q
Sbjct: 199 MLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01100FIMREGULATRY310.002 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 31.4 bits (71), Expect = 0.002
Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 181 LFLIVIIRSVTLPGAMEGIKFFLTPDFSLISSEGILYAL 219
FL+ I SV LPG+M + FFL S I S+ ++ A+
Sbjct: 13 AFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAM 51


6AAT16_02355AAT16_02410Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_02355216-1.267798hypothetical protein
AAT16_023600170.875490hypothetical protein
AAT16_02365-1171.384544hypothetical protein
AAT16_02370-1162.544728lysine decarboxylase
AAT16_02375-1172.879732hypothetical protein
AAT16_02380-1173.998620hypothetical protein
AAT16_02385-2185.392183histidine kinase
AAT16_02390-2195.119393chemotaxis protein CheY
AAT16_023950215.423251hypothetical protein
AAT16_02400-2204.836887molybdenum cofactor biosynthesis protein B
AAT16_02405-2194.713143molybdenum cofactor biosynthesis protein C
AAT16_02410-2194.065129molybdopterin molybdenumtransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02385PF06580364e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.0 bits (83), Expect = 4e-04
Identities = 18/103 (17%), Positives = 39/103 (37%), Gaps = 21/103 (20%)

Query: 486 IILNLIANGINYTHEGGTIEVSLRENIYEIRLIVTDDGIGIPEESLGRIFERFYRVDKAR 545
++ N I +GI +GG I + ++ + L V + G + +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNT--------------- 307

Query: 546 SRHSGGTGLGLAIVKHLIESHKG---RIEIESAEDEGTTITVI 585
TG GL V+ ++ G +I++ + + + +I
Sbjct: 308 ---KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02390HTHFIS1022e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 102 bits (256), Expect = 2e-27
Identities = 35/138 (25%), Positives = 70/138 (50%), Gaps = 2/138 (1%)

Query: 3 SILVVDDEPSIVTLLKFNLEQSGYSVLTAEDGNTGLDLALTEQPDLIVLDLMLPGMDGMD 62
+ILV DD+ +I T+L L ++GY V + T DL+V D+++P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VCKTLRQEKMNTPILMLTAKDEEFDKILGLELGADDYMTKPFSPREVVARVKAIL--RRS 120
+ +++ + + P+L+++A++ I E GA DY+ KPF E++ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 121 QVEALAEKAAEEVFSIGD 138
+ L + + + + +G
Sbjct: 125 RPSKLEDDSQDGMPLVGR 142


7AAT16_02590AAT16_02785Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_025902252.798961hypothetical protein
AAT16_025952253.368881bacillithiol system protein YtxJ
AAT16_026003274.435544hypothetical protein
AAT16_026053265.095872LytR family transcriptional regulator
AAT16_026101284.423052UDP-phosphate N-acetylglucosaminyl 1-phosphate
AAT16_026150254.077247ABC transporter
AAT16_026201243.805830DegV domain-containing protein
AAT16_026250243.969408hypothetical protein
AAT16_026300203.615119hypothetical protein
AAT16_026351193.619393sigma-54 modulation protein
AAT16_026401183.981103preprotein translocase subunit SecA
AAT16_026451174.098116peptide chain release factor 2
AAT16_026501193.678818hypothetical protein
AAT16_026550193.835442hypothetical protein
AAT16_026601234.861557hypothetical protein
AAT16_026652274.653529hypothetical protein
AAT16_026702274.881060hydrolase
AAT16_026752285.405634protein CsbA
AAT16_026802285.447840excinuclease ABC subunit B
AAT16_026851265.240555excinuclease ABC subunit A
AAT16_026900244.183159hypothetical protein
AAT16_026951244.481779serine kinase
AAT16_027002243.746593diacylglyceryl transferase
AAT16_027051182.715199acetyltransferase
AAT16_027151192.750882thioredoxin reductase
AAT16_02720-1172.808702nucleotide-binding protein
AAT16_02725-1152.548575hypothetical protein
AAT16_02730-2121.493697sporulation regulator WhiA
AAT16_02735-1122.032771hypothetical protein
AAT16_027402161.953336Clp protease
AAT16_027501152.513255*hypothetical protein
AAT16_027552162.648327membrane protein
AAT16_027602173.228130hypothetical protein
AAT16_027653223.275909hypothetical protein
AAT16_027703263.079785glyceraldehyde-3-phosphate dehydrogenase
AAT16_027752213.408885phosphoglycerate kinase
AAT16_027802193.159497triosephosphate isomerase
AAT16_027852192.836886phosphoglyceromutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02600MICOLLPTASE320.003 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 32.4 bits (73), Expect = 0.003
Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 15/92 (16%)

Query: 170 MYE----DINTILKLLKRYEEDEYKDYLTQLGNIKSFDEE----VNILIDESESLSLLLI 221
MY N + +K + YKDY+ + + +++ ++ L++ ++L + L+
Sbjct: 593 MYNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDKYQDYMDSLLNNIDNLDVPLV 652

Query: 222 DIDNFKVVNDEHSYKSGDAL---IKQMANLLD 250
+ + H K + + IK+++N+ D
Sbjct: 653 SDEYV----NGHEAKDINEITNDIKEVSNIKD 680


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02640SECA10820.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 1082 bits (2799), Expect = 0.0
Identities = 412/904 (45%), Positives = 568/904 (62%), Gaps = 71/904 (7%)

Query: 1 MGILDKVF-DGNKRELRSLRKIAEKVEDYKETMAGLDDASLQGKTDEFKEMLAGAEDDKA 59
+ +L KVF N R LR +RK+ + + M L D L+GKT EF+ L E
Sbjct: 3 IKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEV--- 59

Query: 60 EEKMLDQILPEAFAVVREASKRTLGLEPYPVQIMGGAALHKGDISEMKTGEGKTLTATMP 119
L+ ++PEAFAVVREASKR G+ + VQ++GG L++ I+EM+TGEGKTLTAT+P
Sbjct: 60 ----LENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLP 115

Query: 120 VYLNALTGKGVHVITVNEYLSATQMEEMSVLYNFLKLTVGLNLNAKNSEEKREAYAADIT 179
YLNALTGKGVHV+TVN+YL+ E L+ FL LTVG+NL + KREAYAADIT
Sbjct: 116 AYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADIT 175

Query: 180 YTTNNELGFDYLRDNMVTYKKDRVLRGLNYAIIDEVDSILIDEARTPLIISGRANQTNTQ 239
Y TNNE GFDYLRDNM ++RV R L+YA++DEVDSILIDEARTPLIISG A ++
Sbjct: 176 YGTNNEYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEM 235

Query: 240 YIQANQFVKMLKE-----------DEDFTYDIKTKNIQLNDDGMEKAEKWF-------KV 281
Y + N+ + L + F+ D K++ + L + G+ E+ +
Sbjct: 236 YKRVNKIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEG 295

Query: 282 DNLYDVKHVNLLHHINQALKAHFSMQRDTDYVVEEDKIVIVDQFTGRKMKGRRFSDGLHQ 341
++LY ++ L+HH+ AL+AH RD DY+V++ +++IVD+ TGR M+GRR+SDGLHQ
Sbjct: 296 ESLYSPANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQ 355

Query: 342 AIEAKEGVEIQNESRTMASITFQNFFRQYNKLSGMTGTAKTEEEEFINIYNMKVTVIPTN 401
A+EAKEGV+IQNE++T+ASITFQN+FR Y KL+GMTGTA TE EF +IY + V+PTN
Sbjct: 356 AVEAKEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTN 415

Query: 402 LPIAREDRTDKIYSTKDIKFKNVVDEVVERHRNGQPVLIGTVAVETSEYIANLLSKKGIR 461
P+ R+D D +Y T+ K + +++++ ER GQPVL+GT+++E SE ++N L+K GI+
Sbjct: 416 RPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIK 475

Query: 462 HNVLNAKNHEREADIIMSAGKKGAVTIATNMAGRGTDIKLG------------------- 502
HNVLNAK H EA I+ AG AVTIATNMAGRGTDI LG
Sbjct: 476 HNVLNAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIE 535

Query: 503 ----------EGVKEAGGLAVIGTERHESRRIDDQLRGRAGRQGDVGVSTFYLSLEDDLM 552
+ V EAGGL +IGTERHESRRID+QLRGR+GRQGD G S FYLS+ED LM
Sbjct: 536 KIKADWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALM 595

Query: 553 RRFGSERMQGMMGRLGMQEEE-ITSKMISKAVESSQKRVEGNNFDSRKKLLEYDDVLRRQ 611
R F S+R+ GMM +LGM+ E I ++KA+ ++Q++VE NFD RK+LLEYDDV Q
Sbjct: 596 RIFASDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQ 655

Query: 612 REIIYDERNDIIDQDDVRDQLMGMIEASVERTVNYYILDDD--ELIDYDQFIKTIEDMYL 669
R IY +RN+++D DV + + + E + T++ YI E+ D + +++ +
Sbjct: 656 RRAIYSQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFD 715

Query: 670 SDESIE--VPDVRGRENDEIIALILEKVNAELERKEEKLTSEKMRLFERMMMLRTIDQKW 727
D I + + + IL + +RKEE + +E MR FE+ +ML+T+D W
Sbjct: 716 LDLPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLW 775

Query: 728 VEHIDSMDQLRTGIHLRSYGQINPLREYQNEGLQMFEDMLVAIEDDTAKYVLKTELKSDE 787
EH+ +MD LR GIHLR Y Q +P +EY+ E MF ML +++ + + K +++ E
Sbjct: 776 KEHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPE 835

Query: 788 EI---------KREQVIKQNEMQTGDGKEKVKKGPVKK--EIKVGRNDPCPCGSGKKYKN 836
E+ + E++ + ++ D + E KVGRNDPCPCGSGKKYK
Sbjct: 836 EVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQ 895

Query: 837 CHGQ 840
CHG+
Sbjct: 896 CHGR 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02660IGASERPTASE418e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 8e-06
Identities = 25/116 (21%), Positives = 41/116 (35%), Gaps = 19/116 (16%)

Query: 141 VEEAPQTEEAPAAE--EAPQAEEETEDQNTAQAAEVQE------APAVVEEDNSADEQAA 192
VE+ QT + QA+ + N + A V E APA E + +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 193 EQQAAEQQAAEQRAAEQERIEQREAEQA-----------EAEQEKQEAQAAAPQQT 237
+Q++ + EQ A E + A++A E Q E + +T
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100



Score = 39.7 bits (92), Expect = 2e-05
Identities = 30/128 (23%), Positives = 45/128 (35%), Gaps = 10/128 (7%)

Query: 137 AAASVEEAPQTEEAPA--AEEAPQAEEETEDQNTAQAAEVQEAPAVVEEDNSADEQAAEQ 194
A V+EAP APA +E E ++ ++ Q+A ++ ++A
Sbjct: 1016 EIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSN 1075

Query: 195 QAAEQQAAEQRAAEQERIE-QREAEQAEAEQEKQEAQAA-------APQQTSNVSGGNAV 246
A Q E + E E Q + A EK+E P+ TS VS
Sbjct: 1076 VKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQ 1135

Query: 247 SVAQSVAA 254
S A
Sbjct: 1136 SETVQPQA 1143



Score = 37.7 bits (87), Expect = 8e-05
Identities = 24/136 (17%), Positives = 48/136 (35%), Gaps = 11/136 (8%)

Query: 124 AGKTLIVSADAAPAAASVEEAPQTEEAPAAEEAPQAEEETEDQNTAQAAEVQEAPAVVEE 183
A + + A S E +T+ E A +EE + + + QE P V +
Sbjct: 1072 AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE---KAKVETEKTQEVPKVTSQ 1128

Query: 184 DNSADEQAAEQQAAEQQAAEQRAAEQERIEQREAEQAEAEQEKQEAQAAAPQQTSNVSGG 243
+ EQ+ Q + A E ++ +++ P + ++ +
Sbjct: 1129 VSPKQEQSETVQPQAEPARENDPTVN-------IKEPQSQTNTTADT-EQPAKETSSNVE 1180

Query: 244 NAVSVAQSVAAGKSYV 259
V+ + +V G S V
Sbjct: 1181 QPVTESTTVNTGNSVV 1196



Score = 30.4 bits (68), Expect = 0.015
Identities = 16/125 (12%), Positives = 33/125 (26%), Gaps = 4/125 (3%)

Query: 130 VSADAAPAAASVEEAPQTEEAPAAEEAPQAEEETEDQNTAQAAEVQEA----PAVVEEDN 185
V++ +P E E + +E + Q A Q A V +
Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184

Query: 186 SADEQAAEQQAAEQQAAEQRAAEQERIEQREAEQAEAEQEKQEAQAAAPQQTSNVSGGNA 245
+ E A Q + + + + + + + S +
Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244

Query: 246 VSVAQ 250
+VA
Sbjct: 1245 STVAL 1249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02695NUCEPIMERASE280.042 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 28.2 bits (63), Expect = 0.042
Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 148 VLITGESGVGKSETALELVKNGHRLVADDNV 178
L+TG +G + L++ GH++V DN+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNL 33


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02735NUCEPIMERASE375e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.5 bits (87), Expect = 5e-05
Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 1 MNVLITGGTGFIGGKLAEILKEEHDHV 27
M L+TG GFIG +++ L E V
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQV 27


8AAT16_02865AAT16_02935Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_02865-118-4.498752methionine ABC transporter substrate-binding
AAT16_02870-121-5.847612DNA repair protein
AAT16_02875-122-6.567768hypothetical protein
AAT16_02880124-6.312744pilus assembly protein
AAT16_02885127-7.227213pilus assembly protein TadB
AAT16_02890229-7.266178hypothetical protein
AAT16_02895331-6.940988hypothetical protein
AAT16_02900229-6.392615hypothetical protein
AAT16_02905231-6.947703hypothetical protein
AAT16_02910228-6.631804hypothetical protein
AAT16_02915224-5.730042hypothetical protein
AAT16_02920022-6.149559hypothetical protein
AAT16_02925019-5.229117hypothetical protein
AAT16_02930-118-5.424880hypothetical protein
AAT16_02935-115-4.413686hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02905GPOSANCHOR300.027 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.027
Identities = 15/145 (10%), Positives = 44/145 (30%), Gaps = 1/145 (0%)

Query: 135 ETMKYQAPIQMAGVFADLLKKADGTVDPSEVEDMEETQEFLEKFEEVMELVKKRNAELKK 194
+ +A L+KA D + + + + L+
Sbjct: 177 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEG 236

Query: 195 ADEEVKSLKEDITSRYNSKIVGKESSSEKIPESIKSNMADLSKYYPRYLELKNKHEEEDS 254
A + I ++ E+ ++ ++++ M + + L+ + ++
Sbjct: 237 AMNFSTADSAKI-KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEA 295

Query: 255 EDEESEEELSDEEKEDKEKKRDDEE 279
E + E + + +RD +
Sbjct: 296 EKADLEHQSQVLNANRQSLRRDLDA 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02930PREPILNPTASE453e-08 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 45.2 bits (107), Expect = 3e-08
Identities = 26/119 (21%), Positives = 49/119 (41%), Gaps = 8/119 (6%)

Query: 5 FIILGIFLCIVFYYDAIKQIIPNWLNVSGAVVGVGYHSLSAGVDGFIQSFGGGLVCGIIL 64
++L L + + D K ++P+ L + G+ ++ L G + G + ++L
Sbjct: 137 ALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFN-LLGGFVSLGDAVIGAMAGYLVL 195

Query: 65 LVLY-VFK------AIGAGDVKLFFAIGTITGILFGLYSIMYSIICAGIIGLLYLLFTR 116
LY FK +G GD KL A+G G ++ S + +G+ +L
Sbjct: 196 WSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRN 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02935SECYTRNLCASE356e-04 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 35.1 bits (81), Expect = 6e-04
Identities = 21/77 (27%), Positives = 33/77 (42%), Gaps = 9/77 (11%)

Query: 190 IKNSGVSIKGDMQYGDTDKYKIKKIKTMPPLNRKEKLYSSLIAIV--LLAVIWSQMPLSS 247
+K G I G T +Y + + + + LY LIA+V + V + S
Sbjct: 347 MKKYGGFIPGIRAGRPTAEY-LSYV--LNRITWPGSLYLGLIALVPTMALVGF---GASQ 400

Query: 248 NILLG-TAFLLSVGVAV 263
N G T+ L+ VGV +
Sbjct: 401 NFPFGGTSILIIVGVGL 417


9AAT16_02985AAT16_04220Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_029850214.157587protoheme IX farnesyltransferase
AAT16_029901224.558883hypothetical protein
AAT16_029951215.393283serine ammonia-lyase
AAT16_030000195.536819L-alanoyl-D-glutamate peptidase
AAT16_030051215.386547alanine racemase
AAT16_030100185.173321pyridoxal-5'-phosphate-dependent protein subunit
AAT16_030150204.313942hypothetical protein
AAT16_030201204.5904671,4-dihydroxy-2-naphthoate prenyltransferase
AAT16_03025-1223.330706hypothetical protein
AAT16_03030-1202.957153thioredoxin
AAT16_030350233.834692hypothetical protein
AAT16_030401234.502428membrane protein
AAT16_030450224.097773amino acid ABC transporter ATP-binding protein
AAT16_03050-1193.576387hypothetical protein
AAT16_030550213.933013membrane protein
AAT16_03060-1193.4315315'-nucleotidase
AAT16_03065-1183.586786sodium:proton antiporter
AAT16_03070-1152.526800lipoyl synthase
AAT16_03075-1153.267117hypothetical protein
AAT16_030801183.577122cytosolic protein
AAT16_030850173.509904hypothetical protein
AAT16_030902194.325110HAD family hydrolase
AAT16_030953193.3278002-ketogluconate reductase
AAT16_031002222.904850hypothetical protein
AAT16_031052232.394755disulfide oxidoreductase
AAT16_031102222.080041nitrogen fixation protein NifU
AAT16_031151182.910501NADH dehydrogenase
AAT16_03120-1152.636806hypothetical protein
AAT16_03125-1153.185390hypothetical protein
AAT16_031300163.276968NADH dehydrogenase
AAT16_031350173.141202membrane protein
AAT16_031401203.391225hypothetical protein
AAT16_031453232.714485thioesterase
AAT16_031502232.230504cation:proton antiporter
AAT16_031552202.097752monovalent cation/H+ antiporter subunit F
AAT16_031602212.197672monovalent cation/H+ antiporter subunit E
AAT16_031651212.066589monovalent cation/H+ antiporter subunit D
AAT16_031700181.538973monovalent cation/H+ antiporter subunit C
AAT16_03175-2150.906872monovalent cation/H+ antiporter subunit B
AAT16_03180-2191.944860cation:proton antiporter
AAT16_031850242.515854peptidylprolyl isomerase
AAT16_03190-1232.933817hypothetical protein
AAT16_03195-1243.925515hypothetical protein
AAT16_032000244.377786general stress protein
AAT16_032050265.4731341-pyrroline-5-carboxylate dehydrogenase
AAT16_032100255.845007ornithine--oxo-acid aminotransferase
AAT16_032151266.815122glutamate dehydrogenase
AAT16_032200287.845521glutamate dehydrogenase
AAT16_032251307.541063anthranilate phosphoribosyltransferase
AAT16_032300307.307446indole-3-glycerol phosphate synthase
AAT16_032350296.895594N-(5'-phosphoribosyl)anthranilate isomerase
AAT16_032401346.907214tryptophan synthase subunit beta
AAT16_032452376.668412tryptophan synthase subunit alpha
AAT16_032502365.941926glucose-6-phosphate isomerase
AAT16_032553376.033656hypothetical protein
AAT16_032603366.186626signal peptidase IB
AAT16_032653356.150944hypothetical protein
AAT16_032702276.898004hypothetical protein
AAT16_032753247.246070hypothetical protein
AAT16_032801257.789619hypothetical protein
AAT16_032851257.409861hypothetical protein
AAT16_032901257.639539hypothetical protein
AAT16_032950287.494239hypothetical protein
AAT16_033001306.297964hypothetical protein
AAT16_033050275.664310hypothetical protein
AAT16_033101265.772637phosphatase
AAT16_033151265.871327DNA methyltransferase
AAT16_033201265.876291UDP pyrophosphate phosphatase
AAT16_033251225.936147chaperone protein ClpB
AAT16_033301226.204025transporter
AAT16_033350256.920344arginine ABC transporter ATP-binding protein
AAT16_03340-1247.043146amino acid ABC transporter permease
AAT16_03345-1257.270923amino acid ABC transporter permease
AAT16_033500268.390159amino acid ABC transporter substrate-binding
AAT16_033550318.666498glycerol-3-phosphate dehydrogenase
AAT16_03360-1278.260322glycerol kinase
AAT16_033650247.498853glycerol transporter
AAT16_033702267.511798glycerol-3-phosphate responsive antiterminator
AAT16_033752277.660125hypothetical protein
AAT16_033802245.476267hypothetical protein
AAT16_033852225.275119peptide ABC transporter ATPase
AAT16_033902235.417638peptide ABC transporter substrate-binding
AAT16_033951245.487608peptide ABC transporter permease
AAT16_034001214.649732peptide ABC transporter permease
AAT16_034050203.398288ABC transporter substrate-binding protein
AAT16_034100173.2075153-oxoacyl-ACP synthase
AAT16_034150193.3141123-oxoacyl-ACP synthase
AAT16_034201202.527592tryptophanyl-tRNA synthetase
AAT16_034251191.922606ArsR family transcriptional regulator
AAT16_034301232.774485hypothetical protein
AAT16_034351253.314264hypothetical protein
AAT16_034403293.548846oligopeptidase PepB
AAT16_034454334.778413hypothetical protein
AAT16_034506355.567103globin
AAT16_034556406.641459hypothetical protein
AAT16_034607396.970586hypothetical protein
AAT16_034658407.622041GTP pyrophosphokinase
AAT16_034705305.096227inorganic polyphosphate kinase
AAT16_034754263.955339magnesium transporter MgtE
AAT16_034802253.182257sodium:proton antiporter
AAT16_034851182.616315enoyl-ACP reductase
AAT16_034901212.841025hypothetical protein
AAT16_03495-1183.278800hypothetical protein
AAT16_035000255.197123hypothetical protein
AAT16_035051255.076283hypothetical protein
AAT16_035100245.551710sodium:alanine symporter
AAT16_035151244.499048lipoprotein
AAT16_03520-1214.462559ABC transporter
AAT16_035252285.501544ABC transporter ATP-binding protein
AAT16_035301305.938254hypothetical protein
AAT16_035352347.285328hypothetical protein
AAT16_035401306.088447hypothetical protein
AAT16_035451326.578994dehydrogenase
AAT16_035501336.805249GMC family oxidoreductase
AAT16_035550296.226125preprotein translocase subunit TatC
AAT16_035600306.212231UTP--glucose-1-phosphate uridylyltransferase
AAT16_035651336.974981hypothetical protein
AAT16_035704438.796901preprotein translocase subunit TatC
AAT16_035755439.138911GMC family oxidoreductase
AAT16_035806509.479455hypothetical protein
AAT16_035856509.353673hypothetical protein
AAT16_035907509.310124hypothetical protein
AAT16_035956498.829592UDP-N-acetylmuramoylalanyl-D-glutamate--L-lysine
AAT16_036004406.867571hypothetical protein
AAT16_036053366.071011peptide chain release factor 3
AAT16_036102294.441939membrane protein
AAT16_036202294.419326ATP synthase
AAT16_036251253.769666hypothetical protein
AAT16_036302244.464292hypothetical protein
AAT16_036350285.161426hypothetical protein
AAT16_036401295.517574hypothetical protein
AAT16_036451296.163662hypothetical protein
AAT16_036500275.223593hypothetical protein
AAT16_03655-1234.124810hypothetical protein
AAT16_036601201.097390hypothetical protein
AAT16_036650241.723681hypothetical protein
AAT16_036700262.142008hypothetical protein
AAT16_036750282.709712hypothetical protein
AAT16_036801334.079681RelE/StbE family addiction module toxin
AAT16_036901364.489356teichoic acid ABC transporter permease
AAT16_036951353.866001hypothetical protein
AAT16_037002333.595583hypothetical protein
AAT16_037052343.619393hypothetical protein
AAT16_037153343.914513teichoic acid ABC transporter ATP-binding
AAT16_037203313.269598hypothetical protein
AAT16_037251160.430568hypothetical protein
AAT16_03730015-1.544394hypothetical protein
AAT16_03735017-2.901600hypothetical protein
AAT16_03740120-4.334706alpha/beta hydrolase
AAT16_03745126-7.458009hypothetical protein
AAT16_03750231-9.129643hypothetical protein
AAT16_03755131-10.182733teichoic acid ABC transporter permease
AAT16_03760129-9.969689teichoic acid ABC transporter ATP-binding
AAT16_03765026-9.098799group 1 family glycosyl transferase
AAT16_03770125-8.620885hypothetical protein
AAT16_03775121-7.727568hypothetical protein
AAT16_03780020-7.440765hypothetical protein
AAT16_03785024-8.734665hypothetical protein
AAT16_03790129-10.783098UDP-N-acetyl-D-mannosamine dehydrogenase
AAT16_03795234-12.495076hypothetical protein
AAT16_03800336-12.974679hypothetical protein
AAT16_03805542-14.847113hypothetical protein
AAT16_03810648-15.948243hypothetical protein
AAT16_03815647-15.871681hypothetical protein
AAT16_03820646-14.511597hypothetical protein
AAT16_03825645-14.184533hypothetical protein
AAT16_03830647-14.225435hypothetical protein
AAT16_03835545-13.324507hypothetical protein
AAT16_03845545-13.276288hypothetical protein
AAT16_03850645-12.571151hypothetical protein
AAT16_03855436-10.668687hypothetical protein
AAT16_03860429-8.547808hypothetical protein
AAT16_03865221-5.755265hypothetical protein
AAT16_03870216-3.195318hypothetical protein
AAT16_03875013-0.643693hypothetical protein
AAT16_03880-1160.787628hypothetical protein
AAT16_03890-2192.244568hypothetical protein
AAT16_03895-1183.250225biotin synthase
AAT16_039150193.006283hypothetical protein
AAT16_03920-1182.022934hypothetical protein
AAT16_03925-1141.596480hypothetical protein
AAT16_039300143.449382hypothetical protein
AAT16_03940-1152.526281MFS transporter
AAT16_03945-1161.943432hypothetical protein
AAT16_039500162.784658hypothetical protein
AAT16_039550193.239975sodium:proton antiporter
AAT16_039600233.148627isoaspartyl dipeptidase
AAT16_039650252.623945hypothetical protein
AAT16_039702313.671476acetylglucosaminyldiphospho-UDP
AAT16_039752293.114386hypothetical protein
AAT16_039800261.104994teichoic acid ABC transporter permease
AAT16_039850252.369393CDP-glycerol:glycerophosphate
AAT16_039901241.867315glycerol-3-phosphate cytidylyltransferase
AAT16_039952212.368028hypothetical protein
AAT16_040051181.535757hypothetical protein
AAT16_040100170.652683hypothetical protein
AAT16_04015016-0.587924membrane protein
AAT16_04020020-5.218836hypothetical protein
AAT16_04025130-7.595852capsule biosynthesis protein
AAT16_04030437-9.793501capsular biosynthesis protein
AAT16_04035544-11.383423hypothetical protein
AAT16_04040750-13.523513polysaccharide biosynthesis protein EpsC
AAT16_040451058-16.314977hypothetical protein
AAT16_04050956-16.463724UDP-glucose 4-epimerase
AAT16_040551058-17.422710capsular biosynthesis protein
AAT16_040601059-17.511764UDP-N-acetylglucosamine 2-epimerase
AAT16_040651057-17.266766hypothetical protein
AAT16_040701055-17.091246hypothetical protein
AAT16_04075650-15.492064hypothetical protein
AAT16_04080445-13.049595hypothetical protein
AAT16_04085027-7.756089hypothetical protein
AAT16_04090-120-5.311257glycosyl transferase
AAT16_04100-116-3.571407hypothetical protein
AAT16_04105-1150.563073capsule biosynthesis protein
AAT16_04110-1161.953776capsular biosynthesis protein
AAT16_04115-1181.843232polysaccharide biosynthesis protein EpsC
AAT16_041200242.713964pyridoxal phosphate-dependent aminotransferase
AAT16_041251303.882384hypothetical protein
AAT16_041303314.283004protein CapI
AAT16_041353324.198919GDP-mannose dehydrogenase
AAT16_041401324.210146hypothetical protein
AAT16_041451314.756353hypothetical protein
AAT16_041501324.280550hypothetical protein
AAT16_041552303.518776hypothetical protein
AAT16_04160019-2.855099hypothetical protein
AAT16_04165124-5.840493sugar transferase
AAT16_04170126-7.962796hypothetical protein
AAT16_04175330-8.049480hypothetical protein
AAT16_04180539-10.122920hypothetical protein
AAT16_04185644-11.527488hypothetical protein
AAT16_04190547-10.992161hypothetical protein
AAT16_04195549-12.240501PTS galactitol transporter subunit IIB
AAT16_04200650-12.930501PTS galactitol transporter subunit IIC
AAT16_04205139-12.854966hypothetical protein
AAT16_04215022-6.792539hypothetical protein
AAT16_04220018-4.372164hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03075IGASERPTASE310.004 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.004
Identities = 23/115 (20%), Positives = 44/115 (38%), Gaps = 2/115 (1%)

Query: 90 PKSEIREMIENGELKEQSDETAEPSKPNPDETVSGEGEPDETAGGSGSEAISGTGVESSE 149
P + E +Q +T E ++ + ET + E + A + V S
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 150 SESTSVEPGRPDESSPVHKPEP--TEQKQTQKQPQHKRRNQSRNRKKPNQQKQRQ 202
SE+ + E++ V K E E ++TQ+ P+ + + + Q Q +
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03200adhesinmafb270.019 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 27.3 bits (60), Expect = 0.019
Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 4/71 (5%)

Query: 48 VNQFLSKGQIVKAKILSVDKHGKLNLTLKENEYFKSEEKKRDRRSVLEQIRETEKYGFES 107
+N F+S G+ + + ++ N E K L + EK E+
Sbjct: 236 LNPFISAGEALGIGDILYGTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREA 295

Query: 108 IRQKMPEWIEE 118
+ + WI+E
Sbjct: 296 VDR----WIQE 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03325GPOSANCHOR436e-06 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 42.7 bits (100), Expect = 6e-06
Identities = 40/127 (31%), Positives = 62/127 (48%), Gaps = 15/127 (11%)

Query: 409 TELDQINRRVMQLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVEKEKAQI 468
+LD QLE E Q L+ E + +S + L+++L +REA++ L +K + Q
Sbjct: 316 RDLDASREAKKQLEAEHQKLE-EQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQN 374

Query: 469 Q-----------KVTGKREELDRVRKELEEAENNYE-LEKA-AELRHGRLPSLEKELAEL 515
+ + RE +V K LEEA + LEK EL + + EKE AEL
Sbjct: 375 KISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLT-EKEKAEL 433

Query: 516 EAQLQEE 522
+A+L+ E
Sbjct: 434 QAKLEAE 440



Score = 37.7 bits (87), Expect = 2e-04
Identities = 18/113 (15%), Positives = 41/113 (36%), Gaps = 2/113 (1%)

Query: 409 TELDQINRRVMQLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVEKEKAQI 468
+ ++ + S L ++ +A + + A+I
Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248

Query: 469 QKVTGKREELDRVRKELEEAENNYELEKAAELRHGRLPSLEKELAELEAQLQE 521
+ + ++ L+ + ELE+A A+ ++ +LE E A LEA+ +
Sbjct: 249 KTLEAEKAALEARQAELEKALEGAMNFSTADSA--KIKTLEAEKAALEAEKAD 299



Score = 33.9 bits (77), Expect = 0.003
Identities = 23/118 (19%), Positives = 48/118 (40%), Gaps = 17/118 (14%)

Query: 403 EMGSNPTELDQINRRVMQLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVE 462
++ ++ + LE E+ L+ + V + L+++L +REA++ L
Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQSQ-VLNANRQSLRRDLDASREAKKQL----- 328

Query: 463 KEKAQIQKVTGKREELDRVRKELEEAENNYELEKAAELRHGRLPSLEKELAELEAQLQ 520
+A+ QK+ + + + R+ L + K LE E +LE Q +
Sbjct: 329 --EAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ---------LEAEHQKLEEQNK 375



Score = 32.3 bits (73), Expect = 0.009
Identities = 31/106 (29%), Positives = 59/106 (55%), Gaps = 3/106 (2%)

Query: 420 QLEIEEQALKSEDDSVSRNRLEELQKELSEAREAQQALTQRVEKEKAQIQKVTGKREELD 479
QLE E Q L+ E + +S + L+++L +REA++ + + +E+ +++ + +EL+
Sbjct: 362 QLEAEHQKLE-EQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELE 420

Query: 480 RVRK--ELEEAENNYELEKAAELRHGRLPSLEKELAELEAQLQEES 523
+K E E+AE +LE A+ +L +ELA+L A +S
Sbjct: 421 ESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDS 466


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03430BCTERIALGSPD300.007 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.9 bits (67), Expect = 0.007
Identities = 8/34 (23%), Positives = 18/34 (52%)

Query: 85 DPFSMEQTPMNINNKDIESFLDEAAKDNGETVPV 118
P + E+ + DI+ F++ +K+ +TV +
Sbjct: 23 RPAAAEEFSASFKGTDIQEFINTVSKNLNKTVII 56


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03480PF06580300.024 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 30.2 bits (68), Expect = 0.024
Identities = 21/136 (15%), Positives = 41/136 (30%), Gaps = 12/136 (8%)

Query: 103 NTLALTLSIFGGILLLSLLFAFMMSWAGIFDDVFLLVIIISTISLGVVVP---TLKETNL 159
N G + F F + + I IS + L + +K
Sbjct: 9 NKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGW 68

Query: 160 ITTQMGQIILLVAVIADLVTMIMLALYSQLYADSS-------QPIWLMGILVVFAVLFYF 212
+ MGQIIL V ++ M+ + ++ + + + ++F V+
Sbjct: 69 LKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVT 128

Query: 213 LG--RVMHHAQFLKQL 226
+ F K
Sbjct: 129 FMWSLLYFGWHFFKNY 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03485DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 70.5 bits (172), Expect = 2e-16
Identities = 65/258 (25%), Positives = 100/258 (38%), Gaps = 17/258 (6%)

Query: 3 LEGKTYVIMGVANKRSIAWGAARALDQMGAKLVFTILNERFRRELEKLLGELEGDHDIVV 62
+EGK I G A + I AR L GA + N ++ L + E H
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL-KAEARHAEAF 62

Query: 63 ECDVQDDAQIESAFREIGEKTGGIDGLLHAIAFAGKDELKGGYSETTREGFKNALDISTY 122
DV+D A I+ I + G ID L++ AG G + E ++ +++
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNV---AGVLRP-GLIHSLSDEEWEATFSVNST 118

Query: 123 SLTVVAKHAKKIM--NEGGSIVTMTYLGGERAMPNYNVMGVAKAALDSSVRYLAYDLGED 180
+ ++ K M GSIVT+ + +KAA + L +L E
Sbjct: 119 GVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178

Query: 181 GFRVNAVSAGPIRT-----LSSSAVGEFKSILKEIEE---KAPLRRNVDQLEVGNTVAFL 232
R N VS G T L + G + I +E PL++ ++ + V FL
Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 233 LSDLASGITGEVVHVDSG 250
+S A IT + VD G
Sbjct: 239 VSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03530HTHTETR573e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.6 bits (136), Expect = 3e-12
Identities = 15/59 (25%), Positives = 30/59 (50%)

Query: 12 RQYEIFAAAMAEFGEHGFKKASTNRIVKRAGMSKGMLYYYFDNKQSIFDDALDFALDHI 70
+ I A+ F + G S I K AG+++G +Y++F +K +F + + + +I
Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03540SECYTRNLCASE270.014 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 27.4 bits (61), Expect = 0.014
Identities = 10/45 (22%), Positives = 21/45 (46%)

Query: 61 GNRLVMLIFYMVFVFLPAILISVFQNNILLLGSIFVFTIFVYFIV 105
GN + +L+F + P+ L ++ + L G I T+ ++
Sbjct: 187 GNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLI 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03590TCRTETA492e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 49.1 bits (117), Expect = 2e-08
Identities = 40/173 (23%), Positives = 68/173 (39%), Gaps = 6/173 (3%)

Query: 251 IMLQGLGVGMLLPVLPTYITSELSLNYFQYTFFILIVFGLVGFSMTVLSRALDTNSVRL- 309
+ L +G+G+++PVLP + + N + IL+ L + L S R
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILL--ALYALMQFACAPVLGALSDRFG 71

Query: 310 TFAVICGGFLIYAVGIMWFSTLETIWLIFAIASFIGLSYGIMLPAWNKYLAGTIMQDKSA 369
V+ AV +T +W+++ I + G Y+A D+ A
Sbjct: 72 RRPVLLVSLAGAAVDYAIMATAPFLWVLY-IGRIVAGITGATGAVAGAYIADITDGDERA 130

Query: 370 ESWGVISSVQGIGAMIGPALGGLTADLFGTVDATLLASGLIFVLLFVYYAVLF 422
+G +S+ G G + GP LGGL + A A+ + L F+ L
Sbjct: 131 RHFGFMSACFGFGMVAGPVLGGLMGGF--SPHAPFFAAAALNGLNFLTGCFLL 181



Score = 34.4 bits (79), Expect = 7e-04
Identities = 20/103 (19%), Positives = 44/103 (42%), Gaps = 2/103 (1%)

Query: 313 VICGGFLIYAVGIMWFSTLETIWLIFAIASFIGLSYGIMLPAWNKYLAGTIMQDKSAESW 372
+ G + G + + W+ F I + S GI +PA L+ + +++ +
Sbjct: 279 ALMLGMIADGTGYILLAFATRGWMAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQ 337

Query: 373 GVISSVQGIGAMIGPALGG-LTADLFGTVDATLLASGLIFVLL 414
G ++++ + +++GP L + A T + +G LL
Sbjct: 338 GSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLL 380


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03605TCRTETOQM2181e-65 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 218 bits (556), Expect = 1e-65
Identities = 117/455 (25%), Positives = 207/455 (45%), Gaps = 64/455 (14%)

Query: 15 IISHPDAGKTTLTEKLLLFGGAIREAGTV-KGKKSNKFATSDWMKVEQERGISVTSSVMQ 73
+++H DAGKTTLTE LL GAI E G+V KG +D +E++RGI++ + +
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGT-----TRTDNTLLERQRGITIQTGITS 62

Query: 74 FDFDGYKINILDTPGHEDFSEDTYRTLMAVDSAVMVIDAAKGIEPQTLKLFKVCKMRGIP 133
F ++ K+NI+DTPGH DF + YR+L +D A+++I A G++ QT LF + GIP
Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122

Query: 134 IFTFINKLDRMGKEPFELLEEIESTLEIETYPMTWPVGMGQSFFGIINRKDRTINPYREE 193
FINK+D+ G + + ++I+ L E V + N E
Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ-KVELYP--------NMCVTNFTESE 173

Query: 194 EKLQLTDDYGLKENHPIEADEAFQTAVEEFMLVEEAGDDFDKEKIST--------GDLTP 245
+ + IE ++ +E++M +G + ++ L P
Sbjct: 174 QWDTV-----------IEGNDDL---LEKYM----SGKSLEALELEQEESIRFHNCSLFP 215

Query: 246 VFFGSALSTFGIEEFLGTYVDFAPMPTSRQTKEDTEIEPLDDAFTGFIFKIQANMDPRHR 305
V+ GSA + GI+ + + T R E G +FKI+ + R
Sbjct: 216 VYHGSAKNNIGIDNLIEVITNKFYSSTHRGQSE----------LCGKVFKIE--YSEK-R 262

Query: 306 DRLAFMRIVSGKFTRGMDATLARTGRKSKVSRATMFMADDTETVNEAYAGDIIGLYDTG- 364
RLA++R+ SG D+ K K++ + + +++AY+G+I+ L +
Sbjct: 263 QRLAYIRLYSGVLHLR-DSVRISEKEKIKITEMYTSINGELCKIDKAYSGEIVILQNEFL 321

Query: 365 --TYQIGDTLYGPGAKKVEFEALPQFTPELFMKVSAKNVMKQKHFYKGIEQLVQEG-TIQ 421
+GDT P +++E P P L V +++ + ++ ++
Sbjct: 322 KLNSVLGDTKLLPQRERIEN---PL--PLLQTTVEPSKPQQREMLLDALLEISDSDPLLR 376

Query: 422 YYKTMHTNQPILGAVGQLQFEVFEHRMKNEYNTDV 456
YY T++ IL +G++Q EV ++ +Y+ ++
Sbjct: 377 YYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEI 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03665ARGREPRESSOR260.034 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 26.4 bits (58), Expect = 0.034
Identities = 15/42 (35%), Positives = 19/42 (45%), Gaps = 5/42 (11%)

Query: 37 QLYIIEMIAEEPGITQKTLVERFKKK-----QTSVSRAITRL 73
+ I E+I TQ LV+ KK Q +VSR I L
Sbjct: 7 HIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03725BCTERIALGSPD320.010 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.010
Identities = 18/94 (19%), Positives = 38/94 (40%), Gaps = 8/94 (8%)

Query: 242 RHMVDNSLSRTKSNYEFS----ITDRVKVLEKLQDILKVEDDEEVRTLIIDRMR-AQAGF 296
R + DN+ + +YE S +T R V+++L I++ D+ R+++ + A A
Sbjct: 147 RQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAAD 206

Query: 297 MNRYLKE-HMEDYGQVASEVAHFTDEYFPYARMN 329
+ + + E + + R N
Sbjct: 207 VVKLVTELNKDTSKSALPGSM--VANVVADERTN 238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03730SACTRNSFRASE421e-07 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 42.2 bits (99), Expect = 1e-07
Identities = 26/117 (22%), Positives = 45/117 (38%), Gaps = 9/117 (7%)

Query: 8 TKELYEQCLDIRKRVFVEEQNVPLDREIDEHEDFATHILLRDDTPLGTVRYRPLSKETVK 67
T+E + K F + ++ +D E E A + ++ +G ++ R
Sbjct: 38 TEERFS------KPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYAL 91

Query: 68 VERMAVMPEARGLKLGRKLMDFVHEHAKHYGYEKARLGAQTH---AASFYEKLGYKI 121
+E +AV + R +G L+ E AK + L Q A FY K + I
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03760TYPE3IMPPROT280.038 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 28.2 bits (63), Expect = 0.038
Identities = 15/76 (19%), Positives = 35/76 (46%), Gaps = 9/76 (11%)

Query: 218 IQYGEMKEFGDAL-PIVERYNEFINQYKLLTKEEQLQYKEKMMEK--QKEKSRRAKPKKD 274
+ + ++ + ++ Y +++ +Y + E +Q+ E K E++ K KD
Sbjct: 80 VTFNDISSLSKHVDEGLDGYRDYLIKY---SDRELVQFFENAQLKRQYGEETETVKRDKD 136

Query: 275 ---KAPLSSLIPAFIL 287
K + +L+PA+ L
Sbjct: 137 EIEKPSIFALLPAYAL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03790ALARACEMASE290.042 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 28.6 bits (64), Expect = 0.042
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 284 NVSMPQYVVDYTKEILEKLEGDKVTVFGLTYKGDVDDIRESPAFDIYELL 333
VSM VD T + G V ++G K +DD+ + YEL+
Sbjct: 298 TVSMDMLAVDLTP-CPQAGIGTPVELWGKEIK--IDDVAAAAGTVGYELM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03795PF05272300.022 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.7 bits (66), Expect = 0.022
Identities = 28/156 (17%), Positives = 55/156 (35%), Gaps = 19/156 (12%)

Query: 12 VRDFDVSHYVWIYEKGNK----PILQSVQSMRKTGELKDFQKQILRVLSHRKVG-NKDFY 66
+ + H + E G K +L+ + K+ + +H +G KD Y
Sbjct: 577 GKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSY 636

Query: 67 --YLEHEGHELGWAEL----KTSI----VVYSKPREHVRLDLDKFMQEQEKQ-IFVVSKN 115
+EL E+ + +S ++ R +++Q+ +Q + + N
Sbjct: 637 EQIAGIVAYELS--EMTAFRRADAEAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTN 694

Query: 116 NLRLLKDQMLDSRFIMVK-DGVEYEALFKKHRLQGW 150
+ L D + RF V G +K R Q +
Sbjct: 695 KRQYLFDITGNRRFWPVLVPGRANLVWLQKFRGQLF 730


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03890UREASE344e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 33.6 bits (77), Expect = 4e-04
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 8/66 (12%)

Query: 134 KMTLDAARLNGTEGEEGSVEAGKYADFVVLNDNPLGYDVELTDDLVEMTIVNGKIVYGSR 193
K T++ A +G E GS+E GK AD V+ NP + V+ +M ++ G I
Sbjct: 408 KYTINPAIAHGLSHEIGSLEVGKRADLVLW--NPAFFGVK-----PDMVLLGGTIAAAP- 459

Query: 194 SGDQGA 199
GD A
Sbjct: 460 MGDPNA 465


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03940TCRTETB621e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 62.2 bits (151), Expect = 1e-12
Identities = 58/184 (31%), Positives = 76/184 (41%), Gaps = 8/184 (4%)

Query: 25 LGLLAIMGPLNIDMYLPSFPGIARDLGTSPSLVQVSLTACLLGLAFGQVVIGPLSDAQGR 84
L +L+ LN + S P IA D P+ TA +L + G V G LSD G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 85 KRPLLIATSLFVVSSLLCAVAPNIY-VLIAARFLQGFTASAGVVLSRAVVRDVFSGRELS 143
KR LL + S++ V + + +LI ARF+QG A+A L VV
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 144 KFFSLLMVINAVAPMAAPIAGGAILLLPFASWHTIFLFLAVLGIMIVIIVAVSLRETLPP 203
K F L+ I A+ P GG I H I +L MI II L + L
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIA-------HYIHWSYLLLIPMITIITVPFLMKLLKK 191

Query: 204 AQRI 207
RI
Sbjct: 192 EVRI 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03960UREASE445e-07 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 44.3 bits (105), Expect = 5e-07
Identities = 32/124 (25%), Positives = 49/124 (39%), Gaps = 34/124 (27%)

Query: 5 IKNGDIYAPEHVGKKSVLLNGRIIIKIGDIDEEQLGRLFDVEVIDAEGMIVSPGIIDPHV 64
+K+G I A G + II+ G EVI EG IV+ G +D H+
Sbjct: 90 LKDGRIAAIGKAGNPDMQPGVTIIVGPG------------TEVIAGEGKIVTAGGMDSHI 137

Query: 65 HLIGGGGEGGFATRTPELQLSNIIKAGVTTVVG-----CLGTDGTT-----RHMTSLLAK 114
H I P+ Q+ + +G+T ++G GT TT H+ ++
Sbjct: 138 HFI-----------CPQ-QIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEA 185

Query: 115 ARAL 118
A A
Sbjct: 186 ADAF 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03965HTHFIS1514e-42 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 151 bits (383), Expect = 4e-42
Identities = 79/388 (20%), Positives = 146/388 (37%), Gaps = 40/388 (10%)

Query: 90 MQKTSKFMVVNDNPA--ATLETI---EDLENVLPDH--DFLPYMAHEPMPENFDFIIT-- 140
M +V +D+ A L + + + ++A D ++T
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD----GDLVVTDV 55

Query: 141 --PGEANLVPTKAYQTFDIGARVVSIE---TVMELKEIFELEMKDSLLMQYYIKTMVHLT 195
P E + V+ + T M + E D L + + ++ +
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 196 AKRSENTPVSIADQNKN-RTFSGISTESPQMQSTIRIASQMAKTSNIIHITGETGTGKQM 254
+ + + + + S MQ R+ +++ +T + ITGE+GTGK++
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 255 LAEMIHNDSAYHDMPFYIYSGADKDPQSIDNELFG-------GEGEKHQGILREVNRGTV 307
+A +H+ + PF + A I++ELFG G + G + GT+
Sbjct: 176 VARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235

Query: 308 YIKNIDSIPYQLQNKLANYFDANA----GSS-----DVRIVTSSIDDLWELYKGDIISQK 358
++ I +P Q +L G DVRIV ++ DL + + +
Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRED 295

Query: 359 LYSYLSSYILKVPSISERKEDIPVLIDDFKNHFNRTEMQ---FSERVMNAFVRYDWPGNV 415
LY L+ L++P + +R EDIP L+ F + + F + + + WPGNV
Sbjct: 296 LYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355

Query: 416 RELYNLISYCVCLNQ-KYVEIDSLPIFF 442
REL NL+ L + + +
Sbjct: 356 RELENLVRRLTALYPQDVITREIIENEL 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03980PF06580290.018 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.018
Identities = 15/71 (21%), Positives = 22/71 (30%)

Query: 37 IVWEVLTPVISIMIYWFVFGTLRQRAPIEMGGTEVPFFYWLAIGFIVWTFFFQGSIEASK 96
I+ VL + I + WFV T R + V F LA+ I
Sbjct: 76 IILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLY 135

Query: 97 SIYRRLKMLSK 107
+ K +
Sbjct: 136 FGWHFFKNYKQ 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03990LPSBIOSNTHSS376e-06 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 37.1 bits (86), Expect = 6e-06
Identities = 27/121 (22%), Positives = 50/121 (41%), Gaps = 16/121 (13%)

Query: 14 KVITYGTFDLLHMGHINILRRAKERGDYLVVAVSSDEFNKLKHKEAYYSYEDR-KAILEA 72
I G+FD + GH++I+ R D + VAV + +K+ +S ++R + I +A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRN-----PNKQPMFSVQERLEQIAKA 56

Query: 73 IKYVDEVIPEHNWGQKVKDVQKHDIDVFVMG----DDWKGEFDF------LKEYCEVVYL 122
I ++ + G V ++ + G D++ E L E V+L
Sbjct: 57 IAHLPNAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFL 116

Query: 123 A 123

Sbjct: 117 T 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04040NUCEPIMERASE878e-21 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 87.1 bits (216), Expect = 8e-21
Identities = 56/299 (18%), Positives = 103/299 (34%), Gaps = 45/299 (15%)

Query: 282 TILVTGAGGSIGSELVRQISKFQPRQVVLLGHGENSIYTILEEMS--GIKGNIEYIPIIA 339
LVTGA G IG + +++ + QVV + N Y + + + + +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLE-AGHQVVGI-DNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 340 DVQDRKRIFKIFDKYKPNIVYHAAAHKHVPLMEYNPKEAVKNNIIGTKNTAEAAIEYKAE 399
D+ DR+ + +F V+ + V NP +N+ G N E K +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 400 KFVLIST---------------DKAVNPPNVMGATKRMAEMVVQVLNGESEQTTLVAVRF 444
+ S+ D +P ++ ATK+ E++ + +RF
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYS-HLYGLPATGLRF 178

Query: 445 GNVLGSRGS---VIPKFRKQIEAGGPITVTDE-RMTRYFMTI------------------ 482
V G G + KF K + G I V + +M R F I
Sbjct: 179 FTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHAD 238

Query: 483 PEASRLVIQAGTLANGGEVFVLDMGQPVKIVDLARNMIRLSGYSETEIQIQFSGIRPGE 541
+ + V+ + PV+++D + + G E + ++PG+
Sbjct: 239 TQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG---IEAKKNMLPLQPGD 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04050NUCEPIMERASE693e-15 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 69.0 bits (169), Expect = 3e-15
Identities = 49/265 (18%), Positives = 97/265 (36%), Gaps = 32/265 (12%)

Query: 8 LITGGTGSFGNAVLDRFLETDIKEIRIFSRDEKKQDDMRKKYRNEKI-----KFHLGDVR 62
L+TG G G V R LE + + I + ++ D K+ R E + +FH D+
Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYY-DVSLKQARLELLAQPGFQFHKIDLA 62

Query: 63 DKDSVKN--SMHGVDYIFHAAALKQVPSCEFFPMEAVKTNVVGTENVIDAAIEKNVEKVI 120
D++ + + + + +F + V P +N+ G N+++ ++ ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLL 122

Query: 121 CLST---------------DKAAYPINAMGISKAMMEKVLVAKSKTVSSEDTLICGTRYG 165
S+ D +P++ +K E + S T G R+
Sbjct: 123 YASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPAT---GLRFF 179

Query: 166 NVMASRGS---VIPLFIQQIKEGKDITV-TDPNMTRFLMSLEEAVELVVFAFENAKSGDI 221
V G + F + + EGK I V M R +++ E ++ + D
Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHAD- 238

Query: 222 MVQKSPSSTIKDLAQALKELFNADN 246
Q + + + A ++N N
Sbjct: 239 -TQWTVETGTPAASIAPYRVYNIGN 262


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04055NUCEPIMERASE573e-11 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 57.1 bits (138), Expect = 3e-11
Identities = 47/229 (20%), Positives = 79/229 (34%), Gaps = 57/229 (24%)

Query: 1 MNILITGANGFVGKNLSAELEQNTNYIV----------------------------YKI- 31
M L+TGA GF+G ++S L + + +V +KI
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 32 --TRETTEETFEEYCKKADFVFHL---AGVNR-PKNEKEFMTGNLDFTVKLVNELKKHDN 85
RE + F + VF V +N + NL + ++ E +H+
Sbjct: 61 LADREGMTDLFASG--HFERVFISPHRLAVRYSLENPHAYADSNLTGFLNIL-EGCRHNK 117

Query: 86 FAPVLITSSIQ----------AELD------NPYGKSKKAGEDIVFEYGGNNKVKTFVYR 129
+L SS + D + Y +KKA E + Y + R
Sbjct: 118 IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177

Query: 130 LPNLFGKWCRPNYNSVVATFSHNIANGLPIRI-DNPDAKIKLLYIDDLI 177
++G W RP+ + F+ + G I + + K YIDD+
Sbjct: 178 FFTVYGPWGRPDM--ALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04115NUCEPIMERASE826e-19 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 81.8 bits (202), Expect = 6e-19
Identities = 56/313 (17%), Positives = 100/313 (31%), Gaps = 72/313 (23%)

Query: 282 TILVTGAGGSIGSEIVRQIAKFQPRKILLLGHGENSIYTILEEVLDNKTDSIS------- 334
LVTGA G IG + ++ LL G + +DN D
Sbjct: 2 KYLVTGAAGFIGFHVSKR----------LLEAGHQVV------GIDNLNDYYDVSLKQAR 45

Query: 335 --------YVPIIADVQNRKRMFKVFEKYRPDIVYHAAAHKHVPMMEYNPQEAVKNNVIG 386
+ D+ +R+ M +F + V+ + V NP +N+ G
Sbjct: 46 LELLAQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTG 105

Query: 387 TKNTAEAACHFKAKKFVMIST---------------DKAVNPPNVMGATKRMAEMIVQAL 431
N E H K + + S+ D +P ++ ATK+ E++
Sbjct: 106 FLNILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY 165

Query: 432 DKGCEHTTLVAVRFGNVLGSRGS---VVPKFKKQIQLGGPVTV-TDPRMTRYFMTI---- 483
+RF V G G + KF K + G + V +M R F I
Sbjct: 166 SH-LYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 484 --------------PEASRLVIQASTLAEGGEVFVLDMGEPVKIVDLAKNMIRLCGFAEE 529
+ + + V+ + PV+++D + + G
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI--- 281

Query: 530 DIGIEFVGIRPGE 542
+ + ++PG+
Sbjct: 282 EAKKNMLPLQPGD 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04120STREPTOPAIN300.017 Streptopain (C10) cysteine protease family signature.
		>STREPTOPAIN#Streptopain (C10) cysteine protease family signature.

Length = 398

Score = 30.0 bits (67), Expect = 0.017
Identities = 29/148 (19%), Positives = 48/148 (32%), Gaps = 24/148 (16%)

Query: 162 DAAESLGAVYKGRMSGTFGKFGVYSFNGNKIITTSGGGMIISDEEIM----------IKK 211
S Y F +N N I+ T G S+ + M I
Sbjct: 215 TYTLSSNNPYFNHPKNLFAAISTRQYNWNNILPTYSGR--ESNVQKMAISELMADVGISV 272

Query: 212 ALKKATQSKETAAHYQH----ENVGYNYRLSNICAGIGRGQ--MEVLEERIRQKRAIFEQ 265
+ S + EN GYN + I G Q +++ + Q + ++ Q
Sbjct: 273 DMDYGPSSGSAGSSRVQRALKENFGYNQSVHQINRGDFSKQDWEAQIDKELSQNQPVYYQ 332

Query: 266 YVYGLGDVDGLGFM---PEANDSFHTRW 290
G+G V G F+ + + +H W
Sbjct: 333 ---GVGKVGGHAFVIDGADGRNFYHVNW 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04130NUCEPIMERASE5330.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 533 bits (1376), Expect = 0.0
Identities = 200/331 (60%), Positives = 250/331 (75%)

Query: 3 KILVTGSAGFIGSHLSARLLQEGYTVAGIDNLNDYYDVGLKKDRLELLLQNRVKSYEADI 62
K LVTG+AGFIG H+S RLL+ G+ V GIDNLNDYYDV LK+ RLELL Q + ++ D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 63 SDTGSVMEIFESEKPDIVINLAAQAGVRYSLENPHAYITSNINGFTNILEACRHQKVEQL 122
+D + ++F S + V + VRYSLENPHAY SN+ GF NILE CRH K++ L
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 123 IYASSSSVYGANTSKPFSTSDNIDHPLSLYAATKKANELMAHTYSHLYRLPTTGLRFFTV 182
+YASSSSVYG N PFST D++DHP+SLYAATKKANELMAHTYSHLY LP TGLRFFTV
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTV 181

Query: 183 YGPWGRPDMALFKFTKAILEDRPIDVYNNGDMLRDFTYVDDIVESIHRLVKLTPKPDPEW 242
YGPWGRPDMALFKFTKA+LE + IDVYN G M RDFTY+DDI E+I RL + P D +W
Sbjct: 182 YGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQW 241

Query: 243 SGDNPNPSSSNAPYRIYNIGNNAPVRLMAFIEAIENRLGKKGEKNFMPLQPGDVPETYAD 302
+ + P++S APYR+YNIGN++PV LM +I+A+E+ LG + +KN +PLQPGDV ET AD
Sbjct: 242 TVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSAD 301

Query: 303 VEDLFRTTGFRPSTDIQDGVNHFIDWYLGYY 333
+ L+ GF P T ++DGV +F++WY +Y
Sbjct: 302 TKALYEVIGFTPETTVKDGVKNFVNWYRDFY 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04205TCRTETB270.006 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 27.2 bits (60), Expect = 0.006
Identities = 6/41 (14%), Positives = 17/41 (41%), Gaps = 4/41 (9%)

Query: 13 FMVLLGAALMIIGF----FTKDIKMWFIAFAIALLVRYYAA 49
+++ +G + + F F + WF+ I ++ +
Sbjct: 324 YVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSF 364


10AAT16_04295AAT16_04670Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_04295-2113.243887sugar phosphate isomerase
AAT16_04300-2123.632691fructose 1,6-bisphosphatase
AAT16_04305-2153.996265transketolase
AAT16_043101234.806757hypothetical protein
AAT16_043151255.171355aconitate hydratase
AAT16_043200244.112243helicase SNF2
AAT16_04325-117-2.563205hypothetical protein
AAT16_04330017-3.073640methionine sulfoxide reductase A
AAT16_04340-118-5.236393*hypothetical protein
AAT16_04345028-7.993441hypothetical protein
AAT16_04350027-6.157030hypothetical protein
AAT16_04355430-7.878276hypothetical protein
AAT16_04360330-5.613830hypothetical protein
AAT16_04365230-3.869145hypothetical protein
AAT16_043700240.870783hypothetical protein
AAT16_043750194.002915hypothetical protein
AAT16_043801205.024570prevent-host-death protein
AAT16_043852225.899861addiction module protein
AAT16_043902257.104448hypothetical protein
AAT16_043953286.958860hypothetical protein
AAT16_044005327.089445sodium:proline symporter
AAT16_044053286.350682X-Pro dipeptidase
AAT16_044102234.567669hypothetical protein
AAT16_044150203.897405N-acetylmannosamine-6-phosphate 2-epimerase
AAT16_044200203.789309RpiR family transcriptional regulator
AAT16_044251204.109569PTS glucose transporter subunit IIB
AAT16_044300193.469513sodium transporter
AAT16_044350213.296947hypothetical protein
AAT16_044401254.843555hypothetical protein
AAT16_044453275.831850hypothetical protein
AAT16_044502244.833729ribokinase
AAT16_044551214.061406ribose ABC transporter
AAT16_044601203.377714hypothetical protein
AAT16_044650193.134983ribose ABC transporter permease
AAT16_04470-1203.296799D-ribose transporter subunit RbsB
AAT16_04475-1192.708030hypothetical protein
AAT16_044800203.101900phosphate:nucleotide phosphotransferase
AAT16_04485-3130.448277hypothetical protein
AAT16_04490118-1.432048hypothetical protein
AAT16_04495225-3.809574hypothetical protein
AAT16_04500639-7.295908hypothetical protein
AAT16_04505644-8.550001hypothetical protein
AAT16_04515748-9.309447hypothetical protein
AAT16_04520647-9.404417hypothetical protein
AAT16_04525646-8.329856carbon monoxide dehydrogenase
AAT16_04530648-8.379132carbon monoxide dehydrogenase
AAT16_04535648-9.298624molybdopterin dehydrogenase
AAT16_04540649-10.046174(2Fe-2S)-binding protein
AAT16_04545543-9.0343144-chlorobenzoate--CoA ligase
AAT16_04550537-7.339313crotonase
AAT16_04555327-5.367364benzoate transporter
AAT16_04560019-3.155441hypothetical protein
AAT16_04565-2180.039802hypothetical protein
AAT16_04570-2192.911419hypothetical protein
AAT16_045750214.394989threonine transporter RhtB
AAT16_045803265.680796hypothetical protein
AAT16_045853296.186214sodium:proton antiporter
AAT16_045955356.520853hypothetical protein
AAT16_046005366.699270hypothetical protein
AAT16_046054356.998514carbon-phosphorus lyase
AAT16_046103316.043092carbon-phosphorus lyase complex subunit PhnJ
AAT16_046153305.424325phosphonate C-P lyase system protein PhnK
AAT16_046203295.051017phosphonate metabolism protein PhnM
AAT16_046252275.300876phosphonate ABC transporter ATP-binding protein
AAT16_046303256.092821phosphoesterase
AAT16_046353256.089903phosphonate ABC transporter substrate-binding
AAT16_046403235.486318phosphonate ABC transporter ATP-binding protein
AAT16_046452205.052529phosphonate ABC transporter permease
AAT16_046501205.105283phosphonate ABC transporter permease
AAT16_046551194.961961membrane protein
AAT16_046600183.957063hypothetical protein
AAT16_04665-1153.834797hypothetical protein
AAT16_04670-1153.963489hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04390IGASERPTASE300.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.005
Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 122 ADGSKKESEETAEEQTEEEQPEEQTEEEQPE-EQTDEVPAEESEGAVEDAAVE 173
S E++ET +T+E E+ E+ + E E+T EVP S+ + + E
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04490HTHTETR551e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.0 bits (132), Expect = 1e-11
Identities = 36/198 (18%), Positives = 73/198 (36%), Gaps = 14/198 (7%)

Query: 1 MEKQDLRKIKTRKAIDQAFTALIAEKGFEAMTIKDIAEEAIINRGTFYMHYEDKYALLES 60
K +TR+ I L +++G + ++ +IA+ A + RG Y H++DK L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 YENTLLDGLYEILSRNIEEEHHKLSIGMPRKIATDTFNY-ISENADKIIALF-----NNQ 114
+ E+ + + + R+I ++E +++
Sbjct: 62 IWELSESNIGELELEYQAKF-PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 115 GENQFEHKVRAHMLNYYRIHSDQL----IDKNRLRVDID-YLLAYITNAHI-GLIRNW-L 167
GE + + ++ +Q I+ L D+ A I +I GL+ NW
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 168 EHGRRETSEELADILEML 185
+ +E D + +L
Sbjct: 181 APQSFDLKKEARDYVAIL 198


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04565HTHFIS558e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.8 bits (132), Expect = 8e-11
Identities = 22/122 (18%), Positives = 47/122 (38%), Gaps = 3/122 (2%)

Query: 2 SILIIDDDLESSVRITNILKQSIHSDIKILEARSATEGLKMVKEDRPFIVVTELSLSDST 61
+IL+ DDD + L ++ + +A + + +VVT++ + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRA---GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLEVGKKILSEFNDIFVIAISQLKMFELVQESINSGFSGFHLKPVIKSEFLSTIERLILS 121
++ +I D+ V+ +S F ++ G + KP +E + I R +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 RT 123

Sbjct: 122 PK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04585TYPE3IMSPROT290.035 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.035
Identities = 23/155 (14%), Positives = 56/155 (36%), Gaps = 14/155 (9%)

Query: 55 DKMEKGIITRLKTAMPAIFILFAVGIII--GTWIYSGTVPLLIYYGLQIISPTYFLVTAF 112
D +KG + + K + I+ +++ + + L++ Q P ++
Sbjct: 16 DARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYV 75

Query: 113 VIVAVVSVATGTAWGSTATAGVALMGIAAELDVSLAMAAGAVISGGVFGDKLSPLSDTTN 172
V ++ ALM IA+ + + G +ISG + ++
Sbjct: 76 VDNVLLEFFYLC---FPLLTVAALMAIASHV-----VQYGFLISGEAIKPDIKKINPIEG 127

Query: 173 LAPLVVEVNLYEHIKHMLWTTVPASIVGLIIWFFV 207
+ +L E +K + + ++ ++IW +
Sbjct: 128 AKRIFSIKSLVEFLK----SILKVVLLSILIWIII 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04620UREASE362e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 36.3 bits (84), Expect = 2e-04
Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 339 ITLNPAEAVNMDHEIGSIREGKKADI 364
T+NPA A + HEIGS+ GK+AD+
Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADL 434


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04625PF05272320.002 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 32.4 bits (73), Expect = 0.002
Identities = 13/69 (18%), Positives = 28/69 (40%), Gaps = 7/69 (10%)

Query: 36 LGIVGRSGSGKSTILKSIYGTYMPEEGAIMYHSKENGPVNIL-----EINDYELIRLRKT 90
+ + G G GKST++ ++ G + + ++ I E++ E+ R+
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELS--EMTAFRRA 656

Query: 91 EIGYVSQFL 99
+ V F
Sbjct: 657 DAEAVKAFF 665


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04660DHBDHDRGNASE300.001 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.6 bits (66), Expect = 0.001
Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 42 GKEFVSVNPMKRFGEPEEVGNLVTFLLSNEATFSNAAVIPIDGGQS 87
+ F + P+K+ +P ++ + V FL+S +A + +DGG +
Sbjct: 213 LETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04665adhesinb1684e-50 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 168 bits (426), Expect = 4e-50
Identities = 68/316 (21%), Positives = 124/316 (39%), Gaps = 12/316 (3%)

Query: 1 MFRRSLWFLSAMSVIILTACGAASPEESEGSGKIEVYTTVFALQSLTEQIAGDNAEVHSI 60
M + L ++ + L AC + GS K+ V T + +T+ IAGD +HSI
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60

Query: 61 YPNGTDIHSYEPTQKDMLSYAESDLFITTNKELDAVSGKIADVLNEDIEILEAVGDTGHL 120
P G D H YEP +D+ +++DL L+ + +E + + +
Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLE---TGGNAWFTKLVENAKKKENKDYY 117

Query: 121 LEDTHSHDHGEGDDHDHSHGEIDPHVWLDPVLSIDMAEAIKDKLSTLDPDNAEAYEENFE 180
+ E DPH WL+ I A+ I +LS DP N E YE+N +
Sbjct: 118 AVSEGVDVI-YLEGQSEKGKE-DPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLK 175

Query: 181 TVKADLEELD----ASLESVTEDSKVKNVYISHESIGYLANRYGFTQHGVSGMNNE-EPT 235
L LD ++ + K+ + S Y + Y + +N E E T
Sbjct: 176 AYVEKLSALDKEAKEKFNNIPGEKKM--IVTSEGCFKYFSKAYNVPSAYIWEINTEEEGT 233

Query: 236 QKEVIDMVEGLKADGSKYILTEQNISNKVTDIIKDAGGVEQLGFHNLSVLMDEDNPDTDY 295
++ +VE L+ + E ++ ++ + + + ++ Y
Sbjct: 234 PDQIKTLVEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSY 293

Query: 296 QTLMRHNIEVLDRALN 311
++M++N+E + L+
Sbjct: 294 YSMMKYNLEKIAEGLS 309


11AAT16_05290AAT16_05320Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_052902201.027132heme A synthase
AAT16_052952200.868588protoheme IX farnesyltransferase
AAT16_053003240.435443cytochrome B
AAT16_053053250.538748quinol oxidase subunit 1
AAT16_05310523-0.725115cytochrome B oxidoreductase
AAT16_05315218-0.449235cytochrome B6
AAT16_05320218-0.876474cytochrome C oxidase assembly protein
12AAT16_05635AAT16_05720Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_05635-2173.054440membrane protein
AAT16_05640-2172.93438350S ribosomal protein L28
AAT16_05645-1163.384765hypothetical protein
AAT16_05650-1163.396884hypothetical protein
AAT16_05655-2184.304617serine dehydratase
AAT16_05660-1174.661791serine dehydratase
AAT16_05665-1183.896314ATP-dependent DNA helicase
AAT16_05670-1183.662549hypothetical protein
AAT16_05675-2162.783675fatty acid biosynthesis transcriptional
AAT16_056801173.176041phosphate acyltransferase
AAT16_056852162.821695hypothetical protein
AAT16_056902161.9549563-ketoacyl-ACP reductase
AAT16_056953172.313178acyl carrier protein
AAT16_057002172.493935ribonuclease III
AAT16_057053182.526281hypothetical protein
AAT16_05710-1181.815595cell division protein FtsY
AAT16_05715-1192.587611DNA-binding protein
AAT16_05720-1213.337650signal recognition particle
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05660PF06438280.029 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 28.4 bits (63), Expect = 0.029
Identities = 22/107 (20%), Positives = 39/107 (36%), Gaps = 12/107 (11%)

Query: 80 SGKGLSGDLTLYAIAHAVSTNEVNAAMGKICATPTAGSAGVVPGVLFAMKEKHDVSREDM 139
+G SG L + + S +++ + G G V V++ + + +
Sbjct: 99 TGGASSGGYALDSQEVSFSNLGLDSPIA-------QGRDGTVHKVVYGLMSGDSSALQGQ 151

Query: 140 IKFLFTSGAFGFVVANNASISGAAGGCQAEVGSASAMAAAALVEMAG 186
I L + + + AAG V A+ AAAA V + G
Sbjct: 152 IDALLKAVDPSLSINSTFDQLAAAG-----VAHATPAAAAAEVGVVG 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05690DHBDHDRGNASE1429e-44 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 142 bits (359), Expect = 9e-44
Identities = 82/252 (32%), Positives = 127/252 (50%), Gaps = 10/252 (3%)

Query: 3 KIALVTGASRGIGKSIALSLGKEYTVIVNYSGSREKAEGVADEINSEGGTAEAYQCHVQN 62
KIA +TGA++GIG+++A +L + I + EK E V + +E AEA+ V++
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 YDDVKAMIKYITDTYGSIDLVVNNAGVTKDNLLMRMKEDEWNQVIDVNLKGAFNVIQSVS 122
+ + I G ID++VN AGV + L+ + ++EW VN G FN +SVS
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 RPMIRQKGGRIINISSIVGSLGNPGQTNYVASKAGIDGITKSVARELAPKGITVNAVAPG 182
+ M+ ++ G I+ + S + Y +SKA TK + ELA I N V+PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 183 FIESDMTDVL--SDDIKEQMLG--------QIPLNHFGTVDDISETVKFLASGSAKYITG 232
E+DM L ++ EQ++ IPL DI++ V FL SG A +IT
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 233 QTIHVNGGMYMG 244
+ V+GG +G
Sbjct: 249 HNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05695ACRIFLAVINRP250.038 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 24.8 bits (54), Expect = 0.038
Identities = 9/42 (21%), Positives = 18/42 (42%), Gaps = 2/42 (4%)

Query: 33 GADSLDIAELVMELEDEFEMEIPDEEAEKINTVGDALNYIDK 74
GA++LD A+ + E + P + K+ D ++
Sbjct: 296 GANALDTAKAIKAKLAELQPFFP--QGMKVLYPYDTTPFVQL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05705GPOSANCHOR542e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 53.9 bits (129), Expect = 2e-09
Identities = 56/381 (14%), Positives = 121/381 (31%), Gaps = 32/381 (8%)

Query: 148 DEVLKARPEQRRNLIEETAGVMKYKLRKKESEKRLEDTAQNLSRVNDIIQELESRVNKLE 207
+++ + + E K L + L ND + E S +
Sbjct: 42 AVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKL 101

Query: 208 RESANAKEYLALKEEISRSDIEVTAYDINALMTILRTEEEAYEEIEKKAEDCRAKLQQME 267
R++ + A K + + + M + + +E + A+ +E
Sbjct: 102 RKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 161

Query: 268 QKMSELGSARDRHDSKNRELNSRLV-------ELSRRLENTGGRIELYKERKNNKGQLVE 320
+ + + +K + L + EL + LE +
Sbjct: 162 KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKA 221

Query: 321 ELKVRLSEQQARKETLAAKADEVDRTAASLNETALMLKKSLSDTDEQKKYLTKDRGDEIE 380
L R ++ + E + +L L+ ++ ++ + +
Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 281

Query: 381 KLKDSYYDLMVEKTTLENDQRREESEKSRLDGSLRQKEERLAALRNDYDTEKSEH----- 435
K+K L EK LE ++ E + L+ + + L A R ++EH
Sbjct: 282 KIKT----LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337

Query: 436 ----------------DALVDKKEKTKSELAHAREKYLDEKRNLAELNQKYDAEREKLHK 479
DA + K++ ++E E+ + + L + DA RE +
Sbjct: 338 QNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ 397

Query: 480 ANRFIEQQSSKLEMLKNMQNE 500
+ +E+ +SKL L+ + E
Sbjct: 398 VEKALEEANSKLAALEKLNKE 418



Score = 44.3 bits (104), Expect = 3e-06
Identities = 38/210 (18%), Positives = 72/210 (34%), Gaps = 10/210 (4%)

Query: 674 ESQRDMAETEEKLAEYKNKLEHMKDTVKKLGGEVAGQMEQLSKLESTGETLAEKHEQTES 733
+ K+ + + + L + G M + + +TL + E+
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 734 AADRLGYQLEAKAETMAVLEEELKSLGHVNEE-----RDFEGLIKEAEDKLQKLDEHIRM 788
L LE ++K+L D E ++ A + I+
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 789 MSASDKDKKQKLALLTDEAHEIEREYTAVRERISHNSAEKERLGSELHDVQEAIEETEAQ 848
+ A + + A L TA +I AEK L +E D++ + A
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 849 QRLVAEDLGGMDLDALEKEEQELTAEVEKL 878
++ + DLDA + +++L AE +KL
Sbjct: 311 RQSLRR-----DLDASREAKKQLEAEHQKL 335



Score = 39.7 bits (92), Expect = 7e-05
Identities = 50/248 (20%), Positives = 94/248 (37%), Gaps = 13/248 (5%)

Query: 669 KNSIIESQRDMAETEEKLAEYKNKLEHMKDTVKKLGGEVAGQMEQLSKLESTGETLAEKH 728
+ A+ E+ L N +K L E A + ++LE E
Sbjct: 217 EAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS 276

Query: 729 EQTESAADRLGYQLEAKAETMAVLEEELKSL-----GHVNEERDFEGLIKEAEDKLQKLD 783
+ L + A A LE + + L + K+ E + QKL+
Sbjct: 277 TADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLE 336

Query: 784 EHIRMMSASDKDKKQKLALLTDEAHEIEREYTAVRERISHNSAEKERLGSELHDVQEAIE 843
E ++ AS + ++ L + ++E E+ + E+ + A ++ L +L +EA +
Sbjct: 337 EQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK 396

Query: 844 ETEAQQRLVAEDLGGMDLDALEKEEQELTAEVEKLYAETDEVSGLQHEIRKEYNSIVEER 903
+ E + L ALEK +EL E E + LQ ++ E ++ E+
Sbjct: 397 QVEKAL-----EEANSKLAALEKLNKELE---ESKKLTEKEKAELQAKLEAEAKALKEKL 448

Query: 904 DRTSKELE 911
+ ++EL
Sbjct: 449 AKQAEELA 456



Score = 32.7 bits (74), Expect = 0.009
Identities = 28/248 (11%), Positives = 75/248 (30%), Gaps = 5/248 (2%)

Query: 764 EERDFEGLIKEAEDKLQKLDEHIRMMSASDKDKKQKLALLTDEAHEIEREYTAVRERISH 823
+ D K +D +L E + + + L+ + E+E + + +
Sbjct: 72 KNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEG 131

Query: 824 NSAEKERLGSELHDVQEAIEETEAQQRLVAEDLGGMDLDALEKEEQELTAEVEKLYAETD 883
+++ ++ A++ + + L + +A+++ L AE
Sbjct: 132 AMNFSTADSAKIKTLEAEKAALAARKADLEKAL-----EGAMNFSTADSAKIKTLEAEKA 186

Query: 884 EVSGLQHEIRKEYNSIVEERDRTSKELEECQETLRNHTGKKEKLDVKIEQKIEYLSENYK 943
+ Q E+ K + S +++ + +K L+ +E + + + +
Sbjct: 187 ALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 246

Query: 944 MTYEKAREEYDDFSDIDQKRMKISLNKKSIEELGPVNLGAIEEFDRVNERYQFLKSQEAD 1003
E+ + + + E + L+ Q
Sbjct: 247 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQV 306

Query: 1004 LLEARSTL 1011
L R +L
Sbjct: 307 LNANRQSL 314


13AAT16_05795AAT16_05820Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_057952130.941073DNA topoisomerase I
AAT16_058004180.773571recombinase XerC
AAT16_058053211.107516ATP-dependent protease subunit HslV
AAT16_058104220.646020ATP-dependent protease
AAT16_05815321-0.309179transcriptional repressor CodY
AAT16_058202170.11268130S ribosomal protein S2
14AAT16_05985AAT16_06010Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_059850143.2140702-oxoglutarate ferredoxin oxidoreductase subunit
AAT16_05990-1163.3702952-oxoacid ferredoxin oxidoreductase
AAT16_05995-1183.728088ABC transporter ATP-binding protein
AAT16_06000-2204.669411multidrug ABC transporter ATP-binding protein
AAT16_06005-2193.746239thiamine ABC transporter permease
AAT16_06010-2203.362880ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06000TCRTETB310.017 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.017
Identities = 42/173 (24%), Positives = 75/173 (43%), Gaps = 23/173 (13%)

Query: 58 LVTLTSLLSITAPFLVGYIVDNYFVQQRFDGLFRILMILLATYVLLSATQYIAAFLMV-- 115
L+ + + IT PFL+ + ++ FD ILM + + +L T Y +FL+V
Sbjct: 171 LLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSV 230

Query: 116 ---GLSQRTVYKLRD-----RLFSHMQKLPIRFFDKRQHGEL---MSRMTNDIETISQTL 164
+ + + K+ D L ++ + G + +S + ++ + Q L
Sbjct: 231 LSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQ-L 289

Query: 165 NTSFIQFTTSVVTLIGTVSVMIY------LSPLLTLLTVTIIPVLILAVGFIT 211
+T+ I SV+ GT+SV+I+ L L V I V L+V F+T
Sbjct: 290 STAEI---GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLT 339


15AAT16_06335AAT16_06375Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_063352151.787182hypothetical protein
AAT16_063402151.621966hypothetical protein
AAT16_063453163.37491350S ribosomal protein L21
AAT16_063503163.834624hypothetical protein
AAT16_063553163.88316550S ribosomal protein L27
AAT16_063602153.834053GTPase ObgE
AAT16_063650183.945273ATP-dependent DNA helicase RuvA
AAT16_063700163.761945ATP-dependent DNA helicase RuvB
AAT16_063750153.132528S-adenosylmethionine tRNA ribosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06365MICOLLPTASE270.048 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 27.4 bits (60), Expect = 0.048
Identities = 13/45 (28%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 17 ITLDTGGIGHLINVPNPFRFEAALDSEVTIFTELIVREDSHTLYG 61
+ D GGI ++ N+ F +E + + EL E +H L G
Sbjct: 467 FSTDNGGI-YIENIGTFFTYERTPEESIYTLEELFRHEFTHYLQG 510


16AAT16_06420AAT16_06575Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_06420-315-3.994995histidyl-tRNA synthetase
AAT16_06425-320-5.567914aspartate--tRNA ligase
AAT16_06430129-9.147274hypothetical protein
AAT16_06435027-9.276957phosphodiesterase
AAT16_06440227-9.668009membrane protein
AAT16_06445329-9.386560hypothetical protein
AAT16_06450329-8.758808hypothetical protein
AAT16_06455329-8.423003hypothetical protein
AAT16_06460129-7.938013hypothetical protein
AAT16_06465230-7.411241peptidase S66
AAT16_06470433-7.835831hypothetical protein
AAT16_06475231-8.004399aminoglycoside adenylyltransferase
AAT16_06480031-6.979999hypothetical protein
AAT16_06485031-7.204145acetyltransferase
AAT16_06490-121-4.902113SAM-dependent methyltransferase
AAT16_06495-219-3.238037hypothetical protein
AAT16_06500-119-3.181077membrane protein
AAT16_06505-118-2.406551alcohol dehydrogenase
AAT16_06510-118-2.715976hypothetical protein
AAT16_06515-114-0.500687glycine/betaine ABC transporter permease
AAT16_06520-115-0.702285glycine/betaine ABC transporter ATP-binding
AAT16_065250140.7183296-phospho 3-hexuloisomerase
AAT16_065301151.4590293-hexulose-6-phosphate synthase
AAT16_065351172.010607transcriptional regulator
AAT16_065401173.061471recombinase RarA
AAT16_065451183.505073Rrf2 family transcriptional regulator
AAT16_065501193.982463ABC transporter
AAT16_065551174.025760aspartate kinase
AAT16_065600164.234861homoserine dehydrogenase
AAT16_065650184.324455threonine synthase
AAT16_065700204.081422hypothetical protein
AAT16_065750183.143001hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06430UREASE310.008 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 31.2 bits (71), Expect = 0.008
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 5/42 (11%)

Query: 302 NNPVQRYSAEDIFKMATINGARAYNLQETMGKIKEGYKADLV 343
N V+RY A+ TIN A A+ L +G ++ G +ADLV
Sbjct: 399 NFRVKRYIAK-----YTINPAIAHGLSHEIGSLEVGKRADLV 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06455HTHFIS823e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 3e-20
Identities = 28/121 (23%), Positives = 58/121 (47%), Gaps = 4/121 (3%)

Query: 1 MNGYNILIVEDEVSVSKGLKKVLEGEGANVSVNETGEGVVEQLADAH--LILMDIMLPFD 58
M G IL+ +D+ ++ L + L G +V + + +A L++ D+++P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 59 DGLSISKEIL-HRVDIPIIFLTAMNDIDSKLDGLKSGE-DYITKPFHPLELISRLNNVIS 116
+ + I R D+P++ ++A N + + + G DY+ KPF ELI + ++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 117 R 117

Sbjct: 121 E 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06485SACTRNSFRASE300.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 30.3 bits (68), Expect = 0.002
Identities = 14/58 (24%), Positives = 23/58 (39%)

Query: 72 YGRFVWVCDLVTDTNKRSKGYGEKLLGFVHDWAAEKGYESVALSSGLQRTEAHRFYEN 129
+ + + D+ + R KG G LL +WA E + + L + A FY
Sbjct: 86 WNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAK 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06510SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.4 bits (76), Expect = 2e-04
Identities = 15/84 (17%), Positives = 36/84 (42%), Gaps = 2/84 (2%)

Query: 54 INLQNKEVFGIYNQEELIGFLDLLFHYPDDSTCMIGYLVIDQRYRKQGLGQKIYNEVVTY 113
+ + K F Y + IG + + ++ + I + + + YRK+G+G + ++ + +
Sbjct: 60 VEEEGKAAFLYYLENNCIGRIKIRSNWNGYAL--IEDIAVAKDYRKKGVGTALLHKAIEW 117

Query: 114 LSKRDISKVRLGVIKDNIPAVKMW 137
+ + L NI A +
Sbjct: 118 AKENHFCGLMLETQDINISACHFY 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06540PF05272357e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.7 bits (79), Expect = 7e-04
Identities = 14/66 (21%), Positives = 29/66 (43%), Gaps = 7/66 (10%)

Query: 43 MILYGPPGIGKTSIASAIAGSTSYKFRTLNAVTNTKKDMQIVADEGKMSGSVILLLDEIH 102
++L G GIGK+++ + + G + T + K + +++G V L E+
Sbjct: 599 VVLEGTGGIGKSTLINTLVG-LDFFSDTHFDIGTGKDSYE------QIAGIVAYELSEMT 651

Query: 103 RLDKAK 108
+A
Sbjct: 652 AFRRAD 657


17AAT16_06655AAT16_06680Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_06655216-3.925652hypothetical protein
AAT16_06660118-4.194617RNA-binding protein
AAT16_06665217-3.628545hypothetical protein
AAT16_06670117-5.006188HAD family hydrolase
AAT16_06675016-4.698517ribosomal silencing factor RsfS
AAT16_06680-115-4.607932hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06655PF03309290.016 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 29.0 bits (65), Expect = 0.016
Identities = 15/64 (23%), Positives = 24/64 (37%), Gaps = 3/64 (4%)

Query: 74 EIDEGARMIGAVNTV-AVKDGVFKGYNTDISGYMNAFTARF-GEQKRKVLIIGAGGAAKA 131
E+ +IG NTV ++ G G+ + G +N G V ++ G A
Sbjct: 174 ELTRPRSVIGK-NTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPL 232

Query: 132 VQRA 135
V
Sbjct: 233 VLPD 236


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06665LPSBIOSNTHSS422e-07 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 42.1 bits (99), Expect = 2e-07
Identities = 17/72 (23%), Positives = 33/72 (45%), Gaps = 4/72 (5%)

Query: 3 IGVFGGTFDPVHIGHIHAVAEAKIALNLDKVIIIPARQSPLKSSSPTKDKHRLNMLHHAV 62
++ G+FDP+ GH+ + D+V + R +P K + + RL + A+
Sbjct: 2 NAIYPGSFDPITFGHLDIIERG--CRLFDQVYVAVLR-NPNKQPMFSVQE-RLEQIAKAI 57

Query: 63 EGYGFIEIDTFE 74
++D+FE
Sbjct: 58 AHLPNAQVDSFE 69


18AAT16_06800AAT16_06830Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_06800215-2.897701carboxypeptidase
AAT16_06805216-4.884380hypothetical protein
AAT16_06810216-3.667660acetyltransferase
AAT16_06815216-3.479833DNA topoisomerase III
AAT16_06825323-4.439027hypothetical protein
AAT16_06830218-2.535356hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06810SACTRNSFRASE290.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.002
Identities = 9/30 (30%), Positives = 17/30 (56%)

Query: 26 VIEHTEVQDSLKGQGAGSQLVDTMVEFAKQ 55
+IE V + +G G+ L+ +E+AK+
Sbjct: 91 LIEDIAVAKDYRKKGVGTALLHKAIEWAKE 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_06815PRPHPHLPASEC382e-04 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 37.7 bits (87), Expect = 2e-04
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 6/74 (8%)

Query: 214 YYGIETVTDSVKFTWQDDKGSNRSFDKDKIDSIISKIGNEDLKITDIQ----KKAKKTFA 269
Y+GI+T D W+ D N F D+ K+ +E+LKI DIQ +K K T
Sbjct: 305 YFGIKT-KDGKTQEWEMDNPGN-DFMTGSKDTYTFKLKDENLKIDDIQNMWIRKRKYTAF 362

Query: 270 PALYDLTELQRDAN 283
P Y ++ AN
Sbjct: 363 PDAYKPENIKIIAN 376


19AAT16_07670AAT16_07720Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_07670016-3.073226phosphatidic acid phosphatase
AAT16_07675115-2.863145response regulator ArlR
AAT16_07680115-4.113109histidine kinase
AAT16_07685019-4.0805582-oxoglutarate dehydrogenase
AAT16_07690119-5.202814dihydrolipoamide succinyltransferase
AAT16_07695118-5.974368hypothetical protein
AAT16_07700118-5.778297hypothetical protein
AAT16_07705118-6.377015hypothetical protein
AAT16_07710221-5.303675branched-chain amino acid ABC transporter
AAT16_07715317-4.384132tellurite resistance protein TelA
AAT16_07720018-3.750805hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_07675HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 34/139 (24%), Positives = 69/139 (49%), Gaps = 6/139 (4%)

Query: 3 HILVVEDEINLARFIELELVHEGYTVTLSDNGTDGLEKALDNEYECILLDLMLPELNGLE 62
ILV +D+ + + L GY V ++ N + + ++ D+++P+ N +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 63 VCRRIRKV-KDVPIVIITAKGETYDKVVGLDYGADDYIVKPFEIEELLARIRVIM----- 116
+ RI+K D+P+++++A+ + + GA DY+ KPF++ EL+ I +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 117 RRSANSEEKQEILELYGIS 135
R S ++ Q+ + L G S
Sbjct: 125 RPSKLEDDSQDGMPLVGRS 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_07690IGASERPTASE402e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.0 bits (93), Expect = 2e-05
Identities = 25/105 (23%), Positives = 42/105 (40%), Gaps = 2/105 (1%)

Query: 94 SEESKQEENKAEESKSEKKSAEDKKEEPASSEESESGDNDERIVATPSARRLAREKGIDL 153
+ S+ E AE SK E K+ E K E+ A+ +++ + + + A E
Sbjct: 1031 ATPSETTETVAENSKQESKTVE-KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 154 SEINASDPRGLVRSQDVDNHSKQPAKAETPKQEAPKSKSSDKPEK 198
SE + + V+ K + E QE PK S P++
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEK-TQEVPKVTSQVSPKQ 1133



Score = 32.7 bits (74), Expect = 0.004
Identities = 42/227 (18%), Positives = 77/227 (33%), Gaps = 28/227 (12%)

Query: 11 ESITEGTIASWLKQKGDSVEKGENILELETDKVNVEVISEEA-GVITELKAEEGDTVEVG 69
S T T+A KQ+ +VEK E ET N EV E V + E
Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDAT-ETTAQNREVAKEAKSNVKANTQTNE------- 1084

Query: 70 QVIAIVDENGEGGGSSDSSSGENKSEESKQEENKAEESKSE---KKSAEDKKEEPASSEE 126
V ++G + ++ + + K+E+ K E K++ K +++ ++ S
Sbjct: 1085 -----VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139

Query: 127 SESGDNDERIVATPSARRLAREKG-----IDLSEINASDPRGLVRSQDVDNHSKQPAKAE 181
+ T + + + ++ +S+ V N + E
Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN--TGNSVVE 1197

Query: 182 TPKQEAPKSK----SSDKPEKPVVREKMSRRRKTIAKKLLEVSQNTA 224
P+ P + +S+ KP R + S R + S N
Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244


20AAT16_07965AAT16_08115Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_07965215-0.830263porphobilinogen deaminase
AAT16_07970115-1.464339hypothetical protein
AAT16_07975215-0.756638glutamyl-tRNA reductase
AAT16_07980114-1.225701GTP-binding protein
AAT16_07985115-1.403130ATP-dependent protease
AAT16_07990017-3.787900trigger factor
AAT16_07995024-7.285625hypothetical protein
AAT16_08000232-10.309867DNA mismatch repair protein MutT
AAT16_08005440-13.292446hypothetical protein
AAT16_08015551-16.372790hypothetical protein
AAT16_08025435-11.227178hypothetical protein
AAT16_08030335-10.525838hypothetical protein
AAT16_08035330-7.941273hypothetical protein
AAT16_080400221.599009hypothetical protein
AAT16_080450284.944034hypothetical protein
AAT16_080502356.575868transposase
AAT16_080554387.203775pilus assembly protein HicB
AAT16_080602376.746923hypothetical protein
AAT16_08065-1263.433742carbon starvation protein CstA
AAT16_08070023-0.755155hypothetical protein
AAT16_08075023-0.543145arsenic ABC transporter ATPase
AAT16_08080124-2.965217hypothetical protein
AAT16_08085227-5.600084hypothetical protein
AAT16_08090224-4.999845DEAD/DEAH box helicase
AAT16_08100224-5.497632type I restriction-modification protein subunit
AAT16_08105222-4.658160hypothetical protein
AAT16_08110121-6.034562hypothetical protein
AAT16_08115019-5.446603hypothetical protein
21AAT16_08475AAT16_08500Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_08475020-3.20790030S ribosomal protein S4
AAT16_08480-119-3.733375potassium transporter Trk
AAT16_08485-119-3.546680hypothetical protein
AAT16_08490-118-3.786268cation transporter
AAT16_08495-217-3.043362hypothetical protein
AAT16_08500-217-3.173350hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_08495PF00577280.010 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 28.3 bits (63), Expect = 0.010
Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 102 SYTADASELEPGSYEVVIHVNG 123
S + EL PG+Y V I++N
Sbjct: 65 SRFENGQELPPGTYRVDIYLNN 86


22AAT16_08910AAT16_09430Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_08910021-3.1433211,4-dihydroxy-2-naphthoate
AAT16_08915127-4.105421hypothetical protein
AAT16_08920229-4.332753hypothetical protein
AAT16_08925331-5.568408hypothetical protein
AAT16_08930333-5.567227acetyl-CoA acetyltransferase
AAT16_08935332-6.4596423-hydroxyacyl-CoA dehydrogenase
AAT16_08940331-6.635709glutaryl-CoA dehydrogenase
AAT16_08945020-4.765894long-chain fatty acid--CoA ligase
AAT16_08950025-5.958495CoA-transferase
AAT16_08965124-5.444404hypothetical protein
AAT16_08970024-5.745028hypothetical protein
AAT16_08975126-5.444566hypothetical protein
AAT16_08980120-2.418268hypothetical protein
AAT16_08985220-2.854454hypothetical protein
AAT16_08990119-1.204512hypothetical protein
AAT16_08995119-0.337645hypothetical protein
AAT16_090001190.233695hypothetical protein
AAT16_090051200.597960hypothetical protein
AAT16_09010019-1.747619hypothetical protein
AAT16_09015220-1.194091hypothetical protein
AAT16_09020119-1.135898phage protein
AAT16_09025121-0.446823hypothetical protein
AAT16_090304230.504923hypothetical protein
AAT16_090355230.404263hypothetical protein
AAT16_09040320-0.376613hypothetical protein
AAT16_09045118-2.070611hypothetical protein
AAT16_09050118-2.234120hypothetical protein
AAT16_09055019-1.786037hypothetical protein
AAT16_09060-220-3.195820peptidase
AAT16_09065-321-3.767690portal protein
AAT16_09070-319-4.438021terminase
AAT16_09080327-3.631674hypothetical protein
AAT16_09085226-2.898428phage protein
AAT16_09090223-3.072784hypothetical protein
AAT16_09095019-2.198437hypothetical protein
AAT16_09100117-1.801242ArpR
AAT16_09105018-1.232029hypothetical protein
AAT16_09110019-1.797274hypothetical protein
AAT16_09115019-1.200725hypothetical protein
AAT16_09120120-0.911917hypothetical protein
AAT16_09125226-2.375753hypothetical protein
AAT16_09135229-2.793651hypothetical protein
AAT16_09140430-3.653335hypothetical protein
AAT16_09145227-1.392181hypothetical protein
AAT16_09150324-1.707142hypothetical protein
AAT16_09155123-2.378550hypothetical protein
AAT16_09160122-2.291841hypothetical protein
AAT16_09165123-2.927647hypothetical protein
AAT16_09170124-3.268895beta-lactamase
AAT16_09175325-4.028536recombinase
AAT16_09180332-6.410369hypothetical protein
AAT16_09185432-6.227546hypothetical protein
AAT16_09195432-4.852830hypothetical protein
AAT16_09200430-2.902200hypothetical protein
AAT16_09205229-3.518288hypothetical protein
AAT16_09210028-5.044919hypothetical protein
AAT16_09215024-3.936287hypothetical protein
AAT16_09220021-3.813403hypothetical protein
AAT16_09225019-3.827041hypothetical protein
AAT16_09235021-5.279536XRE family transcriptional regulator
AAT16_09240-118-4.820552hypothetical protein
AAT16_09245-119-4.045046hypothetical protein
AAT16_09290022-4.289225********hypothetical protein
AAT16_09295-121-3.417319hypothetical protein
AAT16_09300-125-4.690065MerR family transcriptional regulator
AAT16_09305023-4.248156hypothetical protein
AAT16_09310024-4.906026deacylase
AAT16_09315028-6.221431MarR family transcriptional regulator
AAT16_09320029-7.068822glyoxalase
AAT16_09325234-9.024640spermidine acetyltransferase
AAT16_09330440-11.782720hypothetical protein
AAT16_09340442-11.971810hypothetical protein
AAT16_09345546-13.767871hypothetical protein
AAT16_09350341-12.104561hypothetical protein
AAT16_09355233-10.262340hypothetical protein
AAT16_09365234-11.042223hypothetical protein
AAT16_09370-133-10.301372hypothetical protein
AAT16_09380036-11.248876hypothetical protein
AAT16_09395-131-9.061694hypothetical protein
AAT16_09400028-8.058153terminase
AAT16_09410133-9.673986hypothetical protein
AAT16_09415130-9.406612hypothetical protein
AAT16_09420021-7.304800hypothetical protein
AAT16_09430-114-3.110299transposase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_0900556KDTSANTIGN310.026 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 31.5 bits (71), Expect = 0.026
Identities = 30/122 (24%), Positives = 46/122 (37%), Gaps = 32/122 (26%)

Query: 255 PDTLKIEGLADGLQETLATGKAIGPFAELLERLGVDMDKFNGGLSDAIANGTEQNFVMQT 314
P++ IE + +QE + LE L D F+G +++A N NFVM
Sbjct: 292 PNSASIEQIQSKIQE----------LGDTLEEL---RDSFDGYINNAFVNQIHLNFVMPP 338

Query: 315 LADNGLANVNQKFRENNKELVESRQSQQSFQQAMADLGTTLAPIATRITQGITGIVEKFN 374
A + +Q Q QQA A +A A R+ G I + +
Sbjct: 339 QA-------------------QQQQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYK 379

Query: 375 NL 376
+L
Sbjct: 380 DL 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09145PF01540260.019 Adhesin lipoprotein
		>PF01540#Adhesin lipoprotein

Length = 475

Score = 25.9 bits (56), Expect = 0.019
Identities = 11/46 (23%), Positives = 25/46 (54%)

Query: 2 QTLEEKDARIEAQAKKIRELRDEITQLQGEKRDLTDALNLTSREME 47
Q +++ + +I + KI+E E+ +L + + D + LT ++E
Sbjct: 107 QKVDQANKKIADENLKIKEGAKELLKLSEKIQSFADTIALTITKLE 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09325SACTRNSFRASE290.007 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.007
Identities = 12/58 (20%), Positives = 21/58 (36%), Gaps = 1/58 (1%)

Query: 82 LIVKPEFSGNGFAKFAFKEAVKYAFEVLNMHKVYLYVDTENEKAVRIYEKQGFKNEGV 139
+ V ++ G +A+++A E + + L N A Y K F V
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKEN-HFCGLMLETQDINISACHFYAKHHFIIGAV 151


23AAT16_09490AAT16_09560Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_09490211-3.926818multidrug ABC transporter ATP-binding protein
AAT16_09495311-4.529396diadenosine tetraphosphate hydrolase
AAT16_09500311-4.609046hypothetical protein
AAT16_09505310-4.764197hypothetical protein
AAT16_09510310-4.9843363'-5' exonuclease
AAT16_09515112-4.580146hypothetical protein
AAT16_09520015-4.082897DNA repair exonuclease
AAT16_09525-115-3.159453hypothetical protein
AAT16_09530-215-3.196418hypothetical protein
AAT16_09535-114-2.539418Cro/Cl family transcriptional regulator
AAT16_09540-215-2.373460sodium:dicarboxylate symporter
AAT16_09545-116-2.996677LuxR family transcriptional regulator
AAT16_09550-112-2.382819histidine kinase
AAT16_09555-112-3.174676disulfide bond formation protein DsbB
AAT16_09560012-3.088304hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09495MICOLLPTASE290.010 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 28.5 bits (63), Expect = 0.010
Identities = 13/52 (25%), Positives = 23/52 (44%)

Query: 36 KGHTLVIPRKPVENIYDLDEKTGAHIMKVITEVANAIKTAFNPAGLNVVQNN 87
KG + P + ++Y D ++G + + V + N +K A V NN
Sbjct: 947 KGEKTLEPGRYYLSVYTYDNQSGTYTVNVKGNLKNEVKETAKDAIKEVENNN 998


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09500FLAGELLIN270.039 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 26.5 bits (58), Expect = 0.039
Identities = 7/44 (15%), Positives = 18/44 (40%)

Query: 77 DEVKTMISNYKADISPNIERIQKDVENLQNRGEDIQESVGKIQD 120
D + + ++ + R + NL N ++ + +I+D
Sbjct: 425 DSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09545HTHFIS652e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.9 bits (158), Expect = 2e-14
Identities = 29/120 (24%), Positives = 49/120 (40%), Gaps = 4/120 (3%)

Query: 3 KIIMTDDHHIVREGMKFLLSTTEDIRVIEDFGTGAETLEFLSENHRDTDLVLLDLVMPEM 62
I++ DD +R + L + V A +++ DLV+ D+VMP+
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDV-RITSNAATLWRWIAAGD--GDLVVTDVVMPDE 60

Query: 63 DGIEVTRRIKAEYPGIKVLVLSSYTSEEYIRPVFAAQADGYIIKEMAAEELIESIKNVIE 122
+ ++ RIK P + VLV+S+ + A Y+ K ELI I +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09550PF06580431e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 42.9 bits (101), Expect = 1e-06
Identities = 37/225 (16%), Positives = 94/225 (41%), Gaps = 26/225 (11%)

Query: 155 AFQIGSTLKRIELTAQEQENMIIRERQRLARDLHDSVN-QMLFSIGITSHAAKTLKDKEK 213
+ K+ E+ + +M +E Q +A L +N +F+ + + A L+D K
Sbjct: 137 GWHFFKNYKQAEIDQWKMASMA-QEAQLMA--LKAQINPHFMFNA-LNNIRALILEDPTK 192

Query: 214 LSDAFDSIENTSKHAMREMKALIWQLKPIGLEKGIIDAIEKYADLLGLELEVKVTGFYDV 273
+ S+ ++++R A + + E + ++ Y L ++ E ++ +
Sbjct: 193 AREMLTSLSELMRYSLRYSNA---RQVSLADE---LTVVDSYLQLASIQFEDRLQFENQI 246

Query: 274 PDHIEVGLYRV----MQEGLNNVRKHSGSTKAE-----IAILSKSDELNIQIKDDGIGFE 324
+ +V +Q + N KH + + + + + +++++ G
Sbjct: 247 NP--AIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLAL 304

Query: 325 QKEKSGYSYGLGNMKDRVRKLGG---ILEIKSKKGEGTSIKVSVP 366
+ K GL N+++R++ L G +++ K+G+ ++ V +P
Sbjct: 305 KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


24AAT16_09615AAT16_09820Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_09615015-3.269070iron-sulfur cluster-binding protein
AAT16_09620-116-4.269633hypothetical protein
AAT16_09625121-3.007803fructokinase
AAT16_09630225-3.405994hypothetical protein
AAT16_09635328-3.834168hypothetical protein
AAT16_09640227-4.350412hypothetical protein
AAT16_09645326-3.931482hypothetical protein
AAT16_09650224-3.839045peptidase S8
AAT16_09655021-5.158924thiosulfate sulfurtransferase
AAT16_09660121-6.172871hypothetical protein
AAT16_09665-115-4.150765alcohol acetyltransferase
AAT16_09815-113-3.309240**************************hypothetical protein
AAT16_09820-112-3.040574Fur family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09615PF06917290.039 Periplasmic pectate lyase
		>PF06917#Periplasmic pectate lyase

Length = 555

Score = 28.7 bits (64), Expect = 0.039
Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 6/82 (7%)

Query: 108 LLNDRLDKLEQFIK-SLDPDIET-KSMVDTGV-LSDREVARRSGLGYIGKNGFMINPNLG 164
+L +D L+ + + + D + T + + + G +S + R GY G G +I+P
Sbjct: 336 VLQWVIDGLKNYYRFAYDVESNTLRPLWNDGQDMSGYVLPRD---GYYGVKGTVISPFPL 392

Query: 165 TYSYLGEMITSYPFPPDEELID 186
YL ++ ++ DEEL+D
Sbjct: 393 DVDYLLPLVRAWRLSEDEELLD 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09620TYPE4SSCAGA320.008 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 31.6 bits (71), Expect = 0.008
Identities = 26/114 (22%), Positives = 52/114 (45%), Gaps = 19/114 (16%)

Query: 152 GNYDKYREIKEHEIKRQQDEYKQYTAKRKHLEKAITHKV-NRSSNINRPKNKQDSDFRQT 210
GNYD E+K+ Q + ++ KR+HLEK + K+ ++S N N+ + K ++ ++
Sbjct: 602 GNYD--------EVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQKD 653

Query: 211 GAKPYFNKKKKK----------MEQVASSMKTRLEQLEVKEKPFEEKSIHFNTG 254
NK+ + ++ + + +LE + K F++ F G
Sbjct: 654 EIFALINKEANRDARAIAYAQNLKGIKRELSDKLENVNKNLKDFDKSFDEFKNG 707


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09650SUBTILISIN2743e-92 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 274 bits (701), Expect = 3e-92
Identities = 115/295 (38%), Positives = 169/295 (57%), Gaps = 10/295 (3%)

Query: 86 KDIPVYAYQQDVPYGIDKVQAPLAHQNGDKGAGVKLAVIDTGIDADHEDLD---VHGGYS 142
+ I ++P G++ +QAP +G GVK+AV+DTG DADH DL + G
Sbjct: 11 QVIKQEQQVNEIPRGVEMIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKARIIGGRNF 69

Query: 143 VFTSGVDADPYYDGSGHGTHVAGTAAALDNNVGVVGVAPEADLYAVKVLNSSGSGSSSGV 202
D + + D +GHGTHVAGT AA +N GVVGVAPEADL +KVLN GSG +
Sbjct: 70 TDDDEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWI 129

Query: 203 VQGVEWAVQNGMDVINMSLGSSAHSQAIQDVVDAAYYEHDILVVAAAGNEGNASGTGDTV 262
+QG+ +A++ +D+I+MSLG + + V A ILV+ AAGNEG+ D +
Sbjct: 130 IQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKA-VASQILVMCAAGNEGDGDDRTDEL 188

Query: 263 GYPAQYDSAFAVAATDENNQRASFSSTGPAVDISAPGVNILSTVPGNGYSSLNGTSMASP 322
GYP Y+ +V A + + + FS++ VD+ APG +ILSTVPG Y++ +GTSMA+P
Sbjct: 189 GYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGGKYATFSGTSMATP 248

Query: 323 HVAGAGAVIRSSFPGTG-----AAEVRSLMQGASKYIGSDTNWYGSGLLQINSAV 372
HVAGA A+I+ + E+ + + + +G+ G+GLL + +
Sbjct: 249 HVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVE 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09815ACRIFLAVINRP270.030 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.7 bits (59), Expect = 0.030
Identities = 12/68 (17%), Positives = 28/68 (41%), Gaps = 6/68 (8%)

Query: 1 MKKKSNKIQRIRGFALALVFIGMGIMYLGVFFREYQIIFSL-FLIVGLLPIL----LSFV 55
+ N+ + + +VF+ + +Y + ++ + IVG+L
Sbjct: 865 ERLSGNQAPALVAISFVVVFLCLAALYES-WSIPVSVMLVVPLGIVGVLLAATLFNQKND 923

Query: 56 IYFWVGMI 63
+YF VG++
Sbjct: 924 VYFMVGLL 931


25AAT16_09930AAT16_09980Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_09930-116-3.051795hypothetical protein
AAT16_09935020-2.423412DNA polymerase IV
AAT16_09940016-1.927572hypothetical protein
AAT16_09945016-1.988078hypothetical protein
AAT16_09950017-1.579994transcriptional regulator
AAT16_09955114-2.057691hypothetical protein
AAT16_09960215-2.545719hypothetical protein
AAT16_09965214-3.039629oxidoreductase
AAT16_09970217-4.016723hypothetical protein
AAT16_09975114-3.275820hypothetical protein
AAT16_09980115-3.296600hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09930BONTOXILYSIN300.020 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 29.9 bits (67), Expect = 0.020
Identities = 22/157 (14%), Positives = 52/157 (33%), Gaps = 32/157 (20%)

Query: 186 KRSEKEYGSIQKQITESEKLLKYQQDELKYHRSNREEMRLMNRLEREIQFKKLYFIHVGN 245
+ SI Q + +++++ + +L + ++L+
Sbjct: 682 ELICMAKQSILAQESLVKQIVQNKFTDLSKASIPPDTLKLIRETT--------------- 726

Query: 246 LIYLPENIAMDFSDVEKEAMNEIARYLNTDFSSLKMTNPSIRSLRRQIKGMDEEKEAFKI 305
E +D S+ + +MN + +LN + + + + I M++ I
Sbjct: 727 -----EKTFIDLSNESQISMNRVDNFLNKASICVFVED----IYPKFISYMEKYINNINI 777

Query: 306 HVLYEII----II----YQMIVQHEKTTGEVKKVDIK 334
I I +I + T + K +DI+
Sbjct: 778 KTREFIQRCTNINDNEKSILINSYTFKTIDFKFLDIQ 814


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09950HTHFIS869e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.4 bits (214), Expect = 9e-22
Identities = 34/130 (26%), Positives = 65/130 (50%), Gaps = 1/130 (0%)

Query: 2 TRVLIIEDNASIAEIERDYLEVNDIGSDIVLNGRDGLRMVYTGGYDLIVLDIMLPDIDGF 61
+L+ +D+A+I + L I N R + G DL+V D+++PD + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EILKKIRDE-VDVPILMVTAKVSDIDIVRGLNLGADDYITKPFSPNELVARVKSHVTRYQ 120
++L +I+ D+P+L+++A+ + + ++ GA DY+ KPF EL+ + + +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 RIMEKNSSDG 130
R K D
Sbjct: 124 RRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_09955PF06580415e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.6 bits (95), Expect = 5e-06
Identities = 31/199 (15%), Positives = 76/199 (38%), Gaps = 44/199 (22%)

Query: 175 LRTIAAKTRE----LDKLIDELSLFSNLNMEESPLEKECIELDQFLSHIIDEAKLELE-- 228
L I A E +++ LS ++ S + + L L+ + ++ L+L
Sbjct: 179 LNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQ--VSLADELTVV--DSYLQLASI 234

Query: 229 --DEKIEWSYEHPTEI-DIVIPADRMKLSRVFTNLLNNSVKY---RCRENHVIDIRLSRT 282
++++++ + I D+ +P + L+ N +K+ + + I ++ ++
Sbjct: 235 QFEDRLQFENQINPAIMDVQVP------PMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288

Query: 283 GNDAVVDIKDNGRGIDREVLPKIFEPFYREESSRNKKTGGSGLGLSIV-ENIVRSHGGQ- 340
+++++ G + +G GL V E + +G +
Sbjct: 289 NGTVTLEVENTGSLALKNT------------------KESTGTGLQNVRERLQMLYGTEA 330

Query: 341 -IDIKSEQGEWTMATVKLP 358
I + +QG+ A V +P
Sbjct: 331 QIKLSEKQGKVN-AMVLIP 348


26AAT16_10140AAT16_10170Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_10140322-4.126362molybdenum ABC transporter ATP-binding protein
AAT16_10145630-5.961273cysteine synthase
AAT16_10150737-9.284349hypothetical protein
AAT16_10155839-10.071257tRNA-dihydrouridine synthase
AAT16_10160536-10.789820DNA mismatch repair protein Vsr
AAT16_10165223-7.295908hypothetical protein
AAT16_10170014-3.926145DNA methyltransferase
27AAT16_10380AAT16_10515Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_103800123.628254PTS glucose transporter subunit IIBC
AAT16_103850153.602100hypothetical protein
AAT16_10390-2183.312084hypothetical protein
AAT16_10395-1183.634221choline transporter
AAT16_10400-1224.051442hypothetical protein
AAT16_10405-1214.246965aldehyde dehydrogenase
AAT16_10410-1214.037065UDP-glucose 4-epimerase
AAT16_104150214.174562hypothetical protein
AAT16_10420-1224.168700hypothetical protein
AAT16_10425-1203.415642hypothetical protein
AAT16_10430-1203.282157hypothetical protein
AAT16_104350223.028383hypothetical protein
AAT16_104401212.635651hypothetical protein
AAT16_104452211.878685hypothetical protein
AAT16_104501191.384269hypothetical protein
AAT16_104551202.005562multidrug ABC transporter permease
AAT16_104601161.357763sodium ABC transporter ATP-binding protein
AAT16_104651161.407191hypothetical protein
AAT16_10470-1142.370626hypothetical protein
AAT16_10475-1142.869659glucosamine-6-phosphate deaminase
AAT16_10480-1143.988429peptidase C15
AAT16_104850154.214278ABC transporter substrate-binding protein
AAT16_104900165.142632hypothetical protein
AAT16_104950165.378143membrane protein
AAT16_105000184.774787LamB/YcsF family protein
AAT16_105051175.082863acetyl-CoA carboxylase
AAT16_105100164.411318hypothetical protein
AAT16_105150144.151067hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10400ACRIFLAVINRP250.038 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 24.8 bits (54), Expect = 0.038
Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 11/55 (20%)

Query: 14 EYIEPSTLTLIYD-YIAIIGVALMVFLF--------IPIMHMPVMASLLISLALL 59
+++ S ++ + AI+ V L+++LF IP + +PV LL + A+L
Sbjct: 331 PFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV--VLLGTFAIL 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10455ACRIFLAVINRP310.012 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.012
Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 1/60 (1%)

Query: 120 SGESVSRVVNDTAIIKDLITSHFPQLIGGIMSVVGSVVILFVLDWRMSLIMFISVPISIV 179
G V + T ++ I L IM V V+ LF+ + R +LI I+VP+ ++
Sbjct: 319 QGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVF-LVMYLFLQNMRATLIPTIAVPVVLL 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10500RTXTOXINA310.005 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 30.7 bits (69), Expect = 0.005
Identities = 15/58 (25%), Positives = 30/58 (51%), Gaps = 8/58 (13%)

Query: 116 LYNATFEDKELAKTIADAVKDYNPKLKLM--------GLSNQNLVKAGEEAGLEVRHE 165
L++A K+ K A+ ++ +L L+ G S +LV+ +E G+EV+++
Sbjct: 24 LHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTADELGIEVQYD 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10510RTXTOXIND290.007 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.007
Identities = 6/23 (26%), Positives = 14/23 (60%)

Query: 116 GLVEEILAENGDTVEYDQPLITI 138
+V+EI+ + G++V L+ +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKL 127


28AAT16_10570AAT16_10640Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_105702131.213749hypothetical protein
AAT16_105751131.835919sucrose-6-phosphate hydrolase
AAT16_10580-1142.373089PTS sugar transporter subunit IIA
AAT16_10585-2142.833555DNA-binding protein
AAT16_10590-1163.721343methylase
AAT16_10595-1184.824097hypothetical protein
AAT16_106001195.080032fructosamine kinase
AAT16_106052224.314299thioesterase
AAT16_106100171.7718963-hydroxybutyryl-CoA dehydrogenase
AAT16_10615120-1.198316NADPH:quinone reductase
AAT16_10620-115-3.280607TetR family transcriptional regulator
AAT16_10625-115-3.911043universal stress protein
AAT16_10630-117-3.733115hypothetical protein
AAT16_10640-217-3.038409endonuclease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10610NUCEPIMERASE280.048 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 28.2 bits (63), Expect = 0.048
Identities = 13/31 (41%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 1 MKFAIVGT-GVIGSGWITRILAHGHDVVATD 30
MK+ + G G IG R+L GH VV D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGID 31


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10620HTHTETR571e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 57.3 bits (138), Expect = 1e-12
Identities = 17/84 (20%), Positives = 36/84 (42%)

Query: 7 KMQLLEAAADIVNEHGSDYLTLDAVAERAGVSKGGLIYHFKNKDALIRGLVEHANQLYRD 66
+ +L+ A + ++ G +L +A+ AGV++G + +HFK+K L + E + +
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72

Query: 67 NVDRHIEPEDDSNGRWLRAFIEAT 90
+ LR +
Sbjct: 73 LELEYQAKFPGDPLSVLREILIHV 96


29AAT16_10745AAT16_11015Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_10745115-4.682314hypothetical protein
AAT16_10750015-4.639497hypothetical protein
AAT16_10755021-6.065073hypothetical protein
AAT16_10760128-7.616240hypothetical protein
AAT16_10765237-9.154217phosphohydrolase
AAT16_10770343-10.246033hypothetical protein
AAT16_10775334-6.691258hypothetical protein
AAT16_10780333-6.394210dehydrogenase
AAT16_10785330-5.231761GNAT family acetyltransferase
AAT16_10790129-3.634565hypothetical protein
AAT16_10795029-3.254591resolvase
AAT16_10800028-3.190372hypothetical protein
AAT16_10805029-4.608258universal stress protein
AAT16_10810-130-4.944534arsenic resistance operon repressor
AAT16_10815030-4.845345arsenic ABC transporter ATPase
AAT16_10820132-5.873007dehydrogenase
AAT16_10825238-10.967106ArsR family transcriptional regulator
AAT16_10830234-9.507847arsenical pump membrane protein
AAT16_10835026-7.684571alcohol dehydrogenase
AAT16_10840022-6.231421permease
AAT16_10845-221-6.106691ArsR family transcriptional regulator
AAT16_10850-121-5.629174hypothetical protein
AAT16_10860020-0.020578membrane protein
AAT16_10865-117-0.674682Zn-dependent hydrolase
AAT16_10870017-2.880607hypothetical protein
AAT16_10875-119-2.829854cytoplasmic protein
AAT16_10880-117-2.070091hypothetical protein
AAT16_10885-115-1.664541hypothetical protein
AAT16_10895-214-0.9354555'-nucleotidase
AAT16_10900020-4.566882hypothetical protein
AAT16_10905123-5.075229cytochrome C biogenesis protein CcdA
AAT16_10910330-7.469268alkyl hydroperoxide reductase
AAT16_10915333-8.153851transcriptional regulator
AAT16_10920337-9.232388histidine kinase
AAT16_10925232-8.406683hypothetical protein
AAT16_10930026-6.178799hypothetical protein
AAT16_10935-122-3.660286hypothetical protein
AAT16_10940-220-2.476113dihydrofolate reductase
AAT16_10945-222-3.350742general stress protein
AAT16_10955-217-2.187502hypothetical protein
AAT16_10960-218-2.703583hypothetical protein
AAT16_10965-216-2.238766hypothetical protein
AAT16_10970-216-3.097132hypothetical protein
AAT16_10975012-1.694216hypothetical protein
AAT16_10980111-0.731648ABC transporter ATPase
AAT16_10985112-0.638658nitrilase
AAT16_10990112-0.100484hypothetical protein
AAT16_10995211-0.223692hypothetical protein
AAT16_11000-1120.934522hypothetical protein
AAT16_11005-2131.653463hypothetical protein
AAT16_11010-2153.016829membrane protein
AAT16_11015-2163.046217hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10785SACTRNSFRASE290.006 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.1 bits (65), Expect = 0.006
Identities = 30/112 (26%), Positives = 49/112 (43%), Gaps = 12/112 (10%)

Query: 29 KDYSKEYIEDDVKQMDKNFFIERAKFTNCYVFVNENIGEIIGVGSIGSYWGSETESSLFT 88
K Y K+Y +DD MD ++ E K +++ EN IG I S W + +
Sbjct: 44 KPYFKQYEDDD---MDVSYVEEEGKA--AFLYYLEN--NCIGRIKIRSNWNGY--ALIED 94

Query: 89 IFVSPDYQGMGIGKKIME--TLESDEYFLRSKRVEIPA-SITALTFYQKMGY 137
I V+ DY+ G+G ++ + E +E +I+A FY K +
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10795HTHTETR280.014 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.014
Identities = 6/22 (27%), Positives = 12/22 (54%)

Query: 159 TLDDIKAATNISRATLYRHLES 180
+L +I A ++R +Y H +
Sbjct: 33 SLGEIAKAAGVTRGAIYWHFKD 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10800PREPILNPTASE300.023 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 29.8 bits (67), Expect = 0.023
Identities = 26/126 (20%), Positives = 49/126 (38%), Gaps = 11/126 (8%)

Query: 11 WLTAPGANIMAGIVVALALIPEAIAFSIIAGVDPMVGLYASFLIAVIISIVGGRPAMISG 70
LT P + G++ L ++ ++I M G + + ++ G+ M
Sbjct: 160 QLTLPL--LWGGLLFNLLGGFVSLGDAVIGA---MAGYLVLWSLYWAFKLLTGKEGM--- 211

Query: 71 ATGAIALLVVPLVSEHGVEYLLAATILMGIIQIIFGVLKVGKLMKFIPNSVMIGFVNALA 130
G LL L + G + L +L ++ G+ + L++ S I F LA
Sbjct: 212 GYGDFKLLAA-LGAWLGWQALPIVLLLSSLVGAFMGIGLI--LLRNHHQSKPIPFGPYLA 268

Query: 131 IMIFMA 136
I ++A
Sbjct: 269 IAGWIA 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10870PF01206608e-14 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 59.8 bits (145), Expect = 8e-14
Identities = 19/68 (27%), Positives = 41/68 (60%)

Query: 126 VEASGLQCPGPLLKVNEVMGELEPGQQMEITVTDFGFCTDVEAWARKTGHSILKNEKSED 185
++A+GL CP P+LK + + + G+ + + TD G D E+++++TGH +L+ ++ +
Sbjct: 8 LDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEEDG 67

Query: 186 KVMVVLQK 193
L++
Sbjct: 68 TYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10915HTHFIS929e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 9e-24
Identities = 30/121 (24%), Positives = 57/121 (47%), Gaps = 1/121 (0%)

Query: 2 KQKILVVEDDHMIRNLIKINLENNNYDVVEAADGAEAKNVFLDAHPCLVILDLMLPKVSG 61
ILV +DD IR ++ L YDV ++ A LV+ D+++P +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 EEFFEWVREQERNEVSFIMLSAKSRVSDKVKGLKMGADDYITKPFEPDELVAHVEAVLRR 121
+ +++ R ++ +++SA++ +K + GA DY+ KPF+ EL+ + L
Sbjct: 63 FDLLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 T 122

Sbjct: 122 P 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10965NUCEPIMERASE335e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 33.2 bits (76), Expect = 5e-04
Identities = 23/127 (18%), Positives = 35/127 (27%), Gaps = 18/127 (14%)

Query: 1 MKVGIIGANGNIGLRLGKILSSRGVDTLGF---------VRKEEQAEKLKSIGVNPKTAD 51
MK + GA G IG + K L G +G K+ + E L G D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 IIETSTEDYTTLLEGTDVLVFTAGAGGAGV-------ETTRKIDGEGVSKMIEAAEDAGV 104
+ E T L V + G ++E +
Sbjct: 61 L--ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 105 KRFILVS 111
+ + S
Sbjct: 119 QHLLYAS 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10995SACTRNSFRASE290.006 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.8 bits (64), Expect = 0.006
Identities = 16/65 (24%), Positives = 29/65 (44%)

Query: 47 AWDEARMVGIIRSSGDQNFTQYISDLIVHPEYKTKGLASKLMNTYINEVSEVDEIFLMMD 106
+ E +G I+ + N I D+ V +Y+ KG+ + L++ I E LM++
Sbjct: 70 YYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLE 129

Query: 107 AAPGN 111
N
Sbjct: 130 TQDIN 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11000TACYTOLYSIN371e-04 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 36.9 bits (85), Expect = 1e-04
Identities = 38/171 (22%), Positives = 65/171 (38%), Gaps = 11/171 (6%)

Query: 4 LLSATLVSALFLAACSDGEENTEESTEEATAEEAATEESSEESTEEESTEEESGEGAESD 63
LL+A L+ + A +D + +TE T E ESSE +TE+ + + +
Sbjct: 19 LLTAALIVGNLVTANADSNKQNTANTETTTTNEQPKPESSELTTEKAGQKMDDMLNSNDM 78

Query: 64 ADAASDEGSSEDLAEAEIDEEDMKSAYDLGEDKADMIDSATETDQSVEDVLQAPSEVTSY 123
A E E + E ED K + + D E + + + EV +
Sbjct: 79 IKLAPKEMPLESAEKEEKKSEDNKKSEE---------DHTEEINDKIYSLNYNELEVLAK 129

Query: 124 QQETAIMIEVTEGEQVLDEAFTGNRAQIDETEGTLEVASDYIDESFNVTYP 174
ET EG + D+ R + + ++++ ID + TYP
Sbjct: 130 NGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISI--IDSVTDRTYP 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11005IGASERPTASE354e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.0 bits (80), Expect = 4e-04
Identities = 27/118 (22%), Positives = 45/118 (38%), Gaps = 5/118 (4%)

Query: 24 EESTEEETMEESSEEETVEEESAGTTDEESEESSEEGSGNATTVEAEEDSLEESDEAESA 83
T+E E+ E TVE+E E+E++ E + +E S +AE A
Sbjct: 1089 GSETKETQTTETKETATVEKE--EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 84 SSDDRTEDLTEGEMDEVNPDDAYDIDEDKVRMVENATETD---NTVDDVLKAPSEITS 138
+D T ++ E + D ++ VE NT + V++ P T
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204


30AAT16_11060AAT16_11125Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_11060216-1.550940hypothetical protein
AAT16_110650130.684220hypothetical protein
AAT16_110701152.325847FMN-dependent NADH-azoreductase
AAT16_110751152.944561hypothetical protein
AAT16_110801174.229128NonF
AAT16_110851164.233015hypothetical protein
AAT16_110900185.335966MFS transporter
AAT16_11095-1175.159998multidrug MFS transporter
AAT16_11100-2154.601624hypothetical protein
AAT16_11105-2154.496517multidrug MFS transporter
AAT16_11110-1143.818970peptidase T
AAT16_11115-1183.900850luciferase
AAT16_11120-2133.315732GNAT family acetyltransferase
AAT16_11125-2143.052969DNA mismatch repair protein MutT
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11065AUTOINDCRSYN290.004 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 29.4 bits (66), Expect = 0.004
Identities = 11/66 (16%), Positives = 22/66 (33%), Gaps = 10/66 (15%)

Query: 5 ENFESLDLHILEEIYRLRVSVFI--------VEQECAYQEIDGKDPVSTHIYKTDDSGIS 56
N L E++ LR F + + D + T+++ D+ +
Sbjct: 7 VNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNT--TYLFGIKDNTVI 64

Query: 57 AYLRIV 62
LR +
Sbjct: 65 CSLRFI 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11090TCRTETB1362e-37 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 136 bits (345), Expect = 2e-37
Identities = 89/402 (22%), Positives = 184/402 (45%), Gaps = 14/402 (3%)

Query: 12 RLIFILLSGAFVALLSNTFLNVALPSIKDDFGITTSTVQWVSTAYMLVSGIVIPTTAFLM 71
+++ L +F ++L+ LNV+LP I +DF ++ WV+TA+ML I L
Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 72 QRFSAKKLFIAAMLLFLTGTLVAGLSPT-FMVLILGRMIQASGSAILMPLLMNVMITSFP 130
+ K+L + +++ G+++ + + F +LI+ R IQ +G+A L+M V+ P
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 131 PAQRGTAMGLFSLVMFFAPAIGPTLSGFIVQHYSWHMLFFMMVPILLIVLSIGWLKLPQT 190
RG A GL ++ +GP + G I + W L ++P++ I+ +KL +
Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKK 191

Query: 191 DTHYTTKIDIPSVILSTLGFGGILYGFSAAGNSGWMRPDVILTLFIGFTAVFFYIRKQIH 250
+ DI +IL ++G + ++ I L + + +++
Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTTSY---------SISFLIVSVLSFLIFVKHIRK 242

Query: 251 MNDPMLNFKVYKFPMFTLASLLIGTMNMALFSGMILMPIYLQDIQGISPLDTG-ILLLPG 309
+ DP ++ + K F + L G + + + ++P ++D+ +S + G +++ PG
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 310 ALIMGFMSPISGKLFDMFGPKVLAITGLTLTVSTTFFFSQLEVDTSYTFLIMLYSIRAFG 369
+ + I G L D GP + G+T +S +F + ++T+ F+ ++ G
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTF-LSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 370 MTLVMTPVMTNGMNQLTPELTPHGSSINSMLNQVSGAIGTAL 411
++ T + T + L + G S+ + + +S G A+
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11095TCRTETB1303e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 130 bits (328), Expect = 3e-35
Identities = 90/403 (22%), Positives = 183/403 (45%), Gaps = 14/403 (3%)

Query: 27 AFAAILNQTLLATAIPHIMADLELEADVAQWLQSVFMLVNGIMIPVTAFLISKFSTRALF 86
+F ++LN+ +L ++P I D W+ + FML I V L + + L
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 87 FTALSLFGLGTLVCGISPN-FPILMAGRVLQAAGAGIIMPLMQTILFLVYPKSERGKAMG 145
+ + G+++ + + F +L+ R +Q AGA L+ ++ PK RGKA G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 146 MFGLVISFAPAIGPTLSGWFIDIYPWRGLFYMLLPIVIIDLIVAYFILRNVTEQTNPKLD 205
+ G +++ +GP + G W L L+P++ I + L + D
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKGHFD 200

Query: 206 MLSIILSTLGFGGLLYGFSVAGNSGWLSSAVIISLAVGAVALFIFIRRQNSLEQPILEFG 265
+ IIL ++G + +S I L V ++ IF++ + P ++ G
Sbjct: 201 IKGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPG 251

Query: 266 VFKDKIFTLTTALGMIVFMAMIGGAVILPILMQNMLGFSALASG-MMLFPGAVIMGVMSP 324
+ K+ F + G I+F + G ++P +M+++ S G +++FPG + + +
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 325 ITGRLFDRYGARWLAIIGLGIVAVTSLMFTNLDTETTFTYLAVVNAFRMLGVSMVMMPVT 384
I G L DR G ++ IG+ ++V+ L + L ETT ++ ++ F + G+S ++
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTKTVIS 370

Query: 385 TAGLNQMSKKLVPHGTAMNNTMRQIAGAVGTALLVSIMTNTML 427
T + + ++ G ++ N ++ G A++ +++ +L
Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11105TCRTETB1192e-31 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 119 bits (299), Expect = 2e-31
Identities = 81/419 (19%), Positives = 189/419 (45%), Gaps = 14/419 (3%)

Query: 11 QTEIKKLPLMLVLLSGAFAAILNQTLLATAIPHIMADLNLEADVAQWLQSVFMLVNGIMI 70
Q+ ++ +++ L +F ++LN+ +L ++P I D N W+ + FML I
Sbjct: 7 QSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGT 66

Query: 71 PVTAFLIGKFSTRSLFFTALILFGIGTLVCGLAPN-FTILLLGRILQASGAGIIMPLMQT 129
V L + + L +I+ G+++ + + F++L++ R +Q +GA L+
Sbjct: 67 AVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMV 126

Query: 130 ILFLIYPREKRGTAMGFFGLVISFAPAIGPTLSGWFVEIYPWRGLFYIILPIVIIDLIIA 189
++ P+E RG A G G +++ +GP + G W + +++P++ II
Sbjct: 127 VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMI---TIIT 181

Query: 190 YFVLKNVTEQTNPKVDVFSIILSTLGFGGLLYGFSIAGSSGWLSPTVLISLGVGAITLTL 249
L + ++ F I L G+++ S V + ++ +
Sbjct: 182 VPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSV------LSFLI 235

Query: 250 FIKRQFRLEQPILEFRVFRDPIFTLATIIGMVAFMTMIGGAIILPIFMQNMLGFTAFESG 309
F+K ++ P ++ + ++ F + + G + F T+ G ++P M+++ + E G
Sbjct: 236 FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG 295

Query: 310 -LMMLPGALLMGIMSPVTGRMFDKFGARWLVIPGLGIVTVTTFMFAVLDTETTFTYLAVV 368
+++ PG + + I + G + D+ G +++ G+ ++V+ + L ETT ++ ++
Sbjct: 296 SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTII 354

Query: 369 NAVRMLGISMVMMPSTTAGLNQLTNKLVPHGTAMNNTMRQVAGAVGTALFVSVMTITMI 427
+ G+S +T + L + G ++ N ++ G A+ +++I ++
Sbjct: 355 IVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413


31AAT16_11345AAT16_11450Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_11345434-3.5868684Fe-4S ferredoxin
AAT16_11350534-3.509771peptidase
AAT16_11355536-3.733285hypothetical protein
AAT16_11360536-2.866802xylose isomerase
AAT16_11365433-2.683838ABC transporter permease
AAT16_11370227-1.361539sugar ABC transporter ATP-binding protein
AAT16_11375226-0.543277dehydrogenase
AAT16_11380023-0.268782sugar ABC transporter substrate-binding protein
AAT16_11385-120-0.840620dehydrogenase
AAT16_11390-218-0.750894sugar phosphate isomerase
AAT16_11400016-0.560824molecular chaperone GroEL
AAT16_11405213-1.789230molecular chaperone GroES
AAT16_11410111-1.537593hypothetical protein
AAT16_11415110-1.805668phosphate:AMP phosphotransferase
AAT16_11420111-1.732264oligoendopeptidase F
AAT16_11425015-2.268331hypothetical protein
AAT16_11430-211-2.885391hypothetical protein
AAT16_11435-211-2.228844hypothetical protein
AAT16_11440-112-2.770841preprotein translocase subunit TatC
AAT16_11445-116-3.143646translocase
AAT16_11450-217-3.620037redox-sensing transcriptional repressor rex
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11375FLGHOOKAP1300.018 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.9 bits (67), Expect = 0.018
Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 6/85 (7%)

Query: 123 GVKHQVAFNYRKTPAVALAKKYIEDGEIGRILSFRGTYLQDWSANPDSPLSWRFQES--- 179
+ VA+ + + +K + G +G IL+FR L L+ F E+
Sbjct: 252 PSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQ-LALAFAEAFNT 310

Query: 180 --SAGSGALGDIGTHVIDLAHYLVA 202
AG A GD G + V
Sbjct: 311 QHKAGFDANGDAGEDFFAIGKPAVL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11425IGASERPTASE631e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 62.8 bits (152), Expect = 1e-12
Identities = 50/265 (18%), Positives = 89/265 (33%), Gaps = 12/265 (4%)

Query: 26 TTISADETDDIEEQSAQTQQESEETESQLNEPAESESTEEATGEQSQEASPELETDLTES 85
T I+ + S + E + P + +T T E E S + + ++
Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054

Query: 86 QLEPVEPEKNDAEAAKEEAINDRQNHDDEGEAEGTSPVNDFLPGNETENQE--ESSEEND 143
+ + E + E AKE N + N T G+ET+ + E+ E
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKAN---------TQTNEVAQSGSETKETQTTETKETAT 1105

Query: 144 VEASSEEQTSEESTEETAGEESTEQPALEGPSSEEQQEEPSLETPTDDISEEEPNEEQTG 203
VE + + E T+E S P E + + Q EP+ E ++ +EP +
Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE-NDPTVNIKEPQSQTNT 1164

Query: 204 EPAVDEPENEEPKEENTGETSGEESTEQTPTVENPKGESSSEPSTEELSKENTSEQTKEE 263
++P E T VENP+ + + S+ + + +
Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHR 1224

Query: 264 TVEMPQEESTEEAAGSAADDGKEAS 288
+ E A S+ D A
Sbjct: 1225 RSVRSVPHNVEPATTSSNDRSTVAL 1249



Score = 37.4 bits (86), Expect = 1e-04
Identities = 42/245 (17%), Positives = 78/245 (31%), Gaps = 7/245 (2%)

Query: 134 NQEESSEENDVEASSEEQTSEESTEETAGEESTEQPAL--EGPSSEEQQEEPSLETPTDD 191
N E V+ ++ + + + + E+ A E P PS T T
Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET-- 1039

Query: 192 ISEEEPNEEQTGEPAVDEPENEEPKEENTGETSGEESTEQTPTVENPKGESSSEP-STEE 250
++E E +T E +E + E +N +S + T N +S SE T+
Sbjct: 1040 VAENSKQESKTVE--KNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 251 LSKENTSEQTKEETVEMPQEESTEEAAGSAADDGKEASVGEKQGKREIYRYDYDVLEGIN 310
+ T+ KEE ++ E++ E ++ K+ Q + E R + +
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 311 LTPGGSEEQLKLLDKRVNRLMTSKIVDVEDMSEEEIMEIEEEVKKEEGITQESDREELPN 370
+ + + V +E TQ + E N
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 371 TGENN 375
+N
Sbjct: 1218 KPKNR 1222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11445TATBPROTEIN364e-06 Bacterial sec-independent translocation TatB protein...
		>TATBPROTEIN#Bacterial sec-independent translocation TatB protein

signature.
Length = 171

Score = 35.8 bits (82), Expect = 4e-06
Identities = 12/65 (18%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 13 VGPTSMVVIAVVALIIFGPKKLPQFGRAMGSTLREFKDATKGLATDDDEE---EEKEKNQ 69
+G + ++++ ++ L++ GP++LP + + +R + + + +E +E + +
Sbjct: 4 IGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQDSL 63

Query: 70 KKIES 74
KK+E
Sbjct: 64 KKVEK 68


32AAT16_11605AAT16_11680Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_11605221-1.331453thiamine-phosphate pyrophosphorylase
AAT16_11610222-2.290459hypothetical protein
AAT16_11615222-2.742174phosphomethylpyrimidine kinase
AAT16_11620224-3.872884thiaminase
AAT16_11625318-1.276035hypothetical protein
AAT16_11630115-2.238956hypothetical protein
AAT16_11635116-1.113941hypothetical protein
AAT16_116402240.0760023-hydroxyacyl-ACP dehydratase
AAT16_116452240.353994hypothetical protein
AAT16_116503271.175181UDP-N-acetylglucosamine
AAT16_116552300.148090membrane protein
AAT16_11660330-0.122065ATP synthase F0F1 subunit epsilon
AAT16_11665232-0.248634ATP F0F1 synthase subunit beta
AAT16_11670227-0.860130ATP synthase F0F1 subunit gamma
AAT16_11675126-1.679471ATP F0F1 synthase subunit alpha
AAT16_11680220-3.609024hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_11625IGASERPTASE422e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.4 bits (99), Expect = 2e-06
Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 2/78 (2%)

Query: 134 APAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQA--AAEQAAQKEAAAKQEREAAQQ 191
APA + E A+ ++QE E EQ A + Q A++A A Q E AQ
Sbjct: 1029 APATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088

Query: 192 QKQEQQAQQQQQTQTENQ 209
+ ++ Q + +T
Sbjct: 1089 GSETKETQTTETKETATV 1106



Score = 38.9 bits (90), Expect = 2e-05
Identities = 20/96 (20%), Positives = 40/96 (41%), Gaps = 2/96 (2%)

Query: 122 KTLTVSGENAEQAPAVEAPQVEEPAQPAQQ--EQSNNEAAEQQAAQQQEQAAAEQAAQKE 179
+T EN++Q ++ + Q E + + +A Q + A + KE
Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094

Query: 180 AAAKQEREAAQQQKQEQQAQQQQQTQTENQSTSSSS 215
+ +E A +K+E+ + ++TQ + TS S
Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVS 1130



Score = 35.0 bits (80), Expect = 3e-04
Identities = 22/109 (20%), Positives = 42/109 (38%), Gaps = 5/109 (4%)

Query: 115 SDMIFAGKTLTVSGENAEQAPAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQAAAEQ 174
S++ +T V+ +E + +E A ++E++ E + Q + + +
Sbjct: 1074 SNVKANTQTNEVAQSGSETKETQTT-ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK 1132

Query: 175 AAQKEAAAKQEREAAQQQK----QEQQAQQQQQTQTENQSTSSSSNSGQ 219
Q E Q A + +E Q+Q TE + +SSN Q
Sbjct: 1133 QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181



Score = 30.8 bits (69), Expect = 0.008
Identities = 18/90 (20%), Positives = 37/90 (41%), Gaps = 5/90 (5%)

Query: 131 AEQAPAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQAAAEQAAQKEAAAKQEREAAQ 190
E E P+V P Q++ E + QA +E KE ++ A
Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQS---ETVQPQAEPARENDPTVNI--KEPQSQTNTTADT 1168

Query: 191 QQKQEQQAQQQQQTQTENQSTSSSSNSGQN 220
+Q ++ + +Q TE+ + ++ ++ +N
Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198



Score = 30.4 bits (68), Expect = 0.011
Identities = 18/92 (19%), Positives = 32/92 (34%), Gaps = 7/92 (7%)

Query: 131 AEQAPAVEAPQVEEPAQPAQQEQSNNEAAEQQAAQQQEQAAAEQAAQKEAAA---KQERE 187
Q + P+ P +N E A A A A + E A KQE +
Sbjct: 994 TTNITTPNNIQADVPSVP----SNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 188 AAQQQKQEQQAQQQQQTQTENQSTSSSSNSGQ 219
++ +Q+ Q + ++ S+ + Q
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081


33AAT16_12305AAT16_12685Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_12305-1153.256797hypothetical protein
AAT16_12310-1142.102835PTS sugar transporter subunit IIC
AAT16_123151131.932832N-acetylmuramic acid-6-phosphate etherase
AAT16_123201132.260659hypothetical protein
AAT16_123252161.743773membrane protein
AAT16_123301162.015503hypothetical protein
AAT16_123353192.127145hypothetical protein
AAT16_123403214.288036hypothetical protein
AAT16_123451203.800358hypothetical protein
AAT16_123502214.326015ABC transporter ATP-binding protein
AAT16_123551194.288792hypothetical protein
AAT16_123603214.400205hypothetical protein
AAT16_123652194.324274transporter
AAT16_123702184.034502hypothetical protein
AAT16_123753183.729721sodium:proton antiporter
AAT16_123801121.6939485-carboxymethyl-2-hydroxymuconate isomerase
AAT16_123850121.533186peptidase M20
AAT16_12390-1110.944651aminobenzoyl-glutamate transporter
AAT16_12400-312-1.021216hypothetical protein
AAT16_12405-2150.030284hypothetical protein
AAT16_124100151.449005hypothetical protein
AAT16_124150192.450646membrane protein
AAT16_124201203.032696carboxylesterase
AAT16_124251173.031374hypothetical protein
AAT16_124301172.953154ring-cleaving dioxygenase
AAT16_124350141.177132glyoxalase
AAT16_12440013-0.572915MarR family transcriptional regulator
AAT16_12445015-1.512421glyoxalase
AAT16_12450018-2.078656glutamine synthetase
AAT16_12455227-4.687942GCN5 family acetyltransferase
AAT16_12460030-8.776690hypothetical protein
AAT16_12465236-9.156741PTS sorbitol transporter subunit IIA
AAT16_12470026-6.911661PTS sorbitol transporter subunit IIB
AAT16_12475-117-5.267486PTS system glucitol/sorbitol-specific
AAT16_12480-120-4.755916glucitol operon activator
AAT16_12485-220-4.177146hypothetical protein
AAT16_12490-218-1.567899sorbitol-6-phosphate 2-dehydrogenase
AAT16_12495-118-1.278499glutamine synthetase
AAT16_12500124-2.119210hypothetical protein
AAT16_12505334-3.739654transketolase
AAT16_12510435-4.948388iditol 2-dehydrogenase
AAT16_12515335-6.975576hypothetical protein
AAT16_12520231-6.936982sorbitol dehydrogenase
AAT16_12525127-6.525422PTS system galactitol-specific transporter
AAT16_12530022-5.922107PTS galactitol transporter subunit IIB
AAT16_12535-117-4.193760PTS sugar transporter subunit IIA
AAT16_12540-114-2.929769transcription antiterminator BglG
AAT16_12545-2191.999455hypothetical protein
AAT16_12555-2172.517363hypothetical protein
AAT16_12560-1223.236314hypothetical protein
AAT16_12565-3152.319264hypothetical protein
AAT16_12570-3140.977136amino acid ABC transporter ATP-binding protein
AAT16_12575-3131.085252acyl-CoA dehydrogenase
AAT16_12580-2130.704275hypothetical protein
AAT16_12585-3181.879918dihydrodipicolinate synthase
AAT16_12590-2151.322017antiporter
AAT16_12595-1222.383409hypothetical protein
AAT16_12600-1264.471474proline racemase
AAT16_126050182.818377glycine oxidase
AAT16_126101170.962837hypothetical protein
AAT16_12615017-0.529606hypothetical protein
AAT16_12620118-1.089221sarcosine oxidase subunit alpha
AAT16_12625019-1.987031hypothetical protein
AAT16_12630126-3.461061hypothetical protein
AAT16_12635019-2.241732hypothetical protein
AAT16_12645-113-0.640704peptide ABC transporter permease
AAT16_12650-117-1.478987peptide ABC transporter permease
AAT16_12655-117-1.931402hypothetical protein
AAT16_12660019-2.151375proline racemase
AAT16_12665017-2.509060metallopeptidase
AAT16_12670022-1.916701dihydrodipicolinate synthase
AAT16_12675431-6.450209C4-dicarboxylate ABC transporter permease
AAT16_12680119-4.275984hypothetical protein
AAT16_12685116-3.216830hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12310ACRIFLAVINRP310.013 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.6 bits (69), Expect = 0.013
Identities = 24/127 (18%), Positives = 56/127 (44%), Gaps = 11/127 (8%)

Query: 168 ALGAISGALIFNPALDEMELFGEMLVSGRGGLFAVMMSAFLMAMLEQQIRKVVPNSLDLI 227
+G + A +FN D + G + G A+++ F ++E++ + VV +L +
Sbjct: 908 IVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAV 967

Query: 228 VTSTITVFVVGLITVIGLQPV------GAVLSEGIIVSINWVLEVGGIFAGAVLAAVFLP 281
+ + L ++G+ P+ G+ + + + +GG+ + +LA F+P
Sbjct: 968 RMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGV-----MGGMVSATLLAIFFVP 1022

Query: 282 LVLVGLH 288
+ V +
Sbjct: 1023 VFFVVIR 1029


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12330IGASERPTASE487e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 48.1 bits (114), Expect = 7e-08
Identities = 35/157 (22%), Positives = 62/157 (39%), Gaps = 9/157 (5%)

Query: 298 TTTTKAENDAPEVDTGELESVIAEAEAVSEADRIPSLQSA-----LKNAKAVVEDDETTQ 352
TT + D P V + E IA + P+ S +N+K + E +
Sbjct: 998 TTPNNIQADVPSVPSNNEE--IARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE 1055

Query: 353 QQADEVEAALASALEENEDELKAAEEESSEEGSSEESTEEKTNEASTEEETTEEPAAEET 412
Q A E A +E + +KA + + S E+ E +T E +E A+
Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115

Query: 413 GESTEEESPEEEMTEESAEESEAESETASAQAEASND 449
E T+E + ++ S ++ ++E+ A+ ND
Sbjct: 1116 TEKTQEVP--KVTSQVSPKQEQSETVQPQAEPAREND 1150



Score = 32.7 bits (74), Expect = 0.003
Identities = 28/161 (17%), Positives = 55/161 (34%), Gaps = 9/161 (5%)

Query: 289 EPLLTSYFRTTTTKAENDAPEVDTG-ELESVIAEAEAVSEADRIPSLQSALKNAKAVVED 347
P +TS ++E P+ + E + + E S+ + + K + VE
Sbjct: 1122 VPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQ 1181

Query: 348 DETTQQQADEVEAALASALEENEDELKAAEEESSEEGSSEESTEEKTNEASTEEETTEEP 407
T + +++ EN + A + + S + + + EP
Sbjct: 1182 PVTESTTVNT-----GNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236

Query: 408 AAEETGESTEEESPEEEMTEESAEESEAESETASAQAEASN 448
A + + + + T +A S+A A AQ A N
Sbjct: 1237 ATTSSNDRSTVALCDLTSTNTNAVLSDA---RAKAQFVALN 1274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12365FIMREGULATRY300.004 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 30.3 bits (68), Expect = 0.004
Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 182 FILLIARSLTLPGAMEGVEFLLMPDFSAITSEAILFAL 219
F+L I S+ LPG+M + F L+ S+I S+ ++ A+
Sbjct: 14 FLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAM 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12455SACTRNSFRASE280.014 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.014
Identities = 13/53 (24%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 87 ISVAPSYQNKGIGSQMIVTALKRAEEMGYESVIVLGHD------KYYPRFGFR 133
I+VA Y+ KG+G+ ++ A++ A+E + +++ D +Y + F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12485PF09025300.010 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 30.4 bits (68), Expect = 0.010
Identities = 14/83 (16%), Positives = 26/83 (31%), Gaps = 1/83 (1%)

Query: 527 IMKEKNKPTINNNLGFPHAVHTLEKIKIKIA-ILDESLKDYEDLKLIILMAIPENNVNEA 585
P L E++ + A L D +LK ++ +P +
Sbjct: 34 QALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPLGRQQQT 93

Query: 586 VLIRLYEEILSLATNDYLMDRIR 608
L++L + +YL R
Sbjct: 94 FLLQLLGAVEHAPGGEYLAQLAR 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12490DHBDHDRGNASE1161e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 116 bits (292), Expect = 1e-33
Identities = 65/271 (23%), Positives = 119/271 (43%), Gaps = 21/271 (7%)

Query: 3 NWLNIDKKVVVITGGSSGIGRRILESLLENGAIVYNADMKDNPIDHNNYHY--------- 53
N I+ K+ ITG + GIG + +L GA + D ++
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 54 LKTDVTQEENVKNTVEQIVNEQKQIDVLINNAGINLPRVLVDVRGEKPEYEINMKDLDFM 113
DV + +I E ID+L+N AG+ P + ++ ++ +
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRP---------GLIHSLSDEEWEAT 112

Query: 114 FAVNLKGPVLFSREVSRQFVEQQHGVIINVSSEAGQEGSQGQSIYSATKAALIGFTRSWA 173
F+VN G SR VS+ ++++ G I+ V S + Y+++KAA + FT+
Sbjct: 113 FSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLG 172

Query: 174 KELGEHNIRVVAIAPGILEETGLRTAAYEEALAYSRNTTVEGLNSDYSKSIPIGRVGELT 233
EL E+NIR ++PG E + +E ++G + IP+ ++ + +
Sbjct: 173 LELAEYNIRCNIVSPGSTETDMQWSLWADE---NGAEQVIKGSLETFKTGIPLKKLAKPS 229

Query: 234 EVADLVCYLASEKSSYITGTTINISGGKSRG 264
++AD V +L S ++ +IT + + GG + G
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12540PF08280320.011 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 31.7 bits (72), Expect = 0.011
Identities = 30/153 (19%), Positives = 56/153 (36%), Gaps = 15/153 (9%)

Query: 10 LFDELLKNPSVTSKELEEKYKLTRRQFGYSFNKINDFLVSKNLPKIERGRQGNFIIEQTV 69
L K S+ E+ EK LT Q + ++N F I++ I Q
Sbjct: 49 LVVLFFKTSSLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKRM----ISCQ-- 102

Query: 70 ISNLSDEDEFQIKESNVYSEKQRFFMILLMLIGSKEELSLNHFAIELDVSRNTILNDLKH 129
++ S E +Y+ ++ ++ L FA +S ++ +
Sbjct: 103 FTHPSKETYLYQ----LYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREA 158

Query: 130 VRKIASEFYLSIKYSRIKGYVIEGEEFYIRKLL 162
+ + F L + ++I GEE+ IR L+
Sbjct: 159 LIPLLRNFELKLSKNKIV-----GEEYRIRYLI 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12555SECFTRNLCASE310.003 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 31.0 bits (70), Expect = 0.003
Identities = 25/132 (18%), Positives = 56/132 (42%), Gaps = 14/132 (10%)

Query: 79 LVVTSFLSLFGVGFDPASLNSAAVVIVTFSICYSVFQSEIIRGALHSLDKDQIEAAQSLG 138
L+ ++ + FD ++ +A + I +SI +V + +R L + +
Sbjct: 191 LLTVGLFAVLQLKFDLTTV-AALLTITGYSINDTVVVFDRLRENLIKYKTMPL--RDVMN 247

Query: 139 YSTSQTLRKVIIPQVMTEALPDTMNAFLIIIKALSLAFLVTVVDIFAQARLVGAQTFSYL 198
S ++TL + ++ + T L+ + + L + V+ F A + G T +Y
Sbjct: 248 LSVNETLSRTVMTGMTT----------LLALVPM-LIWGGDVIRGFVFAMVWGVFTGTYS 296

Query: 199 EAFVAAALVYWV 210
+VA +V ++
Sbjct: 297 SVYVAKNIVLFI 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12580FLGFLGJ290.032 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 28.9 bits (64), Expect = 0.032
Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 346 GASVSYAMHLIKSLSDFETMPDSKISKKLCEFGVTLSRRTVNKYKNEILSQ 396
G + A ++K ++ + +P+ +F TV +Y+N+ LSQ
Sbjct: 85 GKGLGLAEMMVKQMTPEQPLPEESTPAAPMKF----PLETVVRYQNQALSQ 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12610HTHFIS361e-123 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 361 bits (928), Expect = e-123
Identities = 121/342 (35%), Positives = 187/342 (54%), Gaps = 28/342 (8%)

Query: 139 FRNIIYKSSVMEHVKNQIEKAAKTNANILITGETGVGKELFAKSIHDTSA-VKGEFIPIN 197
++ +S+ M+ + + + +T+ ++ITGE+G GKEL A+++HD G F+ IN
Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAIN 195

Query: 198 CGAIPSHLFESELFGYEKGAFTGANREGNKGKIELAEGGTLFLDEMGDMPLDMQVKFLRV 257
AIP L ESELFG+EKGAFTGA G+ E AEGGTLFLDE+GDMP+D Q + LRV
Sbjct: 196 MAAIPRDLIESELFGHEKGAFTGAQTRS-TGRFEQAEGGTLFLDEIGDMPMDAQTRLLRV 254

Query: 258 LQEKQYFKLGGNKEKSADFRLVSATNRKIADLLASDDFRSDLLYRINVVNIHIPPLRERP 317
LQ+ +Y +GG +D R+V+ATN+ + + FR DL YR+NVV + +PPLR+R
Sbjct: 255 LQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRA 314

Query: 318 DDIESLFFYYLYSLSEKYGTSVKYANQQLINHLKAYHWPGNVRELINVIERLVIFSNEEA 377
+DI L +++ +EK G VK +Q+ + +KA+ WPGNVREL N++ RL ++
Sbjct: 315 EDIPDLVRHFV-QQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDV 373

Query: 378 LNNEIFDQY---------LTEVGEDAKQATLPSVTEELE----------------LKDYV 412
+ EI + + + + ++ EE +
Sbjct: 374 ITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVL 433

Query: 413 EKIEADYIRHVLEENGQNVERASKALGISRPTLYAKVKRFGL 454
++E I L N +A+ LG++R TL K++ G+
Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12655TYPE4SSCAGA300.030 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 30.1 bits (67), Expect = 0.030
Identities = 28/123 (22%), Positives = 58/123 (47%), Gaps = 9/123 (7%)

Query: 381 NMTDIEEAQKLWEQGLE--ELGTDSITLELLSYDDDQRKAMAEYMKNQWENNLPGLTVAI 438
N ++++AQK E+ L E + +L S ++ K A+ N ++ + L I
Sbjct: 603 NYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQKDEIFAL---I 659

Query: 439 NQQPNKQKLDLEGKQDYDMSFSGWRNDISDPVEFLNVHLSDGPYNWQDFANEEYDELVKK 498
N++ N+ + Y + G + ++SD +E +N +L D ++ +F N + + K
Sbjct: 660 NKEANRDARAIA----YAQNLKGIKRELSDKLENVNKNLKDFDKSFDEFKNGKNKDFSKA 715

Query: 499 AQT 501
+T
Sbjct: 716 EET 718


34AAT16_12765AAT16_13155Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_12765-1143.241325mannonate dehydratase
AAT16_127700183.883250D-mannonate oxidoreductase
AAT16_127750235.401235hypothetical protein
AAT16_127801255.444304hypothetical protein
AAT16_127851255.862732hypothetical protein
AAT16_127900245.441028N-acetyl-L,L-diaminopimelate aminotransferase
AAT16_127950225.232176membrane protein
AAT16_128000215.481533peptidase M20
AAT16_12805-1204.647347transporter
AAT16_12810-2163.643009membrane protein
AAT16_12815-1234.583801Zn-dependent hydrolase
AAT16_128200234.785037hypothetical protein
AAT16_128250254.937704cytoplasmic protein
AAT16_128302275.179594hypothetical protein
AAT16_128352275.080341PTS mannose transporter subunit IIB
AAT16_128401264.974441oligo-beta-mannoside permease IIC protein
AAT16_128450203.590521PTS system cellobiose-specific transporter
AAT16_128500204.4151046-phospho-beta-glucosidase
AAT16_12855-1194.734746hypothetical protein
AAT16_12865-1164.550400*membrane protein
AAT16_128700164.164568mannose-6-phosphate isomerase
AAT16_128750204.939656hypothetical protein
AAT16_128801246.358114PTS mannose transporter subunit IIABC
AAT16_128852256.639175sulfurase
AAT16_128903276.899964hypothetical protein
AAT16_128952296.973675hypothetical protein
AAT16_129002174.280905hypothetical protein
AAT16_129054161.924252gluconokinase
AAT16_12910320-0.932649gluconate transporter
AAT16_12915733-5.424789galactonate dehydratase
AAT16_129201148-9.034859hypothetical protein
AAT16_129301040-7.281639C4-dicarboxylate ABC transporter permease
AAT16_12935733-6.070940hypothetical protein
AAT16_12940324-3.722321hypothetical protein
AAT16_12945320-3.437403hypothetical protein
AAT16_12950-115-0.218636hypothetical protein
AAT16_12955-2132.275223sodium:pantothenate symporter
AAT16_12960-1151.115154hypothetical protein
AAT16_12965-1141.2652145-carboxymethyl-2-hydroxymuconate isomerase
AAT16_129700140.997681XRE family transcriptional regulator
AAT16_12975011-0.678319hypothetical protein
AAT16_12980114-3.727233gluconate permease
AAT16_12985220-6.444710hypothetical protein
AAT16_12990231-9.693895tartronate semialdehyde reductase
AAT16_12995334-10.215620GntR family transcriptional regulator
AAT16_13000130-9.537452hypothetical protein
AAT16_13005125-7.630324hypothetical protein
AAT16_13010021-6.183728integrase
AAT16_13015-117-5.092729hypothetical protein
AAT16_13020-214-2.964038restriction endonuclease subunit M
AAT16_13025-213-1.575643restriction endonuclease subunit R
AAT16_13030-1192.320594multidrug MFS transporter
AAT16_130350272.943064hypothetical protein
AAT16_130400273.256554SAM-dependent methyltransferase
AAT16_130450283.745655hypothetical protein
AAT16_130501293.6600432-dehydro-3-deoxyphosphogluconate aldolase
AAT16_130550262.941121hypothetical protein
AAT16_13065-2192.429387hypothetical protein
AAT16_13075-1162.053668hypothetical protein
AAT16_13080-2171.643311MFS transporter
AAT16_13085-1161.090548hypothetical protein
AAT16_13090-2141.241557hypothetical protein
AAT16_13095-2132.487814glycerate kinase
AAT16_131000133.403515anion:sodium symporter
AAT16_131050143.795762hypothetical protein
AAT16_13110-1134.026250sodium:proton antiporter
AAT16_13115-1144.182131NAD-dependent dehydratase
AAT16_131200154.432301dihydroorotate dehydrogenase
AAT16_13125-2143.136283transporter
AAT16_13130-1163.5936195,10-methylene tetrahydromethanopterin
AAT16_131350164.021119hypothetical protein
AAT16_13140-1153.926410hypothetical protein
AAT16_13145-1164.106160galactoside O-acetyltransferase
AAT16_131500173.964989hypothetical protein
AAT16_13155-1203.851576hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12770DHBDHDRGNASE946e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.3 bits (234), Expect = 6e-25
Identities = 69/275 (25%), Positives = 122/275 (44%), Gaps = 24/275 (8%)

Query: 4 NFEGLSGKTAVITGGSGVLCQEMAKELARQGMKVAILNRNKENGQKIADEIANNEGTAIA 63
N +G+ GK A ITG + + + +A+ LA QG +A ++ N E +K+ + A A
Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 64 VTCDVLDEESVQKAYVTVKEQLGECDLLINGAGGNHPDAITDKETFEKGDIENDSLKSFF 123
DV D ++ + ++ ++G D+L+N AG P I
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHS------------------ 103

Query: 124 DLELKGFDHVFRLNLVGSLIPTQVFGKEMTN-RGGTVINISSMSAPSPMTKVPAYSAAKA 182
L + ++ F +N G ++ K M + R G+++ + S A P T + AY+++KA
Sbjct: 104 -LSDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKA 162

Query: 183 GIDNLTQWLAVHFADAGIRVNAIAPGFFLTKQNRNLLLKEDGS---FSERAEKIISHTPQ 239
T+ L + A+ IR N ++PG T +L E+G+ E + P
Sbjct: 163 AAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPL 222

Query: 240 RRFGDPEDLLGTLLWLADDNTSKFVTGITVPVDGG 274
++ P D+ +L+L +T + VDGG
Sbjct: 223 KKLAKPSDIADAVLFLVSGQAGH-ITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12785TYPE3IMSPROT320.008 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 31.7 bits (72), Expect = 0.008
Identities = 14/90 (15%), Positives = 31/90 (34%)

Query: 141 AQFVAFGVLFEVVLGVSFSIGVIVGGIITIIYTMLGGFFAVALTDFIQGLLMAFALFILP 200
L +++G+S ++ I F+ AL+ + +L+ F P
Sbjct: 30 VSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFP 89

Query: 201 ILAIIEIGGFNRMGTLLGESMGTEFLQPFF 230
+L + + G + E ++P
Sbjct: 90 LLTVAALMAIASHVVQYGFLISGEAIKPDI 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12820PF01206614e-14 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 60.5 bits (147), Expect = 4e-14
Identities = 19/68 (27%), Positives = 44/68 (64%)

Query: 126 IEASGLQCPGPLLRVNETMGQLDPGQQMEITVTDFGFCTDVEAWAKKTGNTVLKNEKKED 185
++A+GL CP P+L+ +T+ ++ G+ + + TD G D E+++K+TG+ +L+ ++++
Sbjct: 8 LDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEEDG 67

Query: 186 KVVVVLEK 193
L++
Sbjct: 68 TYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12855PF05043372e-04 Transcriptional activator
		>PF05043#Transcriptional activator

Length = 493

Score = 37.2 bits (86), Expect = 2e-04
Identities = 22/132 (16%), Positives = 52/132 (39%), Gaps = 8/132 (6%)

Query: 100 ELLLTRGHVKSEDLADALFISRSTLQSDLKAVKGIL-AQYDLEIESKPNYGMRATGTEMN 158
E + ++E + +IS S+L + + ++ Q+ E+ P + G E +
Sbjct: 93 EFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPV---QIIGNERD 149

Query: 159 LRFCLSQYVFDRRVYKSEPQSVYFDSDELGAVHNTVAEALDDNQLVMTDIAINNLVIHIA 218
+R+ +QY ++ + P F++ + + + M L + +
Sbjct: 150 IRYFFAQYFSEKYYFLEWP----FENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLV 205

Query: 219 IALRRIRDGYSV 230
L RI+ G+ +
Sbjct: 206 TNLYRIKFGHFM 217


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12875SACTRNSFRASE270.034 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.8 bits (59), Expect = 0.034
Identities = 20/112 (17%), Positives = 43/112 (38%), Gaps = 20/112 (17%)

Query: 55 KRFYKEFGKISDLGQYM------IFIENEDSELVGTV---TAWHG--TVKDRLHGRLHWF 103
K ++K++ Y+ F+ ++ +G + + W+G ++D
Sbjct: 44 KPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIED--------I 95

Query: 104 NVVPDFQGRGLGVPLLSKGMAMLQENHEEAF-LKVDVNNKMMVRLFISMGWK 154
V D++ +G+G LL K + +ENH L+ N + +
Sbjct: 96 AVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_12940PERTACTIN290.028 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 29.3 bits (65), Expect = 0.028
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 13/121 (10%)

Query: 86 GSGSVGMTQAANANPDGYTATMVIAELAMYEHLGT-SPLTPEDFKPVALINYDPAALTVP 144
G+G V + + AN +T+V L H+GT PL PED P ++ D + VP
Sbjct: 159 GAGGVRVERGANVTVQ--RSTIVDGGL----HIGTLQPLQPEDLPPSRVVLGDTSVTAVP 212

Query: 145 ADAPYDTVGEFIEYAKE---HPGEVSVGNAGPGSIWHVAAANLENAADIELNHVPHEGAA 201
A V F+ A E G ++ G A + A +L+ A I P GA
Sbjct: 213 ASGAPAAV--FVFGANELTVDGGHITGGRAAGVAAMDGAIVHLQRAT-IRRGDAPAGGAV 269

Query: 202 P 202
P
Sbjct: 270 P 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_13030TCRTETB522e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 51.8 bits (124), Expect = 2e-09
Identities = 66/400 (16%), Positives = 138/400 (34%), Gaps = 66/400 (16%)

Query: 18 IIFFYHFIVMFS------MYVSIVTIGNFAIENFNASASTAGLVASIFIVGVLAGRAISG 71
I+ + + FS + VS+ I N +FN ++ V + F++ G A+ G
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIAN----DFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 72 YQVNRLGARKIMYIGTVLFFLTYGLYFIDGGLV-LLIAARFLNGFATGLISTALNTLATI 130
++LG ++++ G ++ + F+ LLI ARF+ G + +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 131 SVPENRRGEGISYFSLSFVLGSAVGPFLGFLLLEIMS----FNTMLILVLIAVFIVALMT 186
+P+ RG+ +G VGP +G ++ + +I ++ F++ L+
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190

Query: 187 PMVRLNNITRDYK----------------------------------------------- 199
VR+ D K
Sbjct: 191 KEVRIKG-HFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249

Query: 200 PEAGKFRMIDRDALPMGFSVLFMGLAYASILSFLNLYAIEVNLVTAASFFFLVYSAVVML 259
P GK L G + + S++ ++ +++ S + V++
Sbjct: 250 PGLGKNIPFMIGVL-CGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVII 308

Query: 260 TRPLTGKMMDQKGANIVLYPTFIFMAIGFYVLG--NSTTGFIMLLAGALIGLGFGNFQSI 317
+ G ++D++G VL F+++ F TT + M + + G +++
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 318 AQTVCVNLADRDNVGLATSTYFIMLEVGLGFGPFFLGFLV 357
T+ + + G S + G G +G L+
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408



Score = 34.1 bits (78), Expect = 0.001
Identities = 23/112 (20%), Positives = 46/112 (41%), Gaps = 4/112 (3%)

Query: 69 ISGYQVNRLGARKIMYIGTVLFFLT-YGLYFIDGG--LVLLIAARFLNGFATGLISTALN 125
I G V+R G ++ IG ++ F+ + I F+ G + T ++
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS-FTKTVIS 370

Query: 126 TLATISVPENRRGEGISYFSLSFVLGSAVGPFLGFLLLEIMSFNTMLILVLI 177
T+ + S+ + G G+S + + L G + LL I + L+ + +
Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEV 422



Score = 28.7 bits (64), Expect = 0.050
Identities = 23/131 (17%), Positives = 50/131 (38%), Gaps = 1/131 (0%)

Query: 253 YSAVVMLTRPLTGKMMDQKG-ANIVLYPTFIFMAIGFYVLGNSTTGFIMLLAGALIGLGF 311
+ + + GK+ DQ G ++L+ I + ++++A + G G
Sbjct: 58 FMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGA 117

Query: 312 GNFQSIAQTVCVNLADRDNVGLATSTYFIMLEVGLGFGPFFLGFLVPSLGYGGLYQSLVI 371
F ++ V ++N G A ++ +G G GP G + + + L +I
Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI 177

Query: 372 SILVGLVIFYF 382
+I+ +
Sbjct: 178 TIITVPFLMKL 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_13080TCRTETB608e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.5 bits (144), Expect = 8e-12
Identities = 77/372 (20%), Positives = 138/372 (37%), Gaps = 51/372 (13%)

Query: 36 MPIFTEEFGVSATLSSLSMTITTLTLALSMLVFGSISESLGRKNIMVVSMFAASLLCILT 95
+P +F ++ T LT ++ V+G +S+ LG K +++ + ++
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 ALSPNFY-VLIALRALQGVVLAGVPSIAMAYISEEIHPRSLAGAMGLYISGNALGAVFGR 154
+ +F+ +LI R +QG A P++ M ++ I + A GL S A+G G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 155 VFSGVAADYIGWHGAMLGIGIISIIATVIFWKSLRPPRNFVAQNFN-------------F 201
G+ A YI W +L I +I+II TV F L + +F+ F
Sbjct: 157 AIGGMIAHYIHW-SYLLLIPMITII-TVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFF 214

Query: 202 MQLTRS---------------LLHHM-------------KNPVLVCFFFVGFLLLGANLS 233
M T S + H+ KN + G ++ G
Sbjct: 215 MLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAG 274

Query: 234 IYSYVTFVFLDVPYSLSQSIVSWIFLI--FIIGIFSSMITGRFVAKFGKVKFIYIALGIT 291
S V ++ DV + LS + + + + + I I G V + G + + I +
Sbjct: 275 FVSMVPYMMKDV-HQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL 333

Query: 292 LLGVFLL-FIP--NLLIMVLGLSLFTYGFFASHSVVSGLVGENAVSNKAQAS-SLYLFFY 347
+ F+ M + + G + +V+S +V + +A A SL F
Sbjct: 334 SVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTS 393

Query: 348 YTGSSIGGTAAG 359
+ G G
Sbjct: 394 FLSEGTGIAIVG 405


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_13090DHBDHDRGNASE572e-11 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 56.6 bits (136), Expect = 2e-11
Identities = 49/208 (23%), Positives = 80/208 (38%), Gaps = 14/208 (6%)

Query: 4 KTLAITGATSGIGRATVQELADDFDEIILLARNEVKAEILKKELKEMNRALKVKVIECNL 63
K ITGA GIG A + LA I + N K E + LK R + ++
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARH--AEAFPADV 66

Query: 64 ASLISVEKAALHTQENYEKIDCLINNAGV--VSLSRQETADGHELMMGTNYLGHYLLTHY 121
++++ + ID L+N AGV L + + E N G + +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 122 LMPILMKSEAPQIVIVSSNAYGFTTLKSDYFKGKGNVMNLYGRSKLAVLYFMQELHEQFS 181
+ +M + IV V SN G M Y SK A + F + L + +
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTS----------MAAYASSKAAAVMFTKCLGLELA 176

Query: 182 DQGVRVTAVHPGAVSTNLGRTKQNEKFG 209
+ +R V PG+ T++ + ++ G
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENG 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_13115NUCEPIMERASE465e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.5 bits (108), Expect = 5e-08
Identities = 23/126 (18%), Positives = 46/126 (36%), Gaps = 17/126 (13%)

Query: 1 MRVLVVGANGQIGHQVAEKLKNKGHDPVA---------MVRKEEQVSQFKDKGIETVLGD 51
M+ LV GA G IG V+++L GH V + K+ ++ G + D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 52 LE--KDFSHAFEN--VDSVVFAAGSGGSTGA----DKTIIIDQEGAIETVDNAKRAGVKH 103
L + + F + + V + + + G + ++ + ++H
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 104 FVIISS 109
+ SS
Sbjct: 121 LLYASS 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_13125ACRIFLAVINRP290.024 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.0 bits (65), Expect = 0.024
Identities = 10/61 (16%), Positives = 27/61 (44%), Gaps = 1/61 (1%)

Query: 218 LATGLAYYLFASGLKNVKSSTAVTLSLAEPLTASLLGVFLVGEILDMWSWAGLIMLLMGI 277
++ + + A+ ++ +V L + + LL L + D++ GL+ +G+
Sbjct: 878 ISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLT-TIGL 936

Query: 278 A 278
+
Sbjct: 937 S 937


35AAT16_13430AAT16_13495Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_134302140.661180protein-L-isoaspartate(D-aspartate)
AAT16_134351130.354365Clp protease ClpX
AAT16_13440011-0.579954hypothetical protein
AAT16_13445-212-0.600535hypothetical protein
AAT16_13450-214-0.497649CtsR family transcriptional regulator
AAT16_13455-214-0.024536nucleoside permease
AAT16_134600150.671213acetyltransferase
AAT16_134651190.985305phosphohydrolase
AAT16_134702201.612567hypothetical protein
AAT16_134751191.802622hypothetical protein
AAT16_134802202.253064ATP phosphoribosyltransferase
AAT16_134852202.550886histidinol dehydrogenase
AAT16_134903201.661322hypothetical protein
AAT16_134952191.457598imidazoleglycerol-phosphate dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_13435HTHFIS381e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 37.9 bits (88), Expect = 1e-04
Identities = 34/185 (18%), Positives = 59/185 (31%), Gaps = 13/185 (7%)

Query: 494 EESEKLLNLESLLHNRVIGQKDAIGSI---SKAVRRARAGLKNPKRPIGSFIFLGPTGVG 550
+ L +++ + S A++ L + + + G +G G
Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTG 172

Query: 551 KTELARALSEAMFGEEDAMIRVDMSEYMEKHSVSRLVGSPPG-YVGYDDGGQLTEKVRRK 609
K +ARAL + + ++M+ S L G G + G + +
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST--GRFEQA 230

Query: 610 PYSLILFDEIEKAHPDVFNMLLQVLDDG---RLTDSNGRTVDFRNTIIVMTSNIG-AQEL 665
+ DEI D LL+VL G + D R IV +N Q +
Sbjct: 231 EGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATNKDLKQSI 287

Query: 666 KDQKF 670
F
Sbjct: 288 NQGLF 292



Score = 36.3 bits (84), Expect = 5e-04
Identities = 17/82 (20%), Positives = 31/82 (37%), Gaps = 3/82 (3%)

Query: 145 PDSLQQKDNMKKDHNTPTLDSLARDLTQIARDDMLDPVIGRSSEITRVIEVLSRRTKNN- 203
P + + + D P++GRS+ + + VL+R + +
Sbjct: 103 PKPFDLTELIGIIGRALAEPKRRPSKLEDDSQD-GMPLVGRSAAMQEIYRVLARLMQTDL 161

Query: 204 PVLI-GEPGVGKTAIAEGLAQQ 224
++I GE G GK +A L
Sbjct: 162 TLMITGESGTGKELVARALHDY 183


36AAT16_00375AAT16_00435N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_00375328-3.2736063-ketoacyl-ACP reductase
AAT16_00385326-2.461064GntR family transcriptional regulator
AAT16_00390428-1.742047transporter
AAT16_00395423-0.616162membrane protein
AAT16_00400624-1.845117NAD-dependent epimerase
AAT16_00410421-0.890638universal stress protein
AAT16_00415320-0.201581hypothetical protein
AAT16_004200170.129081hypothetical protein
AAT16_00425-1120.500489hypothetical protein
AAT16_00430-1120.339287choloylglycine hydrolase
AAT16_00435-1120.709621epidermal surface antigen
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00375DHBDHDRGNASE1212e-35 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 121 bits (304), Expect = 2e-35
Identities = 70/257 (27%), Positives = 125/257 (48%), Gaps = 17/257 (6%)

Query: 3 RTVLVTGSGRGLGSYIVKALSEKGFNVI-INYNNSKEES-EKLKKEIGSQAIAIQADITD 60
+ +TG+ +G+G + + L+ +G ++ ++YN K E K A A AD+ D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 61 REAVEQLVKKGTEHFGQIDVVVNNALVNFKFDPTTQKAFKDLTYKDYEQQLDGTLKAAFN 120
A++++ + G ID++VN A V + L+ +++E FN
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHS-----LSDEEWEATFSVNSTGVFN 122

Query: 121 VSQSVIPQFLERKDGAIISIGTNLYQNPVVPYHEYTTAKAALIGFTRNVAAELGQHGIRA 180
S+SV ++R+ G+I+++G+N P Y ++KAA + FT+ + EL ++ IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 181 NVVSGGLLKTT---------DASAVTTPEVFDLIAQSTPLRKVTTPQDVANMVVYLCSEA 231
N+VS G +T + + + PL+K+ P D+A+ V++L S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 232 ADGITGQNITVDGGLTM 248
A IT N+ VDGG T+
Sbjct: 243 AGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00400NUCEPIMERASE280.028 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.8 bits (62), Expect = 0.028
Identities = 13/30 (43%), Positives = 15/30 (50%)

Query: 1 MKVFVFGGNEGAGEHVLKKLAAKGHEAVTI 30
MK V G G HV K+L GH+ V I
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00415IGASERPTASE310.004 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.004
Identities = 16/76 (21%), Positives = 26/76 (34%)

Query: 84 SNETPQNEVTETAQQEDAPQVTEESNEQTQQVAPNTEQQESAPQVTEETQQQPEQNTQQS 143
E P E + +Q T + + NT + P V E+ +P+ ++S
Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRS 1226

Query: 144 EDVQAAAPEQNTESSE 159
E T SS
Sbjct: 1227 VRSVPHNVEPATTSSN 1242



Score = 29.6 bits (66), Expect = 0.014
Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 7/87 (8%)

Query: 86 ETPQNEVTETAQQ----EDAPQVTEESNEQTQQVAPNTEQQESAPQVTEETQQQPEQNTQ 141
P T + E++ Q ++ + Q T Q +V +E + + NTQ
Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR---EVAKEAKSNVKANTQ 1081

Query: 142 QSEDVQAAAPEQNTESSEATGGSTKEQ 168
+E Q+ + + T+++E +T E+
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEK 1108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00420IGASERPTASE310.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.005
Identities = 23/93 (24%), Positives = 39/93 (41%), Gaps = 5/93 (5%)

Query: 91 EAVETEEAPQVTEEAQPQQQVQEAPQV-----TEEAPQVTEEQPAQNTQQSEDVQAAAPE 145
E +T+E P+VT + P+Q+ E Q E P V ++P T + D + A E
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174

Query: 146 QSTQSTGGSTKAQFLAAGGTEAMWQNIVMPEST 178
S+ T++ + G + P +T
Sbjct: 1175 TSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_00435IGASERPTASE456e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.1 bits (106), Expect = 6e-07
Identities = 22/177 (12%), Positives = 57/177 (32%), Gaps = 4/177 (2%)

Query: 214 ARVKKEAEIAESENRRETEIQQAKDNEDISNEQYKREMNIAESRKEKDIKDAKILAETEK 273
++ + S N + +A + +AE+ K++ K E +
Sbjct: 1001 NNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEK--NEQDA 1058

Query: 274 ENAAARAAGQLEEEERRLEVERQRLEIREQEKQNELKLRQMERENDVQ--LEKQQVEVRR 331
A+ +E + ++ Q E+ + + + +E EK +VE +
Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118

Query: 332 QQAEADYYAQTKDAEARAESRMAEGKAEAEVIREKSMAEAEAIERRAKAMAEHKDVI 388
Q +Q + ++E+ + + E ++ E ++ +
Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175



Score = 32.0 bits (72), Expect = 0.007
Identities = 36/274 (13%), Positives = 84/274 (30%), Gaps = 21/274 (7%)

Query: 210 RPQIARVKKEAEIAESENRRETEIQQAKDNEDISNEQYKREMNIAESRKEKD----IKDA 265
P A + E +++E++ + + + RE+ K + A
Sbjct: 1027 PPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVA 1086

Query: 266 KILAETEKENAAARAAGQLEEEERRLEVERQRLEIREQEKQNELKLRQMEREN-----DV 320
+ +ET++ E+E + +VE ++ + + +++ +Q + E +
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQ-EVPKVTSQVSPKQEQSETVQPQAEP 1145

Query: 321 QLEKQQVEVRRQQAEADYYAQTKDAEARAESRMAEGKAEAEVIREKSMAEAEAIERRAKA 380
E ++ + A+ S E + E E A
Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205

Query: 381 MAEHKDVIILEKLIEIMPEFAKAVSDSMSNVESIRVLDSGSGDQLQSLPNTV-TGTMAKL 439
+ + ++V NVE S + +L + T T A L
Sbjct: 1206 TTQPTVNSESSNKPKN--RHRRSVRSVPHNVEPATT--SSNDRSTVALCDLTSTNTNAVL 1261

Query: 440 QESMGQM------TGFDLENFLGNLSSSEEADFS 467
++ + G + + L + E ++
Sbjct: 1262 SDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295


37AAT16_01090AAT16_01125N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_01090013-1.370107MFS transporter
AAT16_01095-114-2.651521formate dehydrogenase
AAT16_01100018-4.068931transporter
AAT16_01105-116-3.908774hypothetical protein
AAT16_01110-214-3.324169glutamine synthetase
AAT16_01115-215-3.665532hypothetical protein
AAT16_01120-313-2.303330hypothetical protein
AAT16_01125-214-1.489870magnesium chelatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01090TCRTETA484e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.9 bits (114), Expect = 4e-08
Identities = 67/347 (19%), Positives = 125/347 (36%), Gaps = 24/347 (6%)

Query: 60 AAFMGHFVEAKGPRISGLVSTLFFASGMAVAGLAVQLESLILLYFGYGVLGGIGLGIGY- 118
A +G + G R LVS A A+ A L L + G+ G G G
Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAY 119

Query: 119 ---ITPVSTLVKWFPDRRGMATGLAIMGFGFAAMLASPAMEWLIVNVSIAGTFYILAVIY 175
IT + F G + A GFG M+A P + L+ S F+ A +
Sbjct: 120 IADITDGDERARHF----GFMS--ACFGFG---MVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 176 FVVMIASSLYLERPPEGYEPEGMNLDEKVTAKKDIVQLTANEAVRTRRFYFLWSMLFLNV 235
+ + L PE ++ E L + A + + L ++ F+
Sbjct: 171 GLNFLTGCFLL---PESHKGERRPLRRE--ALNPLASFRWARGMTV--VAALMAVFFIMQ 223

Query: 236 TCGIAILAVASPMAQEIAGLSAGAAAVMVGIMGVFNGGGRLVWAS-ISDYIGRPNLYSLF 294
G A+ ++ A + + G+ + + + ++ +G L
Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG 283

Query: 295 FIIQIALFLLLPSVSHALVFQAMLFVIISCYGGGFSAIPAYIGDIFGTKQLGAIHGYILT 354
I ++LL + + + V+++ G G A+ A + ++ G + G +
Sbjct: 284 MIADGTGYILLAFATRGWMA-FPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAA 342

Query: 355 AWAAAGLVGPFISSTVYEAT-QSYTLTLYIFGALFIAALAISILIRG 400
+ +VGP + + +Y A+ ++ +I GA L + L RG
Sbjct: 343 LTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALY-LLCLPALRRG 388


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01095NUCEPIMERASE300.041 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.8 bits (67), Expect = 0.041
Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 698 IQEGEPIVIYNVNGVFQGFARIADIKAGNIGIQ 730
+ EG+ I +YN + + F I DI I +Q
Sbjct: 199 MLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01100FIMREGULATRY310.002 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 31.4 bits (71), Expect = 0.002
Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 181 LFLIVIIRSVTLPGAMEGIKFFLTPDFSLISSEGILYAL 219
FL+ I SV LPG+M + FFL S I S+ ++ A+
Sbjct: 13 AFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAM 51


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01125HTHFIS353e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 3e-04
Identities = 40/165 (24%), Positives = 66/165 (40%), Gaps = 25/165 (15%)

Query: 15 GKVIIGH----EKVIELVFVSMLQKGHILFESVPGTGKTMLSKAV---AKAIGGSFKRIQ 67
G ++G +++ ++ M ++ GTGK ++++A+ K G F I
Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAIN 195

Query: 68 ---FTPDVLPSDITG------LNIYNPKTQEFELRRGPVDTDILLADEINRATPRTQSAL 118
D++ S++ G T FE G L DEI Q+ L
Sbjct: 196 MAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT----LFLDEIGDMPMDAQTRL 251

Query: 119 LEVMEEKQVTIDGERIPVSEPF-IVLATQNPIES--KQGTF--DL 158
L V+++ + T G R P+ IV AT ++ QG F DL
Sbjct: 252 LRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDL 296


38AAT16_01565AAT16_01600N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_015650140.137645hypothetical protein
AAT16_015701120.673498LuxR family transcriptional regulator
AAT16_015751131.016033hypothetical protein
AAT16_01580-1130.788219nitrate/nitrite transporter
AAT16_01585-2140.040961peptide ABC transporter ATP-binding protein
AAT16_01590-2150.154940peptide ABC transporter ATP-binding protein
AAT16_01595-213-0.021259diguanylate cyclase
AAT16_01600-214-0.427493peptide ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01565PF06580422e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.8 bits (98), Expect = 2e-06
Identities = 20/108 (18%), Positives = 45/108 (41%), Gaps = 15/108 (13%)

Query: 244 RFDTEIETALYR------IIQESVFNAMKYA-----NVDAVDVTLMTREDYLEVIVEDEG 292
+F+ +I A+ ++Q V N +K+ + + + + VE+ G
Sbjct: 241 QFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 293 EGFDMHSSPQGSGLGLFGMRERAEAIGG---TLSIKSIVGRGTKITLI 337
+ ++ + +G GL +RER + + G + + G+ + LI
Sbjct: 301 SLA-LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01570HTHFIS636e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 6e-14
Identities = 22/115 (19%), Positives = 53/115 (46%), Gaps = 3/115 (2%)

Query: 2 KIVIADDHSVVRSGFSMIINYQKDMEVVATAGDGLEAYRMVQKYEPDIILMDISMPPGES 61
I++ADD + +R+ + ++ + +V + +R + + D+++ D+ MP E+
Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMP-DEN 61

Query: 62 GLIATGKISQDFPDTKIIILTMYDDEEYLFHSLKNGAKGYVLKSAPDAELLDAIR 116
+I + PD +++++ + + + GA Y+ K EL+ I
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01575FLAGELLIN320.001 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 31.6 bits (71), Expect = 0.001
Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 3/44 (6%)

Query: 22 MHEITKWFERMRELEAGGGAGTQ---ELGALMDHVAQSKEIINR 62
++EI +R+REL GT +L ++ D + Q E I+R
Sbjct: 81 LNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01580TCRTETB423e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 42.2 bits (99), Expect = 3e-06
Identities = 70/384 (18%), Positives = 136/384 (35%), Gaps = 71/384 (18%)

Query: 24 IPFISEDVNIPAEQVAIITAVPVILGSVLRIPLGYYANVIGARKVFIASFILLLFPIYYI 83
+P I+ D N P + ++ S+ G ++ +G +++ + I+ F
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 84 SNTSSHIDLLIGG-FFLGIGGAMF-SVGVTSLPKYYPKEKHGLINGIYG-VGNIGTAIAS 140
S LLI F G G A F ++ + + +Y PKE G G+ G + +G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 141 FSAPVLAVQIGWQNTIRLLLVVLIVFIVINIFFGDRQEKLVKQPLFGQIKGII------- 193
++A I W + L+ ++ I+ + + ++E +K IKGII
Sbjct: 157 AIGGMIAHYIHWSYLL-LIPMITIITVPFLMKL-LKKEVRIKGHF--DIKGIILMSVGIV 212

Query: 194 -------NNEK----LWVISLWYF------------------------------ITFGSF 212
+ + V+S F I FG+
Sbjct: 213 FFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTV 272

Query: 213 VAFTVFLPNFLITNYGIDNVDAGIRTAGFIALAT----FIRPLGGFIGDKFDPL----IA 264
F +P + + + + G + I T +GG + D+ PL I
Sbjct: 273 AGFVSMVPYMMKDVHQLSTAEIG---SVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIG 329

Query: 265 LIFTFLGITIGGIILAFSPTFMLFSIGCLL--VAATAGIGNGLVFKLVPQYFNKQAGIAN 322
+ F + +L + FM I +L ++ T + + +V + Q ++AG
Sbjct: 330 VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQ---QEAGAGM 386

Query: 323 GFVSMMGGLGGFFPPLILTLIHAI 346
++ L I+ + +I
Sbjct: 387 SLLNFTSFLSEGTGIAIVGGLLSI 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01600RTXTOXINA290.026 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.026
Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 14/110 (12%)

Query: 148 AVLQYYLA--LQAGWFPIAGWNGFIYSILPAIALASTPMAFIA---KLTRSSMLEETNSE 202
+ QY +A G A G I S A+ LA +P++F++ K R++ +EE +
Sbjct: 286 GISQYIIAQRAAQGLSTSAAAAGLIAS---AVTLAISPLSFLSIADKFKRANKIEEYSQR 342

Query: 203 YVKMAKAKGISRWAVVFKH--ALRNALLPVVTYLAPLTAGI---ITGSFV 247
+ K+ G S A K A+ +L + T LA +++GI T S V
Sbjct: 343 FKKLG-YDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLV 391


39AAT16_01620AAT16_01655N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_01620015-1.075926MFS transporter
AAT16_01625117-1.770966MFS transporter
AAT16_01630117-2.412739membrane protein
AAT16_01635-117-1.734565membrane protein
AAT16_01640-118-1.286060hypothetical protein
AAT16_01645016-1.045776luciferase
AAT16_01650-114-0.240453MFS transporter
AAT16_016550130.411522hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01620TCRTETA561e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 55.6 bits (134), Expect = 1e-10
Identities = 69/371 (18%), Positives = 131/371 (35%), Gaps = 34/371 (9%)

Query: 4 PSRNIIIAVFMVGTFAIGMTEY--VVTGLLTQFAADLDVAIATTGLLLSVYAISVTIFGP 61
P+R +I+ + V A+G+ V+ GLL DV A G+LL++YA+ P
Sbjct: 3 PNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVT-AHYGILLALYALMQFACAP 61

Query: 62 IVRLATLKFSPKLLLIILVSIFLISNIVAATAPNFEVLLFSRLLSASMHAPFFGLTMSLA 121
++ + +F + +L++ ++ + + ATAP VL R+++ A +A
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 122 MAISPPHKKTASIAAVNGGLTIAIMLGVPFGSFVGAALDWRLVFWIIAVLGLTTLIGIIL 181
I+ ++ ++ ++ G G +G F+ A L +
Sbjct: 122 -DITDGDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCF 179

Query: 182 TTP-------NYRPKDIPKISKELSVIKNKNVLMTIFVIVFGFSGV----------FTAY 224
P ++ + V+ + + F V F
Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGED 239

Query: 225 TFMEPMLRQITGFGTAGITISMFLFG-LGAVAGNFTAGTVQPSLLTSRIIM-TMGALGIV 282
F + I IS+ FG L ++A G V L R +M M A G
Sbjct: 240 RF---------HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290

Query: 283 LFIFTFMLQMPVLAYAASLLFGMGTFGTTPILNSKIIFAAKEAPALSGTLAASVFNLANS 342
+ F + + A+ +L G G + +E A++ +L +
Sbjct: 291 YILLAFATRGWM-AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSI 349

Query: 343 IGATLGSALLN 353
+G L +A+
Sbjct: 350 VGPLLFTAIYA 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01625TCRTETA340.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.6 bits (77), Expect = 0.001
Identities = 40/287 (13%), Positives = 95/287 (33%), Gaps = 18/287 (6%)

Query: 37 DTDAAAISMLISAIGIGKLFGLSFAGKLSDSLGRKPMVITAGILYVIFLIAVPFSPTYGI 96
+ A +L++ + + G LSD GR+P+++ + + + +P +
Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWV 98

Query: 97 AFAFALLAGMGNSILDTSTYPALIEGFPKRASSATVLVKAFMSIGATILPLMITFFIAKE 156
+ ++AG+ T A+ + + + F + A M+ +
Sbjct: 99 LYIGRIVAGI------TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG 152

Query: 157 LFYGYT----FFIMAFVFLINAVHLTTVKFPKANMVVVEPGTDNNGENKKKAEPHFAVKP 212
L G++ FF A + +N + P+++ + ++ P + +
Sbjct: 153 LMGGFSPHAPFFAAAALNGLNFL-TGCFLLPESH------KGERRPLRREALNPLASFRW 205

Query: 213 RFWREGVTVIFIGFTSVSLF-MIIQTWMATFAEEIIGMDESTAINLLSYYSFGGFITVIL 271
V + F + L + F E+ D +T L+ + + +
Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265

Query: 272 LATMLDRVFRPVTILILYPLIAIAALSALLFVTNYYVLVVIAFILGL 318
+ + L+L + L F T ++ I +L
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01650TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 48/285 (16%), Positives = 99/285 (34%), Gaps = 14/285 (4%)

Query: 41 DMPSLLGIVLIVITVPRLIMMTYGGILADNYKKSTIMFGTNSAQAV--LLLCITLLVWND 98
D+ + GI+L + + + G L+D + + ++ + + AV ++ +W
Sbjct: 40 DVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW-- 97

Query: 99 AMTLMALLSFAGLFGMLDAFFGPASTSLLPKIVDRPQLQKANAYFQGVDQVSFILGPVLA 158
L AG+ G G + + + I D + + + + GPVL
Sbjct: 98 --VLYIGRIVAGITGAT----GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151

Query: 159 GMIMEVFDVSISFFVAFILVSLSALIILPPFIKEAAVENKVKQSQVENLKEGFNYVRQSN 218
G++ FF A L L+ L + E + + + N F + R
Sbjct: 152 GLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMT 210

Query: 219 FLLIGMLILITLNFFVFGTLHIAIPLLVDVYGGTPINLSYMEMSLSIGMVLGTLILGRYI 278
+ M + + + + D + + + I L ++ +
Sbjct: 211 VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPV 270

Query: 279 IAKKG--RMSLYGLLATVIFYIIFSFM-DNLTLLPIMLLFIGFAM 320
A+ G R + G++A YI+ +F PIM+L +
Sbjct: 271 AARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGI 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_01655HTHTETR791e-20 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 79.3 bits (195), Expect = 1e-20
Identities = 29/166 (17%), Positives = 62/166 (37%), Gaps = 6/166 (3%)

Query: 1 MDTKEKILDVGRQLFASYGYEGTTMTMIAGGVEIKKPSLYAHYTSKEQIFKDVLDKEVAD 60
+T++ ILDV +LF+ G T++ IA + + ++Y H+ K +F ++ + ++
Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69

Query: 61 YITFLHEAAAADDSSIKEKLYRLLVEHALDDEASMNFYYRFIKY-----QPAGLEEYIVG 115
E A L R ++ H L+ + ++ + G +
Sbjct: 70 IGELELEYQAKFPGDPLSVL-REILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128

Query: 116 SFAEMESETEKIFEMILNQGKEQGEIDKSLSNTQIYRMYFLLVDGL 161
+ + E+ E L E + L + + + GL
Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174


40AAT16_02090AAT16_02105N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_02090-2132.430167histidine kinase
AAT16_02095-2123.428271hypothetical protein
AAT16_02100-3132.368203ABC transporter ATP-binding protein
AAT16_02105-2122.103328ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02090PF06580543e-10 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 54.1 bits (130), Expect = 3e-10
Identities = 60/360 (16%), Positives = 133/360 (36%), Gaps = 63/360 (17%)

Query: 27 FYFIFRSISLWEIVVGIVITILFFAVYW--LTFNSRGALIYIGLSLEFIINIAMTVLFGY 84
F ++ S L ++ I I+++ + +F R + + + + + V+ G
Sbjct: 30 FASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGM 89

Query: 85 VYFALFIAFYVGNIRSKAGFISMYVIHLVLTVGAIIFAFFINYVLFLSHLPFLIMTILGV 144
V+F + + L+ + AF + L + ++ + +
Sbjct: 90 VWFVANTSIW----------------RLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSL 133

Query: 145 ILIPLNKYNRLKQEALEVKLEDANQRIAELAIVEER---HRIARDLHDTLGQKLSMIGLK 201
+ + + KQ ++ + + A+L ++ + H + + L
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHF----MFNAL---------- 179

Query: 202 GELSRKLMDTDTEKAKKELQDIQNTARHALKEVREMISDMKNVNLKEELAHVKMILETAG 261
R L+ D KA++ L + R++L+ S+ + V+L +EL V L+ A
Sbjct: 180 -NNIRALILEDPTKAREMLTSLSELMRYSLRY-----SNARQVSLADELTVVDSYLQLAS 233

Query: 262 IRH------DIQVETEFKDIPMLTESVLSMSLKEAVTNVVKH---SKAKQCSVMLT--ET 310
I+ + Q+ D+ V M ++ V N +KH + ++L +
Sbjct: 234 IQFEDRLQFENQINPAIMDVQ-----VPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288

Query: 311 NKDIFLKISDDGR---NPQALEFGNGLQGMRERLTFVNGE---FEVFHSENGFEINITVP 364
N + L++ + G G GLQ +RERL + G ++ + + +P
Sbjct: 289 NGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02095HTHFIS522e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 52.1 bits (125), Expect = 2e-10
Identities = 27/161 (16%), Positives = 57/161 (35%), Gaps = 8/161 (4%)

Query: 2 IRIVIAEDQNLLLGALG-ALLDLEEDITVVGKAANGEEVLELVRETRPDICLMDIEMPVM 60
I++A+D + L AL D+ + N + + D+ + D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 TGLDAAEQLKAED--CKVIILTTFARPGYFERAKKANVRGYLLKDSPSETLANSIRQIMK 118
D ++K V++++ +A + YL K L I + +
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 119 GKRIYSPELIDIAFESENPL--TPREMEIIQLLGEGKKTKA 157
+ +L D + + + + EI ++L +T
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02100PF05272310.004 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.004
Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 38 LLGPSGAGKTTLVRELAGLD 57
L G G GK+TL+ L GLD
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_02105ABC2TRNSPORT611e-12 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 60.7 bits (147), Expect = 1e-12
Identities = 45/180 (25%), Positives = 86/180 (47%), Gaps = 9/180 (5%)

Query: 163 FLTAGVSFIRERTTGTLERLLSTPIRKWEIVMGYLIGFALFTVLQSAIIAWYAIYILDML 222
F T +F R T E +L T +R +IV+G + A L A I A L
Sbjct: 84 FETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAA----L 139

Query: 223 MVGVFIDVLWIILALALTAL---TLGILVSSFANNEFQMIQFIPIVVVPQIFFSG-LFNL 278
++ +L+ + +ALT L +LG++V++ A + I + +V+ P +F SG +F +
Sbjct: 140 GYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPV 199

Query: 279 DTISDLLSWIGPLTPLYYAAESLRDVMIRGYGWSDIYMNLLILLLFSLIFIVLNILVLRK 338
D + + PL ++ + +R +M+ D+ ++ L ++ +I L+ +LR+
Sbjct: 200 DQLPIVFQTAARFLPLSHSIDLIRPIMLGHPV-VDVCQHVGALCIYIVIPFFLSTALLRR 258


41AAT16_03960AAT16_03990N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_039600233.148627isoaspartyl dipeptidase
AAT16_039650252.623945hypothetical protein
AAT16_039702313.671476acetylglucosaminyldiphospho-UDP
AAT16_039752293.114386hypothetical protein
AAT16_039800261.104994teichoic acid ABC transporter permease
AAT16_039850252.369393CDP-glycerol:glycerophosphate
AAT16_039901241.867315glycerol-3-phosphate cytidylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03960UREASE445e-07 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 44.3 bits (105), Expect = 5e-07
Identities = 32/124 (25%), Positives = 49/124 (39%), Gaps = 34/124 (27%)

Query: 5 IKNGDIYAPEHVGKKSVLLNGRIIIKIGDIDEEQLGRLFDVEVIDAEGMIVSPGIIDPHV 64
+K+G I A G + II+ G EVI EG IV+ G +D H+
Sbjct: 90 LKDGRIAAIGKAGNPDMQPGVTIIVGPG------------TEVIAGEGKIVTAGGMDSHI 137

Query: 65 HLIGGGGEGGFATRTPELQLSNIIKAGVTTVVG-----CLGTDGTT-----RHMTSLLAK 114
H I P+ Q+ + +G+T ++G GT TT H+ ++
Sbjct: 138 HFI-----------CPQ-QIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEA 185

Query: 115 ARAL 118
A A
Sbjct: 186 ADAF 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03965HTHFIS1514e-42 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 151 bits (383), Expect = 4e-42
Identities = 79/388 (20%), Positives = 146/388 (37%), Gaps = 40/388 (10%)

Query: 90 MQKTSKFMVVNDNPA--ATLETI---EDLENVLPDH--DFLPYMAHEPMPENFDFIIT-- 140
M +V +D+ A L + + + ++A D ++T
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD----GDLVVTDV 55

Query: 141 --PGEANLVPTKAYQTFDIGARVVSIE---TVMELKEIFELEMKDSLLMQYYIKTMVHLT 195
P E + V+ + T M + E D L + + ++ +
Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 196 AKRSENTPVSIADQNKN-RTFSGISTESPQMQSTIRIASQMAKTSNIIHITGETGTGKQM 254
+ + + + + S MQ R+ +++ +T + ITGE+GTGK++
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 255 LAEMIHNDSAYHDMPFYIYSGADKDPQSIDNELFG-------GEGEKHQGILREVNRGTV 307
+A +H+ + PF + A I++ELFG G + G + GT+
Sbjct: 176 VARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235

Query: 308 YIKNIDSIPYQLQNKLANYFDANA----GSS-----DVRIVTSSIDDLWELYKGDIISQK 358
++ I +P Q +L G DVRIV ++ DL + + +
Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFRED 295

Query: 359 LYSYLSSYILKVPSISERKEDIPVLIDDFKNHFNRTEMQ---FSERVMNAFVRYDWPGNV 415
LY L+ L++P + +R EDIP L+ F + + F + + + WPGNV
Sbjct: 296 LYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355

Query: 416 RELYNLISYCVCLNQ-KYVEIDSLPIFF 442
REL NL+ L + + +
Sbjct: 356 RELENLVRRLTALYPQDVITREIIENEL 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03980PF06580290.018 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.018
Identities = 15/71 (21%), Positives = 22/71 (30%)

Query: 37 IVWEVLTPVISIMIYWFVFGTLRQRAPIEMGGTEVPFFYWLAIGFIVWTFFFQGSIEASK 96
I+ VL + I + WFV T R + V F LA+ I
Sbjct: 76 IILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLY 135

Query: 97 SIYRRLKMLSK 107
+ K +
Sbjct: 136 FGWHFFKNYKQ 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_03990LPSBIOSNTHSS376e-06 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 37.1 bits (86), Expect = 6e-06
Identities = 27/121 (22%), Positives = 50/121 (41%), Gaps = 16/121 (13%)

Query: 14 KVITYGTFDLLHMGHINILRRAKERGDYLVVAVSSDEFNKLKHKEAYYSYEDR-KAILEA 72
I G+FD + GH++I+ R D + VAV + +K+ +S ++R + I +A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRN-----PNKQPMFSVQERLEQIAKA 56

Query: 73 IKYVDEVIPEHNWGQKVKDVQKHDIDVFVMG----DDWKGEFDF------LKEYCEVVYL 122
I ++ + G V ++ + G D++ E L E V+L
Sbjct: 57 IAHLPNAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFL 116

Query: 123 A 123

Sbjct: 117 T 117


42AAT16_04660AAT16_04695N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_046600183.957063hypothetical protein
AAT16_04665-1153.834797hypothetical protein
AAT16_04670-1153.963489hypothetical protein
AAT16_04675-2111.910353major facilitator transporter
AAT16_04680-2110.724237hypothetical protein
AAT16_04685-213-0.468815PadR family transcriptional regulator
AAT16_04690-2100.946326ChrA protein
AAT16_046950110.122661hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04660DHBDHDRGNASE300.001 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 29.6 bits (66), Expect = 0.001
Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 42 GKEFVSVNPMKRFGEPEEVGNLVTFLLSNEATFSNAAVIPIDGGQS 87
+ F + P+K+ +P ++ + V FL+S +A + +DGG +
Sbjct: 213 LETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGGAT 258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04665adhesinb1684e-50 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 168 bits (426), Expect = 4e-50
Identities = 68/316 (21%), Positives = 124/316 (39%), Gaps = 12/316 (3%)

Query: 1 MFRRSLWFLSAMSVIILTACGAASPEESEGSGKIEVYTTVFALQSLTEQIAGDNAEVHSI 60
M + L ++ + L AC + GS K+ V T + +T+ IAGD +HSI
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60

Query: 61 YPNGTDIHSYEPTQKDMLSYAESDLFITTNKELDAVSGKIADVLNEDIEILEAVGDTGHL 120
P G D H YEP +D+ +++DL L+ + +E + + +
Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLE---TGGNAWFTKLVENAKKKENKDYY 117

Query: 121 LEDTHSHDHGEGDDHDHSHGEIDPHVWLDPVLSIDMAEAIKDKLSTLDPDNAEAYEENFE 180
+ E DPH WL+ I A+ I +LS DP N E YE+N +
Sbjct: 118 AVSEGVDVI-YLEGQSEKGKE-DPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLK 175

Query: 181 TVKADLEELD----ASLESVTEDSKVKNVYISHESIGYLANRYGFTQHGVSGMNNE-EPT 235
L LD ++ + K+ + S Y + Y + +N E E T
Sbjct: 176 AYVEKLSALDKEAKEKFNNIPGEKKM--IVTSEGCFKYFSKAYNVPSAYIWEINTEEEGT 233

Query: 236 QKEVIDMVEGLKADGSKYILTEQNISNKVTDIIKDAGGVEQLGFHNLSVLMDEDNPDTDY 295
++ +VE L+ + E ++ ++ + + + ++ Y
Sbjct: 234 PDQIKTLVEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSY 293

Query: 296 QTLMRHNIEVLDRALN 311
++M++N+E + L+
Sbjct: 294 YSMMKYNLEKIAEGLS 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04675TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.9 bits (80), Expect = 5e-04
Identities = 24/98 (24%), Positives = 42/98 (42%), Gaps = 11/98 (11%)

Query: 66 GGVIFGHIGDRVGRKKTLIITLSLMGIATACIGFLPTYAQIGIAAPILLMLLRLIQGLGI 125
G ++G + D++G K+ L+ + + + ++ + I A R IQG G
Sbjct: 65 GTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAGA 117

Query: 126 GGEWGGALLLATEYAPKEQR----GFFGSVPQMGITIG 159
+++ Y PKE R G GS+ MG +G
Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_04695SACTRNSFRASE347e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 34.1 bits (78), Expect = 7e-05
Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 2/75 (2%)

Query: 45 LIGGLDAGMTVDKMLYLSTIFVKEKYRGHGVGRRLMYEMEKQAAEIGADLIRLD--SFSW 102
IG + + + I V + YR GVG L+++ + A E + L+ +
Sbjct: 76 CIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINI 135

Query: 103 EGVGFYEKLGYEVIG 117
FY K + +
Sbjct: 136 SACHFYAKHHFIIGA 150


43AAT16_05140AAT16_05175N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_051404241.161409hypothetical protein
AAT16_051454251.265378pyruvate dehydrogenase
AAT16_051503220.3080122-oxoisovalerate dehydrogenase
AAT16_051551200.055675branched-chain alpha-keto acid dehydrogenase
AAT16_051601181.503818dihydrolipoamide dehydrogenase
AAT16_051650142.972975hypothetical protein
AAT16_051700123.424661hypothetical protein
AAT16_051751143.697211hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05140RTXTOXIND280.034 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.9 bits (62), Expect = 0.034
Identities = 36/206 (17%), Positives = 72/206 (34%), Gaps = 13/206 (6%)

Query: 6 AGLVLAMMVTAGCSSDEDELLDFYNAFQKTVEVEKEIETVSEEFDSLESEKGELQESLEN 65
G VL + G +D + + + +I + S E + L K + +N
Sbjct: 120 KGDVLLKLTALGAEADTLKTQSSL-LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 66 ASREELPEISAQLVENTDARIEQLDAEVAVMGDSRSRMETSRQYIEEISNGSNREKAESL 125
S EE+ +++ + E Q ++ R + NR + S
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQ-------KELNLDKKRAERLTVLARINRYENLSR 231

Query: 126 VEAM---DVRYKAHGDMIGSYKAVLESEREIFEYLGEEDVSQDEVDERLNSLSEEYQQVE 182
VE D H + AVLE E + E + E V + ++++ + + ++ +
Sbjct: 232 VEKSRLDDFSSLLH-KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290

Query: 183 ENAAAFGEET-EKVNEIKKEIEDVIQ 207
F E +K+ + I +
Sbjct: 291 LVTQLFKNEILDKLRQTTDNIGLLTL 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05155RTXTOXIND330.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.3 bits (76), Expect = 0.002
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 36 LAEVQNDKAVVEIPSPVDGTVKKLHV-EEGTVTTVGETIVTI 76
LA+ + + I +PV V++L V EG V T ET++ I
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVI 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05170TCRTETB310.005 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.4 bits (71), Expect = 0.005
Identities = 31/152 (20%), Positives = 59/152 (38%), Gaps = 15/152 (9%)

Query: 16 IAEKIFGWLAWLALLAVTGFILFFALVMVNDPAFIESFRQQMQNSL-SQMDTGGVSTEQM 74
+ F L + G F ALVMV +I + L + G
Sbjct: 98 VGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157

Query: 75 TDQMLSMLNSSWMIALYLAVPLILGIFGLLTMRR---RI-----LAGFLLLIAGILTAPM 126
M++ W L + + I+ + L+ + + RI + G +L+ GI+
Sbjct: 158 IGGMIAH-YIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF--F 214

Query: 127 VIFVITGLIPLFFVIAAI--LLFVRKDRVITH 156
++F + I F +++ + L+FV+ R +T
Sbjct: 215 MLFTTSYSI-SFLIVSVLSFLIFVKHIRKVTD 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05175HTHTETR625e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.0 bits (150), Expect = 5e-14
Identities = 25/162 (15%), Positives = 46/162 (28%), Gaps = 5/162 (3%)

Query: 1 MGLRETNKERRRSSIIKTAKAFFVEKGFNAVHMQEIADAEGIGIATLFRYFPKKEQLILA 60
+ + R I+ A F ++G ++ + EIA A G+ ++ +F K L
Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 61 AAISIMESEADAFKNILNH----PDKTAYEKIEDCFDYMKGIHISPSANTAKFNDAFQVY 116
+ + P E + + F+ V
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 117 IDNTTEPVENLLPYFEARRKIVDHFLQIIEQGKLDGTLHPER 158
+ + L E+ +I IE L L R
Sbjct: 122 EMAVVQQAQRNLC-LESYDRIEQTLKHCIEAKMLPADLMTRR 162


44AAT16_05420AAT16_05445N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_05420091.207290hypothetical protein
AAT16_05425-1110.920448cell division protein FtsA
AAT16_05430-1111.429908cell division protein FtsZ
AAT16_05435-1111.091844hypothetical protein
AAT16_05440-1111.308746hypothetical protein
AAT16_05445-1121.505068hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05420OUTRSURFACE300.013 Outer surface protein signature.
		>OUTRSURFACE#Outer surface protein signature.

Length = 273

Score = 30.3 bits (68), Expect = 0.013
Identities = 15/49 (30%), Positives = 19/49 (38%)

Query: 93 DDEDTGEIETFTRDGKPISVDTFNSSRKTSGKEKRNGAGKDDSSVKKRA 141
DD E F DGK + +S KTS E N G+ + R
Sbjct: 92 DDLSKTTFELFKEDGKTLVSRKVSSKDKTSTDEMFNEKGELSAKTMTRE 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05425SHAPEPROTEIN491e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.4 bits (118), Expect = 1e-08
Identities = 34/154 (22%), Positives = 64/154 (41%), Gaps = 20/154 (12%)

Query: 202 GGVVIDIGADLTQFGYYERGALKYAGSLPVGG----NHITNDLSEAFN--TPFEVAEKVK 255
G +V+DIG T+ + Y+ S+ +GG I N + + AE++K
Sbjct: 160 GSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIK 219

Query: 256 HQYGHAFFDLASDEDIVKLPQRD---GEP-DIEVTPKDLADIIELRLEEMLLDVFTELQ- 310
H+ G A+ E +++ R+ G P + ++ + ++ L ++ V L+
Sbjct: 220 HEIGSAYPGDEVRE--IEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQ 277

Query: 311 -----EAGITRVSGGFVVTGGTVNLLGVKELLQD 339
+ I G V+TGG L + LL +
Sbjct: 278 CPPELASDI--SERGMVLTGGGALLRNLDRLLME 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05440ALARACEMASE397e-06 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 39.0 bits (91), Expect = 7e-06
Identities = 31/182 (17%), Positives = 61/182 (33%), Gaps = 24/182 (13%)

Query: 2 IKDNYTNIRQEIGDEATIIAVTK---Y-HTVEETLEAYEAGVRDFGENRPEGFLEKRKAL 57
+K N + +RQ A + +V K Y H +E A A F E + R+
Sbjct: 14 LKQNLSIVRQ-AATHARVWSVVKANAYGHGIERIWSAIGA-TDGFALLNLEEAITLRERG 71

Query: 58 PADANVHFIGTLQSRKVKQIAD--------DLYYLHSLDRESIAKKIEQYSNHTVKCFIQ 109
+ G ++ ++ + L +L + ++ I
Sbjct: 72 WKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLD----------IY 121

Query: 110 VNVSGEDSKHGLIPEEVPEFLETLSEYEKIEVIGLMTMAPHTEDRSLISEVFKRLSELKQ 169
+ V+ ++ G P+ V + L + + LM+ E IS R+ + +
Sbjct: 122 LKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAAE 181

Query: 170 KL 171
L
Sbjct: 182 GL 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_05445IGASERPTASE280.038 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.7 bits (61), Expect = 0.038
Identities = 29/132 (21%), Positives = 48/132 (36%), Gaps = 16/132 (12%)

Query: 13 VEYEEDVPAETSEPQ---KKQSEQNPKVTSFEQSARRRPEPVKADKPTDKKQPKDQNQNV 69
VE EE ET + Q K S+ +PK E + + + EP + + PT + N
Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE-TVQPQAEPARENDPTVNIKEPQSQTNT 1164

Query: 70 RGKKMAEGIRGPERRKSAAKRQ--EKDRSTNRNEKETKLMTAETSNTKVCLFEPRVFSET 127
P + S+ Q + + N + T T +P V SE+
Sbjct: 1165 TADTEQ-----PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT----QPTVNSES 1215

Query: 128 QDIADELKHERA 139
+ +H R+
Sbjct: 1216 SNKPKN-RHRRS 1226


45AAT16_07205AAT16_07230N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_07205-116-2.153468arginine repressor
AAT16_07210-116-1.440159hypothetical protein
AAT16_07215016-0.315382membrane protein
AAT16_07220016-0.153250phosphate butyryltransferase
AAT16_072250130.744136leucine dehydrogenase
AAT16_072300140.567892butyrate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_07205ARGREPRESSOR1831e-62 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 183 bits (466), Expect = 1e-62
Identities = 87/148 (58%), Positives = 117/148 (79%)

Query: 3 NKSIRQIKIREIISNGKVETQEDLVEKLNVYNFNVTQATVSRDIKELQLIKVPTPSGSYI 62
NK R IKIREII+ ++ETQ++LV+ L +NVTQATVSRDIKEL L+KVPT +GSY
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGSYK 61

Query: 63 YSMPKDRKFHPLEKLGRYLMDSFVKLDYTGNLLVLKTLPGNAQSIGAIIDQLEWEEVIGT 122
YS+P D++F+PL KL R LMD+FVK+D +L+VLKT+PGNAQ+IGA++D L+WEE++GT
Sbjct: 62 YSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIMGT 121

Query: 123 ICGDDTCLLICRDEEAQLEIKDRIFNLI 150
ICGDDT L+ICR + ++ +I L+
Sbjct: 122 ICGDDTILIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_07210GPOSANCHOR300.029 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.0 bits (67), Expect = 0.029
Identities = 30/244 (12%), Positives = 75/244 (30%), Gaps = 9/244 (3%)

Query: 155 EEKYKLYKESYKKFKALEEKIQDLEYKDRNRMQQLELYRHQYDELSSMGLVHGEEEQLEE 214
EK +E + LE+ ++ +++ + L++ E+ LE
Sbjct: 109 SEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA--RKADLEKALEG 166

Query: 215 EISYFNNYEKIHDTLSIMRTQLDSEYSPQVMLYEIHKSIETMSKFDETYTAFTETILESY 274
+++ TL + L++ + + ++ +
Sbjct: 167 AMNFSTADSAKIKTLEAEKAALEARQ--AELEKALEGAMNFSTADSAKIKTLEAEKAALA 224

Query: 275 HLLNELDSKVSGDLSNVDYDEGTYNEKQLRLASINNLKRKYNKTTEELIDLRESLNEDIM 334
+L+ + G ++ D + A++ + + K E ++ + + I
Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 284

Query: 335 QL----ENIAQSFEKLESEKSAALDEMEKLASFLQNYRVERKHFLENRIKKELHDLDMPD 390
L + LE + + L L R +K LE +K + +
Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQ-LEAEHQKLEEQNKISE 343

Query: 391 ADFE 394
A +
Sbjct: 344 ASRQ 347



Score = 29.6 bits (66), Expect = 0.037
Identities = 22/152 (14%), Positives = 43/152 (28%), Gaps = 12/152 (7%)

Query: 247 YEIHKSIETMSKFDETYTAFTETILESYHLLNELDSKVSGDLSNVDYDEGTYNE------ 300
+ + TE + + L + D +S S + E +
Sbjct: 71 LKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130

Query: 301 -----KQLRLASINNLKRKYNKTTEELIDLRESLNEDIMQLENIAQSFEKLESEKSAALD 355
A I L+ + DL ++L + + + LE+EK+A
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 356 EMEKLASFLQNYRVERKHFLENRIKKELHDLD 387
+L L+ +IK +
Sbjct: 191 RQAELEKALEGAM-NFSTADSAKIKTLEAEKA 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_07215PF06580260.033 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 25.6 bits (56), Expect = 0.033
Identities = 12/82 (14%), Positives = 31/82 (37%), Gaps = 3/82 (3%)

Query: 5 IALVILIVPVFLAGLGIKYMRDSMFGVVNDPFTLTVVQFVVGLGLTIFGVWFIGGYILHR 64
+ ++I V+ + + FTL + ++ + + +W + + H
Sbjct: 81 LPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGWHF 140

Query: 65 ERKNKRVSERFIEQSRQSRKAQ 86
+ K + I+Q + + AQ
Sbjct: 141 FKNYK---QAEIDQWKMASMAQ 159


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_07230ACETATEKNASE1731e-52 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 173 bits (439), Expect = 1e-52
Identities = 75/341 (21%), Positives = 136/341 (39%), Gaps = 36/341 (10%)

Query: 4 NILVLNLGSTSTKVAIYNNLNS------LAEE--------TLRHPSSETV--KPMPEQIE 47
ILV+N GS+S K + + + LAE T + K M + +
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61

Query: 48 YRLKAILGFLTEQNFDPSTIDIVSARGGTLKPIEGGTYNINDQMVTD--------LLESR 99
+ + + + A G + + GG Y + ++TD +E
Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGH--RVVHGGEYFTSSVLITDDVLKAITDCIELA 119

Query: 100 YGRHASNMSGLIADRFRNKYDCKAVITDPVVVDELVDEVRMTGL------KGIERKSIFH 153
+ +N+ G+ A + D + D + + K RK FH
Sbjct: 120 PLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFH 179

Query: 154 ALNQKAVARKYAESVHKDYEDINVIICHMGGGITAGAHRRGRVIDVNDGLSG-EGPMSPN 212
+ K V+++ AE ++K E + +I CH+G G + A + G+ ID + G + EG
Sbjct: 180 GTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGT 239

Query: 213 RTGSLPNGAFAKYVIDHQLDYDSAYELITKKGGFMSLAG-TQDALELEKQAL-SGDGSAI 270
R+GS+ + + + + ++ KK G ++G + D +LE A +GD A
Sbjct: 240 RSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQ 299

Query: 271 AIYEAMAVQIAKEIAARAAILKGETEQIIFTGGLAYSEYLI 311
A ++ K I + AA + G + I+FT G+ + I
Sbjct: 300 LALNVFAYRVKKTIGSYAAAMGG-VDVIVFTAGIGENGPEI 339


46AAT16_08575AAT16_08605N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_08575015-3.266791hypothetical protein
AAT16_08580114-2.340907PhoB family transcriptional regulator
AAT16_08585114-1.656280hypothetical protein
AAT16_08590012-0.738531hypothetical protein
AAT16_08595-111-0.500027chloramphenicol resistance protein DHA1
AAT16_08600-112-1.210375hypothetical protein
AAT16_08605012-0.954602permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_08575PF06580362e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 2e-04
Identities = 21/148 (14%), Positives = 49/148 (33%), Gaps = 25/148 (16%)

Query: 178 ETVDLEQIVNSIIRKFRIICMQKGIGFDVTLHAAEVQTDLKWCTFVLEQVISNSVKY--- 234
V L + + ++ +Q D++ +++ ++ N +K+
Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273

Query: 235 --TEEADIAITSDLIEGWITLEISDEGRGIRKEDLPRIFEAGFTSTSDHGDAQSTGMGLY 292
+ I + G +TLE+ + G K +STG GL
Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKN-----------------TKESTGTGLQ 316

Query: 293 LAHEAAEAMH---IQMRIESEYGRGTTT 317
E + ++ Q+++ + G+
Sbjct: 317 NVRERLQMLYGTEAQIKLSEKQGKVNAM 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_08580HTHFIS816e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.0 bits (200), Expect = 6e-20
Identities = 31/137 (22%), Positives = 60/137 (43%), Gaps = 2/137 (1%)

Query: 2 SKILIIEDDETLFSELKMRLEDWDYTVFGVTDFSNVLNDFITVGPDLVIIDITLPKYDGF 61
+ IL+ +DD + + L L Y V ++ + + DLV+ D+ +P + F
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 YWCQRIRNI-SSLPIIFLSSRDHPTDMVMSMQMGSDDYIQKPFNFDVLVAKI-QALLRRT 119
RI+ LP++ +S+++ + + + G+ DY+ KPF+ L+ I +AL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 120 YQYQNQNIDVVKFRDAV 136
+ D V
Sbjct: 124 RRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_08595TCRTETA642e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 64.5 bits (157), Expect = 2e-13
Identities = 74/384 (19%), Positives = 140/384 (36%), Gaps = 28/384 (7%)

Query: 2 NKKLLITFTVGVFLLGMMELIISGILELMSDDLGISH---AMTGQLITVYAVSFAVFGPL 58
N+ L++ + V L + +I +L + DL S+ A G L+ +YA+ P+
Sbjct: 4 NRPLIVILST-VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPV 62

Query: 59 LVKATEKIRPKPVIIASLILFVIGNVIFGLSSTFLMLSLGRIVTAVAAAVFIVK---IMD 115
L +++ +PV++ SL + I + +L +GRIV + A V I D
Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD 122

Query: 116 MTVLLSRPEIRGKMIALVYMGFSAANVFGIPIGTLIGQQFGWRIIFWLVIVIAILVGI-G 174
+T R G M A G A G +G L+G F F+ + L + G
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVA----GPVLGGLMG-GFSPHAPFFAAAALNGLNFLTG 177

Query: 175 ILSLVPDKKGEDLGEPLPDKILDRRNVFLYIGVTMAVLIGNYIVIGYISP-------LMT 227
L KGE + +A L+ + ++ + +
Sbjct: 178 CFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237

Query: 228 SNGFTLKSVSIALLIAGAGGM---TGTYIGGLLVDRIGSKRTIIYMLILFMISMAILPLL 284
+ F + +I + +A G + I G + R+G +R ++ +I +L
Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297

Query: 285 YGSPALFYTNLFFWSVFQWSTSPSVQSGLVENVQGSAAMVFSWNMSGL-NLGIGIGAVIG 343
F + P++Q+ L V +++ L +L +G ++
Sbjct: 298 TRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355

Query: 344 GVYISNFDISYAPWLSVFIVGLGL 367
+ ++ W +I G L
Sbjct: 356 TAIYAASITTWNGW--AWIAGAAL 377


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_08605PF05775300.009 Enterobacteria AfaD invasin protein
		>PF05775#Enterobacteria AfaD invasin protein

Length = 142

Score = 30.3 bits (68), Expect = 0.009
Identities = 12/37 (32%), Positives = 15/37 (40%), Gaps = 3/37 (8%)

Query: 247 VAIGFIPQIFCYGVMGGLGIWFGVRLAEPNPGVYIVQ 283
+A G +I C G +W R G YIVQ
Sbjct: 44 LATG---RIICQDTHSGFRVWINARQEGGGAGKYIVQ 77


47AAT16_10060AAT16_10100N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
AAT16_10060-210-0.515717hypothetical protein
AAT16_10065-210-0.525052cation transporter
AAT16_10075015-0.924369diacetyl reductase
AAT16_10085-214-1.851511*membrane protein
AAT16_10090-114-0.902429arsenic resistance protein
AAT16_10095-115-1.734236hypothetical protein
AAT16_10100-116-1.864086hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10060HTHTETR423e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 42.3 bits (99), Expect = 3e-07
Identities = 13/45 (28%), Positives = 25/45 (55%)

Query: 6 RKEETKNNLLDAFWELYKEKPLTKITVKEITDKAGYNRGTFYTYF 50
+ET+ ++LD L+ ++ ++ ++ EI AG RG Y +F
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHF 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10065ACRIFLAVINRP6110.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 611 bits (1577), Expect = 0.0
Identities = 243/1037 (23%), Positives = 458/1037 (44%), Gaps = 34/1037 (3%)

Query: 3 LSDFSIRRPNFTIVVMIILLLLGAVSLTRLPLQLMPNIEPPIAAVATTYQGAGPEEVMED 62
+++F IRRP F V+ IIL++ GA+++ +LP+ P I PP +V+ Y GA + V +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 63 VTVPIESELSSLSGLTNISSQSQES-SSVVILEFGYDTKIDDVENDIMRAVESA--DLPD 119
VT IE ++ + L +SS S + S + L F T D + + ++ A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 120 EAGDPSFLKFDISMMPSIQMAVTSSGD--SVAEYQDQVDDLIT-ELENIEGVASITENGS 176
E S + S + + D V + L + GV + G+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 177 VTEEIQVNLDTEALEQYNMSQSDIAGIIEANNISIP----NATVTDTEDRTSISTRTVSE 232
+++ LD + L +Y ++ D+ ++ N I T + + S +
Sbjct: 181 -QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 233 IDGVESLQELVLAELPDDGGTITLDDVAEVSIEEQSSNTLTRMNQEEALSIDVMLASDAN 292
E ++ L D G + L DVA V + ++ N + R+N + A + + LA+ AN
Sbjct: 240 FKNPEEFGKVTLRVNSD-GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298

Query: 293 ASNVNKEFNAVLDEKLDEEEFSNLTVETLYDEGEYIDIAINSVYTSLISGAVLAMIVLFA 352
A + K A L +L + V YD ++ ++I+ V +L +L +V++
Sbjct: 299 ALDTAKAIKAKL-AELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYL 357

Query: 353 FLRNLKAPLIIGISIPFSVITTFALLFFTDISINMMTLGGLALGIGMLVDNAVVVIENIY 412
FL+N++A LI I++P ++ TFA+L SIN +T+ G+ L IG+LVD+A+VV+EN+
Sbjct: 358 FLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417

Query: 413 RHLSMGK-KPKQAASEGTKEVASAIIASTLTTAAVFLPVVFVSGLVGQLFTPFAITVAFS 471
R + K PK+A + ++ A++ + +AVF+P+ F G G ++ F+IT+ +
Sbjct: 418 RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSA 477

Query: 472 LLGSLFIALTVVPMLASRILTAPDENMEKIRS------ERSYMRMLRKFTR---WSLNHR 522
+ S+ +AL + P L + +L + + ++ + +T L
Sbjct: 478 MALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGST 537

Query: 523 VLVLILTTLLLIVSALGIYNQGINLMPESDEGALTIEIEKEQGTIFEDTFDTVENIENEL 582
L++ L++ + + +PE D+G I+ G E T ++ + +
Sbjct: 538 GRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYY 597

Query: 583 KDYPEVDTYLSNIGSSSPMMSMSEEPNKASITATLVDPADRSVTTNEF---INDIEDEIE 639
+ + + + + + N +L +R+ N I+ + E+
Sbjct: 598 LKNEKANVES--VFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELG 655

Query: 640 KIDDSAEINIVPMSQSGMG---GEPNTLMLNVSDDSADRLAESEQTIIQALEDDEKIESV 696
KI D I + +G G L+ Q + A + + SV
Sbjct: 656 KIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSV 715

Query: 697 ESSREEMVQELQVQVDRAAARENGLQPAQVGSALYEASNGVQATTVENNNEFLSIVVKYP 756
+ E + +++VD+ A+ G+ + + + A G + + V+
Sbjct: 716 RPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQAD 775

Query: 757 DDVLSSMENFRDIQIANSEGEYVALSEVAELEEVDMLPMITRDSMEETSELTVTYASDMS 816
E+ + + ++ GE V S V P + R + + E+ A S
Sbjct: 776 AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTS 835

Query: 817 LNEAGTYVENIIEDADFSDDTHYSIGGDLEMLTDAMPQMLLALILGVLFIYLVMVAQFES 876
+A +EN+ Y G + Q + + + ++L + A +ES
Sbjct: 836 SGDAMALMENLASKLP--AGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893

Query: 877 FKHPFIVIMAVPLSIIGVMLALVITNNPLSIVSFVGIIMLLGIVVNNSILLVDYTNQQKE 936
+ P V++ VPL I+GV+LA + N + VG++ +G+ N+IL+V++ E
Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953

Query: 937 K-GYPTLEALELSVQHRFRPIVITALTTALGMLPLALGIGEGGEMVASMGIVVIGGLTSS 995
K G +EA ++V+ R RPI++T+L LG+LPLA+ G G ++GI V+GG+ S+
Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSA 1013

Query: 996 TIFTLFIIPIFYSYVDK 1012
T+ +F +P+F+ + +
Sbjct: 1014 TLLAIFFVPVFFVVIRR 1030



Score = 106 bits (265), Expect = 4e-25
Identities = 80/531 (15%), Positives = 195/531 (36%), Gaps = 53/531 (9%)

Query: 516 RWSLNHRVLVLILTTLLLIVSALGIYNQGINLMPESDEGALTI----------EIEKEQG 565
+ + + +L +L++ AL I + P A+++ ++
Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62

Query: 566 TIFEDTFDTVENIENELKDYPEVDTYLSNIGSSSPMMSMSEEPNKASITATLVDPADRSV 625
+ E + ++N+ M S S+ +IT T D +
Sbjct: 63 QVIEQNMNGIDNLMY--------------------MSSTSDSAGSVTITLTFQSGTDPDI 102

Query: 626 TTNEFINDIEDEIEKIDDSAEINIVPMSQSGMGGEPNTLMLNVSDDSADRLAE----SEQ 681
+ N ++ + + + + +S + VSD+ +
Sbjct: 103 AQVQVQNKLQLATPLLPQEVQQQGISVEKSSSS--YLMVAGFVSDNPGTTQDDISDYVAS 160

Query: 682 TIIQALEDDEKIESVESSREEMVQELQVQVDRAAARENGLQPAQVGSALYEASNGVQA-- 739
+ L + V+ + +++ +D + L P V + L ++ + A
Sbjct: 161 NVKDTLSRLNGVGDVQLFGAQ--YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQ 218

Query: 740 ----TTVENNNEFLSIVVKYPDDVLSSMENFRDIQI-ANSEGEYVALSEVAELEE-VDML 793
+ SI+ + + E F + + NS+G V L +VA +E +
Sbjct: 219 LGGTPALPGQQLNASIIAQ---TRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENY 275

Query: 794 PMITRDSMEETSELTVTYASDMSLNEAGTYVENIIED--ADFSDDTHYSIGGDL-EMLTD 850
+I R + + + L + A+ + + ++ + + F D +
Sbjct: 276 NVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQL 335

Query: 851 AMPQMLLALILGVLFIYLVMVAQFESFKHPFIVIMAVPLSIIGVMLALVITNNPLSIVSF 910
++ +++ L ++ ++LVM ++ + I +AVP+ ++G L ++ ++
Sbjct: 336 SIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTM 395

Query: 911 VGIIMLLGIVVNNSILLVD-YTNQQKEKGYPTLEALELSVQHRFRPIVITALTTALGMLP 969
G+++ +G++V+++I++V+ E P EA E S+ +V A+ + +P
Sbjct: 396 FGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP 455

Query: 970 LALGIGEGGEMVASMGIVVIGGLTSSTIFTLFIIPIFYSYVDKETRKMHKK 1020
+A G G + I ++ + S + L + P + + K H +
Sbjct: 456 MAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10075DHBDHDRGNASE1255e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 125 bits (315), Expect = 5e-37
Identities = 71/257 (27%), Positives = 111/257 (43%), Gaps = 15/257 (5%)

Query: 3 KVAVITGSGGGLGKGIAERLAKDGFKVVVNDINAEAVNSTVEEIKAGGYEVIGVQGDVSK 62
K+A ITG+ G+G+ +A LA G + D N E + V +KA DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 KEHQFLLVQRAVEVFGRLDVFVNNAGIDVVTPFLDVDEAQLNKAFSINVNGVVFGTQAAA 122
+ R G +D+ VN AG+ + + + FS+N GV +++ +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 EQFKKQESKGKIINACSIAGHESYEMLSTYSATKHAVKSFTHSSAKELAPYNIRVNAYCP 182
+ + S G I+ S ++ Y+++K A FT ELA YNIR N P
Sbjct: 129 KYMMDRRS-GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 183 GVAGTAM----W--DRIDEEMVKYYDHMEPGDAFKEFSGNILLGRPQEPEDVANLVSFLA 236
G T M W + E+++K G + F I L + +P D+A+ V FL
Sbjct: 188 GSTETDMQWSLWADENGAEQVIK-------GSL-ETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 237 SDDSDYITGQAIVTDGG 253
S + +IT + DGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
AAT16_10100FIMREGULATRY260.034 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 26.4 bits (58), Expect = 0.034
Identities = 14/43 (32%), Positives = 19/43 (44%), Gaps = 5/43 (11%)

Query: 1 MKEYQFQATIEPDDHRSVKIINDMER--VVGHIEKDVLRKCEE 41
M E F I S ++I M+ V GH K+V CE+
Sbjct: 28 MSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEV---CEK 67



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.