[Biodevelopers] getting all db sequences from a PSI-BLAST run

Martin Heusel mheusel at gmail.com
Wed Feb 28 13:15:16 EST 2007


Hi Noel,

yepp i meant '-e' sorry. For the number of sequences returned i never
get all sequences of a database. Even for a small database with 500
sequences only around 300 are given back. The e-value was set to
1000000. What i recently learned is that blastpgp only makes
approximations for computing the e-values for speed up reasons. It
computes only the first taylor term of a taylor approx. of exp() of

E = K m n exp(-lambda score)

 which only makes sense for not too small scores. So my assumption is
that for to small scores and lambdas the approx. gives way to high
e-values exceeding the -e threshold. By definition an e-value is a
propability and should not go beyond 1.

Regards

  Martin

On 2/25/07, Noel Faux <Noel.Faux at med.monash.edu.au> wrote:
> Hi Martin,
>
> This was probably a typo, but, I think you need '-e' not '-E' to set the
> e-value cutoff for the returned results.  When I wanted all results I
> set -b to the size of the subject database and -e 100000.  The e-value
> never reached that, so PSI-BLAST returned all results.
>
> Cheers
> Noel
>
> Martin Heusel wrote:
> > Hi,
> >
> > i'm wondering if it's possible to get all sequences of a large
> > database ranked by E-value or score from PSI-BLAST with a query.
> > Normally PSI-BLAST stops outputting after a couple of sequences even
> > if one sets the output parameters -b or -E to very high values. Is it
> > possible in general or are there computational limits (time etc.) in
> > figuring out the right scores or E-values when it comes to the many
> > sequences with very low identity? Thanks for any advice.
> >
> > Martin
> >
>
> _______________________________________________
> Biodevelopers mailing list
> Biodevelopers at bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/biodevelopers
>


-- 
+ Neural Processing Group Technical University Berlin
+ http://ni.cs.tu-berlin.de
+ Institute of Bioinformatics Johannes Kepler University Linz
+ http://www.bioinf.jku.at/

+ In the beginning was the WORD, and the WORD was UNSIGNED,
+ and the main(){} was without form and void


More information about the Biodevelopers mailing list