[Biococoa-dev] Optimizations

Koen van der Drift kvddrift at earthlink.net
Tue Mar 15 20:20:47 EST 2005


On Mar 15, 2005, at 7:29 PM, John Timmer wrote:

> Sorry for my lack of clarity.  Shark says that over 30% of the 
> execution
> time in the "findSequence" method is spent checking whether one symbol
> represents another.  Currently, that's done by checking whether the
> submitted symbol occurs in the array of represented symbols.  
> According to
> the docs, making the represented symbols a set instead of an array will
> speed this up significantly.
>
> Returning an array from the method doesn't enter into this issue, and
> definitely should not be changed.
>
> I may be obsessing about this, but my tests earlier today showed that 
> the
> non-strict version of the code to take 4-5X the time to execute 
> compared to
> the strict one.  In a 1.2Kb sequence, it's the difference between 
> barely
> perceptible and wondering whether something's broken.
>

Ah, I see what you mean now, and yes, a BCSymbolSet could be much 
faster. I think the symbolsets are ready for use (is that right, 
Charles?). What's missing so far is that they have not been implemented 
to the BCSequence code yet.

- Koen.





More information about the Biococoa-dev mailing list