[Biococoa-dev] BCSequence

Koen van der Drift kvddrift at earthlink.net
Wed Aug 25 17:25:52 EDT 2004


On Aug 25, 2004, at 2:00 AM, Alexander Griekspoor wrote:

> Let's start a little new discussion then ;-)
> In principle these modifications can be seen as features right? So now 
> we have three names/kinds around in two pairs:
> - Modifications (example: methylgroup, phosphate etc)
> - Features (example: alpha-helix, nuclear localization signal etc)
> Or
> - Features (example: methylgroup, phosphate etc)
> - Annotations (example: alpha-helix, nuclear localization signal etc)


I think we should treat the modifications as an array of BCSymbols. We 
could even make a BCModificationsArray if we add an intermediate class 
called BCSymbolArray as follows:

BCSymbolArray
|
|
------BCSequence
|
|
------BCModificationsArray


The BCSymbolArray can actually take care of a some of the code that is 
currently in BCSequence.

If we calculate the mass of a molecule, we can just iterate over the 
BCModificationsArray to add the masses of the modifications. For me 
features and annotations relate more to secondary structures and 
author's comments (as found in a swissprot or ncbi file). But that's 
just a name game.


> Modifications and features are very alike, and a modification could be 
> seen as a special feature and thus a subclass (inheriting the 
> add/removing/editing/syncing etc methods, but add weight, pi etc ). 
> Also the question raises wether we should keep them in two arrays 
> (features and modifications) or in one (features). If you display all 
> features of a sequence, it perhaps would be nice to see the 
> modifications as well.

I think we should keep them separate. Modifications are per BCSymbol, 
features can span a whole range of BCSymbols. Also maybe we should move 
the mass calculations into a separate class that accepts a sequence to 
calculate the mass? The same for features, pI, etc.


For example we have the class MassCalculator with the following methods

-(id) MassCalculator initWithSequence:(BCSequence *)seq
-(id) MassCalculator initWithSubSequence:(BCSequence *)seq 
inRange:(NSRange)aRange
-(id) MassCalculator initWithString:(NSString *)seq
-(id) MassCalculator initWithSubString:(NSString *)seq 
inRange:(NSRange)aRange

-(float)getMass useMassType:(BCMassType)type addModifications:(BOOL)mods

The getMass method iterates over all symbols and adds the mass, just as 
we do now in the molecularWeight method.


Then we use it as follows:

MassCalculator calculator = [[MassCalculator alloc] 
initwithSequence:mySequence];

float totalMass = [calculator getMass useMassType:BCAverage 
addModifications:YES];

[calculator release];


<nitpicking mode on>
I prefer to use the word 'mass' instead of 'weight'. See eg the 
description in <http://en.wikipedia.org/wiki/Weight>. If we want to 
keep a method molecularWeight around that's fine with me, we could just 
have it return the result of getMass using the averageMass type, which 
is the same value.
<nitpicking mode off>


> 1 The molecular weight method should take modifications into account. 
> A methylgroup adds weight. Thus modifications should have an 
> addedWeight: mode: method  that can accept negative values as well (if 
> the modification removes more weight than it adds).

Just put a negative value in the plist, and it will substract it when 
summing all modifications.



> In both cases it would be nice to add a (p)list of predefined 
> modifications and features.
>

definitely yes.



> I think that's a pretty easy choice, as we don't know of anyone using 
> BioCocoa at the moment, I think we should just start from scratch.
>

I agree, but it think it would be fair to Peter to let him have his say 
as well. He started BioCocoa and the IO classes are all his code and I 
don't want to throw that away :)


- Koen.




More information about the Biococoa-dev mailing list