[Biococoa-dev] more ramblings
Koen van der Drift
kvddrift at earthlink.net
Fri Dec 3 15:30:37 EST 2004
On Dec 3, 2004, at 12:03 PM, Alexander Griekspoor wrote:
>> I like the BCAnnotatedSequence name. Although it kinda leaves out the
>> features.
> True, but in some way also in that case I like to think as the
> sequence being annotated...
>>
Yes, that is true.
>> BTW, I'm still confused about what is the difference between a
>> feature and an annotation. I think we should define very well what we
>> want to store in either one. And if the BCFeature and BCAnnotation
>> aren't objects, what do you intend them to be?
> Oh sorry, in the previous email you wrote:
>
>>
>> BCSequenceWrapper
>> |
>> -> has-a BCSequence, BCFeatures, etc
>>
>>
>> BCFeatures
>> |
>> -> has-a dictionary of features (d'oh)
>
> ..so I thought you wanted to create a BCFeatureS class, which would
> have a dictionary of features (BCFeature?).
I don;t know - I thought that was similar to what John proposed last
week. I really have no idea what the structure should be. It was just
an example to get the discussion going.
> But I guess you meant the same thing as I do: A BCSequenceWrapper
> (object) has a dictionary containing multiple BCFeature objects.
>
Yes. Although we probably will use BCAnnotatedSequence which is a
BCSequence, instead of has-a (see my other email).
> As for the distinction between features and annotations: it's pretty
> simple, features are linked to certain symbol ranges (phoshorylation
> at 12, helix from 15-18), annotations are like metadata, author =
> koen, link to article = ... , creation date.. etc
Yes, that makes sense.
>
> I think that both should be tagged by string identifiers (like
> BCFeatureTypeAlphaHelix and BCAnnotationTypeAuthor), and we can
> predefine a complete set of them to be sure that things like author,
> date etc are standard items. The main advantage is that a developer
> can add it's own tags to for instance save a marked region in a
> sequence: (AGFeatureTypeMyMarkedRegion). We will make sure that these
> get written in and out of files for him, so he doesn't have to invent
> his own filetype.
> Note that it should be no problem to add multiple instances of
> BCFeatures with same tag (like authors).
Sounds like a good plan. Not that this will not work with a dictionary,
because it can have only one of each key. BTW, I'm pretty sure that
there is a xml scheme out there that tries to unify these things for
various dataformats. We could consider adapting such a thing (if it
exists).
>
>>
>> Should we also make a special class BCAnnotatedSequenceReader that
>> takes care of extracting as much as info as possible from a file? Or
>> should we do everything with BCSequenceReader.?
>
> Good question, I don't know, there should definitely be the option to
> either read in the features and annotations or not. It can go into one
> class, but it's perhaps nicer to have to classes. Then again, it would
> require a lot of code duplication which we don't want. We could make
> the one a subclass of the other, the annotated one takes off where the
> super class stops. No idea...
Food for thought.
cheers,
- Koen.
More information about the Biococoa-dev
mailing list