[Biococoa-dev] Even more on sequence formats

Peter Schols peter.schols at bio.kuleuven.be
Tue Apr 11 09:08:40 EDT 2006


ReadSeq could be a good example indeed.
I guess we have most of them, except for IG, NBRF, Fitch, Zuker,  
Olsen, ASN.1.
We do have Clustal, Nona, TNT, Hennig, PDB however ;-)
(PAUP == Nexus).

Peter




> Finally, to come back to the question on the formats, perhaps we  
> can learn from a classic sequence reader package called ReadSeq by  
> d.g.gilbert.
> It reads the following formats, which are outlined in the Formats  
> textfile inside the src folder:
>          1. IG/Stanford           10. Olsen (in-only)
>          2. GenBank/GB            11. Phylip3.2
>          3. NBRF                  12. Phylip
>          4. EMBL                  13. Plain/Raw
>          5. GCG                   14. PIR/CODATA
>          6. DNAStrider            15. MSF
>          7. Fitch                 16. ASN.1
>          8. Pearson/Fasta         17. PAUP
>          9. Zuker (in-only)       18. Pretty (out-only)
>
> Some of them we support, but some not, so we can even add a few  
> formats, plus the source code nicely shows how to discriminate them.
> The latest version switched from c to java and added even a few  
> more formats, so there's plenty to add ;-) The source also contains  
> many sample files for testing purposes.
> I'm not sure where it can be found nowadays, so I put it  
> temporarily on our server for you guys to download:
> http://www.mekentosj.com/temporary/readseq.zip
> Have a look at it and tell me what you think.
> Cheers,
> Alex
>
>
>> For those who feel like helping out, the way to implement the code  
>> is:
>>
>> - remove white lines (optional)
>> - get each line
>> - extract annotations into a BCAnnotationsArray
>> - extract the sequence(s) into an NSString
>> - once done with all the sequences, create a BCSequence from each  
>> sequenceString
>> - add the annotations to each BCSequence
>> - add the new BCSequence(s) to the BCSequenceArray
>> - return the BCSequenceArray
>>
>>
>> cheers,
>>
>> - Koen.
>> _______________________________________________
>> Biococoa-dev mailing list
>> Biococoa-dev at bioinformatics.org
>> https://bioinformatics.org/mailman/listinfo/biococoa-dev
>>
>
> **************************************************************
>                         ** Alexander Griekspoor **
> **************************************************************
>                  The Netherlands Cancer Institute
>                  Department of Tumorbiology (H4)
>             Plesmanlaan 121, 1066 CX, Amsterdam
>                        Tel:  + 31 20 - 512 2023
>                        Fax:  + 31 20 - 512 2029
>                       AIM: mekentosj at mac.com
>                       E-mail: a.griekspoor at nki.nl
>                    Web: http://www.mekentosj.com
>
> MacOS X: The power of UNIX with the simplicity of the Mac
>
> ***************************************************************
>
>
> *********************************************************
>                     ** Alexander Griekspoor **
> *********************************************************
>               The Netherlands Cancer Institute
>               Department of Tumorbiology (H4)
>          Plesmanlaan 121, 1066 CX, Amsterdam
>                   Tel:  + 31 20 - 512 2023
>                   Fax:  + 31 20 - 512 2029
>                   AIM: mekentosj at mac.com
>                   E-mail: a.griekspoor at nki.nl
>               Web: http://www.mekentosj.com
>
>                             iRNAi, do you?
>              http://www.mekentosj.com/irnai
>
> *********************************************************
>
> _______________________________________________
> Biococoa-dev mailing list
> Biococoa-dev at bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/biococoa-dev


Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm




More information about the Biococoa-dev mailing list