[BiO BB] genbank orthography

Michael Ashburner ma11 at gen.cam.ac.uk
Wed Oct 24 12:58:38 EDT 2007


I agree it is a terrible mess. Not the new gaz.obo project.  This is  
an attempt to build an artefact in OBO
format for geographical locations. The current version has about  
20,000 locations. We have a parse of about
45,000 from Genbank but it will take some time to check them and get  
them in to this file.

http://obo.cvs.sourceforge.net/obo/obo/ontology/environmental/gaz.obo? 
view=log


Michael Ashburner


The file is available from the OBO CVS site
On 24 Oct 2007, at 10:41, Dan Bolser wrote:

> On 24/10/2007, Sterten at aol.com <Sterten at aol.com> wrote:
>>
>> names are not spelled uniformly, e.g. Viet Nam and Vietnam,
>> also many typos, this makes it very difficult to sort and analyse  
>> the  entries
>> by computer.
>> I'm looking for a complete list of different spellings
>> (thousands of entries...) and the suggested standard so we can
>> correct/uniformify them automatically.
>
> Great idea. The PDB needs something similar also!
>
>
>>
>>
>>
>>
>> _______________________________________________
>> General Forum at Bioinformatics.Org -  
>> BiO_Bulletin_Board at bioinformatics.org
>> https://bioinformatics.org/mailman/listinfo/bio_bulletin_board
>>
>
>
> -- 
> hello
> _______________________________________________
> General Forum at Bioinformatics.Org -  
> BiO_Bulletin_Board at bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/bio_bulletin_board




More information about the BBB mailing list