Bioinformatics.org
[University of Birmingham]
[Patsnap]
Not logged in
  • Log in
  • Bioinformatics.org
    Membership (44430+) Group hosting [?] Wiki
    Franklin Award
    Sponsorships

    Careers
    About bioinformatics
    Bioinformatics jobs

    Research
    All information groups
    Online databases Online analysis tools Online education tools More tools

    Development
    All software groups
    FTP repository
    SVN & CVS repositories [?]
    Mailing lists

    Forums
    News & Commentary
  • Submit
  • Archives
  • Subscribe

  • Jobs Forum
    (Career Center)
  • Submit
  • Archives
  • Subscribe
  • News & Commentary - Message forums

    Gene name errors introduced by Excel
    Submitted by Marcos Oliveira de Carvalho; posted on Thursday, July 29, 2004

    Submitter

    A research article at BMC Bioinformatics:

    BACKGROUND

    When processing microarray data sets, we recently noticed that some gene names were being changed inadvertently to non-gene names.

    RESULTS

    A little detective work traced the problem to default date format conversions and floating-point format conversions in the very useful Excel program package. The date conversions affect at least 30 gene names; the floating-point conversions affect at least 2,000 if Riken identifiers are included. These conversions are irreversible; the original gene names cannot be recovered.

    CONCLUSIONS

    Users of Excel for analyses involving gene names should be aware of this problem, which can cause genes, including medically important ones, to be lost from view and which has contaminated even carefully curated public databases. We provide work-arounds and scripts for circumventing the problem.

    The full article is open-access and gratis to download:
    http://www.biomedcentral.com/1471-2105/5/80

    Expanded view | Monitor forum | Save place

    Start a new thread:
    You have to be logged in to post a reply.

     

    Copyright © 2024 Scilico, LLC · Privacy Policy