[Biodevelopers] (#=#) XML for huge DB? - XML::Parser....

Patrick McConnell MCCon012 at mc.duke.edu
Fri Aug 1 14:49:46 EDT 2003





You will have to post your SAX handler code for us to be any help, I think.

Perhaps your parser is returning character data in separate chunks so that
you are getting events like this:
      (1) Element start: hsp_num
      (2) Characters: 1
      (3) Characters: 2
      (4) Element end: hsp_num
In this case, your hsp num is supposed to be 12, but gets split because the
character buffer was filled at this random time.  I think such an occurance
is legal for a SAX parser, but I am no expert.

I don't know about getting a Q instead of the proper accession.

Good luck!

-Patrick





Dan Bolser <dmb at mrc-dunn.cam.ac.uk>@bioinformatics.org on 08/01/2003
01:33:37 PM

Please respond to biodevelopers at bioinformatics.org

Sent by:    biodevelopers-admin at bioinformatics.org


To:    biodevelopers at bioinformatics.org
cc:

Subject:    [Biodevelopers] (#=#) XML for huge DB? - XML::Parser....


I am now deeply frustrated.

My XML::Parser is working almost perfectly, apart from
dumping the occasional Q instead of the correct
Hit_accession, and then somtimes 'loosing' the last
digit of the Hsp_num...

I only notice these when they violate my primary
keys, so I don't know how many 'silent' mutations
are being introduced.

If I put some new lines in the file, after the <!DOCYTPE,
and before <BlastOutput, other characters go missing,
but the previous ones are returned. There is a specific
number of lines which gives no detected mutations,

but puting the file back the way it was makes the original
problem come right back - in exactly the same predetermined
way....

Who should I kill?

Please don't sugest using another parser...

Which Bioperl should I use?

I found the great looking

BLAST::Report

http://ccgb.umn.edu/~crow/projects/xmlblast/

uses dom, which is a shame.

Thanks anyone who can return my missing characters,
and my sanity....

Why would XML::Parser develope this problem is such
a derministic random way?

?Thanks again, Dan.



_______________________________________________
Biodevelopers mailing list
Biodevelopers at bioinformatics.org
https://bioinformatics.org/mailman/listinfo/biodevelopers







More information about the Biodevelopers mailing list