I was going to recommend (or at least ask permission) to omit that very code while I'm working on the file-based vs memory-based sequence classes.  Unfortunately, things have been very crazy at work for the last 2 months, so I haven't had the time I'd like to work on this.  I only have skeletons of the new BCSequence class cluster together, but I was not finding an easy way to filter the file-based sequences like the memory-based ones are at init time.  Omitting that code from the library is a breaking change...  So it's probably not going to be well-received by any end-users.  <br>

<br>I have not heard back from Peter Schols regarding repository access.  I would really like to branch the source and check-in on a regular basis. Is there anyone else that can grant this access?  If not to the trunk, at least to a branched folder?<br>

<br><div class="gmail_quote">On Wed, Apr 1, 2009 at 11:21 AM, Scott Christley <span dir="ltr"><<a href="mailto:schristley@mac.com">schristley@mac.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

Hello Stephan,<br>

<br>

Yes this is an issue.  BioCocoa attempts to determine the sequence type by looking at the symbols, once it makes a decision it strips any unknown symbols, of course it can make a wrong decision or more likely in your case, it considers the gap symbol as unknown.<br>


<br>

Unfortunately there isn't a direct workaround unless you are willing to make a modification to the BioCocoa source code, there are just a few lines you can comment out in BCSequence.m that will skip.<br>

<br>

I think BioCocoa probably needs to be changed so that it doesn't modify the sequence data at all, and the user is responsible for initiating a sequence type check and/or filtering.<br>

<br>

cheers<br><font color="#888888">

Scott</font><div><div></div><div class="h5"><br>

<br>

On Apr 1, 2009, at 3:05 AM, Stephan wrote:<br>

<br>

<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

Hi,<br>

<br>

I am new to BioCocoa and was wondering whether there is a way of parsing FASTA files that contain alignment-information, i.e. they include sequences with the gap-symbol "-".<br>

Right now, if I parse the file, the gaps are filtered out.<br>

<br>

Any sugguestions?<br>

<br>

Best,<br>

Stephan<br>

<br>

<br>

<br>

_______________________________________________<br>

Biococoa-dev mailing list<br>

<a href="mailto:Biococoa-dev@bioinformatics.org" target="_blank">Biococoa-dev@bioinformatics.org</a><br>

<a href="http://www.bioinformatics.org/mailman/listinfo/biococoa-dev" target="_blank">http://www.bioinformatics.org/mailman/listinfo/biococoa-dev</a><br>

</blockquote>

<br>

<br>

_______________________________________________<br>

Biococoa-dev mailing list<br>

<a href="mailto:Biococoa-dev@bioinformatics.org" target="_blank">Biococoa-dev@bioinformatics.org</a><br>

<a href="http://www.bioinformatics.org/mailman/listinfo/biococoa-dev" target="_blank">http://www.bioinformatics.org/mailman/listinfo/biococoa-dev</a><br>

<br>

<br>

</div></div></blockquote></div><br>