Yes, I believe you've got it. A local alignment between "disestabishmentarianism" and<br>

"reestablishmentSomeNonMatchingPart" would match establishment, but a

fully global alignment would force the leading d to match the leading r

and the trailing m to match the trailing t. <br>

<br>

Marty<br>

<br>

<br><br><div><span class="gmail_quote">On 3/11/06, <b class="gmail_sendername">Theodore H. Smith</b> <<a href="mailto:delete@elfdata.com">delete@elfdata.com</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

<br>Hi people,<br><br>I've successfully designed, written and compiled a program that uses<br>the smith-waterman algorithm.<br><br>Nothing new there, but it's for an interesting project, and before<br>the project is complete, perhaps some questions asked to

<br>bioinformaticians can help bring me up to your level.<br><br>The next stage after compiling, is testing my algorithm. I now must<br>write some tests for my code.<br><br>This is where I am seeing that I'm unsure if I even understand Smith-

<br>Waterman properly! I understand Levenshtein OK (similar to Needleman-<br>Wunsch), but Smith-Waterman I'm a bit unclear on.<br><br>Mostly I'm wondering exactly how does local matching help us, over<br>global matching. I got a lay person's description of why it helps,

<br>but I'm more interested in getting an exact feel for it.<br><br>Does it make sense to use English words as an example here, instead<br>of protein sequences? That would help me understand this a bit<br>better, as I have a better feel for English than proteins (unlike

<br>many of you).<br><br>Would then the main advantage be, for searching for short sequences<br>within long ones, without being unfairly penalised by the non-<br>matching ends of the long sequence?<br><br>For example: "extrapolate" could match "extra", far better in Smith-

<br>Waterman than it could using Levenshtein, because we aren't being<br>penalised so badly by the "polate" part.<br><br>Or perhaps: "specialisation" would match "lisation" far better using<br>

local than global, because we aren't being penalised by the "specia"<br>part so much.<br><br>Or even: "disestablishmentarianism" would match "establishment" far<br>better using local than global, because we aren't being penalised by

<br>"dis" or "arianism".<br><br>Is that how local searches like Smith-Waterman benefit us?<br><br><br>What about when we are searching for two long sequences of which only<br>a small part will match?<br>

Let's say "disestabishmentarianism" against "reestablishmentSomeNonMatchingPart". A local alignment should be able to figure out that "establishment" aligns well in this case.

<br><br>Is that basically how Smith-Waterman helps us?<br><br>--<br><a href="http://elfdata.com/plugin/">http://elfdata.com/plugin/</a><br><br><br><br>_______________________________________________<br>Bioinformatics.Org general forum  -  

<a href="mailto:BiO_Bulletin_Board@bioinformatics.org">BiO_Bulletin_Board@bioinformatics.org</a><br><a href="https://bioinformatics.org/mailman/listinfo/bio_bulletin_board">https://bioinformatics.org/mailman/listinfo/bio_bulletin_board

</a><br></blockquote></div><br><br clear="all"><br>-- <br>-- <br>Martin Gollery<br>Associate Director<br>Center For Bioinformatics<br>University of Nevada at Reno<br>Dept. of Biochemistry / MS330<br>775-784-7042<br>-----------