> >The clustal parser output has a whole lot of dashes in the sequences. > >Can you have a look at those? (reg expressions are not my strong > point as > >you might have noticed) > > Dashes are "gaps"... they are meant to say that that particular > position in Sequence A does not match letters in the corresponding > position in Sequence B, in any meaningful way. When measuring Sure, I simply did not think you would want the gaps in the sequences parsed outof a clustalw file. But then again, maybe you do! I guess I would have never even though about parsing out sequences from clustalw result files. If you mail me the specific headers you want, I'll add them to cvs. Is Wincvs really such a pain? Best, Nico