On 12 Jun 2005, at 11:16 pm, Luobin Yang wrote: > When I use idsfasta program from EMBOSS to index NCBI nt fasta format > database, the program complains that "duplicate ID found" and aborts. > Has anyone used idsfasta program to index NCBI nt database and has the > same problem? I tried to change the fields option from acum (accession > number) to seqvn (sequence version and GI) but it doesn't solve the > problem. Maybe there really is a duplicate entry in the data. It isn't unknown for them to slip through occasionally. It's simple enough to check whether the ID is duplicated yourself, and remove one of the entries (presumably the older one) if it is. Tim -- Dr Tim Cutts Informatics Systems Group, Wellcome Trust Sanger Institute GPG: 1024D/E3134233 FE3D 6C73 BBD6 726A A3F5 860B 3CDD 3F56 E313 4233