Date:
08/13/04 11:02
|
Submitted by:
dmb
|
Assigned to:
liwz
|
Category:
Clustering
|
Priority:
5
|
Ticket group:
Serious
|
Resolution:
Unset
|
Summary:
Suggested parameters cause an error!
|
Original submission:
From the docs...
55% cd-hit -n 4 -c 0.55 or
mcd-hit -n 3 -c 0.55
When I try
cd-hit
-i ./Scratch/somefile.fa
-o ./Scratch/somefile.fa.55
-c 0.55
-b 20
-M 1400
-n 4
-l 40
-L 0
-d 40
-t 2
I get the following error...
Fatal Error
Too short word length, increase it or the tolerance
Fixed by using
mcd-hit
-i ./Scratch/somefile.fa
-o ./Scratch/somefile.fa.55
-c 0.55
-b 20
-M 1400
-n 3
-l 40
-L 0
-d 40
-t 2
The optimal wordsize choice is emperically determined, it would be nice if we could analytically determine their values.
Maybe even a hybrid word size approach would be optimal.
|
Please log in to add comments and receive followups via email.
|
No followups have been posted
|
No results for "Dependent on ticket" |
No results for "Dependent on Task" |
No other tickets are dependent on this ticket
|
Ticket change history
Field
|
Old value
|
Date
|
By
|
status_id
|
Pending |
07/14/11 01:21
|
liwz
|
close_date
|
12/31/69 19:00 |
07/14/11 01:21
|
liwz
|
assigned_to
|
unset |
08/13/04 11:06
|
dmb
|
|