Open
Description
Hi,
I ran MeShClust on 700k sequences and got the following error message:
Using 16 bit histograms
Counting 4-mers [======================================================] 100 %
Splitting data
Point pairs: 38
Sorting data [=========================================================] 100 %
Warning: Alignment may be too large for sampling
Before Pair: >158256496-stool1_revised_C820061_1_gene84242 strand:+, >158256496-stool1_revised_C820061_1_gene84242 strand:+
Before Pair: >158256496-stool1_revised_C820061_1_gene84242 strand:+, >158256496-stool1_revised_C844273_1_gene26404 strand:+
Before Pair: >158256496-stool1_revised_C820061_1_gene84242 strand:+, >158256496-stool1_revised_C850045_1_gene50883 strand:-
Before Pair: >158256496-stool1_revised_C820061_1_gene84242 strand:+, >158256496-stool1_revised_C928413_1_gene23126 strand:-
Alignment [============================================================] 100 %
positive=56 negative=1008
resizing positive
Vector size: 56 min size: 56
resizing negative
Vector size: 1008 min size: 56
index size: 952
positive=56 negative=56
Adding combo 18
new single feature 2
new single feature 16
Adding combo 6
new single feature 4
Adding combo 32
new single feature 32
bounds[0]: 0 to 16290
bounds[1]: 0.0944969 to 1
bounds[2]: 0 to 16290
bounds[3]: -0.188998 to 15.5225
Accuracy: 96.4286% Sensitivity: 100% Specificity: 92.8571%
Accuracy: 94.6429% Sensitivity: 100% Specificity: 89.2857%
Adding combo 1026
new single feature 1024
bounds[0]: 0 to 16290
bounds[1]: 0.0944969 to 1
bounds[2]: 0 to 16290
bounds[3]: -0.188998 to 15.5225
bounds[4]: 34393 to 65536
Accuracy: 98.2143% Sensitivity: 100% Specificity: 96.4286%
Accuracy: 100% Sensitivity: 100% Specificity: 100%
breaking from acc cutoff
Final: feat size is 4
Using 4 features Mar 9 2018
error: list not sorted===============> ] 40 %
terminate called after throwing an instance of 'int'
Can this be overcome?
Many thanks,
Matthieu
Metadata
Metadata
Assignees
Labels
No labels