# Table 1 Results for Consensus with H = 250 and p = 80% on the Benchmark 1 datasets

Precision Timing
CNS Rat Leukemia NCI60 Lymph. Yeast PBM CNS Rat NCI60 Yeast PBM
Hier-A - 8.9 × 105 1.4 × 106 5.0 × 107 -
Hier-C $5$ - 8.1 × 105 1.3 × 106 4.8 × 107 -
Hier-S 2 $10$ 10 - 4.3 × 105 1.0 × 105 4.8 × 107 -
K-means-R - 5.6 × 105 1.2 × 106 2.7 × 107 -
K-means-A - 1.0 × 106 1.8 × 106 5.6 × 107 -
K-means-C - 9.8 × 105 1.7 × 106 5.3 × 107 -
K-means-S $5$ - 1.2 × 106 1.2 × 106 5.7 × 107 -
NMF-R - - 1.1 × 108 6.4 × 107 - -
NMF-A 2 - - 3.0 × 107 1.3 × 107 - -
NMF-C 5 - - 3.0 × 107 1.3 × 107 - -
NMF-S 2 8 - - 3.6 × 107 1.3 × 107 - -
Gold solution 6 3 8 3 5 18 - - - -
1. A summary of the results for Consensus with H = 250 and p = 80%, on all algorithms, on the Benchmark 1 datasets. Each cell in the table displays either a precision or a timing result. That is, either the prediction of the number of clusters in a dataset given by a measure or the execution time it took to get such a prediction. For cells displaying precision, a number in a circle with a black background indicates a prediction in agreement with the number of classes in the dataset; while a number in a circle with a white background indicates a prediction that differs, in absolute value, by 1 from the number of classes in the dataset; a number in a square indicates a prediction that differs, in absolute value, by 2 from the number of classes in the dataset; a number not in a circle/square indicates the remaining predictions. When one obtains two very close predictions for k*, they are both reported and separated by a dash. An entry containing a dash only indicates that either the experiment was stopped because of its high computational demand or that no useful indication was given by the method. For cells displaying timing, we use the following notation. Numeric values report timing in milliseconds, while a dash indicates that the timing is not available for at least one of the following reasons: the experiment (a) was performed on a computer other than the AMD Athlon; (b) it was stopped because of its high computational demand; (c) a smaller range of clustering solutions have been produced for that dataset, due to its size, i.e., Leukemia with p = 66%. For this particular set of experiments, we do not report the timing results for Leukemia and Lymphoma because they are redundant.