Skip to main content

Table 1 Medium-gap-length model conditions: parameter values and summary statistics

From: Non-parametric and semi-parametric support estimation using SEquential RESampling random walks on biomolecular sequences

Model conditionNumber of taxaTree heightInsertion/deletion probabilityNHDGappinessTrue align length
10.A100.40.130.2970.4741965.3
10.B100.70.10.3940.5122165.1
10.C1010.060.5140.5262162.8
10.D101.60.0310.5990.4851874.4
10.E104.30.0130.6930.4651849.3
50.A500.450.060.2810.5162043.5
50.B500.70.030.3980.4751935.5
50.C5010.020.5140.4982047.6
50.D501.80.0120.5940.4711945.0
50.E504.30.0040.6880.4591890.2
  1. The main simulations in our study utilized the medium gap length distribution from the study of Liu et al. [15]. The model condition parameters consist of the number of taxa, model tree height, and insertion/deletion probability. Each model condition corresponds to a distinct set of model parameter values. The 10-taxon model conditions are named 10.A through 10.E in order of generally increasing sequence divergence; the 50-taxon model conditions are named 50.A through 50.E similarly. The following table columns list average summary statistics for each model condition (\(n=20\)). “NHD” is the average normalized Hamming distance of a pair of aligned sequences in the true alignment. “Gappiness” is the percentage of true alignment cells which consists of indels. “True align length” is the length of the true alignment