Minimum evolution

Minimum evolution is a distance method employed in phylogenetics modeling. It shares with maximum parsimony the aspect of searching for the phylogeny that has the shortest total sum of branch lengths.^[1]^[2]

This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these messages)

This article's lead section may need to be rewritten. The reason given is: Context is a slow and non standard. (June 2020)

A major contributor to this article appears to have a close connection with its subject. (June 2020)

The theoretical foundations of the minimum evolution (ME) criterion lay in the seminal works of both Kidd and Sgaramella-Zonta (1971)^[3] and Rzhetsky and Nei (1993).^[4] In these frameworks, the molecular sequences from taxa are replaced by a set of measures of their dissimilarity (i.e., the so-called "evolutionary distances") and a fundamental result states that if such distances were unbiased estimates of the true evolutionary distances from taxa (i.e., the distances that one would obtain if all the molecular data from taxa were available), then the true phylogeny of taxa would have an expected length shorter than any other possible phylogeny T compatible with those distances.

Remove ads

Relationships with and comparison with other methods

Summarize

Perspective

More information Principle, Usage Case ...

Comparison Table for Phylogenetic Tree-Building Methods
	Principle	Usage Case	Time Complexity
Minimum Evolution (ME)	Searches for the shortest total sum of branch lengths	Large datasets where character-based methods would be too demanding	NP-hard without utilizing heuristics
Maximum Parsimony	Searches for the minimal number of evolutionary steps (i.e. mutations)	Similar sequence comparisons	NP-hard for large datasets
Neighbor Joining	Iteratively joins the shortest branch lengths to construct a single tree	Large datasets / Datasets with few informative sites	O(N³), can be improved using heuristics
Maximum Likelihood	Looks for the most probable function possible given data parameters	Large datasets where low bias is important	O(N), this improves with larger datasets
Unweighted Pair Group Method with Arithmetic Mean(UPGMA)	Creates a cluster of clusters and finds the arithmetic mean of the clusters	Initial examination of data	O(N²)

Maximum parsimony

It is worth noting here a subtle difference between the maximum-parsimony criterion and the ME criterion: while maximum-parsimony is based on an abductive heuristic, i.e., the plausibility of the simplest evolutionary hypothesis of taxa with respect to the more complex ones, the ME criterion is based on Kidd and Sgaramella-Zonta's conjectures that were proven true 22 years later by Rzhetsky and Nei.^[4] These mathematical results set the ME criterion free from the Occam's razor principle and confer it a solid theoretical and quantitative basis.

Similarly to ME, maximum parsimony becomes an NP-hard problem when trying to find the optimal tree^[5] (that is, the one with the least total character-state changes). This is why heuristics are often utilized in order to select a tree, though this does not guarantee the tree will be an optimal selection for the input dataset. This method is often used when very similar sequences are analyzed, as part of the process is locating informative sites in the sequences where a notable number of substitutions can be found. ^[6]

Maximum-parsimony criterion, which uses Hamming distance branch lengths, was shown to be statistically inconsistent in 1978. This led to an interest in statistically consistent alternatives such as ME.^[7]

Neighbor joining

Neighbor joining may be viewed as a greedy heuristic for the balanced minimum evolution (BME) criterion. Saito and Nei's 1987 NJ algorithm far predates the BME criterion of 2000. For two decades, researchers used NJ without a firm theoretical basis for why it works.^[8]

While neighbor joining shares the same underlying principle of prioritizing minimal evolutionary steps, it differs in that it is a distance method as opposed to maximum parsimony, which is a character-based method. Distance methods like neighbor joining are often simpler to implement and more efficient, which has led to its popularity for analyzing especially large datasets where computational speed is critical. Neighbor joining is a relatively fast phylogenetic tree-building method, though its worst-case time complexity can still be O(N³) without utilizing heuristic implementations to improve on this.^[9] It also considers varying rates of evolution across branches, which many other methods do not account for.

Neighbor joining also is a rather consistent method in that an input distance matrix with little to no errors will often provide an output tree with minimal inaccuracy. However, using simple distance values rather than full sequence information like in maximum parsimony does lend itself to a loss of information due to the simplification of the problem.^[10]

Maximum Likelihood

Maximum likelihood contrasts itself with Minimum Evolution in the sense of Maximum likelihood is a combination of the testing of the most likely tree to result from the data. However, due to the nature of the mathematics involved it is less accurate with smaller datasets but becomes far less biased as the sample size increases, this is due to the error rate being 1/log(n). Minimal evolution is similar but it is less accurate with very large datasets. It is similarly powerful but overall much more complicated compared to UPGMA and other options.^[11]

UPGMA

UPGMA is a clustering method. It builds a collection of clusters that are then further clustered until the maximum potential cluster is obtained. This is then worked backwards to determine the relation of the groups. It specifically uses an arithmetic mean enabling a more stable clustering. Overall while it is less powerful compared to any of the other listed comparisons it is far simpler and less complex to create. Minimal Evolution is overall more powerful but also more complicated to set up, and is also NP hard. ^[12]

Remove ads

Statistical consistency

Summarize

Perspective

The ME criterion is known to be statistically consistent whenever the branch lengths are estimated via the Ordinary Least-Squares (OLS) or via linear programming.^[4]^[13]^[14] However, as observed in Rzhetsky & Nei's article, the phylogeny having the minimum length under the OLS branch length estimation model may be characterized, in some circumstance, by negative branch lengths, which unfortunately are empty of biological meaning.^[4] To solve this drawback, Pauplin^[15] proposed to replace OLS with a new particular branch length estimation model, known as balanced basic evolution (BME). Richard Desper and Olivier Gascuel^[16] showed that the BME branch length estimation model ensures the general statistical consistency of the minimum length phylogeny as well as the non-negativity of its branch lengths, whenever the estimated evolutionary distances from taxa satisfy the triangle inequality.

Le Sy Vinh and Arndt von Haeseler^[17] have shown, by means of massive and systematic simulation experiments, that the accuracy of the ME criterion under the BME branch length estimation model is by far the highest in distance methods and not inferior to those of alternative criteria based e.g., on Maximum Likelihood or Bayesian Inference. Moreover, as shown by Daniele Catanzaro, Martin Frohn and Raffaele Pesenti,^[18] the minimum length phylogeny under the BME branch length estimation model can be interpreted as the (Pareto optimal) consensus tree between concurrent minimum entropy processes encoded by a forest of n phylogenies rooted on the n analyzed taxa. This particular information theory-based interpretation is conjectured to be shared by all distance methods in phylogenetics.

Francois Denis and Olivier Gascuel^[19] proved that the Minimum Evolution principle is not consistent in weighted least squares and generalized least squares. They showed that there was an algorithm that could be used in OLS models where all weights are equal called EDGE_LENGTHS. In this algorithm the lengths of two edges, 1u and 2u can be computed without using distances δij(i,j≠1,2). This property does not hold in WLS models or in the GLS models. Without this property the ME principle is not consistent in the WLS and GLS models.

Remove ads

Algorithmic aspects

Summarize

Perspective

The "minimum evolution problem" (MEP), in which a minimum-summed-length phylogeny is derived from a set of sequences under the ME criterion, is said to be NP-hard.^[20]^[21] The "balanced minimum evolution problem" (BMEP), which uses the newer BME criterion, is APX-hard.^[20]

A number of exact algorithms solving BMEP have been described.^[22]^[23]^[24]^[25] The best known exact algorithm^[26] remains impractical for more than a dozen taxa, even with multiprocessing.^[20] There is only one approximation algorithm with proven error bounds, published in 2012.

In practical use, BMEP is overwhelmingly implemented by heuristic search. The basic, aforementioned neighbor-joining algorithm implements a greedy version of BME.^[27]

FastME, the "state-of-the-art",^[20] starts with a rough tree then improves it using a set of topological moves such as Nearest Neighbor Interchanges (NNI). Compared to NJ, it is about as fast and more accurate.^[28]

FastME operates on the Balanced Minimum Evolution principle, which calculates tree length using a weighted linear function of all pairwise distances. The BME score for a given topology is expressed as:

L=\sum _{i<j}w_{ij}\cdot d_{ij}

where $d_{ij}$ represents the evolutionary distance between taxa $i$ and $j$ , and $w_{ij}$ is a topology-dependent weight that balances each pair’s contribution. This approach enables more accurate reconstructions than greedy algorithms like NJ.

The algorithm improves tree topology through local rearrangements, primarily Subtree Prune and Regraft (SPR) and NNI operations. At each step, it checks if a rearranged tree has a lower BME score. If so, the change is retained. This iterative refinement enables FastME to converge toward near-optimal solutions efficiently, even for large datasets.

Simplified pseudocode of FastME:

Input: Distance matrix D for n taxa
Output: Tree T minimizing BME score

1. Construct initial tree T using Neighbor Joining
2. Repeat until no improvement:
    a. For each possible SPR or NNI move:
        i. Apply the move to produce new tree T'
        ii. Compute BME score of T'
        iii. If score(T') < score(T), set T = T'
3. Return final tree T

Simulations reported by Desper and Gascuel demonstrate that FastME consistently outperforms NJ in terms of topological accuracy, particularly when evolutionary rates vary or distances deviate from strict additivity. It has also been successfully used on datasets with over 1,000 taxa.^[29]

Like most distance-based methods, BME assumes that the input distances are additive. When this assumption does not hold—due to noise, unequal rates, or other violations—the resulting trees may still be close to optimal, but accuracy can be affected. In addition to FastME, metaheuristic methods such as genetic algorithms and simulated annealing have also been used to explore tree topologies under the minimum evolution criterion, particularly for very large datasets where traditional heuristics may struggle.^[30]

Remove ads

References

Loading content...

Minimum evolution

Relationships with and comparison with other methods

Maximum parsimony

Neighbor joining

Maximum Likelihood

UPGMA

Statistical consistency

Algorithmic aspects

See also

References

Further reading

Wikiwand - on