Information about Evolutionary Tree

Enlarge picture
Fig. 1: A speculatively rooted tree for rRNA genes
A phylogenetic tree, also called an evolutionary tree, is a tree showing the evolutionary relationships among various biological species or other entities that are believed to have a common ancestor. In a phylogenetic tree, each node with descendants represents the most recent common ancestor of the descendants, and the edge lengths in some trees correspond to time estimates. Each node is called a taxonomic unit. Internal nodes are generally called hypothetical taxonomic units (HTUs) as they cannot be directly observed.

Although the idea of a "tree of life" arose from ancient notions of a ladder-like progression from lower to higher forms of life (such as in the Great Chain of Being), Charles Darwin (1859) first illustrated and popularized the notion of an evolutionary "tree" in his seminal book The Origin of Species. Over a century later, evolutionary biologists still use tree diagrams to depict evolution because the floral analogy effectively conveys the concept that speciation occurs through the adaptive and random splitting of lineages.

Types

Enlarge picture
Fig. 1: Unrooted tree of the myosin supergene family[1]
Enlarge picture
Fig. 2: A highly resolved, automatically generated Tree Of Life, based on completely sequenced genomes [2][3].
Enlarge picture
A phylogenetic tree, showing how Eukaryota and Archaea are more closely related to each other than to Bacteria, based on Cavalier-Smith's theory of bacterial evolution.
A rooted phylogenetic tree is a directed tree (data structure) with a unique node corresponding to the (usually imputed) most recent common ancestor of all the entities at the leaves of the tree. The most common method for rooting trees is the use of an uncontroversial outgroup — close enough to allow inference from sequence or trait data, but far enough to be a clear outgroup.

Unrooted trees illustrate the relatedness of the leaf nodes without making assumptions about common ancestry. While unrooted trees can always be generated from rooted ones by simply omitting the root, a root cannot be inferred from an unrooted tree without some means of identifying ancestry; this is normally done by including an outgroup in the input data or introducing additional assumptions about the relative rates of evolution on each branch, such as an application of the molecular clock hypothesis. Figure 1 depicts an unrooted phylogenetic tree for myosin, a superfamily of proteins.[4]

Both rooted and unrooted phylogenetic trees can be either bifurcating or multifurcating, and either labeled or unlabeled. A bifurcating tree has a maximum of two descendants arising from each interior node, while a multifurcating tree may have more than two. A labeled tree has specific values assigned to its leaves, while an unlabeled tree, sometimes called a tree shape, only defines a topology. The number of possible trees for a given number of leaf nodes depends on the specific type of tree, but there are always more multifurcating than bifurcating trees, more labeled than unlabeled trees, and more rooted than unrooted trees. The last distinction is the most biologically relevant; it arises because there are many places on an unrooted tree to put the root. For labeled bifurcating trees, there are
total rooted trees and
total unrooted trees, where n represents the number of leaf nodes. The number of unrooted trees for n input sequences or species is equal to the number of rooted trees for n-1 sequences.[5]

A dendrogram is a broad term for the diagrammatic representation of a phylogenetic tree.

A cladogram is a tree formed using cladistic methods. This type of tree only represents a branching pattern, i.e., its branch lengths do not represent time.

A phylogram is a phylogenetic tree that explicitly represents number of character changes through its branch lengths.

An ultrametric tree or chronogram is a phylogenetic tree that explicitly represents evolutionary time through its branch lengths.

Construction

Phylogenetic trees among a nontrivial number of input sequences are constructed using computational phylogenetics methods. Distance-matrix methods such as neighbor-joining or UPGMA, which calculate genetic distance from multiple sequence alignments, are simplest to implement, but do not invoke an evolutionary model. Many sequence alignment methods such as ClustalW produce both sequence alignments and phylogenetic trees. Methods including maximum parsimony, maximum likelihood and Bayesian inference apply an explicit model of evolution to phylogenetics.<ref name="Felsenstein" /> Identifying the optimal tree using many of these techniques is NP-hard<ref name="Felsenstein" />, so heuristic search and optimization methods are used in combination with tree-scoring functions to identify a reasonably good tree that fits the data.

Tree-building methods can be assessed on the basis of several criteria:[6]
  • efficiency (how long does it take to compute the answer, how much memory does it need?)
  • power (does it make good use of the data, or is information being wasted?)
  • consistency (will it converge on the same answer repeatedly, if each time given different data for the same model problem?)
  • robustness (does it cope well with violations of the assumptions of the underlying model?)
  • falsifiability (does it alert us when it is not good to use, i.e. when assumptions are violated?)
Tree-building techniques have also gained the attention of mathematicians. Trees can also be built using T-theory. [7]

Limitations

Although phylogenetic trees produced on the basis of sequenced genes or genomic data in different species can provide evolutionary insight, they have important limitations. They do not necessarily (and likely do not) represent actual evolutionary history. The data on which they are based is noisy; the analysis can be confounded by horizontal gene transfer[8], hybridisation between species that were not nearest neighbors on the tree before hybridisation takes place, convergent evolution, and conserved sequences. To avoid these limitations, one method of analysis, implemented in the program PhyloCode, does not assume a tree structure.

Also, there are problems in basing the analysis on a single type of character, such as a single gene or protein or only on morphological analysis, because such trees constructed from another unrelated data source often differ from the first, and therefore great care is needed in inferring phylogenetic relationships among species. This is most true of genetic material that is subject to lateral gene transfer and recombination, where different haplotype blocks can have different histories. In general, the output tree of a phylogenetic analysis is an estimate of the character's phylogeny and not the phylogeny of the taxa from which these characters were sampled, though ideally, both should be very close.

When extinct species are included in a tree, they should always be terminal nodes, as it is unlikely that they are direct ancestors of any extant species. Scepticism must apply when extinct species are included in trees that are wholly or partly based on DNA sequence data, due to little useful "ancient DNA" is preserved for longer than 100,000 years, and except in the most unusual circumstances no DNA sequences long enough for use in phylogenetic analyses have yet been recovered from material over 1 million years old.

See also

References

1. ^ Hodge T, Cope M (2000). "A myosin family tree". J Cell Sci 113 Pt 19: 3353-4. PMID 10984423. 
2. ^ Letunic, I (2007). "Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation." (Pubmed). Bioinformatics 23(1): 127-8. 
3. ^ Ciccarelli, FD (2006). "Toward automatic reconstruction of a highly resolved tree of life." (Pubmed). Science 311(5765): 1283-7. 
4. ^ Maher BA (2002). "Uprooting the Tree of Life". The Scientist 16: 18. 
5. ^ Felsenstein J. (2004). Inferring Phylogenies Sinauer Associates: Sunderland, MA.
6. ^ Penny, D., Hendy, M. D. & M. A. Steel. 1992. Progress with methods for constructing evolutionary trees. Trends in Ecology and Evolution 7: 73-79.
7. ^ A. Dress, K. T. Huber, and V. Moulton. 2001. Metric Spaces in Pure and Applied Mathematics. Documenta Mathematica LSU 2001: 121-139
8. ^ Woese C (2002). "On the evolution of cells". Proc Natl Acad Sci U S A 99 (13): 8742-7. PMID 12077305. 

External links

Images

General

In graph theory, a tree is a graph in which any two vertices are connected by exactly one path. Alternatively, any connected graph with no cycles is a tree. A forest is a disjoint union of trees.
..... Click the link for more information.
Editing of this page by unregistered or newly registered users is currently disabled due to vandalism.
If you are prevented from editing this page, and you wish to make a change, please discuss changes on the talk page, request unprotection, log in, or .
..... Click the link for more information.
species is one of the basic units of biological classification. A species is often defined as a group of organisms capable of interbreeding and producing fertile offspring.
..... Click the link for more information.
A group of organisms is said to have common descent if they have a common ancestor. In modern biology, it is generally accepted that all living organisms on Earth are descended from a common ancestor or ancestral gene pool.
..... Click the link for more information.
The most recent common ancestor (MRCA) of any set of organisms is the most recent individual from which all organisms in the group are directly descended. The term is most frequently used of humans.
..... Click the link for more information.
time.

One view is that time is part of the fundamental structure of the universe, a dimension in which events occur in sequence, and time itself is something that can be measured.
..... Click the link for more information.
tree of life is a mystical concept, a metaphor for common descent, and a motif in various world theologies and philosophies.

Conceptual and mythological "trees of life"

Various forms of trees of life
..... Click the link for more information.
Life (Biota)

Domains and Kingdoms
  • Life on Earth (Gaeabionta)
  • Nanobes

..... Click the link for more information.
great chain of being or scala naturæ is a classical and western medieval conception of the order of the universe, whose chief characteristic is a strict hierarchical system.
..... Click the link for more information.
Charles Robert Darwin

At the age of 51, Charles Darwin had just published On the Origin of Species.
..... Click the link for more information.


Natural selection is the process by which favorable traits that are heritable become more common in successive generations of a population of reproducing organisms, and unfavorable traits that are heritable become less
..... Click the link for more information.
On the Origin of Species
by Means of Natural Selection


The title page of the 1859 edition
of On the Origin of Species
Author Charles Darwin
Country United Kingdom
Language English
Subject(s)
..... Click the link for more information.
Evolutionary biology is a sub-field of biology concerned with the origin and descent of species, as well as their change, multiplication, and diversity over time.
..... Click the link for more information.
tree structure is a way of representing the hierarchical nature of a structure in a graphical form. It is named a "tree structure" because the graph looks a bit like a tree, even though the tree is generally shown upside down compared with a real tree; that is to say with the root
..... Click the link for more information.
Editing of this page by unregistered or newly registered users is currently disabled due to vandalism.
If you are prevented from editing this page, and you wish to make a change, please discuss changes on the talk page, request unprotection, log in, or .
..... Click the link for more information.
flora (plural: floras or florae) has two meanings. The first meaning, or flora of an area or of time period, refers to all plant life occurring in an area or time period, especially the naturally occurring or indigenous plant life.
..... Click the link for more information.
Analogy is both the cognitive process of transferring information from a particular subject (the analogue or source) to another particular subject (the target), and a linguistic expression corresponding to such a process.
..... Click the link for more information.


Speciation is the evolutionary process by which new biological species arise. There are four modes of natural speciation, based on the extent to which speciating populations are geographically isolated from one another:
..... Click the link for more information.
An adaptation is a positive characteristic of an organism that has been favored by natural selection.[1] The concept is central to biology, particularly in evolutionary biology.
..... Click the link for more information.
random is used to express lack of order, purpose, cause, or predictability in non-scientific parlance. A random process is a repeating process whose outcomes follow no describable deterministic pattern, but follow a probability distribution.
..... Click the link for more information.
tree is a widely-used data structure that emulates a tree structure with a set of linked nodes.

Nodes

A node may contain a value or a condition or represents a separate data structure or a tree of its own.
..... Click the link for more information.
imputation is the substitution of some value for a missing data point or a missing component of a data point. Once all missing values have been imputed, the dataset can then be analysed using standard techniques for complete data.
..... Click the link for more information.
In computer science, a leaf node is a node of a tree data structure that has zero child nodes. Often, leaf nodes are the nodes farthest from the root node. In the graph theory tree, a leaf node is a vertex of degree 1 other than the root (except when the tree has only one vertex;
..... Click the link for more information.
outgroup. The evolutionary conclusion from this is that the outgroup branched from the parent group before the other two groups branched from each other.

Some examples, with outgroup on the right:
  • Humans, chimpanzees — gorillas

..... Click the link for more information.
The molecular clock (based on the molecular clock hypothesis (MCH)) is a technique in genetics to date when two species diverged.
..... Click the link for more information.
Myosins are a large family of motor proteins found in eukaryotic tissues. They are responsible for actin-based motility.

Structure and Function

Domains

Most myosin molecules are composed of both a head and a tail domain.
..... Click the link for more information.
A gene family is a set of genes defined by presumed homology, i.e. evidence that the genes evolved from a common ancestral gene. They generally share some biochemical activity.
..... Click the link for more information.
Proteins are large organic compounds made of amino acids arranged in a linear chain and joined together by peptide bonds between the carboxyl and amino groups of adjacent amino acid residues.
..... Click the link for more information.
Cladistics is a philosophy of classification that arranges organisms only by their order of branching in an evolutionary tree and not by their morphological similarity, in the words of Luria et al. (1981).
..... Click the link for more information.
Computational phylogenetics is the application of computational algorithms, methods and programs to phylogenetic analyses. The goal is to assemble a phylogenetic tree representing a hypothesis about the evolutionary ancestry of a set of genes, species, or other taxa.
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter