Phylogenetics - Distance-Based Methods

1732 days ago, 609 views
PowerPoint PPT Presentation
Phylogenetics. Endeavors to surmise the developmental history of a gathering of life forms or successions of nucleic acids or proteinsPhylogenetic strategies can be utilized for the investigation of transformative connections between types of living beings and also genesAttempt to remake transformative ancestorsEstimate time of dissimilarity from precursor.

Presentation Transcript

Slide 1

Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204

Slide 2

Phylogenetics Attempts to surmise the transformative history of a gathering of life forms or successions of nucleic acids or proteins Phylogenetic strategies can be utilized for the investigation of developmental connections between types of living beings and in addition qualities Attempt to remake developmental predecessors Estimate time of disparity from progenitor

Slide 3

Phylogenetic Trees We can utilize phylogenetic trees to represent the transformative connections among gatherings of species or qualities Leaf hubs of the tree are the species or qualities we are looking at, inside hubs are induced regular precursors

Slide 4

Phylogenetic Trees

Slide 5

History Taxonomists utilized life systems and physiology to amass and arrange life forms Morphological elements like nearness of plumes or number of legs When protein sequencing, and later DNA sequencing got to be distinctly normal, amino corrosive and DNA groupings turned into the basic approach to contruct trees

Slide 6

Phylogenetic Tree built from aa arrangements of Cytochrome C protein

Slide 7

The Big Picture Determine the species or qualities to be considered Acquire homologous grouping information Use various succession arrangement programming like ClustalW to adjust Clean up information by hand Use phylogenetic examination programming like Phylip in view of procedures we will think about Verify tentatively

Slide 8

Phylogenetics Can be utilized to take care of various intriguing issues Forensics HIV infection changes quickly Predicting advancement of flu infections Predicting elements of uncharacterized qualities - ortholog location Drug revelation Vaccine improvement Target deduced basic precursor

Slide 9

Types of Data Two classes Numerical information Distance between items E.g.evolutionary remove between two species Usually got from arrangement information Character information Each character has a limited number of states E.g. number or legs = 1, 2, 4 DNA = {A, C, T, G}

Slide 10

Phylogenetic Trees are made out of hubs and branches Terminal or leaf hubs compare to a quality or life form for which information has been gathered Internal hubs generally speak to an induced normal precursor that offered ascend to two free heredities at some point previously

Slide 11

Rooted and Unrooted Trees Some trees make an induction about a typical predecessor and the bearing of advancement and some don't First sort is known as an established tree and has a solitary hub assigned as root which is the regular progenitor Second sort is called an unrooted tree Specifies just relationship amongst hubs and says nothing in regards to heading of development

Slide 12

R B C Time E An A B C D E D Rooted and Unrooted Trees

Slide 13

Rooted and Unrooted Trees Roots can more often than not be allocated to unrooted trees utilizing an outgroup Species unambiguously isolated the soonest from others being examined E.g. primates if there should arise an occurrence of people and gorillas For three species there are 3 conceivable established trees, yet just a single conceivable unrooted tree

Slide 14

Rooted and Unrooted Trees In actuality the quantities of established ( N R ) and unrooted trees ( N U ) for n species is N R = (2n - 3)!/2 n-2 (n - 2)! N U = (2n - 5)!/2 n-3 (n - 3)!

Slide 15

Rooting Trees can be established by utilizing the outgroup technique beforehand specified, or by putting the root halfway between the two most far off species as controlled by branch length Branch length measures the measure of contrast that happened along a branch Assumes the animal types are developing in a clock-like way

Slide 16

Rooting a Tree

Slide 17

More Tree Terminology Structure of a phylogenetic tree can be spoken to in Newick organize utilizing settled enclosures (((B, C), (D, E)), An) If we need information to tell in which arrange at least two free heredities happened previously, the tree might be multifurcating (more than two predecessors) else, it is bifurcating (precisely two progenitors for every inside hub)

Slide 18

Character and Distance Data Character-based strategies utilize adjusted DNA or protein groupings straightforwardly for tree derivation Species A ATCGAATCGTTCCGGA Species B ATCCAATAGTTCCGGA Species C AACGAATCCTACCGGT Species D ATCGTTTCCAACCGCT Species E ATAGATTCGTTCGGGA

Slide 19

Character and Distance Data Distance-based strategies must change the succession information into a pairwise likeness grid for use amid tree surmising

Slide 20

Distance-Based Methods Given such an info lattice we need to discover an edge-weighted tree where the leafs of the tree compare to the species and the separations measured between two leaves relates to the relating network an incentive for the leaves

Slide 21

UPGMA (Unweighted Pair Group Method with Arithmetic mean) is the most seasoned separation framework strategy Uses a separation framework speaking to quantify of hereditary separation between sets of species being viewed as Clusters the two nearest species Compute new separation network utilizing number juggling intend to first bunch Repeat until all species gathered

Slide 22


Slide 23

Estimation of Branch Length Scaled trees, where the length of the branches compare to how much successions have wandered are called cladograms If rates of advancement are thought to be steady in all genealogies then inner hubs are set at equivalent separations from each of the species they offer ascent to on a bifurcating tree (UPGMA ex.)

Slide 24

UPGMA So UPGMA is exceptionally basic and creates established trees, in any case… Major shortcoming is that the calculation expect that rates of advancement are the same among various ancestries This does not fit existing natural information, so presumably shouldn't utilize UPGMA to construct phylogenetic trees

Slide 25

Transformed Distance Method Several separation framework based contrasting options to UPGMA permit distinctive rates of development inside various genealogies Oldest and easiest is the changed separation technique which exploits an outgroup Other ancestries just advance independently from each other after they veered and since the outgroup separated first we can utilize it as an edge of reference to look at how much alternate ancestries developed by observing when they separated

Slide 26

Neighbor's Relation Method One variation of UPGMA tries to match species so as to limit the aggregate of the branch lengths On an established tree, sets of species isolated from each other by just a single hub are called neighbors We have vital connections between neighbors of a phylogenetic tree with four hubs

Slide 27

Neighbor's Relation Method The accompanying hold for this tree d AC + d BD = d AD + d BC = a + b + c + d + 2 e = d AB + d CD + 2 e d AB + d CD < d AC + d BD d AB + d CD < d AD + d BC A C a c e b d B D

Slide 28

Neighbor's Relation Method Consider all conceivable pairwise courses of action of four species, and figure out which fulfills the four point condition (set of 2 imbalances) This procedure can be iterated to produce a total tree, however the procedure is unfeasible for extensive arrangements of species

Slide 29

b c an a c d e d e b Neighbor-Joining Methods Other neighborliness methodologies are accessible also Neighbor-joining techniques begin with all species masterminded in a star tree

Slide 30

Neighbor-Joining Methods The combine of hubs hauled out (assembled) at every cycle are picked so that the aggregate length of the branches on the tree is limited After a couple of hubs is hauled out, it shapes a bunch in the tree and is incorporated into further adjusts of emphasis (and another separation lattice is created) The tree's aggregate branch length is ascertained as: Q 12 = ( N - 2) d 12 -  ( d 1i )-  ( d 2i )