Graph-Theoretic Algorithms to Improve Phylogenomic Analyses

Funding: U.S. National Science Foundation grant CCF-1535977 (Algorithms in the Field), collaborative with UC Berkeley.

This grant lasted from 2014 to 2020. The project outcomes report is available at this NSF page.

Participants at UIUC:

Participants at Berkeley (collaborating institution):

Project Overview:

Understanding the history of life on earth--how species evolved from their common ancestor--is a major goal of biological research. These evolutionary trees are very hard to construct with high accuracy, because nearly all of the most accurate approaches require the solution to computationally hard optimization problems. Furthermore, research has shown that the evolutionary tree for a single gene can be different from the evolutionary tree for the species, and current methods do not provide adequate accuracy on genome-scale data. As a result, large evolutionary trees, covering big portions of "The Tree of Life", are very difficult to compute with high accuracy. This project will develop methods that can enable highly accurate species tree estimation. The key approach is the development of novel divide-and-conquer strategies, whereby a dataset is divided into overlapping subsets, species trees are constructed on the subsets, and then the subset species trees are merged together into a tree on the full dataset. These approaches will be combined with powerful statistical estimation methods, to potentially transform the capability of evolutionary biologists to analyze their data. This project will also provide open source software for the new methods that are developed, and provide training in the use of the software to biologists at national meetings. The project will also contribute to interdisciplinary training for two doctoral students, one at Illinois and one at Berkeley, and course materials for computational biology will be made available online.

Publications supported by the grant to UIUC

2016

2017

2018

2019

2020

Project Software:

Summer Symposia and Software Schools:

The grant provided summer symposia and software schools to train researchers (from students through faculty) in phylogenomic estimation, which is a main focus of this grant.

Presentations:

See http://tandy.cs.illinois.edu/talks.html for the full list of talks.

2015

2016

2017

2018

2019

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.