Wednesday, January 31, 2007

When phylogenetic names would be useful

To avoid being charged with being consistent (unlikely, I know), despite being underwhelmed by phylogenetic names in the context of TreeBASE (see conversation with David Marjanović in the previous post), I think they could be very useful in annotating phylogenetic trees. One of the things that impresses me most about Google Earth is the community that has developed around this tool (take a look at the latest issue of Sightseer). Lots of people are contributing content to Google Earth, and I think this is driven in part by the ease of creating the KML files that Google Earth uses, and the ease of referring to a geographic location (i.e., latitude and longitude).

Imagine having a similar tool for phylogenetics, where people could add annotations (such as "wings developed here", "oldest fossil found at this site", "pollination mode changed here", "change in base composition took place here", etc.). A bit like MacClade on steroids, or a community driven version of the TaxonTree tool developed by the University of Maryland's Human-Computer Interaction Lab (see their page on Biodiversity Informatics Visualization).

Leaving aside the issue of file format, we'd need a way of unambiguously locating a point on a tree. An obvious solution is to use least common ancestors (same concept as most recent common ancestors), as used in phylogenetic nomenclature (although not in apomorphy-based definitions, which strike me as a disaster waiting to happen).

It's in this sort of context that I think phylogenetic names show great promise. For some nice examples of using LCA's to identify nodes, take a look at the mor server (seemingly down at present, but described in a paper by David Hibbett and colleagues doi:10.1080/10635150590947104), and Michael Sanderson's program r8s. The mor package uses phylogenetic definitions of names to locate the corresponding groups in a large phylogenetic tree, and r8s has the MRCA command to attach age constraints to nodes in a tree.