[Taxacom] cladistics (was: clique analysis in textbooks)
Bob Morris
morris.bob at gmail.com
Fri Aug 19 22:52:33 CDT 2011
On Fri, Aug 19, 2011 at 2:32 PM, Sergio Vargas <sevragorgia at gmail.com> wrote:
"...because clustering can be done (computationally) efficiently
whereas searching for an optimal tree using phylogenetic methods
cannot."
It's fair enough that some or even all biologists might have a usage
of "clustering" that meet all of your explanation, and perhaps even
that this should be agreed to by all of the readership of taxacom. I
wouldn't know. But in statistical pattern recognition and datamining,
not everything called clustering can be done computationally
efficiently. Many techniques those disciplines call clustering are
intractable in the sense that they are NP-hard. Informally, this means
that (with presently understood computational complexity theory),
they fundamentally scale at least exponentially with size of the data
and no algorithm can circumvent that, just as for optimal tree
induction problems. So I can only understand your text as meaning
"...because clustering as meant by all practicing phylogeneticists can
be done (computationally) efficiently...", and that is why you are
prepared to subsequently say that the rest of your explanation "[...]
is so basic I cannot believe I am explaining it".
I do wonder a little whether in fact all practicing phylogeneticist
readers of taxacom understand by "clustering" only tractable
algorithms.
Bob Morris
Robert A. Morris
Emeritus Professor of Computer Science
UMASS-Boston
100 Morrissey Blvd
Boston, MA 02125-3390
IT Staff
Filtered Push Project
Harvard University Herbaria
email: morris.bob at gmail.com
web: http://efg.cs.umb.edu/
web: http://etaxonomy.org/mw/FilteredPush
http://www.cs.umb.edu/~ram
On Fri, Aug 19, 2011 at 2:32 PM, Sergio Vargas <sevragorgia at gmail.com> wrote:
> Hi,
>
> >Clustering is clustering is clustering. Group some things together and
> you are clustering - however it is done.
>
> no you are not. Grouping is not clustering, there are many ways to group
> things together not involving clustering. Maximum parsimony, maximum
> likelihood and bayesian analysis are not clustering. It is simply
> incorrect to call to these methods clustering. When you run either of
> the above analyses you are not clustering, despite the result being
> something similar to a cluster. If you could reduce phylogenetic
> inference to clustering everything would be so easy (computationally
> speaking) because clustering can be done (computationally) efficiently
> whereas searching for an optimal tree using phylogenetic methods cannot.
> Taxa are only "clustered" (randomly or sequentially) together to build
> the first tree, afterwards entire topologies are evaluated, taxa are not
> clustered. This is so basic I cannot believe I am explaining it.
>
> sergio
>
> --
> Sergio Vargas R., M.Sc.
> Dept. of Earth& Environmental Sciences
> Palaeontology& Geobiology
> Ludwig-Maximilians-Universität München
> Richard-Wagner-Str. 10
> 80333 München
> Germany
> tel. +49 89 2180 17929
> s.vargas at lrz.uni-muenchen.de
> sevra at marinemolecularevolution.org
>
> check my webpage:
> http://www.marinemolecularevolution.org
>
> check my research ID:
> http://www.researcherid.com/rid/A-5678-2011
>
>
> _______________________________________________
>
> Taxacom Mailing List
> Taxacom at mailman.nhm.ku.edu
> http://mailman.nhm.ku.edu/mailman/listinfo/taxacom
>
> The Taxacom archive going back to 1992 may be searched with either of these methods:
>
> (1) by visiting http://taxacom.markmail.org
>
> (2) a Google search specified as: site:mailman.nhm.ku.edu/pipermail/taxacom your search terms here
>
--
Robert A. Morris
Emeritus Professor of Computer Science
UMASS-Boston
100 Morrissey Blvd
Boston, MA 02125-3390
IT Staff
Filtered Push Project
Department of Organismal and Evolutionary Biology
Harvard University
email: morris.bob at gmail.com
web: http://efg.cs.umb.edu/
web: http://etaxonomy.org/mw/FilteredPush
http://www.cs.umb.edu/~ram
phone (+1) 857 222 7992 (mobile)
More information about the Taxacom
mailing list