[Taxacom] globalnames?
Tony.Rees at csiro.au
Tony.Rees at csiro.au
Tue Sep 15 15:05:20 CDT 2009
Dear Paul,
I will anticipate any response from the "real" GNA/GNI developers by pointing out that in their concept, this content (Global Names Index) is the "raw" material from which the "cleaned" lists such as you describe can be assembled, so that will certainly happen and be much more pleasant to look at (cue David Remsen and Rich Pyle...). In addition, the application of algorithms and later expert review to reconcile spelling errors is happening as we speak - the first large scale trials (18.5 million names) being done yesterday, with preliminary - if still slightly imperfect - results appearing on the results page as you click on any name instance...
Onwards and upwards (or perhaps: forwards in all directions...)
Regards - Tony
________________________________________
From: taxacom-bounces at mailman.nhm.ku.edu [taxacom-bounces at mailman.nhm.ku.edu] On Behalf Of dipteryx at freeler.nl [dipteryx at freeler.nl]
Sent: Tuesday, 15 September 2009 11:05 PM
To: taxacom at mailman.nhm.ku.edu
Subject: Re: [Taxacom] globalnames?
***
I see that many of the names are there multiple times (five records
for one name not being anything special, with an instance of over a
dozen not requiring a great deal of searching). Apparently this is
because they are filed by text string (name plus authorship). A lot of
duplication could be avoided by the straightforward policy of filing
by scientific name (leading to a reduction by two thirds or so), with
a further reduction of the chaos possible by correcting spelling errors.
Paul
_______________________________________________
Taxacom Mailing List
Taxacom at mailman.nhm.ku.edu
http://mailman.nhm.ku.edu/mailman/listinfo/taxacom
The Taxacom archive going back to 1992 may be searched with either of these methods:
(1) http://taxacom.markmail.org
Or (2) a Google search specified as: site:mailman.nhm.ku.edu/pipermail/taxacom your search terms here
More information about the Taxacom
mailing list