written by: Patrick Leary
Last modified: April 18, 2009, 11:04 pm
This script reads a set of text files that are part of a NameBank data dump containing distinct genus and species names. It provides a summary analysis of the frequency of one, two, and three character suffixes.
These values can assist in determining the probability that an unknown string may represent a scientific name.