after I now have such a very interesting data base of 41245
cities/towns on earth, it is very easy and most informative to play a few games with it.
As a start, I used my dayly "professional math tool", Maple9, and with only 2 commands
Code: Select all
people:=evalf(map(log10,readdata("log.txt",integer))):
histogram(people,area=count,axes=boxed,labels=["log10(population)","number of cities"],labeldirections=[horizontal,vertical],OPTS);
I generated this nice histogram from the file 'log.txt', containing the populations in each of the 41245 cities/towns. This file I just wrote out with Perl, of course.
What does it tell us?
On the horizontal axis, I plotted the decadic logarithm of the
populations (binned within certain ranges) and vertically, you can read off the corresponding numbers of cities/towns. So on the horizontal axis,
2 = 100, 4 = 10000 6 = 1000000 people!
You see from the plot that most cities (14000+) on earth (or better in that database!) host between 10000 and 16000 people, while only a "few" (412) host around one million of people.
AHA!
Why did I do that plot besides mere curiosity??
Of course, to optimize the calculation of an importance parameter for
the location labels from this distribution....This is in progress!
Bye Fridger