[Pkg-isocodes-devel] ISO 3166 db
Ron Savage
ron at savage.net.au
Thu Sep 29 04:21:52 UTC 2016
Hi
I'm concerned about the discrepancy between the Debian db of
countries/subcountries (https://pkg-isocodes.alioth.debian.org/) and the
Wikipedia equivalent (https://en.wikipedia.org/wiki/ISO_3166-1).
I'm concentrating on 'has_subcounties' and 'subcountries_in_db' here.
My code to report on the Debian db says:
countries_in_db => 249.
has_subcounties => 198.
subcountries_in_db => 4847.
subcountry_types_in_db => 92.
However, I've written a Perl module
WWW::Scraper::Wikipedia::ISO3166 which downloads and scrapes the
corresponding Wikipedia pages. It reports:
countries_in_db => 249
has_subcounties => 200
subcountries_in_db => 5297
subcountry_categories_in_db => 77
subcountry_files_downloaded => 249
subcountry_info_in_db => 352
Anyone know why there would be such a discrepancy?
--
Ron Savage - savage.net.au
More information about the Pkg-isocodes-devel
mailing list