Skip to content

Geonames DB does not always has hierarchy information #109

@benoit74

Description

@benoit74

#107 showed that Geonames DB which we use to fetch places and their hierarchy relationships sometimes miss hierarchy information

#108 applied on patch on that to at least create a fully functional ZIM with only few direct search entry missing.

For india for instance:

[maps2zim::MainThread::2026-04-28 13:21:16,405] INFO:  Processing geonames allCountries entries
[maps2zim::MainThread::2026-04-28 13:21:44,292] INFO:  Loaded 26166 unique place names for a total of 29256 places
[maps2zim::MainThread::2026-04-28 13:21:44,299] INFO:  Parsing hierarchy file
[maps2zim::MainThread::2026-04-28 13:21:44,541] INFO:  Parsing country info file
[maps2zim::MainThread::2026-04-28 13:21:44,568] INFO:  Progress 10 / 26176
[maps2zim::MainThread::2026-04-28 13:21:46,743] WARNING:Not adding duplicate place Gongri in title search: 8578975,8746090
[maps2zim::MainThread::2026-04-28 13:21:46,754] WARNING:Not adding duplicate place Najin in title search: 8741156,8746080
[maps2zim::MainThread::2026-04-28 13:21:46,757] WARNING:Not adding duplicate place Xiongba in title search: 8745930,8745990
[maps2zim::MainThread::2026-04-28 13:21:46,758] WARNING:Not adding duplicate place Yare in title search: 8745931,8745943
[maps2zim::MainThread::2026-04-28 13:21:46,761] WARNING:Not adding duplicate place Pianji in title search: 8745940,8745988
[maps2zim::MainThread::2026-04-28 13:21:46,767] WARNING:Not adding duplicate place Quluo in title search: 8745959,8745992
[maps2zim::MainThread::2026-04-28 13:21:46,778] WARNING:Not adding duplicate place Gangga in title search: 8745979,8746134
[maps2zim::MainThread::2026-04-28 13:21:46,789] WARNING:Not adding duplicate place Xiongmei in title search: 8746013,8746050
[maps2zim::MainThread::2026-04-28 13:21:46,790] WARNING:Not adding duplicate place Zhaxigang in title search: 8746015,8746020,8746097
[maps2zim::MainThread::2026-04-28 13:21:46,817] WARNING:Not adding duplicate place Cuoduo in title search: 8746107,8746136
[maps2zim::MainThread::2026-04-28 13:21:54,568] INFO:  Progress 23904 / 26176
[maps2zim::MainThread::2026-04-28 13:21:55,204] WARNING:Not adding duplicate place Siraha (district) in title search: 12095581,12097001
[maps2zim::MainThread::2026-04-28 13:21:55,213] WARNING:Not adding duplicate place Meringden in title search: 12095563,12095997
[maps2zim::MainThread::2026-04-28 13:21:55,234] WARNING:Not adding duplicate place Tatopani in title search: 12095614,12095645
[maps2zim::MainThread::2026-04-28 13:21:55,340] WARNING:Not adding duplicate place Sammarimai in title search: 12095951,12096023
[maps2zim::MainThread::2026-04-28 13:21:55,356] WARNING:Not adding duplicate place Makalu in title search: 12096006,12096223
[maps2zim::MainThread::2026-04-28 13:21:55,394] WARNING:Not adding duplicate place Bansagadhi in title search: 12096139,12096142
[maps2zim::MainThread::2026-04-28 13:21:55,512] INFO:  Added 29223 redirects and 2027 disambiguation pages

E.g. for Siraha search these are the results:

Image

And this is the disambiguation page:

Image

I have no suggestion ATM besides:

  • living with a moderately severe issue
  • switch to another place DB (but it is not like there are many on the free market, I don't have one to suggest ATM)

@kelson42 WDYT? Is this a showstopper for the release in your perspective?

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingquestionFurther information is requested

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions