GeoNames Home | Postal Codes | Download / Webservice | About 

GeoNames Forum
  [Search] Search   [Recent Topics] Recent Topics   [Groups] Back to home page 
[Register] Register / 
[Login] Login 
superfluous records of cities - 55 instances of Krajan  XML
Forum Index -> General
Author Message
shukuboy



Joined: 15/11/2011 22:27:32
Messages: 2
Offline

Hi,

I've been analysing my data and have found some extreme cases of superfluous repetition, for example in case of Krajan in Indonesia, there are 55 records in the cities1000, which all are almost identical except for the geo-coding coordinates which vary slightly. All records are marked as PPLA4.

In total there are around 2000 cities with duplicated ascii_name, admin1 code and country code combination, which range from 55 repetitions to 2 repetitions.

What's the best way of taking care of such instances. Are there any procedures for reporting them or do we have to remove them manually ?

Cheers,
Shuku
marc



Joined: 08/12/2005 07:39:47
Messages: 4501
Offline

Hi Shuku

Just because the name is identical does not mean they are duplicates. As you mention the coordinates are different and they refer therefore to different locations.

Best

Marc

[WWW]
Backslider



Joined: 26/10/2012 02:22:56
Messages: 5
Offline

The only solution I have found is to use 'GROUP BY asciiname' in your query.

Lots of nice data, but lots of duplicates and poor thought given to it. For example, if I try to look up cities/towns for Australian Capital Territory, I am faced with all the SUBURBS of Canberra - this is next to useless for practical application.

I should be able to get just: Canberra, Hall.
 
Forum Index -> General
Go to:   
Powered by JForum 2.1.5 © JForum Team