<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
	<channel>
		<title><![CDATA[Latest posts for the topic "superfluous records of cities - 55 instances of Krajan"]]></title>
		<link>http://forum.geonames.org/gforum/posts/list/4.page</link>
		<description><![CDATA[Latest messages posted in the topic "superfluous records of cities - 55 instances of Krajan"]]></description>
		<generator>JForum - http://www.jforum.net</generator>
			<item>
				<title>superfluous records of cities - 55 instances of Krajan</title>
				<description><![CDATA[ Hi,

I've been analysing my data and have found some extreme cases of superfluous repetition, for example in case of Krajan in Indonesia, there are 55 records in the cities1000, which all are almost identical except for the geo-coding coordinates which vary slightly.   All records are marked as PPLA4.

In total there are around 2000 cities with duplicated ascii_name, admin1 code and country code combination, which range from 55 repetitions to 2 repetitions.

What's the best way of taking care of such instances.  Are there any procedures for reporting them or do we have to remove them manually ? 

Cheers,
Shuku]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/3159.page#10604</guid>
				<link>http://forum.geonames.org/gforum/posts/list/3159.page#10604</link>
				<pubDate><![CDATA[Fri, 18 Nov 2011 22:08:05]]> GMT</pubDate>
				<author><![CDATA[ shukuboy]]></author>
			</item>
			<item>
				<title>Re:superfluous records of cities - 55 instances of Krajan</title>
				<description><![CDATA[ Hi Shuku

Just because the name is identical does not mean they are duplicates. As you mention the coordinates are different and they refer therefore to different locations.

Best

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/3159.page#11005</guid>
				<link>http://forum.geonames.org/gforum/posts/list/3159.page#11005</link>
				<pubDate><![CDATA[Sun, 18 Dec 2011 06:13:52]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
			<item>
				<title>Re:superfluous records of cities - 55 instances of Krajan</title>
				<description><![CDATA[ The only solution I have found is to use 'GROUP BY asciiname' in your query.

Lots of nice data, but lots of duplicates and poor thought given to it.  For example, if I try to look up cities/towns for Australian Capital Territory, I am faced with all the SUBURBS of Canberra - this is next to useless for practical application.

I should be able to get just: Canberra, Hall.]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/3159.page#11984</guid>
				<link>http://forum.geonames.org/gforum/posts/list/3159.page#11984</link>
				<pubDate><![CDATA[Tue, 30 Oct 2012 07:59:34]]> GMT</pubDate>
				<author><![CDATA[ Backslider]]></author>
			</item>
	</channel>
</rss>