<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
	<channel>
		<title><![CDATA[Latest posts for the topic "San Francisco, Cuba"]]></title>
		<link>http://forum.geonames.org/gforum/posts/list/4.page</link>
		<description><![CDATA[Latest messages posted in the topic "San Francisco, Cuba"]]></description>
		<generator>JForum - http://www.jforum.net</generator>
			<item>
				<title>San Francisco, Cuba</title>
				<description><![CDATA[ Try searching for San Francisco, Cuba and tell me what in the world went on there.  :)  Did some input script mess up or was the original data that dirty?]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/1592.page#6611</guid>
				<link>http://forum.geonames.org/gforum/posts/list/1592.page#6611</link>
				<pubDate><![CDATA[Thu, 5 Nov 2009 00:01:53]]> GMT</pubDate>
				<author><![CDATA[ rlevering]]></author>
			</item>
			<item>
				<title>Re:San Francisco, Cuba</title>
				<description><![CDATA[ On this note, I just ran a duplicate detector on the DB...where I defined a duplicate to be anything with the same name, feature code, and all the same hierarchical breakdown (country,adm1,adm2).  There are a very large number of duplicates in the database.  I'm sure some of these may actually be different places, but the large majority that I saw were definitely import errors.  Is there a strategy to handle these?]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/1592.page#6612</guid>
				<link>http://forum.geonames.org/gforum/posts/list/1592.page#6612</link>
				<pubDate><![CDATA[Thu, 5 Nov 2009 00:30:24]]> GMT</pubDate>
				<author><![CDATA[ rlevering]]></author>
			</item>
			<item>
				<title>Re:San Francisco, Cuba</title>
				<description><![CDATA[ This for a change really looks like the same toponym. The toponym is referring to an area and there are a lot of markers covering the entire area. I could imagine that one of the input sources was aggregating small subsets (like maps) and the toponym was on each of the map once and ended up n-times. Please feel free to clean it up, be careful with automated scripts. There are a couple of threads of users complaining about duplicates, but usually it is absolutely not clear whether they really are duplicates, often they are clearly not duplicates.  There is just no law that makes place names unique even though it would make life easier for application developers.

Best

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/1592.page#6620</guid>
				<link>http://forum.geonames.org/gforum/posts/list/1592.page#6620</link>
				<pubDate><![CDATA[Thu, 5 Nov 2009 20:35:35]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
	</channel>
</rss>