GeoNames Home | Postal Codes | Download / Webservice | About 

GeoNames Forum
  [Search] Search   [Recent Topics] Recent Topics   [Groups] Back to home page 
[Register] Register / 
[Login] Login 
Duplicate hotel toponyms - cleanup process?  XML
Forum Index -> Discussion of GeoNames Toponyms
Author Message
hendersonmj



Joined: 27/08/2018 16:14:28
Messages: 2
Offline

I am building a proof-of-concept application for hotel property market area analysis and have been using geonames.org US.txt data as one of the foundation elements.

Hotel data seems decent, but there's some low hanging fruit opportunities for improvement. I already suggested the Hyatt Place (rebranded from Amerisuites) switch and appreciate the quick response.

The biggest concern at the moment is the significant number of duplicate hotel toponyms. Perhaps 5% or so of the 56,000 US hotel toponyms are duplicates.

Suggestion: I can pretty easily pull the toponyms that are, say, <.05 miles apart and have similar names, then pick one to survive (asking you to eliminate the duplicate). Because hotels change names frequently, I would have to verify the current name of those that are duplicate sites with outdated names.

What's the best format for providing bulk data for cleanup?

You can contact me directly at michael dot henderson at anaplan dot com.
 
Forum Index -> Discussion of GeoNames Toponyms
Go to:   
Powered by JForum 2.1.5 © JForum Team