GeoNames Home | Postal Codes | Download / Webservice | About 

GeoNames Forum
  [Search] Search   [Recent Topics] Recent Topics   [Groups] Back to home page 
[Register] Register / 
[Login] Login 
Dupulicate cities problem  XML
Forum Index -> Discussion of GeoNames Toponyms
Author Message
mvanwyk



Joined: 24/05/2010 14:32:25
Messages: 15
Offline

I've been going through the database and have found what appear to be a handful of duplicate cites. I tried to delete them through the geonames web interface but it says that I don't have sufficient privileges.

So basically, how do I clean these entries up?
marc



Joined: 08/12/2005 07:39:47
Messages: 3993
Offline

are you sure they are duplicates?


Marc

[WWW]
mvanwyk



Joined: 24/05/2010 14:32:25
Messages: 15
Offline

In most cases I'm pretty confident.

Here is a specific example. There are 2 cities called Mehmand Chak in the Punjab region of Pakistan. Both have populations of 5,000. Mehmand Chak (6940901) is located precisely where on a google maps label for Mehmand Chak and satellite imagery clearly shows a small city. According to google maps, the other one (6940900) is located in an empty area about 10km NW of the first one (6940901). It seems unlikely to me that there would be 2 cities called Mehmand Chak, in the same province within 10km of each other, with the same populations and one is in an area that has old satellite imagery so you can't see the city. Most of my suspected duplicates are like this.

In a slightly different category are a few such as Pattani, Changwat Pattani, Thailand. Pattani (1607978) was close to the google maps label for Pattani so I moved it, however Pattani (1597001) is in the middle of a field about 40km from the first. However, both cities are listed as a "seat of a first-order administrative division" and it seems weird to me that a seat of a first-order administrative division would be in the middle of a field. Also, weird that there would be two cities called Pattani in the same province and both would be seats of a 1st order admin division.

On a side note, I'm trying to update some of the populations of a few major cities that have a population of 0 in the cities1000.txt, but I'm getting similar user authority errors with some of them. Ie Rayong, Thailand (1607017)
marc



Joined: 08/12/2005 07:39:47
Messages: 3993
Offline

you are right they are duplicates. The first was entered twice by the same user. He didn't realize that the first try also resulted in an entry in the db.

I have deleted both duplicates and increased your userlevel so that you should be able to add population numbers without problems.

Marc

[WWW]
mvanwyk



Joined: 24/05/2010 14:32:25
Messages: 15
Offline

Thanks for the help Marc!

Here are some more possible duplicates I've found

There are 2 cities called Avon, Lorain Country, Ohio
5146282 -> this one is in a field near the other Avon
5146277 -> this one is on a city called Avon

There are 2 cities called Union City, Alameda County, California
5404555 -> this one is near the google maps label Union City
5404554 -> this one is on the city about 6km from the first Union City

Africo, Provincia di Reggio di Calabria, Calabria, Italy
6534810 -> this one is on a label for Africo
2525768 -> this one appears to be in the middle of nowhere

Bella Vista, Benton County, Arkansas
4101115 -> this one is on a label for Bella Vista
4101114 -> this one is about 15km from the first city

Berceto, Provincia di Parma, Emilia-Romagna, Italy
3182172 -> this one is in an area that has a few towns but none called Berceto
6535025 -> this one is on a label for Berceto

I'm uncertain about this one, but it seems possible.
Arkhangel’skoye, RU.47., Moskovskaya Oblast', Russia
580993 -> this one is near a label that says arkhangel-skoye, but it is in a field and the population says 3600, which seems unlikely
580994 -> this one is also near a label that says arkhangel-skoye, but geonames has the city listed as Mikhaylovka. Also this geonameid also has a population of 3600, but the city seems too large for 3600.

Calasetta, Provincia di Cagliari, Sardinia, Italy
2525460 -> this one is on a label Calasetta
2567545 -> this one is about 20km south of the first city in the middle of nowhere

I'm not sure which is more correct
Camden Town, Camden, Greater London, England, United Kingdom
3345437 -> slightly closer to the Camden Town metro stop
6545177 -> within 200m of the previous one

Campagna, Campagna, Provincia di Salerno, Campania, Italy
3181066 -> this one is one the label Campagna
3181065 -> this is about 4km to the east

This one I'm not quite sure about
Casalincontrada, Casalincontrada, Provincia di Chieti, Abruzzo, Italy
6535649 -> this one is on a city called
Casalincontrada, Provincia di Chieti, Abruzzo, Italy
3180182 -> this one seems to be in the middle of nowhere, but has an almost identical population to the previous city

Castel di Iudica, Castel di Iudica, Catania, Sicily, Italy
6535669 -> this one is on the label for the city
2525101 -> this one is about 5km north west of the first city

Cubará, CO.36., Boyacá, Colombia
3685571 -> this one is near Cubara label
6196086 -> this one is about 10km south west of the 1st and has the same population as the 1st

Esposende, Braga, Portugal
2739849 -> this one is on the city label
2739848 -> this one is about 2km west of the 1st city

Klundert, Gemeente Moerdijk, North Brabant, Netherlands
6251994 -> this one is on the city label
2752600 -> this one is about 500m north of the 1st

Lakewood Park, Saint Lucie County, Florida, United States
4161510 -> this one is on the city label
4161511 -> this one is about 15km south east of the 1st

Magisano, Magisano, Provincia di Catanzaro, Calabria, Italy
6535318 -> this one is on the city label
2524335 -> this one is appears to be in the middle of nowhere

Marlow, Stephens County, Oklahoma, United States
4542179 -> this one is on the city label
4542180 -> this one is about 10km south of the first

Oakdale, Allen Parish. Louisiana, United States
4335796 -> this one is on the city label
4335797 -> this one is about 5km east of the first

Parre, Parre, Provincia di Bergamo, Lombardy, Italy
6535782 -> this one is on the city label
3171445 -> this one is on a city labelled Cividate al Piano

Roghudi, Roghudi, Provincia di Reggio di Calabria, Calabria, Italy
2523590 -> this one is on the city label
6534812 -> this one appears to be on a city called San Leonardo

Sabbioneta, Sabbioneta, Province of Mantua, Lombardy, Italy
6535244 -> this one is on the city label
3168752 -> this one is in a field

Not quite sure about this one
San Bartolomé, San Bartolomé, Province of Las Palmas, Canary Islands, Spain
2511447 -> I think this one is ok
2511440 -> Should this city be called San Bartolomé de Tirajana?

San Vittore del Lazio, San Vittore del Lazio, Provincia di Frosinone, Latium, Italy
6535052 -> this one is on the city label
3167174 -> this one appears to be in the middle of nowhere


mvanwyk



Joined: 24/05/2010 14:32:25
Messages: 15
Offline

Baabda, LB.05., Mont-Liban, Lebanon
277226 -> this says its the seat of region which I believe is correct
6273715 -> this one is about 100m from the first and is just listed as a populated place
 
Forum Index -> Discussion of GeoNames Toponyms
Go to:   
Powered by JForum 2.1.5 © JForum Team