<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
	<channel>
		<title><![CDATA[Latest posts for the topic "Encoding issues in admin1Codes"]]></title>
		<link>http://forum.geonames.org/gforum/posts/list/4.page</link>
		<description><![CDATA[Latest messages posted in the topic "Encoding issues in admin1Codes"]]></description>
		<generator>JForum - http://www.jforum.net</generator>
			<item>
				<title>Encoding issues in admin1Codes</title>
				<description><![CDATA[ Follow-up of a remark I made on my region of Provence Alpes Côte d'Azur, which shows in the map interface as "Provence-Alpes-Côte dʼAzur" with a weird ʼ instead of ' in Internet Explorer. In Firefox I have a correct display.
Same through the web service 
http://ws.geonames.org/countrySubdivision?lat=44.5&lng=6.5

So I downloaded the admin1Codes.txt file, and found out that it was not only a browser issue, since I found indeed :
FR.B8	Provence-Alpes-Côte dʼAzur in this file.
among many other occurrences of bad encoded characters in various languages. I checked in various text editors, and have the same issue, although the file seems to be recognized as UTF-8 encoded.

Do others have the same issue? Any clue on how to fix that? Is it something wrong in the files, or in my machine (could be, it's a new one and maybe some settings are to be fixed).]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#942</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#942</link>
				<pubDate><![CDATA[Fri, 6 Oct 2006 16:37:56]]> GMT</pubDate>
				<author><![CDATA[ bernard]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ Thanks Bernard

It is an arabic character in the original dataset. I will correct it.

The character is also used for the Valle d'Aosta.

The reverse geocoding webservice, the map interface and the admin1Codes.txt file are all the same thing.

Cheers

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#956</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#956</link>
				<pubDate><![CDATA[Sun, 8 Oct 2006 16:16:12]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ Côte d'Azur is OK now but there is still a lot of other names to clean up, e.g at http://www.geonames.org/maps/showOnMap?q=Abū. There again everything OK in Firefox and a lot of things like Abū Z̧aby in IE (well, if you look at this thread in Firefox, you don't see the problem at all ... )]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#959</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#959</link>
				<pubDate><![CDATA[Mon, 9 Oct 2006 10:45:47]]> GMT</pubDate>
				<author><![CDATA[ bernard]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ Do you speak these languages in order to know it is not a IE bug or shortcoming? I am reluctant to change it just because IE is having problems and I would not know with which character to replace it. Any ideas?

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#964</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#964</link>
				<pubDate><![CDATA[Mon, 9 Oct 2006 18:48:24]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ Unfortunately I don't speak those languages  :cry: 

And were it only for IE, I would gladly forget it  :P 

But, as written above, I find the issue also when dowloading admin1Codes.txt, and opening it with any text or XML editor at hand (UltraEdit, XML Spy ...) even when taking UTF-8 options. So... I don't know. We would need native speakers around for a variety of languages I'm afraid. What is the source of all those names, BTW? ]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#966</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#966</link>
				<pubDate><![CDATA[Mon, 9 Oct 2006 19:56:58]]> GMT</pubDate>
				<author><![CDATA[ bernard]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ The source for most of these codes is the National Geospatial-Intelligence Agency. 

<blockquote>It is a Department of Defense (DoD) combat support agency that has been assigned an important, additional statutory mission of supporting national-level policymakers and Government Agencies. NGA is a member of the Intelligence Community and the single entity upon which the U.S. Government now relies to coherently manage the previously separate disciplines of imagery and mapping. By providing customers with ready access to the world's best imagery and geospatial intelligence, NGA provides critical support for the national decision making process and contributes to the high state of operational readiness of America's military forces.&nbsp;
		</blockquote>

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#985</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#985</link>
				<pubDate><![CDATA[Fri, 13 Oct 2006 19:19:08]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ I think my observation might be related:

http://forum.geonames.org/gforum/posts/list/928.page]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#4138</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#4138</link>
				<pubDate><![CDATA[Mon, 9 Jun 2008 16:21:34]]> GMT</pubDate>
				<author><![CDATA[ giorgio79]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ We are still seeing lots of 'square' characters in the source files (AllCountries.txt) mainly in the alternate names column. Even thru UltraEdit or EditPad Pro and the web interface e.g. for

Hup’o Bank
Nishi Kaitoku Seamount
Usan Trough
...

So we can conclude that this is because of the National Geospatial-Intelligence Agency having bugs and can't really expect this to get fixed?

Thanks]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#6688</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#6688</link>
				<pubDate><![CDATA[Wed, 18 Nov 2009 22:47:41]]> GMT</pubDate>
				<author><![CDATA[ zukanta]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ I don't see any 'squares' for the places you list below. I would rather say you haven't installed the right fonts on your machine to have them rendered properly.

Best

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#6689</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#6689</link>
				<pubDate><![CDATA[Wed, 18 Nov 2009 22:56:30]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ Hi Marc,

Thanks for your quick reply.

Well, I'm using a unicode font and I can see without problems lots of double byte text (arabic for instance) and most of the file is ok except for some of the last entries in the alternateNames column on several records. I suspect these entries to be in Chinese or Japanese.

Which editor did you use to look at AllCountries.txt that shows data without 'squares' for "Hup’o Bank"? I'm curious,

Thanks!
]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#6690</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#6690</link>
				<pubDate><![CDATA[Wed, 18 Nov 2009 23:15:59]]> GMT</pubDate>
				<author><![CDATA[ zukanta]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ I was looking at the webpage with firefox.

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#6693</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#6693</link>
				<pubDate><![CDATA[Thu, 19 Nov 2009 08:12:54]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ Marc,

I tried all available encodings and unicode fonts in EditPad Pro and UltraEdit. To no avail. SQL Server 2005 also has issues with those specific entries/characters. Sometimes it's a single character amongst several arabic characters.

I tried with FireFox and it displays fine, even in the FireFox editor when I look at the source code. Can't see anything special in the FF setup that makes it work (Font or encoding: simple Times New Roman/helvetica/Arial/Verdana and UTF8).

Do you know of any file editor in a Windows environment that can display for instance the Alternate Names for 

Hup’o Bank 
Nishi Kaitoku Seamount 
Usan Trough 
... 

and if so which one and using which font and encoding?

Thanks!

]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#6698</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#6698</link>
				<pubDate><![CDATA[Thu, 19 Nov 2009 19:44:54]]> GMT</pubDate>
				<author><![CDATA[ zukanta]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ If I open it with notepad then it looks ok.

Marc]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#6706</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#6706</link>
				<pubDate><![CDATA[Fri, 20 Nov 2009 07:54:06]]> GMT</pubDate>
				<author><![CDATA[ marc]]></author>
			</item>
			<item>
				<title>Re:Encoding issues in admin1Codes</title>
				<description><![CDATA[ Thanks Marc,

I was using fonts that had 'unicode' in their name but happen not to be FULLY unicode compatible. <b>Arial Unicode MS </b>seems to be the most Unicode compatible so far. Eyeballing the file, problems seem to reduced to a few arabic alternate name entries in AllCountries.txt:

e.g.
Rūd-e Takhtarī
Kūh-e Takhtarī
Dasht-e Takhtarī
Sulni Khwaṟ

But since they show fine in FireFox, I guess these issues are due to bugs with the Arial Unicode MS font. So don't worry about this,

Cheers and thanks for your help!
]]></description>
				<guid isPermaLink="true">http://forum.geonames.org/gforum/posts/list/176.page#6712</guid>
				<link>http://forum.geonames.org/gforum/posts/list/176.page#6712</link>
				<pubDate><![CDATA[Fri, 20 Nov 2009 17:33:57]]> GMT</pubDate>
				<author><![CDATA[ zukanta]]></author>
			</item>
	</channel>
</rss>