A few corrections in adminASCII file: (you told me that the ADMs cannot be changed from the UI, right ?)
GB.A4 Bath and North East Somerset Bath and North East Somerset 6457408
And some additional non ASCII names :
LV.00 Jurmala Jurmala 459202
MA.00 El Aayoune El Aayoune 2543878
MQ.00 Département de Martinique Department of Martinique 3570311
MR.00 Disctrict de Nouakchott Nouakchott District de 2377449
MV.00 Maale Maale 1337624
And I don't know what to do about this one, since there is no equivalent in ADM1.txt :
TJ. Region of Republican Subordination Region of Republican Subordination 6452615
There is another issue not yet fixed. The ascii names for Chinese or other exotic scipts. The ascii field is automatically generated with a romanization algorithm. But this does not work for Chinese. Should we use null or empty values there?
- Concerning the KR region : I think the ISO code is fine. (will you also add it to the adm1 file ? or just add it to the adm1ascii file)
- Concerning the ASCII equivalent : a null value is fine. I don't think people should use this field anyways. (lucene) text analyzers & stemmers are a better way to prepare the data for full text search.