Fidhleir
Joined: 26/11/2019 12:17:05
Messages: 1
Offline
|
Running a php job to convert from utf8 to 8859-1, I found that at least some of the apostrophes used in contractions (e.g. d'Orsay) are the 3-byte right apostrophe (hex E28099). They break php's utf8_decode() function, which cannot handle more than 2-byte chars.
Since flavored apostrophes are a typographic conceit that provides no extra information value, they should be globally replaced with the generic apostrophe (hex 27).
|