I’ve been reading about character encoding recently, in particular to the various unicode standards. I’ve been rather pissed off with setting up the wrong collation in MySQL, I just realized that at my other blog, I have posts that are in utf8_unicode_ci, latin1_general_ci and utf_general_ci. This is what you get when you migrate database blindly without knowing what is character set. I regret not reading enough. Now I set everything to utf8_general_ci.
Anyway, something about another encoding set – GB2312 – caught my attention.
Here’s a trivia, the older Chinese encoding GB2312 cannot write the former Chinese Premier Zhu Rongji’s name. His name has often appeared as 朱熔基. Zhu disapproves of this and prefers the correct version, 朱镕基.
Later, GBK is established and his name finally can be written correctly. GBK is an extension of the GB2312 character set for simplified Chinese characters, used in the People’s Republic of China.
Well, another trivia about him:
Zhu is known for his technical intellect. In 1997 at a state banquet in Australia, Zhu left for the bathroom, and was gone quite a while. Concerned security staff finally went off to find him. He was in the bathroom, studying the water-saving dual-flush system which he had just disassembled. As a hydrologist, Zhu grasped the impact such water savings, multiplied by China’s huge population, could have on China’s infrastructure. (Source: Wikipedia)