|
From: | PePa |
Subject: | [bug-gawk] Thai UTF-8 length bug |
Date: | Tue, 21 Jun 2016 13:25:47 +0700 |
User-agent: | Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.8.0 |
Dear folk,Couldn't find any report about this. I read that gawk as of 3.1.5 is supposed to report length in characters now. That is not true for Thai characters (Ubuntu 16.04 gawk 4.1.3):
LC_ALL=th_TH.UTF-8 gawk 'BEGIN {print length("ค้ม")}' 3 (should be 2) Are you aware of this problem, or is something wrong on my side?? Cheers, Peter
[Prev in Thread] | Current Thread | [Next in Thread] |