[Debian-zh-dev] Bug#850903: GB mojibake in shijing source passes down to UTF-8 shijing
Mingye Wang
arthur2e5 at aosc.io
Wed Jan 11 01:26:59 UTC 2017
Package: fortune-zh
Version: 2.1
Version: 2.0
fortune-zh's shijing.gb seems to use a white square ("\xa1\xa5") for
characters missing from the gb encoding used at that time (gb2312?).
These "white square" blocks passes down into the UTF-8 version (shijing)
as U+25A1 characters, that is, an actual white square in Unicode.
Since it's now 2017[citation needed] and more legitimate sources for
Shi-Jing like [1] are available, fortune-zh should really consider
re-crawling these ancient text in a modern encoding.
[1]: https://zh.wikisource.org/wiki/%E8%A9%A9%E7%B6%93
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 538 bytes
Desc: OpenPGP digital signature
URL: <http://lists.alioth.debian.org/pipermail/chinese-developers/attachments/20170110/6ed6cdc9/attachment.sig>
More information about the Chinese-developers
mailing list