[Popcon-developers] invalid UTF-8 in by_inst

Paul Wise pabs at debian.org
Wed Feb 4 15:13:07 UTC 2015


On Wed, Feb 4, 2015 at 7:45 PM, Bill Allombert wrote:

> In this instance, the issue was due to a corrupted report (probably corrupted
> during transit, a better checksum than the TCP checksum should have caught it)
> and I removed it.

I see, thanks.

> However, a lots of reports I receive are not in correct UTF-8 so I cannot simply
> discard all such reports. This is due to filenames that appear in the report:
> they are not always encoded in UTF-8. For example aspell-es includes the file
> /usr/lib/aspell/español.alias. In older version of the package, the name is
> encoded in latin1 instead of UTF-8.

Interesting, hopefully those will go away eventually.

> In any case, thanks for lettin us know about the broken report!

No problem.

-- 
bye,
pabs

https://wiki.debian.org/PaulWise



More information about the Popcon-developers mailing list