[gopher] Hello Gopher Project
James Mills
prologic at shortcircuit.net.au
Tue Dec 16 23:17:07 UTC 2014
On Wed, Dec 17, 2014 at 9:01 AM, Kevin Veroneau <kevin at veroneau.net> wrote:
> It's actually amazing how much of WWW uses characters >128, and even
> for some basic characters which are actually in the <128. I notice
> many blog posts using a different version of "`" and "'" characters for
> some weird reason. This is more noticeable when using Python to scrap
> RSS feeds and needing to re-encode them. If you look at some of the
> titles and content of the RSS feeds, you'll notice lots of "dont"
> rather than "don't" as these blogs are encoding that character using a
> non-ACSII byte for whatever reason. My blog, Python Diary in Planet
> Python is one of the blogs that only uses only ASCII characters.
>
Yeah in my attempt to provide a Gopher version of the PyPi Feed(s)
I encountered a Unicode issue last night. So I had to disable it fo rnow.
I'm using gopherfeed (a library I found on Bitbucket)
but I may have to fork it and improve it's Unicode support
(or lack thereof) and improve it's ability to deal with broken
encodings :)
cheers
James
James Mills / prologic
E: prologic at shortcircuit.net.au
W: prologic.shortcircuit.net.au
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/gopher-project/attachments/20141217/3e7083fb/attachment.html>
More information about the Gopher-Project
mailing list