[gopher] GopherMole - a gopher media crawler

Mateusz Viste mateusz at viste.fr
Sat Jan 3 10:38:54 UTC 2015


On 01/03/2015 11:27 AM, James Mills wrote:
> Mis-rendered correct (which is what I meant)
> but the client "won't break".

That's correct.

> What's what I meant by "degrade".

Sure, but that's hardly 'graceful'. And doesn't have anything to do with 
ISO-8859-1. Which doesn't mean I am opposed to UTF-8 usage in the 
gopherspace, on the contrary, I'm 100% for it. But it's important to 
keep in mind the exact impact it will have on legacy clients.

> *I think* a Gopher server that splits out UTF_8 encoded data to a Client
> that doesn't support UTF-8 encoding will still display the content (just
> not any codepoint higher than 255)?

Only low-ascii will be rendered correctly, that is anything above code 
point 127 will be scrambled.

Here's an example:

gopher://gopher.viste.fr/0/docs/other/Little%2520Big%2520Adventure%2520-%2520Soluce%2520du%2520jeu%2520%2528french%2529.txt

Same thing here (but on a polish document):

gopher://gopher.viste.fr/0/docs/opowiadania%2520%2528polish%2529/sendbajt.txt

When I open these documents with Overbite, all french or polish 
diacritics are broken (until I set my browser manually to UTF-8).

Of course there are thousands of such examples across the gopherspace.

Mateusz



More information about the Gopher-Project mailing list