<div dir="ltr"><div class="gmail_extra"><br><div class="gmail_quote">On Wed, Dec 17, 2014 at 9:01 AM, Kevin Veroneau <span dir="ltr"><<a href="mailto:kevin@veroneau.net" target="_blank">kevin@veroneau.net</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div id=":7y9" class="a3s" style="overflow:hidden">It's actually amazing how much of WWW uses characters >128, and even<br>
for some basic characters which are actually in the <128. I notice<br>
many blog posts using a different version of "`" and "'" characters for<br>
some weird reason. This is more noticeable when using Python to scrap<br>
RSS feeds and needing to re-encode them. If you look at some of the<br>
titles and content of the RSS feeds, you'll notice lots of "dont"<br>
rather than "don't" as these blogs are encoding that character using a<br>
non-ACSII byte for whatever reason. My blog, Python Diary in Planet<br>
Python is one of the blogs that only uses only ASCII characters.</div></blockquote></div><br>Yeah in my attempt to provide a Gopher version of the PyPi Feed(s)</div><div class="gmail_extra">I encountered a Unicode issue last night. So I had to disable it fo rnow.</div><div class="gmail_extra"><br></div><div class="gmail_extra">I'm using gopherfeed (a library I found on Bitbucket)</div><div class="gmail_extra">but I may have to fork it and improve it's Unicode support</div><div class="gmail_extra">(or lack thereof) and improve it's ability to deal with broken</div><div class="gmail_extra">encodings :)</div><div class="gmail_extra"><br></div><div class="gmail_extra">cheers</div><div class="gmail_extra">James<br><br clear="all"><div><div class="gmail_signature"><span style="border-collapse:collapse;color:rgb(136,136,136);font-size:13px"><br><font face="arial, sans-serif">James Mills / prologic</font><br><br><font face="arial, sans-serif"></font><font face="'courier new', monospace">E:Â <a href="mailto:prologic@shortcircuit.net.au" style="color:rgb(0,0,204)" target="_blank">prologic@shortcircuit.net.au</a></font></span><div><span style="font-family:'courier new',monospace;color:rgb(136,136,136);font-size:13px">W:Â </span><a href="http://prologic.shortcircuit.net.au" style="font-family:'courier new',monospace;font-size:13px;color:rgb(0,0,204)" target="_blank">prologic.shortcircuit.net.au</a><br></div></div></div>
</div></div>