<?xml version="1.0" encoding="ISO-8859-1"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Them Character Encoding Blues</title>
	<atom:link href="http://blog.isecore.net/2007/10/06/them-character-encoding-blues/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.isecore.net/2007/10/06/them-character-encoding-blues/</link>
	<description>A Kevlar-Burrito Full Of Meat</description>
	<pubDate>Fri, 09 Jan 2009 07:50:46 +0000</pubDate>
	<generator>http://wordpress.org/?v=abc</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: isecore</title>
		<link>http://blog.isecore.net/2007/10/06/them-character-encoding-blues/comment-page-1/#comment-83118</link>
		<dc:creator>isecore</dc:creator>
		<pubDate>Sat, 06 Oct 2007 11:26:43 +0000</pubDate>
		<guid isPermaLink="false">http://blog.isecore.net/2007/10/06/them-character-encoding-blues/#comment-83118</guid>
		<description>Well, that would make sense. I've read about several Windows-users who discover that their OS uses some pretty weird encodings, but UTF-16 would make sense in some ass-backwards way.

Anyhoo, everything here is going to be converted to UTF-8 in the future. I'm tired of seeing weird characters in my filenames :)</description>
		<content:encoded><![CDATA[<p>Well, that would make sense. I&#8217;ve read about several Windows-users who discover that their OS uses some pretty weird encodings, but UTF-16 would make sense in some ass-backwards way.</p>
<p>Anyhoo, everything here is going to be converted to UTF-8 in the future. I&#8217;m tired of seeing weird characters in my filenames <img src='http://blog.isecore.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mind</title>
		<link>http://blog.isecore.net/2007/10/06/them-character-encoding-blues/comment-page-1/#comment-83103</link>
		<dc:creator>Mind</dc:creator>
		<pubDate>Sat, 06 Oct 2007 07:33:55 +0000</pubDate>
		<guid isPermaLink="false">http://blog.isecore.net/2007/10/06/them-character-encoding-blues/#comment-83103</guid>
		<description>Ah, I found some more info. Apparently, in windows the standard is UTF-16, that is, the cheap version of UTF-8. The pros is that characters are always 16 bits wide instead of varying length as it is in UTF-8. That makes for extremely easy encoding as you can beforehand reserve space for said amount of characters. But, UTF-16 is not UTF-8 and therefor does not support all the characters UTF-8 supports.

Now, windowsdevelopers refer to UTF-16 as unicode and Linuxdevelopers refer to UTF-8 as unicode. Hence the confusion on my part. A more correct term would be to not refer to UTF-16 as unicode at all but rather keep using the term "wide" and nothing else. The standard libs to handle unicode under Linux handles UTF-8 and on windows UTF-16. Pretty annoying, this probably means I have to get some external lib to be able to support UTF-8 on windows.</description>
		<content:encoded><![CDATA[<p>Ah, I found some more info. Apparently, in windows the standard is UTF-16, that is, the cheap version of UTF-8. The pros is that characters are always 16 bits wide instead of varying length as it is in UTF-8. That makes for extremely easy encoding as you can beforehand reserve space for said amount of characters. But, UTF-16 is not UTF-8 and therefor does not support all the characters UTF-8 supports.</p>
<p>Now, windowsdevelopers refer to UTF-16 as unicode and Linuxdevelopers refer to UTF-8 as unicode. Hence the confusion on my part. A more correct term would be to not refer to UTF-16 as unicode at all but rather keep using the term &#8220;wide&#8221; and nothing else. The standard libs to handle unicode under Linux handles UTF-8 and on windows UTF-16. Pretty annoying, this probably means I have to get some external lib to be able to support UTF-8 on windows.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mind</title>
		<link>http://blog.isecore.net/2007/10/06/them-character-encoding-blues/comment-page-1/#comment-83090</link>
		<dc:creator>Mind</dc:creator>
		<pubDate>Sat, 06 Oct 2007 05:57:45 +0000</pubDate>
		<guid isPermaLink="false">http://blog.isecore.net/2007/10/06/them-character-encoding-blues/#comment-83090</guid>
		<description>Itīs funny, I was just looking up much of this stuff with character encoding while programming on a chat-bot. Sort of my long term project to do a megaHAL replacement that is sligthly more modern and portable. Anyway, I canīt seem to find exactly what sort of character encoding unicode is in the various libraries of C++. The encoding is simply referred to as "wide". I know that UTF-8 is "teh shit", Itīs not really a lack of knowledge on my part about how the character encodings work or what they are good for, itīs more of a lack in documentation on the libraries part or a lack of my ability to find the correct information about it. Really annoying. I have found a standardlib that I use now that I suppose is the same everyone that makes "unicode" supporting programs use. But I am not certain, and I don`t really know what will happen when people start entering text into it with various keyboardsettings and languages or from different media such as the input-box on a webform kontra a well formatted textfile. Itīs a djungle really... at least until I find more reliable information about it.</description>
		<content:encoded><![CDATA[<p>Itīs funny, I was just looking up much of this stuff with character encoding while programming on a chat-bot. Sort of my long term project to do a megaHAL replacement that is sligthly more modern and portable. Anyway, I canīt seem to find exactly what sort of character encoding unicode is in the various libraries of C++. The encoding is simply referred to as &#8220;wide&#8221;. I know that UTF-8 is &#8220;teh shit&#8221;, Itīs not really a lack of knowledge on my part about how the character encodings work or what they are good for, itīs more of a lack in documentation on the libraries part or a lack of my ability to find the correct information about it. Really annoying. I have found a standardlib that I use now that I suppose is the same everyone that makes &#8220;unicode&#8221; supporting programs use. But I am not certain, and I don`t really know what will happen when people start entering text into it with various keyboardsettings and languages or from different media such as the input-box on a webform kontra a well formatted textfile. Itīs a djungle really&#8230; at least until I find more reliable information about it.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
