i am such a loser.  i totally enjoy this " bullshit drama that mushroom
clouds into some crap-tangent."  it gives me the will to move on through my
bullshit mushroom cloud of a day.
nah, just kidding, i love my life.  this list has a huge amount of comic
relief for me.

On Sun, Mar 6, 2011 at 12:01 AM, Robert Nesius <nesius at gmail.com> wrote:

>
>
> On Sat, Mar 5, 2011 at 11:46 PM, Mike Miller <mbmiller+l at gmail.com> wrote:
>
>> On Sat, 5 Mar 2011, Adam Morris wrote:
>>
>>  Try \x{8a0} instead.  I think that \x normally accepts only two following
>>> characters, so you have to use \x{} for long hexadecimal numbers.
>>>
>>
>> You top posted, so I have to ignore you.
>>
>> Just kidding.  I did try that and that didn't work either.  Then I did
>> this...
>>
>> perl -pe 's/[[:ascii:]]//g ; s/(.)/$1\n/g' file.txt | sort | uniq -c >|
>> bad_chars.txt
>>
>> ...and when I looked at the resulting bad_chars.txt file in emacs again,
>> the characters looked different.  Before they were appearing as purple
>> rectangles, but now they appeared as a pair of characters that looked like
>> this: \302\240
>>
>> I could represent them exactly that way in perl and delete them.  I don't
>> really get what was happening there.
>>
>
> I'm guessing you were looking at (possibly variable-length) unicode
> characters, and your perl filter split them into fixed-length octets or
> something.
>
> -Rob
>
>
> _______________________________________________
> TCLUG Mailing List - Minneapolis/St. Paul, Minnesota
> tclug-list at mn-linux.org
> http://mailman.mn-linux.org/mailman/listinfo/tclug-list
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.mn-linux.org/pipermail/tclug-list/attachments/20110306/7a1d494d/attachment.html>