lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20070514164608.GS7984@stusta.de>
Date:	Mon, 14 May 2007 18:46:08 +0200
From:	Adrian Bunk <bunk@...sta.de>
To:	Stefan Richter <stefanr@...6.in-berlin.de>
Cc:	"John Anthony Kazos Jr." <jakj@...-k-j.com>, mgarski@...t.pl,
	linux-kernel@...r.kernel.org
Subject: Re: [PATCH] Convert files to UTF-8 and some cleanups

On Mon, May 14, 2007 at 06:25:14PM +0200, Stefan Richter wrote:
> John Anthony Kazos Jr. wrote:
> > You can't post a patch to UTF-8 in plain text, because it isn't plain 
> > text.
> 
> Of course it is, you said so...
> 
> > MTAs and MUAs mangle the hell out of them. Look back in the archives 
> > for my postings. Once you've got something in UTF-8, -*THEN*- you can post 
> > in plain inline text.
> 
> ...yourself.  Just send as
> 
> 	Content-type: text/plain; charset="utf-8"
> 
> Of course a patch like that will usually contain several character sets,
> but that's no problem as long as the byte codes are preserved and we can
> easily see how the end result looks like.  The important thing is that
> the submitter's and committer's MUAs do it right and reviewers' MUAs do
> it sort of right.  Also, nobody cares if there is still some pre-WWI
> 7bit-only MTA operated in some lone desert town.  And if some web
> archive of LKML isn't compatible to internet mail, then that's tough luck.

The problem is that the unconverted characters are always invalid UTF-8 [1],
and it's 100% correct for a MUA to convert these invalid UTF-8 
characters to replacement characters - if a MUA claims to send valid
charset="utf-8", it is simply not allowed to preserve the byte codes.

> Stefan Richter

cu
Adrian

[1] except for some cases with two or more non 7bit ASCII 
    characters in a row

-- 

       "Is there not promise of rain?" Ling Tan asked suddenly out
        of the darkness. There had been need of rain for many days.
       "Only a promise," Lao Er said.
                                       Pearl S. Buck - Dragon Seed

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ