[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20180412005029.GQ30522@ZenIV.linux.org.uk>
Date: Thu, 12 Apr 2018 01:50:29 +0100
From: Al Viro <viro@...IV.linux.org.uk>
To: Linus Torvalds <torvalds@...ux-foundation.org>
Cc: Alexey Dobriyan <adobriyan@...il.com>,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
Andrew Morton <akpm@...ux-foundation.org>
Subject: Re: [PATCH] proc: fixup (c) sign
On Wed, Apr 11, 2018 at 03:41:26PM -0700, Linus Torvalds wrote:
> On Wed, Apr 11, 2018 at 1:24 PM, Alexey Dobriyan <adobriyan@...il.com> wrote:
> >
> > It got sent from here as iso8859-1.
>
> It's probably a good idea to just move away from Latin1 entirely, and
> encourage people to just use utf-8.
>
> But yes, Andrew's scripts clearly do a horrible job at looking at
> email encoding, and that really should be fixed.
>
> I worry less about silly copyright signs than about peoples actual
> _names_, but we've certainly seen problems there too.
FWIW, excluding the stuff obviously not in UTF8 (logo.gif and
tools/power/cpupower/po/{de,fr}.po) we have
lib/fonts/font_7x14.c
lib/fonts/font_8x16.c
lib/fonts/font_8x8.c
lib/fonts/font_pearl_8x8.c
arch/s390/kernel/ebcdic.c
crap in comments
arch/m68k/hp300/hp300map.map
drivers/tty/vt/defkeymap.map
non-UTF8 - literals for compose statements
arch/arm/crypto/sha256_glue.c
arch/arm/crypto/sha256_neon_glue.c
kernel/events/callchain.c
Latin1 copyright symbol (0xa9)
drivers/staging/rtl8188eu/include/odm.h
Latin1 inverted ! (0xa1)
Documentation/devicetree/bindings/net/nfc/pn544.txt
Latin1 superscript 2 (0xb2), in I²C
arch/arm/boot/dts/sun4i-a10-inet97fv2.dts
Latin1 o-umlaut (0xf6) in David Lanzendörfer
drivers/crypto/vmx/ghashp8-ppc.pl
a bunch of Latin1 middle dots (0xb7), in comments inside string constant
used to produce asm output, no less
drivers/gpu/drm/amd/include/atombios.h
several U+FFFD in there - buggered conversion to UTF8?
drivers/gpu/drm/r128/r128_drv.h
ditto, but here it's possible to figure out what got buggered -
Michel D�zer <daenzerm@...dent.ethz.ch> that presumably should've been "Dänzer"
drivers/iio/dac/ltc2632.c
Latin1 e-acute in Maxime Roussin-Bélanger
drivers/power/reset/ltc2952-poweroff.c
ditto in René Moll
drivers/spi/spi-omap-100k.c
a couple of U+FFFD in Juha Yrjölä <juha.yrjola@...ia.com> - o- and a-umlaut
In this case the actual spelling is trivially googled.
drivers/spi/spi-omap2-mcspi.c
ditto
drivers/usb/misc/iowarrior.c
U+FFFD in Stéphane Doyon <s.doyon@...eotron.ca> - e-acute
drivers/visorbus/visorbus_main.c
U+FFFD in place of copyright symbol
Powered by blists - more mailing lists