lists.openwall.net | lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC | |
Open Source and information security mailing list archives
| ||
|
Date: Mon, 10 May 2021 13:19:50 +0200 From: Mauro Carvalho Chehab <mchehab+huawei@...nel.org> To: Thorsten Leemhuis <linux@...mhuis.info> Cc: Linux Doc Mailing List <linux-doc@...r.kernel.org>, linux-kernel@...r.kernel.org, Jonathan Corbet <corbet@....net>, alsa-devel@...a-project.org, coresight@...ts.linaro.org, dri-devel@...ts.freedesktop.org, intel-gfx@...ts.freedesktop.org, intel-wired-lan@...ts.osuosl.org, keyrings@...r.kernel.org, kvm@...r.kernel.org, linux-acpi@...r.kernel.org, linux-arm-kernel@...ts.infradead.org, linux-edac@...r.kernel.org, linux-ext4@...r.kernel.org, linux-f2fs-devel@...ts.sourceforge.net, linux-fpga@...r.kernel.org, linux-hwmon@...r.kernel.org, linux-iio@...r.kernel.org, linux-input@...r.kernel.org, linux-integrity@...r.kernel.org, linux-media@...r.kernel.org, linux-pci@...r.kernel.org, linux-pm@...r.kernel.org, linux-rdma@...r.kernel.org, linux-riscv@...ts.infradead.org, linux-sgx@...r.kernel.org, linux-usb@...r.kernel.org, mjpeg-users@...ts.sourceforge.net, netdev@...r.kernel.org, rcu@...r.kernel.org, x86@...nel.org Subject: Re: [PATCH 00/53] Get rid of UTF-8 chars that can be mapped as ASCII Em Mon, 10 May 2021 12:52:44 +0200 Thorsten Leemhuis <linux@...mhuis.info> escreveu: > On 10.05.21 12:26, Mauro Carvalho Chehab wrote: > > > > As Linux developers are all around the globe, and not everybody has UTF-8 > > as their default charset, better to use UTF-8 only on cases where it is really > > needed. > > […] > > The remaining patches on series address such cases on *.rst files and > > inside the Documentation/ABI, using this perl map table in order to do the > > charset conversion: > > > > my %char_map = ( > > […] > > 0x2013 => '-', # EN DASH > > 0x2014 => '-', # EM DASH > I might be performing bike shedding here, but wouldn't it be better to > replace those two with "--", as explained in > https://en.wikipedia.org/wiki/Dash#Approximating_the_em_dash_with_two_or_three_hyphens > > For EM DASH there seems to be even "---", but I'd say that is a bit too > much. Yeah, we can do, instead: 0x2013 => '--', # EN DASH 0x2014 => '---', # EM DASH I was actually in doubt about those ;-) Btw, when producing HTML documentation, Sphinx should convert: -- into EN DASH and: --- into EM DASH So, the resulting html will be identical. > Or do you fear the extra work as some lines then might break the > 80-character limit then? No, I suspect that the line size won't be an issue. Some care should taken when EN DASH and EM DASH are used inside tables. Thanks, Mauro
Powered by blists - more mailing lists