lists.openwall.net | lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC | |
Open Source and information security mailing list archives
| ||
|
Date: Tue, 6 Nov 2007 03:26:12 -0500 From: Kyle Moffett <mrmacman_g4@....com> To: Adrian Bunk <bunk@...nel.org> Cc: "Ahmed S. Darwish" <darwish.07@...il.com>, Casey Schaufler <casey@...aufler-ca.com>, akpm@...l.org, torvalds@...l.org, linux-security-module@...r.kernel.org, linux-kernel@...r.kernel.org Subject: Re: [PATCH] Smackv10: Smack rules grammar + their stateful parser On Nov 06, 2007, at 01:33:05, Adrian Bunk wrote: > Can you limit this to 7bit ASCII and use isascii() somewhere? > > Otherwise I'd expect funny things to happen when you e.g. use > isspace() on the UTF-8 encoded character à. Actually, you don't need to. You tell them it expects UTF-8 encoded strings and be done with it. All US-ASCII characters from 0 through 127 (IE: high bit clear) are exactly the same in UTF-8, and UTF-8 special characters have the high bit set in all bytes. Therefore you just assume that anything with the high bit set is part of a word and you can handle basic UTF-8. (It doesn't work on special UTF-8 space characters like nonbreaking space and similar, but handling those is significantly more complicated). Cheers, Kyle Moffett - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@...r.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists