lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CALCETrW_vS9Z-VzN-_-sw_aryS=C3Nj2QSqtuuCF5OPT8Jq_Vg@mail.gmail.com>
Date:	Tue, 21 Apr 2015 11:11:35 -0700
From:	Andy Lutomirski <luto@...capital.net>
To:	Michal Hocko <mhocko@...e.cz>
Cc:	David Herrmann <dh.herrmann@...il.com>,
	One Thousand Gnomes <gnomes@...rguk.ukuu.org.uk>,
	Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
	Richard Weinberger <richard@....at>,
	Linus Torvalds <torvalds@...ux-foundation.org>,
	Steven Rostedt <rostedt@...dmis.org>,
	Jiri Kosina <jkosina@...e.cz>,
	Al Viro <viro@...iv.linux.org.uk>,
	Borislav Petkov <bp@...en8.de>,
	Andrew Morton <akpm@...ux-foundation.org>,
	Arnd Bergmann <arnd@...db.de>,
	"Eric W. Biederman" <ebiederm@...ssion.com>,
	Tom Gundersen <teg@...m.no>,
	"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>,
	Daniel Mack <daniel@...que.org>,
	Djalal Harouni <tixxdz@...ndz.org>
Subject: Re: [GIT PULL] kdbus for 4.1-rc1

On Tue, Apr 21, 2015 at 7:27 AM, Michal Hocko <mhocko@...e.cz> wrote:
> On Tue 21-04-15 16:01:01, David Herrmann wrote:
>> Hi
>>
>> On Tue, Apr 21, 2015 at 2:20 PM, Michal Hocko <mhocko@...e.cz> wrote:
>> > On Tue 21-04-15 12:17:49, David Herrmann wrote:
>> >> Hi
>> >>
>> >> On Tue, Apr 21, 2015 at 11:35 AM, One Thousand Gnomes
>> >> <gnomes@...rguk.ukuu.org.uk> wrote:
>> >> >> On top of that, I think that someone into resource management needs to
>> >> >> seriously consider whether having a broadcast send do get_user_pages
>> >> >> or the equivalent on pages supplied by untrusted recipients (plural!)
>> >> >> is a good idea.
>> >> >
>> >> > Oh but its so much fun if you pass pages belonging to a device driver, or
>> >> > pass bits of a GEM object thereby keeping entire graphics textures
>> >> > referenced 8)
>> >>
>> >> We do not use GUP, nor do we pass around pinned pages. All we use is
>> >> __vfs_read() / __vfs_write() on shmem. Whether generic_file_write() /
>> >> copy_from_user() internally relies on GUP or not, is an orthogonal
>> >> issue that does not belong here.
>> >
>> > It kind of does AFAIU.
>>
>> No, it is not. The issue with GUP is that you elevate the page
>> ref-count and thus prevent lru isolation, sealing, whatsoever.
>
> The point was that such a memory might be not present yet and need a
> page fault with all the side effects - memory reclaim, memcg charge...
>
>> I cannot see how it is related to kdbus. However, ...
>>
>> > If for nothing else then the memcg reasons mentioned in
>> > other email (http://marc.info/?l=linux-kernel&m=142953380508188). If an
>> > untrusted user is allowed to hand over a shmem backed buffer which hasn't
>> > been charged yet (read faulted in) and then kdbus forced to fault it in
>> > a different user's context then you basically allow to hide memory
>> > allocations from the memcg. That is a clear show stopper.
>> >
>> > Or have I misunderstood the way how shmem buffers are used here?
>>
>> ..as you mentioned memcg, lets figure that out here. shmem buffers are
>> used as receive-buffers by kdbus peers. They are read-only to
>> user-space. All allocations are done by the kernel on message passing.
>
> OK, so the shmem buffer is allocated on the kernels behalf and under
> its control and no userspace can hand over one to kdbus. Do I get
> it right? If yes then the memcg escape I was describing above is
> not possible of course. This wasn't clear to me from the previous
> discussion. Thanks for the clarification!

I'm still missing something here, I think.  At the time of pool
creation, the kernel calls shmem_file_setup in the context of the
untrusted user.  Then, when a privileged daemon broadcasts, the kernel
calls vfs_iter_write or similar, thus allocating the page, right?  I
don't see why the page would be allocated early or why vfs_iter_write
and the associated shmem code would care what memcg created the shmem
file -- all of that code seems to use current's memcg on brief
inspection.

Bear in mind that the bad guy gets to use madvise, etc to mess around
with the page cache state.

--Andy
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ