[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20090213152836.0fbbfa7d.akpm@linux-foundation.org>
Date: Fri, 13 Feb 2009 15:28:36 -0800
From: Andrew Morton <akpm@...ux-foundation.org>
To: Dave Hansen <dave@...ux.vnet.ibm.com>
Cc: mingo@...e.hu, orenl@...columbia.edu, linux-api@...r.kernel.org,
containers@...ts.linux-foundation.org,
linux-kernel@...r.kernel.org, linux-mm@...ck.org,
torvalds@...ux-foundation.org, viro@...iv.linux.org.uk,
hpa@...or.com, tglx@...utronix.de
Subject: Re: [RFC v13][PATCH 00/14] Kernel based checkpoint/restart
On Thu, 12 Feb 2009 10:11:22 -0800
Dave Hansen <dave@...ux.vnet.ibm.com> wrote:
>
> ...
>
> > - In bullet-point form, what features are missing, and should be added?
>
> * support for more architectures than i386
> * file descriptors:
> * sockets (network, AF_UNIX, etc...)
> * devices files
> * shmfs, hugetlbfs
> * epoll
> * unlinked files
> * Filesystem state
> * contents of files
> * mount tree for individual processes
> * flock
> * threads and sessions
> * CPU and NUMA affinity
> * sys_remap_file_pages()
>
> This is a very minimal list that is surely incomplete and sure to grow.
That's a worry.
>
> > For extra marks:
> >
> > - Will any of this involve non-trivial serialisation of kernel
> > objects? If so, that's getting into the
> > unacceptably-expensive-to-maintain space, I suspect.
>
> We have some structures that are certainly tied to the kernel-internal
> ones. However, we are certainly *not* simply writing kernel structures
> to userspace. We could do that with /dev/mem. We are carefully pulling
> out the minimal bits of information from the kernel structures that we
> *need* to recreate the function of the structure at restart. There is a
> maintenance burden here but, so far, that burden is almost entirely in
> checkpoint/*.c. We intend to test this functionality thoroughly to
> ensure that we don't regress once we have integrated it.
I guess my question can be approximately simplified to: "will it end up
looking like openvz"? (I don't believe that we know of any other way
of implementing this?)
Because if it does then that's a concern, because my assessment when I
looked at that code (a number of years ago) was that having code of
that nature in mainline would be pretty costly to us, and rather
unwelcome.
The broadest form of the question is "will we end up regretting having
done this".
If we can arrange for the implementation to sit quietly over in a
corner with a team of people maintaining it and not screwing up other
people's work then I guess we'd be OK - if it breaks then the breakage
is localised.
And it's not just a matter of "does the diffstat only affect a single
subdirectory". We also should watch out for the imposition of new
rules which kernel code must follow. "you can't do that, because we
can't serialise it", or something.
Similar to the way in which perfectly correct and normal kernel
sometimes has to be changed because it unexpectedly upsets the -rt
patch.
Do you expect that any restrictions of this type will be imposed?
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists