[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080820191659.GA22591@x200.localdomain>
Date: Wed, 20 Aug 2008 23:16:59 +0400
From: adobriyan@...il.com
To: Andrew Morton <akpm@...ux-foundation.org>
Cc: peter_e@....net, bugme-daemon@...zilla.kernel.org,
linux-kernel@...r.kernel.org, Nadia Derbey <Nadia.Derbey@...l.net>,
Solofo Ramangalahy <Solofo.Ramangalahy@...l.net>,
Manfred Spraul <manfred@...orfullife.com>,
Pierre Peiffer <peifferp@...il.com>, containers@...ts.osdl.org
Subject: Re: [Bugme-new] [Bug 11381] New: default shmmax
On Wed, Aug 20, 2008 at 11:12:57PM +0400, wrote:
> On Wed, Aug 20, 2008 at 12:00:43PM -0700, Andrew Morton wrote:
> >
> > (switched to email. Please respond via emailed reply-to-all, not via the
> > bugzilla web interface).
> >
> > On Wed, 20 Aug 2008 05:58:57 -0700 (PDT)
> > bugme-daemon@...zilla.kernel.org wrote:
> >
> > > http://bugzilla.kernel.org/show_bug.cgi?id=11381
> > >
> > > Summary: default shmmax
> > > Product: Other
> > > Version: 2.5
> > > KernelVersion: 2.6.26.2
> > > Platform: All
> > > OS/Version: Linux
> > > Tree: Mainline
> > > Status: NEW
> > > Severity: enhancement
> > > Priority: P1
> > > Component: Other
> > > AssignedTo: other_other@...nel-bugs.osdl.org
> > > ReportedBy: peter_e@....net
> > >
> > >
> > > I would like to request that the default shmmax setting be increased or the
> > > downsides of that be documented. Allow me to explain.
> > >
> > > I am with the PostgreSQL development team. PostgreSQL is probably one of the
> > > few users of large amounts of SysV shared memory. Users would usually want to
> > > configure anywhere between 10% and 50% of their physical RAM to be used as
> > > shared memory, which would translate to something on the order of gigabytes
> > > nowadays. One of the uniformly annoying things about setting this up is that
> > > you need to reconfigure the Linux kernel to allow that. sysctl is nice and
> > > all, but it still requires users to learn about operating system and kernel
> > > details, requires root access, and distros don't handle sysctl uniformly
> > > either. Maybe there is even a good reason for that, but I couldn't find it,
> > > and at least I would like to learn it, so that we can pass that information on
> > > to our users.
> > >
> > > I did some kernel version archeology and found out that up until kernels 2.2
> > > the shmmax setting appears to have been restricted by CPU-specific constraints,
> > > as indicated by the default setting being different across CPUs and being
> > > defined in an asm header. The default setting on i386 was increased from 16 MB
> > > to 32 MB somewhere around 1998 in the kernel 2.0 line, and it remains at 32 MB
> > > in the latest kernel on all architectures.
> > >
> > > Now one question is whether there is a space or time overhead involved with
> > > setting a high shmmax limit that isn't actually used. If so, it would be
> > > interesting to know what that overhead is. The feeling I get from browsing the
> > > kernel source code over time is that there was some management overhead and/or
> > > some restrictions about this in old kernels, but that nowadays it doesn't
> > > really seem to matter much anymore. I suspect instead that this whole thing
> > > was just forgotten, because few applications use large amounts of shared
> > > memory.
> > >
> > > So, if you want to do us a favor, could you please see about increasing the
> > > default shmmax setting to whatever the theoretical maximum is?
> > >
> >
> > I don't think anybody has even thought about the shmmax default in
> > years. Sure, it might be time to reexamine that.
> >
> > It would be useful to get distro input on this. Do they override the
> > kernel default at boot time? If so, what do they do?
> >
> >
> > Also, from a quick read it looks to me that shmmax is busted in the
> > non-init namespace.
> >
> > clone_ipc_ns() calls shm_init_ns() which does
> >
> > ns->shm_ctlmax = SHMMAX;
> >
> > which a) fails to inherit the parent's setting and
>
> This is debatable if such behaviour should be default, this makes one ipc_ns
> more ipc_ns than others.
Oh, I forgot for a moment, that mainline has hierarchical ipc_ns.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists