[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-Id: <20080131015453.d5c98955.akpm@linux-foundation.org>
Date: Thu, 31 Jan 2008 01:54:53 -0800
From: Andrew Morton <akpm@...ux-foundation.org>
To: Miklos Szeredi <miklos@...redi.hu>
Cc: a.p.zijlstra@...llo.nl, linux-kernel@...r.kernel.org,
linux-fsdevel@...r.kernel.org, linux-mm@...ck.org,
kay.sievers@...y.org, greg@...ah.com, trond.myklebust@....uio.no
Subject: Re: [patch 2/6] mm: bdi: export BDI attributes in sysfs
On Thu, 31 Jan 2008 10:39:02 +0100 Miklos Szeredi <miklos@...redi.hu> wrote:
> > On Tue, 29 Jan 2008 16:49:02 +0100
> > Miklos Szeredi <miklos@...redi.hu> wrote:
> >
> > > From: Peter Zijlstra <a.p.zijlstra@...llo.nl>
> > >
> > > Provide a place in sysfs (/sys/class/bdi) for the backing_dev_info
> > > object. This allows us to see and set the various BDI specific
> > > variables.
> > >
> > > In particular this properly exposes the read-ahead window for all
> > > relevant users and /sys/block/<block>/queue/read_ahead_kb should be
> > > deprecated.
> >
> > This description is not complete. It implies that the readahead window is
> > not "properly" exposed for some "relevant" users. The reader is left
> > wondering what on earth this is referring to. I certainly don't know.
> > Perhaps when this information is revealed, we can work out what was
> > wrong with per-queue readahead tuning.
>
> I think Peter meant, that the readahead window was only exposed for
> block devices, and not things like NFS or FUSE.
OK.
>
> > > +blk-NAME
> > > +
> > > + Block devices, NAME is 'sda', 'loop0', etc...
> >
> > But if I've done `mknod /dev/pizza-party 8 0', I'm looking for
> > blk-pizza-party, not blk-sda.
> >
> > But I might still have /dev/sda, too.
>
> An alternative would be to uniformly use MAJOR:MINOR in there. It
> would work for block devices and anonymous devices (NFS/FUSE) as well.
>
> Would that be any better?
I suppose so. sysfs likes to use symlinks to point over at related
things in different directories...
> >
> > > +FSTYPE-MAJOR:MINOR
> > > +
> > > + Non-block device backed filesystems which provide their own
> > > + BDI, such as NFS and FUSE. MAJOR:MINOR is the value of st_dev
> > > + for files on this filesystem.
> > > +
> > > +default
> > > +
> > > + The default backing dev, used for non-block device backed
> > > + filesystems which do not provide their own BDI.
> > > +
> > > +Files under /sys/class/bdi/<bdi>/
> > > +---------------------------------
> > > +
> > > +read_ahead_kb (read-write)
> > > +
> > > + Size of the read-ahead window in kilobytes
> > > +
> > > +reclaimable_kb (read-only)
> > > +
> > > + Reclaimable (dirty or unstable) memory destined for writeback
> > > + to this device
> > > +
> > > +writeback_kb (read-only)
> > > +
> > > + Memory currently under writeback to this device
> > > +
> > > +dirty_kb (read-only)
> > > +
> > > + Global threshold for reclaimable + writeback memory
> > > +
> > > +bdi_dirty_kb (read-only)
> > > +
> > > + Current threshold on this BDI for reclaimable + writeback
> > > + memory
> > > +
> >
> > I dunno. A number of the things which you're exposing are closely tied to
> > present-day kernel implementation and may be irrelevant or even
> > unimplementable in a few years' time.
>
> Which ones?
I don't know - I misplaced my copy of linux-2.6.44 :)
The whole concept of a BDI might go away, who knows? Progress in
non-volatile semiconductor storage might make the whole
rotating-platter-with-a-seek-head thing obsolete.
read_ahead_kb is likely to be stable. writeback_kb is a stable concept
too, although we might lose the ability to keep track of it some time in
the future.
Suppose that /dev/sda and /dev/sdb share the same queue - we lose the ability
to track some of these things?
> They could possibly be moved to debugfs, or something.
>
> I agree, that sysfs should be relatively stable.
This does look more like a debugging feature than a permanently-offered,
support-it-forever part of the kernel ABI.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists