linux-kernel - Re: [PATCH] mm: add replace_page_cache

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives

Hash Suite: Windows password security audit tool. GUI, reports in PDF.

[<prev] [next>] [<thread-prev] [day] [month] [year] [list]

Message-Id: <E1PcagC-0007Ge-40@pomaz-ex.szeredi.hu>
Date:	Tue, 11 Jan 2011 10:38:16 +0100
From:	Miklos Szeredi <miklos@...redi.hu>
To:	Daisuke Nishimura <nishimura@....nes.nec.co.jp>
CC:	kamezawa.hiroyu@...fujitsu.com, miklos@...redi.hu,
	akpm@...ux-foundation.org, minchan.kim@...il.com,
	linux-fsdevel@...r.kernel.org, linux-mm@...ck.org,
	linux-kernel@...r.kernel.org, nishimura@....nes.nec.co.jp
Subject: Re: [PATCH] mm: add replace_page_cache_page() function

On Tue, 11 Jan 2011, Daisuke Nishimura wrote:
> > What I recommend is below. (Please see the newest -mm because of a bug fix for
> > mem cgroup) Considering page management on radix-tree, it can be considerd as
> > a kind of page-migration, which replaces pages on radix-tree.
> > 
> > ==
> > 
> > > +int replace_page_cache_page(struct page *old, struct page *new, gfp_t gfp_mask)
> > > +{
> > > +	int error;
> > > +
> > > +	VM_BUG_ON(!PageLocked(old));
> > > +	VM_BUG_ON(!PageLocked(new));
> > > +	VM_BUG_ON(new->mapping);
> > > +
> > 	struct mem_cgroup *memcg;
> > 
> I think it should be initialized to NULL.
> 
> > 	error = mem_cgroup_prepare_migration(old, new, &memcg);
> 
> I want some comments like:
> 
> 	/*
> 	 * This is not page migration, but prepare_migration and end_migration
> 	 * does enough work for charge replacement.
> 	 */
> 
> > 	#
> > 	# This function will charge against "newpage". But this expects
> > 	# the caller allows GFP_KERNEL gfp_mask. 
> > 	# After this, the newpage is in "charged" state.
> > 	if (error)
> > 		return -ENOMEM;
> > 
> > > +	error = radix_tree_preload(gfp_mask & ~__GFP_HIGHMEM);
> > > +	if (!error) {
> > > +		struct address_space *mapping = old->mapping;
> > > +		pgoff_t offset = old->index;
> > > +
> > > +		page_cache_get(new);
> > > +		new->mapping = mapping;
> > > +		new->index = offset;
> > > +
> > > +		spin_lock_irq(&mapping->tree_lock);
> > > +		__remove_from_page_cache(old);
> > > +		error = radix_tree_insert(&mapping->page_tree, offset, new);
> > > +		BUG_ON(error);
> > > +		mapping->nrpages++;
> > > +		__inc_zone_page_state(new, NR_FILE_PAGES);
> > > +		if (PageSwapBacked(new))
> > > +			__inc_zone_page_state(new, NR_SHMEM);
> > > +		spin_unlock_irq(&mapping->tree_lock);
> > > +		radix_tree_preload_end();
> > 
> > > +		mem_cgroup_replace_cache_page(old, new); <== remove this.
> > 
> > 		mem_cgroup_end_migraton(memcg, old, new, true);
> > 
> > > +		page_cache_release(old);
> > > +	} 
> > 	else 
> > 		mem_cgroup_end_migration(memcg, old, new, false);
> > 
> > 	# Here, if the 4th argument is true, old page is uncharged.
> > 	# if the 4th argument is false, the new page is uncharged.
> > 	# Then, "charge" of the old page will be migrated onto the new page
> > 	# if replacement is done.
> > 
> > 
> > 
> > > +
> > > +	return error;
> > > +}
> > > +EXPORT_SYMBOL_GPL(replace_page_cache_page);
> > > +
> > 
> > ==
> > 
> > I think this is enough simple and this covers all memory cgroup's racy
> > problems.
> > 
> I agree.

Thanks for the comments.

Yeah, using existing infrastructure is undoubtedly simpler and less
prone to bugs.  So going with this for a first implementation might
do.

However, replace_page_cache_page() is meant to be very efficient,
otherwise any performance won by not copying the page contents are
lost to the cost of page replacement.  My guess is,
mem_cgroup_prepare_migration()/end_migration() are to heavyweight for
this.

Thanks,
Miklos
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/