err.no Git - linux-2.6/commit

author	Hugh Dickins <hugh@veritas.com>
	Thu, 7 Feb 2008 08:14:22 +0000 (00:14 -0800)
committer	Linus Torvalds <torvalds@woody.linux-foundation.org>
	Thu, 7 Feb 2008 16:42:20 +0000 (08:42 -0800)
commit	82369553d6d3bc67c54129a02e0bc0b5b88f3045
tree	1d80a6cc9f5840550ad025b32ac8ef8fd915fd98	tree \| snapshot
parent	3be91277e754c7db04eae145ba622b3a3e3ad96d	commit \| diff

memcgroup: fix hang with shmem/tmpfs

The memcgroup regime relies upon a cgroup reclaiming pages from itself within
add_to_page_cache: which may involve some waiting.  Whereas shmem and tmpfs
rely upon using add_to_page_cache while holding a spinlock: when it cannot
wait.  The consequence is that when a cgroup reaches its limit, shmem_getpage
just hangs - unless there is outside memory pressure too, neither kswapd nor
radix_tree_preload get it out of the retry loop.

In most cases we can mem_cgroup_cache_charge the page waitably first, to
attach the page_cgroup in advance, so add_to_page_cache will do no more than
increment a count; then mem_cgroup_uncharge_page after (in both success and
failure cases) to balance the books again.

And where there used to be a congestion_wait for kswapd (recently made
redundant by radix_tree_preload), use mem_cgroup_cache_charge with NULL page
to go through a cycle of allocation and freeing, without accounting to any
particular page, and without updating the statistics vector.  This brings the
cgroup below its limit so the next try usually succeeds.

Signed-off-by: Hugh Dickins <hugh@veritas.com>
Cc: Balbir Singh <balbir@linux.vnet.ibm.com>
Cc: KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Cc: Mel Gorman <mel@csn.ul.ie>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

mm/memcontrol.c		diff \| blob \| history
mm/shmem.c		diff \| blob \| history