[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20160827133642.4znpf3aprhgbdtwv@thunk.org>
Date: Sat, 27 Aug 2016 09:36:42 -0400
From: Theodore Ts'o <tytso@....edu>
To: Roy Yang <roy@...esity.com>
Cc: linux-ext4@...r.kernel.org
Subject: Re: Ext4 stuck at wait_transaction_locked
On Thu, Aug 25, 2016 at 11:52:07PM -0700, Roy Yang wrote:
> I need your help to debug one ext4 issue. We consistently see Ext4
> stuck at wait_transaction_locked after another process is killed by
> cgroup because of oom. We have two processes keeping writing data to
> the same disk, and one was killed because of oom; the other process
> will stall at all I/O operations pretty soon.
You're using an ancient, 3.10-based RHEL 7 kernel:
> Linux sedhaswell04-node-1 3.10.0-327.22.2.el7.cohesity.x86_64 #1 SMP
> Tue Jul 5 12:41:09 PDT 2016 x86_64 x86_64 x86_64 GNU/Linux
As far as I know this bug does not exist in the upstream kernel ---
but the 3.10 kernel was released in June 2013, and since then changes
are the responsibility of Red Hat / CentOS. So you would need to get
support from Red Hat, since they have made a huge number of changes to
the kernel.
If you had given us the stack trace from the task that got OOM-killed,
we might be able to take a quick look, but if you use a distribution
kernel, it is the responsibility of the distribution to support you
--- this is, after all, why they get paid the big bucks. :-)
If you want to you use the latest upstream kernel, we would be much
more likely to help, although of course unlike Red Hat we don't have
any kind of guaranteed response time. For that, you would need to go
find a distribution and pay the aforementioned big bucks. :-)
Cheers,
- Ted
--
To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Powered by blists - more mailing lists