[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <87k0chhmjw.ffs@tglx>
Date: Fri, 25 Mar 2022 20:14:27 +0100
From: Thomas Gleixner <tglx@...utronix.de>
To: kernel test robot <oliver.sang@...el.com>,
Artem Savkov <asavkov@...hat.com>
Cc: 0day robot <lkp@...el.com>, Josh Poimboeuf <jpoimboe@...hat.com>,
LKML <linux-kernel@...r.kernel.org>, lkp@...ts.01.org,
netdev@...r.kernel.org, davem@...emloft.net,
yoshfuji@...ux-ipv6.org, dsahern@...nel.org,
Artem Savkov <asavkov@...hat.com>
Subject: Re: [timer] d41e0719d5:
UBSAN:shift-out-of-bounds_in_lib/flex_proportions.c
On Fri, Mar 25 2022 at 15:38, kernel test robot wrote:
> [ 42.401895][ C0] UBSAN: shift-out-of-bounds in lib/flex_proportions.c:80:20
> [ 42.410963][ C0] shift exponent -1007885658 is negative
Cute.
> [ 42.416462][ C0] CPU: 0 PID: 330 Comm: sed Tainted: G I 5.17.0-rc6-00027-gd41e0719d576 #1
> [ 42.426240][ C0] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1 10/07/2015
> [ 42.434363][ C0] Call Trace:
> [ 42.437516][ C0] <TASK>
> [ 42.440319][ C0] dump_stack_lvl (lib/dump_stack.c:107)
> [ 42.444699][ C0] ubsan_epilogue (lib/ubsan.c:152)
> [ 42.448985][ C0] __ubsan_handle_shift_out_of_bounds.cold (lib/ubsan.c:330)
> [ 42.455618][ C0] ? cpumask_next (lib/cpumask.c:23)
> [ 42.459996][ C0] ? __percpu_counter_sum (lib/percpu_counter.c:138)
> [ 42.465248][ C0] fprop_new_period.cold (lib/flex_proportions.c:80 (discriminator 1))
> [ 42.470224][ C0] writeout_period (mm/page-writeback.c:623)
So it seems a timer fired early. Which then makes writeout_period() go south:
int miss_periods = (jiffies - dom->period_time) / VM_COMPLETIONS_PERIOD_LEN;
If jiffies < dom->period_time the result is a very large negative
number.
This happens because of:
> @@ -67,7 +67,8 @@ struct timer_list {
> #define TIMER_DEFERRABLE 0x00080000
> #define TIMER_PINNED 0x00100000
> #define TIMER_IRQSAFE 0x00200000
> -#define TIMER_INIT_FLAGS (TIMER_DEFERRABLE | TIMER_PINNED | TIMER_IRQSAFE)
> +#define TIMER_UPPER_BOUND 0x00400000
> +#define TIMER_INIT_FLAGS (TIMER_DEFERRABLE | TIMER_PINNED | TIMER_IRQSAFE | TIMER_UPPER_BOUND)
> #define TIMER_ARRAYSHIFT 22
> #define TIMER_ARRAYMASK 0xFFC00000
TIMER_UPPER_BOUND steals a bit from the ARRAYMASK. So if the timer is
armed and the stored arraymask happens to have bit 22 set, then on the
next arming of the timer it will be treated as upper bound timer,
expires early and all hell breaks lose. The same can happen the other
way round. So I really have to ask how this ever "worked".
Thanks,
tglx
Powered by blists - more mailing lists