linux-kernel - Re: [PATCH 2/2] printk: make sure to print log on console.

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives

Hash Suite: Windows password security audit tool. GUI, reports in PDF.

[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]

Message-ID: <20180601090942.ek3j4bpbhschljrw@pathway.suse.cz>
Date:   Fri, 1 Jun 2018 11:09:42 +0200
From:   Petr Mladek <pmladek@...e.com>
To:     Sergey Senozhatsky <sergey.senozhatsky.work@...il.com>
Cc:     Maninder Singh <maninder1.s@...sung.com>,
        sergey.senozhatsky@...il.com, rostedt@...dmis.org,
        linux-kernel@...r.kernel.org, a.sahrawat@...sung.com,
        pankaj.m@...sung.com, v.narang@...sung.com
Subject: Re: [PATCH 2/2] printk: make sure to print log on console.

On Fri 2018-06-01 10:53:56, Petr Mladek wrote:
> On Fri 2018-06-01 13:40:50, Sergey Senozhatsky wrote:
> > On (05/31/18 14:21), Petr Mladek wrote:
> > > > 
> > > > Upstream printk has no printing kthread. And we also run
> > > > printk()->console_unlock() with disabled preemption.
> > > 
> > > Yes, the comment was wrong
> > 
> > Yes, that was the only thing I meant.
> > I really didn't have any time to look at the patch yesterday, just
> > commented on the most obvious thing.
> 
> Fair enough.
> 
> > > but the problem is real.
> > 
> > Yep, could be. But not exactly the way it is described in the commit
> > messages and the patch does not fully address the problem.
> > 
> > The patch assumes that all those events happen sequentially. While
> > in reality they can happen in parallel on different CPUs.
> > 
> > Example:
> > 
> > 	CPU0					CPU1
> > 
> > 	set console verbose
> > 
> > 	dump_backtrace()
> > 	{
> > 		// for (;;)  print frames
> > 		printk("%pS\n", frame0);
> > 		printk("%pS\n", frame1);
> > 		printk("%pS\n", frame2);
> > 		printk("%pS\n", frame3);
> > 		...				console_loglevel = CONSOLE_LOGLEVEL_SILENT;
> > 		printk("%pS\n", frame12);
> > 		printk("%pS\n", frame13);
> > 	}
> > 
> > Part of backtrace or the entire backtrace will be missed, because
> > we read the global console_loglevel. The problem is still there.
> 
> [...]
> 
> > So I'd say that most likely the following scenarios can suffer:
> > 
> > - NMI comes in, sets loglevel to X, printk-s some data, restores the
> >   loglevel back to Y
> > - IRQ comes in [like sysrq, etc] comes in and does the same thing
> > - software exception comes in and does the same thing [e.g. bust_spinlocks()
> >   at arch/s390/mm/fault.c]

I forgot to say that it was a great point and analyze.

> My view is:
> 
> The race with another printk() (console_lock owner) is much more
> likely than a race between two CPUs manipulating console_loglevel.
> 
> The proposed patch seems to be in the right direction. It is supposed
> to fix the most likely scenario. We could block it and request full
> solution but I wonder if it is worth it.
> 
> I am personally fine with this partial solution for now. We could
> always make it better if people meet the other scenarios.

I am still fine with the partial solution. Well, I will think
more about it before approving any patch.

Best Regards,
Petr