lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <47FBEBA9.30005@sgi.com>
Date:	Tue, 08 Apr 2008 15:03:21 -0700
From:	Mike Travis <travis@....com>
To:	Yinghai Lu <yhlu.kernel@...il.com>
CC:	Ingo Molnar <mingo@...e.hu>, Thomas Gleixner <tglx@...utronix.de>,
	"H. Peter Anvin" <hpa@...or.com>,
	Andrew Morton <akpm@...ux-foundation.org>,
	linux-kernel@...r.kernel.org
Subject: Re: [PATCH 0/2] NR_CPUS: increase maximum NR_CPUS to 4096

Yinghai Lu wrote:
> On Fri, Apr 4, 2008 at 6:30 PM, Mike Travis <travis@....com> wrote:
>>   * Increases the limit of NR_CPUS to 4096 and introduces a
>>     boolean called "MAXSMP" which when set (e.g. "allyesconfig")
>>     will set NR_CPUS = 4096 and NODES_SHIFT = 9 (512).
>>
>>     I've been running this config (4k NR_CPUS, 512 Max Nodes)
>>     on an AMD box with 2 dual-cores and 4gb memory as well as an
>>     Intel box with 4 single-core cpus and 8Mb.  I've also
>>     successfully booted it in a simulated 2cpus/1Gb environment.
>>
>>  Based on:
>>         git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6.git
>>     +   x86/latest          .../x86/linux-2.6-x86.git
>>     +   sched-devel/latest  .../mingo/linux-2.6-sched-devel.git
>>
>>  Signed-off-by: Mike Travis <travis@....com>
> 
> got

Hi Yinghai,

Thanks for the feedback!  Would you send me your config file and
other details (like cpu type/mem size/etc.) and I'll attempt
to reproduce the failure.

(My problem is that only the AMD box is a real "workstation", the
Intel box is a dual quad-cpu server so it's really deficient in I/O.)

Thanks,
Mike
> 
> ------------[ cut here ]------------
> WARNING: at kernel/sched_fair.c:815 hrtick_start_fair+0x69/0x156()
> Modules linked in:
> Pid: 1, comm: swapper Not tainted
> 2.6.25-rc8-x86-latest.git-smp-01033-ga39ae31-dirty #77
> 
> Call Trace:
>  [<ffffffff802596ce>] warn_on_slowpath+0x67/0x8e
>  [<ffffffff8024b266>] hrtick_start_fair+0x69/0x156
>  [<ffffffff8024a619>] ? dequeue_entity+0x2a/0xf8
>  [<ffffffff8025547d>] dequeue_task_fair+0x5f/0x7e
>  [<ffffffff80248ea3>] dequeue_task+0x22/0x44
>  [<ffffffff80248efe>] deactivate_task+0x39/0x69
>  [<ffffffff80a57c31>] schedule+0x1b9/0x5c5
>  [<ffffffff80270dff>] ? autoremove_wake_function+0x20/0x5e
>  [<ffffffff80a58338>] schedule_timeout+0x31/0xd7
>  [<ffffffff8024db1a>] ? __wake_up+0x52/0x75
>  [<ffffffff80a578a3>] wait_for_common+0x103/0x189
>  [<ffffffff8024f29b>] ? default_wake_function+0x0/0x36
>  [<ffffffff80a57a62>] wait_for_completion+0x2b/0x41
>  [<ffffffff8026c0d2>] call_usermodehelper_exec+0x87/0xe5
>  [<ffffffff80561073>] kobject_uevent_env+0x3d0/0x424
>  [<ffffffff805610e5>] kobject_uevent+0x1e/0x34
>  [<ffffffff805f2dbd>] device_add+0x2f9/0x494
>  [<ffffffff805f2f80>] device_register+0x28/0x43
>  [<ffffffff80577abf>] pcie_port_device_register+0x3f1/0x43e
>  [<ffffffff80972165>] ? pcibios_set_master+0x8d/0xa8
>  [<ffffffff80a24a1c>] pcie_portdrv_probe+0x79/0xbb
>  [<ffffffff80574173>] pci_call_probe+0xe5/0x146
>  [<ffffffff80574331>] pci_device_probe+0x64/0xa2
>  [<ffffffff805f5946>] driver_probe_device+0xcf/0x16d
>  [<ffffffff8031d708>] ? sysfs_addrm_finish+0x2f/0x22b
>  [<ffffffff805f5b0b>] ? __driver_attach+0x0/0xbe
>  [<ffffffff805f5b79>] __driver_attach+0x6e/0xbe
>  [<ffffffff805f4999>] bus_for_each_dev+0x5e/0xa2
>  [<ffffffff805f571f>] driver_attach+0x2f/0x45
>  [<ffffffff805f5430>] bus_add_driver+0xc6/0x226
>  [<ffffffff805f45a4>] ? bus_put+0x29/0x3f
>  [<ffffffff805f5e66>] driver_register+0x6d/0xfc
>  [<ffffffff805745fa>] __pci_register_driver+0x62/0xb0
>  [<ffffffff818b25db>] pcie_portdrv_init+0x4a/0x72
>  [<ffffffff81890bcf>] kernel_init+0x1b4/0x340
>  [<ffffffff80225308>] child_rip+0xa/0x12
>  [<ffffffff81890a1b>] ? kernel_init+0x0/0x340
>  [<ffffffff802252fe>] ? child_rip+0x0/0x12
> 
> ---[ end trace e26645195698f5cf ]---
> BUG: unable to handle kernel NULL pointer dereference at 0000000000000148
> IP: [<ffffffff8024b3cf>] pick_next_task_fair+0x7c/0xbb
> PGD 0
> Oops: 0000 [1] SMP
> CPU 28
> Modules linked in:
> Pid: 1, comm: swapper Not tainted
> 2.6.25-rc8-x86-latest.git-smp-01033-ga39ae31-dirty #77
> RIP: 0010:[<ffffffff8024b3cf>]  [<ffffffff8024b3cf>]
> pick_next_task_fair+0x7c/0xbb
> RSP: 0018:ffff81081cc5cd70  EFLAGS: 00010046
> RAX: 0000000000000000 RBX: ffff81383c21a280 RCX: 0000000000000000
> RDX: ffff81383c224080 RSI: ffff81383c224080 RDI: 0000000063e15417
> RBP: ffff81081cc5cda0 R08: 0000000000000000 R09: ffff81383c224108
> R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000
> R13: ffff81383c224080 R14: ffff81383c224080 R15: 000000000000001c
> FS:  0000000000000000(0000) GS:ffff81401cc3c600(0000) knlGS:0000000000000000
> CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> CR2: 0000000000000148 CR3: 0000000000201000 CR4: 00000000000006e0
> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> Process swapper (pid: 1, threadinfo ffff81081cc5c000, task ffff81401cc52000)
> Stack:  ffff81081cc5cda0 0000000063e15417 ffffffff80a81840 0000000000000000
>  00000000fffeecfd ffff81383c224080 ffff81081cc5ce70 ffffffff80a57d28
>  ffff81081cc5ce00 ffff81081cc5ce20 ffffffff81963080 ffffffff81963080
> Call Trace:
>  [<ffffffff80a57d28>] schedule+0x2b0/0x5c5
>  [<ffffffff80270dff>] ? autoremove_wake_function+0x20/0x5e
>  [<ffffffff80a58338>] schedule_timeout+0x31/0xd7
>  [<ffffffff8024db1a>] ? __wake_up+0x52/0x75
>  [<ffffffff80a578a3>] wait_for_common+0x103/0x189
>  [<ffffffff8024f29b>] ? default_wake_function+0x0/0x36
>  [<ffffffff80a57a62>] wait_for_completion+0x2b/0x41
>  [<ffffffff8026c0d2>] call_usermodehelper_exec+0x87/0xe5
>  [<ffffffff80561073>] kobject_uevent_env+0x3d0/0x424
>  [<ffffffff805610e5>] kobject_uevent+0x1e/0x34
>  [<ffffffff805f2dbd>] device_add+0x2f9/0x494
>  [<ffffffff805f2f80>] device_register+0x28/0x43
>  [<ffffffff80577abf>] pcie_port_device_register+0x3f1/0x43e
>  [<ffffffff80972165>] ? pcibios_set_master+0x8d/0xa8
>  [<ffffffff80a24a1c>] pcie_portdrv_probe+0x79/0xbb
>  [<ffffffff80574173>] pci_call_probe+0xe5/0x146
>  [<ffffffff80574331>] pci_device_probe+0x64/0xa2
>  [<ffffffff805f5946>] driver_probe_device+0xcf/0x16d
>  [<ffffffff8031d708>] ? sysfs_addrm_finish+0x2f/0x22b
>  [<ffffffff805f5b0b>] ? __driver_attach+0x0/0xbe
>  [<ffffffff805f5b79>] __driver_attach+0x6e/0xbe
>  [<ffffffff805f4999>] bus_for_each_dev+0x5e/0xa2
>  [<ffffffff805f571f>] driver_attach+0x2f/0x45
>  [<ffffffff805f5430>] bus_add_driver+0xc6/0x226
>  [<ffffffff805f45a4>] ? bus_put+0x29/0x3f
>  [<ffffffff805f5e66>] driver_register+0x6d/0xfc
>  [<ffffffff805745fa>] __pci_register_driver+0x62/0xb0
>  [<ffffffff818b25db>] pcie_portdrv_init+0x4a/0x72
>  [<ffffffff81890bcf>] kernel_init+0x1b4/0x340
>  [<ffffffff80225308>] child_rip+0xa/0x12
>  [<ffffffff81890a1b>] ? kernel_init+0x0/0x340
>  [<ffffffff802252fe>] ? child_rip+0x0/0x12
> 
> 
> Code: 24 40 78 1c 8b 3d 36 05 b3 00 48 89 da be 00 04 00 00 e8 7a eb
> ff ff 49 39 c6 7f 04 4c 8b 63 48 4c 89 e6 4
> 8 89 df e8 29 f1 ff ff <49> 8b 9c 24 48 01 00 00 48 85 db 75 a5 49 8d
> 5c 24 c8 4c 89 ef
> RIP  [<ffffffff8024b3cf>] pick_next_task_fair+0x7c/0xbb
>  RSP <ffff81081cc5cd70>
> CR2: 0000000000000148
> ---[ end trace e26645195698f5cf ]---

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ