lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <86802c440808161156rf48f23ai9d77ce3cab36f02a@mail.gmail.com>
Date:	Sat, 16 Aug 2008 11:56:54 -0700
From:	"Yinghai Lu" <yhlu.kernel@...il.com>
To:	"James Bottomley" <James.Bottomley@...senpartnership.com>
Cc:	"Alan Cox" <alan@...rguk.ukuu.org.uk>,
	"H. Peter Anvin" <hpa@...or.com>,
	"Jesse Barnes" <jbarnes@...tuousgeek.org>,
	"Ingo Molnar" <mingo@...e.hu>,
	"Thomas Gleixner" <tglx@...utronix.de>,
	"Eric W. Biederman" <ebiederm@...ssion.com>,
	"Andrew Morton" <akpm@...ux-foundation.org>,
	linux-kernel@...r.kernel.org,
	"Andrew Vasquez" <andrew.vasquez@...gic.com>
Subject: Re: [PATCH] pci: change msi-x vector to 32bit

On Sat, Aug 16, 2008 at 9:13 AM, James Bottomley
<James.Bottomley@...senpartnership.com> wrote:
> On Sat, 2008-08-16 at 16:39 +0100, Alan Cox wrote:
>> > Where exactly is this code in the kernel?  Most arches assume the irq is
>> > an index to a compact table bounded by NR_IRQS, so something like this
>> > would violate that assumption.
>>
>> Yes, which is no bad thing for some platforms. There are some driver
>> assumptions like that but those have also been stomped.
>
> I'm not saying we couldn't do this, or even that we shouldn't; I'm just
> asking why would we want to?
>
> All arches currently seem to have show_interrupts() which loop over
> 0..NR_IRQS where the interrupt is printed as %d.  In this encoded scheme
> they would show up with rather nastily large numbers that have no
> visible meaning unless we switch to hex for displaying them.
>
> What I'm really saying is that irq as the interrupt number is really the
> *user's* handle for the interrupt not the machine's, so it needs to be
> something the user is comfortable with.  We could overcome this
> objection by encoding the number to something meaningful for the
> user ... I'm just asking if there's any benefit to doing this?
>
the code is tip/irq/sparseirq or tip/master

story:
1. for x86_64: first we have NR_IRQS = NR_CPUS * NR_VECTORS, because
it already supports per_cpu vector
2. SGI want MAX_SMP support: NR_CPUS=4096, so everything is broken.
3. Mike spent some time to make every array [NR_CPUS]  to per_cpu
define as possible.
4. Mike or someone else reduce NR_IRQS to 224, because NR=256*4096,
will make kstat_irqs[NR_CPUS][NR_VECTORS*NR_VECTORS] too big, and it
could be complied.
5. IBM guys report their one server is broken, that system GSI > 256,
so some irq can not work.
6. Yinghai tried one patch change NR_IRQS=32*NR_CPUS., but sgi said it
still broke their system.  --- for 2.6.27
7. Eric provide one patch NR_IRQS = min(32*NR_CPUS, NR_VECTORS *
MAX_IO_APICS) --- for 2.6.27
8. For 2.6.28 later, Yinghai add code dyn_array, and probe nr_irqs, so
NR_IRQS related will be dynamically allocated after nr_irqs is probed.
9. Eric said using dyn_array still waste ram, because a lot of
irq_desc is not used. when MSI-X is involved, some card could use 256
vectors or 4096 in theory.
10. Eric said he had one dyn irq_desc, with 90% done. but didn't have
time to work it out left 10%
11. Yinghai add sparese_irq support. those array will be increased by
32, and be claimed one by one.
12. according to Eric, we could have irq spread out [0, -1U), irq =
bus/dev/fn + entry_of_msix
13. with sparseirq, /proc/interrupts will have irq_number in hex.

but msix current cached irq number, and it only use 16bit to store
unsigned int irq., and later cards will call request_irq with
truncated irq_number...card will fallback to MSI or INTa

only two places need to be changed about that.

BTW, any reason qlogic card need to cache that irq number second times?

YH


system with qlogic and lpfc

LBSuse:~ # cat /proc/interrupts
           CPU0       CPU1       CPU2       CPU3       CPU4       CPU5
      CPU6       CPU7       CPU8       CPU9       CPU10      CPU11
 CPU12      CPU13      CPU14      CPU15
0x0:        111          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   IO-APIC-edge      timer
0x4:        450          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   IO-APIC-edge      serial
0x7:          1          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   IO-APIC-edge
0x8:          1          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   IO-APIC-edge      rtc0
0x9:          0          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   IO-APIC-fasteoi   acpi
0x17:          0          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi   sata_nv
0x16:        140          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi
ohci_hcd:usb2, sata_nv
0x15:        384          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi
ehci_hcd:usb1
0x14:          0          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi   sata_nv
0x10:       1083          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi   aacraid
0x2e:          0          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi   sata_nv
0x2d:          0          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi   sata_nv
0x2c:          0          0          0          0          0
0          0          0          0          0          0          0
      0          0          0          0   IO-APIC-fasteoi   sata_nv
0x50100:          0          0          0          0          0
  0          0          0          0          0          0          0
        0          0          0          0   PCI-MSI-edge      aerdrv
0x70100:          0          0          0          0          0
  0          0          0          0          0          0          0
        0          0          0          0   PCI-MSI-edge      aerdrv
0x78100:          0          0          0          0          0
  0          0          0          0          0          0          0
        0          0          0          0   PCI-MSI-edge      aerdrv
0x8058100:          0          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge
aerdrv
0x8070100:          0          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge
aerdrv
0x8078100:          0          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge
aerdrv
0x8300100:         41          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge
qla2xxx (default)
0x83000ff:          0          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge
qla2xxx (rsp_q)
0x8301100:         41          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge
qla2xxx (default)
0x83010ff:          0          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge
qla2xxx (rsp_q)
0x300100:          2          0          0          0          0
   0          0          0          0          0          0          0
         0          0          0          0   PCI-MSI-edge      lpfc
0x301100:          2          0          0          0          0
   0          0          0          0          0          0          0
         0          0          0          0   PCI-MSI-edge      lpfc
0x40100:        326          0          0          0          0
  0          0          0          0          0          0          0
        0          0          0          0      none-edge
0x48100:        328          0          0          0          0
  0          0          0          0          0          0          0
        0          0          0          0      none-edge
0x8040100:       2222          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0   PCI-MSI-edge      eth2
0x8048100:        326          0          0          0          0
    0          0          0          0          0          0
0          0          0          0          0      none-edge
NMI:          0          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   Non-maskable interrupts
LOC:       8782       5209       3029       3222       4556       3328
      2862       2782       2730       3218       2742       2655
 3664       3099       3146       3356   Local timer interrupts
RES:        904       2930         98         65       1083       3723
       158         84         46       1899        157         60
 2476        971        114         97   Rescheduling interrupts
CAL:         12         89         71         65         65        142
        77         66         65        118         77         67
   66        106         72         67   function call interrupts
TLB:          7         90         18          5          3        115
        16         10          3        123         19          5
    2        157         18          3   TLB shootdowns
TRM:          0          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   Thermal event interrupts
THR:          0          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   Threshold APIC interrupts
SPU:          0          0          0          0          0          0
         0          0          0          0          0          0
    0          0          0          0   Spurious interrupts
ERR:          1

system with neptune:
LBSuse:~ # cat /proc/interrupts
           CPU0       CPU1       CPU2       CPU3       CPU4       CPU5
      CPU6       CPU7
0x0:         92          0          0          0          0          0
         0          1   IO-APIC-edge      timer
0x4:          0          0          0          0          0          0
         1        532   IO-APIC-edge      serial
0x7:          1          0          0          0          0          0
         0          0   IO-APIC-edge
0x8:          0          0          0          0          0          0
         0          1   IO-APIC-edge      rtc0
0x9:          0          0          0          0          0          0
         0          0   IO-APIC-fasteoi   acpi
0x17:          0          0          0          0          0
0          0          0   IO-APIC-fasteoi   sata_nv
0x16:          0          0          0          0          0
0          2        105   IO-APIC-fasteoi   ohci_hcd:usb2
0x15:          0          0          0          0          0
0          0       1014   IO-APIC-fasteoi   ehci_hcd:usb1
0x14:          0          0          0          0          0
0          0          1   IO-APIC-fasteoi   sata_nv, sata_nv
0x2e:          0          0          0          0          0
0          0          0   IO-APIC-fasteoi   sata_nv
0x2d:          0          0          0          0          0
0          0          0   IO-APIC-fasteoi   sata_nv
0x2c:          0          0          0          0          0
0          0          0   IO-APIC-fasteoi   sata_nv
0x50100:          0          0          0          0          0
  0          0          0   PCI-MSI-edge      aerdrv
0x70100:          0          0          0          0          0
  0          0          0   PCI-MSI-edge      aerdrv
0x78100:          0          0          0          0          0
  0          0          0   PCI-MSI-edge      aerdrv
0x8058100:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      aerdrv
0x8070100:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      aerdrv
0x8078100:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      aerdrv
0x8301100:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010ff:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010fe:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010fd:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010fc:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010fb:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010fa:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f9:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f8:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f7:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f6:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f5:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f4:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f3:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f2:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f1:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010f0:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010ef:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010ee:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010ed:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x83010ec:          0          0          0          0          0
    0          0          0   PCI-MSI-edge      eth5
0x40100:          0          0          0          0          0
  0          9       5352   PCI-MSI-edge      eth0
0x48100:          0          0          0          0          0
  0          4        148      none-edge
0x8040100:          0          0          0        154          0
    0          0          0      none-edge
0x8048100:          0          0          0        154          0
    0          0          0      none-edge
NMI:          0          0          0          0          0          0
         0          0   Non-maskable interrupts
LOC:       4780       4021       2441       2831       3978       3672
      2576       4601   Local timer interrupts
RES:        647       4295        485        282       1324       3561
       620       1902   Rescheduling interrupts
CAL:         18         92         53         44         33         53
        47         39   function call interrupts
TLB:         23        176         65         41         48        274
        95         62   TLB shootdowns
TRM:          0          0          0          0          0          0
         0          0   Thermal event interrupts
THR:          0          0          0          0          0          0
         0          0   Threshold APIC interrupts
SPU:          0          0          0          0          0          0
         0          0   Spurious interrupts
ERR:          1
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ