[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20080711060714.GT1678@linux-os.sc.intel.com>
Date: Thu, 10 Jul 2008 23:07:15 -0700
From: Suresh Siddha <suresh.b.siddha@...el.com>
To: "Eric W. Biederman" <ebiederm@...ssion.com>
Cc: "Siddha, Suresh B" <suresh.b.siddha@...el.com>,
"mingo@...e.hu" <mingo@...e.hu>, "hpa@...or.com" <hpa@...or.com>,
"tglx@...utronix.de" <tglx@...utronix.de>,
"akpm@...ux-foundation.org" <akpm@...ux-foundation.org>,
"arjan@...ux.intel.com" <arjan@...ux.intel.com>,
"andi@...stfloor.org" <andi@...stfloor.org>,
"jbarnes@...tuousgeek.org" <jbarnes@...tuousgeek.org>,
"steiner@....com" <steiner@....com>,
"linux-kernel@...r.kernel.org" <linux-kernel@...r.kernel.org>
Subject: Re: [patch 23/26] x64, x2apic/intr-remap: MSI and MSI-X support for interrupt remapping infrastructure
On Thu, Jul 10, 2008 at 06:22:57PM -0700, Eric W. Biederman wrote:
> > As MSI is edge triggered, irq migration is a simple atomic update(of vector
> > and cpu destination) of IRTE and flushing the hardware cache.
>
> A couple of things. I believe MSI edge triggered irqs can always be migrated
> safely outside of irq context.
>
> Flushing the hardware cache sounds like it will flush the irqs towards the cpu.
> How do we flush the inflight irqs flushed to the apic. Does a register read work?
As I mentioned in other thread, we are waiting for the queued invalidation
descriptor to complete (in qi_submit_sync()). That will register the inflight
ones at the cpu's apic before setting the descriptor completion.
> For MSI irqs going directly to the cpu it should, as long as the cpu and local
> apic count as the same device from the perspective of pci ordering rules.
>
> > +++ tree-x86/arch/x86/kernel/io_apic_64.c 2008-07-10 09:52:34.000000000 -0700
> > @@ -2289,6 +2289,9 @@
> >
> > dynamic_irq_cleanup(irq);
> >
> > +#ifdef CONFIG_INTR_REMAP
> > + free_irte(irq);
> > +#endif
> > spin_lock_irqsave(&vector_lock, flags);
> > __clear_irq_vector(irq);
> > spin_unlock_irqrestore(&vector_lock, flags);
> > @@ -2307,11 +2310,42 @@
> >
> > tmp = TARGET_CPUS;
> > err = assign_irq_vector(irq, tmp);
> > - if (!err) {
> > - cpus_and(tmp, cfg->domain, tmp);
> > - dest = cpu_mask_to_apicid(tmp);
> > + if (err)
> > + return err;
> > +
> > + cpus_and(tmp, cfg->domain, tmp);
> > + dest = cpu_mask_to_apicid(tmp);
>
> Can we simplify this a little. In particular have a function
>
> struct IOAPIC_ROUTE_entry x86_map_irq(irq, mask);
>
> Where x86_map_irq would ultimately figure out the path to the cpu.
> In the simple case it would just call assign_irq_vector();
> When irqs are remapped it would perform the additional
But we already know that the irq's are remapped, as we are using different
irq_chip's when irq's are remapped.
> map_irq_to_irte_handle();
> modify_irte(irq, &irte);
>
> And then have the generic msi code and the ioapic code.
> Map from the struct IOAPIC_ROUTE_entry or whatever to the appropriate bits for the hardware
> they control.
>
> That should allows us a lot more flexibility going forward with less code then is in your
> patches.
Are you talking about the setup code or the migration code? Because in migration
code, we don't even touch MSI/IO-apic devices (for edge atleast) and we
already use different irq_chip's for that.
For initial setup, I agree that it can use some simplifications. It's getting
late here and I will look at all your suggestions tomorrow.
thanks,
suresh
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
Powered by blists - more mailing lists