linux-kernel - Re: [PATCH v10 2/2] irqchip/loongarch-avec: Add AVEC irqchip support

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <40b3b302-14a1-c158-99ed-feb81d3fefc4@loongson.cn>
Date: Wed, 21 Aug 2024 09:47:18 +0800
From: Tianyang Zhang <zhangtianyang@...ngson.cn>
To: Thomas Gleixner <tglx@...utronix.de>, corbet@....net, alexs@...nel.org,
 chenhuacai@...nel.org, kernel@...0n.name, jiaxun.yang@...goat.com,
 gaoliang@...ngson.cn, wangliupu@...ngson.cn, lvjianmin@...ngson.cn,
 yijun@...ngson.cn, mhocko@...e.com, akpm@...ux-foundation.org,
 dianders@...omium.org, maobibo@...ngson.cn, xry111@...111.site,
 zhaotianrui@...ngson.cn, nathan@...nel.org, yangtiezhu@...ngson.cn,
 zhoubinbin@...ngson.cn
Cc: loongarch@...ts.linux.dev, linux-doc@...r.kernel.org,
 linux-kernel@...r.kernel.org, Huacai Chen <chenhuacai@...ngson.cn>
Subject: Re: [PATCH v10 2/2] irqchip/loongarch-avec: Add AVEC irqchip support

Hi, Thomas

在 2024/8/21 上午12:29, Thomas Gleixner 写道:
> On Thu, Aug 15 2024 at 19:26, Tianyang Zhang wrote:
>>   .../arch/loongarch/irq-chip-model.rst         |  32 ++
>>   .../zh_CN/arch/loongarch/irq-chip-model.rst   |  32 ++
>>   arch/loongarch/Kconfig                        |   1 +
>>   arch/loongarch/include/asm/cpu-features.h     |   1 +
>>   arch/loongarch/include/asm/cpu.h              |   2 +
>>   arch/loongarch/include/asm/hardirq.h          |   3 +-
>>   arch/loongarch/include/asm/hw_irq.h           |   2 +
>>   arch/loongarch/include/asm/irq.h              |  25 +-
>>   arch/loongarch/include/asm/loongarch.h        |  18 +-
>>   arch/loongarch/include/asm/smp.h              |   2 +
>>   arch/loongarch/kernel/cpu-probe.c             |   3 +-
>>   arch/loongarch/kernel/irq.c                   |  15 +-
>>   arch/loongarch/kernel/paravirt.c              |   5 +
>>   arch/loongarch/kernel/smp.c                   |   6 +
>>   drivers/irqchip/Makefile                      |   2 +-
>>   drivers/irqchip/irq-loongarch-avec.c          | 426 ++++++++++++++++++
>>   drivers/irqchip/irq-loongarch-cpu.c           |   5 +-
>>   drivers/irqchip/irq-loongson-eiointc.c        |   7 +-
>>   drivers/irqchip/irq-loongson-pch-msi.c        |  24 +-
>>   include/linux/cpuhotplug.h                    |   3 +-
> This patch is doing too many things at once and is absolutely not
> reviewable.
>
> Please split it up into the obvious bits and pieces:
>
>     1) The IRQ_NOPROBE change
>
>     2) See below
>
>     3) Documentation
>
>     4) Add the arch/loongson parts, i.e. all the defines and
>        basic required function prototypes with a little twist.
>        Add a Kconfig symbol:
>
> 	Kconfig IRQ_LOONGARCH_AVEC
>          	bool
>
>        in drivers/irqchip/Kconfig. This allows you to add all
>        arch/loongarch/ changes with the simple tweak:
>
>        #ifdef CONFIG_IRQ_LOONGARCH_AVEC
>        # define cpu_has_avecint		cpu_opt(LOONGARCH_CPU_AVECINT)
>        #else
>        # define cpu_has_avecint		false
>        #endif
>        
>        and
>        
>        #ifdef CONFIG_IRQ_LOONGARCH_AVEC
>        # define SMP_CLEAR_VECTOR		BIT(ACTION_CLEAR_VECTOR)
>        #else
>        # define SMP_CLEAR_VECTOR		(0)
>        #endif
>
>        That way the compiler will optimize out stuff like the
>        SMP_CLEAR_VECTOR handling and you only need the prototype of
>        complete_irq_moving(), but no implementation.
>
>     5) Change the CPU hotplug callback for EOINTC and do
>        the acpi_cascade_irqdomain_init() change.
>
>     6) Prepare get_pch_msi_handle() in the pch MSI driver
>
>     7) Implement the driver and select IRQ_LOONGARCH_AVEC
>        from IRQ_LOONGARCH_CPU
>
>     8) Remove the IRQ_LOONGARCH_AVEC helpers
Thanks for your guiding, we will complete the split as required
>
>> diff --git a/arch/loongarch/Kconfig b/arch/loongarch/Kconfig
>> index 70f169210b52..0e3abf7b0bd3 100644
>> --- a/arch/loongarch/Kconfig
>> +++ b/arch/loongarch/Kconfig
>> @@ -85,6 +85,7 @@ config LOONGARCH
>>   	select GENERIC_ENTRY
>>   	select GENERIC_GETTIMEOFDAY
>>   	select GENERIC_IOREMAP if !ARCH_IOREMAP
>> +	select GENERIC_IRQ_MATRIX_ALLOCATOR
> Please move this to IRQ_LOONGARCH_CPU in patch #7
OK, thanks
>
>> @@ -92,15 +103,21 @@ int liointc_acpi_init(struct irq_domain *parent,
>>   					struct acpi_madt_lio_pic *acpi_liointc);
>>   int eiointc_acpi_init(struct irq_domain *parent,
>>   					struct acpi_madt_eio_pic *acpi_eiointc);
>> +int avecintc_acpi_init(struct irq_domain *parent);
>> +
>> +void complete_irq_moving(void);
>>   
>>   int htvec_acpi_init(struct irq_domain *parent,
>>   					struct acpi_madt_ht_pic *acpi_htvec);
>>   int pch_lpc_acpi_init(struct irq_domain *parent,
>>   					struct acpi_madt_lpc_pic *acpi_pchlpc);
>> -int pch_msi_acpi_init(struct irq_domain *parent,
>> -					struct acpi_madt_msi_pic *acpi_pchmsi);
>>   int pch_pic_acpi_init(struct irq_domain *parent,
>>   					struct acpi_madt_bio_pic *acpi_pchpic);
>> +int pch_msi_acpi_init(struct irq_domain *parent,
>> +					struct acpi_madt_msi_pic *acpi_pchmsi);
>> +int pch_msi_acpi_init_v2(struct irq_domain *parent,
>> +					struct acpi_madt_msi_pic *acpi_pchmsi);
> This is really the wrong place for all these prototypes. They are only
> used in drivers/irqchip/... except for complete_irq_moving().
>
> So the proper place for them is drivers/irqchip/irq-loongarch.h
>
> Move them there. This is patch #2 which I referred to above.
Ok ,thanks
>
>> +static phys_addr_t msi_base_addr;
>>
> So you have everything related to the avec chip in loongarch_avec, so
> why don't you move that into that data structure?
Ok, thanks
>
>> It just looks more compatible with the previous one. It really doesn't make sense. We will delete it later+struct avecintc_chip {
>> +	struct fwnode_handle	*fwnode;
>> +	struct irq_domain	*domain;
>> +	struct irq_matrix	*vector_matrix;
>> +	raw_spinlock_t		lock;
>> +};
> The lock should be the first member as spinlocks have alignment
> requirements....
Ok ,thanks
> It just looks more compatible with the previous one. It really doesn't make sense. We will delete it later
>
>> +static int avecintc_domain_alloc(struct irq_domain *domain,
>> +				 unsigned int virq, unsigned int nr_irqs, void *arg)
>> +{
>> +	guard(raw_spinlock_irqsave)(&loongarch_avec.lock);
>> +
>> +	for (unsigned int i = 0; i < nr_irqs; i++) {
>> +		struct irq_data *irqd = irq_domain_get_irq_data(domain, virq + i);
>> +		struct avecintc_data *adata = kzalloc(sizeof(*adata), GFP_KERNEL);
> That was never tested with any debug. You _cannot_ do a GFP_KERNEL
> allocation with the raw spinlock held. And no, don't use
> GFP_ATOMIC. There is absolutely zero reason to hold the lock accross all
> of that. As you got your ideas from x86_vector_alloc_irqs(), you could
> have looked at how that's done correctly.
I got it , thanks
>
>> +		unsigned int cpu, ret;
>> +
>> +		if (!adata)
>> +			return -ENOMEM;
>> +
>> +		ret = irq_matrix_alloc(loongarch_avec.vector_matrix, cpu_online_mask, false, &cpu);
>> +		if (ret < 0) {
>> +			kfree(adata);
>> +			return ret;
>> +		}
>> +
>> +		adata->moving = 0;
> Redundant. The struct is allocated with kzalloc()...
sorry, this is my stupid mistake ..... ,thanks
>
>> +		adata->prev_cpu = adata->cpu = cpu;
>> +		adata->prev_vec = adata->vec = ret;
>> +		adata->managed = irqd_affinity_is_managed(irqd);
> If you want to support managed interrupts, then you cannot allocate
> from the CPU online mask. See x86...
Ok, we will reconsider these functions, thanks
>
>> +		irq_domain_set_info(domain, virq + i, virq + i, &avec_irq_controller,
>> +				    adata, handle_edge_irq, NULL, NULL);
>> +		irqd_set_single_target(irqd);
>> +		irqd_set_affinity_on_activate(irqd);
>> +
>> +		per_cpu_ptr(irq_map, adata->cpu)[adata->vec] = irq_data_to_desc(irqd);
> static int avecintc_alloc_vector(struct avecintc_adata *adata)
> {
>          int ret, cpu;
>
> 	guard(raw_spinlock_irqsave)(&loongarch_avec.lock);
> 	ret = irq_matrix_alloc(loongarch_avec.vector_matrix, cpu_online_mask, false, &cpu);
>          if (ret < 0)
>                return ret;
>
> 	adata->prev_cpu = adata->cpu = cpu;
> 	adata->prev_vec = adata->vec = ret;
>          per_cpu_ptr(irq_map, adata->cpu)[adata->vec] = irq_data_to_desc(irqd);
>          return 0;
> }
>
> static int avecintc_domain_alloc(struct irq_domain *domain, ...)
> {
> 	for (unsigned int i = 0; i < nr_irqs; i++) {
> 		struct irq_data *irqd = irq_domain_get_irq_data(domain, virq + i);
> 		struct avecintc_data *adata = kzalloc(sizeof(*adata), GFP_KERNEL);
>                  int ret;
>
> 		if (!adata)
> 			return -ENOMEM;
>
> 		irq_domain_set_info(domain, virq + i, virq + i, &avec_irq_controller,
> 				    adata, handle_edge_irq, NULL, NULL);
> 		irqd_set_single_target(irqd);
> 		irqd_set_affinity_on_activate(irqd);
>
> 		ret = avecintc_alloc_vector(adata);
>                  if (ret < 0) {
> 			kfree(adata);
>                          return ret;
>                  }
>           }
> No?
OK , It really seems more appropriate, thanks
> It just looks more compatible with the previous one. It really doesn't make sense. We will delete it later
>
>> +static void clear_free_vector(struct irq_data *irqd)
>> +{
>> +	struct avecintc_data *adata = irq_data_get_irq_chip_data(irqd);
>> +	bool managed = irqd_affinity_is_managed(irqd);
> Don't even try. Your managed support is broken at the allocation side
> and at several other places.
OK, thanks
>> +	per_cpu(irq_map, adata->cpu)[adata->vec] = NULL;
>> +	irq_matrix_free(loongarch_avec.vector_matrix, adata->cpu, adata->vec, managed);
>> +	adata->cpu = UINT_MAX;
>> +	adata->vec = UINT_MAX;
>> +
>> +#ifdef CONFIG_SMP
>> +	if (!adata->moving)
>> +		return;
>> +
>> +	per_cpu(irq_map, adata->prev_cpu)[adata->prev_vec] = NULL;
>> +	irq_matrix_free(loongarch_avec.vector_matrix,
>> +			adata->prev_cpu, adata->prev_vec, adata->managed);
>> +	adata->moving = 0;
>> +	adata->prev_vec = UINT_MAX;
>> +	adata->prev_cpu = UINT_MAX;
> What's all the clearing for when you kfree() it two lines further down?
OK, we will reconsider there ,thanks
>
>> +	list_del_init(&adata->entry);
>> +#endif
>> +	kfree(adata);
> And no, not under the lock .... Move the locking into this function and
> kfree() at the call site. There is zero reason to hold the lock over the
> full loop.
Ok ,thanks
>> +static int __init pch_msi_parse_madt(union acpi_subtable_headers *header,
>> +				     const unsigned long end)
>> +{
>> +	struct acpi_madt_msi_pic *pchmsi_entry = (struct acpi_madt_msi_pic *)header;
>> +
>> +	msi_base_addr = pchmsi_entry->msg_address - AVEC_MSG_OFFSET;
>> +
>> +	return pch_msi_acpi_init_v2(loongarch_avec.domain, pchmsi_entry);
>> +}
> ...
>
>> +int __init pch_msi_acpi_init_v2(struct irq_domain *parent, struct acpi_madt_msi_pic *acpi_pchmsi)
> The second argument is required because?
It just looks more compatible with the previous one. It really doesn't 
make sense. We will delete it later, Thanks
>
> Thanks,
>
>          tglx

Thanks again

Tianyang