lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAE9FiQX8XXkwbhhuNhCF-pHdc_3or_dMEH-VN5NBSEV_eXE-Pg@mail.gmail.com>
Date:	Thu, 12 Dec 2013 22:26:58 -0800
From:	Yinghai Lu <yinghai@...nel.org>
To:	Bjorn Helgaas <bhelgaas@...gle.com>,
	Rajat Jain <rajatjain.linux@...il.com>
Cc:	"linux-pci@...r.kernel.org" <linux-pci@...r.kernel.org>,
	Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
	Kenji Kaneshige <kaneshige.kenji@...fujitsu.com>,
	Alex Williamson <alex.williamson@...hat.com>,
	Yijing Wang <wangyijing@...wei.com>,
	Paul Bolle <pebolle@...cali.nl>,
	Rajat Jain <rajatjain@...iper.net>,
	Rajat Jain <rajatxjain@...il.com>,
	Guenter Roeck <groeck@...iper.net>,
	Guenter Roeck <linux@...ck-us.net>
Subject: Re: [PATCH v2 2/4] pciehp: Use link change notifications for hot-plug
 and removal

On Thu, Dec 12, 2013 at 2:44 PM, Bjorn Helgaas <bhelgaas@...gle.com> wrote:
> [+cc Yinghai]
>
> On Tue, Dec 03, 2013 at 02:32:46PM -0800, Rajat Jain wrote:
>> A lot of systems do not have the fancy buttons and LEDs, and instead
>> want to rely only on the Link state change events to drive the hotplug
>> and removal state machinery.
>> (http://www.spinics.net/lists/hotplug/msg05802.html)

"A lot of systems", are you kidding? You count it? Rajat.

>>
>> This patch adds support for that functionality. Here are the details
>> about the patch itself:
>>
>> * Define and use interrupt events for linkup / linkdown.
>
> This seems like a reasonable idea.
>
> In the ExpressCard Standard (Rel 2.0, Feb 2009), Section 6.3.1 and Figure
> 6-2 talk about a "Link Detect" or "Link Train Detected" interrupt being
> used to trigger the ACPI hotplug flow for a non-PCIe-aware OS.  I'm not
> sure what interrupt this refers to.  The Slot Control Data Link Layer State
> Changed interrupt (which you're using) sounds similar, but my guess is that
> "Link Detect" is a generic term for chipset-specific functionality to
> generate an SMI for hotplug events.
>
> But then Section 6.3.1.1 suggests that a PCIe-aware OS would use "Presence
> Detect Changed" instead.  I don't know why a different mechanism would be
> suggested, although DLLSC was added after PCIe 1.0, so older hardware
> wouldn't have a PCIe-standard link detection mechanism.
>
> In any event, I think having the Link Status Data Link Layer Link Active
> bit set would certainly imply that a device is present, so it seems
> reasonable to use DLLLA in addition to Presence Detect State.

Yes, if Attention attention is not there and Surprise removal is supported.

>
>> * Introduce the functions to handle link-up and link-down events and
>>   queue the work in the slot->wq to be processed by pciehp_power_thread
>>
>> * The current code bails out from device removal, if an adapter is not
>>   detected. That is not right, because if an adapter is not present at
>>   all, it provides all the more reason to REMOVE the device. This is
>>   specially a problem for link state hot-plug, because some ports use
>>   in band mechanism to detect the presence detection. Thus when link
>>   goes down, presence detect also goes down. We _want_ that the devices
>>   should be removed in this case.
>>
>> * The current pciehp driver disabled the link in case of a hot-removal.
>>   Since for link change based hot-plug to work, we need that enabled,
>>   hence make sure to not disable the link permanently if link state
>>   based hot-plug is to be used. Also have to remove
>>   pciehp_link_disable() and pcie_wait_link_not_active() static functions
>>   since they are not being used anywhere else.
>>
>> * pciehp_reset_slot - reset of secondary bus may cause undesirable
>>   spurious link notifications. Thus disable these around the secondary
>>   bus reset.
>>
>> Signed-off-by: Rajat Jain <rajatjain@...iper.net>
>> Signed-off-by: Guenter Roeck <groeck@...iper.net>
>> ---
>>  v2: - drop the use_link_state_events module parameter as discussed here:
>>        http://marc.info/?t=138513966800006&r=1&w=2
>>      - removed the static functions left unused after this patch.
>>      - make the pciehp_handle_linkstate_change() return void.
>>      - dropped the "RFC" from subject, and added Guenter's signature
>>
>>  drivers/pci/hotplug/pciehp.h      |    3 +
>>  drivers/pci/hotplug/pciehp_ctrl.c |  130 ++++++++++++++++++++++++++++++++++---
>>  drivers/pci/hotplug/pciehp_hpc.c  |   56 ++++++++--------
>>  3 files changed, 150 insertions(+), 39 deletions(-)
>>
>> diff --git a/drivers/pci/hotplug/pciehp.h b/drivers/pci/hotplug/pciehp.h
>> index fc322ed..353edda 100644
>> --- a/drivers/pci/hotplug/pciehp.h
>> +++ b/drivers/pci/hotplug/pciehp.h
>> @@ -110,6 +110,8 @@ struct controller {
>>  #define INT_BUTTON_PRESS             7
>>  #define INT_BUTTON_RELEASE           8
>>  #define INT_BUTTON_CANCEL            9
>> +#define INT_LINK_UP                  10
>> +#define INT_LINK_DOWN                        11
>>
>>  #define STATIC_STATE                 0
>>  #define BLINKINGON_STATE             1
>> @@ -133,6 +135,7 @@ u8 pciehp_handle_attention_button(struct slot *p_slot);
>>  u8 pciehp_handle_switch_change(struct slot *p_slot);
>>  u8 pciehp_handle_presence_change(struct slot *p_slot);
>>  u8 pciehp_handle_power_fault(struct slot *p_slot);
>> +void pciehp_handle_linkstate_change(struct slot *p_slot);
>>  int pciehp_configure_device(struct slot *p_slot);
>>  int pciehp_unconfigure_device(struct slot *p_slot);
>>  void pciehp_queue_pushbutton_work(struct work_struct *work);
>> diff --git a/drivers/pci/hotplug/pciehp_ctrl.c b/drivers/pci/hotplug/pciehp_ctrl.c
>> index 38f0186..4c2544c 100644
>> --- a/drivers/pci/hotplug/pciehp_ctrl.c
>> +++ b/drivers/pci/hotplug/pciehp_ctrl.c
>> @@ -150,6 +150,27 @@ u8 pciehp_handle_power_fault(struct slot *p_slot)
>>       return 1;
>>  }
>>
>> +void pciehp_handle_linkstate_change(struct slot *p_slot)
>> +{
>> +     u32 event_type;
>> +     struct controller *ctrl = p_slot->ctrl;
>> +
>> +     /* Link Status Change */
>> +     ctrl_dbg(ctrl, "Data Link Layer State change\n");
>> +
>> +     if (pciehp_check_link_active(ctrl)) {
>> +             ctrl_info(ctrl, "slot(%s): Link Up event\n",
>> +                       slot_name(p_slot));
>> +             event_type = INT_LINK_UP;
>> +     } else {
>> +             ctrl_info(ctrl, "slot(%s): Link Down event\n",
>> +                       slot_name(p_slot));
>> +             event_type = INT_LINK_DOWN;
>> +     }
>> +
>> +     queue_interrupt_event(p_slot, event_type);
>> +}
>> +
>>  /* The following routines constitute the bulk of the
>>     hotplug controller logic
>>   */
>> @@ -442,6 +463,100 @@ static void handle_surprise_event(struct slot *p_slot)
>>       queue_work(p_slot->wq, &info->work);
>>  }
>>
>> +/*
>> + * Note: This function must be called with slot->lock held
>> + */
>> +static void handle_link_up_event(struct slot *p_slot)
>> +{
>> +     struct controller *ctrl = p_slot->ctrl;
>> +     struct power_work_info *info;
>> +
>> +     info = kmalloc(sizeof(*info), GFP_KERNEL);
>> +     if (!info) {
>> +             ctrl_err(p_slot->ctrl, "%s: Cannot allocate memory\n",
>> +                      __func__);
>> +             return;
>> +     }
>> +     info->p_slot = p_slot;
>> +     INIT_WORK(&info->work, pciehp_power_thread);
>> +
>> +     switch (p_slot->state) {
>> +     case BLINKINGON_STATE:
>> +     case BLINKINGOFF_STATE:
>> +             cancel_delayed_work(&p_slot->work);
>> +             /* Fall through */
>> +     case STATIC_STATE:
>> +             p_slot->state = POWERON_STATE;
>> +             queue_work(p_slot->wq, &info->work);
>> +             break;
>> +     case POWERON_STATE:
>> +             ctrl_info(ctrl,
>> +                       "Link Up event ignored on slot(%s): already powering on\n",
>> +                       slot_name(p_slot));
>> +             kfree(info);
>> +             break;
>> +     case POWEROFF_STATE:
>> +             ctrl_info(ctrl,
>> +                       "Link Up event queued on slot(%s): currently getting powered off\n",
>> +                       slot_name(p_slot));
>> +             p_slot->state = POWERON_STATE;
>> +             queue_work(p_slot->wq, &info->work);
>> +             break;
>> +     default:
>> +             ctrl_err(ctrl, "Not a valid state on slot(%s)\n",
>> +                      slot_name(p_slot));
>> +             kfree(info);
>> +             break;
>> +     }
>> +}
>> +
>> +/*
>> + * Note: This function must be called with slot->lock held
>> + */
>> +static void handle_link_down_event(struct slot *p_slot)
>> +{
>> +     struct controller *ctrl = p_slot->ctrl;
>> +     struct power_work_info *info;
>> +
>> +     info = kmalloc(sizeof(*info), GFP_KERNEL);
>> +     if (!info) {
>> +             ctrl_err(p_slot->ctrl, "%s: Cannot allocate memory\n",
>> +                      __func__);
>> +             return;
>> +     }
>> +     info->p_slot = p_slot;
>> +     INIT_WORK(&info->work, pciehp_power_thread);
>> +
>> +     switch (p_slot->state) {
>> +     case BLINKINGON_STATE:
>> +     case BLINKINGOFF_STATE:
>> +             cancel_delayed_work(&p_slot->work);
>> +             /* Fall through */
>> +     case STATIC_STATE:
>> +             p_slot->state = POWEROFF_STATE;
>> +             queue_work(p_slot->wq, &info->work);
>> +             break;
>> +     case POWEROFF_STATE:
>> +             ctrl_info(ctrl,
>> +                       "Link Down event ignored on slot(%s): already powering off\n",
>> +                       slot_name(p_slot));
>> +             kfree(info);
>> +             break;
>> +     case POWERON_STATE:
>> +             ctrl_info(ctrl,
>> +                       "Link Down event queued on slot(%s): currently getting powered on\n",
>> +                       slot_name(p_slot));
>> +             p_slot->state = POWEROFF_STATE;
>> +             queue_work(p_slot->wq, &info->work);
>> +             break;
>> +     default:
>> +             ctrl_err(ctrl, "Not a valid state on slot %s\n",
>> +                      slot_name(p_slot));
>> +             kfree(info);
>> +             break;
>> +     }
>> +}
>> +
>>  static void interrupt_event_handler(struct work_struct *work)
>>  {
>>       struct event_info *info = container_of(work, struct event_info, work);
>> @@ -468,6 +583,12 @@ static void interrupt_event_handler(struct work_struct *work)
>>               ctrl_dbg(ctrl, "Surprise Removal\n");
>>               handle_surprise_event(p_slot);
>>               break;
>> +     case INT_LINK_UP:
>> +             handle_link_up_event(p_slot);
>> +             break;
>> +     case INT_LINK_DOWN:
>> +             handle_link_down_event(p_slot);
>> +             break;
>>       default:
>>               break;
>>       }
>> @@ -524,15 +645,6 @@ int pciehp_disable_slot(struct slot *p_slot)
>>       if (!p_slot->ctrl)
>>               return 1;
>>
>> -     if (!HP_SUPR_RM(p_slot->ctrl)) {
>> -             ret = pciehp_get_adapter_status(p_slot, &getstatus);
>> -             if (ret || !getstatus) {
>> -                     ctrl_info(ctrl, "No adapter on slot(%s)\n",
>> -                               slot_name(p_slot));
>> -                     return -ENODEV;
>> -             }
>> -     }
>> -
>
> This seems like a candidate for splitting into a separate patch.
>
>>       if (MRL_SENS(p_slot->ctrl)) {
>>               ret = pciehp_get_latch_status(p_slot, &getstatus);
>>               if (ret || getstatus) {
>> diff --git a/drivers/pci/hotplug/pciehp_hpc.c b/drivers/pci/hotplug/pciehp_hpc.c
>> index 3a5eee7..1f152f3 100644
>> --- a/drivers/pci/hotplug/pciehp_hpc.c
>> +++ b/drivers/pci/hotplug/pciehp_hpc.c
>> @@ -278,11 +278,6 @@ static void pcie_wait_link_active(struct controller *ctrl)
>>       __pcie_wait_link_active(ctrl, true);
>>  }
>>
>> -static void pcie_wait_link_not_active(struct controller *ctrl)
>> -{
>> -     __pcie_wait_link_active(ctrl, false);
>> -}
>> -
>>  static bool pci_bus_check_dev(struct pci_bus *bus, int devfn)
>>  {
>>       u32 l;
>> @@ -383,11 +378,6 @@ static int pciehp_link_enable(struct controller *ctrl)
>>       return __pciehp_link_set(ctrl, true);
>>  }
>>
>> -static int pciehp_link_disable(struct controller *ctrl)
>> -{
>> -     return __pciehp_link_set(ctrl, false);
>> -}
>> -
>>  int pciehp_get_attention_status(struct slot *slot, u8 *status)
>>  {
>>       struct controller *ctrl = slot->ctrl;
>> @@ -620,13 +610,10 @@ int pciehp_power_off_slot(struct slot * slot)
>>       u16 cmd_mask;
>>       int retval;
>>
>> -     /* Disable the link at first */
>> -     pciehp_link_disable(ctrl);
>> -     /* wait the link is down */
>> -     if (ctrl->link_active_reporting)
>> -             pcie_wait_link_not_active(ctrl);
>> -     else
>> -             msleep(1000);
>
> 2debd9289997 ("PCI: pciehp: Disable/enable link during slot power off/on")
> added this code that disables the link to fix problems.  So we need to make
> sure we don't reintroduce those problems by leaving the link enabled.
>
> One problem was spurious "card present/not present" messages.  I suspect
> that topology has an attention button, and I'm not sure it makes sense to
> enable the presence detect interrupt in that case.  As far as I can tell,
> if there's an attention button, the OS should not do anything on card
> insertion or removal; it should only do something when the attention button
> is pressed.  So maybe we can deal with the message issue that way.

Agreed.

>
> The changelog also hints that disabling the link might be needed to allow a
> repeater to be reset, but I don't know the details.  I don't remember any
> spec language that requires the link to be disabled on hotplug; maybe this
> is a platform-specific quirk.

yes, that is workaround to to reset repeater in between.

Also we can get rid of annoying aer during power off slot.

Also at least other os will turn on link after turn on the power,
as hotplug is working if BIOS have link disabled when boot system
without card inserted.

>
> This patch is pretty large and if it could be split up a bit, especially
> this particular part, it might help Yinghai verify that his system keeps
> working.

I suggest that link event handling will be enabled automatically when
attention button is not there and surprise removal is supported.
And when that enabled, should disable Present event handling.

Thanks

Yinghai
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ