[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <d3f10fc1-635b-3a5d-a967-2816eac005fc@linux.intel.com>
Date: Fri, 13 Jul 2018 10:21:37 -0700
From: Jae Hyun Yoo <jae.hyun.yoo@...ux.intel.com>
To: Brendan Higgins <brendanhiggins@...gle.com>
Cc: Benjamin Herrenschmidt <benh@...nel.crashing.org>,
Joel Stanley <joel@....id.au>,
Andrew Jeffery <andrew@...id.au>, linux-i2c@...r.kernel.org,
OpenBMC Maillist <openbmc@...ts.ozlabs.org>,
Linux ARM <linux-arm-kernel@...ts.infradead.org>,
linux-aspeed@...ts.ozlabs.org,
Linux Kernel Mailing List <linux-kernel@...r.kernel.org>,
james.feist@...ux.intel.com, vernon.mauery@...ux.intel.com,
Benjamin Fair <benjaminfair@...gle.com>,
Patrick Venture <venture@...gle.com>,
Gary Hsu <gary_hsu@...eedtech.com>,
Ryan Chen <ryan_chen@...eedtech.com>
Subject: Re: [PATCH] i2c: aspeed: Improve driver to support multi-master use
cases stably
On 7/12/2018 11:21 AM, Jae Hyun Yoo wrote:
> On 7/12/2018 2:33 AM, Brendan Higgins wrote:
>> On Wed, Jun 27, 2018 at 10:55 AM Jae Hyun Yoo
>> <jae.hyun.yoo@...ux.intel.com> wrote:
>>>
>> <snip>
>>>>> +/* Timeout for bus busy checking */
>>>>> +#define BUS_BUSY_CHECK_TIMEOUT 250000 /*
>>>>> 250ms */
>>>>> +#define BUS_BUSY_CHECK_INTERVAL
>>>>> 10000 /* 10ms */
>>>>
>>>> Could you add a comment on where you got these values from?
>>>>
>>>
>>> These are coming from ASPEED SDK code. Actually, they use 100ms for
>>> timeout and 10ms for interval but I increased the timeout value to
>>> 250ms so that it covers a various range of bus speed. I think, it
>>> should be computed at run time based on the current bus speed, or
>>> we could add these as device tree settings. How do you think about it?
>>>
>>
>> This should definitely be a device tree setting. If one of the busses
>> is being
>> used as a regular I2C bus, it could hold the bus for an unlimited
>> amount of
>> time before sending a STOP. As for a default, 100ms is probably fine
>> given
>> that, a) the limit will only apply to multi-master mode, and b)
>> multi-master
>> mode will probably almost always be used with IPMB, or MCTP (MCTP
>> actually
>> recommends a 100ms timeout for this purpose, see
>> https://www.dmtf.org/sites/default/files/standards/documents/DSP0237_1.1.0.pdf,
>>
>> symbol PT2a). That being said, if you actually want to implement IPMB,
>> or MCTP
>> arbitration logic, it is much more complicated.
>>
>
> Okay then, I think, we can fix the timeout value to 100ms and enable the
> bus busy checking logic only when 'multi-master' is set in device tree.
> My thought is, no additional arbitration logic is needed because
> arbitration is performed in H/W level and H/W reports
> ASPEED_I2CD_INTR_ARBIT_LOSS when it fails acquiring a bus. The
> ARBIT_LOSS event is already being handled well by this driver code you
> implemented.
>
>>> >
>> <snip>
>>>>> #if IS_ENABLED(CONFIG_I2C_SLAVE)
>>>>> - if (aspeed_i2c_slave_irq(bus)) {
>>>>> - dev_dbg(bus->dev, "irq handled by slave.\n");
>>>>> - return IRQ_HANDLED;
>>>>> + if (bus->master_state != ASPEED_I2C_MASTER_INACTIVE) {
>>>>> + if (!aspeed_i2c_master_irq(bus))
>>>>
>>>> Why do you check the slave if master fails (or vice versa)? I
>>>> understand that there are some status bits that have not been handled,
>>>> but it doesn't seem reasonable to assume that there is state that the
>>>> other should do something with; the only way this would happen is if
>>>> the state that you think you are in does not match the status bits you
>>>> have been given, but if this is the case, you are already hosed; I
>>>> don't think trying the other handler is likely to make things better,
>>>> unless there is something that I am missing.
>>>>
>>>
>>> In most of cases, interrupt bits are set one by one but there are also a
>>> lot of other cases that ASPEED I2C H/W sends multiple interrupt bits
>>> with combining master and slave events using a single interrupt call. It
>>> happens much in multi-master environment than single-master. For
>>> example, when master is waiting for a NORMAL_STOP interrupt in its
>>> MASTER_STOP state, SLAVE_MATCH and RX_DONE interrupts could come along
>>> with the NORMAL_STOP in case of an another master immediately sends data
>>> just after acquiring the bus - it happens a lot in BMC-ME connection
>>> practically. In this case, the NORMAL_STOP interrupt should be handled
>>> by master_irq and the SLAVE_MATCH and RX_DONE interrupts should be
>>> handled by slave_irq so it's the reason why this code is added.
>>
>> That sucks. Well, it sounds like there are only a handful of cases in
>> which
>> this can happen. Maybe enumerate these cases and error out or at least
>> warn if
>> it is not one of them?
>>
>
> Yes, that sucks but that is Aspeed's I2C IP behavior and that's the
> reason why they implemented some combination bits handling code in
> their SDK. Actually, the cases are happening somewhat frequently
> but that would not be a problem if we handle the cases properly instead
> of making error out or warn.
>
>>>
>> <snip>
>>>>> + for (;;) {
>>>>> + if (!(readl(bus->base + ASPEED_I2C_CMD_REG) &
>>>>> + (ASPEED_I2CD_BUS_BUSY_STS |
>>>>> + ASPEED_I2CD_XFER_MODE_STS_MASK)))
>>>>
>>>> Is using the Transfer Mode State Machine bits necessary? The
>>>> documentation marks it as "for debugging purpose only," so relying on
>>>> it makes me nervous.
>>>>
>>>
>>> As you said, the documentation marks it as "for debugging purpose only."
>>> but ASPEED also uses this way in their SDK code because it's the best
>>> way for checking bus busy status which can cover both single and
>>> multi-master use cases.
>>>
>>
>> Well, it would also be really nice to have access to this bit if
>> someone wants
>> to implement MCTP. Could we maybe check with Aspeed what them meant by
>> "for
>> debugging purposes only" and document it here? It makes me nervous to
>> rely on
>> debugging functionality for normal usage.
>>
>
> Okay, I'll check it with Aspeed. Will let you know their response.
>
I've checked it with Gary Hsu <gary_hsu@...eedtech.com> and he confirmed
that the bits reflect real information and good to be used in practical
code.
I'll add a comment like below:
/*
* This is marked as 'for debugging purpose only' in datasheet but
* ASPEED confirmed that this reflects real information and good
* to be used in practical code.
*/
Is it acceptable then?
>>>>> + return 0;
>>>>> + if (ktime_compare(ktime_get(), timeout) > 0)
>>>>> + break;
>>>>> + usleep_range((BUS_BUSY_CHECK_INTERVAL >> 2) + 1,
>>>>
>>>> Where did you get this minimum value?
>>>>
>>>
>>> No source for the minimum value. ASPEED uses mdelay(10) in their SDK
>>> but I changed that code using usleep_range and the range value was set
>>> with considering time stretching of usleep_range.
>>> regmap_read_poll_timeout was a reference for this code.
>>
>> What protocol are you trying to implement on top of this? You
>> mentioned BMC-ME
>> above; that's IPMB, right? For most use cases, this should work, but
>> if you
>> need arbitration, you will need to do quite a bit more work.
>>
>
> Yes, I'm implementing IPMB for a BMC-ME channel. As I said above,
> arbitration will be performed in H/W level and it's already been handled
> well by your code. This bus busy checking logic is for checking whether
> any slave operation is currently ongoing or not at the timing of
> master_xfer is called. It's not for arbitration but for preventing state
> conflicts between master and slave operations.
>
> FYI, I broke down this patch into smaller patches you reviewed
> Today. Thanks for sharing your time for reviewing the patches.
> I'll send remaining patches after completing review on those
> patches because the remaining patches have dependency on them.
>
> Thanks!
>
>>>
>>> Thanks,
>>>
>>> Jae
>> <snip>
>>
>> Cheers
>>
>
Powered by blists - more mailing lists