[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <CAOYpmdFYs8=FzpiA-mMNZ=L8B9GRXLfQuEnMeDsvWeeKg2PWtA@mail.gmail.com>
Date: Tue, 11 May 2021 01:06:03 -0700
From: Denis Nikitin <denik@...gle.com>
To: Leo Yan <leo.yan@...aro.org>
Cc: James Clark <james.clark@....com>, coresight@...ts.linaro.org,
Mathieu Poirier <mathieu.poirier@...aro.org>,
Al Grant <al.grant@....com>,
Branislav Rankov <branislav.rankov@....com>,
Denis Nikitin <denik@...omium.org>,
Suzuki Poulose <suzuki.poulose@....com>,
anshuman.khandual@....com, Mike Leach <mike.leach@...aro.org>,
Mark Rutland <mark.rutland@....com>,
Alexander Shishkin <alexander.shishkin@...ux.intel.com>,
Jiri Olsa <jolsa@...hat.com>,
Namhyung Kim <namhyung@...nel.org>,
John Garry <john.garry@...wei.com>,
Will Deacon <will@...nel.org>,
linux-arm-kernel@...ts.infradead.org,
linux-perf-users@...r.kernel.org, linux-kernel@...r.kernel.org
Subject: Re: [RFC PATCH] perf cs-etm: Handle valid-but-zero timestamps
Hi Leo,
> Just remind, as Mike has mentioned that if the timestamp is zero, it
> means the hardware setting for timestamp is not enabled properly. So
> for system wide or per CPU mode tracing, it's better to double check
> what's the reason the timestamp is not enabled properly.
The bug is confirmed by HW verification.
>
> IIUC, this patch breaks the existed rational in the code. Let's think
> about there have 4 CPUs, every CPU has its own AUX trace buffer, and
> when decode the trace data, it will use 4 queues to track the packets
> and every queue has its timestamp.
>
> CPU0: cs_etm_queue -> ... -> packet_queue->timestamp
> CPU1: cs_etm_queue -> ... -> packet_queue->timestamp
> CPU2: cs_etm_queue -> ... -> packet_queue->timestamp
> CPU3: cs_etm_queue -> ... -> packet_queue->timestamp
>
> The issue is if all CPUs' timestamp are zero, it's impossible to find
> a way to synthesize samples in the right time order.
Is it really impossible or it just can lead to incorrect decoding?
I verified the profiles generated with zero timestamps and this patch
on Trogdor (8 CPU cores) and I don't see any significant difference
in the quality of the AutoFDO profiles.
If mixed packets don't cause errors in reconstructing the branches
but instead mess up with their timeline then it probably won't matter
for AutoFDO which just collects statistics of the branches.
What do you think?
>
> [...]
>
> Thanks,
> Leo
Thanks,
Denis
Powered by blists - more mailing lists