[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <20210625141143.577998-1-benjamin.gaignard@collabora.com>
Date: Fri, 25 Jun 2021 16:11:34 +0200
From: Benjamin Gaignard <benjamin.gaignard@...labora.com>
To: hverkuil@...all.nl, ezequiel@...labora.com, p.zabel@...gutronix.de,
mchehab@...nel.org, shawnguo@...nel.org, s.hauer@...gutronix.de,
festevam@...il.com, gregkh@...uxfoundation.org, mripard@...nel.org,
paul.kocialkowski@...tlin.com, wens@...e.org,
jernej.skrabec@...l.net, emil.l.velikov@...il.com,
andrzej.p@...labora.com, jc@...esim.co.uk,
jernej.skrabec@...il.com, nicolas@...fresne.ca, cphealy@...il.com
Cc: kernel@...gutronix.de, linux-imx@....com,
linux-media@...r.kernel.org, linux-rockchip@...ts.infradead.org,
linux-arm-kernel@...ts.infradead.org, linux-kernel@...r.kernel.org,
Benjamin Gaignard <benjamin.gaignard@...labora.com>
Subject: [PATCH v4 0/9] Additional features for Hantro HEVC
version 4:
- Log file descriptor in trace point.
- Add documentation about how use the trace points in Hantro driver.
- Fix typos.
- Make sure that 10 bits output format (i.e. P010) is only enumerated
when encoded input stream is 10 bits.
- Create ops structure for variant to store their specific functions.
- Rename scaling ops to enumare_framesizes
Run fluster on this version: the results are the same 70 over 147 streams OK.
Test streams with embedded scaling list are now decoded and the output
is correct for our eyes but the values of the pixels (y, u and V) are
different from the values of the reference (decoded for FFMPEG) so the
stream of still KO. When decoded with GStreamer or vendor stack the
pixels values are the same but not matching with FFMPEG.
Compressed help to increase the performance of the whole stack, it could
be up to 50 fps (against 45 fps) on IMX8MQ.
The impact of compressed frames is confirmed when using perf to monitor
the number of memory accesses with or without compression feature.
The following command
perf stat -a -e imx8_ddr0/cycles/,imx8_ddr0/read-cycles/,imx8_ddr0/write-cycles/ gst-launch-1.0 filesrc location=Jockey_3840x2160_120fps_420_8bit_HEVC_RAW.hevc ! queue ! h265parse ! v4l2slh265dec ! fakesink
give us these results
without compression feature:
Performance counter stats for 'system wide':
14965816743 imx8_ddr0/cycles/
889197312 imx8_ddr0/read-cycles/
1819348862 imx8_ddr0/write-cycles/
18.707373001 seconds time elapsed
with compression feature:
Performance counter stats for 'system wide':
13750218243 imx8_ddr0/cycles/
402428744 imx8_ddr0/read-cycles/
1255676693 imx8_ddr0/write-cycles/
17.188320061 seconds time elapsed
As expected the number of read/write cycles are really lower when compression
is used.
Thanks to Chris for perf command line :-)
version 3:
- Change trace file name to hantro_trace.h
version 2:
- Fix structure name in ext-ctrls-codec.rst
- Define the value for compression storage size
- Add comments about registers usage
- Add documentation about P010 padding
Basic HEVC support has been added to Hantro driver in this pull request:
https://www.spinics.net/lists/linux-media/msg193744.html
Thanks to that it is now possible to support more features for this driver.
The first patch allow to log the hardware performance per macroblock.
The second patch makes the driver use compressed reference frames to
reduce memory bandwidth consumption.
Patches 3 to 5 allow to decode and produce 10-bits P010 frames.
Patch 6 make usage of G2 post processor to scale down the frames.
Patches 7 and 8 add the support of HEVC scaling matrix by adding a new
control.
All these patches enhance the HEVC support for Hantro (G2) hardware.
Unluckily they almost all touch the same pieces of code, where buffer
size, offset and addresses are set, so they have to be in this order.
They depend of the series pushed in this pull request:
https://www.spinics.net/lists/linux-media/msg193744.html
Benjamin
Benjamin Gaignard (9):
media: hantro: Trace hevc hw cycles performance register
media: hantro: Add support of compressed reference buffers
media: hantro: hevc: Allow 10-bits encoded streams
media: Add P010 video format
media: hantro: hevc: Allow to produce 10-bit frames
media: hantro: create ops for variants
media: hantro: enumerate scaled output formats
media: hevc: Add scaling matrix control
media: hantro: Add scaling lists feature
Documentation/admin-guide/media/hantro.rst | 14 ++
.../admin-guide/media/v4l-drivers.rst | 1 +
.../media/v4l/ext-ctrls-codec.rst | 45 +++++
.../media/v4l/pixfmt-yuv-planar.rst | 78 +++++++-
.../media/v4l/vidioc-queryctrl.rst | 6 +
drivers/media/v4l2-core/v4l2-common.c | 1 +
drivers/media/v4l2-core/v4l2-ctrls-core.c | 6 +
drivers/media/v4l2-core/v4l2-ctrls-defs.c | 4 +
drivers/media/v4l2-core/v4l2-ioctl.c | 1 +
drivers/staging/media/hantro/hantro.h | 21 +-
drivers/staging/media/hantro/hantro_drv.c | 22 ++-
.../staging/media/hantro/hantro_g2_hevc_dec.c | 186 ++++++++++++++++--
drivers/staging/media/hantro/hantro_g2_regs.h | 12 ++
drivers/staging/media/hantro/hantro_hevc.c | 69 ++++++-
drivers/staging/media/hantro/hantro_hw.h | 8 +
drivers/staging/media/hantro/hantro_trace.h | 41 ++++
drivers/staging/media/hantro/hantro_v4l2.c | 41 +++-
drivers/staging/media/hantro/imx8m_vpu_hw.c | 22 ++-
.../staging/media/hantro/rockchip_vpu_hw.c | 30 ++-
.../staging/media/hantro/sama5d4_vdec_hw.c | 6 +-
include/media/hevc-ctrls.h | 11 ++
include/uapi/linux/videodev2.h | 1 +
22 files changed, 580 insertions(+), 46 deletions(-)
create mode 100644 Documentation/admin-guide/media/hantro.rst
create mode 100644 drivers/staging/media/hantro/hantro_trace.h
--
2.25.1
Powered by blists - more mailing lists