linux-kernel - Re: [PATCH v3 15/22] staging: media: tegra-video: tegra20: simplify format align calculations

lists.openwall.net		lists / announce owl-users owl-dev john-users john-dev passwdqc-users yescrypt popa3d-users / oss-security kernel-hardening musl sabotage tlsify passwords / crypt-dev xvendor / Bugtraq Full-Disclosure linux-kernel linux-netdev linux-ext4 linux-hardening linux-cve-announce PHC
Open Source and information security mailing list archives
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <2368735.QZUTf85G27@senjougahara>
Date: Wed, 01 Oct 2025 16:51:39 +0900
From: Mikko Perttunen <mperttunen@...dia.com>
To: Svyatoslav Ryhel <clamor95@...il.com>
Cc: David Airlie <airlied@...il.com>, Simona Vetter <simona@...ll.ch>,
 Maarten Lankhorst <maarten.lankhorst@...ux.intel.com>,
 Maxime Ripard <mripard@...nel.org>, Thomas Zimmermann <tzimmermann@...e.de>,
 Rob Herring <robh@...nel.org>, Krzysztof Kozlowski <krzk+dt@...nel.org>,
 Conor Dooley <conor+dt@...nel.org>,
 Thierry Reding <thierry.reding@...il.com>,
 Jonathan Hunter <jonathanh@...dia.com>,
 Sowjanya Komatineni <skomatineni@...dia.com>,
 Luca Ceresoli <luca.ceresoli@...tlin.com>,
 Prashant Gaikwad <pgaikwad@...dia.com>,
 Michael Turquette <mturquette@...libre.com>, Stephen Boyd <sboyd@...nel.org>,
 Linus Walleij <linus.walleij@...aro.org>,
 Mauro Carvalho Chehab <mchehab@...nel.org>,
 Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
 Jonas Schwöbel <jonasschwoebel@...oo.de>,
 Dmitry Osipenko <digetx@...il.com>,
 Charan Pedumuru <charan.pedumuru@...il.com>,
 Diogo Ivo <diogo.ivo@...nico.ulisboa.pt>,
 Aaron Kling <webgeek1234@...il.com>, Arnd Bergmann <arnd@...db.de>,
 dri-devel@...ts.freedesktop.org, devicetree@...r.kernel.org,
 linux-tegra@...r.kernel.org, linux-kernel@...r.kernel.org,
 linux-media@...r.kernel.org, linux-clk@...r.kernel.org,
 linux-gpio@...r.kernel.org, linux-staging@...ts.linux.dev
Subject:
 Re: [PATCH v3 15/22] staging: media: tegra-video: tegra20: simplify format
 align calculations

On Wednesday, October 1, 2025 2:35 PM Svyatoslav Ryhel wrote:
> ср, 1 жовт. 2025 р. о 08:07 Svyatoslav Ryhel <clamor95@...il.com> пише:
> >
> > ср, 1 жовт. 2025 р. о 07:38 Mikko Perttunen <mperttunen@...dia.com> пише:
> > >
> > > On Friday, September 26, 2025 12:16 AM Svyatoslav Ryhel wrote:
> > > > Simplify format align calculations by slightly modifying supported formats
> > > > structure. Adjusted U and V offset calculations for planar formats since
> > > > YUV420P bits per pixel is 12 (1 full plane for Y + 2 * 1/4 planes for U
> > > > and V) so stride is width * 3/2, but offset must be calculated with plain
> > > > width since each plain has stride width * 1. This aligns with downstream
> > >
> > > plane
> > >
> > > > behavior which uses same approach for offset calculations.
> > > >
> > > > Signed-off-by: Svyatoslav Ryhel <clamor95@...il.com>
> > > > ---
> > > >  drivers/staging/media/tegra-video/tegra20.c | 58 +++++++++------------
> > > >  drivers/staging/media/tegra-video/vi.h      |  3 +-
> > > >  2 files changed, 27 insertions(+), 34 deletions(-)
> > > >
> > > > diff --git a/drivers/staging/media/tegra-video/tegra20.c b/drivers/staging/media/tegra-video/tegra20.c
> > > > index 7c3ff843235d..b7a39723dfc2 100644
> > > > --- a/drivers/staging/media/tegra-video/tegra20.c
> > > > +++ b/drivers/staging/media/tegra-video/tegra20.c
> > > > @@ -280,20 +280,8 @@ static void tegra20_fmt_align(struct v4l2_pix_format *pix, unsigned int bpp)
> > > >       pix->width  = clamp(pix->width,  TEGRA20_MIN_WIDTH,  TEGRA20_MAX_WIDTH);
> > > >       pix->height = clamp(pix->height, TEGRA20_MIN_HEIGHT, TEGRA20_MAX_HEIGHT);
> > > >
> > > > -     switch (pix->pixelformat) {
> > > > -     case V4L2_PIX_FMT_UYVY:
> > > > -     case V4L2_PIX_FMT_VYUY:
> > > > -     case V4L2_PIX_FMT_YUYV:
> > > > -     case V4L2_PIX_FMT_YVYU:
> > > > -             pix->bytesperline = roundup(pix->width, 2) * 2;
> > > > -             pix->sizeimage = roundup(pix->width, 2) * 2 * pix->height;
> > > > -             break;
> > > > -     case V4L2_PIX_FMT_YUV420:
> > > > -     case V4L2_PIX_FMT_YVU420:
> > > > -             pix->bytesperline = roundup(pix->width, 8);
> > > > -             pix->sizeimage = roundup(pix->width, 8) * pix->height * 3 / 2;
> > > > -             break;
> > > > -     }
> > > > +     pix->bytesperline = DIV_ROUND_UP(pix->width * bpp, 8);
> > > > +     pix->sizeimage = pix->bytesperline * pix->height;
> > > >  }
> > > >
> > > >  /*
> > > > @@ -305,6 +293,7 @@ static void tegra20_channel_queue_setup(struct tegra_vi_channel *chan)
> > > >  {
> > > >       unsigned int stride = chan->format.bytesperline;
> > > >       unsigned int height = chan->format.height;
> > > > +     unsigned int width = chan->format.width;
> > > >
> > > >       chan->start_offset = 0;
> > > >
> > > > @@ -321,8 +310,8 @@ static void tegra20_channel_queue_setup(struct tegra_vi_channel *chan)
> > > >
> > > >       case V4L2_PIX_FMT_YUV420:
> > > >       case V4L2_PIX_FMT_YVU420:
> > > > -             chan->addr_offset_u = stride * height;
> > > > -             chan->addr_offset_v = chan->addr_offset_u + stride * height / 4;
> > > > +             chan->addr_offset_u = width * height;
> > > > +             chan->addr_offset_v = chan->addr_offset_u + width * height / 4;
> > > >
> > > >               /* For YVU420, we swap the locations of the U and V planes. */
> > > >               if (chan->format.pixelformat == V4L2_PIX_FMT_YVU420)
> > > > @@ -332,14 +321,14 @@ static void tegra20_channel_queue_setup(struct tegra_vi_channel *chan)
> > > >               chan->start_offset_v = chan->addr_offset_v;
> > > >
> > > >               if (chan->vflip) {
> > > > -                     chan->start_offset   += stride * (height - 1);
> > > > -                     chan->start_offset_u += (stride / 2) * ((height / 2) - 1);
> > > > -                     chan->start_offset_v += (stride / 2) * ((height / 2) - 1);
> > > > +                     chan->start_offset   += width * (height - 1);
> > > > +                     chan->start_offset_u += (width / 2) * ((height / 2) - 1);
> > > > +                     chan->start_offset_v += (width / 2) * ((height / 2) - 1);
> > > >               }
> > > >               if (chan->hflip) {
> > > > -                     chan->start_offset   += stride - 1;
> > > > -                     chan->start_offset_u += (stride / 2) - 1;
> > > > -                     chan->start_offset_v += (stride / 2) - 1;
> > > > +                     chan->start_offset   += width - 1;
> > > > +                     chan->start_offset_u += (width / 2) - 1;
> > > > +                     chan->start_offset_v += (width / 2) - 1;
> > > >               }
> > > >               break;
> > > >       }
> > > > @@ -576,20 +565,23 @@ static const struct tegra_vi_ops tegra20_vi_ops = {
> > > >       .vi_stop_streaming = tegra20_vi_stop_streaming,
> > > >  };
> > > >
> > > > -#define TEGRA20_VIDEO_FMT(MBUS_CODE, BPP, FOURCC)    \
> > > > -{                                                    \
> > > > -     .code    = MEDIA_BUS_FMT_##MBUS_CODE,           \
> > > > -     .bpp     = BPP,                                 \
> > > > -     .fourcc  = V4L2_PIX_FMT_##FOURCC,               \
> > > > +#define TEGRA20_VIDEO_FMT(DATA_TYPE, BIT_WIDTH, MBUS_CODE, BPP, FOURCC)      \
> > > > +{                                                                    \
> > > > +     .img_dt         = TEGRA_IMAGE_DT_##DATA_TYPE,                   \
> > > > +     .bit_width      = BIT_WIDTH,                                    \
> > > > +     .code           = MEDIA_BUS_FMT_##MBUS_CODE,                    \
> > > > +     .bpp            = BPP,                                          \
> > > > +     .fourcc         = V4L2_PIX_FMT_##FOURCC,                        \
> > > >  }
> > > >
> > > >  static const struct tegra_video_format tegra20_video_formats[] = {
> > > > -     TEGRA20_VIDEO_FMT(UYVY8_2X8, 2, UYVY),
> > > > -     TEGRA20_VIDEO_FMT(VYUY8_2X8, 2, VYUY),
> > > > -     TEGRA20_VIDEO_FMT(YUYV8_2X8, 2, YUYV),
> > > > -     TEGRA20_VIDEO_FMT(YVYU8_2X8, 2, YVYU),
> > > > -     TEGRA20_VIDEO_FMT(UYVY8_2X8, 1, YUV420),
> > > > -     TEGRA20_VIDEO_FMT(UYVY8_2X8, 1, YVU420),
> > > > +     /* YUV422 */
> > > > +     TEGRA20_VIDEO_FMT(YUV422_8, 16, UYVY8_2X8, 16, UYVY),
> > > > +     TEGRA20_VIDEO_FMT(YUV422_8, 16, VYUY8_2X8, 16, VYUY),
> > > > +     TEGRA20_VIDEO_FMT(YUV422_8, 16, YUYV8_2X8, 16, YUYV),
> > > > +     TEGRA20_VIDEO_FMT(YUV422_8, 16, YVYU8_2X8, 16, YVYU),
> > > > +     TEGRA20_VIDEO_FMT(YUV422_8, 16, UYVY8_2X8, 12, YUV420),
> > > > +     TEGRA20_VIDEO_FMT(YUV422_8, 16, UYVY8_2X8, 12, YVU420),
> > > >  };
> > >
> > > Looking at the code, BPP seems to only be used for the line stride (i.e. bytes per line) calculation. I think we should just make it 8 for the planar formats (possibly with an explaining comment). With the current code, we end up with 'bytesperline' variables in places not being the actual bytes per line, which is confusing.
> > >
> > > Actually, we can then just make the 'bpp' field be bytes per pixel as it was before to avoid the discrepancy with Tegra210.
> > >
> >
> > No, this code is actually cleaner and in sync with what downstream
> > does, Tegra210 bytes per pixel is confusing since it totally neglects
> > formats with fractional bytes per pixel, it is impossible to set there
> > 3/2 for example, which is used by YUV420.
> >
> > According to downstream code bytes_per_line =
> > soc_mbus_bytes_per_line..., downstream directly name is bytes_per_line
> > and soc_mbus_bytes_per_line returns width * 3 / 2 which is correct
> > calculation (12 bits). Meanwhile for planar formats Tegra has 3
> > different buffers so with offset calculation plain width must be used
> > (which matches downstream).
> >
> 
> If you mean use of BPP by VI, I can propose removing bytesperline and
> sizeimage configuration from VI entirely and leave this to per-SoC
> fmt_align function which does this already anyway and guards every
> time those values are referred. This way there will be no instances
> where "places not being the actual bytes per line" comes true.

Without trying myself, I'm not sure what approach is the cleanest. In any case, the downstream code is just wrong (or incorrectly named), so we shouldn't defer to it in this matter. I don't see a reason to keep the value '12' either if it doesn't serve any purpose (admittedly if we changed it to 8 or 1, 'bpp' would be a confusing name for it, but explainable with a comment and improve-able later) I don't mind having an if/switch statement for the planar formats to use a '8' as multiplier instead of '12' if we need to keep the '12'. But the main thing I want to avoid is a bytesperline/stride variable that isn't the line stride in bytes.

> 
> > > >
> > > >  const struct tegra_vi_soc tegra20_vi_soc = {
> > > > diff --git a/drivers/staging/media/tegra-video/vi.h b/drivers/staging/media/tegra-video/vi.h
> > > > index bfadde8858d4..5cbc0606ed6c 100644
> > > > --- a/drivers/staging/media/tegra-video/vi.h
> > > > +++ b/drivers/staging/media/tegra-video/vi.h
> > > > @@ -281,7 +281,8 @@ enum tegra_image_dt {
> > > >   * @img_dt: MIPI CSI-2 data type (for CSI-2 only)
> > > >   * @bit_width: format width in bits per component (for CSI/Tegra210 only)
> > > >   * @code: media bus format code
> > > > - * @bpp: bytes per pixel (when stored in memory)
> > > > + * @bpp: bytes per pixel (when stored in memory) for Tegra210,
> > > > + *    bits per pixel for Tegra20/Tegra30
> > > >   * @img_fmt: image format (for CSI/Tegra210 only)
> > > >   * @fourcc: V4L2 pixel format FCC identifier
> > > >   */
> > > >