lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Message-Id: <20201113093720.21106-1-will@kernel.org>
Date:   Fri, 13 Nov 2020 09:37:05 +0000
From:   Will Deacon <will@...nel.org>
To:     linux-arm-kernel@...ts.infradead.org
Cc:     linux-arch@...r.kernel.org, linux-kernel@...r.kernel.org,
        Will Deacon <will@...nel.org>,
        Catalin Marinas <catalin.marinas@....com>,
        Marc Zyngier <maz@...nel.org>,
        Greg Kroah-Hartman <gregkh@...uxfoundation.org>,
        Peter Zijlstra <peterz@...radead.org>,
        Morten Rasmussen <morten.rasmussen@....com>,
        Qais Yousef <qais.yousef@....com>,
        Suren Baghdasaryan <surenb@...gle.com>,
        Quentin Perret <qperret@...gle.com>, Tejun Heo <tj@...nel.org>,
        Li Zefan <lizefan@...wei.com>,
        Johannes Weiner <hannes@...xchg.org>,
        Ingo Molnar <mingo@...hat.com>,
        Juri Lelli <juri.lelli@...hat.com>,
        Vincent Guittot <vincent.guittot@...aro.org>,
        kernel-team@...roid.com
Subject: [PATCH v3 00/14] An alternative series for asymmetric AArch32 systems

Hi again everyone,

	I'm not a scheduler hacker,
	I'm a scheduler hacker's mate.
	I'm only hacking the scheduler,
	'cos trying to run 32-bit applications on systems where not all of the CPUs support it is GREAT.

It's Friday 13th, and I'm back with version three of the increasingly
popular patches I previously posted here:

  v1: https://lore.kernel.org/r/20201027215118.27003-1-will@kernel.org
  v2: https://lore.kernel.org/r/20201109213023.15092-1-will@kernel.org

and which started life as a reimplementation of some patches from Qais:

  https://lore.kernel.org/r/20201021104611.2744565-1-qais.yousef@arm.com

The aim of this series is to allow 32-bit ARM applications to run on
arm64 SoCs where not all of the CPUs support the 32-bit instruction set.

There are some major changes in v3:

  * Add some scheduler hooks for restricting a task's affinity mask
  * Implement these hooks for arm64 so that we can avoid 32-bit tasks
    running on 64-bit-only cores
  * Restrict affinity mask of 32-bit tasks on execve()
  * Prevent hot-unplug of all 32-bit CPUs if we have a mismatched system
  * Ensure 32-bit EL0 cpumask is zero-initialised (oops)

It's worth mentioning that this approach goes directly against my
initial proposal for punting the affinity management to userspace,
because it turns out that doesn't really work. There are cases where the
kernel has to muck with the affinity mask explicitly, such as execve(),
CPU hotplug and cpuset balancing. Ensuring that these don't lead to
random SIGKILLs as far as userspace is concerned means avoiding any
64-bit-only CPUs appearing in the affinity mask for a 32-bit task, at
which point it's easier just to handle everything in the kernel anyway.

- Patches 1-6 hack the arm64 CPU feature code to allow 32-bit tasks to
  run on a mismatched system, but forcing SIGKILL if a task ends up on
  the wrong CPU. This is gated on a command-line option; without it, a
  mismatched system will be treated as 64-bit-only.

- Patches 7-11 add scheduler functionality necessary to constrain the
  CPU affinity mask on a per-task basis and hook this up for execve() on
  arm64.

- Patches 12-14 finish off the arm64 plumbing and remove the logic for
  killing misplaced tasks, as it adds overhead to the context-switch and
  ret-to-user paths.

This seems to do the right thing in my contrived QEMU environment, but
as I say, I'm not a scheduler hacker so I'm open to alternative ideas.

Cheers,

Will

Cc: Catalin Marinas <catalin.marinas@....com>
Cc: Marc Zyngier <maz@...nel.org>
Cc: Greg Kroah-Hartman <gregkh@...uxfoundation.org>
Cc: Peter Zijlstra <peterz@...radead.org>
Cc: Morten Rasmussen <morten.rasmussen@....com>
Cc: Qais Yousef <qais.yousef@....com>
Cc: Suren Baghdasaryan <surenb@...gle.com>
Cc: Quentin Perret <qperret@...gle.com>
Cc: Tejun Heo <tj@...nel.org>
Cc: Li Zefan <lizefan@...wei.com>
Cc: Johannes Weiner <hannes@...xchg.org>
Cc: Ingo Molnar <mingo@...hat.com>
Cc: Juri Lelli <juri.lelli@...hat.com>
Cc: Vincent Guittot <vincent.guittot@...aro.org>
Cc: kernel-team@...roid.com

--->8

Will Deacon (14):
  arm64: cpuinfo: Split AArch32 registers out into a separate struct
  arm64: Allow mismatched 32-bit EL0 support
  KVM: arm64: Kill 32-bit vCPUs on systems with mismatched EL0 support
  arm64: Kill 32-bit applications scheduled on 64-bit-only CPUs
  arm64: Advertise CPUs capable of running 32-bit applications in sysfs
  arm64: Hook up cmdline parameter to allow mismatched 32-bit EL0
  sched: Introduce restrict_cpus_allowed_ptr() to limit task CPU
    affinity
  arm64: exec: Adjust affinity for compat tasks with mismatched 32-bit
    EL0
  cpuset: Don't use the cpu_possible_mask as a last resort for cgroup v1
  sched: Introduce arch_cpu_allowed_mask() to limit fallback rq
    selection
  sched: Reject CPU affinity changes based on arch_cpu_allowed_mask()
  arm64: Prevent offlining first CPU with 32-bit EL0 on mismatched
    system
  arm64: Implement arch_cpu_allowed_mask()
  arm64: Remove logic to kill 32-bit tasks on 64-bit-only cores

 .../ABI/testing/sysfs-devices-system-cpu      |   9 +
 .../admin-guide/kernel-parameters.txt         |   7 +
 arch/arm64/include/asm/cpu.h                  |  44 ++--
 arch/arm64/include/asm/cpucaps.h              |   2 +-
 arch/arm64/include/asm/cpufeature.h           |   8 +-
 arch/arm64/include/asm/mmu_context.h          |  12 +
 arch/arm64/kernel/cpufeature.c                | 219 ++++++++++++++----
 arch/arm64/kernel/cpuinfo.c                   |  53 +++--
 arch/arm64/kernel/process.c                   |  17 +-
 arch/arm64/kvm/arm.c                          |  11 +-
 include/linux/sched.h                         |   1 +
 kernel/cgroup/cpuset.c                        |   6 +-
 kernel/sched/core.c                           |  90 +++++--
 13 files changed, 370 insertions(+), 109 deletions(-)

-- 
2.29.2.299.gdc1121823c-goog

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ