[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <27865aa8-71b9-4ddc-b3be-312e4ac3814a@linux.ibm.com>
Date: Mon, 2 Jun 2025 09:55:27 +0530
From: Shrikanth Hegde <sshegde@...ux.ibm.com>
To: Yury Norov <yury.norov@...il.com>, Peter Zijlstra <peterz@...radead.org>
Cc: mingo@...hat.com, juri.lelli@...hat.com, vincent.guittot@...aro.org,
tglx@...utronix.de, maddy@...ux.ibm.com, vschneid@...hat.com,
dietmar.eggemann@....com, rostedt@...dmis.org, jstultz@...gle.com,
kprateek.nayak@....com, huschle@...ux.ibm.com, srikar@...ux.ibm.com,
linux-kernel@...r.kernel.org, linux@...musvillemoes.dk
Subject: Re: [RFC PATCH 0/5] sched: cpu parked and push current task mechanism
Hi.
>
> ----------------------------
>
> vCPU - Virtual CPUs - CPU in VM world.
> pCPU - Physical CPUs - CPU in baremetal world.
>
> A hypervisor is managing these vCPUs from different VMs. When a vCPU
> requests for CPU, hypervisor does the job
> of scheduling them on a pCPU.
>
> So this issue occurs when there are more vCPUs(combined across all VMs)
> than the pCPU. So when *all* vCPUs are
> requesting for CPUs, hypervisor can only run a few of them and remaining
> will be preempted(waiting for pCPU).
>
>
> If we take two VM's, When hypervisor preempts vCPU from VM1 to run vCPU
> from VM2, it has to do
> save/restore VM context.Instead if VM's can co-ordinate among each other
> and request for *limited* vCPUs,
> it avoids the above overhead and there is context switching within
> vCPU(less expensive). Even if hypervisor
> is preempting one vCPU to run another withing the same VM, it is still
> more expensive than the task preemption within
> the vCPU. So *basic* aim to avoid vCPU preemption.
>
>
> So to achieve this, use this parking(we need better name for sure)
> concept, where it is better
> if workloads avoid some vCPUs at this moment. (vCPUs stays online, we
> don't want the overhead of sched domain rebuild).
>
>
> contention is dynamic in nature. When there is contention for pCPU is to
> be detected and determined
> by architecture. Archs needs to update the mask regularly.
>
> When there is contention, use limited vCPUs as indicated by arch.
> When there is no contention, use all vCPUs.
>
I hope this helped to set the problem context. I am trying to get feedback if the approach makes sense.
I will go through other push mechanism we have (example in rt/dl).
Powered by blists - more mailing lists