sched/topology: Remove the EM_MAX_COMPLEXITY limit

author Pierre Gondois <Pierre.Gondois@arm.com>

Mon, 9 Oct 2023 06:00:36 +0000 (11:30 +0530)

committer Ingo Molnar <mingo@kernel.org>

Mon, 9 Oct 2023 11:07:27 +0000 (13:07 +0200)
author Pierre Gondois <Pierre.Gondois@arm.com>
Mon, 9 Oct 2023 06:00:36 +0000 (11:30 +0530)
committer Ingo Molnar <mingo@kernel.org>
Mon, 9 Oct 2023 11:07:27 +0000 (13:07 +0200)
diff --git a/Documentation/scheduler/sched-energy.rst b/Documentation/scheduler/sched-energy.rst

index fc853c8cc346df35b90ee90dbd0501b6ac80cfd8..70e2921ef725dc89919df78d269aa246cd18f114 100644 (file)
--- a/Documentation/scheduler/sched-energy.rst
+++ b/Documentation/scheduler/sched-energy.rst
@@ -359,32 +359,9 @@ in milli-Watts or in an 'abstract scale'.
  6.3 - Energy Model complexity
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  
-The task wake-up path is very latency-sensitive. When the EM of a platform is
-too complex (too many CPUs, too many performance domains, too many performance
-states, ...), the cost of using it in the wake-up path can become prohibitive.
-The energy-aware wake-up algorithm has a complexity of:
-
-       C = Nd * (Nc + Ns)
-
-with: Nd the number of performance domains; Nc the number of CPUs; and Ns the
-total number of OPPs (ex: for two perf. domains with 4 OPPs each, Ns = 8).
-
-A complexity check is performed at the root domain level, when scheduling
-domains are built. EAS will not start on a root domain if its C happens to be
-higher than the completely arbitrary EM_MAX_COMPLEXITY threshold (2048 at the
-time of writing).
-
-If you really want to use EAS but the complexity of your platform's Energy
-Model is too high to be used with a single root domain, you're left with only
-two possible options:
-
-    1. split your system into separate, smaller, root domains using exclusive
-       cpusets and enable EAS locally on each of them. This option has the
-       benefit to work out of the box but the drawback of preventing load
-       balance between root domains, which can result in an unbalanced system
-       overall;
-    2. submit patches to reduce the complexity of the EAS wake-up algorithm,
-       hence enabling it to cope with larger EMs in reasonable time.
+EAS does not impose any complexity limit on the number of PDs/OPPs/CPUs but
+restricts the number of CPUs to EM_MAX_NUM_CPUS to prevent overflows during
+the energy estimation.
  
  
  6.4 - Schedutil governor
diff --git a/kernel/sched/topology.c b/kernel/sched/topology.c

index 1cc595907363bd0966e6270ab86a6fca3a0b20d3..fcda3f066eec5c42143574a6f27cb54b5713a316 100644 (file)
--- a/kernel/sched/topology.c
+++ b/kernel/sched/topology.c
@@ -348,32 +348,13 @@ static void sched_energy_set(bool has_eas)
   *    1. an Energy Model (EM) is available;
   *    2. the SD_ASYM_CPUCAPACITY flag is set in the sched_domain hierarchy.
   *    3. no SMT is detected.
- *    4. the EM complexity is low enough to keep scheduling overheads low;
- *    5. schedutil is driving the frequency of all CPUs of the rd;
- *    6. frequency invariance support is present;
- *
- * The complexity of the Energy Model is defined as:
- *
- *              C = nr_pd * (nr_cpus + nr_ps)
- *
- * with parameters defined as:
- *  - nr_pd:    the number of performance domains
- *  - nr_cpus:  the number of CPUs
- *  - nr_ps:    the sum of the number of performance states of all performance
- *              domains (for example, on a system with 2 performance domains,
- *              with 10 performance states each, nr_ps = 2 * 10 = 20).
- *
- * It is generally not a good idea to use such a model in the wake-up path on
- * very complex platforms because of the associated scheduling overheads. The
- * arbitrary constraint below prevents that. It makes EAS usable up to 16 CPUs
- * with per-CPU DVFS and less than 8 performance states each, for example.
+ *    4. schedutil is driving the frequency of all CPUs of the rd;
+ *    5. frequency invariance support is present;
   */
-#define EM_MAX_COMPLEXITY 2048
-
  extern struct cpufreq_governor schedutil_gov;
  static bool build_perf_domains(const struct cpumask *cpu_map)
  {
-       int i, nr_pd = 0, nr_ps = 0, nr_cpus = cpumask_weight(cpu_map);
+       int i;
         struct perf_domain *pd = NULL, *tmp;
         int cpu = cpumask_first(cpu_map);
         struct root_domain *rd = cpu_rq(cpu)->rd;
@@ -431,20 +412,6 @@ static bool build_perf_domains(const struct cpumask *cpu_map)
                         goto free;
                 tmp->next = pd;
                 pd = tmp;
-
-               /*
-                * Count performance domains and performance states for the
-                * complexity check.
-                */
-               nr_pd++;
-               nr_ps += em_pd_nr_perf_states(pd->em_pd);
-       }
-
-       /* Bail out if the Energy Model complexity is too high. */
-       if (nr_pd * (nr_ps + nr_cpus) > EM_MAX_COMPLEXITY) {
-               WARN(1, "rd %*pbl: Failed to start EAS, EM complexity is too high\n",
-                                               cpumask_pr_args(cpu_map));
-               goto free;
         }
  
         perf_domain_debug(cpu_map, pd);
author	Pierre Gondois <Pierre.Gondois@arm.com>
	Mon, 9 Oct 2023 06:00:36 +0000 (11:30 +0530)
committer	Ingo Molnar <mingo@kernel.org>
	Mon, 9 Oct 2023 11:07:27 +0000 (13:07 +0200)
Documentation/scheduler/sched-energy.rst		patch \| blob \| history
kernel/sched/topology.c		patch \| blob \| history