changeset 20667:f9392f6eda79

tools: improve NUMA guest placement when ballooning

the "guest to a single NUMA node" constrain algorithm does not work
well when we do ballooning. Ballooning and NUMA don't play together
anyway, as Dom0 and thus ballooning is not NUMA aware, I am working on
this but it will not be ready for the Xen 4.0 release window. The
usual ballooning situation will result in an empty candidate list, as
no node has enough free memory to host the guest. In this case the
code will simply pick the first node: again and again, because all
nodes without enough memory will be ultimately penalized with the same
maxint value (regardless of the actual load). The attached patch will
change this to use a relative penalty in case of not-enough memory, so
that low-load low-memory nodes will be used at one point. A half
loaded node has shown to be a good value, as an unbalanced system is
much worse than non-local memory access for guests. Regardless of
that you should restrict the Dom0 on a NUMA system to a reasonable
memory size, so that ballooning is not necessary most of the time. In
this case the guest's memory will be NUMA local.

Signed-off-by: Andre Przywara <>
author Keir Fraser <>
date Fri Dec 11 08:59:54 2009 +0000 (2009-12-11)
parents a50c1cbf08ec
children f61303bcc3ac
files tools/python/xen/xend/
line diff
     1.1 --- a/tools/python/xen/xend/	Fri Dec 11 08:58:06 2009 +0000
     1.2 +++ b/tools/python/xen/xend/	Fri Dec 11 08:59:54 2009 +0000
     1.3 @@ -2670,10 +2670,9 @@ class XendDomainInfo:
     1.4                                      nodeload[i] += 1
     1.5                                      break
     1.6                  for i in range(0, nr_nodes):
     1.7 -                    if len(info['node_to_cpu'][i]) > 0 and i in node_list:
     1.8 -                        nodeload[i] = int(nodeload[i] * 16 / len(info['node_to_cpu'][i]))
     1.9 -                    else:
    1.10 -                        nodeload[i] = sys.maxint
    1.11 +                    nodeload[i] = int(nodeload[i] * 16 / len(info['node_to_cpu'][i]))
    1.12 +                    if len(info['node_to_cpu'][i]) == 0 or i not in node_list:
    1.13 +                        nodelist[i] += 8
    1.14                  return map(lambda x: x[0], sorted(enumerate(nodeload), key=lambda x:x[1]))
    1.16              info = xc.physinfo()