The key Python objects supported by the vegas module are:
- vegas.Integrator — an object describing a multidimensional integration operator. These contain information about the integration volume, but also about optimal remappings of the integration variables based upon the last integral evaluated using the object.
- vegas.AdaptiveMap — an object describing the remappings used by vegas.
- vegas.RWAvg — an object describing the result of a vegas integration. vegas returns the weighted average of the integral estimates from each vegas iteration as an object of class vegas.RWAvg. These are Gaussian random variables — that is, they have a mean and a standard deviation — but also contain information about the iterations vegas used in generating the result.
- vegas.RWAvgArray — an array version of vegas.RWAvg used when the integrand is array-valued.
These are described in detail below.
The central component of the vegas package is the integrator class:
Adaptive multidimensional Monte Carlo integration.
vegas.Integrator objects make Monte Carlo estimates of multidimensional functions f(x) where x[d] is a point in the integration volume:
integ = vegas.Integrator(integration_region)
result = integ(f, nitn=10, neval=10000)
The integator makes nitn estimates of the integral, each using at most neval samples of the integrand, as it adapts to the specific features of the integrand. Successive estimates typically improve in accuracy until the integrator has fully adapted. The integrator returns the weighted average of all nitn estimates, together with an estimate of the statistical (Monte Carlo) uncertainty in that estimate of the integral. The result is an object of type RWAvg (which is derived from gvar.GVar).
Integrands can be array-valued, in which case f(x) returns an array of values corresponding to different integrands. Also vegas can generate integration points in batches (vectors) for integrands built from classes derived from vegas.VecIntegrand. Vectorized integrands are typically much faster, especially if they are coded in Cython.
vegas.Integrators have a large number of parameters but the only ones that most people will care about are: the number nitn of iterations of the vegas algorithm; the maximum number neval of integrand evaluations per iteration; and the damping parameter alpha, which is used to slow down the adaptive algorithms when they would otherwise be unstable (e.g., very peaky integrands). Setting parameter analyzer=vegas.reporter() is sometimes useful, as well, since it causes vegas to print (on sys.stdout) intermediate results from each iteration, as they are produced. This helps when each iteration takes a long time to complete (e.g., an hour) because it allows you to monitor progress as it is being made (or not).
Parameters: |
|
---|
vegas.Integrator objects have attributes for each of these parameters. In addition they have the following methods:
Integrate integrand fcn.
A typical integrand has the form, for example:
def f(x):
return x[0] ** 2 + x[1] ** 4
The argument x[d] is an integration point, where index d=0... represents direction.
Integrands can be array-valued, representing multiple integrands: e.g.,
def f(x):
return [x[0] ** 2, x[0] / x[1]]
The return arrays can have any shape. This is useful for integrands that are closely related, and can lead to substantial reductions in the errors for ratios or differences of the results.
It is usually much faster to use vegas in vector mode, where integration points are presented to the integrand in batches or vectors. Integrands for use in vector mode are objects of classes derived from vegas.VecIntegrand: e.g.,
class vecf(vegas.VecIntegrand):
def __call__(self, x):
return x[:, 0] ** 2 + x[:, 1] ** 4
vecf() would be the integrand. Here x[i, d] represents a collection of different integration points labeled by i=0.... (The number is controlled vegas.Integrator parameter nhcube_vec.) The vector index is always first.
An array-valued integrand for vector mode would be an object of a type like: e.g.,
class vecf(vegas.VecIntegrand):
def __call__(self, x):
f = numpy.empty((x.shape[0], 2), float)
f[:, 0] = x[:, 0] ** 2
f[:, 1] = x[:, 0] / x[:, 1]
return f
Vector mode is particularly useful when the class derived from vegas.VecIntegrand is coded in Cython. Then loops over the integration points can be coded explicitly, avoiding the need to use numpy‘s vector operators if they are not well suited to the integrand.
Any vegas parameter can also be reset: e.g., self(fcn, nitn=20, neval=1e6).
Reset default parameters in integrator.
Usage is analogous to the constructor for vegas.Integrator: for example,
old_defaults = integ.set(neval=1e6, nitn=20)
resets the default values for neval and nitn in vegas.Integrator integ. A dictionary, here old_defaults, is returned. It can be used to restore the old defaults using, for example:
integ.set(old_defaults)
Assemble summary of integrator settings into string.
Parameters: | ngrid (int) – Number of grid nodes in each direction to include in summary. The default is 0. |
---|---|
Returns: | String containing the settings. |
Iterator over integration points and weights.
This method creates an iterator that returns integration points from vegas, and their corresponding weights in an integral. Each point x[d] is accompanied by the weight assigned to that point by vegas when estimating an integral.
Given an vegas.Integrator integ, presumably trained on some integrand, the following code would create a Monte Carlo estimate of the integral of a possibly different integrand f(x):
integral = 0.0
for x, wgt in integ.random():
integral += wgt * f(x)
Here f(x) returns the integrand value for point x[d].
integ.random(yield_hcube=True) will yield the integration point x, the weight wgt, and the index hcube of the y-space hypercube containing this point (hypercubes are indexed by consecutive integers, starting at 0).
Iterator over integration points and weights.
This method creates an iterator that returns integration points from vegas, and their corresponding weights in an integral. The points are provided in arrays x[i, d] where i=0... labels the integration points in a batch (or vector) and d=0... labels direction. The corresponding weights assigned by vegas to each point are provided in an array wgt[i].
Given an vegas.Integrator integ, presumably trained on some integrand, the following code would create a Monte Carlo estimate of the integral of a possibly different (vector) integrand f(x):
integral = 0.0
for x, wgt in integ.random_vec():
f_array = f(x)
integral += wgt.dot(f_array)
Here f(x) returns an array f_array[i] corresponding to the integrand values for points x[i, d]. The points and weights yielded by the iterator are numpy arrays.
integ.random_vec(yield_hcube=True) will yield the integration points x[i, d], the corresponding weights wgt[i], and the corresponding indices hcube[i] of the y-space hypercubes containing the points (hypercubes are indexed by consecutive integers, starting at 0). This information makes it possible to estimate the variance of an integral estimate:
integral = 0.0
variance = 0.0
for x, wgt, hcube in integ.random_vec(yield_hcube=True):
wgt_fx = wgt * f(x)
# iterate over hypercubes: compute variance for each,
# and accumulate for final result
for i in range(hcube[0], hcube[-1] + 1):
idx = (hcube == i) # select array items for h-cube i
nwf = numpy.sum(idx)
wf = wgt_fx[idx]
sum_wf = numpy.sum(wf)
sum_wf2 = numpy.sum(wf ** 2)
integral += sum_wf
variance += (sum_wf2 * nwf - sum_wf ** 2) / (nwf - 1.)
# answer = integral; standard deviation = variance ** 0.5
result = gvar.gvar(integral, variance ** 0.5)
vegas’s remapping of the integration variables is handled by a vegas.AdaptiveMap object, which maps the original integration variables x into new variables y in a unit hypercube. Each direction has its own map specified by a grid in x space:
where and
are the limits of integration.
The grid specifies the transformation function at the points
for
:
Linear interpolation is used between those points. The Jacobian for this transformation is:
vegas adjusts the increments sizes to optimize its Monte Carlo estimates of the integral. This involves training the grid. To illustrate how this is done with vegas.AdaptiveMaps consider a simple two dimensional integral over a unit hypercube with integrand:
def f(x):
return x[0] * x[1] ** 2
We want to create a grid that optimizes uniform Monte Carlo estimates of the integral in y space. We do this by sampling the integrand at a large number ny of random points y[j, d], where j=0...ny-1 and d=0,1, uniformly distributed throughout the integration volume in y space. These samples be used to train the grid using the following code:
import vegas
import numpy as np
def f(x):
return x[0] * x[1] ** 2
m = vegas.AdaptiveMap([[0, 1], [0, 1]], ninc=5)
ny = 1000
y = np.random.uniform(0., 1., (ny, 2)) # 1000 random y's
x = np.empty(y.shape, float) # work space
jac = np.empty(y.shape[0], float)
f2 = np.empty(y.shape[0], float)
print('intial grid:')
print(m.settings())
for itn in range(5): # 5 iterations to adapt
m.map(y, x, jac) # compute x's and jac
for j in range(ny): # compute training data
f2[j] = (jac[j] * f(x[j])) ** 2
m.add_training_data(y, f2) # adapt
m.adapt(alpha=1.5)
print('iteration %d:' % itn)
print(m.settings())
In each of the 5 iterations, the vegas.AdaptiveMap adjusts the map, making increments smaller where f2 is larger and larger where f2 is smaller. The map converges after only 2 or 3 iterations, as is clear from the output:
initial grid:
grid[ 0] = [ 0. 0.2 0.4 0.6 0.8 1. ]
grid[ 1] = [ 0. 0.2 0.4 0.6 0.8 1. ]
iteration 0:
grid[ 0] = [ 0. 0.395 0.601 0.747 0.878 1. ]
grid[ 1] = [ 0. 0.504 0.683 0.812 0.906 1. ]
iteration 1:
grid[ 0] = [ 0. 0.408 0.614 0.761 0.888 1. ]
grid[ 1] = [ 0. 0.535 0.716 0.833 0.921 1. ]
iteration 2:
grid[ 0] = [ 0. 0.409 0.616 0.762 0.89 1. ]
grid[ 1] = [ 0. 0.535 0.716 0.833 0.922 1. ]
iteration 3:
grid[ 0] = [ 0. 0.41 0.616 0.762 0.891 1. ]
grid[ 1] = [ 0. 0.535 0.716 0.833 0.923 1. ]
iteration 4:
grid[ 0] = [ 0. 0.41 0.616 0.763 0.891 1. ]
grid[ 1] = [ 0. 0.536 0.716 0.833 0.923 1. ]
The grid increments along direction 0 shrink at larger values x[0], varying as 1/x[0]. Along direction 1 the increments shrink more quickly varying like 1/x[1]**2.
vegas samples the integrand in order to estimate the integral. It uses those same samples to train its vegas.AdaptiveMap in this fashion, for use in subsequent iterations of the algorithm.
Adaptive map y->x(y) for multidimensional y and x.
An AdaptiveMap defines a multidimensional map y -> x(y) from the unit hypercube, with 0 <= y[d] <= 1, to an arbitrary hypercube in x space. Each direction is mapped independently with a Jacobian that is tunable (i.e., “adaptive”).
The map is specified by a grid in x-space that, by definition, maps into a uniformly spaced grid in y-space. The nodes of the grid are specified by grid[d, i] where d is the direction (d=0,1...dim-1) and i labels the grid point (i=0,1...N). The mapping for a specific point y into x space is:
y[d] -> x[d] = grid[d, i(y[d])] + inc[d, i(y[d])] * delta(y[d])
where i(y)=floor(y*N), delta(y)=y*N - i(y), and inc[d, i] = grid[d, i+1] - grid[d, i]. The Jacobian for this map,
dx[d]/dy[d] = inc[d, i(y[d])] * N,
is piece-wise constant and proportional to the x-space grid spacing. Each increment in the x-space grid maps into an increment of size 1/N in the corresponding y space. So regions in x space where inc[d, i] is small are stretched out in y space, while larger increments are compressed.
The x grid for an AdaptiveMap can be specified explicitly when the map is created: for example,
m = AdaptiveMap([[0, 0.1, 1], [-1, 0, 1]])
creates a two-dimensional map where the x[0] interval (0,0.1) and (0.1,1) map into the y[0] intervals (0,0.5) and (0.5,1) respectively, while x[1] intervals (-1,0) and (0,1) map into y[1] intervals (0,0.5) and (0.5,1).
More typically an initially uniform map is trained with data f[j] corresponding to ny points y[j, d], with j=0...ny-1, uniformly distributed in y space: for example,
m.add_training_data(y, f)
m.adapt(alpha=1.5)
m.adapt(alpha=1.5) shrinks grid increments where f[j] is large, and expands them where f[j] is small. Typically one has to iterate over several sets of ys and fs before the grid has fully adapted.
The speed with which the grid adapts is determined by parameter alpha. Large (positive) values imply rapid adaptation, while small values (much less than one) imply slow adaptation. As in any iterative process, it is usually a good idea to slow adaptation down in order to avoid instabilities.
Parameters: |
|
---|
Number of dimensions.
Number of increments along each grid axis.
The nodes of the grid defining the maps are self.grid[d, i] where d=0... specifies the direction and i=0...self.ninc the node.
The increment widths of the grid:
self.inc[d, i] = self.grid[d, i + 1] - self.grid[d, i]
Adapt grid to accumulated training data.
self.adapt(...) projects the training data onto each axis independently and maps it into x space. It shrinks x-grid increments in regions where the projected training data is large, and grows increments where the projected data is small. The grid along any direction is unchanged if the training data is constant along that direction.
The number of increments along a direction can be changed by setting parameter ninc.
The grid does not change if no training data has been accumulated, unless ninc is specified, in which case the number of increments is adjusted while preserving the relative density of increments at different values of x.
Parameters: |
|
---|
Add training data f for y-space points y.
Accumulates training data for later use by self.adapt(). Grid increments will be made smaller in regions where f is larger than average, and larger where f is smaller than average. The grid is unchanged (converged?) when f is constant across the grid.
Parameters: |
|
---|
Return x values corresponding to y.
y can be a single dim-dimensional point, or it can be an array y[i,j, ..., d] of such points (d=0..dim-1).
Return the map’s Jacobian at y.
y can be a single dim-dimensional point, or it can be an array y[d,i,j,...] of such points (d=0..dim-1).
Replace the grid with a uniform grid.
The new grid has ninc increments along each direction if ninc is specified. Otherwise it has the same number of increments as the old grid.
Map y to x, where jac is the Jacobian.
y[j, d] is an array of ny y-values for direction d. x[j, d] is filled with the corresponding x values, and jac[j] is filled with the corresponding Jacobian values. x and jac must be preallocated: for example,
x = numpy.empty(y.shape, float)
jac = numpy.empty(y.shape[0], float)
Parameters: |
|
---|
Display plots showing the current grid.
Parameters: |
|
---|---|
Nparam axes: | List of pairs of directions to use in different views of the grid. Using None in place of a direction plots the grid for only one direction. Omitting axes causes a default set of pairings to be used. |
Create string with information about grid nodes.
Creates a string containing the locations of the nodes in the map grid for each direction. Parameter ngrid specifies the maximum number of nodes to print (spread evenly over the grid).
Running weighted average of Monte Carlo estimates.
This class accumulates independent Monte Carlo estimates (e.g., of an integral) and combines them into a single weighted average. It is derived from gvar.GVar (from the lsqfit module if it is present) and represents a Gaussian random variable.
The mean value of the weighted average.
The standard deviation of the weighted average.
chi**2 of weighted average.
Number of degrees of freedom in weighted average.
Q or p-value of weighted average’s chi**2.
A list of the results from each iteration.
Add estimate g to the running average.
Assemble summary of independent results into a string.
Running weighted average of array-valued Monte Carlo estimates.
This class accumulates independent arrays of Monte Carlo estimates (e.g., of an integral) and combines them into an array of weighted averages. It is derived from numpy.ndarray. The array elements are gvar.GVars (from lsqfit) and represent Gaussian random variables.
chi**2 of weighted average.
Number of degrees of freedom in weighted average.
Q or p-value of weighted average’s chi**2.
A list of the results from each iteration.
Add estimate g to the running average.
Assemble summary of independent results into a string.
Base class for classes providing vectorized integrands.
A class derived from vegas.VecInterand will normally provide a __call__(self, x) method that returns an array f where:
x[i, d] is a contiguous array where i=0... labels different integrtion points and d=0... labels different directions in the integration space.
f[i] is a contiguous array containing the integrand values corresponding to the integration points x[i, :]. f[i] is either a number, for a single integrand, or an array (of any shape) for multiple integrands (i.e., an array-valued integrand).
x is a numpy array.
Deriving from vegas.VecIntegrand is the easiest way to construct integrands in Cython, and gives the fastest results.