Next: REFERENCES Up: Table of Contents

Short Note
Preconditioning

Jon Claerbout and Dave Nichols

Author has no known email address

In our C++ project we plan to supply reusable least-squares solvers. This led to a question that we are only beginning to answer, ``are there any helpful general purpose preconditioners?'' The one described in this note applies to operators that are representable in partitioned form, such as the regression

$\begin{displaymath} \left[ \begin{array} {cc} \bold A_1 & \bold A_2 \end{arra... ...\ \bold x_2 \end{array} \right] \quad \approx \quad \bold y\end{displaymath}$ (1)

We came up with two possibilities for rescaling $\tilde {\bf x} = \bold M \bold x$ before going to a conjugate-gradient (CG) solver. These two possibilities are:

$\begin{displaymath} \bold M \quad = \quad \left[ \begin{array} {cc} \Vert\bo... ...t & 0 \\ 0 & \Vert\bold A_2' \bold y\Vert \end{array} \right]\end{displaymath}$ (2)

$\begin{displaymath} \bold M \quad = \quad \left[ \begin{array} {cc} \Vert\bo... ... & 0 \\ 0 & \Vert\bold A_2 \bold r_2\Vert \end{array} \right]\end{displaymath}$ (3)

where $\bold r_1$ and $\bold r_2$ contain random numbers and where $\Vert\bold v\Vert$ is some norm such as the square root of the sum of the squares. The basic idea of preconditioning is that you solve $\bold A\bold x=\bold A\bold M^{-1}\bold M\bold x\approx \bold y$ in the form $(\bold A\bold M^{-1})\tilde\bold x\approx \bold y$ by the usual CG procedure and then solve $\bold M \bold x \approx \tilde {\bf x}$ .The new system should converge faster if $\bold A\bold M^{-1}\approx \bold I$ .(In general, everyone is left to his or her own ingenuity for finding $\bold M$ .We believe that gain control and deconvolution operators are good prospects for use in guessing a good $\bold M$ ).

We dislike (3) because we don't know how to specify the random numbers. We know the numbers should not be all ``1'' because d/dx and $\nabla^2$ are important operators and their norm is clearly non zero whereas it might appear to be zero if the random numbers were taken to be all ``1''. On the other hand should the random numbers be all positive or both positive and negative? We are not sure we can make a prior specification that would be generally satisfactory. While positive random numbers might be safe enough, it is worth considering (2).

Can it really be that (2) contains an idea not already inside the CG solver? After all, the first step of CG computes the sum of these two terms. Apparently yes, there is an important idea missing from the CG solver which has no prior knowledge that the operator can be partitioned into the $\bold A_1$ and $\bold A_2$ parts.

Partitioned problems of the kind discussed here frequently arise from nonlinear optimization where there are two partial derivative matrices with different physical dimensions. Since the problems are nonlinear and the operators themselves change as the iteration proceeds, it is natural to think of changing the preconditioning scalars as the iteration proceeds. In the linear case, however, rescaling with iteration would mess up the CG solver. We asked Michael Saunders about the existence of nonlinear solvers for this type of problem, but he did not have any immediate leads.

An interesting close-to-home example of a partitioned regression problem is found in PVI page 182. This example is a one-dimensional missing data problem. There the $\bold x_1$ vector contains the 3-5 components of a prediction-error filter (dimensionless transfer function) whereas the $\bold x_2$ vector would have the typical 1-2 thousand time samples of a seismogram (units of pressure, for example). We have not tested the two alternative preconditioners suggested in this paper because we have not yet done the programming. Also, a more meaningful evaluation will arise in a multidimensional context where preconditioning is more important, and where more programming, and perhaps some more research, are required.

Two apparently related references are Kennett et al. (1988) and Golub and Varah (1974).

Next: REFERENCES Up: Table of Contents

Stanford Exploration Project
11/16/1997

Short Note Preconditioning

Short Note
Preconditioning