next up previous print clean
Next: ACKNOWLEDGEMENTS Up: Preconditioning Previous: UNDERDETERMINED LEAST-SQUARES

SCALING THE ADJOINT

Given the usual linearized fitting goal between data space and model space, $ \bold d \approx \bold F \bold m$,the simplest image of the model space results from application of the adjoint operator $ \hat \bold m = \bold F' \bold d$.Unless $\bold F$ has no physical units, however, the physical units of $\hat \bold m$ do not match those of $\bold m$,so we need a scaling factor. The theoretical solution $\bold m_{\rm theor} = (\bold F'\bold F)^{-1}\bold F'\bold d$suggests that the scaling units should be those of $(\bold F'\bold F)^{-1}$.We could probe the operator $\bold F$or its adjoint with white noise or a zero-frequency input. Bill Symes suggests we probe with the data $\bold d$ because it has the spectrum of interest. He proposes we make our image with $\hat \bold m = \bold W^2 \bold F'\bold d$where we choose the weighting function to be  
 \begin{displaymath}
\bold W^2 \quad =\quad
 {
 {\bf diag\ } {\bold F' \bold d}
 \over
 {\bf diag\ } {\bold F'\bold F\bold F'\bold d}
 }\end{displaymath} (38)
which obviously has the correct physical units. (The mathematical function ${\bf diag}$ takes a vector and lies it along the diagonal of a square matrix.) The weight $\bold W^2$can be thought of as a diagonal matrix containing the ratio of two images. A problem with the choice (38) is that the denominator might vanish or might even be negative. The way to stabilize any ratio is suggested at the beginning of Chapter [*]; that is, we revise the ratio a/b to  
 \begin{displaymath}
\bold W^2 \quad =\quad
 {
 {\bf diag} < ab \gt
 \over
 {\bf diag} < b^2 + \epsilon^2 \gt
 }\end{displaymath} (39)
where $\epsilon$ is a parameter to be chosen, and the angle braces indicate the possible need for local smoothing.

To go beyond the scaled adjoint we can use $\bold W$ as a preconditioner. To use $\bold W$ as a preconditioner we define implicitly a new set of variables $\bold p$by the substitution $\bold m=\bold W\bold p$.Then $\bold d \approx \bold F\bold m=\bold F\bold W\bold p$.To find $\bold p$ instead of $\bold m$,we do CD iteration with the operator $\bold F\bold W$ instead of with $\bold F$.As usual, the first step of the iteration is to use the adjoint of $\bold d\approx \bold F\bold W\bold p$ to form the image $\hat\bold p=(\bold F\bold W)'\bold d$.At the end of the iterations, we convert from $\bold p$ back to $\bold m$with $\bold m=\bold W\bold p$.The result after the first iteration $\hat\bold m=\bold W\hat\bold p=\bold W(\bold F\bold W)'\bold d=\bold W^2\bold F'\bold d$turns out to be the same as Symes scaling.

By (38), $\bold W$ has physical units inverse to $\bold F$.Thus the transformation $\bold F\bold W$ has no units so the $\bold p$ variables have physical units of data space. Experimentalists might enjoy seeing the solution $\bold p$with its data units more than viewing the solution $\bold m$with its more theoretical model units.

The theoretical solution for underdetermined systems $\bold m =\bold F' (\bold F \bold F')^{-1} \bold d$suggests an alternate approach using instead $\hat{\bold m} =\bold F' \bold W_d^2 \bold d$.A possibility for $\bold W_d^2$ is  
 \begin{displaymath}
\bold W_d^2 \quad =\quad
 {
 {\bf diag\ } {\bold d}
 \over
 {\bf diag\ } {\bold F\bold F'\bold d}
 }\end{displaymath} (40)

Experience tells me that a broader methodology is needed. Appropriate scaling is required in both data space and model space. We need something that includes a weight for each space, $\bold W_m$ and $\bold W_d$ where $\hat \bold m = \bold W_m \bold F'\bold W_d \bold d$.

I have a useful practical example (stacking in v(z) media) in another of my electronic books (BEI), where I found both $\bold W_m$ and $\bold W_d$ by iterative guessing. But I don't know how to give you a general strategy. I feel this is a major unsolved(?) opportunity.


next up previous print clean
Next: ACKNOWLEDGEMENTS Up: Preconditioning Previous: UNDERDETERMINED LEAST-SQUARES
Stanford Exploration Project
12/26/2000