E04 Chapter Contents
E04 Chapter Introduction
NAG Library Manual

# NAG Library Routine DocumentE04YCF

Note:  before using this routine, please read the Users' Note for your implementation to check the interpretation of bold italicised terms and other implementation-dependent details.

## 1  Purpose

E04YCF returns estimates of elements of the variance-covariance matrix of the estimated regression coefficients for a nonlinear least squares problem. The estimates are derived from the Jacobian of the function $f\left(x\right)$ at the solution.
This routine may be used following any one of the nonlinear least squares routines E04FCF, E04FYF, E04GBF, E04GDF, E04GYF, E04GZF, E04HEF or E04HYF.

## 2  Specification

 SUBROUTINE E04YCF ( JOB, M, N, FSUMSQ, S, V, LDV, CJ, WORK, IFAIL)
 INTEGER JOB, M, N, LDV, IFAIL REAL (KIND=nag_wp) FSUMSQ, S(N), V(LDV,N), CJ(N), WORK(N)

## 3  Description

E04YCF is intended for use when the nonlinear least squares function, $F\left(x\right)={f}^{\mathrm{T}}\left(x\right)f\left(x\right)$, represents the goodness-of-fit of a nonlinear model to observed data. The routine assumes that the Hessian of $F\left(x\right)$, at the solution, can be adequately approximated by $2{J}^{\mathrm{T}}J$, where $J$ is the Jacobian of $f\left(x\right)$ at the solution. The estimated variance-covariance matrix $C$ is then given by
 $C=σ2JTJ-1, JTJ nonsingular,$
where ${\sigma }^{2}$ is the estimated variance of the residual at the solution, $\stackrel{-}{x}$, given by
 $σ2=Fx- m-n ,$
$m$ being the number of observations and $n$ the number of variables.
The diagonal elements of $C$ are estimates of the variances of the estimated regression coefficients. See the E04 Chapter Introduction, Bard (1974) and Wolberg (1967) for further information on the use of $C$.
When ${J}^{\mathrm{T}}J$ is singular then $C$ is taken to be
 $C=σ2 JTJ †,$
where ${\left({J}^{\mathrm{T}}J\right)}^{†}$ is the pseudo-inverse of ${J}^{\mathrm{T}}J$, and
 $σ2 = F x- m-k , k=rank ​J$
but in this case the parameter IFAIL is returned as nonzero as a warning to you that $J$ has linear dependencies in its columns. The assumed rank of $J$ can be obtained from IFAIL.
The routine can be used to find either the diagonal elements of $C$, or the elements of the $j$th column of $C$, or the whole of $C$.
E04YCF must be preceded by one of the nonlinear least squares routines mentioned in Section 1, and requires the parameters FSUMSQ, S and V to be supplied by those routines (e.g., see E04FCF). FSUMSQ is the residual sum of squares $F\left(\stackrel{-}{x}\right)$ and S and V contain the singular values and right singular vectors respectively in the singular value decomposition of $J$. S and V are returned directly by the comprehensive routines E04FCF, E04GBF, E04GDF and E04HEF, but are returned as part of the workspace parameter W (from one of the easy-to-use routines). In the case of E04FYF, S starts at ${\mathbf{W}}\left(\mathit{NS}\right)$, where
 $NS=6× N+2× M+M× N+1+ max1, N × N-1 / 2$
and in the cases of the remaining easy-to-use routines, S starts at ${\mathbf{W}}\left(\mathit{NS}\right)$, where
 $NS=7× N+2× M+M× N+N× N+1 / 2+1+ max1, N × N-1 / 2 .$
The parameter V starts immediately following the elements of S, so that V starts at ${\mathbf{W}}\left(\mathit{NV}\right)$, where
 $NV=NS+N .$
For all the easy-to-use routines the parameter LDV must be supplied as N. Thus a call to E04YCF following E04FYF can be illustrated as
``` .
.
.
CALL E04FYF (M, N, LFUN1, X, FSUMSQ, W, LW, IUSER, RUSER, IFAIL)
.
.
.
NS = 6*N _ 2*M + M*N + 1 MAX((1,(N*(N-1))/2)
NV = NS + N;
CALL E04YCF (JOB, M, N, FSUMSQ, W(NS), W(NV), N, CJ, WORK, IFAIL)
```
where the parameters M, N, FSUMSQ and the $\left(n+{n}^{2}\right)$ elements ${\mathbf{W}}\left(\mathit{NS}\right),{\mathbf{W}}\left(\mathit{NS}+1\right),\dots ,\phantom{\rule{0ex}{0ex}}{\mathbf{W}}\left(\mathit{NV}+{{\mathbf{N}}}^{2}-1\right)$ must not be altered between the calls to E04FYF and E04YCF. The above illustration also holds for a call to E04YCF following a call to one of E04GYF, E04GZF or E04HYF, except that $\mathit{NS}$ must be computed as
 $NS=7× N+2× M+M× N+ N× N+1 /2+1+ max 1, N × N-1 / 2 .$
Bard Y (1974) Nonlinear Parameter Estimation Academic Press
Wolberg J R (1967) Prediction Analysis Van Nostrand

## 5  Parameters

1:     JOB – INTEGERInput
On entry: which elements of $C$ are returned as follows:
${\mathbf{JOB}}=-1$
The $n$ by $n$ symmetric matrix $C$ is returned.
${\mathbf{JOB}}=0$
The diagonal elements of $C$ are returned.
${\mathbf{JOB}}>0$
The elements of column JOB of $C$ are returned.
Constraint: $-1\le {\mathbf{JOB}}\le {\mathbf{N}}$.
2:     M – INTEGERInput
On entry: the number $m$ of observations (residuals ${f}_{i}\left(x\right)$).
Constraint: ${\mathbf{M}}\ge {\mathbf{N}}$.
3:     N – INTEGERInput
On entry: the number $n$ of variables $\left({x}_{j}\right)$.
Constraint: $1\le {\mathbf{N}}\le {\mathbf{M}}$.
4:     FSUMSQ – REAL (KIND=nag_wp)Input
On entry: the sum of squares of the residuals, $F\left(\stackrel{-}{x}\right)$, at the solution $\stackrel{-}{x}$, as returned by the nonlinear least squares routine.
Constraint: ${\mathbf{FSUMSQ}}\ge 0.0$.
5:     S(N) – REAL (KIND=nag_wp) arrayInput
On entry: the $n$ singular values of the Jacobian as returned by the nonlinear least squares routine. See Section 3 for information on supplying S following one of the easy-to-use routines.
6:     V(LDV,N) – REAL (KIND=nag_wp) arrayInput/Output
On entry: the $n$ by $n$ right-hand orthogonal matrix (the right singular vectors) of $J$ as returned by the nonlinear least squares routine. See Section 3 for information on supplying V following one of the easy-to-use routines.
On exit: if ${\mathbf{JOB}}\ge 0$, V is unchanged.
If ${\mathbf{JOB}}=-1$, the leading $n$ by $n$ part of V is overwritten by the $n$ by $n$ matrix $C$. When E04YCF is called with ${\mathbf{JOB}}=-1$ following an easy-to-use routine this means that $C$ is returned, column by column, in the ${n}^{2}$ elements of W given by ${\mathbf{W}}\left(\mathit{NV}\right),{\mathbf{W}}\left(\mathit{NV}+1\right),\dots ,{\mathbf{W}}\left(\mathit{NV}+{{\mathbf{N}}}^{2}-1\right)$. (See Section 3 for the definition of $\mathit{NV}$.)
7:     LDV – INTEGERInput
On entry: the first dimension of the array V as declared in the (sub)program from which E04YCF is called. When V is passed in the workspace parameter W (following one of the easy-to-use least square routines), LDV must be the value N.
Constraint: if ${\mathbf{JOB}}=-1$, ${\mathbf{LDV}}\ge {\mathbf{N}}$.
8:     CJ(N) – REAL (KIND=nag_wp) arrayOutput
On exit: if ${\mathbf{JOB}}=0$, CJ returns the $n$ diagonal elements of $C$.
If ${\mathbf{JOB}}=j>0$, CJ returns the $n$ elements of the $j$th column of $C$.
If ${\mathbf{JOB}}=-1$, CJ is not referenced.
9:     WORK(N) – REAL (KIND=nag_wp) arrayWorkspace
If ${\mathbf{JOB}}=-1$ or $0$, WORK is used as internal workspace.
If ${\mathbf{JOB}}>0$, WORK is not referenced.
10:   IFAIL – INTEGERInput/Output
On entry: IFAIL must be set to $0$, $-1\text{​ or ​}1$. If you are unfamiliar with this parameter you should refer to Section 3.3 in the Essential Introduction for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value $-1\text{​ or ​}1$ is recommended. If the output of error messages is undesirable, then the value $1$ is recommended. Otherwise, because for this routine the values of the output parameters may be useful even if ${\mathbf{IFAIL}}\ne {\mathbf{0}}$ on exit, the recommended value is $-1$. When the value $-\mathbf{1}\text{​ or ​}1$ is used it is essential to test the value of IFAIL on exit.
On exit: ${\mathbf{IFAIL}}={\mathbf{0}}$ unless the routine detects an error or a warning has been flagged (see Section 6).

## 6  Error Indicators and Warnings

If on entry ${\mathbf{IFAIL}}={\mathbf{0}}$ or $-{\mathbf{1}}$, explanatory error messages are output on the current error message unit (as defined by X04AAF).
Note: E04YCF may return useful information for one or more of the following detected errors or warnings.
Errors or warnings detected by the routine:
${\mathbf{IFAIL}}=1$
 On entry, ${\mathbf{JOB}}<-1$, or ${\mathbf{JOB}}>{\mathbf{N}}$, or ${\mathbf{N}}<1$, or ${\mathbf{M}}<{\mathbf{N}}$, or ${\mathbf{FSUMSQ}}<0.0$, or ${\mathbf{LDV}}<{\mathbf{N}}$.
${\mathbf{IFAIL}}=2$
The singular values are all zero, so that at the solution the Jacobian matrix $J$ has rank $0$.
${\mathbf{IFAIL}}>2$
At the solution the Jacobian matrix contains linear, or near linear, dependencies amongst its columns. In this case the required elements of $C$ have still been computed based upon $J$ having an assumed rank given by ${\mathbf{IFAIL}}-2$. The rank is computed by regarding singular values $SV\left(j\right)$ that are not larger than $10\epsilon ×SV\left(1\right)$ as zero, where $\epsilon$ is the machine precision (see X02AJF). If you expect near linear dependencies at the solution and are happy with this tolerance in determining rank you should call E04YCF with ${\mathbf{IFAIL}}={\mathbf{1}}$ in order to prevent termination (see the description of IFAIL). It is then essential to test the value of IFAIL on exit from E04YCF.
$\mathbf{\text{Overflow}}$
If overflow occurs then either an element of $C$ is very large, or the singular values or singular vectors have been incorrectly supplied.

## 7  Accuracy

The computed elements of $C$ will be the exact covariances corresponding to a closely neighbouring Jacobian matrix $J$.

When ${\mathbf{JOB}}=-1$ the time taken by E04YCF is approximately proportional to ${n}^{3}$. When ${\mathbf{JOB}}\ge 0$ the time taken by the routine is approximately proportional to ${n}^{2}$.

## 9  Example

This example estimates the variance-covariance matrix $C$ for the least squares estimates of ${x}_{1}$, ${x}_{2}$ and ${x}_{3}$ in the model
 $y=x1+t1x2t2+x3t3$
using the $15$ sets of data given in the following table:
 $y t1 t2 t3 0.14 1.0 15.0 1.0 0.18 2.0 14.0 2.0 0.22 3.0 13.0 3.0 0.25 4.0 12.0 4.0 0.29 5.0 11.0 5.0 0.32 6.0 10.0 6.0 0.35 7.0 9.0 7.0 0.39 8.0 8.0 8.0 0.37 9.0 7.0 7.0 0.58 10.0 6.0 6.0 0.73 11.0 5.0 5.0 0.96 12.0 4.0 4.0 1.34 13.0 3.0 3.0 2.10 14.0 2.0 2.0 4.39 15.0 1.0 1.0$
The program uses $\left(0.5,1.0,1.5\right)$ as the initial guess at the position of the minimum and computes the least squares solution using E04FYF. See the routine document E04FYF for further information.

### 9.1  Program Text

Program Text (e04ycfe.f90)

### 9.2  Program Data

Program Data (e04ycfe.d)

### 9.3  Program Results

Program Results (e04ycfe.r)