naginterfaces.library.contab.binary¶

naginterfaces.library.contab.binary(n, gprob, x, irl, a, c, cgetol, chisqr, iprint=1, maxit=1000, ishow=0, io_manager=None)[source]¶

binary fits a latent variable model (with a single factor) to data consisting of a set of measurements on individuals in the form of binary-valued sequences (generally referred to as score patterns). Various measures of goodness-of-fit are calculated along with the factor (theta) scores.

For full information please refer to the NAG Library document for g11sa

https://www.nag.com/numeric/nl/nagdoc_29.3/flhtml/g11/g11saf.html

Parameters

nint

$n$ , the number of individuals in the sample.

gprobbool

Must be set equal to $T r u e$ if $G (z) = Φ^{- 1} (z)$ and $F a l s e$ if $G (z) = l o g i t (z)$ .

xbool, array-like, shape $(ns, ip)$

The first $s$ rows of $x$ must contain the $s$ different score patterns. The $l$ th row of $x$ must contain the $l$ th score pattern with $x [l - 1, j - 1]$ set equal to $T r u e$ if $x_{l j} = 1$ and $F a l s e$ if $x_{l j} = 0$ . All rows of $x$ must be distinct.

irlint, array-like, shape $(ns)$

The $i$ th component of $i r l$ must be set equal to the frequency with which the $i$ th row of $x$ occurs.

afloat, array-like, shape $(ip)$

$a [j - 1]$ must be set equal to an initial estimate of $α_{j 1}$ . In order to avoid divergence problems with the E-M algorithm you are strongly advised to set all the $a [j - 1]$ to $0.5$ .

cfloat, array-like, shape $(ip)$

$c [j - 1]$ must be set equal to an initial estimate of $α_{j 0}$ . In order to avoid divergence problems with the E-M algorithm you are strongly advised to set all the $c [j - 1]$ to $0.0$ .

cgetolfloat

The accuracy to which the solution is required.

If $c g e t o l$ is set to $10^{- l}$ and on exit the function exits successfully or $e r r n o$ = 7, then all elements of the gradient vector will be smaller than $10^{- l}$ in absolute value.

For most practical purposes the value $10^{- 4}$ should suffice.

You should be wary of setting $c g e t o l$ too small since the convergence criterion may then have become too strict for the machine to handle.

If $c g e t o l$ has been set to a value which is less than the square root of the machine precision, $ϵ$ , then binary will use the value $\sqrt{ϵ}$ instead.

chisqrbool

If $c h i s q r$ is set equal to $T r u e$ , a likelihood ratio statistic will be calculated (see $c h i$ ).

If $c h i s q r$ is set equal to $F a l s e$ , no such statistic will be calculated.

iprintint, optional

The frequency with which the maximum likelihood search function is to be monitored.

$i p r i n t > 0$

The search is monitored once every $i p r i n t$ iterations, and when the number of quadrature points is increased, and again at the final solution point.

$i p r i n t = 0$

The search is monitored once at the final point.

$i p r i n t < 0$

The search is not monitored at all.

$i p r i n t$ should normally be set to a small positive number.

maxitint, optional

The maximum number of iterations to be made in the maximum likelihood search. There will be an error exit (see Exceptions) if the search function has not converged in $m a x i t$ iterations.

ishowint, optional

Indicates which of the following three quantities are to be printed before exit from the function (given a valid parameter set):

Table of maximum likelihood estimates and standard errors (as returned in the output arrays $a$ , $c$ , $a l p h a$ , $p i g a m$ and $c m$ ).
Table of observed and expected first - and second-order margins (as returned in the output arrays $e x p p$ and $o b s$ ).
Table of observed and expected frequencies of score patterns along with theta scores (as returned in the output arrays $i r l$ , $e x f$ , $y$ , $x l$ and $i o b$ ) and the likelihood ratio statistic (if required).

$i s h o w = 0$

None of the above are printed.

$i s h o w = 1$

(a) only is printed.

$i s h o w = 2$

(b) only is printed.

$i s h o w = 3$

(c) only is printed.

$i s h o w = 4$

(a) and (b) are printed.

$i s h o w = 5$

(a) and (c) are printed.

$i s h o w = 6$

(b) and (c) are printed.

$i s h o w = 7$

(a), (b) and (c) are printed.

io_managerFileObjManager, optional

Manager for I/O in this routine.

Returns

xbool, ndarray, shape $(ns, ip)$

Given a valid parameter set then the first $s$ rows of $x$ still contain the $s$ different score patterns. However, the following points should be noted:

If the estimated factor loading for the $j$ th item is negative then that item is re-coded, i.e., $0$ s and $1$ s (or $T r u e$ and $F a l s e$ ) in the $j$ th column of $x$ are interchanged.
The rows of $x$ will be reordered so that the theta scores corresponding to rows of $x$ are in increasing order of magnitude.

irlint, ndarray, shape $(ns)$

Given a valid parameter set then the first $s$ components of $i r l$ are reordered as are the rows of $x$ .

afloat, ndarray, shape $(ip)$

$a [j - 1]$ contains the latest estimate of $α_{j 1}$ , for $j = 1, 2, \dots, p$ . (Because of possible recoding all elements of $a$ will be positive.)

cfloat, ndarray, shape $(ip)$

$c [j - 1]$ contains the latest estimate of $α_{j 0}$ , for $j = 1, 2, \dots, p$ .

niterint

Given a valid parameter set then $n i t e r$ contains the number of iterations performed by the maximum likelihood search function.

alphafloat, ndarray, shape $(ip)$

Given a valid parameter set then $a l p h a [j - 1]$ contains the latest estimate of $α_{j}$ . (Because of possible recoding all elements of $a l p h a$ will be positive.)

pigamfloat, ndarray, shape $(ip)$

Given a valid parameter set then $p i g a m [j - 1]$ contains the latest estimate of either $π_{j}$ if $g p r o b = F a l s e$ (logit model) or $γ_{j}$ if $g p r o b = T r u e$ (probit model).

cmfloat, ndarray, shape $(2 \times ip, 2 \times ip)$

Given a valid parameter set then the strict lower triangle of $c m$ contains the correlation matrix of the parameter estimates held in $a l p h a$ and $p i g a m$ on exit. The diagonal elements of $c m$ contain the standard errors. Thus:

$c m [2 \times i - 2, 2 \times i - 2]$	=	standard error $(a l p h a [i - 1])$
$c m [2 \times i - 1, 2 \times i - 1]$	=	standard error $(p i g a m [i - 1])$
$c m [2 \times i - 1, 2 \times i - 2]$	=	correlation $(p i g a m [i - 1], a l p h a [i - 1])$ ,

for $i = 1, 2, \dots, p$ ;

$c m [2 \times i - 2, 2 \times j - 2]$	=	correlation $(a l p h a [i - 1], a l p h a [j - 1])$
$c m [2 \times i - 1, 2 \times j - 1]$	=	correlation $(p i g a m [i - 1], p i g a m [j - 1])$
$c m [2 \times i - 2, 2 \times j - 1]$	=	correlation $(a l p h a [i - 1], p i g a m [j - 1])$
$c m [2 \times i - 1, 2 \times j - 2]$	=	correlation $(a l p h a [j - 1], p i g a m [i - 1])$ ,

for $j = 1, 2, \dots, i - 1$ .

If the second derivative matrix cannot be computed then all the elements of $c m$ are returned as zero.

gfloat, ndarray, shape $(2 \times ip)$

Given a valid parameter set then $g$ contains the estimated gradient vector corresponding to the final point held in the arrays $a l p h a$ and $p i g a m$ . $g [2 \times j - 2]$ contains the derivative of the log-likelihood with respect to $a l p h a [j - 1]$ , for $j = 1, 2, \dots, p$ . $g [2 \times j - 1]$ contains the derivative of the log-likelihood with respect to $p i g a m [j - 1]$ , for $j = 1, 2, \dots, p$ .

exppfloat, ndarray, shape $(ip, ip)$

Given a valid parameter set then $e x p p [i - 1, j - 1]$ contains the expected percentage of individuals in the sample who respond positively to items $i$ and $j$ ( $j \leq i$ ), corresponding to the final point held in the arrays $a l p h a$ and $p i g a m$ .

obsfloat, ndarray, shape $(ip, ip)$

Given a valid parameter set then $o b s [i - 1, j - 1]$ contains the observed percentage of individuals in the sample who responded positively to items $i$ and $j$ ( $j \leq i$ ).

exffloat, ndarray, shape $(ns)$

Given a valid parameter set then $e x f [l - 1]$ contains the expected frequency of the $l$ th score pattern ( $l$ th row of $x$ ), corresponding to the final point held in the arrays $a l p h a$ and $p i g a m$ .

yfloat, ndarray, shape $(ns)$

Given a valid parameter set then $y [l - 1]$ contains the estimated theta score corresponding to the $l$ th row of $x$ , for the final point held in the arrays $a l p h a$ and $p i g a m$ .

xlfloat, ndarray, shape $(:)$

If $g p r o b$ has been set equal to $F a l s e$ (logit model) then, given a valid parameter set, $x l [l - 1]$ contains the estimated component score corresponding to the $l$ th row of $x$ for the final point held in the arrays $a l p h a$ and $p i g a m$ .

If $g p r o b$ is set equal to $T r u e$ (probit model), this array is not used.

iobint, ndarray, shape $(ns)$

Given a valid parameter set then $i o b [l - 1]$ contains the number of items in the $l$ th row of $x$ for which the response was positive ( $T r u e$ ).

rloglfloat

Given a valid parameter set then $r l o g l$ contains the value of the log-likelihood kernel corresponding to the final point held in the arrays $a l p h a$ and $p i g a m$ , namely

s - 1 \sum l = 0 i r l [l] \times log (e x f [l] / n) .

chifloat

If $c h i s q r$ was set equal to $T r u e$ on entry, then given a valid parameter set, $c h i$ will contain the value of the likelihood ratio statistic corresponding to the final parameter estimates held in the arrays $a l p h a$ and $p i g a m$ , namely

2 \times s - 1 \sum l = 0 i r l [l] \times log (e x f [l] / i r l [l]) .

The summation is over those elements of $i r l$ which are positive. If $e x f [l - 1]$ is less than $5.0$ , then adjacent score patterns are pooled (the score patterns in $x$ being first put in order of increasing theta score).

If $c h i s q r$ has been set equal to $F a l s e$ , then $c h i$ is not used.

idfint

If $c h i s q r$ was set equal to $T r u e$ on entry, then given a valid parameter set, $i d f$ will contain the degrees of freedom associated with the likelihood ratio statistic, $c h i$ .

$i d f = s_{0} - 2 \times p$	if $s_{0} < 2^{p}$ ;
$i d f = s_{0} - 2 \times p - 1$	if $s_{0} = 2^{p}$ ,

where $s_{0}$ denotes the number of terms summed to calculate $c h i$ ( $s_{0} = s$ only if there is no pooling).

If $c h i s q r$ has been set equal to $F a l s e$ , $i d f$ is not used.

siglevfloat

If $c h i s q r$ was set equal to $T r u e$ on entry, then given a valid parameter set, $s i g l e v$ will contain the significance level of $c h i$ based on $i d f$ degrees of freedom. If $i d f$ is zero or negative then $s i g l e v$ is set to zero.

If $c h i s q r$ was set equal to $F a l s e$ , $s i g l e v$ is not used.

Raises

NagValueError

(errno $1$ )

On entry, $\sum_{i} (i r l [i]) = ⟨ v a l u e ⟩$ and $n = ⟨ v a l u e ⟩$ .

Constraint: $\sum_{i} (i r l [i]) = n$ .

(errno $1$ )

On entry, $i = ⟨ v a l u e ⟩$ and $i r l [i - 1] = ⟨ v a l u e ⟩$ .

Constraint: $i r l [i - 1] \geq 0$ .

(errno $1$ )

On entry, $i = ⟨ v a l u e ⟩$ and $j = ⟨ v a l u e ⟩$ .

Constraint: rows $i$ and $j$ of $x$ should not be identical.

(errno $1$ )

On entry, $ns = ⟨ v a l u e ⟩$ and $ip = ⟨ v a l u e ⟩$ .

Constraint: $ns > 2 \times ip$ .

(errno $1$ )

On entry, $ip = ⟨ v a l u e ⟩$ .

Constraint: $ip \geq 3$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq 7$ .

(errno $1$ )

On entry, $i s h o w = ⟨ v a l u e ⟩$ .

Constraint: $i s h o w = 0$ , $1$ , $2$ , $3$ , $4$ , $5$ , $6$ or $7$ .

(errno $1$ )

On entry, $ns = ⟨ v a l u e ⟩$ and $ip = ⟨ v a l u e ⟩$ .

Constraint: $ns \leq 2^{ip}$ .

(errno $1$ )

On entry, $ns = ⟨ v a l u e ⟩$ and $n = ⟨ v a l u e ⟩$ .

Constraint: $ns \leq n$ .

(errno $1$ )

On entry, $m a x i t = ⟨ v a l u e ⟩$ .

Constraint: $m a x i t \geq 1$ .

(errno $2$ )

For at least one of the $ip$ items the responses are all at the same level.

(errno $3$ )

$m a x i t$ iterations have been performed: $m a x i t = ⟨ v a l u e ⟩$ .

(errno $4$ )

One of the elements of $a$ has exceeded $10$ in absolute.

(errno $5$ )

Failure to invert Hessian matrix and $m a x i t$ iterations made: $m a x i t = ⟨ v a l u e ⟩$ .

(errno $6$ )

Failure to invert Hessian matrix plus Heywood case encountered.

Warns

NagAlgorithmicWarning

(errno $7$ ): $χ^{2}$ statistic has less than one degree of freedom.

Notes

Given a set of $p$ dichotomous variables $~ x = {(x_{1}, x_{2}, \dots, x_{p})}_{1}^{'}$ , where $^{'}$ denotes vector or matrix transpose, the objective is to investigate whether the association between them can be adequately explained by a latent variable model of the form (see Bartholomew (1980) and Bartholomew (1987))

G {π_{i} (θ)} = α_{i 0} + α_{i 1} θ .

The $x_{i}$ are called item responses and take the value $0$ or $1$ . $θ$ denotes the latent variable assumed to have a standard Normal distribution over a population of individuals to be tested on $p$ items. Call $π_{i} (θ) = P (x_{i} = 1 | θ)$ the item response function: it represents the probability that an individual with latent ability $θ$ will produce a positive response (1) to item $i$ . $α_{i 0}$ and $α_{i 1}$ are item parameters which can assume any real values. The set of parameters, $α_{i 1}$ , for $i = 1, 2, \dots, p$ , being coefficients of the unobserved variable $θ$ , can be interpreted as ‘factor loadings’.

$G$ is a function selected by you as either $Φ^{- 1}$ or logit, mapping the interval $(0, 1)$ onto the whole real line. Data from a random sample of $n$ individuals takes the form of the matrices $X$ and $R$ defined below:

\begin{matrix} X_{s \times p} = ⎡ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ \begin{matrix} x_{11} & x_{12} & \dots & x_{1 p} x_{21} & x_{22} & \dots & x_{2 p} ⋮ & ⋮ & ⋮ x_{s 1} & x_{s 2} & \dots & x_{s p} \end{matrix} ⎤ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦ = ⎡ ⎢ ⎢ ⎢ ⎢ ⎣ \begin{matrix} {~ x}_{1} {~ x}_{2} ⋮ {~ x}_{s} \end{matrix} ⎤ ⎥ ⎥ ⎥ ⎥ ⎦, R_{s \times 1} = ⎡ ⎢ ⎢ ⎢ ⎢ ⎣ \begin{matrix} r_{1} r_{2} ⋮ r_{s} \end{matrix} ⎤ ⎥ ⎥ ⎥ ⎥ ⎦ \end{matrix}

where ${~ x}_{l} = (x_{l 1}, x_{l 2}, \dots, x_{l p})$ denotes the $l$ th score pattern in the sample, $r_{l}$ the frequency with which ${~ x}_{l}$ occurs and $s$ the number of different score patterns observed. (Thus $\sum_{l = 1}^{s} r_{l} = n$ ). It can be shown that the log-likelihood function is proportional to

s \sum l = 1 r_{l} log (P_{l}),

where

P_{l} = P (~ x = {~ x}_{l}) = \int_{- \infty}^{\infty} P (~ x = {~ x}_{l} | θ) ϕ (θ) d θ

( $ϕ (θ)$ being the probability density function of a standard Normal random variable).

$P_{l}$ denotes the unconditional probability of observing score pattern ${~ x}_{l}$ . The integral in (2) is approximated using Gauss–Hermite quadrature. If we take $G (z) = l o g i t (z) = log (\frac{z}{1 - z})$ in (1) and reparameterise as follows,

\begin{matrix} \begin{matrix} α_{i} & = & α_{i 1}, π_{i} & = & {l o g i t}^{- 1} (α_{i 0}), \end{matrix} \end{matrix}

then (1) reduces to the logit model (see Bartholomew (1980))

π_{i} (θ) = \frac{π_{i}}{π_{i} + (1 - π_{i}) e x p (- α_{i} θ)} .

If we take $G (z) = Φ^{- 1} (z)$ (where $Φ$ is the cumulative distribution function of a standard Normal random variable) and reparameterise as follows,

\begin{matrix} \begin{matrix} α_{i} & = & \frac{α_{i 1}}{\sqrt{(1 + α_{i 1}^{2})}} γ_{i} & = & \frac{- α_{i 0}}{\sqrt{(1 + α_{i 1}^{2})}} \end{matrix}, \end{matrix}

then (1) reduces to the probit model (see Bock and Aitkin (1981))

π_{i} (θ) = ϕ ⎛ ⎜ ⎜ ⎝ \frac{α_{i} θ - γ_{i}}{\sqrt{(1 - α_{i}^{2})}} ⎞ ⎟ ⎟ ⎠ .

An E-M algorithm (see Bock and Aitkin (1981)) is used to maximize the log-likelihood function. The number of quadrature points used is set initially to $10$ and once convergence is attained increased to $20$ .

The theta score of an individual responding in score pattern ${~ x}_{l}$ is computed as the posterior mean, i.e., $E (θ | {~ x}_{l})$ . For the logit model the component score $X_{l} = \sum_{j = 1}^{p} α_{j} x_{l j}$ is also calculated. (Note that in calculating the theta scores and measures of goodness-of-fit binary automatically reverses the coding on item $j$ if $α_{j} < 0$ ; it is assumed in the model that a response at the one level is showing a higher measure of latent ability than a response at the zero level.)

The frequency distribution of score patterns is required as input data. If your data is in the form of individual score patterns (uncounted), then binary_service() may be used to calculate the frequency distribution.

References

Bartholomew, D J, 1980, Factor analysis for categorical data (with Discussion), J. Roy. Statist. Soc. Ser. B (42), 293–321

Bartholomew, D J, 1987, Latent Variable Models and Factor Analysis, Griffin

Bock, R D and Aitkin, M, 1981, Marginal maximum likelihood estimation of item parameters: Application of an E-M algorithm, Psychometrika (46), 443–459

NAG and Python

Return to Front

naginterfaces.library.contab.binary¶