g01 Chapter Contents
g01 Chapter Introduction
NAG C Library Manual

# NAG Library Function Documentnag_prob_hypergeom_vector (g01slc)

## 1  Purpose

nag_prob_hypergeom_vector (g01slc) returns a number of the lower tail, upper tail and point probabilities for the hypergeometric distribution.

## 2  Specification

 #include #include
 void nag_prob_hypergeom_vector (Integer ln, const Integer n[], Integer ll, const Integer l[], Integer lm, const Integer m[], Integer lk, const Integer k[], double plek[], double pgtk[], double peqk[], Integer ivalid[], NagError *fail)

## 3  Description

Let $X=\left\{{X}_{i}:i=1,2,\dots ,r\right\}$ denote a vector of random variables having a hypergeometric distribution with parameters ${n}_{i}$, ${l}_{i}$ and ${m}_{i}$. Then
 $Prob Xi = ki = mi ki ni - mi li - ki ni li ,$
where $\mathrm{max}\phantom{\rule{0.125em}{0ex}}\left(0,{l}_{i}+{m}_{i}-{n}_{i}\right)\le {k}_{i}\le \mathrm{min}\phantom{\rule{0.125em}{0ex}}\left({l}_{i},{m}_{i}\right)$, $0\le {l}_{i}\le {n}_{i}$ and $0\le {m}_{i}\le {n}_{i}$.
The hypergeometric distribution may arise if in a population of size ${n}_{i}$ a number ${m}_{i}$ are marked. From this population a sample of size ${l}_{i}$ is drawn and of these ${k}_{i}$ are observed to be marked.
The mean of the distribution $\text{}=\frac{{l}_{i}{m}_{i}}{{n}_{i}}$, and the variance $\text{}=\frac{{l}_{i}{m}_{i}\left({n}_{i}-{l}_{i}\right)\left({n}_{i}-{m}_{i}\right)}{{{n}_{i}}^{2}\left({n}_{i}-1\right)}$.
nag_prob_hypergeom_vector (g01slc) computes for given ${n}_{i}$, ${l}_{i}$, ${m}_{i}$ and ${k}_{i}$ the probabilities: $\mathrm{Prob}\left\{{X}_{i}\le {k}_{i}\right\}$, $\mathrm{Prob}\left\{{X}_{i}>{k}_{i}\right\}$ and $\mathrm{Prob}\left\{{X}_{i}={k}_{i}\right\}$ using an algorithm similar to that described in Knüsel (1986) for the Poisson distribution.
The input arrays to this function are designed to allow maximum flexibility in the supply of vector arguments by re-using elements of any arrays that are shorter than the total number of evaluations required. See Section 2.6 in the g01 Chapter Introduction for further information.

## 4  References

Knüsel L (1986) Computation of the chi-square and Poisson distribution SIAM J. Sci. Statist. Comput. 7 1022–1036

## 5  Arguments

1:     lnIntegerInput
On entry: the length of the array n
Constraint: ${\mathbf{ln}}>0$.
2:     n[ln]const IntegerInput
On entry: ${n}_{i}$, the parameter of the hypergeometric distribution with ${n}_{i}={\mathbf{n}}\left[j\right]$, , for $i=1,2,\dots ,\mathrm{max}\phantom{\rule{0.125em}{0ex}}\left({\mathbf{ln}},{\mathbf{ll}},{\mathbf{lm}},{\mathbf{lk}}\right)$.
Constraint: ${\mathbf{n}}\left[\mathit{j}-1\right]\ge 0$, for $\mathit{j}=1,2,\dots ,{\mathbf{ln}}$.
3:     llIntegerInput
On entry: the length of the array l
Constraint: ${\mathbf{ll}}>0$.
4:     l[ll]const IntegerInput
On entry: ${l}_{i}$, the parameter of the hypergeometric distribution with ${l}_{i}={\mathbf{l}}\left[j\right]$, .
Constraint: $0\le {l}_{i}\le {n}_{i}$.
5:     lmIntegerInput
On entry: the length of the array m
Constraint: ${\mathbf{lm}}>0$.
6:     m[lm]const IntegerInput
On entry: ${m}_{i}$, the parameter of the hypergeometric distribution with ${m}_{i}={\mathbf{m}}\left[j\right]$, .
Constraint: $0\le {m}_{i}\le {n}_{i}$.
7:     lkIntegerInput
On entry: the length of the array k
Constraint: ${\mathbf{lk}}>0$.
8:     k[lk]const IntegerInput
On entry: ${k}_{i}$, the integer which defines the required probabilities with ${k}_{i}={\mathbf{k}}\left[j\right]$, .
Constraint: $\mathrm{max}\phantom{\rule{0.125em}{0ex}}\left(0,{l}_{i}+{m}_{i}-{n}_{i}\right)\le {k}_{i}\le \mathrm{min}\phantom{\rule{0.125em}{0ex}}\left({l}_{i},{m}_{i}\right)$.
9:     plek[$\mathit{dim}$]doubleOutput
Note: the dimension, dim, of the array plek must be at least $\mathrm{max}\phantom{\rule{0.125em}{0ex}}\left({\mathbf{ln}},{\mathbf{ll}},{\mathbf{lm}},{\mathbf{lk}}\right)$.
On exit: $\mathrm{Prob}\left\{{X}_{i}\le {k}_{i}\right\}$, the lower tail probabilities.
10:   pgtk[$\mathit{dim}$]doubleOutput
Note: the dimension, dim, of the array pgtk must be at least $\mathrm{max}\phantom{\rule{0.125em}{0ex}}\left({\mathbf{ln}},{\mathbf{ll}},{\mathbf{lm}},{\mathbf{lk}}\right)$.
On exit: $\mathrm{Prob}\left\{{X}_{i}>{k}_{i}\right\}$, the upper tail probabilities.
11:   peqk[$\mathit{dim}$]doubleOutput
Note: the dimension, dim, of the array peqk must be at least $\mathrm{max}\phantom{\rule{0.125em}{0ex}}\left({\mathbf{ln}},{\mathbf{ll}},{\mathbf{lm}},{\mathbf{lk}}\right)$.
On exit: $\mathrm{Prob}\left\{{X}_{i}={k}_{i}\right\}$, the point probabilities.
12:   ivalid[$\mathit{dim}$]IntegerOutput
Note: the dimension, dim, of the array ivalid must be at least $\mathrm{max}\phantom{\rule{0.125em}{0ex}}\left({\mathbf{ln}},{\mathbf{ll}},{\mathbf{lm}},{\mathbf{lk}}\right)$.
On exit: ${\mathbf{ivalid}}\left[i-1\right]$ indicates any errors with the input arguments, with
${\mathbf{ivalid}}\left[i-1\right]=0$
No error.
${\mathbf{ivalid}}\left[i-1\right]=1$
 On entry, ${n}_{i}<0$.
${\mathbf{ivalid}}\left[i-1\right]=2$
 On entry, ${l}_{i}<0$, or ${l}_{i}>{n}_{i}$.
${\mathbf{ivalid}}\left[i-1\right]=3$
 On entry, ${m}_{i}<0$, or ${m}_{i}>{n}_{i}$.
${\mathbf{ivalid}}\left[i-1\right]=4$
 On entry, ${k}_{i}<0$, or ${k}_{i}>{l}_{i}$, or ${k}_{i}>{m}_{i}$, or ${k}_{i}<{l}_{i}+{m}_{i}-{n}_{i}$.
${\mathbf{ivalid}}\left[i-1\right]=5$
 On entry, ${n}_{i}$ is too large to be represented exactly as a real number.
${\mathbf{ivalid}}\left[i-1\right]=6$
 On entry, the variance (see Section 3) exceeds ${10}^{6}$.
13:   failNagError *Input/Output
The NAG error argument (see Section 3.6 in the Essential Introduction).

## 6  Error Indicators and Warnings

NE_ALLOC_FAIL
Dynamic memory allocation failed.
NE_ARRAY_SIZE
On entry, $\text{array size}=〈\mathit{\text{value}}〉$.
Constraint: ${\mathbf{lk}}>0$.
On entry, $\text{array size}=〈\mathit{\text{value}}〉$.
Constraint: ${\mathbf{ll}}>0$.
On entry, $\text{array size}=〈\mathit{\text{value}}〉$.
Constraint: ${\mathbf{lm}}>0$.
On entry, $\text{array size}=〈\mathit{\text{value}}〉$.
Constraint: ${\mathbf{ln}}>0$.
On entry, argument $〈\mathit{\text{value}}〉$ had an illegal value.
NE_INTERNAL_ERROR
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact NAG for assistance.
NW_IVALID
On entry, at least one value of n, l, m or k was invalid, or the variance was too large.

## 7  Accuracy

Results are correct to a relative accuracy of at least ${10}^{-6}$ on machines with a precision of $9$ or more decimal digits (provided that the results do not underflow to zero).

The time taken by nag_prob_hypergeom_vector (g01slc) to calculate each probability depends on the variance (see Section 3) and on ${k}_{i}$. For given variance, the time is greatest when ${k}_{i}\approx {l}_{i}{m}_{i}/{n}_{i}$ ($=$ the mean), and is then approximately proportional to the square-root of the variance.

## 9  Example

This example reads a vector of values for $n$, $l$, $m$ and $k$, and prints the corresponding probabilities.

### 9.1  Program Text

Program Text (g01slce.c)

### 9.2  Program Data

Program Data (g01slce.d)

### 9.3  Program Results

Program Results (g01slce.r)