g01 Chapter Contents
g01 Chapter Introduction
NAG Library Manual

# NAG Library Function Documentnag_summary_stats_onevar_combine (g01auc)

## 1  Purpose

nag_summary_stats_onevar_combine (g01auc) combines sets of summaries produced by nag_summary_stats_onevar (g01atc).

## 2  Specification

 #include #include
 void nag_summary_stats_onevar_combine (Integer b, const double mrcomm[], Integer *pn, double *xmean, double *xsd, double *xskew, double *xkurt, double *xmin, double *xmax, double rcomm[], NagError *fail)

## 3  Description

Assume a dataset containing $n$ observations, denoted by $x=\left\{{x}_{i}:i=1,2,\dots ,n\right\}$ and a set of weights, $w=\left\{{w}_{i}:i=1,2,\dots ,n\right\}$, has been split into $b$ blocks, and each block summarised via a call to nag_summary_stats_onevar (g01atc). Then nag_summary_stats_onevar_combine (g01auc) takes the $b$ communication arrays returned by nag_summary_stats_onevar (g01atc) and returns the mean ($\stackrel{-}{x}$), standard deviation (${s}_{2}$), coefficients of skewness (${s}_{3}$) and kurtosis (${s}_{4}$), and the maximum and minimum values for the whole dataset.
For a definition of $\stackrel{-}{x},{s}_{2},{s}_{3}$ and ${s}_{4}$ see Section 3 in nag_summary_stats_onevar (g01atc).
West D H D (1979) Updating mean and variance estimates: An improved method Comm. ACM 22 532–555

## 5  Arguments

1:     bIntegerInput
On entry: $b$, the number of blocks the full dataset was split into.
Constraint: ${\mathbf{b}}\ge 1$.
2:     mrcomm[$20×{\mathbf{b}}$]const doubleCommunication Array
Note: where ${\mathbf{MRCOMM}}\left(i,j\right)$ appears in this document, it refers to the array element ${\mathbf{mrcomm}}\left[\left(j-1\right)×20+i-1\right]$.
On entry: the $j$th column of MRCOMM must contain the information returned in rcomm from one of the runs of nag_summary_stats_onevar (g01atc).
3:     pnInteger *Output
On exit: the number of valid observations, that is the number of observations with ${w}_{i}>0$, for $\mathit{i}=1,2,\dots ,n$.
4:     xmeandouble *Output
On exit: $\stackrel{-}{x}$, the mean.
5:     xsddouble *Output
On exit: ${s}_{2}$, the standard deviation.
6:     xskewdouble *Output
On exit: ${s}_{3}$, the coefficient of skewness.
7:     xkurtdouble *Output
On exit: ${s}_{4}$, the coefficient of kurtosis.
8:     xmindouble *Output
On exit: the smallest value.
9:     xmaxdouble *Output
On exit: the largest value.
10:   rcomm[$20$]doubleCommunication Array
On exit: an amalgamation of the information held in mrcomm. This is in the same format as rcomm from nag_summary_stats_onevar (g01atc).
If rcomm is NULL, rcomm is not referenced.
11:   failNagError *Input/Output
The NAG error argument (see Section 3.6 in the Essential Introduction).

## 6  Error Indicators and Warnings

On entry, argument $⟨\mathit{\text{value}}⟩$ had an illegal value.
NE_CASES_ONE
On exit we were unable to calculate xsd, xskew or xkurt. A value of $0$ has been returned.
NE_CASES_ZERO
On entry, the number of valid observations is zero.
NE_ILLEGAL_COMM
On entry, mrcomm is not in the expected format.
NE_INT
On entry, ${\mathbf{b}}=⟨\mathit{\text{value}}⟩$.
Constraint: ${\mathbf{b}}\ge 1$.
NE_INTERNAL_ERROR
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact NAG for assistance.
NE_ZERO_VARIANCE
On exit we were unable to calculate xskew or xkurt. A value of $0$ has been returned.

Not applicable.

## 8  Parallelism and Performance

Not applicable.

The order that the $b$ communication arrays are stored in mrcomm is arbitrary. Different orders can lead to slightly different results due to numerical accuracy of floating-point calculations.
Both nag_summary_stats_onevar_combine (g01auc) and nag_summary_stats_onevar (g01atc) consolidate results from multiple summaries. Whereas the former can only be used to combine summaries calculated sequentially, the latter combines summaries calculated in an arbitrary order allowing, for example, summaries calculated on different processing units to be combined.

## 10  Example

This example summarises some simulated data. The data is supplied in three blocks, the first consisting of $21$ observations, the second $51$ observations and the last $28$ observations. Summaries are produced for each block of data separately and then an overall summary is produced.

### 10.1  Program Text

Program Text (g01auce.c)

### 10.2  Program Data

Program Data (g01auce.d)

### 10.3  Program Results

Program Results (g01auce.r)