NAG Library Routine Document
H02BUF
1 Purpose
H02BUF reads data for a linear or integer programming problem from an external file which is in standard or compatible MPSX input format.
2 Specification
SUBROUTINE H02BUF ( 
INFILE, MAXN, MAXM, OPTIM, XBLDEF, XBUDEF, NMOBJ, NMRHS, NMRNG, NMBND, MPSLST, N, M, A, BL, BU, CVEC, X, INTVAR, CRNAME, NMPROB, IWORK, IFAIL) 
INTEGER 
INFILE, MAXN, MAXM, N, M, INTVAR(MAXN), IWORK(MAXN+MAXM), IFAIL 
REAL (KIND=nag_wp) 
XBLDEF, XBUDEF, A(MAXM,MAXN), BL(MAXN+MAXM), BU(MAXN+MAXM), CVEC(MAXN), X(MAXN) 
LOGICAL 
MPSLST 
CHARACTER(3) 
OPTIM 
CHARACTER(8) 
NMOBJ, NMRHS, NMRNG, NMBND, CRNAME(MAXN+MAXM), NMPROB 

3 Description
H02BUF reads linear programming (LP) or integer programming (IP) problem data from an external file which is prepared in standard or compatible MPSX (see
IBM (1971)) input format and then initializes
$n$ (the number of variables),
$m$ (the number of general linear constraints), the vectors
$c$,
$l$ and
$u$ and the
$m$ by
$n$ matrix
$A$ for use with
E04MFF/E04MFA or
H02BBF, which are designed to solve problems of the form
H02BUF may be followed by calls to either
E04MFF/E04MFA (to solve an LP problem) or
H02BBF and
H02BZF (to solve an IP problem), possibly followed by a call to
H02BVF (to print the solution using MPSX names).
Note that H02BUF uses an ‘infinite’ bound size of
${10}^{20}$ in the definition of
$l$ and
$u$. In other words, any element of
$u$ greater than or equal to
${10}^{20}$ will be regarded as
$+\infty $ (and similarly any element of
$l$ less than or equal to
${10}^{20}$ will be regarded as
$\infty $). If this value is deemed to be ‘inappropriate’, you are recommended to reset the value of either the optional parameter
Infinite Bound Size (if an LP problem is being solved) or the parameter BIGBND (if an IP problem is being solved) and make any necessary changes to
BL and/or
BU prior to calling
E04MFF/E04MFA or
H02BBF (as appropriate).
The documents for
E04MFF/E04MFA,
H02BVF and/or
H02BBF and
H02BZF should be consulted for further details.
MPSX input format
The input file of data may only contain two types of lines.
 Indicator lines (specifying the type of data which is to follow).
 Data lines (specifying the actual data).
The input file must not contain any blank lines. Any characters beyond column 80 are ignored. Indicator lines must not contain leading blank characters (in other words they must begin in column 1). The following displays the order in which the indicator lines must appear in the file:
NAME 
usergiven name 
ROWS 
data line(s) 
COLUMNS 
data line(s) 
RHS 
data line(s) 
RANGES (optional) 
data line(s) 
BOUNDS (optional) 
data line(s) 
ENDATA 

The ‘usergiven name’ specifies a name for the problem and must occupy columns 15–22. The name can either be blank or up to a maximum of $8$ characters.
A data line follows the same fixed format made up of fields defined below. The contents of the fields may have different significance depending upon the section of data in which they appear.


Field 1 
Field 2 
Field 3 
Field 4 
Field 5 
Field 6 
Columns 
$2\u20133$ 
$5\u201312$ 
$15\u201322$ 
$25\u201336$ 
$40\u201347$ 
$50\u201361$ 
Contents 
Code 
Name 
Name 
Value 
Name 
Value 
The names and codes consist of ‘alphanumeric’ characters (i.e., a–z, A–Z,
$0$–9,
$+$,
$$, asterisk (*), blank ( ), colon (:), dollar sign ($) or fullstop (.) only) and the names must not contain leading blank characters. Values are read using Fortran format E12.0. This allows values to be entered in several equivalent forms. For example,
$1.2345678$,
$1.2345678E+0$,
$\text{123.45678E\u22122}$ and
$\text{12345678E\u221207}$ all represent the same number. It is safest to include an explicit decimal point.
Note that in order to ensure numeric values are interpreted as intended, they should be
rightjustified in the
$12$character field, with no trailing blanks. This is because in some situations trailing blanks may be interpreted as zeros and this can dramatically affect the interpretation of the value. This is relevant if the value contains an exponent, or if it contains neither an exponent nor an explicit decimal point. For example, the fields
%%%%1.23E2%
%%%%%%%123%%
may be interpreted as
$\text{1.23E\u221220}$ and
$12300$ respectively (where
% is used to denote a blank). The actual behaviour is systemdependent.
Comment lines are allowed in the data file. These must have an asterisk (*) in column 1 and any characters in columns 2–80. In any data line, a dollar sign ($) as the first character in Field 3 or 5 indicates that the information from that point through column 80 consists of comments.
Columns outside the six fields must be blank, except for columns 72–80, whose contents are ignored by the routine. These columns may be used to enter a sequence number. A nonblank character outside the predefined six fields and columns 72–80 is considered to be a major error (
${\mathbf{IFAIL}}={\mathbf{11}}$; see
Section 6), unless it is part of a comment.
ROWS data line(s)
These lines specify row (constraint) names and their inequality types (i.e.,
$=$,
$\ge $ or
$\le $).
Field 1: 
defines the constraint type. It may be in column 2 or column 3. 
N 
free row, that is no constraint. It may be used to define the objective row. 
G 
greater than or equal to (i.e., $\ge $). 
L 
less than or equal to (i.e., $\le $). 
E 
exactly equal to (i.e., $=$). 
Field 2: 
defines the row name. 
Row type
N stands for ‘Not binding’, also known as ‘Free’. It can be used to define the objective row. The objective row is a free row that specifies the vector
$c$ in the objective function. It is taken to be the first free row, unless some other free row name is specified by the parameter
NMOBJ (see
Section 5). Note that the objective function must be included in the MPSX data file. Thus the maximum number of constraints (
MAXM; see
Section 5) in the problem must be
$m+1$.
COLUMNS data line(s)
These lines specify the names to be assigned to the variables (columns) in the constraint matrix
$A$, and define, in terms of column vectors, the actual values of the corresponding matrix elements.

Field 1: blank (ignored) 

Field 2: gives the name of the column associated with the elements specified in the following fields. 

Field 3: contains the name of a row. 

Field 4: used in conjunction with Field 3 contains the value of the matrix element. 

Field 5: is optional (may be used like Field 3). 

Field 6: is optional (may be used like Field 4). 
Note that only nonzero elements of $A$ need to be specified in the COLUMNS section, as any unspecified elements are assumed to be zero.
RHS data line(s)
This section specifies the righthand side values of the constraint matrix $A$. The lines specify the name of the RHS (righthand side) vector given to the problem, the numerical values of the elements of the vector are also defined by the data lines and may appear in any order. The data lines have exactly the same format as the COLUMNS data lines, except that the column name is replaced by the RHS name. Note that any unspecified elements are assumed to be zero.
RANGES data line(s) (optional)
Ranges are used for constraints of the form $l\le Ax\le u$, where $l$ and $u$ are finite. The range of the constraint is $r=ul$. Either $l$ or $u$ must be specified in the RHS section and $r$ must be defined in this section.
The data lines have exactly the same format as the COLUMNS data lines, except that the column name is replaced by the RANGES name.
BOUNDS data line(s) (optional)
These lines specify limits on the values of the variables (
$l$ and
$u$ in
$l\le x\le u$). If the variable is not specified in the bound set then it is automatically assumed to lie between default lower and upper bounds (usually
$0$ and
$+\infty $). Like an RHS column which is given a name, the set of variables in one bound set is also given a name.
Field 1: 
specifies the type of bound or defines the variable type. 
LO 
lower bound 
UP 
upper bound 
FX 
fixed variable 
FR 
free variable ($\infty $ to $+\infty $) 
MI 
lower bound is $\infty $ 
PL 
upper bound is $+\infty $. This is the default variable type. 
Field 2: 
identifies a name for the bound set. 
Field 3: 
identifies the column name of the variable belonging to this set. 
Field 4: 
identifies the value of the bound; this has a numerical value only in association with LO, UP, FX in Field 1, otherwise it is blank. 
Field 5: 
is blank and ignored. 
Field 6: 
is blank and ignored. 
Note that if RANGES and BOUNDS sections are both present, the RANGES section must appear first.
Integer Problems
In IP problems there are two common integer variable types.
 $0$–1 integer variables which represent ‘on’ or ‘off’ situations and
 General integer variables which are forced to take an integer value, in a specified range, at the optimal integer solution.
Integer variables can be defined in the following compatible and standard MPSX forms.
In the compatible MPSX format, the type of integer variables is defined in Field 1 of the BOUNDS section, that is:
Field 1: 
specifies the type of the integer variable. 
BV 
$0\u20131$ integer variable (bound value is $1.0$). 
UI 
general integer variable (bound value is in Field 4). 
In the standard MPSX format, the integer variables are treated the same as the ‘ordinary’ bounded variables, in the BOUNDS section. Integer markers are, however, introduced in the COLUMNS section to specify the integer variables. The indicator lines for these markers are:


Field 1 
Field 2 
Field 3 
Field 4 
Field 5 
Field 6 
Columns 
$2\u20133$ 
$5\u201312$ 
$15\u201322$ 
$25\u201336$ 
$40\u201347$ 
$50\u201361$ 
Contents 

INTEGER 
‘MARKER’ 

‘INTORG’ 

to mark the beginning of the integer variables and


Field 1 
Field 2 
Field 3 
Field 4 
Field 5 
Field 6 
Columns 
$2\u20133$ 
$5\u201312$ 
$15\u201322$ 
$25\u201336$ 
$40\u201347$ 
$50\u201361$ 
Contents 

INTEGER 
‘MARKER’ 

‘INTEND’ 

to mark the end. That is, any variables between these markers are treated as integer variables. Note that if the (INTEND) indicator line is not specified in the file then all the variables between the (INTORG) indicator line and the end of the COLUMNS section are assumed to be integer variables. The routine accepts both standard and/or compatible MPSX format as a means of specifying integer variables. This is illustrated in
Section 9.2 in H02BFF.
4 References
IBM (1971) MPSX – Mathematical programming system Program Number 5734 XM4 IBM Trade Corporation, New York
5 Parameters
 1: INFILE – INTEGERInput
On entry: the unit number associated with the MPSX data file.
Constraint:
$0\le {\mathbf{INFILE}}\le 99$.
 2: MAXN – INTEGERInput
On entry: an upper limit for the number of variables in the problem.
Constraint:
${\mathbf{MAXN}}\ge 1$.
 3: MAXM – INTEGERInput
On entry: an upper limit for the number of constraints (including the objective) in the problem.
Constraint:
${\mathbf{MAXM}}\ge 1$.
 4: OPTIM – CHARACTER(3)Input
On entry: specifies the direction of the optimization.
OPTIM must be set to 'MIN' for minimization and to 'MAX' for maximization.
Constraint:
${\mathbf{OPTIM}}=\text{'MIN'}$ or $\text{'MAX'}$.
 5: XBLDEF – REAL (KIND=nag_wp)Input
On entry: the default lower bound to be used for the variables in the problem when none is specified in the BOUNDS section of the MPSX data file. For a standard LP or IP problem
XBLDEF would normally be set to zero.
 6: XBUDEF – REAL (KIND=nag_wp)Input
On entry: the default upper bound to be used for the variables in the problem when none is specified in the BOUNDS section of the MPSX data file. For a standard LP or IP problem
XBUDEF would normally be set to ‘infinity’ (i.e.,
${\mathbf{XBUDEF}}\ge {10}^{20}$).
Constraint:
${\mathbf{XBUDEF}}\ge {\mathbf{XBLDEF}}$.
 7: NMOBJ – CHARACTER(8)Input/Output
On entry: either the name of the objective function to be used for the optimization, or blank (in which case the first objective (free) row in the file is used).
On exit: the name of the objective row as defined in the MPSX data file.
 8: NMRHS – CHARACTER(8)Input/Output
On entry: either the name of the RHS set to be used for the optimization, or blank (in which case the first RHS set is used).
On exit: the name of the RHS set read in the MPSX data file.
 9: NMRNG – CHARACTER(8)Input/Output
On entry: either the name of the RANGE set to be used for the optimization, or blank (in which case the first RANGE set (if any) is used).
On exit: the name of the RANGE set read in the MPSX data file. This is blank if the MPSX data file does not have a RANGE set.
 10: NMBND – CHARACTER(8)Input/Output
On entry: either the name of the BOUNDS set to be used for the optimization, or blank (in which case the first BOUNDS set (if any) is used).
On exit: the name of the BOUNDS set read in the MPSX data file. This is blank if the MPSX data file does not have a BOUNDS set.
 11: MPSLST – LOGICALInput
On entry: if
${\mathbf{MPSLST}}=\mathrm{.TRUE.}$, then a listing of the input data is sent to the current advisory message unit (as defined by
X04ABF). This can be useful for debugging the MPSX data file.
 12: N – INTEGEROutput
On exit: $n$, the actual number of variables in the problem.
 13: M – INTEGEROutput
On exit: $m$, the actual number of general linear constraints in the problem.
 14: A(MAXM,MAXN) – REAL (KIND=nag_wp) arrayOutput
On exit: $A$, the matrix of general linear constraints.
 15: BL(${\mathbf{MAXN}}+{\mathbf{MAXM}}$) – REAL (KIND=nag_wp) arrayOutput
On exit:
$l$, the lower bounds for all the variables and constraints in the following order. The first
N elements of
BL contain the bounds on the variables and the next
M elements contain the bounds for the general linear constraints (if any). Note that an ‘infinite’ lower bound is indicated by
${\mathbf{BL}}\left(j\right)=\text{1.0E+20}$ and an equality constraint by
${\mathbf{BL}}\left(j\right)={\mathbf{BU}}\left(j\right)$.
 16: BU(${\mathbf{MAXN}}+{\mathbf{MAXM}}$) – REAL (KIND=nag_wp) arrayOutput
On exit:
$u$, the upper bounds for all the variables and constraints in the following order. The first
N elements of
BU contain the bounds on the variables and the next
M elements contain the bounds for the general linear constraints (if any). Note that an ‘infinite’ upper bound is indicated by
${\mathbf{BU}}\left(j\right)=\text{1.0E+20}$ and an equality constraint by
${\mathbf{BU}}\left(j\right)={\mathbf{BL}}\left(j\right)$.
 17: CVEC(MAXN) – REAL (KIND=nag_wp) arrayOutput
On exit:
$c$, the coefficients of the objective function. The signs of these coefficients are determined by the problem (either LP or IP) and the direction of the optimization (see
OPTIM above).
 18: X(MAXN) – REAL (KIND=nag_wp) arrayOutput
On exit: an initial estimate of the solution to the problem. More precisely,
${\mathbf{X}}\left(\mathit{j}\right)=1.0$ if $\mathit{j}$ is odd and $0.0$ otherwise, for $\mathit{j}=1,2,\dots ,{\mathbf{N}}$.
 19: INTVAR(MAXN) – INTEGER arrayOutput
On exit: indicates which are the integer variables in the problem. More precisely,
${\mathbf{INTVAR}}\left(\mathit{k}\right)=1$ if ${x}_{\mathit{k}}$ is an integer variable, and $0$ otherwise, for $\mathit{k}=1,2,\dots ,{\mathbf{N}}$.
 20: CRNAME(${\mathbf{MAXN}}+{\mathbf{MAXM}}$) – CHARACTER(8) arrayOutput
On exit: the MPSX names of all the variables and constraints in the problem in the following order. The first
N elements contain the MPSX names for the variables and the next
M elements contain the MPSX names for the general linear constraints (if any).
 21: NMPROB – CHARACTER(8)Output
On exit: the name of the problem as defined in the MPSX data file.
 22: IWORK(${\mathbf{MAXN}}+{\mathbf{MAXM}}$) – INTEGER arrayWorkspace
 23: IFAIL – INTEGERInput/Output

On entry:
IFAIL must be set to
$0$,
$1\text{ or}1$. If you are unfamiliar with this parameter you should refer to
Section 3.3 in the Essential Introduction for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value
$1\text{ or}1$ is recommended. If the output of error messages is undesirable, then the value
$1$ is recommended. Otherwise, if you are not familiar with this parameter, the recommended value is
$0$.
When the value $\mathbf{1}\text{ or}\mathbf{1}$ is used it is essential to test the value of IFAIL on exit.
On exit:
${\mathbf{IFAIL}}={\mathbf{0}}$ unless the routine detects an error or a warning has been flagged (see
Section 6).
6 Error Indicators and Warnings
If on entry
${\mathbf{IFAIL}}={\mathbf{0}}$ or
${{\mathbf{1}}}$, explanatory error messages are output on the current error message unit (as defined by
X04AAF).
Errors or warnings detected by the routine:
${\mathbf{IFAIL}}={\mathbf{3}}$ to
${\mathbf{IFAIL}}={\mathbf{15}}$ (apart from
${\mathbf{IFAIL}}={\mathbf{14}}$) are caused by having either a corrupt or a nonstandard MPSX data file. Refer to
Section 3 for a detailed description of the MPSX format which can be read by H02BUF. If
${\mathbf{MPSLST}}=\mathrm{.TRUE.}$, the last line of printed output refers to the line in the MPSX data file which contains the reported error.
 ${\mathbf{IFAIL}}=1$
There are too many rows present in the data file. Increase
MAXM by at least (
${\mathbf{M}}{\mathbf{MAXM}}$) and rerun H02BUF.
 ${\mathbf{IFAIL}}=2$
There are too many columns present in the data file. Increase
MAXN by at least (
${\mathbf{N}}{\mathbf{MAXN}}$) and rerun H02BUF.
 ${\mathbf{IFAIL}}=3$
The objective function row was not found. There must be at least one row in the ROWS section with row type
N for the objective row.
 ${\mathbf{IFAIL}}=4$
There are no rows specified in the ROWS section.
 ${\mathbf{IFAIL}}=5$
An illegal constraint type was detected in the ROWS section. The constraint type must be one of N, L, G or E.
 ${\mathbf{IFAIL}}=6$
An illegal row name was detected in the ROWS section. Names must be made up of alphanumeric characters with no leading blanks.
 ${\mathbf{IFAIL}}=7$
An illegal column name was detected in the COLUMNS section. Names must be made up of alphanumeric characters with no leading blanks.
 ${\mathbf{IFAIL}}=8$
An illegal bound type was detected in the BOUNDS section. The bound type must be one of LO, UP, FX, FR, MI, PL, BV or UI.
 ${\mathbf{IFAIL}}=9$
An unknown column name was detected in the BOUNDS section. All the column names must be specified in the COLUMNS section.
 ${\mathbf{IFAIL}}=10$
The last line in the file does not contain the ENDATA line indicator.
 ${\mathbf{IFAIL}}=11$
An illegal data line was detected in the file. This line is neither a comment line nor a valid data line.
 ${\mathbf{IFAIL}}=12$
An unknown row name was detected in COLUMNS or RHS or RANGES section. All the row names must be specified in the ROWS section.
 ${\mathbf{IFAIL}}=13$
There were no columns specified in the COLUMNS section.
 ${\mathbf{IFAIL}}=14$

An input parameter is invalid.
 ${\mathbf{IFAIL}}=15$
Incorrect integer marker. In standard MPSX data format, integer variables should be defined between INTORG and INTEND markers.
7 Accuracy
Not applicable.
None.
9 Example
This example solves the same problem as the example for
H02BFF, except that it treats it as an LP problem.
One of the applications of linear programming is to the socalled diet problem. Given the nutritional content of a selection of foods, the cost of each food, the amount available of each food and the consumer's minimum daily nutritonal requirements, the problem is to find the cheapest combination. This gives rise to the following problem:
minimize
subject to
where
and
The rows of $A$ correspond to energy, protein and calcium and the columns of $A$ correspond to oatmeal, chicken, eggs, milk, pie and bacon respectively.
The MPSX representation of the problem is given in
Section 9.2.
9.1 Program Text
Program Text (h02bufe.f90)
9.2 Program Data
Program Options (h02bufe.opt)
9.3 Program Results
Program Results (h02bufe.r)