SAS.STAT 9.1 Users Guide (Vol. 5)

The following statements are available in PROC PRINQUAL.

To use PROC PRINQUAL, you need the PROC PRINQUAL and TRANSFORM statements. You can abbreviate all options and t-options to their first three letters . This is a special feature of PROC PRINQUAL and is not generally true of other SAS/STAT procedures.

The rest of this section provides detailed syntax information for each of the preceding statements, beginning with the PROC PRINQUAL statement. The remaining statements are described in alphabetical order.

PROC PRINQUAL Statement

The PROC PRINQUAL statement starts the PRINQUAL procedure. Optionally, this statement identifies an input data set, creates an output data set, specifies the algorithm and other computational details, and controls displayed output. The following table summarizes options available in the PROC PRINQUAL statement.

Task

Option

Identify input data set

 

specifies input SAS data set

DATA=

Specify details for output data set

 

outputs approximations to transformed variables

APPROXIMATIONS

specifies prefix for approximation variables

APREFIX=

outputs correlations and component structure matrix

CORRELATIONS

specifies a multidimensional preference analysis

MDPREF

specifies output data set

OUT=

specifies prefix for principal component scores variables

PREFIX=

replaces raw data with transformed data

REPLACE

outputs principal component scores

SCORES

standardizes principal component scores

STANDARD

specifies transformation standardization

TSTANDARD=

specifies prefix for transformed variables

TPREFIX=

Control iterative algorithm

 

analyzes covariances

COVARIANCE

initializes using dummy variables

DUMMY

specifies iterative algorithm

METHOD=

specifies number of principal components

N=

suppresses numerical error checking

NOCHECK

specifies number of MGV models before refreshing

REFRESH=

restarts iterations

REITERATE

specifies singularity criterion

SINGULAR=

specifies input observation type

TYPE=

Control the number of iterations

 

specifies minimum criterion change

CCONVERGE=

specifies number of first iteration to be displayed

CHANGE=

specifies minimum data change

CONVERGE=

specifies number of MAC initialization iterations

INITITER=

specifies maximum number of iterations

MAXITER=

Specify details for handling missing values

 

includes monotone special missing values

MONOTONE=

excludes observations with missing values

NOMISS

unties special missing values

UNTIE=

Suppress displayed output

 

suppresses displayed output

NOPRINT

The following list describes these options in alphabetical order.

APREFIX= name

APR= name

APPROXIMATIONS

APPROX

APP

CCONVERGE= n

CCO= n

CHANGE= n

CHA= n

CONVERGE= n

CON= n

COVARIANCE

COV

CORRELATIONS

COR

DATA = SAS-data-set

DUMMY

DUM

INITITER= n

INI= n

MAXITER= n

MAX= n

MDPREF

MDP

METHOD=MAC MGV MTV

MET=MAC MGV MTV

MONOTONE= two-letters

MON= two-letters

N= n

NOCHECK

NOC

NOMISS

NOM

NOPRINT

NOP

OUT= SAS-data-set

PREFIX= name

PRE= name

REFRESH= n

REF= n

REITERATE

REI

REPLACE

REP

SCORES

SCO

SINGULAR= n

SIN= n

STANDARD

STD

TPREFIX= name

TPR= name

TSTANDARD=CENTER NOMISS ORIGINAL Z

TST=CEN NOM ORI Z

TYPE= text name

TYP= text name

UNTIE= two-letters

UNT= two-letters

BY Statement

You can specify a BY statement with PROC PRINQUAL to obtain separate analyses on observations in groups defined by the BY variables. When a BY statement appears, the procedure expects the input data set to be sorted in order of the BY variables.

If your input data set is not sorted in ascending order, use one of the following alternatives:

For more information on the BY statement, refer to the discussion in SAS Language Reference: Concepts . For more information on the DATASETS procedure, refer to the discussion in the SAS Procedures Guide .

FREQ Statement

If one variable in the input data set represents the frequency of occurrence for other values in the observation, list the variable s name in a FREQ statement. PROC PRINQUAL then treats the data set as if each observation appeared n times, where n is the value of the FREQ variable for the observation. Noninteger values of the FREQ variable are truncated to the largest integer less than the FREQ value. The observation is used in the analysis only if the value of the FREQ statement variable is greater than or equal to 1.

ID Statement

The ID statement includes additional character or numeric variables in the output data set. The variables must be contained in the input data set.

TRANSFORM Statement

The TRANSFORM statement lists the variables to be analyzed ( variables ) and specifies the transformation ( transform ) to apply to each variable listed. You must specify a transformation for each variable list in the TRANSFORM statement. The variables are variables in the data set. The t-options are transformation options that provide details for the transformation; these depend on the transform chosen . The t-options are listed after a slash in the parentheses that enclose the variables.

For example, the following statements find a quadratic polynomial transformation of all variables in the data set:

proc prinqual; transform spline(_all_ / degree=2); run;

Or, if N1 through N10 are nominal variables and M1 through M10 are ordinal variables, you can use the following statements.

proc prinqual; transform opscore(N1-N10) monotone(M1-M10); run;

The following sections describe the transformations available (specified with transform ) and the options available for some of the transformations (specified with t-options ).

Families of Transformations

There are three types of transformation families: nonoptimal, optimal, and other. Each family is summarized as follows .

Nonoptimal transformations

preprocess the specified variables, replacing each one with a single new nonoptimal, nonlinear transformation.

Optimal transformations

replace the specified variables with new, iteratively derived optimal transformation variables that fit the specified model better than the original variable (except for contrived cases where the transformation fits the model exactly as well as the original variable).

Other transformations

are the IDENTITY and SSPLINE transformations. These do not fit into either of the preceding categories.

The following table summarizes the transformations in each family.

Family

Members of Family

Nonoptimal transformations

 

inverse trigonometric sine

ARSIN

exponential

EXP

logarithm

LOG

logit

LOGIT

raises variables to specified power

POWER

transforms to ranks

RANK

Optimal transformations

 

linear

LINEAR

monotonic, ties preserved

MONOTONE

monotonic B-spline

MSPLINE

optimal scoring

OPSCORE

B-spline

SPLINE

monotonic, ties not preserved

UNTIE

Other transformations

 

identity, no transformation

IDENTITY

iterative smoothing spline

SSPLINE

The transform is followed by a variable (or list of variables) enclosed in parentheses. Optionally, depending on the transform , the parentheses can also contain t-options , which follow the variables and a slash. For example,

transform log(X Y);

computes the LOG transformation of X and Y . A more complex example is

transform spline(Y / nknots=2) log(X1 X2 X3);

The preceding statement uses the SPLINE transformation of the variable Y and the LOG transformation of the variables X1 , X2 , and X3 . In addition, it uses the NKNOTS= option with the SPLINE transformation and specifies two knots.

The rest of this section provides syntax details for members of the three families of transformations. The t-options are discussed in the section Transformation Options (t-options) on page 3663.

Nonoptimal Transformations

Nonoptimal transformations are computed before the iterative algorithm begins. Nonoptimal transformations create a single new transformed variable that replaces the original variable. The new variable is not transformed by the subsequent iterative algorithms (except for a possible linear transformation and missing value estimation).

The following list provides syntax and details for nonoptimal variable transformations.

ARSIN

ARS

EXP

LOG

LOGIT

POWER

POW

RANK

RAN

Optimal Transformations

Optimal transformations are iteratively derived. Missing values for these types of variables can be optimally estimated (see the Missing Values section on page 3667).

The following list provides syntax and details for optimal transformations.

LINEAR

LIN

MONOTONE

MON

MSPLINE

MSP

OPSCORE

OPS

SPLINE

SPL

UNTIE

UNT

Other Transformations

IDENTITY

IDE

SSPLINE

SSP

Transformation Options (t-options)

If you use a nonoptimal, optimal or other transformation, you can use t-options , which specify additional details of the transformation. The t-options are specified within the parentheses that enclose variables and are listed after a slash. For example,

proc prinqual; transform spline(X Y / nknots=3); run;

The preceding statements find an optimal variable transformation (SPLINE) of the variables X and Y and use a t-option to specify the number of knots (NKNOTS=). The following is a more complex example.

proc prinqual; transform spline(Y / nknots=3) spline(X1 X2 / nknots=6); run;

These statements use the SPLINE transformation for all three variables and use t-options as well; the NKNOTS= option specifies the number of knots for the spline.

The following sections discuss the t-options available for nonoptimal, optimal, and other transformations.

The following table summarizes the t-options .

Table 59.1: t-options Available in the TRANSFORM Statement

Task

Option

Nonoptimal transformation t-options

 

uses original mean and variance

ORIGINAL

Parameter t-options

 

specifies miscellaneous parameters

PARAMETER=

specifies smoothing parameter

SM=

Spline t-options

 

specifies the degree of the spline

DEGREE=

spaces the knots evenly

EVENLY

specifies the interior knots or break points

KNOTS=

creates n knots

NKNOTS=

Other t-options

 

renames variables

NAME=

reflects the variable around the mean

REFLECT

specifies transformation standardization

TSTANDARD=

Nonoptimal Transformation t-options

ORIGINAL

ORI

Parameter t-options

PARAMETER= number

PAR= number

SM= n

Spline t-options

The following t-options are available with the SPLINE and MSPLINE optimal transformations.

DEGREE= n

DEG= n

EVENLY

EVE

KNOTS= number-list n TO m BY p

KNO= number-list n TO m BY p

NKNOTS= n

NKN= n

Other t-options

The following t-options are available for all transformations.

NAME= (variable-list)

NAM= (variable-list)

REFLECT

REF

TSTANDARD=CENTER NOMISS ORIGINAL Z

TST=CEN NOM ORI Z

WEIGHT Statement

When you use a WEIGHT statement, a weighted residual sum of squares is minimized. The WEIGHT statement has no effect on degrees of freedom or number of observations, but the weights affect most other calculations. The observation is used in the analysis only if the value of the WEIGHT statement variable is greater than 0.

Категории