SAS.STAT 9.1 Users Guide (Vol. 4)

2017-07-07 02:10:07

Example 48.1. Cochran-Armitage Test with Permutation Resampling

This example, from Keith Soper at Merck, illustrates the exact permutation Cochran-Armitage test carried out on permutation resamples. In the following data set, each observation represents an animal. The binary variables S1 and S2 indicate two tumor types, with 0s indicating no tumor (failure) and 1 indicating a tumor (success); note that they have perfect negative association. The grouping variable is Dose .

data a; input S1 S2 Dose @@; datalines; 0 1 1 1 0 1 0 1 1 0 1 1 0 1 1 1 0 1 1 0 2 1 0 2 0 1 2 1 0 2 0 1 2 1 0 2 1 0 3 1 0 3 1 0 3 0 1 3 0 1 3 1 0 3 ; proc multtest data=a permutation nsample=10000 seed=36607 outperm=pmt pvals; test ca(S1 S2 / permutation=10 uppertailed); class Dose; contrast 'CA Linear Trend' 0 1 2; run; proc print data=pmt; run;

The PROC MULTTEST statement requests 10,000 permutation resamples. The OUTPERM=PMT option creates an output SAS data set for the exact permutation distribution computed for the CA test.

The TEST statement specifies an upper-tailed Cochran-Armitage linear trend test for S1 and S2 . The cutoff for exact permutation calculations is 10, as specified with the PERMUTATION= option in the TEST statement. Since S1 and S2 have ten and eight successes, respectively, PROC MULTTEST uses exact permutation distributions to compute the p -values for both variables.

The groups for the CA test are the levels of Dose from the CLASS statement. The trend coefficients applied to these groups are 0, 1, and 2, respectively, as specified in the CONTRAST statement.

Finally, PROC PRINT displays the SAS data set containing the permutation distributions.

The results from this analysis are listed in Output 48.1.1 through Output 48.1.5.

Output 48.1.1: Cochran-Armitage Test with Permutation Resampling

The Multtest Procedure Model Information Test for discrete variables Cochran-Armitage Exact permutation distribution used Everywhere Tails for discrete tests Upper-tailed Strata weights None P-value adjustment Permutation Number of resamples 10000 Seed 36607

Output 48.1.2: Contrast Coefficients

The Multtest Procedure Contrast Coefficients Dose Contrast 1 2 3 CA Linear Trend 0 1 2

Output 48.1.3: Summary Statistics

The Multtest Procedure Discrete Variable Tabulations Variable Dose Count NumObs Percent S1 1 2 6 33.33 S1 2 4 6 66.67 S1 3 4 6 66.67 S2 1 4 6 66.67 S2 2 2 6 33.33 S2 3 2 6 33.33

Output 48.1.4: Resulting p-Values

The Multtest Procedure p-Values Variable Contrast Raw Permutation S1 CA Linear Trend 0.1993 0.4058 S2 CA Linear Trend 0.9220 1.0000

Output 48.1.5: Exact Permutation Distribution

Obs _contrast_ _var_ _value_ upper_p 1 CA Linear Trend S1 0 1.00000 2 CA Linear Trend S1 1 1.00000 3 CA Linear Trend S1 2 1.00000 4 CA Linear Trend S1 3 1.00000 5 CA Linear Trend S1 4 1.00000 6 CA Linear Trend S1 5 0.99966 7 CA Linear Trend S1 6 0.99609 8 CA Linear Trend S1 7 0.97827 9 CA Linear Trend S1 8 0.92205 10 CA Linear Trend S1 9 0.80070 11 CA Linear Trend S1 10 0.61011 12 CA Linear Trend S1 11 0.38989 13 CA Linear Trend S1 12 0.19930 14 CA Linear Trend S1 13 0.07795 15 CA Linear Trend S1 14 0.02173 16 CA Linear Trend S1 15 0.00391 17 CA Linear Trend S1 16 0.00034 18 CA Linear Trend S1 17 0.00000 19 CA Linear Trend S1 18 0.00000 20 CA Linear Trend S1 19 0.00000 21 CA Linear Trend S1 20 0.00000 22 CA Linear Trend S2 0 1.00000 23 CA Linear Trend S2 1 1.00000 24 CA Linear Trend S2 2 1.00000 25 CA Linear Trend S2 3 0.99966 26 CA Linear Trend S2 4 0.99609 27 CA Linear Trend S2 5 0.97827 28 CA Linear Trend S2 6 0.92205 29 CA Linear Trend S2 7 0.80070 30 CA Linear Trend S2 8 0.61011 31 CA Linear Trend S2 9 0.38989 32 CA Linear Trend S2 10 0.19930 33 CA Linear Trend S2 11 0.07795 34 CA Linear Trend S2 12 0.02173 35 CA Linear Trend S2 13 0.00391 36 CA Linear Trend S2 14 0.00034 37 CA Linear Trend S2 15 0.00000 38 CA Linear Trend S2 16 0.00000

You should check the preceding table to verify that the analysis specifications are correct.

The preceding table lists the label and coefficients from the CONTRAST statement.

The preceding table contains summary statistics for the two test variables, S1 and S2 . The Count column lists the number of successes for each level of the class variable, Dose . The NumObs column is the sample size , and the Percent column is the percentage of successes in the sample.

The Raw column in the preceding p -Values table contains the p -values from the CA test, and the Permutation column contains the permutation-adjusted p -values.

This table shows that, for S1 , the adjusted p -value is almost twice the raw p -value. In fact, from theoretical considerations, the permutation-adjusted p -value for S1 should be 2 — 0.1993 = 0.3986; the difference is due to resampling error. For S2 , the raw p -value is 0.9220, and the adjusted p -value equals 1, as you would expect from theoretical considerations. The permutation p -values for S1 and S2 also happen to be the Bonferroni-adjusted p -values for this example.

The preceding table lists the OUTPERM= data set, which contains the exact permutation distributions for S1 and S2 in terms of cumulative probabilities.

Example 48.2. Freeman-Tukey and t-Tests with Bootstrap Resampling

The data for the following example are the same as for Example 48.1, except that a continuous variable T , which indicates the time of death of the animal, has been added.

data a; input S1 S2 T Dose @@; datalines; 0 1 104 1 1 0 80 1 0 1 104 1 0 1 104 1 0 1 100 1 1 0 104 1 1 0 85 2 1 0 60 2 0 1 89 2 1 0 96 2 0 1 96 2 1 0 99 2 1 0 60 3 1 0 50 3 1 0 80 3 0 1 98 3 0 1 99 3 1 0 50 3 ; proc multtest data=a bootstrap nsample=10000 pvals seed=37081 outsamp=res; test ft(S1 S2 / lowertailed) mean(T / lowertailed); class Dose; contrast 'Linear Trend' 0 1 2; run; proc print data=res(obs=36); run;

The BOOTSTRAP option in the PROC MULTTEST statement requests bootstrap resampling, and NSAMPLE=10000 requests 10,000 bootstrap samples. The seed for the random number generation is 37081. The OUTSAMP=RES option creates an output SAS data set containing the 10,000 bootstrap samples.

The TEST statement specifies the Freeman-Tukey test for S1 and S2 and specifies the t -test for T . Both tests are lower-tailed. The grouping variable in the CLASS statement is Dose , and the coefficients across the levels of Dose are 0, 1, and 2, as specified in the CONTRAST statement. PROC PRINT displays the first 36 observations of the Res data set containing the bootstrap samples.

The results from this analysis are listed in Output 48.2.1 through Output 48.2.5.

Output 48.2.1: FT and t-tests with Bootstrap Resampling

The Multtest Procedure Model Information Test for discrete variables Freeman-Tukey Test for continuous variables Mean t-test Tails for discrete tests Lower-tailed Tails for continuous tests Lower-tailed Strata weights None P-value adjustment Bootstrap Center continuous variables Yes Number of resamples 10000 Seed 37081

Output 48.2.2: Contrast Coefficients

The Multtest Procedure Contrast Coefficients Dose Contrast 1 2 3 Linear Trend 0 1 2

Output 48.2.3: Summary Statistics

The Multtest Procedure Discrete Variable Tabulations Variable Dose Count NumObs Percent S1 1 2 6 33.33 S1 2 4 6 66.67 S1 3 4 6 66.67 S2 1 4 6 66.67 S2 2 2 6 33.33 S2 3 2 6 33.33 Continuous Variable Tabulations Standard Variable Dose NumObs Mean Deviation T 1 6 99.3333 9.6056 T 2 6 87.5000 14.4326 T 3 6 72.8333 22.7017

Output 48.2.4: p-Values

The Multtest Procedure p-Values Variable Contrast Raw Bootstrap S1 Linear Trend 0.8547 1.0000 S2 Linear Trend 0.1453 0.4471 T Linear Trend 0.0070 0.0253

Output 48.2.5: Resampling Data Set

Obs _sample_ _class_ _obs_ S1 S2 T 1 1 1 11 0 1 8.5000 2 1 1 16 0 1 25.1667 3 1 1 16 0 1 25.1667 4 1 1 14 1 0 22.8333 5 1 1 18 1 0 22.8333 6 1 1 14 1 0 22.8333 7 1 2 4 0 1 4.6667 8 1 2 12 1 0 11.5000 9 1 2 8 1 0 27.5000 10 1 2 7 1 0 2.5000 11 1 2 3 0 1 4.6667 12 1 2 12 1 0 11.5000 13 1 3 13 1 0 12.8333 14 1 3 5 0 1 0.6667 15 1 3 8 1 0 27.5000 16 1 3 5 0 1 0.6667 17 1 3 13 1 0 12.8333 18 1 3 6 1 0 4.6667 19 2 1 8 1 0 27.5000 20 2 1 3 0 1 4.6667 21 2 1 9 0 1 1.5000 22 2 1 13 1 0 12.8333 23 2 1 14 1 0 22.8333 24 2 1 12 1 0 11.5000 25 2 2 14 1 0 22.8333 26 2 2 18 1 0 22.8333 27 2 2 15 1 0 7.1667 28 2 2 6 1 0 4.6667 29 2 2 13 1 0 12.8333 30 2 2 1 0 1 4.6667 31 2 3 7 1 0 2.5000 32 2 3 7 1 0 2.5000 33 2 3 6 1 0 4.6667 34 2 3 13 1 0 12.8333 35 2 3 4 0 1 4.6667 36 2 3 6 1 0 4.6667

The information in the preceding table corresponds to the specifications in the invocation of PROC MULTTEST.

The preceding table shows the coefficients from the CONTRAST statement, and they model a linear trend.

The summary statistics in the preceding table for S1 and S2 are the same as those from Example 48.1. The variables S1 and S2 are discrete, and T is a continuous variable. The mean, standard deviation, and sample size for each level of Dose is listed in the table for T . The p -values for S1 and S2 are from the Freeman-Tukey test, and the p -values for T are from the t -test.

The p -values are listed in the preceding table. The Raw column contains the results from the tests on the original data, and the Bootstrap column contains the bootstrap resampled adjustment to raw_p . Note that the adjusted p -values are larger than the raw p -values for all three variables. The adjusted p -values more accurately reflect the correlation of the raw p -values, the small size of the data, and the multiple testing.

The preceding table lists the first 36 observations of the SAS data set resulting from the OUTSAMP=RES option in the PROC MULTTEST statement. The entire data set has 180,000 observations, which is 10,000 times the number of observations in the data set. The _sample_ variable is the sample indicator and _class_ indicates the resampling group , that is, the level of the CLASS variable Dose assigned to the new observation. The number of the observation in the original data set is represented by _obs_ . Also listed are the values of the original test variables, S1 and S2 , and the mean-centered values of T .

Example 48.3. Peto Mortality-Prevalence Test

This example illustrates the use of the Peto mortality-prevalence test. The test is a combination of analyses about the prevalence of incidental tumors in the population and mortality due to fatal tumors .

In the data set, each observation represents an animal. The variables S1 ˆ’ S3 are three tumor types, with a value of 0 indicating no tumor, 1 indicating an incidental (nonlethal) tumor, and 2 indicating a lethal tumor. The time variable T indicates the time of death of the animal, a strata variable B is constructed from T , and the grouping variable Dose is drug dosage.

data a; input S1-S3 T Dose @@; if T<=90 then B=1; else B=2; datalines; 0 0 0 104 0 2 0 1 80 0 0 0 1 104 0 0 0 0 104 0 0 2 0 100 0 1 0 0 104 0 2 0 0 85 1 2 1 0 60 1 0 1 0 89 1 2 0 1 96 1 0 0 0 96 1 2 0 1 99 1 2 1 1 60 2 2 0 0 50 2 2 0 1 80 2 0 0 2 98 2 0 0 1 99 2 2 1 1 50 2 ; proc multtest data=a notables out=p stepsid; test peto(S1-S3 / permutation=20 time=T uppertailed); class Dose; strata B; contrast 'mort-prev' 0 1 2; run; proc print data=p; run;

The NOTABLES option in the PROC MULTTEST statement suppresses the display of the summary statistics for each variable. The OUT=P option creates an output SAS data set containing all p -values and intermediate statistics. The STEPSID option is used to adjust the p -values.

The TEST statement specifies an upper-tailed Peto test for S1 ˆ’ S3 . The mortality strata are defined by TIME= T , the death times. The CLASS statement contains the grouping variable Dose . The prevalence strata are defined by the STRATA statement as the blocking variable B . The CONTRAST statement lists the default linear trend coefficients. PROC PRINT displays the requested p -value data set.

The results from this analysis are listed in Output 48.3.1 through Output 48.3.4.

Output 48.3.1: Peto Test

The Multtest Procedure Model Information Test for discrete variables Peto Exact permutation distribution used Everywhere Tails for discrete tests Upper-tailed Strata weights Sample size P-value adjustment Stepdown Sidak

Output 48.3.2: Contrast Coefficients

The Multtest Procedure Contrast Coefficients Dose Contrast 0 1 2 mort-prev 0 1 2

Output 48.3.3: p-Values

The Multtest Procedure p-Values Stepdown Variable Contrast Raw Sidak S1 mort-prev 0.0681 0.0814 S2 mort-prev 0.5000 0.5000 S3 mort-prev 0.0363 0.0781

Output 48.3.4: OUT= Data Set

Obs _test_ _var_ _contrast_ _strat_ _tstrat_ _value_ _exp_ _se_ raw_p stpsid_p 1 PETO S1 mort-prev 1 0 0 0.00000 0.00000 . . 2 PETO S1 mort-prev 2 0 0 0.62500 0.85696 . . 3 PETO S1 mort-prev 50 1 4 2.00000 1.12022 . . 4 PETO S1 mort-prev 60 1 3 1.75000 1.06654 . . 5 PETO S1 mort-prev 80 1 2 1.57143 1.04978 . . 6 PETO S1 mort-prev 85 1 1 0.75000 0.72169 . . 7 PETO S1 mort-prev 96 1 1 0.70000 0.78102 . . 8 PETO S1 mort-prev 98 1 0 0.00000 0.00000 . . 9 PETO S1 mort-prev 99 1 1 0.42857 0.72843 . . 10 PETO S1 mort-prev 100 1 0 0.00000 0.00000 . . 11 PETO S2 mort-prev 1 0 6 5.50000 1.05221 . . 12 PETO S2 mort-prev 2 0 0 0.00000 0.00000 . . 13 PETO S2 mort-prev 50 1 0 0.00000 0.00000 . . 14 PETO S2 mort-prev 60 1 0 0.00000 0.00000 . . 15 PETO S2 mort-prev 80 1 0 0.00000 0.00000 . . 16 PETO S2 mort-prev 85 1 0 0.00000 0.00000 . . 17 PETO S2 mort-prev 96 1 0 0.00000 0.00000 . . 18 PETO S2 mort-prev 98 1 0 0.00000 0.00000 . . 19 PETO S2 mort-prev 99 1 0 0.00000 0.00000 . . 20 PETO S2 mort-prev 100 1 0 0.00000 0.00000 . . 21 PETO S3 mort-prev 1 0 6 5.50000 1.05221 . . 22 PETO S3 mort-prev 2 0 4 2.22222 1.08298 . . 23 PETO S3 mort-prev 50 1 0 0.00000 0.00000 . . 24 PETO S3 mort-prev 60 1 0 0.00000 0.00000 . . 25 PETO S3 mort-prev 80 1 0 0.00000 0.00000 . . 26 PETO S3 mort-prev 85 1 0 0.00000 0.00000 . . 27 PETO S3 mort-prev 96 1 0 0.00000 0.00000 . . 28 PETO S3 mort-prev 98 1 2 0.62500 0.85696 . . 29 PETO S3 mort-prev 99 1 0 0.00000 0.00000 . . 30 PETO S3 mort-prev 100 1 0 0.00000 0.00000 . . 31 PETO S1 mort-prev . . 12 7.82500 2.42699 0.06808 0.08140 32 PETO S2 mort-prev . . 6 5.50000 1.05221 0.50000 0.50000 33 PETO S3 mort-prev . . 12 8.34722 1.73619 0.03627 0.07811

The preceding information corresponds to the PROC MULTTEST invocation. In this case the totals for all prevalence and fatality strata are less than 20, so exact permutation tests are used everywhere, and the STEPSID adjustments are computed from these permutation distributions.

The contrast trend coefficients are listed in the preceding table. They happen to be the same as the levels of the Dose variable.

In the preceding p -Values table, the p -values for the Peto tests are listed in the Raw column, and the stepdown Sidak adjusted p -values are in the Stepdown Sidak column.

Significant p -values support the claim that higher dosage levels promote higher mortality and prevalence. The raw Peto test is significant at the 5% level for S3 , but the adjusted S3 test is no longer significant at 5%. The raw and adjusted p -values for S2 are the same because of the stepdown technique.

The preceding table lists the OUT= data set. The first 30 observations correspond to intermediate statistics used to compute the Peto p -values. The _test_ variable lists the name of the test, the _var_ variable lists the name of the TEST variables, and the _contrast_ variable lists the CONTRAST label. The _strat_ variable lists the level of the STRATA variable, and the _tstrat_ variable indicates whether or not the stratum corresponds to values of the TIME= variable. The _value_ variable is the observed contrast for a stratum and the _exp_ variable is its expected value. The variable _se_ contains the square root of the variance terms summed to form the denominator of the Peto statistics.

The final three observations correspond to the three Peto tests, with their p -values listed under the raw_p variable. The stpsid_p variable contains the stepdown Sidak adjusted p -values.

Example 48.4. Fisher Test with Permutation Resampling

These data, from Brown and Fears (1981), are the results from an 80-week carcino-genesis bioassay with female mice. Six tissue sites are examined at necropsy; 1 indicates the presence of a tumor and 0 the absence. A frequency variable Freq is included. A control and four different doses of a drug (in parts per milliliter) make up the levels of the grouping variable Dose .

data a; input Liver Lung Lymph Cardio Pitui Ovary Freq Dose$ @@; datalines; 1 0 0 0 0 0 8 CTRL 0 1 0 0 0 0 7 CTRL 0 0 1 0 0 0 6 CTRL 0 0 0 1 0 0 1 CTRL 0 0 0 0 0 1 2 CTRL 1 1 0 0 0 0 4 CTRL 1 0 1 0 0 0 1 CTRL 1 0 0 0 0 1 1 CTRL 0 1 1 0 0 0 1 CTRL 0 0 0 0 0 0 18 CTRL 1 0 0 0 0 0 9 4PPM 0 1 0 0 0 0 4 4PPM 0 0 1 0 0 0 7 4PPM 0 0 0 1 0 0 1 4PPM 0 0 0 0 1 0 2 4PPM 0 0 0 0 0 1 1 4PPM 1 1 0 0 0 0 4 4PPM 1 0 1 0 0 0 3 4PPM 1 0 0 0 1 0 1 4PPM 0 1 1 0 0 0 1 4PPM 0 1 0 1 0 0 1 4PPM 1 0 1 1 0 0 1 4PPM 0 0 0 0 0 0 15 4PPM 1 0 0 0 0 0 8 8PPM 0 1 0 0 0 0 3 8PPM 0 0 1 0 0 0 6 8PPM 0 0 0 1 0 0 3 8PPM 1 1 0 0 0 0 1 8PPM 1 0 1 0 0 0 2 8PPM 1 0 0 1 0 0 1 8PPM 1 0 0 0 1 0 1 8PPM 1 1 0 1 0 0 2 8PPM 1 1 0 0 0 1 2 8PPM 0 0 0 0 0 0 19 8PPM 1 0 0 0 0 0 4 16PPM 0 1 0 0 0 0 2 16PPM 0 0 1 0 0 0 9 16PPM 0 0 0 0 1 0 1 16PPM 0 0 0 0 0 1 1 16PPM 1 1 0 0 0 0 4 16PPM 1 0 1 0 0 0 1 16PPM 0 1 1 0 0 0 1 16PPM 0 1 0 1 0 0 1 16PPM 0 1 0 0 0 1 1 16PPM 0 0 1 1 0 0 1 16PPM 0 0 1 0 1 0 1 16PPM 1 1 1 0 0 0 2 16PPM 0 0 0 0 0 0 14 16PPM 1 0 0 0 0 0 8 50PPM 0 1 0 0 0 0 4 50PPM 0 0 1 0 0 0 8 50PPM 0 0 0 1 0 0 1 50PPM 0 0 0 0 0 1 4 50PPM 1 1 0 0 0 0 3 50PPM 1 0 1 0 0 0 1 50PPM 0 1 1 0 0 0 1 50PPM 0 1 0 0 1 1 1 50PPM 0 0 0 0 0 0 19 50PPM ; proc multtest data=a order=data notables out=p permutation nsample=1000 seed=764511; test fisher(Liver Lung Lymph Cardio Pitui Ovary / lowertailed); class Dose; freq Freq; run; proc print data=p; run;

In the PROC MULTTEST statement, the ORDER=DATA option is required to keep the levels of Dose in the order in which they appear in the data set. Without this option, the levels are sorted by their formatted value, resulting in an alphabetic ordering. The NOTABLES option suppresses the display of summary statistics, and the OUT=P option requests an output data set containing p -values. The PERMUTATION option specifies permutation resampling, NSAMPLE=1000 requests 1000 samples, and SEED=764511 provides a starting value for the random number generator. You should specify a seed if you need to duplicate resampling results.

To test for higher rates of tumor occurrence in the treatment groups compared to the control group, the LOWERTAILED option is specified in the TEST statement to produce a lower-tailed Fisher exact test for the six tissue sites. The Fisher test is appropriate for comparing a treatment and a control, but multiple testing can be a problem. Brown and Fears (1981) use a multivariate permutation to evaluate the entire collection of tests. PROC MULTTEST adjusts the p -values by simulation.

The treatments make up the levels of the grouping variable Dose , listed in the CLASS statement. Since no CONTRAST statement is specified, PROC MULTTEST uses the default pairwise contrasts with the first level of Dose . The FREQ statement is used since this is summary data containing frequency counts of occurrences.

The results from this analysis are listed in Output 48.4.1 through Output 48.4.4.

Output 48.4.1: Fisher Test with Permutation Resampling

The Multtest Procedure Model Information Test for discrete variables Fisher Tails for discrete tests Lower-tailed Strata weights None P-value adjustment Permutation Number of resamples 1000 Seed 764511

Output 48.4.2: Default Contrast Coefficients

The Multtest Procedure Contrast Coefficients Dose Contrast CTRL 4PPM 8PPM 16PPM 50PPM CTRL vs. 4PPM 1 1 0 0 0 CTRL vs. 8PPM 1 0 1 0 0 CTRL vs. 16PPM 1 0 0 1 0 CTRL vs. 50PPM 1 0 0 0 1

Output 48.4.3: p-Values

The Multtest Procedure p-Values Variable Contrast Raw Permutation Liver CTRL vs. 4PPM 0.2828 0.9690 Liver CTRL vs. 8PPM 0.3069 0.9750 Liver CTRL vs. 16PPM 0.7102 1.0000 Liver CTRL vs. 50PPM 0.7718 1.0000 Lung CTRL vs. 4PPM 0.7818 1.0000 Lung CTRL vs. 8PPM 0.8858 1.0000 Lung CTRL vs. 16PPM 0.5469 1.0000 Lung CTRL vs. 50PPM 0.8498 1.0000 Lymph CTRL vs. 4PPM 0.2423 0.9430 Lymph CTRL vs. 8PPM 0.5898 1.0000 Lymph CTRL vs. 16PPM 0.0350 0.2480 Lymph CTRL vs. 50PPM 0.4161 0.9960 Cardio CTRL vs. 4PPM 0.3163 0.9770 Cardio CTRL vs. 8PPM 0.0525 0.3570 Cardio CTRL vs. 16PPM 0.4506 1.0000 Cardio CTRL vs. 50PPM 0.7576 1.0000 Pitui CTRL vs. 4PPM 0.1250 0.7260 Pitui CTRL vs. 8PPM 0.4948 1.0000 Pitui CTRL vs. 16PPM 0.2157 0.9050 Pitui CTRL vs. 50PPM 0.5051 1.0000 Ovary CTRL vs. 4PPM 0.9437 1.0000 Ovary CTRL vs. 8PPM 0.8126 1.0000 Ovary CTRL vs. 16PPM 0.7760 1.0000 Ovary CTRL vs. 50PPM 0.3689 0.9950

Output 48.4.4: OUT= Data Set

Obs _test_ _var_ _contrast_ _xval_ _mval_ _yval_ _nval_ raw_p perm_p sim_se 1 FISHER Liver CTRL vs. 4PPM 14 49 18 50 0.28282 0.969 0.005481 2 FISHER Liver CTRL vs. 8PPM 14 49 17 48 0.30688 0.975 0.004937 3 FISHER Liver CTRL vs. 16PPM 14 49 11 43 0.71022 1.000 0.000000 4 FISHER Liver CTRL vs. 50PPM 14 49 12 50 0.77175 1.000 0.000000 5 FISHER Lung CTRL vs. 4PPM 12 49 10 50 0.78180 1.000 0.000000 6 FISHER Lung CTRL vs. 8PPM 12 49 8 48 0.88581 1.000 0.000000 7 FISHER Lung CTRL vs. 16PPM 12 49 11 43 0.54685 1.000 0.000000 8 FISHER Lung CTRL vs. 50PPM 12 49 9 50 0.84978 1.000 0.000000 9 FISHER Lymph CTRL vs. 4PPM 8 49 12 50 0.24228 0.943 0.007332 10 FISHER Lymph CTRL vs. 8PPM 8 49 8 48 0.58977 1.000 0.000000 11 FISHER Lymph CTRL vs. 16PPM 8 49 15 43 0.03498 0.248 0.013656 12 FISHER Lymph CTRL vs. 50PPM 8 49 10 50 0.41607 0.996 0.001996 13 FISHER Cardio CTRL vs. 4PPM 1 49 3 50 0.31631 0.977 0.004740 14 FISHER Cardio CTRL vs. 8PPM 1 49 6 48 0.05254 0.357 0.015151 15 FISHER Cardio CTRL vs. 16PPM 1 49 2 43 0.45061 1.000 0.000000 16 FISHER Cardio CTRL vs. 50PPM 1 49 1 50 0.75758 1.000 0.000000 17 FISHER Pitui CTRL vs. 4PPM 0 49 3 50 0.12496 0.726 0.014104 18 FISHER Pitui CTRL vs. 8PPM 0 49 1 48 0.49485 1.000 0.000000 19 FISHER Pitui CTRL vs. 16PPM 0 49 2 43 0.21572 0.905 0.009272 20 FISHER Pitui CTRL vs. 50PPM 0 49 1 50 0.50505 1.000 0.000000 21 FISHER Ovary CTRL vs. 4PPM 3 49 1 50 0.94372 1.000 0.000000 22 FISHER Ovary CTRL vs. 8PPM 3 49 2 48 0.81260 1.000 0.000000 23 FISHER Ovary CTRL vs. 16PPM 3 49 2 43 0.77596 1.000 0.000000 24 FISHER Ovary CTRL vs. 50PPM 3 49 5 50 0.36889 0.995 0.002230

The preceding table lists the PROC MULTTEST specifications.

The preceding table lists the default contrasts for the Fisher test. Note that each dose is compared with the control.

The preceding p -Values table lists p -values for the Fisher exact tests and their permutation-based adjustments. As noted by Brown and Fears, only one of the twenty-four tests is significant at the 5% level (Lymph, CTRL vs. 16PPM). Brown and Fears report a 12% chance of observing at least one significant raw p -value for 16PPM and a 9% chance of observing at least one significant raw p -value for Lymph (both at the 5% level). Adjusted p -values exhibit much lower chances of false significances. For this example, none of the adjusted p -values are close to significant.

The preceding table lists the OUT= data set. The _test_ , _var_ ,and _contrast_ variables provide the TEST name, TEST variable, and CONTRAST label, respectively. The _xval_ , _mval_ , _yval_ ,and _nval_ variables contain the components used to compute the Fisher exact tests from the hypergeometric distribution. The raw_p variable contains the p -values from the Fisher exact tests, and the perm_p variable contains their permutation-based adjustments. The variable sim_se is the simulation standard error from the permutation resampling.

Example 48.5. Inputting Raw p-Values

This example illustrates how to use PROC MULTTEST to multiplicity-adjust a collection of raw p -values obtained from some other source. This is a valuable option for those cases where PROC MULTTEST cannot compute the raw p -values directly.

data a; input Test$ Raw_P; datalines; test1 .09108 test2 .69122 test3 .00177 test4 .57181 test5 .03121 test6 .01413 ; proc multtest pdata=a holm hoc fdr; run;

Note that there are no statements other than the PROC MULTTEST statement using the p -value input mode. In this example, the raw p -values are adjusted using the Holm, Hochberg, and Benjamini and Hocherg (FDR) methods .

The output from this analysis is listed in Output 48.5.1.

Output 48.5.1: Inputting Raw p-Values

The Multtest Procedure p-Values False Stepdown Discovery Test Raw Bonferroni Hochberg Rate 1 0.0911 0.2732 0.2732 0.1366 2 0.6912 1.0000 0.6912 0.6912 3 0.0018 0.0106 0.0106 0.0106 4 0.5718 1.0000 0.6912 0.6862 5 0.0312 0.1248 0.1248 0.0624 6 0.0141 0.0707 0.0707 0.0424

Note that the adjusted p -values for the Hochberg method ( hoc_p ) are less than or equal to those for the Holm method ( stpbon_p ). In turn , the adjusted p -values for the Benjamini and Hochberg method ( fdr_p ) are less than or equal to those for the Hochberg method. These comparisons hold generally for all p -value configurations. The FDR method controls the false discovery rate and not the familywise error rate. The Hochberg method controls the familywise error rate under independence. The Holm method controls the familywise error rate without assuming independence.

Example 48.1. Cochran-Armitage Test with Permutation Resampling

Example 48.2. Freeman-Tukey and t-Tests with Bootstrap Resampling

Example 48.3. Peto Mortality-Prevalence Test

Example 48.4. Fisher Test with Permutation Resampling

Example 48.5. Inputting Raw p-Values

Категории