Accessing and Analyzing Data with Microsoft Excel (Bpg-Other)

Excel includes a set of data analysis tools, called the Analysis ToolPak, that you can use to develop complex statistical or engineering analyses. Although this book is not geared toward statisticians, engineers, mathematicians, or academic researchers, this section will describe some of the functions in the Analysis ToolPak that are very helpful for analyzing business data. Because the Analysis ToolPak is not very widely understood, this section details the components of common Analysis ToolPak dialog boxes and demonstrates how to use these dialog boxes in common data analysis scenarios. To use the Analysis ToolPak, simply click Data Analysis on the Tools menu. (See the following if the command is not on the menu.) Click a tool in the list, and then complete the information in the resulting dialog box to perform the analysis.

Installing the Analysis ToolPak

If the Data Analysis command doesn’t appear on the Tools menu, click Add-Ins on the Tools menu, select the Analysis ToolPak check box, and then click OK. If the Analysis ToolPak check box is not visible, be sure you have your original Microsoft Office or Excel installation media handy, double-click the Add/Remove Programs icon in Control Panel, and do one of the following:

For Microsoft Windows 2000, Windows Millennium Edition, and Windows XP:

For Microsoft Windows 98 and Windows NT 4.0:

Follow the instructions on the screen. The Analysis ToolPak can be found by expanding the Microsoft Excel for Windows node and then expanding the Add-Ins node.

Some of the useful Analysis ToolPak tools for business analysis include the following:

The rest of this section describes how to use these tools. In any of the dialog boxes for these individual Analysis ToolPak functions, you’ll need to provide the following information:

The Descriptive Statistics Tool

To use the Descriptive Statistics tool, on the Tools menu, click Data Analysis. Click Descriptive Statistics, and then click OK. The Descriptive Statistics dialog box is shown in Figure 3-23.

Figure 3-23: The Descriptive Statistics dialog box.

In addition to the input range and other standard options, use the following options in the dialog box depending on the information you’re analyzing:

Your Turn

In this exercise, you will run the Descriptive Statistics tool on the total nights booked for all customers for all months.

  1. Open Hotel.xls. If it is already open, close it (do not save it) and open it again.

  2. On the Tools menu, click Data Analysis, click Descriptive Statistics, and then click OK.

  3. Click the Input Range box, and then select cells C2 through C313.

  4. In the Grouped By area, click Columns.

  5. Click the New Worksheet Ply option.

  6. Select the Summary Statistics and Confidence Level For Mean check boxes.

  7. Check the Kth Largest option, and then type 5 in the adjacent box.

  8. Check the Kth Smallest option, and then type 7 in the adjacent box. Compare your results to Figure 3-24.

    Figure 3-24: Completing the Descriptive Statistics dialog box.

  9. Click OK, and then compare your results to Figure 3-25.

    Figure 3-25: Results of running the Descriptive Statistics tool.

The Histogram Tool

To use the Histogram tool, click Data Analysis on the Tools menu. Click Histogram, and then click OK. The Histogram dialog box is shown in Figure 3-26.

Figure 3-26: The Histogram dialog box.

In addition to the input range and other standard options, use the following options in the dialog box depending on the information you’re analyzing:

Your Turn

You want to generate a histogram to display the frequency of nights booked per month for all of the preferred customers.

  1. Open Hotel.xls. If the file is already open, close it (do not save it) and open it again.

  2. In cell F1, type the number 1. In cell F2, type 2, and so on through the number 20 in cell F20. The values you enter will be used as the bins in the histogram.

  3. On the Tools menu, click Data Analysis.

  4. Click Histogram, and then click OK.

  5. Click the Input Range box, and then select cells C2 through C313.

  6. Click the Bin Range box, and then select cells F1 through F20.

  7. Click the Output Range option, click the Output Range box, and then click cell H1.

  8. Select the Pareto (Sorted Histogram) check box.

  9. Select the Chart Output check box, and then compare your results to Figure 3-27.

    Figure 3-27: Completing the Histogram dialog box.

  10. Click OK and compare your results to Figure 3-28.

    Figure 3-28: The histogram produced for Hotel.xls.

The number of nights stayed per month that recur most often are 1, 11, 10, and 14.

The Moving Average Tool

To use the Moving Average tool, click Data Analysis on the Tools menu. Click Moving Average, and then click OK. The Moving Average dialog box appears, as shown in Figure 3-29.

Figure 3-29: The Moving Average dialog box.

For a moving average, the output range must be on the same worksheet as the input range. For this reason, the New Worksheet Ply and New Workbook options are not available. In addition to the input range and other standard options, the Moving Average dialog box includes two items that you set to perform your analysis:

Your Turn

You want to forecast what room service charges might be for the next year by using a three-month moving average of preferred customer Abercrombie’s room service charges during the past year.

  1. Open Hotel.xls. If it is already open, close it (do not save it) and open it again.

  2. On the Tools menu, click Data Analysis.

  3. Click Moving Average, and then click OK.

  4. Click the Input Range box, and then select cells D2 through D13 in the Total Room Service column.

  5. In the Interval box, type 3.

  6. Click the Output Range box, and then click cell F1.

  7. Select the Chart Output check box, compare your results to Figure 3-30, and then click OK. Compare the chart output to Figure 3-31.

    Figure 3-30: Completing the Moving Average dialog box.

    Figure 3-31: The chart showing actual and forecasted room service charges.

The numbers starting in cell F1 next to the chart are a result of running the moving average for each month, using the interval you entered. For example, for March, an average of the room service charges for January, February, and March is calculated. For April, an average of the room service charges for February, March, and April is calculated. You don’t see any moving average calculations for January or February because the data for the previous months (November and December of the preceding year) is not available.

It’s hard to say with certainty what preferred customer Abercrombie’s monthly room charges will be next year. However, overall the trend is downward in the last quarter, which could carry over into the next year. Hopefully, you will take some sort of action with this customer to turn things around.

The Rank and Percentile Tool

To use the Rank and Percentile tool, click Data Analysis on the Tools menu, click Rank And Percentile, and then click OK. Figure 3-32 shows the Rank And Percentile dialog box.

Figure 3-32: The Rank And Percentile dialog box.

Your Turn

You want to give all preferred customers who spent monthly room service charges in the 90th percentile or higher a special thank-you gift.

  1. Open Hotel.xls. If the file is already open, close it (do not save it) and open it again.

  2. On the Tools menu, click Data Analysis, click Rank And Percentile, and then click OK.

  3. Click the Input Range box, and then select cells D2 through D313 in the Total Room Service column.

  4. Click the Output Range option, click in the Output Range box, click cell F1, and compare your results to Figure 3-33.

    Figure 3-33: Completing the Rank And Percentile dialog box.

  5. Click OK, and compare the output to Figure 3-34. The 90th percentile cut-off point is $1,153.40 in any one month. (The numbers in the Point column refer to an item’s order in the list.)

    Figure 3-34: All the values above this point are in the 90th percentile.

The Sampling Tool

To use the Sampling tool, click Data Analysis on the Tools menu, click Sampling, and then click OK. The Sampling dialog box appears, as shown in Figure 3-35.

Figure 3-35: The Sampling dialog box.

The sampling method options include the following:

Your Turn

In this exercise, you’ll determine the average amount of monthly room charges made by each of your preferred customers. You’ll start by averaging about 10 percent of the data values to see whether they are representative of the 310 data values in the list.

  1. Open Hotel.xls. If it is already open, close it (do not save the file) and open the file again.

  2. On the Tools menu, click Data Analysis, click Sampling, and then click OK.

  3. Click the Input Range box, and then select cells D2 through D313 in the Total Room Service column.

  4. Click the Random option, and in the Number Of Samples box, type 31.

  5. Click the Output Range option, and then click cell F1. Compare your results to Figure 3-36.

  6. Click OK and compare your output to Figure 3-37. Because you’ve used random samples, your results will vary.

    Figure 3-36: Completing the Sampling dialog box for a random number of samples.

    Figure 3-37: Running the Sampling tool for a random number of samples.

  7. With the randomly picked values selected, right-click anywhere in the status bar, and click Average.

    What is the average monthly room service charge? Compare this amount to the average monthly room service charge of $473.35 for all preferred customers. Were the values picked representative? Now try a fixed number of data values.

  8. On the Tools menu, click Data Analysis, click Sampling, and then click OK.

  9. Click the Periodic option, and in the Period box, type 12.

  10. Click the Output Range box, and click cell G1.

  11. Click OK, and compare the output to Figure 3-38.

    Figure 3-38: Running the Sampling tool for a periodic number of samples.

  12. With the new sample values selected, right-click in the status bar and click Average.

    The average monthly room service charge based on these records is $522.37. Again, compare this to the average monthly room service charge of $473.35 for all of the preferred customers. Perhaps every 12th value that was sampled was not representative either. Experiment with higher values for both the number of random samples and the number of periodic values to see whether you can get closer to $473.35.

Putting It Together

You can use a number of Analysis ToolPak tools together to perform several analyses at once and spot trends. In the following exercise, you will compare the average of monthly room service charges to their rank and percentile.

  1. Open Hotel.xls. If the file is open already, close it (do not save the file) and reopen the file again.

  2. On the Tools menu, click Data Analysis.

  3. Click Descriptive Statistics, and then click OK.

  4. Click the Input Range box, and then select cells D2 through D313 in the Total Room Service column.

  5. Click the Columns and New Worksheet Ply options.

  6. Select the Summary Statistics check box, and then click OK.

  7. Click the worksheet labeled Sheet1.

  8. On the Tools menu, click Data Analysis.

  9. Click Rank And Percentile, and then click OK.

  10. Click the Input Range box, and then select cells D2 through D313 in the Total Room Service column.

  11. Click the Columns and New Worksheet Ply options, and then click OK.

  12. Compare the Descriptive Statistics worksheet’s Mean value (473.3455, or $473.35) to the values in the Rank And Percentile worksheet’s Column1 and Percent columns.

    You should notice that the mean (average) of $473.35 falls between the 60.1 and 60.4 percentiles. This is more than 10 percentage points higher than the median (midpoint), which is between $380.36 and $387.84 in the Rank And Percentile worksheet’s Column1 column, or $384.10 in the Descriptive Statistics worksheet’s Median row value.

Is this percentage difference good or bad for business? Although you have some big spenders among your customers, this could actually be bad for business in the long run. Here’s why: if the average was closer to the midpoint, this would most likely mean that most customers were making steady, predictable, evenly distributed room service orders. However, because the average is somewhat higher than the midpoint, this means that many of the preferred customers are making either large or small, less predictable room service orders, which could be difficult to plan for in the long run, especially if those big spenders stop purchasing. How do you correct this? The answer comes in knowing how the hotel chain goes about influencing its preferred customers’ purchasing habits and what it can do to keep the big spenders coming back or making the small spenders purchase more room service.

Категории