Advanced DBA Certification Guide and Reference for DB2 Universal Database v8 for Linux, UNIX, and Windows
The previous sections used the explain tool to examine DB2 access plans. The next sections will introduce the db2exfmt tool and discuss how to interpret its output. The db2exfmt tool reads the explain tables and builds an ASCII report with the explain information, as well as a graphical representation of the access plan. The output of the db2exfmt tool includes the following information:
Overview Area
As shown below, the main information in the overview area is the version and release level of DB2, as well as the date and time when the tool was run. DB2 Universal Database Version 8.1, 5622-044 (c) Copyright IBM Corp. 1991, 2002 Licensed Material - Program Property of IBM IBM DATABASE 2 Explain Table Format Tool ************************ EXPLAIN INSTANCE *************************** DB2_VERSION: 08.01.0 SOURCE_NAME: SQLC2E03 SOURCE_SCHEMA: NULLID SOURCE_VERSION: EXPLAIN_TIME: 2002-11-03-16.57.30.133000 EXPLAIN_REQUESTER: DWAINE Database Context Area
The database context area lists the configuration parameters that have the biggest impact on the performance of the database and its applications, including:
Database Context: ---------------- Parallelism: None CPU Speed: 1.094264e-006 Comm Speed: 1 Buffer Pool size: 80000 Sort Heap size: 4096 Database Heap size: 1200 Lock List size: 40 Maximum Lock List: 22 Average Applications: 1 Locks Available: 903 Package Context Area
The package context indicates whether the SQL was dynamic or static, as well as the optimization level, isolation level, and degree of intra-partition parallelism used for the statement. Package Context: ---------------- SQL Type: Dynamic Optimization Level: 5 Blocking: Block All Cursors Isolation Level: Cursor Stability --------------------- STATEMENT 1 SECTION 203 ---------------------- QUERYNO: 1 QUERYTAG: Statement Type: Select Updatable: No Deletable: No Query Degree: 1 Original Statement
The original statement contains the SQL statement as it was originally run by the application. [View full width]
[View full width] Original Statement: ------------------- select l_returnflag, l_linestatus, sum(l_quantity) as sum_qty, sum(l_extendedprice) asOptimized Statement
The optimized statement contains the SQL statement as it was rewritten by the query rewrite facility of the DB2 optimizer. The internal names (Q1, Q2, Q3) represent the table's position in the select list. The query rewrite facility can also perform the following functions if applicable :
[View full width]
[View full width] Optimized Statement: ------------------- SELECT Q3.$C7 AS "L_RETURNFLAG", Q3.$C6 AS "L_LINESTATUS", Q3.$C5 AS "SUM_QTY", Q3.$C4 AS "SUM_BASE_PRICE", Q3.$C3 AS "SUM_DISC_PRICE", Q3.$C2 AS "SUM_CHARGE",(Q3.$C5 / Q3.$C0) AS "AVG_QTY", (Q3.$C4 /Q3.$C0) AS "AVG_PRICE",Access Plan
Access Plan Example 1
Access Plan: ----------- Total Cost: 23296.9 Query Degree: 1 Rows RETURN ( 1) Cost I/O 6 GRPBY ( 2) 23296.9 12728 24 MDTQ ( 3) 23296.9 12728 6 GRPBY ( 4) 23296.7 12728 6 TBSCAN ( 5) 23296.7 12728 6 SORT ( 6) 23296.7 12728 478775 TBSCAN ( 7) 22453.8 12728 496100 TABLE: TPCD.L_SUMMARY The db2exfmt tool builds an ASCII/text graph of the access plan as above. The elements of the access plan are read from the bottom up. Starting at the bottom of the access plan, we see that the base table accessed for this query is the L_SUMMARY table, and it has a cardinality of 496100 rows. The table is accessed via a table scan (relation scan) and the data then sorted. The output of the sort is then scanned, the data grouped on the specified column, and then directed to the coordinator partition, using a merge directed table queue. On the coordinator partition, the data is again grouped before being returned to the application. The operators that can be encountered in the db2exfmt output can be broken into the following categories:
Access Plan Example 2
In the following access plan example, the path of execution is read from the bottom up, and from left to right. Each row that is found by the index scan (IXSCAN) in step 14 is passed to the nested loop join (NLJOIN) in step 13. The nested loop join (NLJOIN) then accesses the inner table, based on the join predicates and local predicates (if any) returned by the fetch (FETCH) in step 15, based on the index scan (IXSCAN) in step 16. Each joined row is returned from the nested loop join (NLJOIN) to the next operator in the access plan. Execution continues until the entire outer stream is exhausted. 3.87404 NLJOIN ( 13) 125.206 5 /-------+------\ 0.968511 4 IXSCAN FETCH ( 14) ( 15) 75.0966 100.118 3 4 /----+---\ 4.99966e+06 4 1.99987e+07 INDEX: TPCD IXSCAN TABLE: TPCD UXP_NMPK ( 16) PARTSUPP 75.1018 3 1.99987e+07 INDEX: TPCD.UXPS_PK2KSC However, the graph does not give all of the details. It is important to know why the fetch is required on the inner table and what columns are the tables being joined on. This information can be found in the access plan details. For example, for the nested loop join in step 13, the detailed information is below. The cost information in the detailed information contains:
Based on the detailed information below, the total cumulative cost is 125.206 timerons. The Re-Total Cost is the estimated cost to reexecute this sub plan. The Cumulative First Row Cost is the estimated cost to return the first row of the result set. The Estimated Bufferpool Buffers is the expected number of buffer pool pages required by this operator. 13) NLJOIN: (Nested Loop Join) Cumulative Total Cost: 125.206 Cumulative CPU Cost: 164264 Cumulative I/O Cost: 5 Cumulative Re-Total Cost: 0.062461 Cumulative Re-CPU Cost: 49744 Cumulative Re-I/O Cost: 0 Cumulative First Row Cost: 125.204 Estimated Bufferpool Buffers: 6 Arguments: --------- EARLYOUT: (Early Out flag) FALSE FETCHMAX: (Override for FETCH MAXPAGES) IGNORE ISCANMAX: (Override for ISCAN MAXPAGES) IGNORE The arguments for the nested loop join indicate the following: EARLYOUT
Indicates whether the optimizer will get the next outer row after finding the first match on the inner row. This guarantees one match on the inner. FETCHMAX
Specifies the maximum number of pages to prefetch for a fetch or index scan. ISCANMAX
A nested loop join can override the original settings if it is an ordered nested loop join. 13) NLJOIN: (Nested Loop Join) Predicates: ---------- 16) Predicate used in Join Relational Operator: Equal (=) Subquery Input Required: No Filter Factor: 5.00034e-08 Predicate Text: -------------- (Q1.PS_PARTKEY = Q2.P_PARTKEY) The predicate information includes the estimated selectivity of the predicate, based on the table and column statistics, as well as the predicate being applied by the operator. In this case, the columns being joined are the PARTKEY columns in table Q1 and Q2. The join will then have two input streams, one for the inner table and one for the outer table. 13) NLJOIN: (Nested Loop Join) Input Streams: ------------- 5) From Operator #14 Estimated number of rows: 0.968511 Partition Map ID 1 Partitioning: (MULT ) Multiple Partitions Number of columns: 3 Subquery predicate ID: Not Applicable Column Names: ------------ +$RID$+P_PARTKEY+P_NAME Partition Column Names: ---------------------- +1: PS_PARTKEY The estimated stream cardinality from operator 14 in this case is .968511, and it is returning three columns. In this case, the operation is occurring on multiple partitions in the database. The partitioning key is PS_PARTKEY. The estimated stream cardinality from operator 15, as seen below, is 4, and it is returning four columns. This operation is also occurring on multiple partitions, and the partitioning key is PS_PARTKEY. 13) NLJOIN: (Nested Loop Join) Input Streams: ------------- 9) From Operator #15 Estimated number of rows: 4 Partition Map ID: 1 Partitioning: (MULT ) Multiple Partitions Number of columns: 4 Subquery predicate ID: Not Applicable Column Names: ------------ +PS_PARTKEY(A)+PS_SUPPKEY(A)+$RID$+PS_AVAILQTY Partition Column Names: ---------------------- +1: PS_PARTKEY The detailed information for the fetch operation shows that the columns PS_PARTKEY and PS_SUPPKEY are being passed to the fetch from the index scan in operation 16, and the fetch is then retrieving the PS_AVAILQTY column from the table PARTSUPP. The PS_AVAILQTY column must be retrieved from the table because it is not contained in the index used in operator 16. 15) FETCH : (Fetch) Arguments: --------- ... Input Streams: ------------- 7) From Operator #16 Column Names: ------------ +PS_PARTKEY(A)+PS_SUPPKEY(A)+$RID$ 8) From Object TPCD.PARTSUPP Column Names: ------------ +PS_AVAILQTY In the index scan in step 16, the optimizer is applying a start and stop predicate to the index scan. The scan will read only the index leaf pages where Q1.PS_PARTKEY = Q2.P_PARTKEY; it does not need to scan the entire index. From step 16, the estimated number of rows returned by the index scan is four. 16) IXSCAN: (Index Scan) Predicates: ---------- 16) Start Key Predicate Relational Operator: Equal (=) Subquery Input Required: No Filter Factor: 5.00034e-08 Predicate Text: -------------- (Q1.PS_PARTKEY = Q2.P_PARTKEY) 16) Stop Key Predicate Relational Operator: Equal (=) Subquery Input Required: No Filter Factor: 5.00034e-08 Predicate Text: -------------- (Q1.PS_PARTKEY = Q2.P_PARTKEY) The details for the sort operation in step 16 of the explain plan below indicate that I/O occurred during the sort. Therefore, the sort must have overflowed and could not be accomplished within the sort heap. 3.65665e+07 TBSCAN ( 15) 6.87408e+06 1.45951e+06 3.65665e+07 SORT ( 16) 6.14826e+06 1.30119e+06 3.65665e+07 TBSCAN ( 17) 2.00653e+06 1.14286e+06 3.74999e+07 TABLE: TPCD ORDERS The detailed information about the table scan following the sort (i.e., step 15 above) will list the estimated number of buffer pool buffers, which gives an estimate for the size of the overflowed temporary table. Based on the following piece of the explain graph, the estimated size of the overflowed sort table will be 163976 pages. 15) TBSCAN: (Table Scan) . . . Estimated Bufferpool Buffers: 163976 Recognizing List Prefetch
The following explain graph shows an example of list prefetch. In step 12, the index scan is applying the predicates and returning the row identifiers (RIDs) to the sort operation in step 11. The RIDs are then sorted based on the page number, and passed to the RID Scan (RIDSCN) operation in step 10. The RID scan will build a list of the pages and call the prefetchers to retrieve the pages into the buffer pool. The fetch operation in step 9 can then fetch and process the pages because they should already be in the buffer pool, due to the work of the prefetchers. 455.385 FETCH ( 9) 308.619 61.2878 /----+---\ 455.385 15009 RIDSCN TABLE: TPCD ( 10) L_SUMMARY2 219.093 17.4697 455.385 SORT ( 11) 219.091 17.4697 455.385 IXSCAN ( 12) 218.559 17.4697 15009 INDEX: TPCD L_SUMMARY2_IDX Recognizing Index ORing
The following explain graph shows an example of index ORing. In steps 6, 8, and 10, the index scan is applying the predicates and returning the RIDs to the sort operations above. The RIDs are then sorted based on the page number, any duplicates are eliminated, and the results are then passed to the RID Scan (RIDSCN) operation in step 4. The RID scan will build a list of the pages and call the prefetchers to retrieve the pages into the buffer pool. The fetch operation in step 3 can then fetch and process the pages because they should already be in the buffer pool, due to the work of the prefetchers. In this case, the fetch operation must reapply the predicates due to the OR predicates. 59537.2 FETCH ( 3) 62819.1 37361.7 /----+---\ 59537.2 1.50002e+08 RIDSCN TABLE: TPCD ( 4) LINEITEM 803.781 82.2908 +-----------------+-----------------+ 59383.3 150.002 4.00006 SORT SORT SORT ( 5) ( 7) ( 9) 653.127 75.5534 75.1057 76.2908 3 3 59383.3 150.002 4.00006 IXSCAN IXSCAN IXSCAN ( 6) ( 8) ( 10) 510.773 75.4177 75.1022 76.2908 3 3 1.50002e+08 1.50002e+08 1.50002e+08 INDEX: INDEX: INDEX: TPCD.L_SD TPCD.L_SK_PK TPCD.L_OK Recognizing Index ANDing
The following explain graph shows an example of index ANDing. In steps 7 and 8, the index scan is applying the predicates and returning the RIDs from the index. The index ANDing (IXAND) operation then hashes the RIDs into the dynamic bitmap and starts returning the RIDs as it works on the last index. The RIDs are then sorted based on the page number, any duplicates are eliminated, and the results are then passed to the RIDSCN operation in step 4. The RID scan will build a list of the pages and call the prefetchers to retrieve the pages into the buffer pool. The fetch operation in step 3 can then fetch and process the pages because they should already be in the buffer pool, due to the work of the prefetchers. In this case, the fetch operation must reapply the predicates because the bitmap used is a reducing bitmap, and not all "qualified" rows are truly qualified. As shown by the access plan graph below, index ANDing is considered when there is a large number of rows to process, but the expected result set is relatively small. In this case, the indexes scanned had approximately 250000 rows and 500000 rows, respectively, but the expected number of rows returned by the fetch is only four. 4.4314 FETCH ( 3) 5475.6 1952.4 /----+---\ 886.281 1.50002e+08 RIDSCN TABLE: TPCD ( 4) LINEITEM 4027.9 1100.96 886.281 SORT ( 5) 4027.9 1100.96 886.281 IXAND ( 6) 4026.01 1100.96 /------+-----\ 248752 534445 IXSCAN IXSCAN ( 7) ( 8) 1480.95 2509.07 430.024 670.935 1.50002e+08 1.50002e+08 INDEX: TPCD INDEX: TPCD L_OK L_SD Handling Predicates
Query predicates can be handled in two different manners within DB2:
Definitions and Terminology
A predicate that can be processed using method 2, described above, is known as a sargable predicate (SARG). There are three types of sargable predicates:
All predicates for which method 2, described above, cannot be applied are referred to as residual predicates (RES). In this case, the rows must be returned one at a time and evaluated, as described in method 1. Sargable Predicates vs. Residual Predicates
Based on the definitions and descriptions given above, it is evident that residual predicates are systematically more expensive in their processing than are sargable predicates. However, it is sometimes impossible to "push down" the evaluation of a predicate to make it sargable, and there are two main reasons for this:
Analyzing Problem SQL Statements
Performance problems in a database can be a result of:
The next section will focus on analyzing SQL statements to determine what, if any, indexes should be created to help optimize problem SQL statements. There are numerous methods that can be used to analyze the executing SQL statements to determine which statements, if any, need to be investigated. These include:
The event monitor tracks each execution of an SQL statement, whereas the snapshot monitor and table function provide one entry for each unique statement, along with information such as the number of times the statement was executed, the total execution time, the number of sorts performed, etc. To obtain an SQL snapshot or get valid information from the dynamic SQL table function, the STATEMENT snapshot monitor switch must be turned on. If the switch is turned on at the session level, the get snapshot or table function must be run in the same session. If the switch is set at the DB2 instance level, the get snapshot or table function can be run from any session with a connection to the database. To set the STATEMENT monitor switch at the instance level, first run the following command, then stop and restart the DB2 instance. update dbm cfg using dft_mon_stmt on Analyzing the output of the SQL statement snapshot or table function can take a good deal of time, especially if a large number of statements have been run. A spreadsheet can be used to aid in the analysis of this information; however, the output of the SQL snapshot must be parsed and converted to a format that is readable by a spreadsheet first. Appendix B, parseSQL.pl, is a Perl program that will parse the output of the SQL snapshot and convert the information to comma-delimited format so it can be imported into a spreadsheet. The output of the SQL snapshot table function can be either exported directly to ASCII delimited format to be analyzed using the same spreadsheet or inserted into a table and analyzed using SQL statements. To make the insertion into a table easiest , export the table in IXF format, so that the table to store the data does not need to exist. For example: export to try.ixf of ixf SELECT * FROM TABLE(SNAPSHOT_DYN_SQL('SAMPLE',-1 )) as SNAPSHOT_DYN_SQL To import this into a table to be analyzed, the table need not exist because the IXF file contains the table definition. The command to create a new table and import the data is as follows : import from try.ixf of ixf create into sqlsnap To capture the SQL snapshot using the table function and create an ASCII delimited file for analysis using a spreadsheet, use the following command: export to try.del of del SELECT * FROM TABLE(SNAPSHOT_DYN_SQL('SAMPLE',-1 )) as SNAPSHOT_DYN_SQL The information reported by the SQL snapshot or table function that is important when looking for poorly performing SQL is:
A statement that is run once a day and takes 10 seconds is much less important than one that runs 1,000 times a day and takes 10 seconds to run each time. The Rows read gives an indication of the efficiency of the statement. A large number of rows read normally indicates that a table scan is being performed to resolve the query. Even if the query runs quickly, performing table scans is not efficient, and the query should be analyzed to determine whether it is indexed correctly. The rows written and statement sorts elements of the snapshot are normally examined together. Overflowed sorts cause rows to be written to temporary tables in the database; therefore, if the statement is causing sorts to occur and there are no rows written, the sort is occurring within the sort heap. If there are rows written, the sort very likely overflowed. Regardless of whether or not the sort overflowed, all statements with sorts should be examined. Another item to examine is the average number of sorts per execution. To calculate this, divide the statement sorts by the number of executions, as follows: Sorts per execution = (Statement sorts / Number of executions)
NOTE Any statement with more than one sort per execution should be examined immediately.
The total execution time is for all executions of the statement. Even though a statement may have a total execution time of 5,000 seconds, if it was run 20,000 times, this is not excessive. However, if it was run only once, this is very excessive. Therefore, it is important to examine the average execution time as follows: Avg execution time = (Total execution time / Number of executions) For example, in the following SQL snapshot entry, the total execution time is 2.25 seconds, which seems like a lot for a simple statement like this. However, the statement was executed over 15,000 times, so the average execution time is very low. There are no sorts happening when this statement is run, so in this case everything appears to be fine. Number of executions = 15616 Number of compilations = 1 Worst preparation time (ms) = 11 Best preparation time (ms) = 11 Internal rows deleted = 0 Internal rows inserted = 0 Rows read = 0 Internal rows updated = 0 Rows written = 0 Statement sorts = 0 Total execution time (sec.ms) = 2.254846 Total user cpu time (sec.ms) = 0.980000 Total system cpu time (sec.ms) = 0.100000 Statement text = SELECT SUM(B.DEDUCTION) ... For the following statement, the total execution time is 23.35 seconds for 29 executions. Although each execution is under one second, the most interesting piece of information in this snapshot is the fact that the 29 executions are causing 6604 sorts to occur. Although the statement may not be taking an excessive amount of time to run, it is using a great deal of resources to perform this many sorts and should be examined to determine whether the underlying tables are indexed correctly. Number of executions = 29 Number of compilations = 1 Worst preparation time (ms) = 35 Best preparation time (ms) = 35 Internal rows deleted = 0 Internal rows inserted = 0 Rows read = 4 Internal rows updated = 0 Rows written = 0 Statement sorts = 6604 Total execution time (sec.ms) = 23.352040 Total user cpu time (sec.ms) = 23.590000 Total system cpu time (sec.ms) = 0.000000 Statement text = SELECT PAGE_NUM ... Steps in Analyzing a Poorly Performing Query
When a poorly performing SQL statement is found, the following steps can be used to help determine the cause of the problem and find a solution:
Determining Good Indexes
An indicator that the current indexes on a table may not be defined correctly (or that more indexes are needed) is when the SQL statement is performing one or more sorts per execution. Indexing Example 1
For the piece of the SQL snapshot shown below, a sort is required each time the statement is executed. Number of executions = 378 . . . Statement sorts = 378 Statement text = SELECT custkey, custname FROM user1.customer ORDER BY custkey DESC To determine whether the table has current statistics, execute the following statement: select stats_time from syscat.tables where tabname='CUSTOMER' If the output of the above statement is as follows: STATS_TIME -------------------------- 2002-10-06-09.45.13.468000 1 record(s) selected. and the current date is October 6, unless there has been a large number of inserts or deletes since 9:45 A.M. , the statistics are current. The next step is to examine the current indexes defined on the CUSTOMER table, as follows: describe indexes for table user1.customer View the output of the above statement, as follows: [View full width]
[View full width] Index schema Index name Unique rule Number of columns -------------- ---------------- -------------- -------------- 0 record(s) selected. SQL0100W No row was found for FETCH, UPDATE or DELETE; or the result of a query is anIn this case, there are no indexes that can be used by the optimizer to retrieve the data columns or to help order the data. Therefore, a good first step is to examine the statement to determine what indexes may help. In this example, there are two columns being selected from the table; therefore, an index can be created on these two columns. The ORDER BY clause in the statement is descending (DESC); therefore, the index should be created either with the CUSTKEY column in descending order or using the ALLOW REVERSE SCANS option. The order of the key columns in the index is also important. The key column with the highest cardinality (i.e., most unique values) should be the first key in the index because it is the most selective and will narrow down the search must faster. To determine the cardinality of the key columns for the above select statement, use the following: select colname, colcard from syscat.columns where tabname='CUSTOMER' The output of the above statement looks like: COLNAME COLCARD ------- ------- CUSTKEY 1709 CUSTNAME 203 ADDRESS 609 3 record(s) selected. In this case, the customer key column (CUSTKEY) has the highest cardinality and, therefore, should be the first column in the index. The CUSTKEY column is also a unique identifier within this table. Thus, there are two options in this case:
Therefore, any of the following four indexes would help improve the performance of this statement: create index cust_ix on customer (custkey desc, custname) create index cust_ix on customer (custkey, custname) allow reverse scans create unique index cust_ix on customer (custkey desc) include (custname) create unique index cust_ix on customer (custkey) include (custname) allow reverse scans The fact that the CUSTKEY column is unique also helps the optimizer choose better access plans; therefore, if a unique index can be used, this is preferable. Because the table has current statistics, when the index is created, collect the index statistics at the same time, as follows: create unique index cust_ix on customer (custkey) include (custname) allow reverse scans collect detailed statistics After creating the index and rerunning the application, the SQL snapshot information for this statement looks like the following: Number of executions = 378 . . . Statement sorts = 0 Statement text = SELECT custkey, custname FROM user1.customer ORDER BY custkey DESC Indexing Example 2
The following statement has been found to be running slowly: select empid, empname from employees order by empid If there are no indexes on the table, the first step should be to create one or more indexes to help select the EMPID and EMPNAME columns and sort the EMPID column. Before adding the index, the access plan for the statement looks like the following: Access Plan: ----------- Total Cost: 4709.851 Query Degree: 1 Rows RETURN ( 1) Cost I/O 9600 TBSCAN ( 2) 470.851 109 9600 SORT ( 3) 470.849 109 9600 TBSCAN ( 4) 454.206 109 9600 TABLE: USER1 EMPLOYEES Because the EMPID column is unique in this table, create an index and capture index statistics at the same time, as follows: create unique index emp_ix on employees (empid) collect detailed statistics Access Plan: ----------- Total Cost: 650.384 Query Degree: 1 Rows RETURN ( 1) Cost I/O 109 FETCH ( 2) 650.384 2 /----+---\ 109 109 IXSCAN TABLE: USER1 ( 3) EMPLOYEES 0.248997 109 INDEX: USER1 EMP_IX The new index has eliminated the sort from the access plan and has greatly reduced the query's cost; however, the query can still be improved. Because the query is accessing only two columns, drop the index above and create a new index with both columns to eliminate the need to FETCH the rows from the table, as follows: create unique index emp_ix on employees (empid) include (empname) collect detailed statistics In this case, the index contains all of the columns being selected, so the entire query should be able to be handled by the index, without needing to read data from the table. This is known as index-only access and is the fastest type of data access. The new access plan is shown below: Access Plan: ----------- Total Cost: 50.2709 Query Degree: 1 Rows RETURN ( 1) Cost I/O 109 IXSCAN ( 2) 50.2709 2 109 INDEX: USER1 EMP_IX In an index-only access plan, the index is scanned (IXSCAN) and the data returned either to the application or to another database operation other than a fetch from the base table, as shown previously. Writing Better SQL Statements
When the DB2 optimizer compiles SQL statements, it can rewrite them into a form that can be optimized more easily. The optimizer then generates a number of alternative execution plans for satisfying the SQL statement. It estimates the execution cost of each alternative plan, using the statistics for tables, indexes, columns, and functions, and chooses the plan with the lowest estimated execution cost. The optimizer must choose an access plan that will produce the result set for the query that was submitted. Therefore, as noted in the following guidelines, the query should be written to retrieve only the data that is required. This helps to ensure that the optimizer can choose the best access plan. Some guidelines for writing efficient SELECT statements are:
Specify Only Needed Columns in the Select List
Specify only those columns that are needed in the select list. Although it may be simpler to specify all columns with an asterisk (*), needless processing and returning of unwanted columns can result in slower response time. Limit the Number of Rows Returned by Using Predicates
Limit the number of rows selected by using predicates to restrict the answer set to only those rows that you require. There are four types of predicates, each with its own distinct method of processing and associated cost. The type of predicate is determined by how and when that predicate is used in the evaluation process. These predicate types are listed below, ordered in terms of performance, starting with the most favorable:
Range delimiting predicates are those used to define the start key and/or stop key for an index search. Index SARGable predicates are not used to define the start/stop key for an index search but can be evaluated from the index because the columns involved in the predicate are part of the index key. For example, assume a table named STAFF and an index defined on the columns NAME, DEPT, and SVC_YEARS in the table. For the following SQL statement: SELECT name, job, salary FROM staff WHERE name = 'John' and dept = 10 and svc_years > 5 The predicates name='John' and dept=10 would be range delimiting predicates, whereas svc_years > 5 would be evaluated as an index SARGable predicate, because the start key value for the index search cannot be determined by this information only. The start key value may be 6, 10, or even higher. If the statement were written as follows: SELECT name, job, salary FROM staff WHERE name = 'John' and dept = 10 and svc_years >=5 the svc_years >=5 clause can now be evaluated using a range delimiting predicate, because the index search can start from the key value 5.
NOTE SARGable refers to something that can be used as a search argument.
DB2 will make use of the index in evaluating these predicates, rather than reading the base table. These range delimiting predicates and index SARGable predicates reduce the number of data pages that must be accessed by reducing the set of rows that need to be read from the table. Index SARGable predicates do not affect the number of index pages that are accessed. Data SARGable predicates are predicates that cannot be evaluated using the index and must be evaluated by reading the data. Typically, these predicates require the access of individual rows from a base table. DB2 will retrieve the columns needed to evaluate the predicate, as well as any others to satisfy the columns in the SELECT list that could not be obtained from the index. For example, assume that a table named PROJECT has an index defined on the PROJNUM column. For the following query: SELECT projnum, projname, repemp FROM project WHERE dept='D11' ORDER BY projnum The predicate dept='D11' will be processed as data SARGable, because there are no indexes defined on the DEPT column, and the base table must be accessed to evaluate that predicate. Residual predicates, typically, are those that require I/O beyond the simple accessing of a base table. Examples of residual predicates include those using quantified subqueries (subqueries with ANY, ALL, SOME, or IN) or those that require reading Long Varchar or LOB data. Residual predicates are the most expensive of the four types of predicates. Because residual predicates and data SARGable predicates require more resources and cost more than range delimiting predicates and index SARGable predicates, limit the number of rows qualified by range delimiting predicates and index SARGable predicates whenever possible. Specify the FOR UPDATE Clause
If an application will update fetched data, specify the FOR UPDATE clause in the SELECT statement of the cursor definition. By doing this, DB2 can choose appropriate locking levels [i.e., a U (update) lock instead of an S (shared) lock] to save the cost to perform lock conversion when the UPDATE is performed. Specify the OPTIMIZE FOR n ROWS Clause
Specify the OPTIMIZE FOR n ROWS clause in the SELECT statement when the number of rows required is less than the total number of rows that could be returned. Using the OPTIMIZE FOR clause influences query optimization based on the assumption that the first n rows should be retrieved quickly, whereas the application can wait for the remaining rows. Row blocking is a technique that reduces overhead by retrieving a number of rows in a single operation. These rows are stored in a cache, and each FETCH request in the application gets the next row from the cache. The OPTIMIZE FOR n ROWS clause will determine the number of records to be blocked. For example, if OPTIMIZE FOR 10 ROWS is specified, the block of rows returned to the client will contain ten rows. SELECT projno,projname,repemp FROM project WHERE deptno='D11' OPTIMIZE FOR 10 ROWS Specify the FETCH FIRST n ROWS ONLY Clause
Specify the FETCH FIRST n ROWS ONLY clause if the application should not retrieve more than n rows, regardless of how many rows there might be in the result set when this clause is not specified. This clause cannot be specified with the FOR UPDATE clause. For example, the following statement will retrieve the first five rows from the result set, not the entire result set: SELECT projno,projname,repemp FROM project WHERE deptno='D11' FETCH FIRST 5 ROWS ONLY The FETCH FIRST n ROWS ONLY clause also determines the number of rows that are blocked in the communication buffer. If the FETCH FIRST n ROWS ONLY and OPTIMIZE FOR n ROWS clauses are both specified, the lower of the two values is used to determine the number of rows to be blocked.
NOTE The OPTIMIZE FOR n ROWS clause does not limit the number of rows that can be fetched or affect the result in any way, other than performance. Using OPTIMIZE FOR n ROWS can improve the performance if no more than n rows are retrieved but may degrade performance if more than n rows are retrieved.
Specify the FOR FETCH ONLY Clause
When fetching rows, if the application will not update the rows retrieved, specify the FOR FETCH ONLY clause in the SELECT statement. This can improve performance by allowing the query to take advantage of record blocking. This can also improve concurrency because exclusive locks will never be obtained when this clause is specified.
NOTE The FOR READ ONLY clause is equivalent to the FOR FETCH ONLY clause.
Avoid Data Type Conversions
Data type conversions (particularly numeric data type conversions) should be avoided whenever possible. When two values are compared, it is more efficient to compare rows with the same data type. For example, TableA and TableB are being joined using col1 in Table1 and col2 in Table2. If the columns col1 and col2 are the same data type, no data type conversion is required. If they are not the same data type, a data type conversion occurs to compare values at run time, and this will affect the performance of the query. Additional Ways to Help the Optimizer
Marking Tables as Volatile
In some situations, scratch tables are used during processing that can grow from being empty to thousands (or even hundreds of thousands) of rows and shrink back to no rows in a very short period of time. It is impossible in this case to keep the statistics current, which can make it difficult for the optimizer to choose optimal access plans. To account for a table that grows and shrinks rapidly during processing, the table can be marked as VOLATILE. If the table is marked as VOLATILE, the optimizer will use an index to scan the table (if one exists), regardless of the current statistics, if the index can provide index-only access or can apply a predicate during the index scan. Use the Selectivity Clause
The DB2 optimizer uses the statistics to estimate the selectivity of each clause in an SQL statement. In most cases, when given current statistics, the optimizer is able to estimate the selectivity accurately. However, when a host variable is used in the predicate, the estimate of the clause selectivity can be wrong, and the optimizer can choose inefficient access plans. The selectivity clause can be specified in the SQL statement when:
In order for the optimizer to consider the selectivity clause, the registry variable SELECTIVITY must be set to YES. Below is an example of how the selectivity clause can be used: SELECT c1, c2 FROM t1 WHERE c1 = :hv1 SELECTIVITY 1 |