Query Executes in Seconds with the RULE Hint and Several Minutes Otherwise – What Would You Do?

10 01 2011

January 10, 2011

An interesting hypothetical question was posed on the OTN forums yesterday. Consider this situation. You are using Oracle Database 11.2.0.2 and you find a query that is executing much slower than expected, typically requiring 10 to 15 minutes to execute. You add a RULE hint to the query and find that the query completes in just a couple of seconds. How would you do to address this issue?

—-

To help your thought process, consider the following test case that I included in the OTN thread. The table definition:

CREATE TABLE T5 (
  C1 NUMBER,
  C2 NUMBER,
  C3 NUMBER,
  C4 NUMBER,
  C5 NUMBER,
  C6 NUMBER,
  C7 NUMBER,
  C8 NUMBER,
  C9 VARCHAR2(50),
  C10 VARCHAR2(50));

INSERT INTO
  T5
SELECT
  ROWNUM,
  ROWNUM,
  MOD(ROWNUM,50),
  MOD(ROWNUM,40),
  MOD(ROWNUM,100),
  ROUND(ROWNUM/1000),
  ROUND(ROWNUM/950),
  ROUND(ROWNUM/600),
  RPAD(CHR(65 + MOD(ROWNUM-1,26)),50,CHR(65 + MOD(ROWNUM-1,26))),
  RPAD(CHR(65 + MOD(ROWNUM-1,26)),50,CHR(65 + MOD(ROWNUM-1,26)))
FROM
  DUAL
CONNECT BY
  LEVEL<=1000000;

COMMIT;

CREATE INDEX IND_T5_C10_F ON T5(LOWER(C10)); 

EXEC DBMS_STATS.GATHER_TABLE_STATS(OWNNAME=>USER,TABNAME=>'T5',CASCADE=>TRUE)

Now, let’s put together a test case script that uses the above table:

ALTER SYSTEM FLUSH BUFFER_CACHE;
ALTER SYSTEM FLUSH BUFFER_CACHE;

SET TIMING ON
SET AUTOTRACE TRACEONLY STATISTICS EXPLAIN
SET ARRAYSIZE 1000

SELECT 
  * 
FROM 
  T5 
WHERE 
  LOWER(C10)=LPAD('a',50,'a'); 

SELECT /*+ RULE */
  * 
FROM 
  T5 
WHERE 
  LOWER(C10)=LPAD('a',50,'a'); 

SELECT /*+ __FAST=TRUE */
  * 
FROM 
  T5 
WHERE 
  LOWER(C10)=LPAD('a',50,'a');

SELECT /*+ INDEX(T5 IND_T5_C10_F) */
  * 
FROM 
  T5 
WHERE 
  LOWER(C10)=LPAD('a',50,'a');

In the above, we have an unhinted query, the same query with a RULE hint, the same query with the __FAST=TRUE hint, and the same query with an index hint. Which query will execute the fastest, and why? Let’s execute the test case script to find out (note that your results could be very different from my results):

SQL> SELECT
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.57

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      | 38462 |  6911K|  5247   (1)| 00:01:03 |
|*  1 |  TABLE ACCESS FULL| T5   | 38462 |  6911K|  5247   (1)| 00:01:03 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Statistics
----------------------------------------------------------
         99  recursive calls
          0  db block gets
      19441  consistent gets
      19295  physical reads
          0  redo size
    1178619  bytes sent via SQL*Net to client
        937  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
         10  sorts (memory)
          0  sorts (disk)
      38462  rows processed

The unhinted version completed in 0.57 seconds, and used a full table scan (your execution time may be a bit longer).

—-

SQL> SELECT /*+ RULE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.31

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

----------------------------------
| Id  | Operation         | Name |
----------------------------------
|   0 | SELECT STATEMENT  |      |
|*  1 |  TABLE ACCESS FULL| T5   |
----------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Note
-----
   - rule based optimizer used (consider using cbo)

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19327  consistent gets
          0  physical reads
          0  redo size
    1178619  bytes sent via SQL*Net to client
        937  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

The version that was hinted to use the RULE based optimizer completed in 0.31 seconds (nearly twice as fast as the unhinted version), and also used a full table scan.

—-

SQL> SELECT /*+ __FAST=TRUE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.28

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      | 38462 |  6911K|  5247   (1)| 00:01:03 |
|*  1 |  TABLE ACCESS FULL| T5   | 38462 |  6911K|  5247   (1)| 00:01:03 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19326  consistent gets
          0  physical reads
          0  redo size
    1178619  bytes sent via SQL*Net to client
        937  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

The version hinted with __FAST=TRUE (a completely fake hint) completed in 0.28 seconds (0.03 seconds faster than the RULE hinted version) and also used a full table scan.

—-

SQL> SELECT /*+ INDEX(T5 IND_T5_C10_F) */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.17

Execution Plan
----------------------------------------------------------
Plan hash value: 1769636183

--------------------------------------------------------------------------------------------
| Id  | Operation                   | Name         | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |              | 38462 |  6911K| 20458   (1)| 00:04:06 |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           | 38462 |  6911K| 20458   (1)| 00:04:06 |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F | 40009 |       |   350   (0)| 00:00:05 |
--------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19576  consistent gets
        352  physical reads
          0  redo size
    1178628  bytes sent via SQL*Net to client
        937  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

The index hinted version of the query completed in 0.17 seconds, which is about twice as fast as the query with the RULE hint and the unhinted version.

—-

With this knowledge, certain questions might come to mind:

Why was the first query slower than the third query?
Why did the optimizer not automatically select the index when that access path was faster? What do I need to check to see why the index was not selected?
Why didn’t the RULE based optimizer select the index access path?
How might the test script results change if we were to replace the number 50 in the script with either the number 20 or the number 10?
How might the test script results change if someone had altered the OPTIMIZER_INDEX_COST_ADJ parameter?

—-

The hypothetical question was essentially very simple. How would you guide Oracle’s optimizer to find the optimal execution path?

—-

Edit January 10, 2011: Note that the phrase “How would you do to address this issue” in the initial paragraph is intentionally left undefined, and left for your interpretation. “This issue” could very well have multiple intended meanings, depending on how you read the paragraph.

Actions

Information

Date : January 10, 2011
Categories : Performance

19 responses

10 01 2011

Martin Berger (15:49:28) :

Charles,
Why on earth do you flush buffer_cache twice? I know about the habit to ‘sync’ os filesystems twice, but that’s something totally different.
1) in the first you have 19295 physical reads, in the 3rd zero (the same to the 2nd).
2) as I did not reproduce your test-case I’m not 100% sure. Based on your question 4) I assume dbms_stats does not store valid low and high value for such long strings (I know about this behaviour in histograms, but again just guess on general statistics) and therefore the optimiser has not good enough informations. Or are there histograms? have to check default_method_opt for 11.2.0.2 and what it means for FBIs and histograms.
3) RULE just does not know what a FBI is. In this case I assumed, the optimiser would just ignore the RULE hint and do the best it can (ALL_ROWS, isn’t it?). Or does it just ignore all objects it does not know until it finishes or stuck (and then ‘fall back to ALL_ROWS’)? => check 10053 to verify that.
4) next guess: as a) you used only single byte characters in C10 and b) the max hist length of a string is approx 15 byte and c) you can not use AL16UTF16 as default character set => with “10” the histograms will be useful and therefore the optimiser seeded with better data.
5) I do not do such cruel things. 😉

Do you thing the 0.03 performance increase of RULE against __FAST=TRUE is just random or any additional performance gain?

To answer “How would you do to address this issue” – it depends on the person who brings it to me! I do not need to discuss the status of RULE with you, also about caches, physical reads, whatever. With others I might have to prepare a good foundation before going into discussions.
The other question: “How would you guide Oracle’s optimizer to find the optimal execution path?” I would try to set the statistics manually so the optimiser have good informations about selectivity of C10.

Martin

10 01 2011

Charles Hooper (17:11:56) :

Why flush the buffer cache twice? I do not remember the exact reason, but I believe that a trustworthy source stated that one or more buffers could be “pinned” by a session, and therefore could not be flushed from the buffer cache on the first try. The second flush attempts to flush those blocks in memory that were previously pinned by sessions.

11 01 2011

Charles Hooper (08:28:53) :

Martin,

1) The performance improvement caused by the addition of the RULE hint could just be an illusion, for the reason that you identified.
2) Histograms – I had not even considered histograms. I am guessing that the optimizer did not collect histograms in this case, but the presence of histograms might very well apply to other cases.
3) It is true that the RULE based optimizer does not know what to do with a function based index, although Martin Preiss (in a comment below) found that the RULE based optimizer in Oracle Database 11.1.0.7 *will* use the function based index. So far I have tested Oracle Database 10.2.0.2, 10.2.0.5, 11.1.0.7, 11.2.0.1, and 11.2.0.2 – only 11.1.0.7 used the function based index with the RULE based optimizer. If I set a default parallel degree for the table the cost based (ALL_ROWS) optimizer will be used when the RULE hint is specified.
4) Another very good point. I used a single byte characterset – while it probably would not make a difference in this test case, it might make a difference in production system performance issues.
5) Nice answer. OK, then set the MBRC system statistic to a very low value, such as 1 or 2 (then be certain to change it back afterward).

If I re-executed the script, I might see different results. The result with the “__FAST=TRUE” hint, I believe, was just a lucky coincidence.

Good idea to work with the statistics manually. Any other ideas?

10 01 2011

Martin Preiss (16:08:25) :

Charles,
when I saw your example, I thought: “physical reads” – and that the rbo would not be able to use a function based index. But then I made the test in 11.1.0.7 and got a strange result:

SQL> SELECT /*+ RULE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 Zeilen ausgewählt.

Abgelaufen: 00:00:00.60

Ausführungsplan
----------------------------------------------------------
Plan hash value: 1769636183

----------------------------------------------------
| Id  | Operation                   | Name         |
----------------------------------------------------
|   0 | SELECT STATEMENT            |              |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F |
----------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Note
-----
   - rule based optimizer used (consider using cbo)


Statistiken
----------------------------------------------------------
          1  recursive calls
          0  db block gets
       9652  consistent gets
       9586  physical reads
          0  redo size
    1085822  bytes sent via SQL*Net to client
        768  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

My first idea was: perhaps autotrace is lying – what says SQL_TRACE:

Misses in library cache during parse: 1
Optimizer mode: RULE
Parsing user id: 174  

Rows     Row Source Operation
-------  ---------------------------------------------------
  38462  TABLE ACCESS BY INDEX ROWID T5 (cr=9652 pr=9586 pw=0 time=0 us)
  38462   INDEX RANGE SCAN IND_T5_C10_F (cr=206 pr=167 pw=0 time=0 us)(object id 202338)

So I guess that the rbo can use a function based index and indeed Jonathan Lewis brings an example on his website: http://www.jlcomp.demon.co.uk/no_fbi.html. He shows there that the rbo can use an index, if the first column in it is not virtual – but he also explains: “that if the first column is a virtual column the rule-based optimizer won’t use the index.” So I still have no fitting explanation for my result.

For the sake of completeness I also made the test on 11.2 and there I got the expected FTS.

Regards
Martin

10 01 2011

Charles Hooper (17:02:50) :

Martin,

That is a very interesting result that you found on 11.1.0.7! Tom Kyte’s book say that the RULE based optimizer cannot use function based (single column) indexes:
http://books.google.com/books?id=8_WChqD3nc4C&pg=PA288#v=onepage&q&f=false
http://books.google.com/books?id=TmPoYfpeJAUC&pg=PA456#v=onepage&q&f=false
http://books.google.com/books?id=mV6b43HuEAkC&pg=PA461#v=onepage&q&f=false

I might have to do some testing to see if I can find the cause for the use of the index. I wonder if it is possible that the cost-based optimizer was actually used even though the rule based optimizer note appeared (I assume that this is would be considered a bug).

10 01 2011

Charles Hooper (19:21:44) :

Martin,

Nice catch. I can confirm that in 10.2.0.5, 11.2.0.1, and 11.2.0.2 the RULE based optimizer cannot use the IND_T5_C10_F function based index, while the RULE based optimizer in 11.1.0.7 is able to use the function based index (this happens when the RULE optimizer is hinted and also when the OPTIMIZER_MODE is set to RULE at the session level.

Elapsed: 00:00:00.26
 
Execution Plan
----------------------------------------------------------
Plan hash value: 1769636183
 
----------------------------------------------------
| Id  | Operation                   | Name         |
----------------------------------------------------
|   0 | SELECT STATEMENT            |              |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F |
----------------------------------------------------
 
Predicate Information (identified by operation id):
---------------------------------------------------
   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')
 
Note
-----
   - rule based optimizer used (consider using cbo)
 
Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19583  consistent gets
        345  physical reads
          0  redo size
    1177860  bytes sent via SQL*Net to client
        918  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

From a 10053 trace we see something slightly different in the predicate information section:

Considering Query Transformations on query block SEL$1 (#0)
**************************
Query transformations (QT)
**************************
CBQT bypassed for query block SEL$1 (#0): rule-based mode.
CBQT: Validity checks failed for bmqha64ws3bd9.
CSE: Considering common sub-expression elimination in query block SEL$1 (#0)
*************************
Common Subexpression elimination (CSE)
*************************
CSE:     CSE bypassed on query block SEL$1 (#0): rule-based mode.
OBYE:   Considering Order-by Elimination from view SEL$1 (#0)
***************************
Order-by elimination (OBYE)
***************************
OBYE:     OBYE bypassed: no order by to eliminate.
CVM: Considering view merge in query block SEL$1 (#0)
query block SEL$1 (#0) unchanged
Considering Query Transformations on query block SEL$1 (#0)
**************************
Query transformations (QT)
**************************
CBQT bypassed for query block SEL$1 (#0): rule-based mode.
CBQT: Validity checks failed for bmqha64ws3bd9.
CSE: Considering common sub-expression elimination in query block SEL$1 (#0)
*************************
Common Subexpression elimination (CSE)
*************************
CSE:     CSE bypassed on query block SEL$1 (#0): rule-based mode.
SU: Considering subquery unnesting in query block SEL$1 (#0)
********************
Subquery Unnest (SU)
********************
SJC: Considering set-join conversion in query block SEL$1 (#0)
*************************
Set-Join Conversion (SJC)
*************************
SJC: not performed
PM: Considering predicate move-around in query block SEL$1 (#0)
**************************
Predicate Move-Around (PM)
**************************
PM:     PM bypassed: rule-based mode.
query block SEL$1 (#0) unchanged
FPD: Considering simple filter push in query block SEL$1 (#0)
LOWER("T5"."C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa'
apadrv-start sqlid=13392027648474066345
  :
    call(in-use=1008, alloc=16344), compile(in-use=55616, alloc=61080), execution(in-use=2424, alloc=4032)
 
*******************************************
Peeked values of the binds in SQL statement
*******************************************
 
Final query after transformations:******* UNPARSED QUERY IS *******
SELECT "T5"."C1" "C1","T5"."C2" "C2","T5"."C3" "C3","T5"."C4" "C4","T5"."C5" "C5","T5"."C6" "C6","T5"."C7" "C7","T5"."C8" "C8","T5"."C9" "C9","T5"."C10" "C10" FROM "TESTUSER"."T5" "T5" WHERE LOWER("T5"."C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa'
apadrv-end
          :
    call(in-use=1056, alloc=16344), compile(in-use=57248, alloc=61080), execution(in-use=2504, alloc=4032)
 
Starting SQL statement dump
 
user_id=178 user_name=TESTUSER module=SQL*Plus action=
sql_id=bmqha64ws3bd9 plan_hash_value=1769636183 problem_type=0
----- Current SQL Statement for this session (sql_id=bmqha64ws3bd9) -----
SELECT /*+ RULE */
  *
FROM
  T5
WHERE
  LOWER(C10)=LPAD('a',50,'a')
sql_text_length=69
sql=SELECT /*+ RULE */
  *
FROM
  T5
WHERE
  LOWER(C10)=LPAD('a',50,'a')
----- Explain Plan Dump -----
----- Plan Table -----
 
============
Plan Table
============
---------------------------------------------------+-----------------------------------+
| Id  | Operation                    | Name        | Rows  | Bytes | Cost  | Time      |
---------------------------------------------------+-----------------------------------+
| 0   | SELECT STATEMENT             |             |       |       |       |           |
| 1   |  TABLE ACCESS BY INDEX ROWID | T5          |       |       |       |           |
| 2   |   INDEX RANGE SCAN           | IND_T5_C10_F|       |       |       |           |
---------------------------------------------------+-----------------------------------+
Predicate Information:
----------------------
2 - access("T5"."SYS_NC00011$"='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')
 
Content of other_xml column
===========================
  db_version     : 11.1.0.7
  parse_schema   : TESTUSER
  plan_hash      : 1769636183
  plan_hash_2    : 3656449929
  Outline Data:
  /*+
    BEGIN_OUTLINE_DATA
      IGNORE_OPTIM_EMBEDDED_HINTS
      OPTIMIZER_FEATURES_ENABLE('11.1.0.7')
      DB_VERSION('11.1.0.7')
      RBO_OUTLINE
      OUTLINE_LEAF(@"SEL$1")
      INDEX_RS_ASC(@"SEL$1" "T5"@"SEL$1" "IND_T5_C10_F")
    END_OUTLINE_DATA
  */
 
...
 
Query Block Registry:
SEL$1 0x1a4e6e40 (PARSER) [FINAL]
 
:
    call(in-use=4040, alloc=32712), compile(in-use=93032, alloc=154544), execution(in-use=11064, alloc=12144)
 
End of Optimizer State Dump
Dumping Hints
=============
  atom_hint=(@=00000002566970D8 err=0 resol=0 used=0 token=455 org=1 lvl=1 txt=RULE )

Note in the above that the RULE based optimizer was able to use the hidden column used by the function based index:

access("T5"."SYS_NC00011$"='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')

The 10053 trace file from 11.2.0.1 showed essentially the same information as 11.1.0.7 during the hard parse.

10 01 2011

Charles Hooper (22:02:24) :

Metalink (MOS) offers one more confirmation that the rule based optimizer should not have been able to use the IND_T5_C10_F function based index, as had happened in 11.1.0.7:
Doc ID 66277.1 – “Concepts and Usage of Function Based Indexes”
Doc ID 189702.1 – “Rule Based Optimizer is to be Desupported in Oracle10g” (also lists other features that are not supported by the rule based optimizer)

—-

This Metalink document appears to be just what the OP of the OTN thread wanted to know:
Doc ID 247743.1 – “Guidelines for Tuning: Query under Rule is fast Query under Cost is Slow”

10 01 2011

Martin Preiss (16:11:51) :

my code tags also don’t work the way I expected …

10 01 2011

Charles Hooper (16:43:27) :

For some reason this blog accepts code tag in square brackets [] and not in angled brackets .

10 01 2011

Jeremy Kendrick (16:53:08) :

I’m curious about the effect of running the queues in reverse order would have on them. Such as running the index one first. Would that then still be the fastest one? My first thought (And I am probably wrong) is that since you are asking for the same data 4 times fairly quickly, wouldn’t it already be in a queue with a, I want to say a high priority value, but I’m not sure of the correct terminology at this time.

10 01 2011

Charles Hooper (17:08:47) :

Jeremy,

Excellent point. The SGA_TARGET was set to 12G, so the order of the queries probably would have made a difference because the previously read blocks would likely already be in the buffer cache and would not need to be re-read from disk. In Oracle Database 11.1 and above there is also the risk that the full table scans will be performed with direct path reads (the Oracle buffer cache is not used in such a case) so there is a chance that the first and third executions could have completed in the same amount of time.

10 01 2011

Kate Johnson (17:09:51) :

Couldn’t the SELECT STATEMENT as
“…WHERE C10=LPAD(‘a’,50,’a’);” induce the index to be used by the optimizer?

Removing the LOWER function from C10.

10 01 2011

Charles Hooper (17:30:27) :

Hi Kate,

You are correct that using the LOWER function on an indexed column could (or very likely will unless you do something like I did in this blog article: https://hoopercharles.wordpress.com/2010/06/01/the-instr-function-will-never-use-an-index-i-will-give-you-a-hint/ ) prevent an index on a column from being used.

However, the statement that I used to create the index IND_T5_C10_F is this:

CREATE INDEX IND_T5_C10_F ON T5(LOWER(C10));

The above syntax tells Oracle to create a function based index on LOWER(C10) (and store the result of that function in the index). When defined that way, the index will only be used when LOWER(C10) appears in the WHERE clause (or in a JOIN clause in an ANSI formatted SQL statement).

It is my understanding that the RULE based optimizer is not supposed to be able to use function based indexes. One of the outcomes that I originally hoped would have happened is that the cost-based optimizer would have automatically selected to use the (potentially much slower) function based index access path, while the RULE based optimizer would have had to use a full table scan. The test server, with the default value for OPTIMIZER_INDEX_COST_ADJ, and its fast random access SSD disk array, probably skewed the outcome a bit.

11 01 2011

Donatello Settembrino (04:05:25) :

Hi Charles,
“adjust” the clustering_factor could be a solution (note the order by clause that I added in your test case in the INSERT statement).

SQL> CREATE TABLE T5 (
  2    C1 NUMBER,
  3    C2 NUMBER,
  4    C3 NUMBER,
  5    C4 NUMBER,
  6    C5 NUMBER,
  7    C6 NUMBER,
  8    C7 NUMBER,
  9    C8 NUMBER,
 10    C9 VARCHAR2(50),
 11    C10 VARCHAR2(50));

Tabella creata.

Passati: 00:00:00.07
SQL> INSERT INTO
  2    T5
  3  SELECT
  4    ROWNUM,
  5    ROWNUM,
  6    MOD(ROWNUM,50),
  7    MOD(ROWNUM,40),
  8    MOD(ROWNUM,100),
  9    ROUND(ROWNUM/1000),
 10    ROUND(ROWNUM/950),
 11    ROUND(ROWNUM/600),
 12    RPAD(CHR(65 + MOD(ROWNUM-1,26)),50,CHR(65 + MOD(ROWNUM-1,26))),
 13    RPAD(CHR(65 + MOD(ROWNUM-1,26)),50,CHR(65 + MOD(ROWNUM-1,26)))
 14  FROM
 15    DUAL
 16  CONNECT BY
 17    LEVEL<=1000000
 18  ORDER BY 10;

SQL> COMMIT;

Commit completato.

Passati: 00:00:00.04
SQL> CREATE INDEX IND_T5_C10_F ON T5(LOWER(C10));

Indice creato.

Passati: 00:00:07.34
SQL> EXEC DBMS_STATS.GATHER_TABLE_STATS(OWNNAME=>USER,TABNAME=>'T5',CASCADE=>TRUE)

Procedura PL/SQL completata correttamente.

Passati: 00:00:17.89

SQL> ALTER SYSTEM FLUSH BUFFER_CACHE;

Modificato sistema.

Passati: 00:00:08.67
SQL> ALTER SYSTEM FLUSH BUFFER_CACHE;

Modificato sistema.

Passati: 00:00:00.06
SQL> SET AUTOTRACE TRACEONLY STATISTICS EXPLAIN
SQL> SET ARRAYSIZE 1000
SQL> SELECT
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

Selezionate 38462 righe.

Passati: 00:00:02.53

Piano di esecuzione
----------------------------------------------------------
Plan hash value: 775203539

--------------------------------------------------------------------------------------------
| Id  | Operation                   | Name         | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |              | 38462 |  6911K|   533   (1)| 00:00:08 |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           | 38462 |  6911K|   533   (1)| 00:00:08 |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F | 38462 |       |   168   (1)| 00:00:03 |
--------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')


Statistiche
----------------------------------------------------------
          1  recursive calls
          0  db block gets
        608  consistent gets
        530  physical reads
          0  redo size
     987802  bytes sent via SQL*Net to client
        556  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

SQL> SELECT /*+ RULE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

Selezionate 38462 righe.

Passati: 00:00:23.56

Piano di esecuzione
----------------------------------------------------------
Plan hash value: 1231838660

----------------------------------
| Id  | Operation         | Name |
----------------------------------
|   0 | SELECT STATEMENT  |      |
|*  1 |  TABLE ACCESS FULL| T5   |
----------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Note
-----
   - rule based optimizer used (consider using cbo)


Statistiche
----------------------------------------------------------
          1  recursive calls
          0  db block gets
       9680  consistent gets
       9638  physical reads
          0  redo size
     987763  bytes sent via SQL*Net to client
        556  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

SQL> SELECT /*+ __FAST=TRUE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

Selezionate 38462 righe.

Passati: 00:00:01.68

Piano di esecuzione
----------------------------------------------------------
Plan hash value: 775203539

--------------------------------------------------------------------------------------------
| Id  | Operation                   | Name         | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |              | 38462 |  6911K|   533   (1)| 00:00:08 |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           | 38462 |  6911K|   533   (1)| 00:00:08 |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F | 38462 |       |   168   (1)| 00:00:03 |
--------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')


Statistiche
----------------------------------------------------------
          1  recursive calls
          0  db block gets
        608  consistent gets
          0  physical reads
          0  redo size
     987763  bytes sent via SQL*Net to client
        556  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

SQL> SELECT /*+ INDEX(T5 IND_T5_C10_F) */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

Selezionate 38462 righe.

Passati: 00:00:01.65

Piano di esecuzione
----------------------------------------------------------
Plan hash value: 775203539

--------------------------------------------------------------------------------------------
| Id  | Operation                   | Name         | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |              | 38462 |  6911K|   533   (1)| 00:00:08 |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           | 38462 |  6911K|   533   (1)| 00:00:08 |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F | 38462 |       |   168   (1)| 00:00:03 |
--------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')


Statistiche
----------------------------------------------------------
          1  recursive calls
          0  db block gets
        608  consistent gets
          0  physical reads
          0  redo size
     987763  bytes sent via SQL*Net to client
        556  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

Therefore, in this case the FBI is used in all cases (except the one using the hint / * + RULE * /). I can reasonably state that:

1) the execution time of the first query is reduced significantly
2) the consistent gets are reduced
3) the physical reads are reduced

Instead this is what I get from the first query if re-run your test case:

SQL> SELECT
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

Selezionate 38462 righe.

Passati: 00:00:22.71

Piano di esecuzione
----------------------------------------------------------
Plan hash value: 1231838660

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      | 38462 |  6911K|  3237   (2)| 00:00:46 |
|*  1 |  TABLE ACCESS FULL| T5   | 38462 |  6911K|  3237   (2)| 00:00:46 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')


Statistiche
----------------------------------------------------------
          1  recursive calls
          0  db block gets
       9679  consistent gets
       9638  physical reads
          0  redo size
    1074775  bytes sent via SQL*Net to client
        556  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

I would say they have solved the problem, but I wonder if those who had the problem can be used
this solution.
In the real world may not be an applicable solution, depends on many factors ..
as the table is used, the queries that are executed for that table
and more…

Regards

In my test case I used the version:

SQL> select * from v$version;

BANNER
--------------------------------------------------------------------------------
Oracle Database 11g Enterprise Edition Release 11.2.0.2.0 - 64bit Production
PL/SQL Release 11.2.0.2.0 - Production
CORE    11.2.0.2.0      Production
TNS for IBM/AIX RISC System/6000: Version 11.2.0.2.0 - Production
NLSRTL Version 11.2.0.2.0 - Production

11 01 2011

Charles Hooper (07:27:21) :

Donatello,

That is a good idea to compare the CLUSTERING_FACTOR, which is a statistic that is not considered by the RULE based optimizer. It could be the case that the CLUSTERING_FACTOR is just poorly calculated (due to the way the statistic is calculated, if Oracle finds that it is jumping back and forth between a small number of table blocks as the index is read, that will cause the CLUSTERING_FACTOR to increase significantly, while the actual performance impact can be quite small) – in which case a manual change to the index’s calculated CLUSTERING_FACTOR statistic might help if the index was used by the RULE based optimizer and not the cost based optimizer (as might be the case when the test case is executed on 11.1.0.7, or if a normal index were used).

I tried your test case (with the ORDER BY 10 in the INSERT INTO SQL statement) on 10.2.0.2 and found that the index was not used automatically (but there is a reason for this). Below are my results for the modified test case:

SQL> SELECT
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:01.68

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      | 38462 |  4995K|  1040  (10)| 00:00:05 |
|*  1 |  TABLE ACCESS FULL| T5   | 38462 |  4995K|  1040  (10)| 00:00:05 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19327  consistent gets
      19278  physical reads
          0  redo size
     993542  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

SQL> SELECT /*+ RULE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.98

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

----------------------------------
| Id  | Operation         | Name |
----------------------------------
|   0 | SELECT STATEMENT  |      |
|*  1 |  TABLE ACCESS FULL| T5   |
----------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Note
-----
   - rule based optimizer used (consider using cbo)

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19327  consistent gets
      10340  physical reads
          0  redo size
     993542  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

SQL> SELECT /*+ __FAST=TRUE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.98

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      | 38462 |  4995K|  1040  (10)| 00:00:05 |
|*  1 |  TABLE ACCESS FULL| T5   | 38462 |  4995K|  1040  (10)| 00:00:05 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19327  consistent gets
      10364  physical reads
          0  redo size
     993542  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

SQL> SELECT /*+ INDEX(T5 IND_T5_C10_F) */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.67

Execution Plan
----------------------------------------------------------
Plan hash value: 1769636183

--------------------------------------------------------------------------------------------
| Id  | Operation                   | Name         | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |              | 38462 |  4995K|  1081   (1)| 00:00:06 |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           | 38462 |  4995K|  1081   (1)| 00:00:06 |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F | 38462 |       |   339   (1)| 00:00:02 |
--------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')
 
Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
       1154  consistent gets
        337  physical reads
          0  redo size
     993542  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

As you can see from the above, the calculated cost for the full table scan access path is 1040, while the calculated cost for the index access path is 1081 – so a full table scan was still used. Why did this happen? Let’s take a look at the system (CPU) statistics:

SELECT
  PNAME,
  PVAL1
FROM
  SYS.AUX_STATS$;
 
PNAME                PVAL1
--------------- ----------
STATUS
DSTART
DSTOP
FLAGS                    1
CPUSPEEDNW      1077.92208
IOSEEKTIM               10
IOTFRSPEED            4096
SREADTIM             4.754
MREADTIM             7.432
CPUSPEED              1335
MBRC                    32
MAXTHR               18432
SLAVETHR

Notice in the above that MBRC is set to a value of 32, let’s change that to 16 to see what happens:

EXEC DBMS_STATS.SET_SYSTEM_STATS('MBRC',16)

Now to rerun the queries, skipping the output from the first query:

...
SQL> SELECT /*+ RULE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:00.81

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

----------------------------------
| Id  | Operation         | Name |
----------------------------------
|   0 | SELECT STATEMENT  |      |
|*  1 |  TABLE ACCESS FULL| T5   |
----------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
              aaaaaa')

Note
-----
   - rule based optimizer used (consider using cbo)

Statistics
----------------------------------------------------------
          0  recursive calls
          0  db block gets
      19327  consistent gets
      10349  physical reads
          0  redo size
     993542  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

So, the RULE based optimizer continues to use a full table scan, how about the cost based optimizer?

SQL> SELECT /*+ __FAST=TRUE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    LOWER(C10)=LPAD('a',50,'a');

38462 rows selected.

Elapsed: 00:00:01.01

Execution Plan
----------------------------------------------------------
Plan hash value: 1769636183

--------------------------------------------------------------------------------------------
| Id  | Operation                   | Name         | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |              | 38462 |  4995K|  1081   (1)| 00:00:06 |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5           | 38462 |  4995K|  1081   (1)| 00:00:06 |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10_F | 38462 |       |   339   (1)| 00:00:02 |
--------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   2 - access(LOWER("C10")='aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa')

Statistics
----------------------------------------------------------
          0  recursive calls
          0  db block gets
      19327  consistent gets
      10364  physical reads
          0  redo size
     993542  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

The above shows that the cost based optimizer is now using the index access path, because we modified the MBRC system (CPU) statistic.

Let’s try a slight variation of the original test case, this time we will not create a function based index, I have removed the LOWER function from the WHERE clauses, and have reset the MBRC system statistic back to a value of 32 (you can optionally remove the ORDER BY 10 clause in the INSERT INTO statement):

DROP TABLE T5 PURGE;
 
CREATE TABLE T5 (
  C1 NUMBER,
  C2 NUMBER,
  C3 NUMBER,
  C4 NUMBER,
  C5 NUMBER,
  C6 NUMBER,
  C7 NUMBER,
  C8 NUMBER,
  C9 VARCHAR2(50),
  C10 VARCHAR2(50));

INSERT INTO
  T5
SELECT
  ROWNUM,
  ROWNUM,
  MOD(ROWNUM,50),
  MOD(ROWNUM,40),
  MOD(ROWNUM,100),
  ROUND(ROWNUM/1000),
  ROUND(ROWNUM/950),
  ROUND(ROWNUM/600),
  RPAD(CHR(65 + MOD(ROWNUM-1,26)),50,CHR(65 + MOD(ROWNUM-1,26))),
  RPAD(CHR(65 + MOD(ROWNUM-1,26)),50,CHR(65 + MOD(ROWNUM-1,26)))
FROM
  DUAL
CONNECT BY
  LEVEL<=1000000
ORDER BY
  10;

COMMIT;

CREATE INDEX IND_T5_C10 ON T5(C10); 

EXEC DBMS_STATS.GATHER_TABLE_STATS(OWNNAME=>USER,TABNAME=>'T5',CASCADE=>TRUE)
 
ALTER SYSTEM FLUSH BUFFER_CACHE;
ALTER SYSTEM FLUSH BUFFER_CACHE;

SET TIMING ON
SET AUTOTRACE TRACEONLY STATISTICS EXPLAIN
SET ARRAYSIZE 1000
SET LINESIZE 140
SET TRIMSPOOL ON

SELECT 
  * 
FROM 
  T5 
WHERE 
  C10=LPAD('A',50,'A'); 

SELECT /*+ RULE */
  * 
FROM 
  T5 
WHERE 
  C10=LPAD('A',50,'A'); 

SELECT /*+ __FAST=TRUE */
  * 
FROM 
  T5 
WHERE 
  C10=LPAD('A',50,'A');

SELECT /*+ INDEX(T5 IND_T5_C10) */
  * 
FROM 
  T5 
WHERE 
  C10=LPAD('A',50,'A');

Now let’s take a look at the output:

SQL> SELECT
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    C10=LPAD('A',50,'A');

38462 rows selected.

Elapsed: 00:00:01.68

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      | 38462 |  4957K|  1025   (8)| 00:00:05 |
|*  1 |  TABLE ACCESS FULL| T5   | 38462 |  4957K|  1025   (8)| 00:00:05 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter("C10"='AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA')

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19325  consistent gets
      19278  physical reads
          0  redo size
     993543  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

As shown above, the cost based optimizer will perform a full table scan (likely because of the MBRC system statistic value).

SQL> SELECT /*+ RULE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    C10=LPAD('A',50,'A');

38462 rows selected.

Elapsed: 00:00:00.23

Execution Plan
----------------------------------------------------------
Plan hash value: 531072535

--------------------------------------------------
| Id  | Operation                   | Name       |
--------------------------------------------------
|   0 | SELECT STATEMENT            |            |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5         |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10 |
--------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   2 - access("C10"='AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA')

Note
-----
   - rule based optimizer used (consider using cbo)

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
       1152  consistent gets
        337  physical reads
          0  redo size
     993543  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

As shown above, the RULE based optimizer will use the index, because it does not care about the CLUSTERING_FACTOR of the index, nor the value of the MBRC system statistic.

SQL> SELECT /*+ __FAST=TRUE */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    C10=LPAD('A',50,'A');

38462 rows selected.

Elapsed: 00:00:00.76

Execution Plan
----------------------------------------------------------
Plan hash value: 2002323537

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      | 38462 |  4957K|  1025   (8)| 00:00:05 |
|*  1 |  TABLE ACCESS FULL| T5   | 38462 |  4957K|  1025   (8)| 00:00:05 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   1 - filter("C10"='AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA')

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
      19325  consistent gets
      10449  physical reads
          0  redo size
     993543  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

SQL> SELECT /*+ INDEX(T5 IND_T5_C10) */
  2    *
  3  FROM
  4    T5
  5  WHERE
  6    C10=LPAD('A',50,'A');

38462 rows selected.

Elapsed: 00:00:00.23

Execution Plan
----------------------------------------------------------
Plan hash value: 531072535

------------------------------------------------------------------------------------------
| Id  | Operation                   | Name       | Rows  | Bytes | Cost (%CPU)| Time     |
------------------------------------------------------------------------------------------
|   0 | SELECT STATEMENT            |            | 38462 |  4957K|  1081   (1)| 00:00:06 |
|   1 |  TABLE ACCESS BY INDEX ROWID| T5         | 38462 |  4957K|  1081   (1)| 00:00:06 |
|*  2 |   INDEX RANGE SCAN          | IND_T5_C10 | 38462 |       |   339   (1)| 00:00:02 |
------------------------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------
   2 - access("C10"='AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA')

Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
       1152  consistent gets
          0  physical reads
          0  redo size
     993543  bytes sent via SQL*Net to client
        752  bytes received via SQL*Net from client
         40  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
      38462  rows processed

I wonder if the comments provided by readers have now covered all of the reasons why a RULE hinted query might complete in seconds while an unhinted query might require considerably longer?

11 01 2011

Martin Preiss (15:10:10) :

until 10.2 there were still some rule hints in the abysses of the server: http://jonathanlewis.wordpress.com/2010/04/13/rule-rules/. If my memory serves me well I optimized some monitoring queries on v$-views by adding rule hints some years ago (in 10.2). But the only reference I could find now for this “technique” is http://www.dba-oracle.com/t_tuning_oracle_dictionary_sql.htm – and so this remains anecdotal evidence …

11 01 2011

Chris Saxon (11:05:37) :

SQL Profiles may also cause adding a rule hint to be faster.

If a profile was created that no longer is the optimal plan (say before an index was created), then adding the rule hint will stop the profile being used. Consider (on an 11.2.0.1 database):

15:52:13  DBA_USER@11GR2> create table t1 as select rownum r from dual connect by level <= 100000;

Table created.

15:52:46  DBA_USER@11GR2> select * from t1
15:52:46   2  where  r = 1;

1 row selected.

Elapsed: 00:00:00.17

Execution Plan
----------------------------------------------------------
Plan hash value: 3617692013

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      |     2 |    26 |    47   (3)| 00:00:01 |
|*  1 |  TABLE ACCESS FULL| T1   |     2 |    26 |    47   (3)| 00:00:01 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   1 - filter("R"=1)

Note
-----
   - dynamic sampling used for this statement (level=2)


Statistics
----------------------------------------------------------
         48  recursive calls
          0  db block gets
        227  consistent gets
        153  physical reads
          0  redo size
        328  bytes sent via SQL*Net to client
        338  bytes received via SQL*Net from client
          2  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
          1  rows processed

15:52:46  DBA_USER@11GR2>
15:52:46  DBA_USER@11GR2> set autotrace off

15:52:46  DBA_USER@11GR2> set autotrace off
15:52:46  DBA_USER@11GR2> col sql_text format a50
15:53:16  DBA_USER@11GR2> select sql_id, sql_text
15:53:16   2  from   v$sql
15:53:17   3  where  sql_text like '%from t1%where  r = 1%'
15:53:17   4  and    sql_text not like '%not htis%'
15:53:17   5  and    lower(sql_text) not like '%explain%';

SQL_ID        SQL_TEXT
------------- --------------------------------------------------
d0dqg1ky8jdfp select * from t1 where  r = 1

1 row selected.

15:53:18  DBA_USER@11GR2> @create_sql_profile
old: FEEDBACK ON for 1 or more rows
new: feedback OFF
old: sqlblanklines OFF
new: sqlblanklines ON
Enter value for sql_id: d0dqg1ky8jdfp
Enter value for child_no (0):
Enter value for profile_name (PROF_sqlid_planhash):
Enter value for category (DEFAULT):
Enter value for force_matching (FALSE):
old  19: sql_id = '&&sql_id'
new  19: sql_id = 'd0dqg1ky8jdfp'
old  20: and child_number = &&child_no
new  20: and child_number = 0
old  27: decode('&&profile_name','X0X0X0X0','PROF_&&sql_id'||'_'||plan_hash_value,'&&profile_name')
new  27: decode('X0X0X0X0','X0X0X0X0','PROF_d0dqg1ky8jdfp'||'_'||plan_hash_value,'X0X0X0X0')
old  33: sql_id = '&&sql_id'
new  33: sql_id = 'd0dqg1ky8jdfp'
old  34: and child_number = &&child_no;
new  34: and child_number = 0;
old  39: category => '&&category',
new  39: category => 'DEFAULT',
old  41: force_match => &&force_matching
new  41: force_match => false
old  52:   dbms_output.put_line('ERROR: sql_id: '||'&&sql_id'||' Child: '||'&&child_no'||' not found in v$sql.');
new  52:   dbms_output.put_line('ERROR: sql_id: '||'d0dqg1ky8jdfp'||' Child: '||'0'||' not found in v$sql.');
SQL Profile PROF_d0dqg1ky8jdfp_3617692013 created.
Elapsed: 00:00:00.12
old: sqlblanklines ON
new: sqlblanklines OFF
old: feedback OFF
new: FEEDBACK ON for 1 or more rows

15:54:28  DBA_USER@11GR2> create index t_i on t1(r);

Index created.

Elapsed: 00:00:00.12
15:54:34  DBA_USER@11GR2>
15:54:34  DBA_USER@11GR2> set autotrace trace
15:54:34  DBA_USER@11GR2>
15:54:34  DBA_USER@11GR2> select /*+ rule */ * from t1
15:54:34   2  where  r = 1;

1 row selected.

Elapsed: 00:00:00.06

Execution Plan
----------------------------------------------------------
Plan hash value: 4021086813

---------------------------------
| Id  | Operation        | Name |
---------------------------------
|   0 | SELECT STATEMENT |      |
|*  1 |  INDEX RANGE SCAN| T_I  |
---------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   1 - access("R"=1)

Note
-----
   - rule based optimizer used (consider using cbo)


Statistics
----------------------------------------------------------
          1  recursive calls
          0  db block gets
          3  consistent gets
          1  physical reads
          0  redo size
        328  bytes sent via SQL*Net to client
        338  bytes received via SQL*Net from client
          2  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
          1  rows processed
15:54:36  DBA_USER@11GR2> select * from t1
15:54:54   2  where  r = 1;

1 row selected.

Elapsed: 00:00:00.03

Execution Plan
----------------------------------------------------------
Plan hash value: 3617692013

--------------------------------------------------------------------------
| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |
--------------------------------------------------------------------------
|   0 | SELECT STATEMENT  |      |   135 |  1755 |    46   (0)| 00:00:01 |
|*  1 |  TABLE ACCESS FULL| T1   |   135 |  1755 |    46   (0)| 00:00:01 |
--------------------------------------------------------------------------

Predicate Information (identified by operation id):
---------------------------------------------------

   1 - filter("R"=1)

Note
-----
   - SQL profile "PROF_d0dqg1ky8jdfp_3617692013" used for this statement


Statistics
----------------------------------------------------------
          7  recursive calls
          0  db block gets
        164  consistent gets
          1  physical reads
          0  redo size
        328  bytes sent via SQL*Net to client
        338  bytes received via SQL*Net from client
          2  SQL*Net roundtrips to/from client
          0  sorts (memory)
          0  sorts (disk)
          1  rows processed

NB – this uses Kerry Osbourne’s create_sql_profile.sql script to create the profile from the plan in the shared pool: http://kerryosborne.oracle-guy.com/2009/04/oracle-sql-profiles/

11 01 2011

Chris Saxon (15:52:47) :

Other hints would also “break” the profile as shown above, of course. I’m not trying to imply that this the only way to stop the profile being used, just that adding it does cause the plan to change (in this case), and therefore is a possible reason for the differences.

11 01 2011

Charles Hooper (19:17:13) :

Chris,

Thank you for posting this example. I had not considered the presence of a profile as the cause… and that might explain a question like this “why is my query slow when I list the columns in one order, but fast when I swap the order of the first two columns.”

I think that a parallel problem might be optimizing for performance when the data volumes are small (possibly with the help of hints or a profile), and then finding that as the data volumes grow (or the data changes significantly) the original optimizations actually hurt performance. Or, as Donatello demonstrated, if the data is loaded in such a way that a lower clustering factor is achieved, the execution plan is frozen, and then as the data is later inserted into the tables in a more or less random fashion, the clustering factor no longer approximates its original value, and thus causing the original optimizations to actually hinder performance.

	Charles Hooper on Favorite Free Applications to…
	Zach on Favorite Free Applications to…
	Charles Hooper on Favorite Free Applications to…
	Zach on Favorite Free Applications to…
	Charles Hooper on Huge Presentation – Work…
	Dan on Huge Presentation – Work…
	Charles Hooper on Review of Synology DS415+, How…
	Matthias Clausen on Review of Synology DS415+, How…
	Charles Hooper on _SMALL_TABLE_THRESHOLD Paramet…
	mohd atif on _SMALL_TABLE_THRESHOLD Paramet…
	Charles Hooper on Free ANSI SQL to Oracle Specif…
	Kalin Agrawal on Free ANSI SQL to Oracle Specif…
	Charles Hooper on Large Page Support or Not…
	vaurob on Large Page Support or Not…
	Locating Oracle… on Finding a New Home for a Clien…

Charles Hooper's Oracle Notes