Documente Academic
Documente Profesional
Documente Cultură
This chapter discusses considerations for using the different types of indexes in an
application. The topics include:
See Also:
In general, you should create an index on a column in any of the following situations:
Although the database creates an index for you on a column with an integrity
constraint, explicitly creating an index on such a column is recommended.
You can use the following techniques to determine which columns are best candidates
for indexing:
Use the EXPLAIN PLAN feature to show a theoretical execution plan of a given
query statement.
Use the V$SQL_PLAN view to determine the actual execution plan used for a
given query statement.
Sometimes, if an index is not being used by default and it would be most efficient to
use that index, you can use a query hint so that the index is used.
See Also:
The following sections explain how to create, alter, and drop indexes using SQL
commands, and give guidelines for managing indexes.
Typically, you insert or load data into a table (using SQL*Loader or Import) before
creating indexes. Otherwise, the overhead of updating the index slows down the insert
or load operation. The exception to this rule is that you must create an index for a
cluster before you insert any data into the cluster.
When you create an index on a table that already has data, Oracle Database must use
sort space to create the index. The database uses the sort space in memory allocated
for the creator of the index (the amount for each user is determined by the
initialization parameter SORT_AREA_SIZE), but the database must also swap sort
information to and from temporary segments allocated on behalf of the index creation.
If the index is extremely large, it can be beneficial to complete the following steps:
1 Use the TEMPORARY TABLESPACE option of the ALTER USER command to make this
your new temporary tablespace.
1 Drop this tablespace using the DROP TABLESPACE command. Then use the ALTER
USER command to reset your temporary tablespace to your original temporary
tablespace.
Under certain conditions, you can load data into a table with the SQL*Loader "direct
path load", and an index can be created as data is loaded.
See Also:
Create an index if you frequently want to retrieve less than about 15% of the
rows in a large table. This threshold percentage varies greatly, however,
according to the relative speed of a table scan and how clustered the row data is
about the index key. The faster the table scan, the lower the percentage; the
more clustered the row data, the higher the percentage.
Index columns that are used for joins to improve join performance.
Primary and unique keys automatically have indexes, but you might want to
create an index on a foreign key; see Chapter 3, "Maintaining Data Integrity Through
Constraints" for more information.
Small tables do not require indexes; if a query is taking too long, then the table
might have grown from small to large.
Some columns are strong candidates for indexing. Columns with one or more of the
following characteristics are good candidates for indexing:
The column contains many nulls, but queries often select all rows having a
value. In this case, a comparison that matches all the non-null values, such as:
is preferable to
WHERE COL_X IS NOT NULL
This is because the first uses an index on COL_X (assuming that COL_X is a
numeric column).
Columns with the following characteristics are less suitable for indexing:
There are many nulls in the column and you do not search on the non-null
values.
The size of a single index entry cannot exceed roughly one-half (minus some
overhead) of the available space in the data block. Consult with the database
administrator for assistance in determining the space required by an index.
The more indexes, the more overhead is incurred as the table is altered. When rows
are inserted or deleted, all indexes on the table must be updated. When a column is
updated, all indexes on the column must be updated.
You must weigh the performance benefit of indexes for queries against the
performance overhead of updates. For example, if a table is primarily read-only, you
might use more indexes; but, if a table is heavily updated, you might use fewer
indexes.
Although you can specify columns in any order in the CREATE INDEX command, the
order of columns in the CREATE INDEX statement can affect query performance. In
general, you should put the column expected to be used most often first in the index.
You can create a composite index (using several columns), and the same index can be
used for queries that reference all of these columns, or just some of them.
For example, assume the columns of the VENDOR_PARTS table are as shown in Figure 4-1.
Assume that there are five vendors, and each vendor has about 1000 parts.
Suppose that the VENDOR_PARTS table is commonly queried by SQL statements such as
the following:
SELECT * FROM vendor_parts
WHERE part_no = 457 AND vendor_id = 1012;
To increase the performance of such queries, you might create a composite index
putting the most selective column first; that is, the column with the most values:
CREATE INDEX ind_vendor_id
ON vendor_parts (part_no, vendor_id);
Composite indexes speed up queries that use the leading portion of the index. So in
this example, queries with WHERE clauses using only the PART_NO column also note a
performance gain. Because there are only five distinct values, placing a separate index
on VENDOR_ID would serve no purpose.
The database can use indexes more effectively when it has statistical information
about the tables involved in the queries. You can gather statistics when the indexes are
created by including the keywords COMPUTE STATISTICS in the CREATE INDEX statement.
As data is updated and the distribution of values changes, you or the DBA can
periodically refresh the statistics by calling procedures
like DBMS_STATS.GATHER_TABLE_STATISTICS and DBMS_STATS.GATHER_SCHEMA_STATISTICS.
It does not speed up queries. The table might be very small, or there might be
many rows in the table but very few index entries.
When you drop an index, all extents of the index's segment are returned to the
containing tablespace and become available for other objects in the tablespace.
Use the SQL command DROP INDEX to drop an index. For example, the following
statement drops a specific named index:
DROP INDEX Emp_ename;
To drop an index, the index must be contained in your schema or you must have
the DROP ANY INDEX system privilege.
When using indexes in an application, you might need to request that the DBA grant
privileges or make changes to initialization parameters.
To create a new index, you must own, or have the INDEX object privilege for, the
corresponding table. The schema that contains the index must also have a quota for
the tablespace intended to contain the index, or the UNLIMITED TABLESPACE system
privilege. To create an index in another user's schema, you must have
the CREATE ANYINDEX system privilege.
In this example, an index is created for a single column, to speed up queries that test
that column:
CREATE INDEX emp_ename ON emp_tab(ename);
In this example, several storage settings are explicitly specified for the index:
CREATE INDEX emp_ename ON emp_tab(ename)
TABLESPACE users
STORAGE (INITIAL 20K
NEXT 20k
PCTINCREASE 75)
PCTFREE 0
COMPUTE STATISTICS;
In this example, the index applies to two columns, to speed up queries that test either
the first column or both columns:
CREATE INDEX emp_ename ON emp_tab(ename, empno) COMPUTE STATISTICS;
In this example, the query is going to sort on the function UPPER(ENAME). An index on
the ENAME column itself would not speed up this operation, and it might be slow to call
the function for each result row. A function-based index precomputes the result of the
function for each column value, speeding up queries that use the function for
searching or sorting:
CREATE INDEX emp_upper_ename ON emp_tab(UPPER(ename)) COMPUTE STATISTICS;
Oracle Database supplies a number of specialized data cartridges to help manage these
kinds of complex data. So, if you need to create a search engine, or a geographic
information system, you can do much of the work simply by creating the right kind of
index.
Note:
The index is more effective if you gather statistics for the table or
schema, using the procedures in the DBMS_STATS package.
The index cannot contain any null values. Either make sure the
appropriate columns contain no null values, or use the NVL function in
the index expression to substitute some other value for nulls.
See Also:
Function-based indexes:
Increase the number of situations where the optimizer can perform a range
scan instead of a full table scan. For example, consider the expression in
this WHERE clause:
The optimizer can use a range scan for this query because the index is built on
(column_a + column_b). Range scans typically produce fast response times if the
predicate selects less than 15% of the rows of a large table. The optimizer can
estimate how many rows are selected by expressions more accurately if the
expressions are materialized in a function-based index. (Expressions of
function-based indexes are represented as virtual columns and ANALYZE can
build histograms on such columns.)
Create more powerful sorts. You can perform case-insensitive sorts with
the UPPER and LOWER functions, descending order sorts with the DESC keyword,
and linguistic-based sorts with the NLSSORT function.
Note:
Oracle Database sorts columns with the DESC keyword in descending order.
Such indexes are treated as function-based indexes. Descending indexes
cannot be bitmapped or reverse, and cannot be used in bitmapped
optimizations. To get the DESC functionality prior to Oracle Database version
8, remove the DESC keyword from the CREATE INDEX statement.
Another function-based index calls the object method distance_from_equator for each
city in the table. The method is applied to the object column Reg_Obj. A query could
use this index to quickly find cities that are more than 1000 miles from the equator:
CREATE INDEX Distance_index
ON Weatherdata_tab (Distance_from_equator (Reg_obj));
Another index stores the temperature delta and the maximum temperature. The result
of the delta is sorted in descending order. A query could use this index to quickly find
table rows where the temperature delta is less than 20 and the maximum temperature
is greater than 75.
CREATE INDEX compare_index
ON Weatherdata_tab ((Maxtemp - Mintemp) DESC, Maxtemp);
The SELECT command uses the function-based index on UPPER(e_name) to return all of
the employees with name like :KEYCOL.
SELECT * FROM Emp_tab WHERE UPPER(Ename) like :KEYCOL;
The following command computes a value for each row using columns A, B, and C,
and stores the results in the index.
CREATE INDEX Idx ON Fbi_tab (A + B * (C - 1), A, B);
The SELECT statement can either use index range scan (since the expression is a prefix
of index IDX) or index fast full scan (which may be preferable if the index has
specified a high parallel degree).
SELECT a FROM Fbi_tab WHERE A + B * (C - 1) < 100;
This example demonstrates how a function-based index can be used to sort based on
the collation order for a national language. The NLSSORT function returns a sort key for
each name, using the collation sequence GERMAN.
CREATE INDEX Nls_index
ON Nls_tab (NLSSORT(Name, 'NLS_SORT = German'));
The SELECT statement selects all of the contents of the table and orders it by NAME. The
rows are ordered using the German collation sequence. The Globalization Support
parameters are not needed in the SELECT statement, because in a German
session, NLS_SORT is set to German and NLS_COMP is set to ANSI.
SELECT * FROM Nls_tab WHERE Name IS NOT NULL
ORDER BY Name;
Any top-level or package-level PL/SQL functions that are used in the index
expression must be declared as DETERMINISTIC. That is, they always return the
same result given the same input, like the UPPER function. You must ensure that
the subprogram really is deterministic, because Oracle Database does not check
that the assertion is true.
You must analyze the table or index before the index is used.
The index function cannot be marked NOT NULL. To avoid a full table
scan, you must ensure that the query cannot fetch null values.