Documente Academic
Documente Profesional
Documente Cultură
Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A.
Part No: 817463910 April 2004, Revision A
This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, Solstice DiskSuite, SunPlex, Java, and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. The OPEN LOOK and Sun Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry. Sun holds a non-exclusive license from Xerox to the Xerox Graphical User Interface, which license also covers Suns licensees who implement OPEN LOOK GUIs and otherwise comply with Suns written license agreements. U.S. Government Rights Commercial software. Government users are subject to the Sun Microsystems, Inc. standard license agreement and applicable provisions of the FAR and its supplements. DOCUMENTATION IS PROVIDED AS IS AND ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NON-INFRINGEMENT, ARE DISCLAIMED, EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID. Copyright 2004 Sun Microsystems, Inc. 4150 Network Circle, Santa Clara, CA 95054 U.S.A. Tous droits rservs.
Ce produit ou document est protg par un copyright et distribu avec des licences qui en restreignent lutilisation, la copie, la distribution, et la dcompilation. Aucune partie de ce produit ou document ne peut tre reproduite sous aucune forme, par quelque moyen que ce soit, sans lautorisation pralable et crite de Sun et de ses bailleurs de licence, sil y en a. Le logiciel dtenu par des tiers, et qui comprend la technologie relative aux polices de caractres, est protg par un copyright et licenci par des fournisseurs de Sun. Des parties de ce produit pourront tre drives du systme Berkeley BSD licencis par lUniversit de Californie. UNIX est une marque dpose aux Etats-Unis et dans dautres pays et licencie exclusivement par X/Open Company, Ltd. Sun, Sun Microsystems, le logo Sun, docs.sun.com, AnswerBook, AnswerBook2, Solstice DiskSuite, SunPlex, Java, et Solaris sont des marques de fabrique ou des marques dposes, ou marques de service, de Sun Microsystems, Inc. aux Etats-Unis et dans dautres pays. Toutes les marques SPARC sont utilises sous licence et sont des marques de fabrique ou des marques dposes de SPARC International, Inc. aux Etats-Unis et dans dautres pays. Les produits portant les marques SPARC sont bass sur une architecture dveloppe par Sun Microsystems, Inc. Linterface dutilisation graphique OPEN LOOK et Sun a t dveloppe par Sun Microsystems, Inc. pour ses utilisateurs et licencis. Sun reconnat les efforts de pionniers de Xerox pour la recherche et le dveloppement du concept des interfaces dutilisation visuelle ou graphique pour lindustrie de linformatique. Sun dtient une licence non exclusive de Xerox sur linterface dutilisation graphique Xerox, cette licence couvrant galement les licencis de Sun qui mettent en place linterface dutilisation graphique OPEN LOOK et qui en outre se conforment aux licences crites de Sun. CETTE PUBLICATION EST FOURNIE EN LETAT ET AUCUNE GARANTIE, EXPRESSE OU IMPLICITE, NEST ACCORDEE, Y COMPRIS DES GARANTIES CONCERNANT LA VALEUR MARCHANDE, LAPTITUDE DE LA PUBLICATION A REPONDRE A UNE UTILISATION PARTICULIERE, OU LE FAIT QUELLE NE SOIT PAS CONTREFAISANTE DE PRODUIT DE TIERS. CE DENI DE GARANTIE NE SAPPLIQUERAIT PAS, DANS LA MESURE OU IL SERAIT TENU JURIDIQUEMENT NUL ET NON AVENU.
040204@7940
Contents
Preface 5
Installing and Conguring Sun Cluster HA for Oracle 9 Overview of the Installation and Conguration Process for Sun Cluster HA for Oracle 10 Planning the Sun Cluster HA for Oracle Installation and Conguration 11 Conguration Requirements 11 Conguration Planning Questions 11 Preparing the Nodes and Disks 12 M How to Prepare the Nodes 12 M How to Congure Oracle Database Access With Solstice DiskSuite 13 M How to Congure Oracle Database Access With VERITAS Volume Manager 14 Installing the Oracle Software 15 M How to Install the Oracle Software M How to Verify the Oracle Installation Creating an Oracle Database 16 16 17 17 21 M How to Create an Oracle Database Setting Up Oracle Database Permissions 15 16 16 Verifying the Oracle Installation and Conguration
M How to Set Up Oracle Database Permissions Installing the Sun Cluster HA for Oracle Packages
M How to Install the Sun Cluster HA for Oracle Packages by Using the Web Start Program 21 M How to Install the Sun Cluster HA for Oracle Packages by Using the scinstall Utility 22 Registering and Conguring Sun Cluster HA for Oracle 23
3
23 27
M How to Register and Congure Sun Cluster HA for Oracle Verifying the Sun Cluster HA for Oracle Installation Oracle Clients 33 34 34 32
M How to Verify the Sun Cluster HA for Oracle Installation Location of Sun Cluster HA for Oracle Log Files Oracle Server Fault Monitor Oracle Listener Fault Monitor 34 36
33
Customizing the Sun Cluster HA for Oracle Server Fault Monitor Dening Custom Behavior for Errors 37 Propagating a Custom Action File to All Nodes in a Cluster Upgrading Sun Cluster HA for Oracle Resource Types 47 47 49
37 46 46
Specifying the Custom Action File That a Server Fault Monitor Should Use Upgrading the SUNW.oracle_listener Resource Type Upgrading the SUNW.oracle_server Resource Type
Preset Actions for Database Management System (DBMS) Errors and Logged Alerts 51
Index
59
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Preface
Sun Cluster Data Service for Oracle Guide for Solaris OS explains how to install and congure Sun Cluster HA for Oracle. This document is intended for system administrators with extensive knowledge of Sun software and hardware. Do not use this document as a planning or presales guide. Before reading this document, you should have already determined your system requirements and purchased the appropriate equipment and software. The instructions in this document assume knowledge of the Solaris Operating System and expertise with the volume manager software that is used with Sun Cluster.
UNIX Commands
This document contains information about commands that are specic to installing and conguring Sun Cluster data services. The document does not contain comprehensive information about basic UNIX commands and procedures, such as shutting down the system, booting the system, and conguring devices. Information about basic UNIX commands and procedures is available from the following sources:
I I I
Online documentation for the Solaris Operating System Solaris Operating System man pages Other software documentation that you received with your system
Typographic Conventions
The following table describes the typographic changes that are used in this book.
TABLE P1 Typographic Conventions Typeface or Symbol Meaning Example
AaBbCc123
The names of commands, les, and directories, and onscreen computer output
Edit your .login le. Use ls -a to list all les. machine_name% you have mail.
AaBbCc123
What you type, contrasted with onscreen computer output Command-line placeholder: replace with a real name or value Book titles, new terms, or terms to be emphasized
machine_name% su Password: To delete a le, type rm lename. Read Chapter 6 in Users Guide. These are called class options. You must be root to do this.
AaBbCc123 AaBbCc123
C shell prompt C shell superuser prompt Bourne shell and Korn shell prompt
machine_name% machine_name# $
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Related Documentation
Information about related Sun Cluster topics is available in the documentation that is listed in the following table. All Sun Cluster documentation is available at http://docs.sun.com.
Topic
Documentation
Sun Cluster Data Services Planning and Administration Guide for Solaris OS Individual data service guides
Concepts Overview Software installation System administration Hardware administration Data service development Error messages Command and function reference
Sun Cluster Concepts Guide for Solaris OS Sun Cluster Overview for Solaris OS Sun Cluster Software Installation Guide for Solaris OS Sun Cluster System Administration Guide for Solaris OS Sun Cluster 3.x Hardware Administration Manual for Solaris OS Individual hardware administration guides Sun Cluster Data Services Developers Guide for Solaris OS Sun Cluster Error Messages Guide for Solaris OS Sun Cluster Reference Manual for Solaris OS
For a complete list of Sun Cluster documentation, see the release notes for your release of Sun Cluster at http://docs.sun.com.
Preface
Help
If you have problems installing or using Sun Cluster, contact your service provider and provide the following information:
I I I I I
Your name and email address (if available) Your company name, address, and phone number The model and serial numbers of your systems The release number of the Solaris Operating System (for example, Solaris 8) The release number of Sun Cluster (for example, Sun Cluster 3.0)
Use the following commands to gather information about each node on your system for your service provider.
Command
Function
prtconf -v
Displays the size of the system memory and reports information about peripheral devices Displays information about processors Reports which patches are installed Displays system diagnostic information Displays Sun Cluster release and package version information
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
How to Prepare the Nodes on page 12 How to Congure Oracle Database Access With Solstice DiskSuite on page 13 How to Congure Oracle Database Access With VERITAS Volume Manager on page 14 How to Install the Oracle Software on page 15 How to Verify the Oracle Installation on page 16 How to Create an Oracle Database on page 16 How to Set Up Oracle Database Permissions on page 17 How to Install the Sun Cluster HA for Oracle Packages by Using the Web Start Program on page 21 How to Install the Sun Cluster HA for Oracle Packages by Using the scinstall Utility on page 22 How to Register and Congure Sun Cluster HA for Oracle on page 27 How to Verify the Sun Cluster HA for Oracle Installation on page 33 How to Specify the Custom Action File That a Server Fault Monitor Should Use on page 46
Note You can use SunPlex Manager to install and congure this data service. See
Overview of the Installation and Conguration Process for Sun Cluster HA for Oracle
The following table summarizes the tasks for installing and conguring Sun Cluster HA for Oracle. The table also provides cross-references to detailed instructions for performing the tasks. Perform these tasks in the order that they are listed.
TABLE 11 Task
Plan the Sun Cluster HA for Oracle installation and conguration Prepare the nodes and disks Install the Oracle software Verify the Oracle installation Create an Oracle database Set up Oracle database permissions
Planning the Sun Cluster HA for Oracle Installation and Conguration on page 11 Preparing the Nodes and Disks on page 12 How to Install the Oracle Software on page 15 How to Verify the Oracle Installation on page 16 How to Create an Oracle Database on page 16 How to Set Up Oracle Database Permissions on page 17
Install the Sun Cluster HA for Oracle Installing the Sun Cluster HA for Oracle Packages packages on page 21 Register and congure Sun Cluster HA for Oracle Verify the Sun Cluster HA for Oracle installation Understand Sun Cluster HA for Oracle fault monitor (Optional) Customize the Sun Cluster HA for Oracle server fault monitor (Optional) Upgrade Sun Cluster HA for Oracle resource types How to Register and Congure Sun Cluster HA for Oracle on page 27 How to Verify the Oracle Installation on page 16 Understanding the Sun Cluster HA for Oracle Fault Monitor on page 34 Customizing the Sun Cluster HA for Oracle Server Fault Monitor on page 37 Upgrading Sun Cluster HA for Oracle Resource Types on page 47
10
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Conguration Requirements
Caution Your data service conguration might not be supported if you do not adhere to these requirements.
Use the requirements in this section to plan the installation and conguration of Sun Cluster HA for Oracle. These requirements apply to Sun Cluster HA for Oracle only. You must meet these requirements before you proceed with your Sun Cluster HA for Oracle installation and conguration. For requirements that apply to all data services, see Conguration Guidelines for Sun Cluster Data Services in Sun Cluster Data Services Planning and Administration Guide for Solaris OS.
I
Oracle application les These les include Oracle binaries, conguration les, and parameter les. You can install these les either on the local le system, the highly available local le system, or on the cluster le system. See Conguration Guidelines for Sun Cluster Data Services in Sun Cluster Data Services Planning and Administration Guide for Solaris OS for the advantages and disadvantages of placing the Oracle binaries on the local le system, highly available local le system, and the cluster le system.
Database-related les These les include the control le, redo logs, and data les. You must install these les on the highly available local le system or the cluster le system as either raw devices or regular les.
What resource groups will you use for network addresses and application resources and the dependencies between them?
Installing and Conguring Sun Cluster HA for Oracle 11
What is the logical hostname (for failover services) or shared address (for scalable services) for clients that will access the data service? Where will the system conguration les reside? See Conguration Guidelines for Sun Cluster Data Services in Sun Cluster Data Services Planning and Administration Guide for Solaris OS for the advantages and disadvantages of placing the Oracle binaries on the local le system rather than the cluster le system.
Note Consult the Oracle documentation before you perform this procedure.
The following steps prepare your nodes and install the Oracle software. 1. Become superuser on all of the cluster members. 2. Congure the /etc/nsswitch.conf les as follows so that the data service starts and stops correctly if a switchover or failover occurs. On each node that can master the logical host that runs Sun Cluster HA for Oracle, include one of the following entries for group in the /etc/nsswitch.conf le.
group: group: group: files files [NOTFOUND=return] nis files [NOTFOUND=return] nisplus
Sun Cluster HA for Oracle uses the su user command to start and stop the database node. The network information name service might become unavailable when a cluster nodes public network fails. Adding one of the preceding entries for group
12 Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
ensures that the su(1M) command does not refer to the NIS/NIS+ name services if the network information name service is unavailable. 3. Congure the cluster le system for Sun Cluster HA for Oracle. If raw devices contain the databases, congure the global devices for raw device access. See the Sun Cluster Software Installation Guide for Solaris OS for information about how to congure global devices. If you use the Solstice DiskSuite/Solaris Volume Manager software, congure the Oracle software to use UNIX le system (UFS) logging on mirrored metadevices or raw-mirrored metadevices. See the Solstice DiskSuite/Solaris Volume Manager documentation for more information about how to congure raw-mirrored metadevices. 4. Prepare the $ORACLE_HOME directory on a local or multihost disk.
Note If you install the Oracle binaries on a local disk, use a separate disk if possible. Installing the Oracle binaries on a separate disk prevents the binaries from overwrites during operating environment reinstallation.
5. On each node, create an entry for the database administrator (DBA) group in the /etc/group le, and add potential users to the group. You typically name the DBA group dba. Verify that the root and oracle users are members of the dba group, and add entries as necessary for other DBA users. Ensure that the group IDs are the same on all of the nodes that run Sun Cluster HA for Oracle, as the following example illustrates.
dba:*:520:root,oracle
You can create group entries in a network name service (for example, NIS or NIS+). If you create group entries in this way, add your entries to the local /etc/inet/hosts le to eliminate dependency on the network name service. 6. On each node, create an entry for the Oracle user ID (oracle). You typically name the Oracle user ID oracle. The following command updates the /etc/passwd and /etc/shadow les with an entry for the Oracle user ID.
# useradd -u 120 -g dba -d /Oracle-home oracle
Ensure that the oracle user entry is the same on all of the nodes that run Sun Cluster HA for Oracle.
1. Congure the disk devices for the Solstice DiskSuite software to use. See the Sun Cluster Software Installation Guide for Solaris OS for information about how to congure the Solstice DiskSuite software. 2. If you use raw devices to contain the databases, run the following commands to change each raw-mirrored metadevices owner, group, and mode. If you do not use raw devices, do not perform this step. a. If you create raw devices, run the following commands for each device on each node that can master the Oracle resource group.
# chown oracle /dev/md/metaset/rdsk/dn # chgrp dba /dev/md/metaset/rdsk/dn # chmod 600 /dev/md/metaset/rdsk/dn
metaset /rdsk/dn
Species the name of the diskset Species the name of the raw disk device within the metaset diskset
diskgroup volume
Species the name of the disk group Species the name of the raw volume within the disk group
14
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
c. Reregister the disk device group with the cluster to keep the VxVM namespace consistent throughout the cluster.
# scconf -c -D name=diskgroup
Local disks of the cluster nodes Highly available local le system Cluster le system
Note Before you install the Oracle software on the cluster le system, start the
Sun Cluster software and become the owner of the disk device group.
See Preparing the Nodes and Disks on page 12 for more information about installation locations. 3. Install the Oracle software. Regardless of where you install the Oracle software, modify each nodes /etc/system les as you would in standard Oracle installation procedures. Then reboot. Log in as oracle to ensure ownership of the entire directory before you perform this step. See the appropriate Oracle installation and conguration guides for instructions about how to install Oracle software.
15
16
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Place all of the database-related les (data les, redo log les, and control les) on either shared raw global devices or on the cluster le system. See Preparing the Nodes and Disks on page 12 for information about installation locations. Within the init$ORACLE_SID.ora or config$ORACLE_SID.ora le, you might need to modify the assignments for control_files and background_dump_dest to specify the locations of the control les and alert les.
Note If you use Solaris authentication for database logins, set the remote_os_authent variable in the init$ORACLE_SID.ora le to True.
2. Create the database. Start the Oracle installer and select the option to create a database. Alternatively, depending on your Oracle version, you can use the Oracle svrmgrl(1M) command to create the database. During creation, ensure that all of the database-related les are placed in the appropriate location, either on shared global devices or on the cluster le system. 3. Verify that the le names of your control les match the le names in your conguration les. 4. Create the v$sysstat view. Run the catalog scripts that create the v$sysstat view. The Sun Cluster HA for Oracle fault monitor uses this view.
To use the Oracle authentication method For all of the supported Oracle releases, type the following script at the sqlplus prompt.
# sqlplus /as sysdba grant connect, resource to user identified by passwd; alter user user default tablespace system quota 1m on system; grant select on v_$sysstat to user; grant create session to user; grant create table to user; exit;
To use the Solaris authentication method Grant permission for the database to use Solaris authentication.
Note The user for which you enable Solaris authentication is the user who owns the les under the $ORACLE_HOME directory. The following code sample shows that the user oracle owns these les.
# sqlplus /as sysdba create user ops$oracle identified by externally default tablespace system quota 1m on system; grant connect, resource to ops$oracle; grant select on v_$sysstat to ops$oracle; grant create session to ops$oracle; grant create table to ops$oracle; exit;
2. Congure NET8 for the Sun Cluster software. The listener.ora le must be accessible from all of the nodes that are in the cluster. Place these les either under the cluster le system or in the local le system of each node that can potentially run the Oracle resources.
Note If you place the listener.ora le in a location other than the
/var/opt/oracle directory or the $ORACLE_HOME/network/admin directory, you must specify the TNS_ADMIN variable or an equivalent Oracle variable in a user-environment le. For information about Oracle variables, see the Oracle documentation. You must also run the scrgadm(1M) command to set the resource extension parameter User_env, which sources the user-environment le. See Table 12 or Table 13 for format details.
18
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Sun Cluster HA for Oracle imposes no restrictions on the listener nameit can be any valid Oracle listener name. The following code sample identies the lines in listener.ora that are updated.
LISTENER = (ADDRESS_LIST = (ADDRESS = (PROTOCOL = TCP) (HOST = logical-hostname) <- use logical hostname (PORT = 1527) ) ) . . SID_LIST_LISTENER = . . (SID_NAME = SID) <- Database name, default is ORCL
The following code sample identies the lines in tnsnames.ora that are updated on client machines.
service_name = . . (ADDRESS = (PROTOCOL = TCP) (HOST = logicalhostname) <- logical hostname (PORT = 1527) <- must match port in LISTENER.ORA ) ) (CONNECT_DATA = (SID = <SID>)) <- database name, default is ORCL
The following example shows how to update the listener.ora and tnsnames.ora les for the following Oracle instances.
Instance
Logical Host
Listener
ora8 ora9
hadbms3 hadbms4
LISTENER-ora8 LISTENER-ora9
19
) SID_LIST_LISTENER-ora9 = (SID_LIST = (SID_DESC = (SID_NAME = ora9) ) ) LISTENER-ora8 = (ADDRESS_LIST = (ADDRESS= (PROTOCOL=TCP) (HOST=hadbms3)(PORT=1806)) ) SID_LIST_LISTENER-ora8 = (SID_LIST = (SID_DESC = (SID_NAME = ora8) ) )
3. Verify that the Sun Cluster software is installed and running on all of the nodes.
# scstat
20
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Note The Web Start program is not available in releases earlier than Sun Cluster 3.1 Data Services 10/03.
How to Install the Sun Cluster HA for Oracle Packages by Using the Web Start Program
You can run the Web Start program with a command-line interface (CLI) or with a graphical user interface (GUI). The content and sequence of instructions in the CLI and the GUI are similar. For more information about the Web Start program, see the installer(1M) man page. 1. On the cluster node where you are installing the Sun Cluster HA for Oracle packages, become superuser. 2. (Optional) If you intend to run the Web Start program with a GUI, ensure that your DISPLAY environment variable is set. 3. Load the Sun Java Enterprise System Accessory CD Volume 3 into the CD-ROM drive. If the Volume Management daemon vold(1M) is running and congured to manage CD-ROM devices, it automatically mounts the CD-ROM on the /cdrom/cdrom0 directory. 4. Change to the Sun Cluster HA for Oracle component directory of the CD-ROM.
Installing and Conguring Sun Cluster HA for Oracle 21
The Web Start program for the Sun Cluster HA for Oracle data service resides in this directory.
# cd /cdrom/cdrom0/\ components/SunCluster_HA_Oracle_3.1
To install only the C locale, select Typical. To install other locales, select Custom.
7. Follow instructions on the screen to install the Sun Cluster HA for Oracle packages on the node. After the installation is nished, the Web Start program provides an installation summary. This summary enables you to view logs that the Web Start program created during the installation. These logs are located in the /var/sadm/install/logs directory. 8. Exit the Web Start program. 9. Unload the Sun Java Enterprise System Accessory CD Volume 3 from the CD-ROM drive. a. To ensure that the CD-ROM is not being used, change to a directory that does not reside on the CD-ROM. b. Eject the CD-ROM.
# eject cdrom
How to Install the Sun Cluster HA for Oracle Packages by Using the scinstall Utility
1. Load the Sun Java Enterprise System Accessory CD Volume 3 into the CD-ROM drive. 2. Run the scinstall utility with no options. This step starts the scinstall utility in interactive mode.
22
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
3. Choose the menu option, Add Support for New Data Service to This Cluster Node. The scinstall utility prompts you for additional information. 4. Provide the path to the Sun Java Enterprise System Accessory CD Volume 3. The utility refers to the CD as the data services cd. 5. Specify the data service to install. The scinstall utility lists the data service that you selected and asks you to conrm your choice. 6. Exit the scinstall utility. 7. Unload the CD from the drive.
23
TABLE 12
Name/Data Type
LISTENER_NAME (string)
The name of the Oracle listener. Default: LISTENER Range: None Tunable: When disabled
ORACLE_HOME (string)
The path to the Oracle home directory. Default: None Range: Minimum = 1 Tunable: When disabled
Probe_timeout (integer)
The time-out value in seconds that the fault monitor uses to probe an Oracle listener. Default: 30 Range: 1 99,999 Tunable: Any time Introduced in release: 3.1 4/04
User_env (string)
A le that contains environment variables to be set before listener startup and shutdown. Those environment variables that have values that differ from Oracle defaults must be dened in this le. For example, a users listener.ora le might not reside under the /var/opt/oracle directory or the $ORACLE_HOME/network/admin. directory. In this situation, the TNS_ADMIN environment variable should be dened. The denition of each environment variable that is dened must follow the format VARIABLE_NAME=VARIABLE_VALUE. Each of these environment variables must be specied, one per line in the environment le. Default: Range: None Tunable: Any time
Table 13 describes the extension properties that you can set for the Oracle server. For the Oracle server, you are required to set only the following extension properties:
I I I
24
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Connect_string
Sun Cluster HA for Oracle Server Extension Properties
Description
TABLE 13
Name/Data Type
Alert_log_file (string)
Oracle alert log le. Default: None Range: Minimum = 1 Tunable: Any time
Auto_End_Bkp (Boolean)
Species whether the following recovery actions are performed if an Oracle relational database management system (RDBMS) hot backup is interrupted. I Recognizing when a database fails to open because of les that remain in hot backup mode. This verication process occurs when Sun Cluster HA for Oracle starts. I Identifying and releasing all les that remain in hot backup mode. I Opening the database for use. The permitted values for this property are as follows: False Species that the recovery actions are not performed. This value is the default. I True Species that the recovery actions are performed.
I
Default: False Range: None Tunable: Any time Connect_cycle (integer) The number of probe cycles that the server fault monitor performs before disconnecting from the database. Default: 5 Range: 0 99,999 Tunable: Any time Connect_string (string) The Oracle user and password that the server fault monitor uses to connect to the database. Default: None Range: Minimum = 1 Tunable: Any time
25
TABLE 13
(Continued)
Name/Data Type
Custom_action_file The absolute path of the le that denes the custom behavior of the (string) Sun Cluster HA for Oracle server fault monitor. Default: Range: None Tunable: Any time Introduced in release: 3.1 10/03 Debug_level (integer) The level to which Sun Cluster HA for Oracle debug messages are logged. Default: 1 Range: 1 100 Tunable: Any time ORACLE_HOME (string) The path to the Oracle home directory. Default: None Range: Minimum = 1 Tunable: When disabled ORACLE_SID (string) The Oracle system identier. Default: None Range: Minimum = 1 Tunable: When disabled Parameter_file (string) The Oracle parameter le. If the Oracle parameter le is not specied, this property defaults to Oracles default. Default: Range: Minimum = 0 Tunable: Any time Probe_timeout (integer) The time-out value (in seconds) that the server fault monitor uses to probe an Oracle server instance. Default: 60 Range: 0 99,999 Tunable: Any time
26
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
TABLE 13
(Continued)
Name/Data Type
Restart_type (string)
Species the entity that the server fault monitor restarts when the response to a fault is restart. The permitted values for this property are as follows: I RESOURCE_GROUP_RESTART Species that all resources in the resource group that contains this resource are restarted I RESOURCE_RESTART Species that only this resource is restarted Default: RESOURCE_GROUP_RESTART Range: None Tunable: Any time
User_env (string)
A le that contains environment variables to be set before server startup and shutdown. Those environment variables that have values that differ from Oracle defaults must be dened in this le. For example, a users listener.ora le might not reside under the /var/opt/oracle directory or the $ORACLE_HOME/network/admin. directory. In this situation, the TNS_ADMIN environment variable should be dened. The denition of each environment variable that is dened must follow the format VARIABLE_NAME=VARIABLE_VALUE. Each of these environment variables must be specied, one per line in the environment le. Default: NULL Range: None Tunable: Any time
Wait_for_online (Boolean)
Wait in the START method until the database is online. Default: True Range: None Tunable: Any time
27
packages as part of your initial Sun Cluster installation, go to Installing the Sun Cluster HA for Oracle Packages on page 21 to install the data service packages. Otherwise, use this procedure to congure the Sun Cluster HA for Oracle. You must have the following information to perform this procedure.
I I
The names of the cluster nodes that master the data service. The network resource that clients use to access the data service. Normally, you set up this IP address when you install the cluster. See the Sun Cluster Concepts Guide for Solaris OS for details about network resources. The path to the Oracle application binaries for the resources that you plan to congure.
1. Become superuser on a cluster member. 2. Run the scrgadm command to register the resource types for the data service. For Sun Cluster HA for Oracle, you register two resource types, SUNW.oracle_server and SUNW.oracle_listener, as follows.
# scrgadm -a -t SUNW.oracle_server # scrgadm -a -t SUNW.oracle_listener
-a -t SUNW.oracle_type
Adds the data service resource type. Species the predened resource type name for your data service.
3. Create a failover resource group to hold the network and application resources. You can optionally select the set of nodes on which the data service can run with the -h option, as follows.
# scrgadm -a -g resource-group [-h nodelist]
-g resource-group
Species the name of the resource group. This name can be your choice but must be unique for resource groups within the cluster. Species an optional comma-separated list of physical node names or IDs that identify potential masters. The order here determines the order in which the nodes are considered as primary during failover.
-h nodelist
Note Use the -h option to specify the order of the node list. If all of the nodes that are in the cluster are potential masters, you do not need to use the -h option.
4. Verify that all of the network resources that you use have been added to your name service database. You should have performed this verication during the Sun Cluster installation.
28 Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Note Ensure that all of the network resources are present in the servers and clients /etc/inet/hosts le to avoid any failures because of name service lookup.
-l logical-hostname
Species a network resource. The network resource is the logical hostname or shared address (IP address) that clients use to access Sun Cluster HA for Oracle. Species an optional, comma-separated list that identies the IP Networking Multipathing groups that are on each node. Each element in netiist must be in the form of netif@node. netif can be given as an IP Networking Multipathing group name, such as sc_ipmp0. The node can be identied by the node name or node ID, such as sc_ipmp0@1 or sc_ipmp@phys-schost-1.
Note Sun Cluster does not currently support the use of the adapter name for netif.
[-n netiist]
Note AffinityOn must be set to TRUE and the local le system must reside on global disk groups to be failover.
29
8. Run the scswitch command to complete the following tasks and bring the resource group oracle-rg online on a cluster node.
Caution Be sure to switch only at the resource group level. Switching at the device group level confuses the resource group, causing it to fail over.
I I
Move the resource group into a MANAGED state. Bring the resource group online.
This node is made the primary for device group ora-set1 and raw device /dev/global/dsk/d1. Device groups that are associated with le systems such as /global/ora-inst and /global/ora-data/logs are also made primaries on this node.
# scswitch -Z -g oracle-rg
-j resource Species the name of the resource to add. -g resource-group Species the name of the resource group into which the resources are to be placed. -t SUNW.oracle_server/listener Species the type of the resource to add. -x Alert_log_file=path-to-log Sets the path under $ORACLE_HOME for the server message log. -x Connect_string=user/passwd Species the user and password that the fault monitor uses to connect to the database. These settings must agree with the permissions that you set up in
30 Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
How to Set Up Oracle Database Permissions on page 17. If you use Solaris authorization, type a slash (/) instead of the user name and password. -x ORACLE_SID=instance Sets the Oracle system identier. -x LISTENER_NAME=listener Sets the name of the Oracle listener instance. This name must match the corresponding entry in listener.ora. -x ORACLE_HOME=Oracle-home Sets the path to the Oracle home directory. -x Restart_type=entity-to-restart Species the entity that the server fault monitor restarts when the response to a fault is restart. Set entity-to-restart as follows:
I
To specify that all resources in the resource group that contains this resource are restarted, set entity-to-restart to RESOURCE_GROUP_RESTART. By default, the resource group that contains this resource is restarted. If you set entity-to-restart to RESOURCE_GROUP_RESTART, all other resources (such as Apache or DNS) in the resource group are restarted, even if they are not faulty. Therefore, include in the resource group only the resources that you require to be restarted when the Oracle server resource is restarted.
Note Optionally, you can set additional extension properties that belong to the Oracle data service to override their default values. See Sun Cluster HA for Oracle Extension Properties on page 23 for a list of extension properties.
-Z -g resource-group
Enables the resource and monitor, moves the resource group to the MANAGED state, and brings it online. Species the name of the resource group.
Resource group: resource-group-1 (failover resource group) Oracle Resources: oracle-server-1, oracle-listener-1 Oracle Instances: ora-lsnr (listener), ora-srvr (server) (Add the failover resource group to contain all of the resources.) # scrgadm -a -g resource-group-1 (Add the logical hostname resource to the resource group.) # scrgadm -a -L -g resource-group-1 -l schost-1 (Register the Oracle resource types.) # scrgadm -a -t SUNW.oracle_server # scrgadm -a -t SUNW.oracle_listener (Add the Oracle application resources to the resource group.) # scrgadm -a -j oracle-server-1 -g resource-group-1 \ -t SUNW.oracle_server -x ORACLE_HOME=/global/oracle \ -x Alert_log_file=/global/oracle/message-log \ -x ORACLE_SID=ora-srvr -x Connect_string=scott/tiger # scrgadm -a -j oracle-listener-1 -g resource-group-1 \ -t SUNW.oracle_listener -x ORACLE_HOME=/global/oracle \ -x LISTENER_NAME=ora-lsnr (Bring the resource group online.) # scswitch -Z -g resource-group-1
32
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
5. Shut down the Oracle instance. The Sun Cluster software restarts the Oracle instance because the Oracle instance is under Sun Cluster control. 6. Switch the resource group that contains the Oracle database resource to another cluster member. The following example shows how to complete this step.
# scswitch -z -g resource-group -h node
7. Log in as oracle to the node that now contains the resource group. 8. Repeat Step 3 and Step 4 to conrm interactions with the Oracle instance.
Oracle Clients
Clients must always refer to the database by using the network resource, not the physical hostname. The network resource is an IP address that can move between physical nodes during failover. The physical hostname is a machine name. For example, in the tnsnames.ora le, you must specify the network resource as the host on which the database instance is running. The network resource is a logical hostname or a shared address. See How to Set Up Oracle Database Permissions on page 17.
Note Oracle client-server connections cannot survive a Sun Cluster HA for Oracle switchover. The client application must be prepared to handle disconnection and reconnection or recovery as appropriate. A transaction monitor might simplify the application. Further, Sun Cluster HA for Oracle node recovery time is application dependent.
33
The /var/opt/SUNWscor/oracle_server directory contains log les for the Oracle server. The /var/opt/SUNWscor/oracle_listener directory contains log les for the Oracle listener.
These les contain information about actions that the Sun Cluster HA for Oracle data service performs. Refer to these les to obtain diagnostic information for troubleshooting your conguration or to monitor the behavior of the Sun Cluster HA for Oracle data service.
A main fault monitor process, which performs error lookup and scha_control actions A database client fault probe, which performs database transactions
34
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Ignore. The server fault monitor ignores the error. Stop monitoring. The server fault monitor is stopped without shutting down the database. Restart. The server fault monitor stops and restarts the entity that is specied by the value of the Restart_type extension property:
I
If the Restart_type extension property is set to RESOURCE_GROUP_RESTART, the server fault monitor restarts the database server resource group. By default, the server fault monitor restarts the database server resource group.
35
If the Restart_type extension property is set to RESOURCE_RESTART, the server fault monitor restarts the database server resource.
Note The number of attempts to restart might exceed the value of the Retry_count resource property within the time that the Retry_interval resource property species. If this situation occurs, the server fault monitor attempts to switch over the resource group to another node.
Switch over. The server fault monitor switches over the database server resource group to another node. If no nodes are available, the attempt to switch over the resource group fails. If the attempt to switch over the resource group fails, the database server is restarted.
When the server fault monitor is started Each time that the server fault monitor queries the health of the server
If an action is dened for a logged alert that the server fault monitor detects, the server fault monitor performs the action in response to the alert. Preset actions for logged alerts are listed in Table A2. To change the action that the server fault monitor performs, customize the server fault monitor as explained in Customizing the Sun Cluster HA for Oracle Server Fault Monitor on page 37.
36
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
If a problem occurs with the listener during a probe, the probe tries to restart the listener. The value that is set in the resource property Retry_count determines the maximum number of times that the probe attempts the restart. If, after trying for the maximum number of times, the probe is still unsuccessful, the probe stops the fault monitor and does not switch over the resource group.
Overriding the preset action for an error Specifying an action for an error for which no action is preset
Caution Before you customize the Sun Cluster HA for Oracle server fault monitor, consider the effects of your customizations, especially if you change an action from restart or switch over to ignore or stop monitoring. If errors remain uncorrected for long periods, the errors might cause problems with the database. If you encounter problems with the database after customizing the Sun Cluster HA for Oracle server fault monitor, revert to using the preset actions. Reverting to the preset actions enables you to determine if the problem is caused by your customizations.
Customizing the Sun Cluster HA for Oracle server fault monitor involves the following activities: 1. Dening custom behavior for errors 2. Propagating a custom action le to all nodes in a cluster 3. Specifying the custom action le that a server fault monitor should use
DBMS errors that occur during a probe of the database by the server fault monitor Alerts that Oracle logs in the alert log le
37
Timeouts that result from a failure to receive a response within the time that is set by the Probe_timeout extension property
To dene custom behavior for these types of errors, create a custom action le.
An entry in a custom action le consists of a sequence of keyword-value pairs that are separated by semicolons. Each entry is enclosed in braces. The format of an entry in a custom action le is as follows:
{ [ERROR_TYPE=DBMS_ERROR|SCAN_LOG|TIMEOUT_ERROR;] ERROR=error-spec; [ACTION=SWITCH|RESTART|STOP|NONE;] [CONNECTION_STATE=co|di|on|*;] [NEW_STATE=co|di|on|*;] [MESSAGE="message-string"] }
White space may be used between separated keyword-value pairs and between entries to format the le. The meaning and permitted values of the keywords in a custom action le are as follows: ERROR_TYPE Indicates the type of the error that the server fault monitor has detected. The following values are permitted for this keyword: DBMS_ERROR SCAN_LOG TIMEOUT_ERROR Species that the error is a DBMS error. Species that the error is an alert that is logged in the alert log le. Species that the error is a timeout.
38
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
The ERROR_TYPE keyword is optional. If you omit this keyword, the error is assumed to be a DBMS error. ERROR Identies the error. The data type and the meaning of error-spec are determined by the value of the ERROR_TYPE keyword as shown in the following table.
ERROR_TYPE
Data Type
Meaning
DBMS_ERROR
The error number of a DBMS error that is generated by Oracle A string in an error message that Oracle has logged to the Oracle alert log le The number of consecutive timed-out probes since the server fault monitor was last started or restarted
SCAN_LOG
TIMEOUT_ERROR
You must specify the ERROR keyword. If you omit this keyword, the entry in the custom action le is ignored. ACTION Species the action that the server fault monitor is to perform in response to the error. The following values are permitted for this keyword: NONE STOP RESTART Species that the server fault monitor ignores the error. Species that the server fault monitor is stopped. Species that the server fault monitor stops and restarts the entity that is specied by the value of the Restart_type extension property of the SUNW.oracle_server resource. Species that the server fault monitor switches over the database server resource group to another node.
SWITCH
The ACTION keyword is optional. If you omit this keyword, the server fault monitor ignores the error. CONNECTION_STATE Species the required state of the connection between the database and the server fault monitor when the error is detected. The entry applies only if the connection is in the required state when the error is detected. The following values are permitted for this keyword: * co Species that the entry always applies, regardless of the state of the connection. Species that the entry applies only if the server fault monitor is attempting to connect to the database.
39
on di
Species that the entry applies only if the server fault monitor is online. The server fault monitor is online if it is connected to the database. Species that the entry applies only if the server fault monitor is disconnecting from the database.
The CONNECTION_STATE keyword is optional. If you omit this keyword, the entry always applies, regardless of the state of the connection. NEW_STATE Species the state of the connection between the database and the server fault monitor that the server fault monitor must attain after the error is detected. The following values are permitted for this keyword: * co di Species that the state of the connection must remain unchanged. Species that the server fault monitor must disconnect from the database and reconnect immediately to the database. Species that the server fault monitor must disconnect from the database. The server fault monitor reconnects when it next probes the database.
The NEW_STATE keyword is optional. If you omit this keyword, the state of the database connection remains unchanged after the error is detected. MESSAGE Species an additional message that is printed to the resources log le when this error is detected. The message must be enclosed in double quotes. This message is additional to the standard message that is dened for the error. The MESSAGE keyword is optional. If you omit this keyword, no additional message is printed to the resources log le when this error is detected.
ERROR_TYPE is set to DBMS_ERROR. ERROR is set to the error number of the DBMS error. ACTION is set to the action that you require.
For example, no action is preset for Oracle error 4031: unable to allocate num-bytes bytes of shared memory. However, this Oracle error indicates that the shared global area (SGA) has insufficient memory, is badly fragmented, or both states apply. If this error affects only a single session, ignoring the error might be appropriate. However, if this error affects more than one session, consider specifying that the server fault monitor restart the database. The following example shows an entry in a custom action le for changing the response to a DBMS error to restart.
EXAMPLE 11
This example shows an entry in a custom action le that overrides the preset action for DBMS error 4031. This entry species the following behavior:
I
In response to DBMS error 4031, the action that the server fault monitor performs is restart. This entry applies regardless of the state of the connection between the database and the server fault monitor when the error is detected. The state of the connection between the database and the server fault monitor must remain unchanged after the error is detected. The following message is printed to the resources log le when this error is detected:
Insufficient memory in shared pool.
41
The following example shows an entry in a custom action le for ignoring a DBMS error.
EXAMPLE 12
This example shows an entry in a custom action le that overrides the preset action for DBMS error 4030. This entry species the following behavior:
I I
The server fault monitor ignores DBMS error 4030. This entry applies regardless of the state of the connection between the database and the server fault monitor when the error is detected. The state of the connection between the database and the server fault monitor must remain unchanged after the error is detected. No additional message is printed to the resources log le when this error is detected.
ERROR_TYPE is set to SCAN_LOG. ERROR is set to a quoted regular expression that identies a string in an error message that Oracle has logged to the Oracle alert log le. ACTION is set to the action that you require.
The server fault monitor processes the entries in a custom action le in the order in which the entries occur. Only the rst entry that matches a logged alert is processed. Later entries that match are ignored. If you are using regular expressions to specify actions for several logged alerts, ensure that more specic entries occur before more general entries. Specic entries that occur after general entries might be ignored.
42 Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
For example, a custom action le might dene different actions for errors that are identied by the regular expressions ORA-65 and ORA-6. To ensure that the entry that contains the regular expression ORA-65 is not ignored, ensure that this entry occurs before the entry that contains the regular expression ORA-6. The following example shows an entry in a custom action le for changing the response to a logged alert.
EXAMPLE 13
This example shows an entry in a custom action le that overrides the preset action for logged alerts about internal errors. This entry species the following behavior:
I
In response to logged alerts that contain the text ORA-00600: internal error, the action that the server fault monitor performs is restart. This entry applies regardless of the state of the connection between the database and the server fault monitor when the error is detected. The state of the connection between the database and the server fault monitor must remain unchanged after the error is detected. No additional message is printed to the resources log le when this error is detected.
To change the maximum number of consecutive timed-out probes allowed, create one entry in a custom action le for each consecutive timed-out probe that is allowed except the rst timed-out probe.
Installing and Conguring Sun Cluster HA for Oracle 43
Note You are not required to create an entry for the rst timed-out probe. The action that the server fault monitor performs in response to the rst timed-out probe is preset.
For the last allowed timed-out probe, create an entry in which the keywords are set as follows:
I I
ERROR_TYPE is set to TIMEOUT_ERROR. ERROR is set to the maximum number of consecutive timed-out probes that are allowed. ACTION is set to RESTART.
For each remaining consecutive timed-out probe except the rst timed-out probe, create an entry in which the keywords are set as follows:
I I
ERROR_TYPE is set to TIMEOUT_ERROR. ERROR is set to the sequence number of the timed-out probe. For example, for the second consecutive timed-out probe, set this keyword to 2. For the third consecutive timed-out probe, set this keyword to 3. ACTION is set to NONE.
Tip To facilitate debugging, specify a message that indicates the sequence number of the timed-out probe.
The following example shows the entries in a custom action le for increasing the maximum number of consecutive timed-out probes to ve.
EXAMPLE 14
{ ERROR_TYPE=TIMEOUT; ERROR=2; ACTION=NONE; CONNECTION_STATE=*; NEW_STATE=*; MESSAGE="Timeout #2 has occurred."; } { ERROR_TYPE=TIMEOUT; ERROR=3; ACTION=NONE; CONNECTION_STATE=*; NEW_STATE=*; MESSAGE="Timeout #3 has occurred."; } 44 Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
EXAMPLE 14
(Continued)
{ ERROR_TYPE=TIMEOUT; ERROR=4; ACTION=NONE; CONNECTION_STATE=*; NEW_STATE=*; MESSAGE="Timeout #4 has occurred."; } { ERROR_TYPE=TIMEOUT; ERROR=5; ACTION=RESTART; CONNECTION_STATE=*; NEW_STATE=*; MESSAGE="Timeout #5 has occurred. Restarting."; }
This example shows the entries in a custom action le for increasing the maximum number of consecutive timed-out probes to ve. These entries specify the following behavior:
I
The server fault monitor ignores the second consecutive timed-out probe through the fourth consecutive timed-out probe. In response to the fth consecutive timed-out probe, the action that the server fault monitor performs is restart. The entries apply regardless of the state of the connection between the database and the server fault monitor when the timeout occurs. The state of the connection between the database and the server fault monitor must remain unchanged after the timeout occurs. When the second consecutive timed-out probe through the fourth consecutive timed-out probe occurs, a message of the following form is printed to the resources log le:
Timeout #number has occurred.
When the fth consecutive timed-out probe occurs, the following message is printed to the resources log le:
Timeout #5 has occurred. Restarting.
45
Locating the le on a le system that all nodes share Locating the le on a highly available local le system Copying the le to the local le system of each cluster node by using operating system commands such as the rcp(1) command or the rdist(1) command
Specifying the Custom Action File That a Server Fault Monitor Should Use
To apply customized actions to a server fault monitor, you must specify the custom action le that the fault monitor should use. Customized actions are applied to a server fault monitor when the server fault monitor reads a custom action le. A server fault monitor reads a custom action le when the you specify the le. Specifying a custom action le also validates the le. If the le contains syntax errors, an error message is displayed. Therefore, after modifying a custom action le, specify the le again to validate the le.
Caution If syntax errors in a modied custom action le are detected, correct the errors before the fault monitor is restarted. If the syntax errors remain uncorrected when the fault monitor is restarted, the fault monitor reads the erroneous le, ignoring entries that occur after the rst syntax error.
-j server-resource Species the SUNW.oracle_server resource -x custom_action_file=lepath Species the absolute path of the custom action le
SUNW.oracle_listener, which represents an Oracle listener SUNW.oracle_server, which represents an Oracle server
Upgrade these resource types if all conditions in the following list apply:
I
You are upgrading from an earlier version of the Sun Cluster HA for Oracle data service. You need to use the new features of this data service.
For general instructions that explain how to upgrade a resource type, see Upgrading a Resource Type in Sun Cluster Data Services Planning and Administration Guide for Solaris OS.
47
1 3.1 4
To determine the version of the resource type that is registered, use one command from the following list:
I I
The resource type registration (RTR) le for this resource type is /opt/SUNWscor/oracle_listener/etc/SUNW.oracle_listener.
You can perform the migration at any time. If you need to use the features of the SUNW.oracle_listener resource type that were introduced in version 3.1 4/04, the required value of the Type_version property is 4. If you need to specify the time-out value in seconds that the fault monitor uses to probe an Oracle listener, set the Probe_timeout extension property. For more information, see Table 12.
The following example shows a command for editing an instance of the SUNW.oracle_listener resource type.
EXAMPLE 15
The SUNW.oracle_listener resource is named oracle-lrs. The Type_version property of this resource is set to 4. The time-out value in seconds that the fault monitor uses to probe an Oracle listener is set to 60 seconds.
48
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
1 3.1 4
To determine the version of the resource type that is registered, use one command from the following list:
I I
The resource type registration (RTR) le for this resource type is /opt/SUNWscor/oracle_server/etc/SUNW.oracle_server.
You can perform the migration at any time. If you need to use the features of the SUNW.oracle_server resource type that were introduced in version 3.1 10/03, the required value of the Type_version property is 4. If you customized the behavior of the server fault monitor, set the Custom_action_file extension property. For more information, see Customizing the Sun Cluster HA for Oracle Server Fault Monitor on page 37.
Installing and Conguring Sun Cluster HA for Oracle 49
The following example shows a command for editing an instance of the SUNW.oracle_server resource type.
EXAMPLE 16
The SUNW.oracle_server resource is named oracle-srs. The Type_version property of this resource is set to 4. Custom behavior for the fault monitor of this resource is specied in the le /opt/SUNWscor/oracle_server/etc/srv_mon_cust_actions.
50
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
APPENDIX
Preset Actions for Database Management System (DBMS) Errors and Logged Alerts
Preset actions for DBMS errors and logged alerts are listed as follows:
I I
DBMS errors for which an action is preset are listed in Table A1. Logged alerts for which an action is preset are listed in Table A2.
co co on *
di di di di
Max. number of DBMS sessions exceeded Max. number of DBMS processes exceeded Session killed by DBA, will reconnect O/S error occurred while obtaining an enqueue. See o/s error. timeout occurred while waiting for resource maximum number of DML locks in DBMS exceeded Need to set DML_LOCKS in init.ora file to value other than 0 failed to connect to ORACLE listener process archiver error. Connect internal only, until freed. Operating system archival error occurred. Check alert log. fatal error in background process normal completion of background process background process %s unexpectedly terminated with error %s
* * *
di * di
di di di
SWITCH
di di di
RESTART * RESTART *
51
(Continued)
Message
470 471 472 473 474 475 476 477 480 481 482 602 604 705 942 1001 1002 1003 1012 1012 1014 1017 1031 1033 1033 1034
* * * *
di di di di di di di di di di di di di di *
Oracle background process died Oracle background process died Oracle background process died Oracle background process died SMON died, warm start required Oracle background process died Oracle background process died Oracle background process died LCK* process terminated with error LMON process terminated with error LMD* process terminated with error internal programming exception Recursive error inconsistent state during start up Warning - V$SYSSTAT not accessible - check grant on V_$SYSSTAT Lost connection to database Internal error in HA-DBMS Oracle Resetting database connection Not logged on Not logged on ORACLE shutdown in progress Please correct login information in HA-DBMS Oracle database configuration Insufficient privileges to perform DBMS operations check Oracle user privileges Oracle is in the shutdown or initialization process Oracle is in the shutdown or initialization process Oracle is not available
RESTART * NONE on
on on on on
di * di di co * *
NONE
on
NONE NONE
co *
co di co
RESTART co
52
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
(Continued)
Message
1034 1034 1035 1041 1041 1045 1046 1050 1053 1054 1075 1089 1089 1090 1092 1513 1542 1552 1950 2701 2703 2704 2709 2710 2719 2721
RESTART di NONE on
co di co di co *
Oracle is not available Oracle is not available Access restricted - restarting database to reset
Fault monitor user lacks CREATE SESSION privilege logon denied. cannot acquire space to extend context area cannot acquire space to open context area user storage address cannot be read or written user storage address cannot be read or written Already logged on immediate shutdown in progresss Investigate! Could be hanging! shutdown in progress - connection is not permitted ORACLE instance terminated. Disconnection forced invalid current time returned by operating system table space is off-line - please correct! rollback segment is off-line - please correct! Insufficient privileges to perform DBMS operations check Oracle user privileges HA-DBMS Oracle error - ORACLE_HOME did not get set!
RESTART * RESTART * SWITCH SWITCH NONE NONE NONE NONE NONE SWITCH NONE NONE NONE * * co on * * * * on on on
di di * * on di * di di * * * *
STOP
* di di di di di *
Appendix A Preset Actions for Database Management System (DBMS) Errors and Logged Alerts
53
(Continued)
Message
2726 2735 2811 2839 2840 2846 2847 2849 2842 3113 3113 3113 3114 4030 4032 4100 6108 6114 7205 7206 7208 7210 7211 7212 7213 7214
STOP
Could not locate ORACLE executables - check ORACLE_HOME setting osnfpm: cannot create shared memory segment Unable to attach shared memory segment Sync of blocks to disk failed.
* * * * * * * * di di di co * * *
Client unable to fork a server - Out of memory lost connection lost connection lost connection Not connected?
communication area cannot be allocated insufficient memory Cant connect to remote database - make sure SQL*Net server is up Cant connect to remote database - check SQL*Net configuration
STOP
co
STOP
co
* * * * * * * *
di di di di di di di di
54
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
(Continued)
Message
7215 7216 7218 7219 7223 7224 7229 7232 7234 7238 7250 7251 7252 7253 7258 7259 7263 7269 7279 7280 7296 7297 7306 7310 7315 7321 7322 7324
* * *
di di di * * slspool: unable to allocate spooler argument buffer. slspool: fork error, unable to spawn spool process. Resource limit reached
RESTART * RESTART *
* * * * *
RESTART *
Appendix A Preset Actions for Database Management System (DBMS) Errors and Logged Alerts
55
(Continued)
Message
7325 7351 7361 7404 7414 7415 7417 7418 7419 7430 7455 7456 7466 7470 7475 7476 7477 7478 7479 7481 9706 9716 9718 9740 9748 9747 9749 9751
RESTART * SWITCH *
* * * * * * * * * * * * * * * * * * * * * * * * * * * *
RESTART * SWITCH *
RESTART * RESTART * SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH SWITCH * * * * * * * * * * * * * * * *
56
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
(Continued)
Message
9755 9757 9756 9758 9761 9765 9779 9829 9831 9834 9836 9838 9837 9844 9845 9846 9847 9853 9854 9856 9874 9876 9877 9878 9879 9885 9888 9894
* * * * * * * * * * * * * * * * * * * * * * * * * * * *
Appendix A Preset Actions for Database Management System (DBMS) Errors and Logged Alerts
57
(Continued)
Message
9909 9912 9913 9919 9943 9947 9948 9949 9950 12505 12541 12545 27100
* * * * * * * * * * TNS:listener could not resolve SID given in connect descriptor.Check listener configuration file. TNS:no listener. Please verify connect_string property, listener and TNSconfiguration. Please check HA-Oracle parameters. Connect failed because target host or object does not exist Shared memory realm already exists
STOP
STOP
STOP
TABLE A2
Alert String
SWITCH * NONE *
di *
Semaphore access problem Warning: Multi-threaded Oracle server process died (restarted automatically)
NONE
58
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
Index
A
action le, See custom action le ACTION keyword, 39 actions listener fault monitor, 36 preset for fault monitor, 51 server fault monitor changing, 39 denition, 35 adding, messages to log les, 40 Alert_log_file extension property, 25 alert logs changing response to errors, 42 preset actions, 58 use of by fault monitor, 36 Auto_End_Bkp extension property, 25 conguring (Continued) Sun Cluster HA for Oracle performing, 23 planning, 11 Connect_cycle extension property, 25 Connect_string extension property, 25 CONNECTION_STATE keyword, 39 custom action le format, 38 maximum number of entries in, 38 order of entries in, 42 propagating to cluster nodes, 46 specifying, 46 validating, 46 Custom_action_file extension property, 26 customizing, server fault monitor, 37
C
C locale, 22 caution notice, server fault monitor customizations, 37 changing number of allowed timeouts, 43 response to DBMS errors, 40 response to logged alerts, 42 server fault monitor actions, 39 commands, node information, 8 conguring Oracle database with Solstice DiskSuite, 13 Oracle database with VERITAS Volume Manager, 14
D
database management system (DBMS) errors changing response to, 40 preset actions, 51 database-related les, conguration requirements, 11 DBMS (database management system) errors changing response to, 40 preset actions, 51 Debug_level extension property, 26 directories, /var/sadm/install/logs, 22
59
E
editing listener resource type instances, 48 server resource type instances, 49 ERROR keyword, 39 ERROR_TYPE keyword, 38 errors in custom action le, 46 DBMS changing response to, 40 preset actions for, 51 ignoring, 41 responding to, 40 SGA, 41 timeouts, 43 types detected by fault monitor, 37 extension properties listener, 24 server, 25
les, RTR (Continued) server, 49 Sun Cluster HA for Oracle logs additional messages in, 40 location, 34 fragmentation, of memory, 41
H
heap memory, 41 hot backup mode, 25
I
ignoring, minor errors, 41 installing Oracle software, 15 Sun Cluster HA for Oracle by using scinstall utility, 22 by using Web Start program, 21 log les created, 22 planning, 11 insufficient memory errors, 41
F
fault monitor overview, 34 actions changing, 39 denition, 35, 36 alert logs, 36 caution notice, 37 customizing, 37 error types detected by, 37 preset actions, 51 les alert log changing response to errors in, 42 use of by fault monitor, 36 custom action format, 38 order of entries in, 42 propagating to cluster nodes, 46 specifying, 46 validating, 46 database, 11 installation logs, 22 Oracle application, 11 RTR listener, 48
60
K
keywords, custom action le, 38
L
listener, extension properties, 24 listener fault monitor, 36 LISTENER_NAME extension property, 24 locales, 22 log les installation, 22 Sun Cluster HA for Oracle additional messages in, 40 location, 34 logged alerts changing response to, 42 preset actions, 58 use of by fault monitor, 36
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A
M
maximum values entries in custom action le, 38 number of allowed timeouts, 43 memory shortage of, 41 MESSAGE keyword, 40 migrating listener resource type instances, 48 server resource type instances, 49
N
NEW_STATE keyword, 40
preventing unnecessary restarts for DBMS errors, 41 for timeouts, 43 Probe_timeout extension property listener, 24 server, 26 properties See also extension properties Type_version listener, 48 server, 49 prtconf -v command, 8 prtdiag -v command, 8 psrinfo -v command, 8
O
Oracle See also Sun Cluster HA for Oracle application les, 11 clients, 33 database conguring with Solstice DiskSuite, 13 conguring with VERITAS Volume Manager, 14 creating, 16 setting permissions, 17 error numbers, 51 installing, 15 preparing nodes for installation of, 12 verifying installation, 16 ORACLE_HOME extension property listener, 24 server, 26 ORACLE_SID extension property, 26 order, entries in custom action le, 42 overriding, server fault monitor presets, 37
R
registering Sun Cluster HA for Oracle generally, 23 during listener upgrade, 47 during server upgrade, 49 resource type registration (RTR) le listener, 48 server, 49 resource types migrating instances of listener, 48 server, 49 responding, to major errors, 40 Restart_type extension property, 27 restarts preventing for DBMS errors, 41 for timeouts, 43 RTR (resource type registration) le listener, 48 server, 49
P
Parameter_file extension property, 26 planning, Sun Cluster HA for Oracle conguration, 11 preset actions, fault monitor, 51
S
scinstall -pv command, 8 server, extension properties, 25
Index
61
server fault monitor actions changing, 39 denition, 35 alert logs, 36 caution notice, 37 customizing, 37 error types detected by, 37 overview, 34 preset actions, 51 sessions effect of errors on, 41 SGA (shared global area), errors, 41 shared global area (SGA), errors, 41 showrev p command, 8 Sun Cluster HA for Oracle See also Oracle conguration performing, 23 planning, 11 installing by using scinstall utility, 22 by using Web Start program, 21 overview of process, 10 planning, 11 listener extension properties, 24 log les additional messages in, 40 location, 34 registering, 23 resource type versions listener, 47 server, 49 server extension properties, 25 SUNW.HAStoragePlus resource type, 29 upgrading, 47 verifying installation, 33 SUNW.HAStoragePlus resource type, 29 syntax errors, custom action le, 46
U
upgrading, Sun Cluster HA for Oracle, 47 User_env extension property listener, 24 server, 27
V
validating, custom action le, 46 /var/sadm/install/logs directory, 22 verifying Oracle installation, 16 Sun Cluster HA for Oracle installation, 33 versions listener resource types, 47 server resource types, 49
W
Wait_for_online extension property, 27 Web Start program, 21
T
timeouts, 43 Type_version property listener, 48 server, 49
62
Sun Cluster Data Service for Oracle Guide for Solaris OS April 2004, Revision A