Documente Academic
Documente Profesional
Documente Cultură
Jan 7, 2009 Solaris, UNIX, vcs Its not entirely clear from the documentation, but Veritas Cluster heartbeat links need to be on separate VLANs. They mention the requirement of different switches, but say nothing about VLANs. Do not use one big VLAN for all your private heartbeat links you need two. Your different clusters can share these two VLANs, but if you have two heartbeat connections for your cluster, they need to be isolated from each other in hardware or in VLANs. If you do put them on the same VLAN or cross your links so they can see each other, youll get something like: Dec 11 16:39:20 server llt: [ID 525299 kern.notice] LLT WARNING V-14-1-10497 crossed links? link 0 and link 1 of node 0 on the same network
haconf -dump -makero hastop -all Then hand-edit the main.cf file in /opt/VRTSvcs/conf/config. Insert one line within the cluster definition block. Heres an example: cluster BIG-CLUSTER4 UserNames = { admin = cERpdxPmHpzS. } Administrators = { admin } ClusterAddress = 192.168.65.144 UseFence = SCSI3 ) Once you insert your line, its a good idea to check the syntax of main.cf: hacf -verify /etc/VRTSvcs/conf/config Then, copy the updated main.cf file from this node to the other nodes using your preferred method rcp, scp, ftp, whatever. Then on each node, hastart. You can verify the fencing configuration with this: #/sbin/vxfenadm -d I/O Fencing Cluster Information: ================================ Fencing Protocol Version: 201 Fencing Mode: SCSI3 Fencing SCSI3 Disk Policy: dmp Cluster Members: * 0 (server1) 1 (server2) RFSM State Information: node 0 in state 8 (running) node 1 in state 8 (running) 4. Testing your fencing setup Youll want to test this before going into production. I have used a few methods to test this, but these are the easiest. 1. If you have physical access to your server Unplug the two heartbeat links and the public network link. Fencing should kick in and the nodes will all race for those cooridator disks. The winner will take control, and the other cluster nodes will panic and reboot. Have a console connection on the nodes to verify.
2. If you have switch access, or access to someone who does An easy thing to do is to disable the network ports corresponding to the heartbeats and public network link. Almost the same as #1. 3. If you want to perform the test by yourself and have no physical access Set up scripts to change the speed/duplex on the NICs running your heartbeats and public network. Do this from the serial console, so you dont lose access obviously (Ive done similar things quite a few times). Once your switch is still auto/auto and your NIC is forced to 10-half with no auto-negotiation, communication will be impossible and effectively youve severed your links. Happy clustering!