Documente Academic
Documente Profesional
Documente Cultură
Agenda
Agenda
Install PuTTY
Install vMA
Install RVTools
Agenda
vMA: vi-fastpass
Agenda
Located in C:\ProgramData\Vmware\Vmware
Virtualcenter\Logs (in Windows 2008)
Troubleshooting
Security
Sharing
UDP 514
TCP 514
Encrypted SSL 1514
Agenda
10
Agenda
11
CPU Ready
12
CPU Ready
High CPU Ready Time
Memory troubleshooting
13
Agenda
14
dvSwitch
Ease network troubleshooting
Centralized administration
Enterprise plus required!!
Private VLANs
Network vMotion (port state follows VM)
NIC teaming based on load
Requirement for Cisco Nexus 1000V
vNICs
Port Groups
VMotion Port
VM Port Group
VMotion Port
VM Port Group
Virtual
vSwitches
Physical
NICs
Physical
Switches
Physical
ESXi Host 1
ESX Host 2
VMotion
Virtual Machines
Service Console
vCenter
Server
Service
Console
Hidden
vSwitches
(IO plane)
Virtual
Physical
ESXi Host 1
ESX Host 2
15
APP
APP
APP
APP
APP
APP
APP
APP
OS
OS
OS
OS
OS
OS
OS
OS
OS
vSwitch
vSwitch
vSwitch
vNetwork
CiscoDistributed
Nexus 1000V
Switch
VMware vSphere
16
/etc/vmware/esx.conf
17
esxcfg-nics
esxcfg-route
esxcfg-vswitch
esxcfg-info -n
18
19
vCenter Server
hostd
vpxa
ESXi host
Host Agent
vmware-hostd program
Logs to hostd.log
20
Logs to vpxa.log
21
Physical
Port-mirror
SPAN (switch port analyzer)
Promiscuous port
Virtual
22
Agenda
Storage Troubleshooting
Reviewing vSphere 5 Storage Maximums
23
Storage Troubleshooting
Storage Terms - PSA, MPP, NMP, SATP, PSP & ALUA
VMkernel has a special layer called the Pluggable
Storage Architecture (PSA)
PSA makes multipathing flexible and allows for 3rd party
multipathing plugins (MPP)
The native multipathing plugin is the VMWare NMP
(native multipathing plugin)
NMP manages sub-plugins with SATP (storage arraytype plugins) and PSP (path selection plugins) being the
defaults
PSA takes effect when VMkernel sends a SCSI
command to access data on a block device
Storage Troubleshooting
Storage Terms - PSA, MPP, NMP, SATP, PSP & ALUA
MPP = multipathing plugin - can coexist with NMP and
can be used on a LUN or per array basis
MPPs job is to discover physical storage devices and
determine claim rules to export a logical device
Claim rules are found in /etc/vmware/esx.conf
Path failover is delegated to SATP
Path load balancing is delegated to PSP
EMC PowerPath is an example of a 3rd party MPP
3rd party MPPs can offer better availability options, better
performance, improved monitoring and better load
balancing
Storage Troubleshooting
Storage Terms - PSA, MPP, NMP, SATP, PSP & ALUA
VMWare offers a SATP for every array they support
SATP options include VMWares default SATP for local
storage, default SATP for generic active/active storage
and default SATP for ALUA storage
ALUA = Asymmetric Logical Unit Access - what midrange arrays do to offer active/active paths (note that
ALUA must be configured on the array)
Once the SATP is chosen, it will monitor health of each
storage path, report changes in the path status, perform
array-specific operations such as activating passive
paths if needed.
24
Storage Troubleshooting
Storage Terms - PSA, MPP, NMP, SATP, PSP & ALUA
Storage Troubleshooting
Storage Terms - PSA, MPP, NMP, SATP, PSP & ALUA
Storage Troubleshooting
Storage Terms - PSA, MPP, NMP, SATP, PSP & ALUA
25
Storage Troubleshooting
Storage Friendly Names, Identifiers and Runtime
Names
Storage Troubleshooting
Identify Log Files Used to Troubleshoot Storage
Storage Troubleshooting
Identifying and setting PSP via Command Line
List all available SATP -> esxcli storage nmp satp list
List all device details -> esxcli storage nmp device list
Set PSP to Round Robin -> esxcli storage nmp device set -device <device> --psp VMW_PSP_RR
Set default PSP for all new LUNs -> esxcli storage nmp satp set s VMW_SATP_DEFAULT_AA -P VMW_PSP_RR
26
Storage Troubleshooting
Identifying Storage Performance Issues with esxtop
esxtop only offers storage latency and throughput for iSCSI and
FC
27
28
Agenda
Troubleshooting vMotion
It is transparent to users.
Troubleshooting vMotion
Host & VM Requirements for vMotion
29
Troubleshooting vMotion
Host & VM Requirements for vMotion
Troubleshooting vMotion
The IP address and subnet mask should match the correct network configuration of
the local LAN.
DNS and routing configuration should be correct for the local LAN.
VLAN settings should match the VLAN configuration of the local LAN.
Both ESXi VMkernel ports should be vMotion-enabled.
Both ESX/ESXi hosts must have a VMkernel port (vmk) on the same LAN.
Test by using vmkping on all hosts
30
31
APP
OS
OS
APP
APP
APP
APP
OS
OS
OS
OS
VMware vSphere
DRS Overview
Initial placement
Dynamic balancing (VMotion)
VMotion requirements
Preferential rules
Rule enforcement: 2 options
32
Hosts
APP
APP
APP
APP
APP
OS
OS
OS
OS
OS
ChassisA
ChassisA
ChassisB
ChassisB
Rules
B ChassisB
Cluster Problems
33
vSphere HA Architecture
FDM
vpxa
datastore
datastore
datastore
FDM
FDM
hostd
vpxa
hostd
vpxa
hostd
vpxd
vCenter Server
= Management network
Troubleshooting HA Failovers
34
Troubleshooting HA Failovers
Cause
vSphere HA master host has not been elected or vCenter Server
is unable to communicate with it.
Multiple master hosts exist and the one with which vCenter
Server is communicating is not responsible for the virtual
machine.
Agent is unable to access the datastore on which the
configuration file of the virtual machine is stored.
Troubleshooting VM Power-On
By default, they are stored in the same folder as the VM disk file
Swap flies are the VMs configured memory less its reserved
memory (2GB conf - 1GB res = 1GB vswp)
On each VM
On the host
On the cluster
If there isnt enough space for the swap file to be created, then
the VM cant be powered on
Troubleshooting VM Power-On
Unable
Unable
Unable
Unable
35
Troubleshooting VM Power-On
Solution
Where applicable, open and connect the VMware Infrastructure (VI) or vSphere
Client to the respective ESX host, VirtualCenter Server, or the vCenter Server
hostname or IP address.
2.
3.
4.
If the virtual machine is unable to power on, an error on the remote console
screen displays with the name of the affected file.
Troubleshooting VM Power-On
To prevent concurrent changes to critical virtual machine files and file systems,
ESX hosts establish locks on these files. In certain circumstances these locks
may not be released when the virtual machine is powered off. The files cannot be
accessed by the servers while locked, and the virtual machine is unable to power
on.
<VMNAME>.vswp
<DISKNAME>-flat.vmdk
<DISKNAME>-<ITERATION>-delta.vmdk
<VMNAME>.vmx
<VMNAME>.vmxf
vmware.log
Troubleshooting VM Power-On
If the touch * command succeeds, then the command successfully made changes to
the date/time stamp and has verified that the file can and has been locked (then
unlocked). At this point, retry the virtual machine power-on operation to see if it
succeeds.
If the touch * command fails with a device or resource busy message, it indicates that
a process is maintaining a lock on the file or directory.
36
Troubleshooting VM Power-On
Start with identifying the server whose VMkernel may be locking the file. To
identify the server:
Report the MAC address of the lock holder by running the command (except on NFS
volume):
# vmkfstools -D /vmfs/volumes/<UUID>/<VMDIR>/<LOCKEDFILE.xxx>
Note: If this process does not reveal the MAC address, or the owner identifier is all zeroes, it is
possible that it is a Service Console-based lock, an NFS lock, or a lock generated by another
system or product that can use or read VMFS file systems. In other circumstances, the file is
locked by a VMkernel child or cartel world and the offending host running the process/world must
be rebooted to clear it.
To check for Service Console-based locks on non-ESXi servers, run this command:
# lsof | grep <name of locked file>
Stop the process ID and its lock using the kill command. From the above example, the
process ID is 3631: # kill 3631
Troubleshooting VM Power-On
In ESXi 5.0, to find the owner of the locked file of a virtual machine, run this
command:
# vmkvsitools lsof | grep <Virtual Machine Name>
You see an output similar to:
11773 vmx 12 46 /vmfs/volumes/<Datastore
Name>/VirtualMachineName/VirtualMachineName-flat.vmdk
You can then run this command to get the PID of the process for the virtual
machine: ps | grep <PID>
You can kill the process with this command: kill -9 <PID>
Questions?
37