Sunteți pe pagina 1din 165

Welcome to

Brocade’s
Fabric Vision Workshop
January 2019

Phillip Coates – Broadcom BSN Systems Engineer


+61 409 132 350
phillip.coates@broadcom.com
© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Agenda

Approx 3 hours Welcome and introduction


Fabric Vision Overview
Deploy: Building a clean and stable infrastructure
Monitor and Prevent: Using MAPS, Dashboards and FPI
Troubleshoot & Support: Using FV tools for rapid diagnosis and resolution
Intro into the Analytics and Monitoring Platform - AMP
Wrap-up and Q&A

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Guidelines for the Day

• It’s your day, so please interact:


– Share experiences
– Ask questions
• If you need to take calls or send emails, you are welcome to leave the
room to do so
• It’s not intended as a Product Engineering Steering Group…
– We’ll take constructive feedback, but we’re not planning to focus on
functionality requirements or roadmap

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Brocade Hardware/

Fabric Vision: Overview Network


Advisor
ASICs

Fabric
Vision
Brocade
Fabric OS

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Brocade Fabric Vision Technology
ADVANCED MONITORING, MANAGEMENT, AND DIAGNOSTICS
Maximizes infrastructure uptime
Avoid
• Prevent problems from occurring Problems

• Addresses problems before they


impact operations—accelerates problem
resolution and recovery

Helps dramatically reduce costs Optimize


Application
Accelerate
Recovery
Performance
• Speeds time to deploy additional capacity
• Reduces day-to-day network administration Fabric Vision Technology
• Eliminates the need for third-party tools

Optimizes application performance Accelerate


Deployments
• Minimizes latency and maximizes
network throughput
© 2014 BROCADE COMMUNICATIONS SYSTEMS, INC. CONFIDENTIAL—FOR INTERNAL USE ONLY 6
Brocade Technology Behind Fabric Vision
Hardware, software, and management together

Fabric-wide configuration Brocade Hardware/


and management tool with visual Network
reporting ASICs
Advisor

Core capabilities
integrated into ASIC
Fabric Vision
Brocade
Enables joint software/hardware
Fabric OS
features and capabilities

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Extends Brocade Fabric Vision Innovation
Deep network intelligence powered by unique HW and SW integration

Deploy 20 years of best


Hardware practices in one click
Management

Reduce common
network problems
Fabric Vision
Fabric Software Eliminate 48 percent
of maintenance costs

8 | Broadcom Proprietary and Confidential. © 2017 Broadcom. All Rights Reserved. The term “Broadcom” refers to Broadcom Limited and/or its subsidiaries.
Brocade Fabric Vision Features

Automatically detect Identifies, monitors, Automation that Quickly detects and Customizable
degraded storage and analyzes the simplifies policy- clearly alerts admins health and
IO performance performance of based monitoring to high levels of performance
with integrated specific flows or and alerting latency, helping to dashboard, with all
device latency and frame types identify slow drain critical information
IOPS monitoring devices on one screen
Fabric Performance Dashboards
IO Insight Flow Vision MAPS Impact Monitoring
(FOS
(FOS 8.0)
8.0) (FOS
(FOS 7.2)
7.2) (FOS
(FOS 7.2)
7.2) (FOS
(FOS 7.2)
7.2)
(FOS
(FOS 7.3)
7.3)

Seamlessly monitors Simplifies SAN Cable and optic Automatically Automatically


VM performance configuration diagnostics that detects and recovers flow
throughout a storage and maintains simplify the recovers from bit control buffer credit
fabric with standards- consistency in deployment and errors, negating loss at the VC
based, end-to-end VM operational behavior support of large the need for level, improving
tagging fabrics retransmission availability

COMPASS ClearLink Forward Error Buffer Credit


VM Insight Correction Recovery
Diagnostics
(FOS
(FOS 8.1)
8.1) (FOS
(FOS 7.4)
7.4)
(FOS
(FOS 7.0)
7.0) (FOS
(FOS 7.0)
7.0) (FOS
(FOS 7.0)
7.0)

10 | Broadcom Proprietary and Confidential. © 2017 Broadcom. All Rights Reserved. The term “Broadcom” refers to Broadcom Limited and/or its subsidiaries.
Brocade Fabric Vision Technology
Fabric Vision Features Introduced in FOS 7.2
• Automation that • Cable and optic • Customizable health • Identifies, monitors,
simplifies policy- diagnostics that and performance and analyzes
based monitoring and simplify the dashboard, with all performance of
alerting deployment and critical information specific flows or
support of large on one screen frame types
fabrics

ClearLink
MAPS Diagnostics Dashboards Flow Vision

Fabric Vision Requirements and Recommendations


• Features require Gen5 or Gen6
– Clearlink Diagnostics
– Flow Vision - Flow Learning
– Flow Vision - Flow Generator – SIM source

• Recommended BNA release v12.4.2 or greater


© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Brocade Fabric Vision Technology
Fabric Vision Features Introduced in FOS 7.3/7.4/8.0.1

• Features require Gen5 or Gen6


– Analytics Monitoring Platform (AMP) (BNA 12.4.4 or greater) FOS 7.4
• Features require Gen6 • Visibility into
– IO Insight (BNA 14.0.1 or greater) application
performance and
• BNA release v12.4.2 or greater slowdown, pinpoint
route cause to host,
FOS 7.3 FOS 7.4 BNA 12.4 FOS 8.0.1 storage, or network
• Quickly detects and • Works in • Simplifies SAN • Enhanced visibility Analytics
clearly alerts high conjunction with configuration and to monitor the Monitoring
Platform
levels of latency, to FPI to mitigate maintains health of storage
help identify slow slow drain devices consistency in devices
drain devices operational behavior

Fabric Performance Slow Drain Device


COMPASS IO Insight
Impact Monitoring Quarantine

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Dashboards
Trending Analytics - Customizable health, status and performance

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Dashboards for Unified Storage
Product Status & Traffic SAN Port Health IP/FC Storage Ports Health

• SAN Inventory • Port Health Violations • Top Port Errors


• Status • Bottlenecked Ports • Top Port Runtime Errors
• Events • Top Port Encode Error Out • Port Receive EOF
• Custom Events • Top Port Link Failures • Top Port Underflow Errors
• COMPASS Drifts • Top Port Sync Losses • Top Port Too Long Errors
• Out of Range Violations • Top Port C3 Discards RX TO • Top Port Discards
• Unused Ports monitor • Top Port Link Resets • Top Port Overflow Errors
• Port Utilization monitor • Top Port Alignment
© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Dashboard Point-in-Time Playback
Troubleshooting and Root Cause Analysis

• Select any date and time in the past 30 days


• Dashboard data is refreshed and reflects
specified point-in-time
• Playback controls allow user to play, pause,
forward and rewind at different speeds

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Agenda

Welcome and introduction


Fabric Vision Overview
Deploy: Building a clean and stable infrastructure
Monitor and Prevent: Using MAPS, Dashboards and FPI
Troubleshoot & Support: Using FV tools for rapid diagnosis and
resolution
Intro into the Analytics and Monitoring Platform - AMP
Wrap-up and Q&A

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case A
Problem: Faulty Media
Customer: Financial Services
Fabric Vision Tools Used: Clearlink Diagnostics, FEC

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision Use Case – Financial Services
Customer Problem
ICL link failure, congestion, oversubscription, loss of buffer credits and misbehaving devices
during critical technology refresh.

Customer Challenges Recovery actions

• Implementing new SAN infrastructure to replace older Detection: Run pro-active Clearlink Diagnostics D-Port on
SAN hardware. Needed seamless migration. ICL reporting test failures.
CRC’s incrementing as well as Physical Coding Sublayer
• Pre-prod testing - ICL links failed clearlink diagnostic (PCS) block errors.
(D-Port traffic link test)
Mitigation: Identified failing Optic (QSFP). Replacement
• Physical layer errors detected resolved the issues.
Best Practice: Test without FEC, Enable FEC wherever
possible

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision Use Case – Financial Services
Outcomes

Operational outcomes Business outcomes & ROI

• Ability to avoid any future physical layer impact to • Costly delays to project avoided.
production environment • Mitigated risk & reputational damage
• Enabled seamless integration of new SAN hardware
• Avoided impacts to other projects by maximising
giving increased confidence in technology
usage of change windows.
• Ability to pin-point physical layer issues easily • Minimized any impact from project dependencies.
• Avoids project delay
• Critical BAU activities retained by reducing time
spent on troubleshooting.

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: D_port Diagnostics

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Brocade ClearLink Diagnostics
Saves time and money

• Use Brocade D_Port to validate configurations prior


to deployment or under test
• Performs complete optical, electrical, and link
saturation testing to ensure reliable connections
– Ensures optical and signal integrity for Gen 5/6 Fibre
Channel optics and cables
– Measures latency and distance across ISLs to ensure
predictable application performance
– Provides link power loss information
• Avoids downstream problems
• Reduces fabric deployment time

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


ClearLink Diagnostics
D-Port
• Consists of five steps:-
‒ Electrical loopback test
‒ Optical loopback test
‒ Link traffic test
‒ Link latency and distance measurement
‒ Power measurement

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


D_Port Modes
• Static – Configure port explicitly – D_Port remains until you remove
configuration – User initiated
• Dynamic – D_port is initiated by external request from the remote end -
default
• On-Demand - The port becomes a D_Port due to an internal request
within the local switch such as the slotpoweroff, slotpoweron, slot insert,
portcfgPersistentDisable and portcfgPersistentEnable commands. – Not
default
Beginning with Fabric OS 8.0.1, the following test options is available only for optical
loopback tests: -framesize, -nframes, - pattern, -payload, -time. Both ends must be running
the same release (Fabric OS 8.0.1 or later) or the test reverts to legacy behavior

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Example Output

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


ClearLink Demo video
2.15 minutes

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: FEC

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Forward Error Correction (FEC)
Enhances transmission reliability and thus performance

• Enables ASIC to recover bit errors in a 10/16/32 Gbps data stream


– Can correct up to 11 error bits in every 2112-bit transmission
• Enhances reliability of transmission and thus performance
• Enabled by default on back-end links of the ASIC-based blades
in the Brocade DCX 8510 and X6
• Mandated in Fibre Channel standard to ensure reliability for all high-speed
32 Gbps links
Tx Port Rx Port
Bit errors

FEC FEC De-coding


Encoding and Correction
EC FEC FEC

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Customer Problem
ICL link failure, congestion, oversubscription, loss of buffer credits and misbehaving devices
during critical technology refresh.

Use Case A: Deep Dive

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
– Problem description

• ICL Link Failure


• Implementing new SAN infrastructure
• Physical layer errors detected

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection: Test Results Switch 1 (with FEC Disabled on Test)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection: Port Stats Switch 1 (with FEC Disabled on Test)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection: Test Results Switch 2 (with FEC Disabled on Test)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection: Port Stats Switch 2 (with FEC Disabled on Test)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Mitigation: Test Results Switch 1 (with FEC Optionally Enabled)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection: Port Stats Switch 1 (with FEC Optionally Enabled on Test)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection: Test Results Switch 2 (with FEC Optionally Enabled)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection: Port Stats Switch 2 (with FEC Optionally Enabled on Test)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Summary of Forward Error Correction (FEC)

• Enabled by default on back-end links of the Condor3/4 based blades in


the Brocade Gen5/6 chassis
• Enables Condor3/4 to recover bit errors in a 10/16/32 Gb data stream
• Enhances reliability of transmission and thus performance
• Can solve majority of physical problems in SAN
• Fewer aborts, timeouts, slow-drain devices due to physical problems

Condor 3 is Broadcom’s Gen5 (16G FC) ASIC


Condor 4 is Broadcom’s Gen6 (32G FC) ASIC

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Mitigation

• The D-Port Test itself identified which side the problem was on since the
test results on Switch 1 failed due to ‘remote’ port, and Switch 2 failed
due to ‘local’ port.
• Always utilize Clearlink diagnostics with FEC disabled (default)
• QSFP optic replaced on Switch 2

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Flow Generator

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Generator

• Configure a 16/32 Gbps Fibre Channel-capable port as a simulated device that


can transmit frames at a full 16/32 Gbps line rate.
• Emulate a SAN without actually having any hosts or targets or SAN-testers.
• Pre-test the entire SAN fabric at the full line rate, including optics and cables on
ISLs as well as internal connections within a switch.

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Pre-defined Flow Generator

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Monitor

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Agenda

Welcome and introduction


Fabric Vision Overview
Deploy: Building a clean and stable infrastructure
Monitor and Prevent: Using MAPS, Dashboards and FPI
Troubleshoot & Support: Using FV tools for rapid diagnosis and resolution
Intro into the Analytics and Monitoring Platform - AMP
Wrap-up and Q&A

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case B
Problem: Fabric Bottleneck
Customer: Government
Fabric Vision Tools Used: MAPS, FPI, BNA Dashboard

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision Use Case – Government
Customer Problem
Poor performance on critical production applications

Customer Challenges Recovery actions


Detection:
• Customer reporting Application Performance issues • Bottleneck icon showing on BNA SAN topology
• FPI and Port Health Out of Range violations reported on
• OEM noticed BNA bottleneck icon being displayed Dashboard
• MAPS/FPI generating traps into monitoring tool:
• Root cause unclear - no Fabric Performance Impact IO_PERF_IMPACT, IO_FRAME_LOSS, C3TXTO, LR
messages displayed in Raslog
Mitigation: Quickly identify misbehaving slow-drain device.
• Lack of tooling and visibility across broader Isolated from fabric to resolve issue.
environment.

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision Use Case – Government
Outcomes

Operational outcomes Business outcomes & ROI

• Rapid detection & isolation of slow-drain issues. • Avoided impact to users and potential public-facing
issues.
• Reduction in impact. Propagation of performance
degradation to other applications avoided. • Reduced potential effort costs in identifying root
cause and resolution actions.
• PS value add - limited the man-hours required to
investigate issue

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: MAPS

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Monitoring and Alerting Policy Suite (MAPS)
Threshold-based monitoring and alerting made simple

Pre-defined Groups

Pre-defined Policies, Rules, and


Actions

Intuitive Reporting

Based on Brocade’s 20 years of experience and best practices in one click

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Monitoring and Alerting Policy Suite (MAPS)
FABRIC-WIDE THRESHOLD-BASED MONITORING AND ALERTING
MADE SIMPLE
• Pre-defined policy not quite right
for your environment? Default
Groups
• Simply use a pre-defined policy and tweak
the rules to fit your needs
• MAPS allows you to customize groups, Custom Rules Custom
Default
Custom
Default Custom
& Actions Groups
rules, and actions to meet your specific Policy
Policy
requirements, leveraging Brocade
automation to dramatically accelerate the Default
process Rules
&
Action
s

© 2014 BROCADE COMMUNICATIONS SYSTEMS, INC. CONFIDENTIAL—FOR INTERNAL USE ONLY 50


Easy Migration from Fabric Watch to MAPS

• Existing Fabric Watch rules and policies can


be seamlessly migrated to MAPS Fabric Watch
• No reconfiguration required!
• Automatically creates 3 new policies:
1. All the monitoring rules based on the default thresholds
configured in Fabric Watch
2.

3.
All the monitoring rules based on the custom thresholds
configured in Fabric Watch
All of the monitoring rules based on the active thresholds
MAPS
in Fabric Watch at the time of migration

© 2014 BROCADE COMMUNICATIONS SYSTEMS, INC. CONFIDENTIAL—FOR INTERNAL USE ONLY 51


Brocade FOS MAPS Integration with Other
Functionalities
• Further simplify Flow Monitor
– Pre-defined groups, rules, policies • Thresholds and alerts
for automatically
– Automated dynamic discovery and monitoring discovered (learning)
flows Bottleneck
– Single interface to learn and control Detection
Test results
• Enhance detection determined • Fabric
Performance
– Pre-validated settings by thresholds Impact
monitoring latency
bottleneck
• Increase flexibility
– Threshold-based monitoring and alerting
• Promote visibility via dashboard
(CLI or Brocade Network Advisor)
– New Fabric Performance Impact category for bottleneck condition
© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Monitoring and Alerting Policy Suite (MAPS)
Fabric-wide threshold-based monitoring and alerting made simple

Aggressive Policy
Pre-defined Policies
• Contains rules and actions with very strict
• Each policy is based on more than thresholds
300 rules with unique actions that • When a pristine network is needed
have been vetted by Brocade experts Moderate Policy
• Takes the guesswork out of defining • Contains rules and actions with threshold values
threshold-based rules and appropriate between the aggressive and conservative
policies
actions
• Integration with Brocade Network Conservative Policy
Advisor automates applying the • Contains rules and actions with more lenient
policies across the fabric or multiple thresholds
• When environments are resilient and can
fabrics accommodate errors

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Monitoring and Alerting Policy Suite (MAPS)
Fabric-wide threshold-based monitoring and alerting made simple

Pre-defined Categories
Ports Switch Status Fabrics
• 300+ rules grouped by type
• 10 pre-defined Categories
FRU Security Resource
• Every rule is customizable
Fabric
• Reduces errors and manual effort FCIP
Traffic /
Performance
Performance
Impact

Backend Ports

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Monitoring and Alerting Policy Suite (MAPS)
Fabric-wide threshold-based monitoring and alerting made simple

Pre-defined Groups
• Enables a group of similar All Host All Target All
components to be monitored as a Ports Ports E_Ports
single entity
All Power
• More than 30 pre-defined groups All SFPs All Fans
Supplies
available
• Ports get automatically assigned All FCIP and
to the right group Circuits more…
• Reduces errors and manual effort

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Components

Actions Actions
Condition Condition

Rule Rule Moderate Policy


Custom Policy

Groups

Categories

Policy
Conservative Policy
Custom Policy

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Categories

Port Examples
Health
Element Definition
The number of times an invalid cyclic redundancy check
(CRC with
Monitors port statistics and takes error occurs on a port or a frame that computes to an invalid
good EOF
action based on the configured CRC. Invalid CRCs can represent noise on the network. Such
(crc g_eof)
thresholds and actions frames are recoverable by retransmission. Invalid CRCs can
markers)
indicate a potential hardware problem

Class 3
timeouts The number of Class 3 discard frames because of timeouts
(C3TXTO)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Categories

Switch
Status Examples
Policy (SSP)
Element Definition
Enables you to monitor the Power Power supply thresholds detect absent or failed power
health of the switch by defining Supplies supplies, and power
the number of types of errors that (BAD_PWR) supplies that are not in the correct slot for redundancy
transitions the overall switch
state into a state that is not Core Blade
healthy2 (DOWN_CO Faulty core blades (applies to modular switches only)
RE)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Categories

Fabric
State Examples
Change
Element Definition
Groups areas of potential Zone changes (ZONE_CHG) Tracks the number of zone changes.
problems arising between Zone changes Because zoning is a security provision, frequent zone changes may
devices, including measures such (ZONE_CHG) indicate a security breach or weakness. Zone change messages occur
whenever there is a change in zone configurations
as zone changes, fabric
segmentation, E_Port down, Tracks the number of fabric reconfigurations. These occur when the
fabric reconfiguration, domain following
Fabric events happen:
ID changes, and fabric logins reconfigurations • Two fabrics with the same domain ID are connected
FAB_CFG) • Two fabrics are joined
• An E_Port or VE_Port goes offline
• A principal link segments from the fabric

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Categories

FRU Health Security Violations Switch Resources

Enables you to define rules Monitors different security System resource monitoring
for field-replaceable units violations on the switch and enables you to monitor your
(FRUs), including SFP takes action based on the system’s RAM, flash,
transceivers, power supplies, configured thresholds and memory, and CPU
and flash memory their actions (i.e. invalid
logins, certificate expiration)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Categories

FCIP Health Traffic / Flow Fabric Backend


Performance Performance Health
Impact (FPI)
Enables you to define rules Monitor the metrics of flows Monitors the current Enables monitoring the
for field-replaceable units that were created in Flow condition of the latency seen health of the back-end switch
(FRUs), including SFP Vision and for which Flow on E_Ports and F_Ports over ports for CRC and Link reset
transceivers, power supplies, Monitor has been enabled different time windows and error rates, invalid
and flash memory uses that to determine the transmission words,
performance impact to the BAD_OS, and frame length
fabric and network (either too long or truncated)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Accessing MAPS Configurations 1

• There are three primary methods for accessing


the MAPS dialogs in Network Advisor:
• (1) Right-clicking on an object and selecting
Fabric Vision > MAPS
• (2) In the main menu, Monitor > Fabric Vision >
MAPS
3
• (3)Select an individual switch and then select
the MAPS quick launch icon
2

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Configuration

Green
Green dot
dot
indicates active
policy

• Lists all switches which have MAPS enabled and the


policies defined on each switch, including the default policies
• Launch points for violations, enabling MAPS actions at switch,
and configuring email settings for MAPS Policy & Group
configuration

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Global Policy Actions

• Opened by clicking the Actions button in


the MAPS Configuration window
• Used to enable/disable MAPS global
actions
• Click next to specific action to enable
• Can be launched from SAN, fabric, or
switch level
• If launched from SAN or fabric, changes
will be applied to all MAPS-enabled
switches in the SAN or fabric

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


E-mail Setup

• Opened by clicking the E-Mail Setup button on


the MAPS Configuration dialog
• Used to modify email settings for MAPS
• Can be launched from SAN, fabric, or switch
level
• If launched from SAN or fabric, changes will be
applied to all MAPS-enabled switches in the
SAN or fabric
• Up to five email addresses may be entered in a
semi-colon (‘;’) separated list
• Relay Host and Domain Name are chassis level
settings
• These settings will be applied to all logical
switches
• Email address setting is local to the logical
switch
© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Importing and Exporting Policies

• Done by selecting a policy and clicking the Import / Export button


• Use to share a policy with other users or to copy a policy from one instance of
Network Advisor to another
• Launches window when importing
on the local file system

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Distributing Policies

• Performed by clicking the


Distribute button
• Use to copy a policy to every
switch in a fabric or all fabrics
discovered by Network Advisor
– Policy can also be activated after
distribution within the same
Network Advisor operation

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Viewing Policies

• Opened by selecting a policy and


clicking the View button
• Provides read-only view of a
policy
• Can be used for policy/rule
comparison, at-a-glance viewing
of rule configurations or action
trigger settings

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Policy Configuration Overview

• Add a new policy


– To make a copy of an existing policy, select the policy in the
tree, and then click Add button
• This clones the existing policy

– To create a new policy, select the switch in the tree, and then
click Add button (next slide)
• This creates an empty policy to which the user can configure new rules

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Using the Add Policy Dialog

• Add Policy dialog


can be used to make a
copy of an existing
policy (clone) or to
create a new policy
• Tab through
the Monitoring
Categories and
create / modify
existing rules

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Configuring Rules

• Network Advisor will auto-generate a rule name


for new/edited rules
• If the user prefers, they can give a custom name
to the rule
• For each category tab, the Measure
drop-down lists the measures applicable for that
category
• The threshold setting is a combination of an
operand (<, <=, =, >=, >) and a value
• Each threshold measure is measured in different
units, so a unit label is added after the value
selection to help the user identify what is being
measured
• The Actions check boxes indicate which actions
are supported for the chosen Measure

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Viewing Rules and Groups
1
• (1)The Groups/Rules tree lists the
2 3
groups defined on the switch and
the rules associated with each group

• (2) Custom Groups contains all


user-defined groups

• (3) Under Custom Groups: Create a


new custom group, modify an
existing custom group or delete a
custom group

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Monitoring and Alerting Policy Suite (MAPS)
MAPS Group Configuration

• Add Port Group dialog can be launched from the MAPS Configuration dialog by clicking Manage
button
• This dialog can be launched in fabric or switch context
• If launched in fabric context, groups are shown for all MAPS-enabled switches in an aggregated view
• Groups can be created, edited, or deleted using this dialog

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Violations Dialog

• Lists details of each MAPS violation including the object affected, the rule condition,
actions triggered, and recommended action to resolve the violation
• Launch points:
• Dashboard widgets
• Fabric/switch/FC
port right-click
will launch in
context of
selected object
• MAPS
Configuration
dialog

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Monitoring and Alerting Policy Suite (MAPS)
Compare MAPS configurations

• Used to compare two


policies
• To compare policies, from
MAPS configuration
dialog:
– (1) Select two policies
• Policy is chosen when
switch row is selected
– (2) Select the Compare
button

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


MAPS Demo video
5 Minutes

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Brocade MAPS Quick Start Guide
Introduction
The Brocade Monitoring and Alerting Policy Suite (MAPS) is an easy-to-use, policy-based monitoring and alerting solution for proactively monitoring the
health and performance of the storage infrastructure. This quick start guide provides step-by-step instructions for easily deploying MAPS in Brocade SAN
environments.

Prerequisites (If the Fabric OS, Brocade Network Advisor, or switch do not meet the requirements below, upgrade the firmware, software, and/or install the required licenses before
preceding.)
Confirm that the product is a Brocade SAN Check if you have a Brocade Fabric Vision technology (or Fabric Watch and Determine which one of the pre-defined

1 switch model with minimum 8 Gbps bandwidth


capability and Fabric OS v7.3 or later (If you 2 Advanced Performance Monitoring) license: switch:admin> licenseshow
H7L73ETXZMFfBQJrKDFNfBWBrABA3N7J7K: Fabric Vision technology license 3 MAPS policies to use: Aggressive, Moderate,
or Conservative
use Brocade Network Advisor, you must have (Refer to Ref.1 Section: Default MAPS Policy
version of 12.3 or later) (Refer to Ref.2 Section: MAPS License Requirement) Recommendations

Enabling MAPS in Fabric OS on Enabling MAPS in BNA on


One Switch OR One or More Switches
1. Enable MAPS using the mapspolicy FOS command with a selected default policy. 1. Select the SAN tab and select Monitor > Fabric Vision > MAPS > Enable to bring
Example: Enabling MAPS with the moderate policy: up the Enable MAPS dialog box.
• switch:admin> mapspolicy --enable dflt_moderate_policy 2. Select and click the right arrow button to move one or more selected devices to the
2. Verify that the MAPS policy has been enabled as an active policy using the Selected Switches list.
mapspolicy command. 3. Click OK on the Enable MAPS dialog box and click Yes on the confirmation.
• switch:admin> mapspolicy --show –summary 4. Select the SAN tab and select Monitor > Fabric Vision > MAPS > Configure to
• (Active Policy is dflt_moderate_policy) bring up MAPS Configuration dialog box.
3. Select and enable the desired MAPS actions for violation alerts on the monitored 5. Select the desired switch with the activated MAPS policy in the MAPS
switch using the FOS command mapsconfig. Configuration dialog box and click Actions to bring up the MAPS Policy Actions
• switch:admin> mapsconfig --actions RASLOG, SNMP, EMAIL dialog box.
6. In the current MAPS Policy Actions dialog box, check the box of each action you
would like to enable and click OK.
(Refer to Ref.1 Section: Activating MAPS Actions) (Refer to Ref.1 Section: Using MAPS with BNA)

References: SAN Fabric Resiliency and Administration Best Practice


Brocade Monitoring and Alerting Policy Suite Administration Guide © 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. 02/17 GA-QSG-6510-00
Fabric Vision: FPI/SDDQ*

*Fabric Performance Impact Monitoring, Slow Drain Device Quarantine


© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Fabric Performance Impact (FPI)
Simplified, advanced detection of any Impacts to SAN

• Fabric Performance Impact (FPI) Monitoring


– Monitors latency on E_Ports and F_Ports over different time windows and determine
the performance impact to the fabric.
– Detects both brief severe-latency conditions and sustained abnormal
latency conditions
– Monitors hardware counters at sub-microsecond levels
• Fabric Performance Impact monitors:
• Fabric Latency/Timeouts
• Initiator/Target Flow Ratios
• Port Performance
– Receive/Transmit usage percentages
– Port utilization percentages

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


How long does frame reside in switch?
Simplified, advanced detection of any Impacts to SAN

• IO_PERF_IMPACT
– When the Transient Queue Latency (TXQ) is greater than high
threshold (10ms)
– Calculated based on buffer credit zero and transient
queue latency counters
• IO_FRAME_LOSS
– When the Transient Queue Latency (TXQ) is greater than high
threshold (80ms)
– Calculated based on TXQ and CX3 Timeouts counters
• IO_LATENCY_CLEAR
– Latencies drop to normal levels, the port state changes to
IO_LATENCY_CLEAR

FOS 7.4, CONDOR 3 © 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION 80
Slow Drain Device Quarantine (SDDQ)
Simplified, advanced detection, and mitigation of slow drain devices in a SAN

• Fabric Performance Impact (FPI) Actions:


• Slow Drain Device Quarantine (SDDQ)
– Flows destined to the SDD reassigned to Low priority
VCs
– Buffer credits free up for regular flows sharing the same
path
– Trigger based on IO_PERF_IMPACT or
IO_FRAME_LOSS
– FOS 8.1 introduces un-quarantine feature
• Port Toggle
– Temporally disables and re-enables port
– Length of time port disable is 2 -3600 sec.
FOS 7.4 © 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION 82
FPI/SDDQ Demo video
3.40 mins

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Customer Problem
Poor performance on critical production applications

Use Case B: Deep Dive

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Problem Description

• Customer reporting FPI issues


• No FPI messages in Raslog
• OEM advising customer of issue with BNA bottleneck icon

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• BNA Topology Map showing Bottleneck icon


• Out of range violations widget on BNA Dashboard (FPI)
• Port Health violations widget on BNA Dashboard (c3 Tx Timeouts)

/31

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection
Errdump Messages:
2017/01/05-18:45:31, [C3-1014], 656, SLOT 6 | CHASSIS, WARNING, ED_DCX8510_8B, Link Reset on Port S2,P159(35) vc_no=0 crd(s)lost=5 auto trigger.
2017/01/05-19:38:57, [C3-1014], 657, SLOT 6 | CHASSIS, WARNING, ED_DCX8510_8B, Link Reset on Port S2,P159(35) vc_no=0 crd(s)lost=5 auto trigger.
2017/01/05-21:11:17, [C3-1014], 658, SLOT 6 | CHASSIS, WARNING, ED_DCX8510_8B, Link Reset on Port S2,P159(35) vc_no=0 crd(s)lost=5 auto trigger.

Errdump Messages (not visible due to Raslog action not set):


2016/12/13-06:04:07, [maps-1003], 213, slot 6 | fid 1, warning, sansw0303a_vf01, f-port 2/31, condition=all_ports(dev_latency_impact==io_perf_impact), current value:
[dev_latency_impact,io_perf_impact, 47 ms frame delay], rulename=defall_ports_io_perf_impact, dashboard category=fabric performance impact.

porterrshow :
Port Error Stats: frames enc crc crc too too bad enc disc link
loss loss frjt fbsy c3timeout pcs
tx rx in err g_eof shrt long eof out c3 fail
sync sig tx rx err
159: 672.3m 2.0g 0 0 0 0 0 0 0 643.9k 0 0
0 0 0 643.9k 7 0
Framelog:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Mitigation

• Quickly identify misbehaving device and isolate from fabric to resolve


issue. Server (SIT_NS_DB02) on port 2/31 was taken off the SAN
fabric

• Alternative Mitigation:
– Manually place in low QoS zone
– SDDQ
– Port Toggle
– Fence/Decom

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Agenda

Welcome and introduction


Fabric Vision Overview
Deploy: Building a clean and stable infrastructure
Monitor and Prevent: Using MAPS, Dashboards and FPI
Troubleshoot & Support: Using FV tools for rapid diagnosis and
resolution
Intro into the Analytics and Monitoring Platform - AMP
Wrap-up and Q&A

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case C
Problem: Slow Drain Device (with SDDQ automation)
Customer: University
Fabric Vision Tools Used: MAPS, FPI, SDDQ

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision Use Case – University
Customer Problem
Quarantined slow-drain device identified during annual SAN Health Check.

Customer Challenges Recovery actions


Detection:
• Quarantined slow-drain device causing • MAPS identified Class 3 TX Discards on ESX host
performance issues on SAN. • Severe IO performance impact identified by FPI
• BNA Out of Range Violations for Port Health and Fabric
• Previously unknown to the customer – identified Performance Impact
from SAN Health check.
Mitigation: MAPS Policy SDDQ action automated the
• No plan in place to reduce or remove impacts. process of moving the slow draining device to a low quality
of service

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision Use Case – University
Outcomes

Operational outcomes Business outcomes & ROI

• Prevented SAN performance from degrading further • Significantly reduced impact to users and public
due to existence of slow draining device. escalation risk.
• Introduction of automated quarantine capability
• Utilization of fully automated proactive monitoring
reduced potentially many hours of manual
and reactive actions
remediation.
• Increased availability of SAN Admins time
• PS value-add:
• Health Check
• Best-practices
• Defining policies

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Customer Problem
Quarantined slow-drain device identified during annual SAN Health Check.

Use Case C: Deep Dive

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Problem Description

• C3 Discards due to timeout on ESX Host (Qlogic 8G HBA)


• Severe Latency (IO_Perf_Impact/IO_Frame_Loss) in fabric

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Customer Setup

• MAPS Global Action configuration requires SDDQ (mapsconfig --action)


mapsconfig –-action raslog,decom,fence,snmp,email,sw_critical,sw_marginal,sfp_marginal,sddq

Extract from configshow:

Enable MAPS Policy and FPI: Enable QoS on E-Ports:


mapspolicy --enable dflt_conservative_policy portcfgqos --enable 1/15-18
mapsconfig --enableFPImon (n/a Gen6)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• BNA Out of Range Violations dashboard widget


Latency C3 Tx Timeouts

/31

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Extract from mapsdb --show:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Extract from mapsdb --show (Rules Affecting Health section):

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Extract from errdump:


2016/12/13-06:04:07, [MAPS-1003], 213, SLOT 6 | FID 1, WARNING, sansw0303a_vf01, F-Port 2/19,
Condition=ALL_PORTS(DEV_LATENCY_IMPACT==IO_PERF_IMPACT), Current Value:[DEV_LATENCY_IMPACT,IO_PERF_IMPACT, 47 ms Frame
Delay], RuleName=defALL_PORTS_IO_PERF_IMPACT, Dashboard Category=Fabric Performance Impact.

2016/12/13-06:04:07, [MAPS-1022], 214, SLOT 6 | FID 1, INFO, sansw0303a_vf01, Port 2/19 has been marked as Slow Drain
Device.

2016/12/13-06:05:07, [MAPS-1004], 215, SLOT 6 | FID 1, INFO, sansw0303a_vf01, F-Port 2/19,


Condition=ALL_PORTS(DEV_LATENCY_IMPACT==IO_LATENCY_CLEAR), Current Value:[DEV_LATENCY_IMPACT,IO_LATENCY_CLEAR],
RuleName=defALL_PORTS_IO_LATENCY_CLEAR, Dashboard Category=Fabric Performance Impact.

• Extract from nsshow:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Port will utilize VC 8 and 9 (low QoS VC’s)…..originally VC 5 (medium QoS)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Extract from logicalgroup --show:

• Extract from sddquarantine --show: • Extract from sddquarantine --clear:


----------------------------------------------------------
Ports marked as Slow Drain Quarantined in the Local Switch: sddquarantine --clear 2/19
2/19 Initiated clearing port from quarantined state
----------------------------------------------------------
sddquarantine --show
Online Quarantined Devices across the fabric
---------------------------------------------------------- ----------------------------------------------------------
Port Index | PID | PWWN Ports marked as Slow Drain Quarantined in the Local Switch:
---------------------------------------------------------- None
147 | 21db00 | 50:01:43:80:28:ce:16:ae
----------------------------------------------------------
----------------------------------------------------------
Configured Limit=10 Quarantined count in chassis=1

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Setup Event Processing on BNA: • Select SDDQ Event (MAPS-1022) from


Traps list:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Mark as Special Event, plus any other • Special Events show up in Custom Event
actions required e.g. email: widget:

• Special Events flagged here:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Mitigation

• Deploy MAPS policy and FPI to proactively alert you to latency bottlenecks
• Utilize SDDQ if you wish to automate proactive mitigation action
• Or manually put device in low quality of service zone
• Or remove device from fabric

• Initiator Checks:
– No. of LUN’s/devices used by adapter
– Load balancing
– Queue depth settings
– HBA driver/firmware levels (upgrade if possible)
– Application load
– etc….etc

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: COMPASS

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


COMPASS

COMPASS Policy
• Configuration and Operational Monitoring Policy
Automation Services Suite (COMPASS) Actions: apply
• Simplifies deployment of larger fabrics with automated policy, monitor
switch and fabric configuration services drifts

• Ensures settings do not drift over time with policy violation


monitoring within Brocade Network Advisor dashboards Template

• Synchronise (push) a template to all switches in a fabric or


Configuration
group Configuration
Blocks
Blocks

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


COMPASS Configuration Policy – Set Up
Creating policy/template/configuration blocks

• Configuration block allows users to create


settings such as FTP Server, Trap
Recipients, Syslog Recipients
• Brocade Network Advisor will provide
users with default configuration blocks
• Each configuration block can have one or
more related settings (nested block)
• Summary shows how many
templates/policies/switches are using this
particular configuration
• Users will be allowed to populate the
values from existing switches

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


COMPASS Monitoring and Status
Drift detection and dashboard

• Dashboard widget summarizes all the


configuration drifts and policy
violations
• Users can drill down to check
configuration drifts and policy
violations
• When a switch drifts from the
configuration, synchronize from
template

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


COMPASS Demo video
3.30 mins

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case D
Problem: High Port Utilization
Customer: Government
Fabric Vision Tools Used: MAPS, FPI, Flow Vision, BNA Dashboard

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision Use Case – Government
Customer Problem
Frequent high port utilization alerts being issued from BNA Dashboard/MAPS.

Recovery actions
Customer Challenges Detection:
• MAPS/FPI RX Util alerts on Host NPIV
• Flow Vision monitor utilised to identify flows behind host NPIV
• High utilization on NPIV port – identified during port
• BNA historical graph identified peak times of high util caused by a
FV Workshop. single VM/application
• All other historical workflows showing low usage.
• Customer very concerned over alerts and potential • Both fabrics showing same utilization (balanced).
other flows that could impact critical applications.
• No Support ticket raised or resolution process in Mitigation:
• MAPS alerting setup on specific traffic flows to determine
progress frequency going forward.
• Flexibility to move other workflows and/or adding additional host
connections to mitigate issue.
© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Fabric Vision Use Case – Government
Outcomes

Operational outcomes Business outcomes & ROI

• Issues identified during PS Workshop session • Avoidance of impact to critical application.


• Rapid identification of the flows and application • Reduction in effort required from SAN and Server
causing the high utilization at specific times. Admin teams to root cause identify and resolve High
• Avoidance of downtime and critical escalations. Utilization issue.
• PS benefits: • Best-practice deployment reducing possibility of
recurrence and associated costs.
• Customer could make an informed decision on NPIV
workflow design going forward.
• Improved insight into traffic balancing
• Customer benefit of Brocade best-practice guidance.

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Flow Vision

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Vision
Broad visibility and insight across the storage network

Flow Learning
• Non-disruptively discover all flows that go to or
come from a specific host or storage port, or
traverse ISLs

Flow Monitoring
• Non-disruptively monitor any flow across the fabric
• Obtain statistics on specific flows or frame types

Flow Generator
• Full-mesh flow generator for stress testing
• Pre-defined flows to minimize configuration

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Monitor

• Create/Define Flows
– Switch port, Initiator Port, Target Port, LUN ID
– Trunk Groups, XISL & Backbone monitors
– Source to Destination or Bi-Directional

• Learn Flows
• Monitor Flows in BNA
– Frame Monitor Flow
– E2E Monitor Flow
– Top Talker Monitor Flow
– LUN level statistics

• Save up to 1024 flows in library


• Up to 64 flows active at one time

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Performance Management
Easily define flows from the Brocade Network Advisor SAN fabric topology view

• Instantly see in-depth


frame and throughput
performance statistics for
key applications
• See trends over time and
gain visibility into
potential problems,
avoiding an impact

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Vision Flow In-Context “Templates”
Flow provisioning management

• Simplified deployment of
Flow Monitor
• Easily accessible, pre-
canned templates from right-
click menu on
Switch/Initiator/Target Port
• Automated new dialog with
populated fields based on the
pre-canned template
selection

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Vision Menu

• To create a new flow


– Under the main menu of SAN tab, Monitor > Fabric Vision > Flow > Add

– Also available by right-clicking on a switch or port

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Provisioning
Launched from Monitor > Fabric Vision > Flow > Add

Select Ellipsis’ icon to launch


Device Picker to select
addressing

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


End Device Picker

• Show: Products and


Ports
– Allows user to select
source or destination
device for flow
provisioning

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Zoned End Device Picker

• Toggle to: Show Zones:


– Allows user to select source or
destination device for flow
provisioning that are zoned
together

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Switch Port Picker: Point of Interest Selection

• Allows users to pick


source or destination
switch ports
– Ingress or Egress port
• Ports can be specified in
D,I or Slot/Port format

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Provisioning – Advanced options

• Frame Type and LUN ID are the


two advanced options available
– Only one Frame type value per
flow
– Only one LUN value per flow

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Monitoring

• All the flows defined in the context of a fabric are listed in the left column
• User can move the flow definition to the right column to monitor the measures available for the flow
• The selected flows can be plotted on a Performance Graph

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Monitoring – Actions

• Flow monitoring dialog also allows


the user to manage the flows:
– Flow drop down list
• Add / Reset / Delete
• MAPS Provisioning
• MAPS Violations

– Feature drop down list


• Activate
• De-activate
• Reset

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Graphing Flows

• Selected flows can be


plotted on graphs
– Real or Historical
• Individual measures can
be selected to plot
• Selected measures can be
published to the
dashboards

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Graphing Flows (cont.)

• IO
Insight
Flows

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Flow Vision Demo Video
1.30 min

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Customer Problem
Frequent high port utilization alerts being issued from BNA Dashboard/MAPS.

Use Case D: Deep Dive

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Problem description

• High Utilization FPI alerts on physical NPIV switch ports


• Which flows are causing the high utilization?
• Customer suspects other flows may cause performance issues to their critical
application
• Unclear whether the application traffic flow is balanced (i.e. same utilization on
both fabrics)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Utilize MAPS CLI dashboard to identify high utilization FPI instances

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Check MAPS dashboard history (last 7 days):

• Customer had a moderate policy enabled (alerts >=75%)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Now confirmed that the physical host NPIV port had high utilization
• 7 day history from the CLI not enough to gauge the frequency of the port utilisation issue
• Historical BNA dashboards - 1 month historical data will give a clearer picture via the violations
widget

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• From CLI utilize ‘portloginshow’ to view all virtual machine login PID’s behind
• the physical NPIV host port reporting FPI

• Took note of the PIDs then activated ‘sys_mon_all_fports’ flow vision


• Learning monitor template (requires FOS 7.4+)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Activate flow learning template ‘sys_mon_all_fports’ monitor

• CLI Filter used to filter for the PID’s associated with the host NPIV port within the output
of the ‘sys_mon_all_fports’ learning monitor to gauge the heaviest flows (see output on
next slide)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Used BNA flow vision to graph the flows via sys_mon_all_fports as follows:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Filter for ‘sys_mon_all_fports’ monitor:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Move the monitor to right hand-side to begin filtering for the PID’s:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection

• Filter for the PID’s to graph in either real-time or historic:

• Note: historical graph has been selected to show how often particular flows are
peaking.
© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
Fabric Vision: Use Case
Detection

• Historical graph will help determine highest and busiest flows over a time period:

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Detection Summary

• Identified the same flow (application) on both fabrics as being the flows with the highest
peaks over a week/fortnight
• All other flows were showing low utilization
• So result had verified the following:
 Non-critical flows were having no impact on the performance of the critical
application flow
 Single flow/application causing the high front port util FPI alerts (the critical app in
this case)
 Flow/application was balanced over the dual fabrics i.e. had same utilization with no
imbalance

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: Use Case
Mitigation

• Deploy MAPS policy and FPI to proactively alert you to high utilization
• Utilize historical performance monitoring in BNA
• Use Flow Vision to identify NPIV Flows
• Design/plan NPIV use to account for application workloads
• Add additional physical port connectivity depending on application workload
• Setup/finetune host queue depth

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Fabric Vision: IO Insight (Gen6)

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


IO Insight: Deliver More Valuable Intelligence
Enhanced visibility into the health of your storage environment

Proactively monitor IO performance and behavior through integrated network sensors to


gain deeper insights into problems and ensure service levels

Advanced IO IO Performance Performance


Monitoring Diagnostics Optimization

Monitor host and storage Identify and isolate the source Leverage IO statistics
device IO workloads of device or network to provision and fine-tune
and behaviors performance degradation the infrastructure

*IO Insight is not available on the Brocade G610 switch


IO Insight Capabilities
Monitor storage IO metrics to gain insights into problems

Proactive IO monitoring: Non-disruptively monitor application flows of interest


– Read/Write statistics and obtain device-level IO insight
– IO per second
– Avg/Max IO
– Total IOs
• Latency metrics
– First Response Time
– Exchange Completion Time
– Pending IO Time

No taps, no downtime, no disruption

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Extending Brocade Fabric Vision Capabilities to
Monitor Virtual Machine IO
Brocade's VM Insight capability for Fabric OS 8.1.0 and later on Gen 6 platforms allows you to gather Fibre Channel and SCSI
I/O performance and I/O latency statistics about individual traffic flows for virtual machine (VM) instances originating from a
single N_Port ID (PID) to destination targets

• Enables network administrators to


identify, set baselines, and monitor VM
performance end-to-end
• Performance metrics for VM flows:
– Network throughput statistics
– IO Insight metrics
(Brocade G620, G630 and Brocade X6)
• Set thresholds for VM flows via MAPS to
receive notifications
IO Insight Monitoring Through
Brocade Network Advisor
• Real-time monitoring of storage
IO flows
• Alert against storage
IO device performance
degradation through out-of-range
violations widget
• Analyze IO metrics
to verify storage performance
behavior

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Read/Write Command Metrics Defined
First response time and exchange completion time (aka “latency”)
for SCSI read/write commands
Host Switch/Fabric Target

FCP READ CMD


(SCSI READ CMD)

First Response Time RD CMD  1st Resp


FCP Data—
First Response WR CMD  1st Xfer Ready
(SCSI_DATA Frame 1)

FCP Data 2 (optional)


(SCSI_DATA Frame 2) Exchange Completion Time RD CMD  Status

FCP Data n (optional) WR CMD  Status


(SCSI_DATA Frame n)

FCP STATUS GOOD


(SCSI End Seq)

*DATA FLOW SHOWN IN THIS DIAGRAM IS FOR READ OPERATIONS. © 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION
IO Insight Demo Video
2.20 mins

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Agenda

09:30 – 09:15 Welcome and introduction


09:15 – 09:30 Fabric Vision Overview

09:30 - 10:45 Deploy: Building a clean and stable infrastructure

10:45 – 11:45 Monitor and Prevent: Using MAPS, Dashboards and FPI
Troubleshoot & Support: Using FV tools for rapid diagnosis and
11:45 - 12:30 resolution
Intro into the Analytics and Monitoring Platform - AMP
12:45 - 13:00 Wrap-up and Q&A

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Brocade Monitoring and Analytics
Technologies Brocade Analytics
Monitoring Platform
Gen 5 and Gen 6 end-to-end
IO monitoring
Brocade Fabric Vision • Full visibility into all SCSI commands
Technology with IO Insight • Alert for host, fabric, and storage
outliers/individual IO flows
Gen 6 storage IO-level • Automatic LUN/flow learning
monitoring • Direct fabric latency monitoring
Brocade Fabric Vision • SCSI R/W response/latency monitoring • 100 percent non-disruptive
Technology on Gen 5 • Alert for host and storage outliers Use cases:
• Identify latency and congestion • Monitoring mission-critical workloads
Monitoring, management, and • 100 percent non-disruptive • Extensive diagnostics and
diagnostic tools Use cases: troubleshooting
• Predefined, threshold- • Highly scalable from device level
• Flow-level monitoring of a device port
based rules/policies to fabric level
• Visibility to understand storage
• Dashboard views performance
• Live monitoring • Monitoring SLA compliance

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Why Brocade Analytics Monitoring Platform?

Stabilize the environment by turning visibility into actionable


intelligence needed to optimize application and storage device
performance

Deliver IO-level visibility to maintain SLA/SLO compliance and


ensure predictable performance

• Level one: 18pt Dual 400


Enable IT staff to easily track performance history and trends,
allowing more time to be spent• Line two: 18pt
on strategic Dual 400
initiatives

© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.


Brocade Analytics Monitoring Platform

Gain deeper insight into Optimize infrastructure


end-to-end application without downtime
performance

Quickly pinpoint
Automate monitoring problems, uncover
and alerting of abnormal issues before users
behaviors are affected

© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.


Brocade Analytics Monitoring Platform
Benchmark metrics such as latency, IOPS, and pending I/O to identify
anomalies and help ensure predictable performance in minutes with
zero downtime to the network and applications

Analyze 20,000 data flows and millions of IOPS on a single system to


optimize application performance

Track performance history and trends, enabling IT staff to proactively


monitor individual IO violations and resolve issues, often before they
occur

Analyze vast amounts of metrics for all the devices non-invasively to


uncover the often subtle causes of infrastructure issues that diminish
application performance
© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.
Analytics Monitoring Platform Metrics
Real-time and historical metrics
• Read/write latency
• Read/write IOPS transfer rate (avg/max)
• Other SCSI command latency stats (reserve, release, inquiry, test unit
ready; aggregate or individual) Metrics by data size:
• <8k
• Pending/outstanding IOs, (indicator of queue depth) • 8k–64k
• 64k–512k
• IO size (avg/max)
• >512k
• Protocol error stat tracking: • All sizes

– I/O aborts, timeouts, check conditions


• Fabric latency
• AMP tracks metrics for all flows at 10 s/5 m/all resolution
• Historical metric retention via Brocade Network Advisor
(5-minute granularity)
Wrap-Up

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


+ Health Check
10 Steps to SAN Maturity

+ Fabric Vision
Basic Support
Initiation –

+ AMP
+ BNA
#1 SAN Infrastructure

#2 Switch Health

#3 Configuration inconsistency and suboptimal fabric/device design

Reactive
#4 Availability – lack of redundancy in the fabric

#5 Availability – reduce resiliency due to physical layer port errors and faulty media

#6 Availability – Congestion due to misbehaving devices or lack of buffer credits

#7 Utilization, Ports, Target, Initiator, ISL ports capacity and balance planning.

Infrastructure performance: deliver IO-level visibility on target, initiator and fabric


#8
latency

Proactive
Target and host performance: reporting on pending IO to optimize queue depths and
#9
optimize application performance

#10 Application performance optimization and trend analysis

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION


Ongoing Education

© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.


Ongoing Education

© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.


Ongoing Education

© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.


Ongoing education

© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.


Ongoing education

© 2016 BROCADE COMMUNICATIONS SYSTEMS, INC.


Creating a MyBrocade Account
Register Add Privileges Activate Account

2. Add any site access code or


promotion codes and your
Brocade product serial ID if you 4. Activate your account by going
1. Visit https://login.brocade.com/ and have a support contract. to the email account you
click “Register.” registered with and click the
3. If you are a member of the activation link sent to you.
Partner Program (APN or TAPP),
complete the form.

165
Finding the Learning Portal Link on MyBrocade

1. On the MyBrocade dashboard, find the Education tab at the top of the
page (red circle) and hover over it.
2. Click the Learning Portal link (blue circle).

166
Searching for Courses on the Learning Portal

• On the Brocade Learning Portal homepage, search for courses by typing


in search words in the search engine in the upper righthand corner of the
page.

167
Thank You

© 2017 BROCADE COMMUNICATIONS SYSTEMS, INC. COMPANY PROPRIETARY INFORMATION

S-ar putea să vă placă și