Documente Academic
Documente Profesional
Documente Cultură
Table of Contents
Introduction................................................................................................................... 2 Evolving Business Needs ........................................................................................ 3 Background The limits of older approaches to data distribution......... 4 Innovation without obsolescence ..........................................................................5 Summary.......................................................................................................................11
Introduction
As market data volumes continue their inexorable rise, the need to widely distribute that data to drive business applications increases exponentially. Yet most distribution technologies still rely on an aged IP stack unable to scale with increasing data volumes to achieve low latency, determinism, throughput and scalability. In this white paper, NYSE TechnologiesSM describes establishing a foundation for the next generation of business applications through Data Fabric, a high performance, low latency, message oriented middleware platform for distributing significant volumes of application level information across the enterprise. Building on the innovation first introduced in earlier versions of NYSE Technologies messaging infrastructure and based on years of production experience, the Data Fabric architecture bypasses both the kernel and operating system, while using industry standard servers and 10 Gigabit Ethernet or 40 Gigabit InfiniBand networks to achieve unprecedented performance.
Reduced cost through hardware footprint reduction, and minimizing operational complexity. Reduced latency, utilizing direct memory access and kernel bypass methods. Flexible deployment options with multiple transports to fulfil different deployment strategies.
Systems based on legacy technology are always lagging behind and missing opportunities. Of course, there are quick fixes. Pushing older system designs to the limits by, for example, placing them on new servers and operating systems might improve latency to some extent but it will not solve jitter unpredictability that can result in data delivery delays. Also unexpected changes in system load due to external events, such as economic news, make capacity planning as much an art as a science; modern systems need to be able to deal with dramatic load changes. Undoubtedly, market data rates will continue to grow. Massive volatility in trading volumes, such as the Flash Crash make planning difficult and require designs that provide headroom in daily operations to accommodate sharp and significant spikes in market data volumes. Firms cannot afford to deliver data too slowly because the operating
In order to address the limitations of TCP, UDP multicast became the dominant technology for high performance data distribution. However, multicast is a one-size-fits-all approach where the client system is responsible for consuming all data on a multicast group and discarding data. Workarounds do exist, like partitioning the multicast by consumer demand or isolating small consumers. However, these introduce more complexity, which leads to higher maintenance costs and reduced flexibility. The ability of a client system to consume data is limited by the kernel and IP stacks capacity to process data. On a modern server, the amount of application processing capacity outstrips the ability of the IP stack to service those clients. The net result is buffer overruns with substantial data loss or excessive dwell time in the kernel, resulting in poor system performance. Performance issues in middleware and software are problematic as market data volumes grow, server hardware advances in speed and networks continue to increase capacity and speed. The IP technology stack, now over 30 years old, is showing its age and is the bottleneck in data messaging.
Publisher
App Memory
NIC Driver
TCP/IP Transport Driver
TCP/IP Sockets Provider Kernel Memory
NIC Driver
TCP/IP Transport Driver
TCP/IP Sockets Provider Kernel Memory
NIC Driver
TCP/IP Transport Driver
TCP/IP Sockets Provider Kernel Memory
Application
App Memory
Application
App Memory
Application
App Memory
Breaking new boundaries, NYSE Technologies was one of the first to market with Remote Direct Memory Access (RDMA), which writes data messages to local memory and then replicates them to other computers in microseconds.
This approach revolutionized the movement of data and clearly showed its superiority over IP-based messaging systems. However, as with any point-to-point solution, it can suffer when widespread distribution is required. The first client will receive the data very quickly but as additional clients are added, the message must be sent to each client, resulting in additional latency and network bandwidth.
HCA
Subscriber
Local Direct Memory Access (LDMA) is similar, but as the name suggests, it operates within a single physical server. As the number of cores on a server has increased, application architects have been able to collapse their designs so that market data consumption, analytics and trading can be deployed on a single server. NYSE Technologies introduced an LDMA messaging model enabling extremely high performance for messaging applications where the publisher and subscribers are located on the same server.
Figure 3. Data Fabric Local Direct Memory Access (LDMA) transport schematic.
Now with Data Fabric, NYSE Technologies is once again introducing an industry leading transport, MultiVerb. By taking advantage of the IB verbs over Infiniband or 10 gigabit Ethernet, Data Fabric utilizes full kernel bypass and zero copy semantics, and now uses multicast delivery to be able to achieve massive scalability with minimal latency.
A publisher writes a message to the Data Fabric transport, which is sent to the network once and the physical switch replicates to all interested clients. On receipt, the message is moved straight to user space memory where Data Fabric does a rapid look-up to determine if any clients are interested. At no point is the IP stack involved. The net result is that Data Fabric MultiVerb offers substantial savings in CPU utilization and latency, while delivering significant scalability.
Publisher Application
Data Fabric MultiVerb Interrupt/Copy Boundary Memory Region
HCA
Interrupt/Copy Boundary Data Fabric MultiVerb Application
HCA
Interrupt/Copy Boundary Data Fabric MultiVerb Application
HCA
Interrupt/Copy Boundary Data Fabric MultiVerb Application
HCA
Interrupt/Copy Boundary Data Fabric MultiVerb Application
Memory Region
Memory Region
Memory Region
Memory Region
Latency Distribution
5.0 4.0 3.0 2.0 1.0 0.0 0 5000 10000 15000 20000 25000
% of Messages
Figure 5. MultiVerb transport latency results with 1000 consumers on a QDR InfiniBand network.
Data Fabric also supports traditional TCP over 1 Gigabit Ethernet for less demanding applications that require firewalls or wider distribution.
Data Fabric works with NYSE Technologies Open Middleware Agnostic Messaging APITM(OpenMAMATM) which allows firms to choose the middleware, or combination of middleware tools, that best suit their current needs. Data Fabric supports multiple IPC idioms including TCP, shared memory, LDMA, RDMA and MultiVerb, without requiring specialized network knowledge of software developers.
As a firms architecture changes, they can reconfigure their systems without writing new code. Moving from one transport to another is merely a configuration change. The solution also offers zero client configuration by centralizing this to the Naming Service Daemon (NSD). The NSD also maintains detailed client statistics to ease management and control. NYSE Technologies is a trusted name in capital markets and continues to innovate by constantly working on new developments. To keep ahead of user requirements, NYSE Technologies is testing Data Fabric on Intel Sandy Bridge processors and on large multi-core boxes with 60 to 80 cores.
Summary
As a trusted name in capital markets, NYSE Technologies continues to break new ground in next generation messaging and is fundamentally changing the approach to middleware. While upgrading servers and networks improves performance, this moves the bottleneck to the software stack, which has lagged behind improvements in hardware technology. By leveraging next generation hardware and semantics, Data Fabric delivers breakthrough performance as the capital markets community faces increasing challenges. Its all about thinking outside the stack.
2012 NYSE Euronext. All rights reserved. All third party trademarks are owned by their respective owners and are used with permission. OpenMAMA is a trademark owned by the Linux Foundation. This announcement may contain forward-looking statements regarding NYSE Euronext and its affiliates. Such statements are based upon the current beliefs and expectations of management and are subject to significant risks and uncertainties. Actual results may differ from those set forth in the forward-looking statements
dc/N11605/120130