Documente Academic
Documente Profesional
Documente Cultură
Abstract— Switching systems based on the ATM principle respect to ATM-based solutions, statistical multiplexing
have outgrown the experimental stage, and are already used of packetized voice and data in combination with voice
today in private and corporate networks, as well as in public compression and intelligent call-routing features will be
wide area networks to provide regular service. ATM has the
inherent ability to provide a common basis for transmission and required.
switching functionality in both local and wide area networks. 3) Network Consolidation: Providers of existing wide-area
With the potential to support all services available today as networks introduce ATM as the basis for a future broad-
well as those envisaged for the future, ATM holds a strong band ISDN, which allows the consolidation of the ex-
promise for network operators and end customers. To fully isting networks in the long term. Such an application
exploit this potential, ATM switch architectures are required
which provide versatility and modularity in supporting services scenario requires intensive interworking with the exist-
and protocols, independent scalability of data throughput and ing narrow-band networks with adaptation functions for
control performance over a wide range, and also reliability the user channels and feature transparent interworking
features adaptable to the respective application scenario. This between the signaling systems used.
paper describes in some detail how the MainStreetXpress core 4) Entertainment-Oriented Interactive Video and Multime-
services node, which has evolved from the prototype described
earlier [5] to a mature central office ATM switch, addresses these dia Services for Residential Customers: In the medium
issues and provides a future-proof architecture incorporating all term, this application area is expected to grow sig-
of the features required in the B-ISDN era. nificantly. In this area, cost-effective solutions for the
Index Terms—Asynchronous transfer mode, broad-band com- customer access are the key issue. In the core net-
munication, redundancy, switching systems. work, high-throughput ATM switches able to support
the highly asymmetrical traffic streams in an efficient
way are required to accommodate an adequate number
I. INTRODUCTION: NETWORK CONCEPTS of subscribers per switch.
throughput guarantees for individual queues as required, e.g., One of the advantages of a cell-based flow control protocol
for ABR traffic. WFQ was first proposed in [4], and has like DBA is its independence from the ASN core realization.
been further analyzed in [11] and [12]. The algorithms studied DBA can be used with any nonblocking self-routing switching
in these references have been designed for nodes handling fabric. There are three basic types of messages passed across
variable-length packets and are difficult to implement. The the core fabric between ingress and egress SMU’s.
SMU uses a simplified algorithm which is easier to implement • Request and Cleardown messages are generated by the
and more suitable for ATM systems switching fixed size ingress SMU’s in response to changes in the fill state of
cells. For a discussion of further WFQ algorithms and the their scheduler blocks.
role of WFQ for ATM traffic management, the reader is • The Offer message is generated by egress SMU’s when
referred to [14]. In order to shape the peak cell rates of bandwidth cleared down by one SMU can be offered to
individual connections, a scheduler block can be operated in another one.
the “rate-shaping” mode. In this mode, the service rate of each In [17], the performance of the DBA protocol has been
individual queue of the scheduler block is always kept below studied. For high loads, the cell delays in a switch using input
a queue-specific maximum value. and output buffering together with the DBA protocol approach
The scheduler blocks of the SMU’s are used for different those experienced in an ideal output buffered switch. For very
purposes at the ingress and egress side, respectively. In the low loads, the delays are slightly higher than for the ideal
egress SMU, a scheduler block is associated with a physical switch due to the fact that it takes some time before a backlog
interface of the switch or with a virtual path originating in the of cells at the ingress is detected and the bandwidth of a pipe
node. In the simplest case, the rate of the scheduler block is increased.
is defined as the rate of the associated interface, e.g., 45
Mbit/s for a scheduler block feeding a DS3 interface.
In the ingress SMU, one scheduler block is provided per E. Control Concept
egress SMU in the system which contains the FIFO queues The main challenge for the control concept of broad-band
for all nonreal-time connections routed via this specific egress systems is to meet the requirements of future applications and
SMU. The service rate of the scheduler block defines the traffic mixes, where little relevant experience exists today.
capacity of an ATM pipe interconnecting the ingress SMU Other than in narrow-band switches, the applications are very
with the corresponding egress SMU. heterogeneous and have significantly different traffic profiles.
The Dynamic Bandwidth Allocation Protocol: As shown in For example, for video on demand, the bandwidth required per
Fig. 6, the DBA protocol controls the capacity (cell rate) connection is relatively high (1.5–6 Mbit/s), and long holding
of the pipes which interconnect each ingress/egress SMU times (60 min and more) occur quite often. Narrow-band
pair. The DBA protocol is a distributed resource management backbone trunking with switching of individual connections,
mechanism whereby each SMU is responsible for its own local on the other hand, requires low connection bandwidth (64
resources, i.e., DBA does not involve any central functions kbit/s or less), but due to the—on average—shorter holding
in the switch. The DBA procedures use a cell-based mes- times and the large number of individual connections, the
saging protocol to exchange bandwidth allocation information required call-processing power is dramatically higher than in
between ingress and egress SMU’s. the first case, while the required switch fabric throughput is
Each ingress SMU has to detect a backlog of cells for significantly lower. As a consequence, ATM switches need a
any pipe to an egress SMU, and interprets the backlog as an scalable processing platform with flexible task assignment and
indication that the currently allocated rate for the specific distributed application software to be future-proof. Moreover,
pipe should be increased. This causes a DBA control cell to the processing power has to be scalable independently of the
be sent to the appropriate egress SMU requesting an increase size of the switch fabric and the number of external interfaces.
in the path’s allocated bandwidth. However, the ingress SMU For MainStreetXpress core node, a distributed control plat-
only requests additional bandwidth from an egress SMU if it form concept has been developed [8]. The main functions
has bandwidth available at its own output (bottleneck at point for call processing and operation/maintenance are performed
) to send data into the ASN core. by a cluster of central control processors, the MP’s (main
Upon receiving a request cell, the egress SMU will grant processors). Starting with one processor, the processing power
the request only if the port bandwidth at point in Fig. 6 of the cluster can be easily extended by adding additional MP’s
is not already fully allocated. If an egress SMU has allocated according to the requirements.
all its inlet bandwidth, any further requests for bandwidth are The peripheral control platforms (PCP’s) located on the
queued until an ingress SMU has emptied its logical buffer various modules (LIC’s, ECC’s, AMX’s, SMU’s, ASN core)
assigned to this egress SMU, and sends a clear down message support the central processors by performing local, mainly
to release bandwidth no longer required. It is the purpose of hardware-related tasks like maintaining the connection-specific
the large input buffers to avoid cell loss in this situation. tables on the LIC’s as well as local maintenance and fault de-
This method ensures that only those cells are admitted to the tection tasks. The communication among the central processors
ASN core that have a prebooked exit from it in order to avoid and between the central and the peripheral processors is ATM
excessive cell losses in the buffers of the switching elements. based, and uses the ASN communication infrastructure already
On the other hand, the latency of the control is low enough to available. The internal control communication protocol uses a
guarantee a full utilization of the switch fabric. standard AAL. The AAL functions as well as real-time critical
802 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 15, NO. 5, JUNE 1997
Application Software Platform: The application software fault-tolerant software implementation, special software
platform comprises the following generic services which are error treatment and a recovery strategy with several
used by the various applications of a broad-band switch. escalation steps are supported.
• Operation Administration and Maintenance (OAM) Ser- • System Utilities: Online debug utilities are provided for
vices: These comprise the Q3 communication services testing and fault evaluation purposes. The main online
and the file system. Applications use the Q3 communi- utilities are patch, trace, and dump.
cation services for the processing of Q3 commands and Applications: The applications implement the specific ser-
for the output of statistics/billing data to the operation vices of the broad-band switch visible to the user. Call-
center. Generic Q3 support functions are event forward- processing software controls the setup and release of calls
ing, notification logging, security functions, and alarming. and connections. Signaling interfaces for the NNI (e.g., CCS7)
The Q3 protocol stack comprises CMIP/CMISE, FTP, and the UNI (e.g., DSS2) are supported. With respect to the
and TCP/IP. The file system is used for the storage of implementation, supplementary features are separated from
administered data or log files for billing/statistics data the basic call handling. Hence, the realization of new call-
on disk. Access to the file system and Q3 communication processing features has a minimal impact on the existing
services is performed through a common API (application software. For billing, the relevant information including, e.g.,
programming interface). the cell counts is provided in a format suitable for pro-
• Physical Switching Server: The physical switching server cessing at an external billing center. Statistics are collected
is responsible to actually set up the paths through the for performance management (e.g., quality of connections),
ASN. For each connection setup request, the application traffic measurements, and load measurements for the MP.
specifies the endpoints of a path. The physical switching Administration is provided for a number of applications, e.g.,
server determines the modules involved and formulates call processing, CCS7 network, system configuration, and
the switching commands required to load the connection- ATM layer OAM functions (loop back, continuity check, etc.).
specific tables in the LIC’s and, for multicast connections Both the narrow-band CCS7 (MTP, ISUP) and the B-ISDN
only, in the switching elements. Thus, the physical switch- (SAAL, MTP level 3, B-ISUP) signaling protocols and the
ing server hides the details of the switching fabric from required interworking functions are implemented. Together
the applications. with the packetization/depacketization for the user traffic on
• Database Management System (DBMS): Applications use the LIC’s and the additional echo cancelation circuits, this
DBMS to store and access semipermanent and transient allows for interworking with the narrow-band ISDN.
data. DBMS is a distributed database system, i.e., data
may be located on any MP in the system. To provide
efficient access and storage, data may be partitioned or III. REDUNDANCY PRINCIPLES
replicated. For example, subscriber data are partitioned
To be able to meet the availability and reliability require-
among the CallP–MP’s, i.e., a specific CallP–MP keeps
ments for a central office switch, various redundancy concepts
the data of a particular subscriber. Routing data for
are implemented in the MainStreetXpress core node, including:
digit translation are replicated, i.e., the same data are
kept on each MP. The location of the data on a certain • full redundancy for all central components, i.e., for the
MP is completely hidden from the application. DBMS ASN, the SMU’s, and the central control;
offers generic access routines (e.g., GET, CREATE, SET, • optional redundancy for external lines and LIC boards;
DELETE) and Q3 operations (e.g., scoping, filtering), and • pool redundancy for ECC’s.
provides concurrency and transaction control to guarantee In general, the application classes for redundancy are de-
consistent data access. The implementation approach for fined as follows.
DBMS is data driven, which means that all information “ ”: The traffic protected by the redundancy is car-
on data is kept in a data dictionary (DDI). ried via one working and one protection entity
• Upgrade: The MainStreetXpress core node allows for the simultaneously. The receiver terminating the
online upgrade of the system software without loss of redundancy has to select cells from one or the
service. During upgrade, the redundant central control other entity to be able to forward one consistent
hardware is used to switch all system functions from the traffic stream.
old to the new software version. The upgrade procedure “ ”: The traffic protected by the redundancy is only
is controlled by generic platform software. carried by the protection entity if a failure oc-
• Load Control: The load state of the MP’s is constantly curs on the working entity. One protection entity
monitored. In case of overload, appropriate measures are protects one working entity. During normal con-
taken by the load control, for example, new call requests ditions, the protection entity can be used for other
are rejected. If an MP gets into overload states frequently, applications, e.g., to transport extra traffic, which
a reassignment of a service may be initiated. For example, is dropped if a failure on the working line occurs.
CCS7 links could be assigned to another or new MP–SLT. “ ”: The traffic protected by the redundancy is only
• Maintenance: Performs all hardware-related maintenance carried by the protection entity if a failure occurs
activities such as fault processing, diagnostics, reconfig- on one of the working entities. One protection
uration, and routine testing of an MP. To allow for a entity protects working entities.
804 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 15, NO. 5, JUNE 1997
free channel out of this pool can be freely selected by the [6] W. Fischer, T. Rambold, and T. Theimer, “Internet and ATM—Prospects
application software. If, e.g., due to hardware failures, single for the Internet of the future as it integrates ATM capabilities,” Telcom
Rep. Int., vol. 19, 1996.
echo cancellation channels or complete ECC’s have to be taken [7] W. Fischer, E. Wallmeier, T. Worster, S. Davis, and A. Hayter, “Data
out of service, sufficient spare capability is provided, which communications using ATM—Architectures, protocols and resource
can then be allocated to new connections. management,” IEEE Commun. Mag., vol. 32, Aug. 1994.
[8] C. H. Hoogendoorn and A. T. Maher, “Enhanced SW architecture for
an ATM universal communication node,” in Proc. ISS’92, Yokohama,
D. Redundancy of Central Control Japan, vol. 1, paper C4.7.
[9] IETF, RFC 1577, “Classical IP and ARP over ATM,” Jan. 1994.
An MP consists of two physical processors with microsyn- [10] ITU-T, Draft Recommendation E.73X, “Methods for traffic control in
chronous operation. Due to this kind of operation, both sides B-ISDN,” Geneva, Switzerland, Dec. 1994.
[11] A. K. Parekh and R. G. Gallager, “A generalized processor sharing
always have the same information about transient states, and approach to flow control in integrated service networks: The single node
the redundancy is completely transparent to the applications case,” IEEE/ACM Trans. Networking, vol. 1, pp. 344–357, June 1993.
[12] , “A generalized processor sharing approach to flow control in
software. If one side fails, the other side takes over immedi- integrated service networks: The multiple node case,” IEEE/ACM Trans.
ately. The identification of the faulty unit is done by means of Networking, vol. 2, pp. 137–150, Apr. 1994.
the built-in self-test capabilities of the processors. Besides this [13] J. W. Roberts, “Variable bit rate traffic control in B-ISDN,” IEEE
Commun. Mag., vol. 29, pp. 50–56, Sept. 1991.
hardware redundancy, there is a logical redundancy on the [14] , “Virtual spacing for flexible traffic control,” Int. J. Commun.
service level for the central control platform. For replicated Syst., vol. 7, pp. 307–318, 1994.
services, any MP offering the service can replace a faulty MP [15] A. Romanow and S. Floyd, “Dynamics of TCP traffic over ATM
networks,” IEEE J. Select. Areas Commun., vol. 13, no. 4, pp. 633–641,
without impact on the applications. For partitioned services, 1995.
the service may be reassigned to another MP. [16] E. Wallmeier, “A connection acceptance algorithm for ATM networks
based on mean and peak bit rates,” Int. J. Digital and Analogue Commun.
Syst., vol. 3, pp. 143–153, 1990.
IV. CONCLUSION [17] E. Wallmeier, S. Schneeberger, T. Worster, and S. Davis, “Traffic control
in ATM switches with large buffers,” in Proc. 9th ITC Specialists
This paper has provided an insight into the architecture of Seminar, KPN Res., Leidschendam, The Netherlands, Nov. 1995.
the MainStreetXpress core node which has been designed for [18] T. Worster, W. Fischer, S. P. Davis, and A. T. Hayter, “Buffering and
flow control for statistical multiplexing in an ATM switch,” in Proc.
central office applications in the core of an ATM network. The
ISS’95, 1995.
highlights of this architecture are the following:
• the scalability, both in terms of throughput and processing
power, allowing to build configurations ranging from
small single-shelf solutions up to very large central office
switches; Erwin P. Rathgeb was born in Ulm, Germany,
in 1958. He received the Dipl.-Ing. and Dr.-Ing.
• the high availability and reliability which are achieved by (Ph.D.) degrees in electrical engineering from the
using redundancy mechanisms in all central components University of Stuttgart, Germany, in 1985 and 1991,
of the system core as well as in the interface units; respectively.
From 1985 to 1990, he was a member of the
• effective support of traffic management mechanisms Scientific Staff at the Institute of Communications
which lead to high link utilization for bursty data traffic, Switching and Data Techniques, University of
and which are confined within specific SMU modules Stuttgart, where he was head of a research group
on the design and analysis of distributed systems.
avoiding that these complex mechanisms have to be From 1990 to 1991, he was a Member of the
treated in every part of the system; Technical Staff within Applied Research at Bellcore, Morristown, NJ, before
• a software architecture which allows us to introduce joining Bosch Telecom, Backnang, Germany. In 1993, he joined the Public
Communication Networks Group of Siemens AG, Munich, Germany. There,
feature upgrades without a major impact on existing he contributed to the definition of the overall system architecture of the
software, and to upgrade the system without service MainStreetXpress core node. Currently he is involved in product planning for
interruption. broad-band communication systems.
Christian Hinterberger was born in Burgkirchen, Regina Wille-Fier was born in Darmstadt, Ger-
Germany, in 1961. He studied electrical engineering many, on April 25, 1956. She studied mathematics
at the Technical University of Munich, where he at the Universities of Tübingen and Münster and
received the Dipl.-Ing. degree in 1986. received the Dr. rer. nat. (Ph.D.) degree for her work
In 1987, he joined the Siemens Research and in the area of functional analysis.
Development Department, where he was involved In 1985, she joined the Research Laboratories
in studies on ATM switching architectures. In 1989, of the Public Communications Networks Group of
he joined the Siemens ATM System Development Siemens AG, Munich, Germany, where she was
Group. There, he was engaged in the development working on software architectures for ATM switches
of ATM switching systems and ATM ASIC’s. From with decentralized control. She participated in the
1994 to 1996, he was with the ATM System Engi- development of the concepts for switching network
neering Group. Since 1997, he has been with the Siemens Semiconductor control, call processing, and internal communication for the MainStreetXpress
Department, responsible for product definition of ATM layer and ATM core node. Currently, she is Director of Systems Engineering for broad-band
adaptation layer ASIC’s. communications systems.