Documente Academic
Documente Profesional
Documente Cultură
01
URGENT
CONTENTS
1
2
OmniPCX Enterprise
LOSS OF SIGNALING LINK ON IP DEVICES
AND INFORMATION ABOUT "UDP-LOST"
AND "UDP-KEEP ALIVE" TIMERS FROM QoS
CATEGORIES
1. PURPOSE OF DOCUMENT
The purpose of this document is to provide configuration recommendations when IP devices such as
GD, INTIP, IP-Phone, PCS are connected to the call server behind unstable IP links of the client
network.
The instability of the IP network is characterized by the loss of the signaling link with the call server,
followed by GD, INTIP and IP Phones reset.
If the customers complain, it is necessary to assess the quality of those links and adapt, if necessary,
the system configuration.
This document does not address the case of call server duplication.
If they are not rescued by PCS or backup signalling, GD and INTIP reboot in a loop until the link
with the call server is restored, then transmit to the call server incident 5857 giving the reason why
they rebooted "Hardware reset on loss of IP DL (DL = Data Link).
5857 = GD / GA / INTIP / RGD: Reason of reboot 2 for GD, GD3, INTIP3
5857 = GD / GA / INTIP / RGD: Reason of reboot 5 for INTIP2
The example given above corresponds to the default values of timers defined in "QoS" IP category:
− UDP Lost = 7s
− Lost UDP Reinit = 7s
− UDP-Keep-Alive = 15s
However we will see why and how these values can be changed depending on the behavior of the
client network.
5.2 Why after a short network interruption sometime all IP devices do not
reboot?
During short breaks in the network, some IP devices can be lost by the system while others are not. It
depends on when the "UDP-Keep-Alive" is positioned on each device in relation to the duration of
the outage. This example justifies increasing UDP-Lost timer in case of short network interruptions.
Network breaks less than 22 seconds (15 seconds of "Keep-Alive" + 7 seconds of "UDP-Lost) are not
necessarily detected.
5.4 Why IP devices reboot after detecting the loss of the signalling link?
Alcatel-Lucent believes that a network supporting VoIP should be stable. However, it is planned to
reboot IP devices when they lose the IP signaling link with the call server for a defense reason. For
this reason it is necessary to analyze the 5857 incidents as they give the reason for restarting GD /
INTIP "Hardware reset on loss of DL-IP. (DL = data link)
5857 = GD / GA / INTIP / RGD: Reason of reboot 2 for GD, GD3, INTIP3
5857 = GD / GA / INTIP / RGD: Reason of reboot 5 for INTIP2
6. PREVENTIVE MAINTENANCE
Statistics tool of signaling link.
It is possible to observe the behavior of signaling link. The statistics are stored in memory and are
saved in the monitor.log file stored in /mnt/flash/info during reboot of the GD. The
statistics are also available in RAM during operation of the coupler, and give an overview of the
health of the signaling link.
− To collect statistics in real time in live memory proceed as follows.
• GD / GA, GD2 / GA2
Connect with telnet then monitor
board 0 0
Other 0 0
sig
dump
− For GD3 / GA3 INTIP3:
• Connect with telnet then monitor
system
dl
dump
− For INTIP / INTIP2
• Do a cpl_online <crystal> <coupler>
remoteip
dump
− To collect statistics recorded during previous GD reboot you have to open monitor.log files
stored in /mnt/flash / info".
Interpretation
• From A to B, are recorded the last 20 events occurring on the signalling link.
• At 10.20 am as there is traffic, we can not see any "Keep-Alive" messages, but only data
messages. We can see that some data messages were not acknowledged as well on call
server side as on GD side.
• A 10:21 am: (C) the GD no longer receives data ACK, and after 7 consecutive attempts to
send the same message, GD reboots at the end of the timer UDP-Lost 7s.
In the table at the end of this edition, we can see that since the last restart of the GD
• 579 times GD had to retransmit a message once because he was not acknowledged.
• 6 times GD had to send the same message 2 times because it was not acknowledged.
• 5 times GD had to send the same message 5 times because it was not acknowledged.
• After 7 consecutive failures, the GD reboots.
Dump DL incident:
02/12 10:19:40 - LOCAL nack 1 times seq num 16146 <---------------------- A
02/12 10:19:48 - DISTANT nack 1 times seq num 9259
02/12 10:19:48 - LOCAL nack 1 times seq num 16237
02/12 10:20:18 - LOCAL nack 1 times seq num 16248
02/12 10:20:36 - Msg lost
02/12 10:20:36 - DISTANT nack 1 times seq num 9320
02/12 10:20:39 - DISTANT nack 1 times seq num 9350
02/12 10:20:42 - DISTANT nack 1 times seq num 9352
02/12 10:20:56 - LOCAL nack 1 times seq num 16280
02/12 10:20:57 - DISTANT nack 1 times seq num 9355
02/12 10:21:15 - Msg lost
02/12 10:21:15 - DISTANT nack 1 times seq num 9360
02/12 10:21:21 - Msg lost <----------------------------------------- C
02/12 10:21:22 - Msg lost
02/12 10:21:23 - Msg lost
02/12 10:21:24 - Msg lost
02/12 10:21:25 - Msg lost
02/12 10:21:26 - Msg lost
02/12 10:21:27 - Msg lost <----------------------------------------- B
02/12 10:21:27 - State of the link changed to LINK_DOWN
Dump of DL Stat :
000000AA-08BB676F: - Rx num = 26282 (Data 17510 + Ack 8772)
000000AB-08BB676F: - Rx Nack num = 698
000000AC-08BB676F: - Tx num = 9365
000000AD-08BB676F: - Tx Nack num = 1188
000000AE-08BB676F: - Tx Rp num = 0
000000AF-08BB676F: