Nooob
05.10.2012, 13:26
Доброго всем время суток!

Есть вопрос, система CS1000E 7.5
после команды SCPU проходят процедура по переключения с core0 на core1.
Но в результате активный ЦПУ - опять core0

> ld 135
CCED000
.stat cpu

cp 0 23 PASS -- ENBL

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 0, DRAM SIZE = 1006 MBytes

CP[0] located at IPMG [0 0 23]


cp 1 23 PASS -- STDBY

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 1, DRAM SIZE = 1006 MBytes

CP[1] located at IPMG [0 1 23]

=========================================
Summary of Local System Resource (side 0)
=========================================
File Descriptors
-----------------
alloc 88
free 1960
total 2048

Unprotected Heap (bytes)
------------------------
alloc 77449840
free 443828832
total 521278672

Protected Heap (bytes)
----------------------
alloc 1055296
free 3139008
total 4194304
.****

>ld 135
CCED000
.scpu
OK
.
SRPT048 DR: Master asked to stop updates and flush file system.

CCED763 LCS: Graceful Switchover. Call Processing Stopped at 11:12:15

SRPT118 CM: Server connection lost.

SRPT021 LCS: hsp state changed from LcsHspStUp to LcsHspStDown.

SRPT131 CM: Primary client can't connect to the other side.

CCED762 SWO: Graceful switch-over from side 0 to side 1 completed
Previous Graceful SWO: at 5/10/12 11:02:27

CCED764 LCS: Graceful Switchover. Call Processing Resumed at 11:12:18

SRPT041 LCS: Graceful switchover command successful.

SRPT027 HB: Cannot detect heartbeat from other core

SRPT2290 HB: HBWaitEtherRep - HB threshold exceeded.


TTY 12 SCH MTC BUG CTY 11:14
OVL111 IDLE 0
>
TTY 12 SCH MTC BUG CTY 11:14
OVL111 IDLE 0
>
ELAN028 The following server failed to register it's pbxLink.
IP <192.168.3.5>, host name <ss1prm>, server type <UNKN>.

TTY 12 SCH MTC BUG CTY 11:14
OVL111 IDLE 0
>
ELAN028 The following server failed to register it's pbxLink.
IP <192.168.3.6>, host name <ss2sec>, server type <UNKN>.

SRPT752 INI 0: INI completed in 4 seconds

ELAN019 ELAN Server enabled after M1 INIT

SRPT179 STARTUP: IPMG Controller task completed successfully.

SRPT091 HB: Local side IPL health change from 0 to 8.

ELAN014 ELAN 0 host IP=192.168.3.4 is enabled

SRPT017 OMM: IP link is UP between Call Server and IPMG[0 0]

ELAN014 ELAN 0 host IP=192.168.3.5 is enabled

ELAN014 ELAN 0 host IP=192.168.3.6 is enabled

ELAN014 ELAN 0 host IP=192.168.3.3 is enabled

SRPT226 Registration has been granted for IP[192.168.3.3] IPMG[0 0]


ELAN014 ELAN 16 host IP=192.168.3.13 is enabled

CSA003 16 11:14:48 5/10/2012

CSA105 16, 11:14:48 5/10/2012

SRPT017 OMM: IP link is UP between Call Server and IPMG[0 1]

SRPT226 Registration has been granted for IP[192.168.3.4] IPMG[0 1]


ELAN007 ELAN 16 host IP=192.168.3.13 disabled, read from socket
fail due to far end disconnect or Ethernet problems

CSA104 16, 11:14:50 5/10/2012

SRPT110 CM: Primary client connection established.

SRPT117 CM: Server connection established.

SRPT021 LCS: hsp state changed from LcsHspStDown to LcsHspStUp.

SRPT028 HB: Heartbeat detected from remote core.

SRPT024 CPM: CPM started protected memory sync.

SEC054 A device has connected to, or disconnected from, a pseudo tty without authentica
ting

SRPT023 CPM: CPM completed protected memory sync.

SRPT033 DR: Master is starting disk sync.

ELAN014 ELAN 16 host IP=192.168.3.13 is enabled

SRPT093 AML: local side AML connection 16 to 192.168.3.13 health change:2.

SRPT091 HB: Local side AML health change from 0 to 2.

SRPT043 LCS: Protected memory and disk synchronization complete.

SRPT050 DR: Disk sync completed.

SRPT092 AML: remote side AML connection 16 to 192.168.3.13 health change: 2.

SRPT091 HB: Remote side AML health change from 0 to 2.

TTY 12 SCH MTC BUG CTY 11:15
OVL111 IDLE 0
>logi admin2
PASS?
SEC0029 SECURITY WARNING: THIS SYSTEM CONTAINS INSECURE PASSWORDS, NOTIFY YOUR SYSTEM ADMINISTRATOR

.
TTY #12 LOGGED IN ADMIN2 11:15 5/10/2012

>
The software and data stored on this system are the property of,
or licensed to, Avaya Inc. and are lawfully available only to
authorized users for approved purposes. Unauthorized access to
any software or data on this system is strictly prohibited and
punishable under appropriate laws. If you are not an authorized
user then logout immediately. This system may be monitored for
operational purposes at any time.
ld 135
CCED000
.
SRPT4619 WARNING: Last Archive Procedure had failed
No archives were completed since Oct 04 12:20:00 2012

sat
CCED001 Invalid command.
.stat cpu

cp 0 23 PASS -- ENBL

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 0, DRAM SIZE = 1006 MBytes

CP[0] located at IPMG [0 0 23]


cp 1 23 PASS -- STDBY

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 1, DRAM SIZE = 1006 MBytes

CP[1] located at IPMG [0 1 23]

=========================================
Summary of Local System Resource (side 0)
=========================================
File Descriptors
-----------------
alloc 86
free 1962
total 2048

Unprotected Heap (bytes)
------------------------
alloc 78128936
free 443149736
total 521278672

Protected Heap (bytes)
----------------------
alloc 1083964
free 3110340
total 4194304


если же сделать sysload active, то core1 становится активным.. и уже в этом случае выполнение команды scpu правильно переключает управление с core1 на core0

Nooob
05.10.2012, 15:36
т.е суть такова - что переключение с 1го на 0й проходит гладко, а вот при переключение с 0го на 1й процессор вначале все идет нормально, и stat cpu показывает активность 1го процессора.. потом оп... активируется 0й после инициализации.

С_Стар
05.10.2012, 15:39
в pdt логи смотреть

Urri
05.10.2012, 15:41
После какой инициализации?

С_Стар
05.10.2012, 15:46
После какой инициализации?
SRPT752 INI 0: INI completed in 4 seconds

Nooob
05.10.2012, 16:21
>ld 135
CCED000
.stat cpu

cp 0 23 PASS -- ENBL

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 0, DRAM SIZE = 1006 MBytes

CP[0] located at IPMG [0 0 23]


cp 1 23 PASS -- STDBY

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 1, DRAM SIZE = 1006 MBytes

CP[1] located at IPMG [0 1 23]

=========================================
Summary of Local System Resource (side 0)
=========================================
File Descriptors
-----------------
alloc 87
free 1961
total 2048

Unprotected Heap (bytes)
------------------------
alloc 77185696
free 444092976
total 521278672

Protected Heap (bytes)
----------------------
alloc 1055296
free 3139008
total 4194304


.scpu
OK
.
SRPT048 DR: Master asked to stop updates and flush file system.

CCED763 LCS: Graceful Switchover. Call Processing Stopped at 14:30:36

SRPT118 CM: Server connection lost.

SRPT021 LCS: hsp state changed from LcsHspStUp to LcsHspStDown.

SRPT131 CM: Primary client can't connect to the other side.

CCED762 SWO: Graceful switch-over from side 0 to side 1 completed
Previous Graceful SWO: at 5/10/12 14:25:11

CCED764 LCS: Graceful Switchover. Call Processing Resumed at 14:30:41

SRPT041 LCS: Graceful switchover command successful.

SRPT027 HB: Cannot detect heartbeat from other core

SRPT2290 HB: HBWaitEtherRep - HB threshold exceeded.

CCED000
.stat cpu

cp 1 23 PASS -- ENBL

SYSTEM STATE = NOT REDUNDANT (SINGLE)
DISK STATE = NOT REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 1, DRAM SIZE = 1006 MBytes

CP[1] located at IPMG [0 1 23]


cp 0 UNKNOWN PASS -- STDBY

SYSTEM STATE = NOT AVAILABLE
DISK STATE = NOT REDUNDANT
HEALTH = 0
VERSION = Nov 16 2010, 14:42:29
Side = 0, DRAM SIZE NOT AVAILABLE

CP[0] located at IPMG [UNKNOWN]

=========================================
Summary of Local System Resource (side 1)
=========================================
File Descriptors
-----------------
alloc 83
free 1965
total 2048

Unprotected Heap (bytes)
------------------------
alloc 77257608
free 444021064
total 521278672

Protected Heap (bytes)
----------------------
alloc 1055296
free 3139008
total 4194304
.
ТУТ Rlogin сессия обрывается.. восстанавливаем её.

INI010 0000006E 0000006F
ACDR ACTIVATED

TTY 12 SCH MTC BUG CTY 14:31
OVL111 BKGD 44
>
CSA003 16 14:31:36 5/10/2012

CSA105 16, 14:31:36 5/10/2012

CSA104 16, 14:31:42 5/10/2012

CDN002 16 7299000 14 31 42

CDN002 16 17700 14 31 42

CDN002 16 17709 14 31 42

CDN002 16 17710 14 31 42

CDN002 16 17711 14 31 42

TTY 12 SCH MTC BUG CTY 14:31
OVL111 BKGD 44

>
.logi admin2
PASS?
SEC0029 SECURITY WARNING: THIS SYSTEM CONTAINS INSECURE PASSWORDS, NOTIFY YOUR SYSTEM ADMINISTRATOR

.
TTY #12 LOGGED IN ADMIN2 14:31 5/10/2012

>
The software and data stored on this system are the property of,
or licensed to, Avaya Inc. and are lawfully available only to
authorized users for approved purposes. Unauthorized access to
any software or data on this system is strictly prohibited and
punishable under appropriate laws. If you are not an authorized
user then logout immediately. This system may be monitored for
operational purposes at any time.

.stat cpu

cp 0 23 PASS -- ENBL

SYSTEM STATE = NOT REDUNDANT (SINGLE)
DISK STATE = NOT REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 0, DRAM SIZE = 1006 MBytes

CP[0] located at IPMG [0 0 23]


cp 1 UNKNOWN PASS -- STDBY

SYSTEM STATE = NOT AVAILABLE
DISK STATE = UNKNOWN
HEALTH = 0
VERSION =
Side = 1, DRAM SIZE NOT AVAILABLE

CP[1] located at IPMG [UNKNOWN]

=========================================
Summary of Local System Resource (side 0)
=========================================
File Descriptors
-----------------
alloc 86
free 1962
total 2048

Unprotected Heap (bytes)
------------------------
alloc 78489936
free 442788736
total 521278672

Protected Heap (bytes)
----------------------
alloc 1083964
free 3110340
total 4194304
.
CSA105 16, 14:32:00 5/10/2012

CSA002 16 14:32:00 5/10/2012 6

AUD000
****
>*
>
OVL000
>
OVL000
>
SEC105 USR=initialization, EVT=global port access state change to off, RESULT=success

SRPT107 Hardware reset reason = Software INI [register value = 0x4]

SEC119 Local authentication is being used.

SRPT026 HB: Local side health change from 0 to 14.

SRPT019 LCS: redundancy state changed from LcsUnknownRed to LcsRedundant.

SRPT131 CM: Primary client can't connect to the other side.

SRPT751 INI 0: starting INI on side 0 due to System Restart
Previous INI: side 0 at 5/10/12 13:21:19
INIs since switch-over ( 5/10/12 14:25:11): 1, 1
INIs since cold start ( 5/10/12 13:21:19): 1, 1

ELAN028 The following server failed to register it's pbxLink.
IP <192.168.3.5>, host name <ss1prm>, server type <UNKN>.

ELAN028 The following server failed to register it's pbxLink.
IP <192.168.3.6>, host name <ss2sec>, server type <UNKN>.

SRPT752 INI 0: INI completed in 3 seconds

ELAN019 ELAN Server enabled after M1 INIT

ELAN014 ELAN 16 host IP=192.168.3.13 is enabled

SEC054 A device has connected to, or disconnected from, a pseudo tty without authentica
ting

SRPT179 STARTUP: IPMG Controller task completed successfully.

SRPT091 HB: Local side IPL health change from 0 to 8.

ELAN014 ELAN 0 host IP=192.168.3.4 is enabled

ELAN014 ELAN 0 host IP=192.168.3.6 is enabled

ELAN014 ELAN 0 host IP=192.168.3.3 is enabled

ELAN014 ELAN 0 host IP=192.168.3.5 is enabled

SRPT017 OMM: IP link is UP between Call Server and IPMG[0 0]

SRPT017 OMM: IP link is UP between Call Server and IPMG[0 1]

SRPT226 Registration has been granted for IP[192.168.3.3] IPMG[0 0]


SRPT226 Registration has been granted for IP[192.168.3.4] IPMG[0 1]


SRPT4619 WARNING: Last Archive Procedure had failed
No archives were completed since Oct 05 12:41:00 2012


SRPT093 AML: local side AML connection 16 to 192.168.3.13 health change:2.

SRPT091 HB: Local side AML health change from 0 to 2.

ELAN007 ELAN 16 host IP=192.168.3.13 disabled, read from socket
fail due to far end disconnect or Ethernet problems

SRPT091 HB: Local side AML health change from 2 to 0.

SRPT110 CM: Primary client connection established.

SRPT117 CM: Server connection established.

SRPT021 LCS: hsp state changed from LcsHspStDown to LcsHspStUp.

SRPT028 HB: Heartbeat detected from remote core.

SRPT024 CPM: CPM started protected memory sync.

SRPT023 CPM: CPM completed protected memory sync.

SRPT033 DR: Master is starting disk sync.

SRPT043 LCS: Protected memory and disk synchronization complete.

SRPT050 DR: Disk sync completed.

SRPT091 HB: Remote side IPL health change from 0 to 8.

OVL000
>
OVL000
>ld 135
CCED000
.stat cpu

cp 0 23 PASS -- ENBL

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 0, DRAM SIZE = 1006 MBytes

CP[0] located at IPMG [0 0 23]


cp 1 23 PASS -- STDBY

SYSTEM STATE = REDUNDANT
DISK STATE = REDUNDANT
HEALTH = 14
VERSION = Nov 16 2010, 14:42:29
Side = 1, DRAM SIZE = 1006 MBytes

CP[1] located at IPMG [0 1 23]

=========================================
Summary of Local System Resource (side 0)
=========================================
File Descriptors
-----------------
alloc 83
free 1965
total 2048

Unprotected Heap (bytes)
------------------------
alloc 78122920
free 443155752
total 521278672

Protected Heap (bytes)
----------------------
alloc 1083964
free 3110340
total 4194304

Urri
05.10.2012, 16:33
Чего-то я не догоняю. Переключили процессора и сделали инициализацию? Или она сама произошла? После инита станция как правило на 0-м проце стартует. потом если включено тестирование с переключением, то переключается на 1-й.

С_Стар
05.10.2012, 17:07
INI0010 xx
IGS or MGS faults seen from the standby CPU, where xx is the maintenance
display code in HEX.
This message only appears if the standby CPU may be used but with a
degradation of network or I/O access as shown by comparing INI0002 with
INI0008, INI0003 with INI0009, and INI0007 with INI0010.

Искать по солюшенам.
Процы одного винтеджа?

TheRam
05.10.2012, 17:21
Не обязательно фиджи. Ох, ещё как "чудесато" бывает, рассказывал...например про оборванный провод в межCCном кабеле...

С_Стар
05.10.2012, 17:22
Не обязательно фиджи. Ох, ещё как "чудесато" бывает, рассказывал...например про оборванный провод в межCCном кабеле...

Ешка :) в 1010

Nooob
05.10.2012, 17:35
Чего-то я не догоняю. Переключили процессора и сделали инициализацию? Или она сама произошла? После инита станция как правило на 0-м проце стартует. потом если включено тестирование с переключением, то переключается на 1-й.

При переключении с 0 на 1 - на некоторое время 1й становится активным, а потом управление переходит опять на 0й с инициализацией

Сделать активным 1й можно лишь перезагрузкой 0го.
Переключение с 1го на 0й проходит гладко

Инициализация проходит во время

Urri
05.10.2012, 17:46
Патчи на 0-м проце. Сплит. Переустановка софта, патчей и базы на 1-м проце. Джойн.

TheRam
05.10.2012, 18:02
Ешка :) в 1010
опс...и правда ешка, не обратил внимания. :eek:
А откуда видно, что 1010? :confused:

С_Стар
05.10.2012, 22:13
опс...и правда ешка, не обратил внимания. :eek:
А откуда видно, что 1010? :confused:
CP[0] located at IPMG [0 0 23]

CP[1] located at IPMG [0 1 23]

МолодЕж увидела