25
PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Embed Size (px)

Citation preview

Page 1: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

PSEN Server BalanceEN/ICE Procedures

Jean-Charles TournierEN/ICE/SCD

09-September-2015

Page 2: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Foreword

This procedure lists the actions needed to be completed by EN/ICE in order to allow EN/EL/CO to update PSEN smoothly. It does not cover the operations needed to create/delete/update devices as it is under EN/EL/CO responsibility and covered by their procedures.

Page 3: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Outline

• Preparatory set up• Procedures• Common issues

Page 4: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Preparatory Setup

• All operations on the data servers should be done as UNICRYO

• On cs-ccr-psen1 and cs-ccr-psen2, open– A terminal with the “top –d 1” command to monitor

the processes– The WinCC OA log viewer

• From cs-ccr-psen1 (but could be also cs-ccr-psen2), open the system overview panel to monitor the redundancy/split status and stop/start of manager progress

Page 5: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

PSEN SERVER BALANCE PROCEDURE

Page 6: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Procedure Overview

1. Notify EN/ICE standby service2. Perform an online backup3. Let EN/EL/CO import the device definition4. Stop the project on cs-ccr-psen25. Backup the project on cs-ccr-psen26. Restart the project on cs-ccr-psen27. Let EN/EL/CO restart all the managers8. From psen2, go back to redu mode9. Restart all the managers on cs-ccr-psen110. Verify that the update is completed11. Notify EN/ICE standby service

Page 7: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 1 – Notify EN/ICE SBS

• Send an email to the EN/ICE standby service to ignore all PSEN alarms in Moon during the update.

From: Jean-Charles Tournier <[email protected]>Date: Thursday 2 July 2015 08:36To: "en-dep-ice-piquet (Members of the EN-ICE Standby Service)" <[email protected]>Subject: PSEN Update

Hello,

Please ignore all PSEN alarms in Moon this morning as we are starting the weekly update. I’ll send an email once the operation is over.

Thanks,jc

Page 8: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 2 – Online Backup from PSEN2Overview

• From cs-ccr-psen2, perform an online backup of the db folder.

• Make sure to configure the backup for cs-ccr-psen2 in the online backup panel

• The online backup is to be located in:– ~/PVSS_projects/PVSS_backup/psen/before_balance

• The operation takes between 1 and 2 minutes to be completed

Page 9: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 2 – Online Backup from PSEN2Details

1 - Open System Management

2 – Open online backup

3 – Click on Host 2

4 – Make sure the target path is set correctly to “…./before_balance/”

5 – Click on “Start backup”

Page 10: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 3 – Import of Devices

• This step is performed by EN/EL/CO• For a typical update, this step takes 1h30 but can

be longer if more devices need to updated/created or if problems are encountered

• To have an idea of the update size, connected to the ENSDM production DB and do:– select count(*) from items;– An update is considered small when it has less than

10,000 items (representing 10,000 DPs to be updated/created in PSEN).

Page 11: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 4 – Stop the project on PSEN2

• Once informed by EN/EL/CO that they completed the update, stop the project on PSEN2– Open the project administration console– Stop the project psen– Wait for the project to be completely stopped• “ps –eaf | grep WCC” should only give results for the

log viewer and the project administration console

Page 12: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 5 – Backup of psen2

• From cs-ccr-psen2:– cd ~/PVSS_projects/PVSS_backup/psen– ./psenBackup.sh

– Wait for the backup to complete (it takes about 5 minutes)

• The script is also available at:– https://svnweb.cern.ch/cern/wsvn/en-ice-svn/tru

nk/applications/PSEN/Development/PSEN/Misc/psenBackup.sh

Page 13: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 6 – Restart the project on PSEN2

• From cs-ccr-psen2, open the administration console and simply restart the project “psen”– Only the core managers will be restarted, i.e. data

event redundancy, etc.– It takes a bit less than 10 minutes for these

managers to restart• Once the project is restarted, notify EN/EL/CO

that they can complete the restart of the project on cs-ccr-psen2

Page 14: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 7 – Let EN/EL/CO complete the project restart

• If you are asked to do it for them:– Open a UI manager on cs-ccr-psen2 only

• WCCOAui –proj psen –m gedi –data cs-ccr-psen2 –event cs-ccr-psen2 –num 127

– Open the “Manager plan” panel and apply the nominal plan to cs-ccr-psen2 (it takes about 20mn for the plan to complete)

Page 15: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 8 – Back to redundancy mode• Wait for a green light from EN/EL/CO – they need to export masked alarms before this step• From cs-ccr-psen2 open a gedi

– WCCOAui –proj psen –m gedi– Note: it is really important to start the gedi from cs-ccr-psen2 otherwise newly created DP may

be lost• Open the system management panel• Click on the redundancy button• Make sure to select the right peer to keep as reference (normally you should select cs-

ccr-psen2)– In you are not able to access the system management panel (blank panel with error) or you don’t

have the choice to choose which server to restart check the common issues slides• cs-ccr-psen1 will be restarted and the event manager on both projects will be at 100% for a

while (1-2 minutes on cs-ccr-psen2, more on cs-ccr-psen1)• Wait for the complete restart of cs-ccr-psen1 and to have the message “recovery

completed” from the log of psen1• Copy the LASER configuration files from cs-ccr-psen2 to cs-ccr-psen1

– From cs-ccr-psen1 do• scp cs-ccr-psen2:~/PVSS_projects/psen/psen/data/LASER_CONFIG_FILES/*

~/PVSS_projects/psen/psen/data/LASER_CONFIG_FILES

Page 16: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 9 – Put PSEN1 into nominal mode

• From a Gedi open either on cs-ccr-psen1 or cs-ccr-psen2, open the manager plan panel– Set it to cs-ccr-psen1– Apply the nominal mode

• This step takes about 20 minutes

Page 17: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 10 – Verify that the update is completed

• Open a normal PSEN session from e.g. the terminal servers– Make sure that the system integrity square is green– Open the alarm screen and check that alarms are

coming/leaving– Open the DPE finder and search for all tags (about 230K tags

should be found)– Do a search from the event screen– Verify with EN/EL/CO that an alarm can be masked and

unmasked• If it can not, simply stop the PSEN_MaskedAlarm manager on PSEN2,

then on PSEN1, then restart it on PSEN1 and finally on PSEN2 (to avoid to have PSEN2 becoming the active peer while the manager is restarted)

Page 18: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

STEP 11 – Notify EN/ICE Standby service

• Send an email to [email protected] to let them know that the update is completed and they should monitor PSEN in MOON

Page 19: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

COMMON ISSUES

Page 20: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Error during import of tags for an RTU

• Sometimes import of tags for one or more RTUs can fail with messages similar to

• The problem is fixed with acknowledging any outstanding alarms for the concerned DPE and retrying the import

WCCOAui (127), 2015.09.09 10:20:49.810, CTRL, WARNING, 5/ctrl, Location of the following log entry: Module: Import DB Panel: /opt/unicryo/PVSS_projects/psen/installed_components/panels/vision/PSEN/ENSDB/PSEN_LoadRtu.pnl [] Object: 6 named: "DBLoadButton" of type: PUSH_BUTTON Script: Clicked Library: /opt/unicryo/PVSS_projects/psen/installed_components/scripts/libs/fwConfigs/fwAlertConfig.ctl Line: 1831WCCOAui (127), 2015.09.09 10:20:49.780, PARAM,WARNING, 31, Config type change failed, DP: psen_1:psen-ETC03_slash_BE-EHI04_slash_BE-PSEN_tag_double-6144463.delay_value:_alert_hdl.._type, MAN: (SYS: 530 Ui -num 127 CONN: 2), USER: 37665

Page 21: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

DIP Manager stops/crashes

• When going back from split mode to redu mode, the DIP manager on PSEN2 may stop.– Simply restart it once the event manager of PSEN2 is

at reasonable level (it is normally at 100% for some time during this phase)

• Once PSEN1 is completely restarted and re-become the active peer, the DIP manager on PSEN1 may stop.– You have to stop the DIP manager on PSEN2, then start

DIP on PSEN1 and finally restart DIP on PSEN2

Page 22: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Alarms are not masked

• Simply restart the PSEN_MaskedAlarm manager– Stop the manager on cs-ccr-psen2– Stop the manager on cs-ccr-psen1– Start the manager on cs-ccr-psen1– Start the manager on cs-ccr-psen1

Page 23: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Blank System Management Panel - 1/2

Page 24: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Blank System Management Panel - 2/2

• This is because the dpe SystemNum of _DistManager and _DistManager_2 are not equals

• Clear both of them• The dist manager is not required to run open the system overview panel

Page 25: PSEN Server Balance EN/ICE Procedures Jean-Charles Tournier EN/ICE/SCD 09-September-2015

Cannot Choose Which Server to restart

• If you get the top picture (instead of the bottom one), this is because Gedi has been started with the options “-data …. –event …” and connect to only one peer.

• To get the bottom picture, Gedi needs to be started normally, with simply “WCCOAui –proj psen –m gedi”

Do not click OK as you don’t know which peer will be restart.