3
1/10/2016 Document 2090597.1 https://support.oracle.com/epmos/faces/DocumentDisplay?_adf.ctrlstate=jp3rwim23_65&id=2090597.1 1/3 OGPG 'XPSV &RUH :LWK (UURU 0HVVDJH WLPHRXW ZDLWLQJ IRU RS +9FWOBRSBKHOOR 'RF ,' ,Q WKLV 'RFXPHQW 6\PSWRPV &DXVH 6ROXWLRQ 5HIHUHQFHV APPLIES TO: 6RODULV 63$5& 2SHUDWLQJ 6\VWHP 9HUVLRQ DQG ODWHU 2UDFOH 6RODULV RQ 63$5& ELW SYMPTOMS OGPG GXPSV FRUH HUURU GXH WR +9 UHVSRQVH WLPHRXW RI RS +9FWOBRSBKHOOR 7\SLFDO PHVVDJHV LQ YDUVYFORJOGRPVOGPGGHIDXOWORJ Oct 26 20:01:13 timeout waiting for op HVctl_op_get_bulk_res_stat Oct 26 20:01:13 fatal error: waiting for hv response timeout [ Oct 26 20:01:16 Stopping because process dumped core. ] [ Oct 26 20:01:16 Executing stop method (:kill). ] [ Oct 26 20:01:16 Executing start method ("/opt/SUNWldm/bin/ldmd_start"). ] Oct 26 20:02:17 timeout waiting for op HVctl_op_hello Oct 26 20:02:17 fatal error: waiting for hv response timeout [ Oct 26 20:02:19 Method "start" exited with status 95. ] 3RVVLEOH IPDGP IDXOW\ PHVVDJH TIME EVENTID MSGID SEVERITY Oct 26 09:48:39 0bfe9607f9b7cc19a8c88d1ec4c7b5ea SMF8000YX major Problem Status : isolated Diag Engine : softwarediagnosis / 0.1 System Manufacturer : unknown Name : ORCL,SPARCT42 Part_Number : unknown Serial_Number : 1xxxxxxxx Host_ID : 8xxxxxxx Suspect 1 of 1 : Fault class : defect.sunos.smf.svc.maintenance Certainty : 100% Affects : svc:///ldoms/ldmd:default Status : faulted and taken out of service Description : A service failed a start, stop or refresh method failed. Response : The service has been placed into the maintenance state. Impact : svc:/ldoms/ldmd:default is unavailable.

Timeout Waiting for Op HVctl_op_hello

Embed Size (px)

DESCRIPTION

Timeout Waiting for Op HVctl_op_hello

Citation preview

Page 1: Timeout Waiting for Op HVctl_op_hello

1/10/2016 Document 2090597.1

https://support.oracle.com/epmos/faces/DocumentDisplay?_adf.ctrl­state=jp3rwim23_65&id=2090597.1 1/3

ldmd Dumps Core With Error Message "timeout waiting for op HVctl_op_hello" (Doc ID2090597.1)

In this Document

SymptomsCauseSolutionReferences

APPLIES TO:

Solaris SPARC Operating System - Version 11 11/11 and laterOracle Solaris on SPARC (64-bit)

SYMPTOMS

ldmd dumps core error due to HV response timeout of op HVctl_op_hello

Typical messages in /var/svc/log/ldoms-ldmd:default.log:

Oct 26 20:01:13 timeout waiting for op HVctl_op_get_bulk_res_stat  Oct 26 20:01:13 fatal error: waiting for hv response timeout 

 [ Oct 26 20:01:16 Stopping because process dumped core. ]  [ Oct 26 20:01:16 Executing stop method (:kill). ] 

 [ Oct 26 20:01:16 Executing start method ("/opt/SUNWldm/bin/ldmd_start"). ]  Oct 26 20:02:17 timeout waiting for op HVctl_op_hello 

 Oct 26 20:02:17 fatal error: waiting for hv response timeout  [ Oct 26 20:02:19 Method "start" exited with status 95. ]

Possible fmadm faulty message:

­­­­­­­­­­­­­­­ ­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­  ­­­­­­­­­­­­­­ ­­­­­­­­­  TIME            EVENT­ID                              MSG­ID         SEVERITY  ­­­­­­­­­­­­­­­ ­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­  ­­­­­­­­­­­­­­ ­­­­­­­­­  Oct 26 09:48:39 0bfe9607­f9b7­cc19­a8c8­8d1ec4c7b5ea  SMF­8000­YX    major 

 

Problem Status    : isolated  Diag Engine       : software­diagnosis / 0.1 

 System    Manufacturer  : unknown 

   Name          : ORCL,SPARC­T4­2    Part_Number   : unknown 

   Serial_Number : 1xxxxxxxx    Host_ID       : 8xxxxxxx  

­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­­  Suspect 1 of 1 : 

  Fault class : defect.sunos.smf.svc.maintenance   Certainty   : 100% 

  Affects     : svc:///ldoms/ldmd:default   Status      : faulted and taken out of service 

 

Description : A service failed ­ a start, stop or refresh method failed.  

Response    : The service has been placed into the maintenance state.  

Impact      : svc:/ldoms/ldmd:default is unavailable.  

Page 2: Timeout Waiting for Op HVctl_op_hello

1/10/2016 Document 2090597.1

https://support.oracle.com/epmos/faces/DocumentDisplay?_adf.ctrl­state=jp3rwim23_65&id=2090597.1 2/3

Action      : Run 'svcs ­xv svc:/ldoms/ldmd:default' to determine the generic              reason why the service failed, the location of any logfiles, and              a list of other services impacted. Please refer to the associated              reference document at http://support.oracle.com/msg/SMF­8000­YX 

             for the latest service procedures and policies regarding this              diagnosis.

CAUSE

Issue is caused by encountering fatal error due to timeout for op HVctl_op_hello, HV response timeout

SOLUTION

- The short-term solution is a power-cycle of the system

!! Please ensure that all Domains and resources of the system gets stopped ( shutdown ) *gracefully* !! before performing a power-cycle of the system ( e.g. "stop /SYS" && "start /SYS" in ILOM )

- The midterm/long-term solution is to update the System Firmware to a recent version ( HypV 1.14.2 orhigher ) You might consider to use the downtime for the power-cycle to also update the System Firmware in one step

To find recent System Firmware for the T4-x and T5-x system, please check this URL or the tables belowhttp://www.oracle.com/technetwork/systems/patches/firmware/index.html

Recent System Firmware for T4-x systems:

SysFW      : MachType   : Patch      : Date       : ILOM         :  HypV       : OBP 

8.7.2.b    : T4­1       : 151743­01  : 2015­05­22 : 3.2.5.3.b    : 1.14.2.a   : 4.37.2 8.8.1.d    : T4­1       : 152059­03  : 2015­10­02 : 3.2.5.6.c    : 1.15.1.b   : 4.38.1 8.8.1.e    : T4­1       : 152059­04  : 2015­12­04 : 3.2.5.6.d    : 1.15.1.b   : 4.38.1 

8.7.2.b    : T4­2       : 151744­01  : 2015­05­22 : 3.2.5.3.b    : 1.14.2.a   : 4.37.2 8.8.1.d    : T4­2       : 152060­03  : 2015­10­02 : 3.2.5.6.c    : 1.15.1.b   : 4.38.1 

8.7.2.b    : T4­4       : 151745­01  : 2015­05­22 : 3.2.5.3.b    : 1.14.2.a   : 4.37.2 8.8.1.d    : T4­4       : 152061­03  : 2015­10­02 : 3.2.5.6.c    : 1.15.1.b   : 4.38.1 

8.7.2.b    : T4­1B      : 151746­01  : 2015­05­22 : 3.2.5.3.b    : 1.14.2.a   : 4.37.2 8.8.1.d    : T4­1B      : 152062­03  : 2015­10­02 : 3.2.5.6.c    : 1.15.1.b   : 4.38.1 

8.7.2.b    : NT4­1      : 151747­01  : 2015­05­22 : 3.2.5.3.b    : 1.14.2.a   : 4.37.2 8.8.1.d    : NT4­1      : 152063­03  : 2015­10­02 : 3.2.5.6.c    : 1.15.1.b   : 4.38.1 

8.7.2.b    : NT4­1B     : 151749­01  : 2015­05­22 : 3.2.5.3.b    : 1.14.2.a   : 4.37.2 8.8.1.d    : NT4­1B     : 152065­03  : 2015­10­02 : 3.2.5.6.c    : 1.15.1.b   : 4.38.1 

8.7.2.b    : NT4­2      : 151748­01  : 2015­05­22 : 3.2.5.3.b    : 1.14.2.a   : 4.37.2 8.8.1.d    : NT4­2      : 152064­03  : 2015­10­02 : 3.2.5.6.c    : 1.15.1.b   : 4.38.1

This issue has also been observed on T5-x systems with old SystemFirmware ( 9.0.X.X or Firmware 9.1.X.X )but not seen with newer versions. Hence, we recommend to upgrade the SystemFirmware to a recent version.e.g.:

SysFW      : MachType   : Patch      : Date       : ILOM         :  HypV       : OBP 

9.4.2.e    : T5­2       : 21342652   : 2015­06­30 : 3.2.5.3.e    : 1.14.2.a   : 4.37.2 9.5.1.b    : T5­2       : 21911663   : 2015­10­01 : 3.2.5.6.b    : 1.15.1.a   : 4.38.1 

Page 3: Timeout Waiting for Op HVctl_op_hello

1/10/2016 Document 2090597.1

https://support.oracle.com/epmos/faces/DocumentDisplay?_adf.ctrl­state=jp3rwim23_65&id=2090597.1 3/3

9.4.2.e    : T5­4       : 21342653   : 2015­06­30 : 3.2.5.3.e    : 1.14.2.a   : 4.37.2 9.5.1.b    : T5­4       : 21911664   : 2015­10­01 : 3.2.5.6.b    : 1.15.1.a   : 4.38.1 

9.4.2.e    : T5­8       : 21342653   : 2015­06­30 : 3.2.5.3.e    : 1.14.2.a   : 4.37.2 9.5.1.b    : T5­8       : 21911664   : 2015­10­01 : 3.2.5.6.b    : 1.15.1.a   : 4.38.1 

9.4.2.e    : T5­1B      : 21342654   : 2015­06­30 : 3.2.5.3.e    : 1.14.2.a   : 4.37.2 9.5.1.b    : T5­1B      : 21911665   : 2015­10­01 : 3.2.5.6.b    : 1.15.1.a   : 4.38.1 

9.4.2.e    : NT5­1B     : 21342655   : 2015­06­30 : 3.2.5.3.e    : 1.14.2.a   : 4.37.2 9.5.1.b    : NT5­1B     : 21911667   : 2015­10­01 : 3.2.5.6.b    : 1.15.1.a   : 4.38.1

Note: Please bear in mind that you cannot save the latest LDom changes to SP due to the failing ldmd ldmd SMF service use to be in maintenance state in such cases State: maintenance since <some-date>, Reason: Restarting too quickly.

Hence, you might need to restore the latest auto-saved config from /var/opt/SUNWldm/autosave* directories. MOS Doc ID 1464421.1 contains steps and examples how to perform this. configuration, save & restore setup and troubleshooting of Oracle VM Server for SPARC (LDom) Document1464421.1

- If you see this issue on a bare-metal system without ldom configuration you might considerto temporarily disable ldmd ( # svcadm disable -t ldmd ) until you can schedule a downtime for the SystemFirmware upgrade

REFERENCES

NOTE:1464421.1 - Configuration, Save & Restore Setup and Troubleshooting of Oracle VM Server for SPARC(LDom)

Didn't find what you are looking for?