Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

Sun Solaris XSCF fault diagnosis

2025-02-22 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >

Share

Shulou(Shulou.com)06/03 Report--

1 、 showhardconf

The showhardconf command can be used to display information about each FRU. The information that can be displayed is as follows:

Current configuration and status of ■

Number of FRU installed by ■

■ domain information

■ IOBOX Information

Name attribute of ■ PCI card

XSCF > showhardconf

SPARC Enterprise M4000

+ Serial:BDF1115196; Operator_Panel_Switch:Locked

+ Power_Supply_System:Single; SCF-ID:XSCF#0

+ System_Power:On; System_Phase:Cabinet Power On

Domain#0 Domain_Status:Running

MBU_A Status:Normal; Ver:4301h; Serial:BD1114008E

+ FRU-Part-Number:CF00541-4359 01 / 541-4359-01

+ Memory_Size:64 GB

+ Type:2

CPUM#0-CHIP#0 Status:Normal; Ver:0601h; Serial:PP105300QG

+ FRU-Part-Number:CA06761-D205 C1 / 371-4932-03

+ Freq:2.660 GHz; Type:48

+ Core:4; Strand:2

CPUM#0-CHIP#1 Status:Normal; Ver:0601h; Serial:PP105300QG

+ FRU-Part-Number:CA06761-D205 C1 / 371-4932-03

+ Freq:2.660 GHz; Type:48

+ Core:4; Strand:2

CPUM#1-CHIP#0 Status:Normal; Ver:0601h; Serial:PP104903Y5

+ FRU-Part-Number:CA06761-D205 C1 / 371-4932-03

+ Freq:2.660 GHz; Type:48

+ Core:4; Strand:2

CPUM#1-CHIP#1 Status:Normal; Ver:0601h; Serial:PP104903Y5

+ FRU-Part-Number:CA06761-D205 C1 / 371-4932-03

+ Freq:2.660 GHz; Type:48

+ Core:4; Strand:2

MEMB#0 Status:Normal; Ver:0101h; Serial:BF1109220C

+ FRU-Part-Number:CF00541-0545 09 / 541-0545-09

MEM#0A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f244b4c

+ Type:2A; Size:2 GB

MEM#0B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f83e611

+ Type:2A; Size:2 GB

MEM#1A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f53e611

+ Type:2A; Size:2 GB

MEM#1B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f444b4b

+ Type:2A; Size:2 GB

* MEM#2A Status:Degraded

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f63e609

+ Type:2A; Size:2 GB

MEM#2B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f83e5fa

+ Type:2A; Size:2 GB

MEM#3A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f444b4c

+ Type:2A; Size:2 GB

MEM#3B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-3f344b4c

+ Type:2A; Size:2 GB

MEMB#1 Status:Normal; Ver:0101h; Serial:BF1036E3DX

+ FRU-Part-Number:CF00541-0545 09 / 541-0545-09

MEM#0A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-5274b16d

+ Type:2A; Size:2 GB

MEM#0B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-5214c262

+ Type:2A; Size:2 GB

MEM#1A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-5234c261

+ Type:2A; Size:2 GB

MEM#1B Status:Normal

+ Code:ce0000000000000001M3 93T5660QZA-CE6 4151-481382de

+ Type:2A; Size:2 GB

MEM#2A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-5e649f87

+ Type:2A; Size:2 GB

MEM#2B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-5264b175

+ Type:2A; Size:2 GB

MEM#3A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-5274b170

+ Type:2A; Size:2 GB

MEM#3B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-5234c268

+ Type:2A; Size:2 GB

MEMB#2 Status:Normal; Ver:0101h; Serial:BF1051HK5T

+ FRU-Part-Number:CF00541-0545 09 / 541-0545-09

MEM#0A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4833ce5e

+ Type:2A; Size:2 GB

MEM#0B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4813ce45

+ Type:2A; Size:2 GB

MEM#1A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4843ce5f

+ Type:2A; Size:2 GB

MEM#1B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4833ce5c

+ Type:2A; Size:2 GB

MEM#2A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4813ce5e

+ Type:2A; Size:2 GB

MEM#2B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4883341c

+ Type:2A; Size:2 GB

MEM#3A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-48833439

+ Type:2A; Size:2 GB

MEM#3B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-48733428

+ Type:2A; Size:2 GB

MEMB#3 Status:Normal; Ver:0101h; Serial:BF1040EUC8

+ FRU-Part-Number:CF00541-0545 09 / 541-0545-09

MEM#0A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4823a1a3

+ Type:2A; Size:2 GB

MEM#0B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-48731182

+ Type:2A; Size:2 GB

MEM#1A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4823a19c

+ Type:2A; Size:2 GB

MEM#1B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-48631182

+ Type:2A; Size:2 GB

MEM#2A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4823a19a

+ Type:2A; Size:2 GB

MEM#2B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4833a19a

+ Type:2A; Size:2 GB

MEM#3A Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-48831186

+ Type:2A; Size:2 GB

MEM#3B Status:Normal

+ Code:ad0000000000000001HYMP125P72CP4-Y5 4141-4813a1a2

+ Type:2A; Size:2 GB

DDC_A#0 Status:Normal

DDC_A#1 Status:Normal

DDC_B#0 Status:Normal

IOU#0 Status:Normal; Ver:0101h; Serial:BF110617KB

+ FRU-Part-Number:CF00541-2240 05 / 541-2240-05

+ Type:1

DDC_A#0 Status:Normal

DDCR Status:Normal

DDC_B#0 Status:Normal

PCI#2 Name_Property:network; Card_Type:Other

PCI#3 Name_Property:SUNW,qlc; Card_Type:Other

PCI#4 Name_Property:SUNW,qlc; Card_Type:Other

XSCFU Status:Normal,Active; Ver:0101h; Serial:BF11071FKN

+ FRU-Part-Number:CF00541-0481 05 / 541-0481-05

OPNL Status:Normal; Ver:0101h; Serial:NN11052TLU

+ FRU-Part-Number:CF00541-0850 06 / 541-0850-06

PSU#0 Status:Normal; Serial:0017527-1108023275

+ FRU-Part-Number:CF00300-2311 0150 / 2311-01-50

+ Power_Status:On; AC:200 V

PSU#1 Status:Normal; Serial:0017527-1012024046

+ FRU-Part-Number:CF00300-2011 0250 / 2011-02-50

+ Power_Status:On; AC:200 V

FAN_A#0 Status:Normal

FAN_A#1 Status:Normal

FANBP_B Status:Normal; Ver:0401h; Serial:NN110736WD

+ FRU-Part-Number:CF00541-3098 01 / 541-3098-01

FAN_B#0 Status:Normal

FAN_B#1 Status:Normal

XSCF >

2 、 showlogs

The showlogs command can be used to display the contents of a specified log in timestamp order starting from the earliest date. Showlogs

The command displays the following logs:

■ error log

■ Power Log

■ event Log

■ temperature and humidity record

■ Monitoring message Log

■ console message Log

■ Emergency message Log

■ IPL message Log

XSCF > showlogs error

Date: May 05 15:03:27 CST 2014 Code: 80002000-c6ff0000-0104340700000000

Status: Alarm Occurred: May 05 15:03:26.996 CST 2014

FRU: / FAN_A#0

Msg: Unit disappeared unexpectedly

Date: May 05 15:04:23 CST 2014 Code: 80002000-c6ff0000-0104080100000000

Status: Alarm Occurred: May 05 15:04:23.572 CST 2014

FRU: / FAN_A#0

Msg: Unit detected unexpectedly

Date: May 05 15:06:53 CST 2014 Code: 80002000-c6ff0000-0104340700000000

Status: Alarm Occurred: May 05 15:06:53.420 CST 2014

FRU: / FAN_A#0

Msg: Unit disappeared unexpectedly

Date: May 05 15:07:34 CST 2014 Code: 80002000-c6ff0000-0104080100000000

Status: Alarm Occurred: May 05 15:07:34.836 CST 2014

FRU: / FAN_A#0

Msg: Unit detected unexpectedly

Date: Feb 07 13:20:46 CST 2016 Code: 80002000-c3ff0000-0104320100000000

Status: Alarm Occurred: Feb 07 13:20:44.966 CST 2016

FRU: / PSU#1

Msg: PSU failed

Date: Jan 23 02:36:06 CST 2018 Code: 60000000-8a2a0000-10cc000000000000

Status: Warning Occurred: Jan 23 02:36:05.765 CST 2018

FRU: / MBU_A/MEMB#1/MEM#1B

Msg: DIMM permanent correctable error

Date: Sep 06 13:11:15 CST 2018 Code: 60000000-8a2a0000-10cc000000000000

Status: Warning Occurred: Sep 06 13:11:15.396 CST 2018

FRU: / MBU_A/MEMB#0/MEM#2A

Msg: DIMM permanent correctable error

3 、 showstatus

Showstatus can be used to display information about degraded FRU on the server. The degraded unit uses an asterisk (*)

Indicates, and any of the following states are displayed:

■ Normal

■ Faulted

■ Degraded

■ Deconfigured

■ Maintenance

XSCF > showstatus

MBU_A Status:Normal

MEMB#0 Status:Normal

* MEM#2A Status:Degraded

4 、 fmadump

Bash-3.2# fmdump

TIME UUID SUNW-MSG-ID

Sep 06 13V 04U 37.2512 168620e1-a275-e9ed-bbff-d8f9da784bc8 SUN4U-8000-2S

Bash-3.2# fmdump-V-u 168620e1-a275-e9ed-bbff-d8f9da784bc8

TIME UUID SUNW-MSG-ID

Sep 06 2018 13 04VR 37.251251000 168620e1-a275-e9ed-bbff-d8f9da784bc8 SUN4U-8000-2S

Nvlist version: 0

Version = 0x0

Class = list.suspect

Uuid = 168620e1-a275-e9ed-bbff-d8f9da784bc8

Code = SUN4U-8000-2S

Diag-time = 1536210277 204244

De = (embedded nvlist)

Nvlist version: 0

Version = 0x0

Scheme = fmd

Authority = (embedded nvlist)

Nvlist version: 0

Version = 0x0

Product-id = SUNW,SPARC-Enterprise

Chassis-id = BDF1115196

Server-id = sunm4k_1

(end authority)

Mod-name = cpumem-diagnosis

Mod-version = 1.7

(end de)

Fault-list-sz = 0x1

Topo-uuid = 4ede8959-9768-eb1c-b6f5-f9f9af63c97c

Fault-list = (array of embedded nvlists)

(start fault-list [0])

Nvlist version: 0

Version = 0x0

Class = fault.memory.dimm

Certainty = 0x5f

Asru = (embedded nvlist)

Nvlist version: 0

Version = 0x0

Scheme = mem

Unum = / MBU_A/MEMB0/MEM2A

Serial = 3F63E609:HYMP125P72CP4-Y5

Authority = (embedded nvlist)

Nvlist version: 0

Product-id = SUNW,SPARC-Enterprise

Server-id = sunm4k_1

(end authority)

(end asru)

Fru = (embedded nvlist)

Nvlist version: 0

Version = 0x0

Scheme = mem

Unum = / MBU_A/MEMB0/MEM2A

Serial = 3F63E609:HYMP125P72CP4-Y5

Authority = (embedded nvlist)

Nvlist version: 0

Product-id = SUNW,SPARC-Enterprise

Server-id = sunm4k_1

(end authority)

(end fru)

(end fault-list [0])

Fault-status = 0x1

Severity = Major

_ _ ttl = 0x1

_ _ tod = 0x5b90b565 0xef9c938

Bash-3.2#

When using the-V option, the user will see at least three other lines of output:

The first line of ■ is a summary of the information previously displayed in console messages, but now includes timestamps, UUID, and

Message ID.

The second line of the ■ is a statement about the diagnostic determination. In this example, you can be confident that the fault occurred in the ASIC shown

Medium. Diagnostics may involve multiple components, where multiple lines are displayed, for example, two lines are shown here, each line description

A component.

The ■ line that begins with "FRU" declares the parts that must be replaced to bring the server back to full normal state.

The line in ■ that starts with "rsrc" indicates which component this failure caused.

Bash-3.2# fmdump-e

TIME CLASS

Jan 23 2018 02:01:14 ereport.asic.mac.mi-ce

Jan 23 2018 02:01:14 ereport.asic.mac.ptrl-ce

Jan 23 2018 02:01:24 ereport.asic.mac.mi-ce

Jan 23 2018 02:01:35 ereport.asic.mac.mi-ce

Jan 23 2018 02:01:35 ereport.asic.mac.ptrl-ce

Jan 23 2018 02:01:46 ereport.asic.mac.mi-ce

Jan 23 2018 02:01:57 ereport.asic.mac.ptrl-ce

.

5 、 fmadm faulty/config

Bash-3.2# fmadm faulty

-

TIME EVENT-ID MSG-ID SEVERITY

-

Sep 06 13:04:37 168620e1-a275-e9ed-bbff-d8f9da784bc8 SUN4U-8000-2S Major

Host: sunm4k_1

Platform: SUNW,SPARC-Enterprise Chassis_id: BDF1115196

Product_sn:

Fault class: fault.memory.dimm 95%

Affects: mem:///unum=/MBU_A/MEMB0/MEM2A

Faulted but still in service

FRU: mem:///unum=/MBU_A/MEMB0/MEM2A 95%

Faulty

Serial ID.: 3F63E609:HYMP125P72CP4-Y5

Description: The number of correctable errors associated with this memory

Module has exceeded acceptable levels.

Response: Pages of memory associated with this memory module have been

Removed from service, up to a limit which has now been reached.

Impact: Total system memory capacity has been reduced.

Action: Use 'fmadm faulty' to provide a more detailed view of this event.

Please refer to the associated reference document at

Http://sun.com/msg/SUN4U-8000-2S for the latest service

Procedures and policies regarding this diagnosis.

Bash-3.2# fmadm config

MODULE VERSION STATUS DESCRIPTION

Cpumem-diagnosis 1.7 active CPU/Memory Diagnosis

Cpumem-retire 1.1 active CPU/Memory Retire Agent

Disk-transport 1.0 active Disk Transport Agent

Eft 1.16 active eft diagnosis engine

Event-transport 2.0 active Event Transport Module

Ext-event-transport 0.1 active External FM event transport

Fabric-xlate 1.0 active Fabric Ereport Translater

Fmd-self-diagnosis 1.0 active Fault Manager Self-Diagnosis

Fps-transport 1.0 active Solaris FP-Scrubber

Io-retire 1.0 active I/O Retire Agent

Snmp-trapgen 1.0 active SNMP Trap Generation Agent

Sysevent-transport 1.0 active SysEvent Transport Agent

Syslog-msgs 1.0 active Syslog Messaging Agent

Zfs-diagnosis 1.0 active ZFS Diagnosis Engine

Zfs-retire 1.0 active ZFS Retire Agent

6 、 fmstat

XSCF > fmstat

Module ev_recv ev_acpt wait svc_t w b open solve memsz bufsz

Eft 0 0 0.0 0.0 0 0 0 3.3M 0

Event-transport 0 0 0.0 0.0 0 0 0 6.4K 0

Faultevent-post 2 0 0.0 8.9 0 0 0

Fmd-self-diagnosis 24 24 0.0 352.1 0 0 1 0 24b 0

Iox_agent 0 0 0.0 0.0 0 0 0

Reagent 0 0 0.0 0.0 0 0 0

Sysevent-transport 00 0.0 8700.4 00 00 00

Syslog-msgs 0 0 0.0 0.0 0 0 0 97b 0

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Servers

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report