Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

You only need this step to deal with the IBM V7000 disk failure!

2025-01-18 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >

Share

Shulou(Shulou.com)06/02 Report--

1. IBM V7000 brief:

The first mid-range storage 2.0 product independently developed by IBM breaks the tradition in architecture and absorbs the essence of XIV, the scale-out architecture of DS and IBM. The disk array, which integrates the three major storage efficiency of "EasyTier auto-tiering", "virtualization" and "thin provisioning" for the first time, makes IBM Storwize V7000 a killer product in the midrange storage market that pays more attention to storage efficiency. For the first time, high-end storage technologies such as DS8000's RAID technology and auto-tiering, SVC virtualization architecture and XIV's delightful management interface are applied to midrange storage, with enterprise-class array capabilities and eye-catching GUI.

Second, fault description

The customer equipment model is IBM V7000 storage, the architecture is AIX+Sybase+V7000 storage array cabinet, and the data to be recovered is mainly stored on the array cabinet, with a total of 12 SAS mechanical hard drives with 600G capacity (one of which is a hot spare).

Due to the IBM V7000 disk failure, there was a problem with another disk during the replacement of disk data synchronization, resulting in the logical disk could not be attached to the minicomputer, and the business was temporarily interrupted. From the storage management interface, two hard drives show that the failure is offline, of which the failed hard disk in slot 10 is a hot spare, and the hard disk in slot 3 is shown in the following figure:

A total of two sets of Mdisk have been created in the customer's array cabinet and added to one pool. Now the customer's main data pool cannot be loaded, and three general-purpose volumes cannot be mounted, as shown in the following figure:

3. Mirrored disk

In order to prevent secondary damage to the original disk caused by misoperation in the process of data recovery, 10 disks are mirrored by winhex software, and the failed hard disk in slot 3 is mirrored by PC3000 (there may be more bad channels). After that, all data recovery operations are carried out on the mirrored disk and will not affect the original disk.

* * IV. Recovery process

Recovery plan 1. Mandatory online operation for storage * *

The main contents are as follows: 1. Analyze the offline order of the failed hard disk in the fault storage.

2. Repair the faulty hard disk that is offline.

3. Insert the repaired hard disk back into storage for forced online operation.

Recovery plan 2. Parsing storage structure

1. Mdisk analysis and recombination

A. according to the partial configuration information given by the customer, the hard disk is classified according to the Mdisk group.

B. analyze all the hard drives in each group of Mdisk to get the relevant raid information.

C. Use professional data recovery software to virtual reorganize Mdisk.

2. Pool analysis

A. analyze all the Mdisk and get the relevant information of pool.

B. Analyze the distribution of pool on Mdisk.

3. LUN structure analysis.

A. analyze the stripe size in pool.

B. Analyze the LUN bitmap and analyze the distribution of each LUN in pool.

C, write a program to extract LUN.

Fifth, drop analysis

According to the characteristics of raid5, it allows a member disk to be offline at most, that is, it can be used normally in the case of a member disk failure. Customer storage devices have failed, and only one hard drive in each Mdisk is offline.

The log stored in V7000 is extracted, and the offline order of each failed hard disk is obtained by analyzing the log.

VI. Verify the data

The generated data is tested by random sampling and there is no problem with the data.

VII. Data transfer

The customer provides the storage device, creates the same number of LUN on the storage device as the previous environment, and copies the image file of the extracted data LUN to the LUN created on the storage by using dd, and gives it to the customer.

VIII. Recovery results

After the data is handed over, the customer reconfigures the storage environment, and everything is normal. The data recovery work has been successfully completed.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Servers

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report