Dell 服务器,用 MegaCli 查看状态提示已降级( Degraded ),机器正面也有一块硬盘在闪黄灯,其他的都是绿灯
用smartctl -H /dev/bus/0 -d megaraid,N(所有盘,包括那块闪灯的)
去看健康状态,都是 ok 的
这个是硬盘问题还是 RAID 配置问题,如何定位,请给个思路,我本来就是一个写增删改查的码畜,这些东西完全不知道该怎么玩。
1
onehao28 2019-06-05 10:11:56 +08:00
smartctl 看 是不准确的,应该 smartctl -a 看一下,看下具体参数。很有可能坏道太多。
|
2
lucifer9 2019-06-05 10:42:33 +08:00
找 Dell 工程师来处理,给钱了干嘛还要给自己找麻烦
|
3
qwerthhusn OP @onehao28 这个能帮忙看一下么?
··· smartctl 6.7 2018-08-04 r4747 [x86_64-linux-3.10.0-957.12.2.el7.x86_64] (daily-20180804) Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Vendor: SEAGATE Product: ST1200MM0099 Revision: ST31 Compliance: SPC-4 User Capacity: 1,200,243,695,616 bytes [1.20 TB] Logical block size: 512 bytes Formatted with type 2 protection 8 bytes of protection information per logical block LU is fully provisioned Rotation Rate: 10000 rpm Form Factor: 2.5 inches Logical Unit id: 0x5000c500bca87617 Serial number: WFK2RM60 Device type: disk Transport protocol: SAS (SPL-3) Local Time is: Wed Jun 5 09:10:55 2019 UTC SMART support is: Available - device has SMART capability. SMART support is: Enabled Temperature Warning: Disabled or Not Supported Read Cache is: Enabled Writeback Cache is: Disabled === START OF READ SMART DATA SECTION === SMART Health Status: OK Grown defects during certification <not available> Total blocks reassigned during format <not available> Total new blocks reassigned = 1 Power on minutes since format <not available> Current Drive Temperature: 34 C Drive Trip Temperature: 60 C Manufactured in week 49 of year 2018 Specified cycle count over device lifetime: 10000 Accumulated start-stop cycles: 20 Specified load-unload count over device lifetime: 300000 Accumulated load-unload cycles: 225 Elements in grown defect list: 39 Vendor (Seagate Cache) information Blocks sent to initiator = 2181055139 Blocks received from initiator = 2453981 Blocks read from cache and sent to initiator = 13569935 Number of read and write commands whose size <= segment size = 35257 Number of read and write commands whose size > segment size = 0 Vendor (Seagate/Hitachi) factory information number of hours powered up = 12.98 number of minutes until next internal SMART test = 51 Error counter log: Errors Corrected by Total Correction Gigabytes Total ECC rereads/ errors algorithm processed uncorrected fast | delayed rewrites corrected invocations [10^9 bytes] errors read: 2180959242 1 0 2180959243 0 1134.149 0 write: 0 0 0 0 0 1.352 0 verify: 1237 0 0 1237 26 0.000 0 Non-medium error count: 0 SMART Self-test log Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ] Description number (hours) # 1 Background long Failed in first segment 96 5 - [0x4 0x44 0x0] Long (extended) Self-test duration: 6723 seconds [112.0 minutes] scsiPrintBackgroundResults Failed [Input/output error] Protocol Specific port log page for SAS SSP relative target port id = 1 generation code = 2 number of phys = 1 phy identifier = 0 attached device type: SAS or SATA device attached reason: unknown reason: loss of dword synchronization negotiated logical link rate: phy enabled; 12 Gbps attached initiator port: ssp=1 stp=1 smp=1 attached target port: ssp=0 stp=0 smp=0 SAS address = 0x5000c500bca87615 attached SAS address = 0x54cd98f0469eb107 attached phy identifier = 7 Invalid DWORD count = 0 Running disparity error count = 0 Loss of DWORD synchronization = 6 Phy reset problem = 0 Phy event descriptors: Invalid word count: 0 Running disparity error count: 0 Loss of dword synchronization count: 6 Phy reset problem count: 0 relative target port id = 2 generation code = 2 number of phys = 1 phy identifier = 1 attached device type: no device attached attached reason: unknown reason: unknown negotiated logical link rate: phy enabled; unknown attached initiator port: ssp=0 stp=0 smp=0 attached target port: ssp=0 stp=0 smp=0 SAS address = 0x5000c500bca87616 attached SAS address = 0x0 attached phy identifier = 0 Invalid DWORD count = 0 Running disparity error count = 0 Loss of DWORD synchronization = 0 Phy reset problem = 0 Phy event descriptors: Invalid word count: 0 Running disparity error count: 0 Loss of dword synchronization count: 0 Phy reset problem count: 0 ··· |
4
crc8 2019-06-05 17:55:37 +08:00
硬盘问题,显示 OK 只能表明还没到显示不 OK 的时候。
|