Bonjour à tous,
Version courte :
Mon KS (ks380507) a eu des erreurs I/O sur le disque, des secteurs en attente d'être réalloués, et après formatage complet (pour finaliser la réallocation) finalement plus rien... Je ne fais plus confiance à ce disque, cependant il m'est difficile de prouver que celui-ci a des problèmes, mis à part l'historique des erreurs SMART. Est-il possible d'en changer quand même ?
Version longue (avec les détails) :
Mon KS a eu de gros soucis de disque dur il y a quelques jours, erreurs I/O, machine bloquée, obligé de rebooter, en rescue pour voir ce qui se passe. SMART indiquait Reallocated_Sector_Ct 0 mais Current_Pending_Sector 1. J'ai donc monté et transféré toutes mes données sur d'autres machines (avec des erreurs de lecture en prime, voir plus bas) et entrepris d'écrire des zéros sur tout le disque afin de réallouer ce secteur définitivement et ensuite faire changer le disque. Problème, à la fin du formatage, le secteur défectueux a disparu : Reallocated_Sector_Ct 0 et Current_Pending_Sector 0 ! Il ne me reste plus que quelques logs et quelques erreurs SMART pour prouver ma bonne foi :
Code:
Error 33 occurred at disk power-on lifetime: 30835 hours (1284 days + 19 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
60 00 08 ff ff ff 4f 00 04:44:18.397 READ FPDMA QUEUED
27 00 00 00 00 00 e0 00 04:44:18.397 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 04:44:18.395 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 04:44:18.395 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 04:44:18.395 READ NATIVE MAX ADDRESS EXT
Error 32 occurred at disk power-on lifetime: 30835 hours (1284 days + 19 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
60 00 08 ff ff ff 4f 00 04:44:15.258 READ FPDMA QUEUED
27 00 00 00 00 00 e0 00 04:44:15.258 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 04:44:15.256 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 04:44:15.256 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 04:44:15.256 READ NATIVE MAX ADDRESS EXT
Error 31 occurred at disk power-on lifetime: 30835 hours (1284 days + 19 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
60 00 08 ff ff ff 4f 00 04:44:12.135 READ FPDMA QUEUED
27 00 00 00 00 00 e0 00 04:44:12.135 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 04:44:12.133 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 04:44:12.133 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 04:44:12.133 READ NATIVE MAX ADDRESS EXT
Error 30 occurred at disk power-on lifetime: 30835 hours (1284 days + 19 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
60 00 08 ff ff ff 4f 00 04:44:09.005 READ FPDMA QUEUED
27 00 00 00 00 00 e0 00 04:44:09.004 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 04:44:09.003 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 04:44:09.002 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 04:44:09.002 READ NATIVE MAX ADDRESS EXT
Error 29 occurred at disk power-on lifetime: 30835 hours (1284 days + 19 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
60 00 08 ff ff ff 4f 00 04:44:05.865 READ FPDMA QUEUED
27 00 00 00 00 00 e0 00 04:44:05.865 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 04:44:05.863 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 04:44:05.863 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 04:44:05.863 READ NATIVE MAX ADDRESS EXT
Code:
ata1.00: exception Emask 0x0 SAct 0x1000 SErr 0x0 action 0x0
ata1.00: irq_stat 0x40000008
ata1.00: failed command: READ FPDMA QUEUED
ata1.00: cmd 60/08:60:90:ad:b0/00:00:e8:00:00/40 tag 12 ncq 4096 in
res 41/40:08:93:ad:b0/00:00:e8:00:00/00 Emask 0x409 (media error)
ata1.00: status: { DRDY ERR }
ata1.00: error: { UNC }
ata1.00: configured for UDMA/133
sd 0:0:0:0: [sda] Unhandled sense code
sd 0:0:0:0: [sda]
Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
sd 0:0:0:0: [sda]
Sense Key : Medium Error [current] [descriptor]
Descriptor sense data with sense descriptors (in hex):
72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
e8 b0 ad 93
sd 0:0:0:0: [sda]
Add. Sense: Unrecovered read error - auto reallocate failed
sd 0:0:0:0: [sda] CDB:
Read(10): 28 00 e8 b0 ad 90 00 00 08 00
end_request: I/O error, dev sda, sector 3903892883
ata1: EH complete
ata1.00: exception Emask 0x0 SAct 0x2000 SErr 0x0 action 0x0
ata1.00: irq_stat 0x40000008
ata1.00: failed command: READ FPDMA QUEUED
ata1.00: cmd 60/08:68:90:ad:b0/00:00:e8:00:00/40 tag 13 ncq 4096 in
res 41/40:08:93:ad:b0/00:00:e8:00:00/00 Emask 0x409 (media error)
ata1.00: status: { DRDY ERR }
ata1.00: error: { UNC }
ata1.00: configured for UDMA/133
sd 0:0:0:0: [sda] Unhandled sense code
sd 0:0:0:0: [sda]
Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
sd 0:0:0:0: [sda]
Sense Key : Medium Error [current] [descriptor]
Descriptor sense data with sense descriptors (in hex):
72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
e8 b0 ad 93
sd 0:0:0:0: [sda]
Add. Sense: Unrecovered read error - auto reallocate failed
sd 0:0:0:0: [sda] CDB:
Read(10): 28 00 e8 b0 ad 90 00 00 08 00
end_request: I/O error, dev sda, sector 3903892883
ata1: EH complete
ata1.00: exception Emask 0x0 SAct 0x4000 SErr 0x0 action 0x0
ata1.00: irq_stat 0x40000008
ata1.00: failed command: READ FPDMA QUEUED
ata1.00: cmd 60/08:70:90:ad:b0/00:00:e8:00:00/40 tag 14 ncq 4096 in
res 41/40:08:93:ad:b0/00:00:e8:00:00/00 Emask 0x409 (media error)
ata1.00: status: { DRDY ERR }
ata1.00: error: { UNC }
ata1.00: configured for UDMA/133
sd 0:0:0:0: [sda] Unhandled sense code
sd 0:0:0:0: [sda]
Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
sd 0:0:0:0: [sda]
Sense Key : Medium Error [current] [descriptor]
Descriptor sense data with sense descriptors (in hex):
72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00
e8 b0 ad 93
sd 0:0:0:0: [sda]
Add. Sense: Unrecovered read error - auto reallocate failed
sd 0:0:0:0: [sda] CDB:
Read(10): 28 00 e8 b0 ad 90 00 00 08 00
end_request: I/O error, dev sda, sector 3903892883
ata1: EH complete
Est-ce suffisant pour demander un remplacement de disque ? Je suis en train de le formater à nouveau avec /dev/urandom cette fois-ci afin de voir si il réagit différemment avec des données aléatoires mais j'en doute... mais je n'ai pas envie de tout rebasculer sur ce disque maintenant qu'il m'a fait un coup comme ça.
Merci d'avance !
Yann