#1 Le 02/05/2017, à 10:49
- eeried
Smartmontools smartctl - interpréter les infos
Bonjour,
Je n’ai pas trouvé comment interpréter sans panique inutile les infos données par smartmontools. Je sais que tout n’est pas à prendre à la lettre.
J’ai fait le test long dont voici le simple résultat:
smartctl 6.5 2016-01-24 r4214 [i686-linux-4.4.0-75-generic] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 15560 -
# 2 Short offline Completed without error 00% 15559 -
Selective Self-tests/Logging not supported
Voici le résultat de la commande
smartctl -a /dev/sda
smartctl 6.5 2016-01-24 r4214 [i686-linux-4.4.0-75-generic] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: SAMSUNG SpinPoint P80
Device Model: SAMSUNG SP1604N
Serial Number: S013J10X330249
Firmware Version: TM100-24
User Capacity: 160 041 885 696 bytes [160 GB]
Sector Size: 512 bytes logical/physical
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA/ATAPI-7 T13/1532D revision 0
Local Time is: Mon May 1 17:29:19 2017 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 5760) seconds.
Offline data collection
capabilities: (0x1b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
No Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging support.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 96) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always - 10
3 Spin_Up_Time 0x0007 064 052 000 Pre-fail Always - 6208
4 Start_Stop_Count 0x0032 095 095 000 Old_age Always - 5889
5 Reallocated_Sector_Ct 0x0033 253 253 010 Pre-fail Always - 0
7 Seek_Error_Rate 0x000b 253 253 051 Pre-fail Always - 0
8 Seek_Time_Performance 0x0024 253 253 000 Old_age Offline - 0
9 Power_On_Half_Minutes 0x0032 097 097 000 Old_age Always - 15561h+12m
10 Spin_Retry_Count 0x0013 253 253 049 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 097 097 000 Old_age Always - 3123
194 Temperature_Celsius 0x0022 154 109 000 Old_age Always - 28
195 Hardware_ECC_Recovered 0x000a 100 100 000 Old_age Always - 329739738
196 Reallocated_Event_Count 0x0012 253 253 000 Old_age Always - 0
197 Current_Pending_Sector 0x0033 253 253 010 Pre-fail Always - 0
198 Offline_Uncorrectable 0x0031 253 253 010 Pre-fail Offline - 0
199 UDMA_CRC_Error_Count 0x000b 100 100 051 Pre-fail Always - 0
200 Multi_Zone_Error_Rate 0x000b 100 100 051 Pre-fail Always - 0
201 Soft_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always - 0
SMART Error Log Version: 1
ATA Error Count: 349 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 349 occurred at disk power-on lifetime: 15558 hours (648 days + 6 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 fe 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
ef 05 fe 00 00 00 40 00 00:00:43.250 SET FEATURES [Enable APM]
c8 00 c8 38 72 57 e0 00 00:00:43.188 READ DMA
c8 00 20 20 7a 09 e0 00 00:00:43.188 READ DMA
c8 00 20 98 dd 6d e0 00 00:00:43.188 READ DMA
c8 00 00 c0 f5 61 e0 00 00:00:43.188 READ DMA
Error 348 occurred at disk power-on lifetime: 15558 hours (648 days + 6 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 fe 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
ef 05 fe 00 00 00 40 00 00:00:36.000 SET FEATURES [Enable APM]
c8 00 08 38 2e 81 e0 00 00:00:36.000 READ DMA
c8 00 60 e0 05 46 e0 00 00:00:36.000 READ DMA
c8 00 08 a0 25 81 e0 00 00:00:36.000 READ DMA
c8 00 70 88 eb 07 e0 00 00:00:36.000 READ DMA
Error 347 occurred at disk power-on lifetime: 15558 hours (648 days + 6 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 fe 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
ef 05 fe 00 00 00 40 00 01:55:32.625 SET FEATURES [Enable APM]
c8 00 18 58 81 44 e0 00 01:55:32.625 READ DMA
c8 00 08 d0 a9 88 e0 00 01:55:32.563 READ DMA
c8 00 20 38 81 44 e0 00 01:55:32.563 READ DMA
c8 00 08 18 0f 81 e0 00 01:55:32.563 READ DMA
Error 346 occurred at disk power-on lifetime: 15557 hours (648 days + 5 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 fe 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
ef 05 fe 00 00 00 40 00 01:22:12.000 SET FEATURES [Enable APM]
ca 00 10 18 08 96 e0 00 01:22:12.000 WRITE DMA
ca 00 08 50 61 94 e0 00 01:22:12.000 WRITE DMA
ca 00 08 68 80 93 e0 00 01:22:12.000 WRITE DMA
ca 00 08 b0 3e 88 e0 00 01:22:12.000 WRITE DMA
Error 345 occurred at disk power-on lifetime: 15557 hours (648 days + 5 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
04 51 fe 00 00 00 40 Error: ABRT
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
ef 05 fe 00 00 00 40 00 01:22:03.750 SET FEATURES [Enable APM]
c8 00 08 68 3d 81 e0 00 01:22:03.750 READ DMA
c8 00 60 e0 05 46 e0 00 01:22:03.750 READ DMA
c8 00 08 f0 2a 81 e0 00 01:22:03.750 READ DMA
c8 00 88 40 06 46 e0 00 01:22:03.688 READ DMA
Voici mes questions:
*Comment lire cette ligne du premier tableau? le nombre 10 est important? ou est-ce la comparaison entre Thresh (=051) et Value (=100)?
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 051 Pre-fail Always - 10
* Pre-Fail veut dire que le disque va claquer ou c’est une indication générale sur la ligne 1 ou autre lignes?
*Est-ce qu’il y a d’autres lignes inquiétantes dans ce tableau?
*Les erreurs 349 à 345: ça veut dire quoi?
Merci de votre aide :-)
Libres-Ailé(e)s association pour GNU/Linux et le monde du Libre (Haute-Loire)
Hors ligne
#2 Le 02/05/2017, à 12:02
- serged
Re : Smartmontools smartctl - interpréter les infos
La signification est dans Wikipédia ou mieux le Wikipeda anglais.
Le "Pre-fail" signifie simplement que cet attribut est signe de "prefailure" : Si le nombre est raisonnable, pas de problème.
Mode bourrin :
Utiliser un utilitaire graphique (comme GSmartControl), il marquera en rouge les attributs inquiétants...
LinuxMint Vera Cinnamon et d'autres machines en MATE, XFCE... 20.x , 21.x ou 19.x
Tour : Asus F2A55 / AMD A8-5600K APU 3,6GHz / RAM 16Go / Nvidia GeForce GT610 / LM21.1 Cinnamon
Portable : LDLC Mercure MH : Celeron N3450 /RAM 4Go / Intel HD graphics 500 i915 / biboot Win 10 (sur SSD) - LM21.1 MATE (sur HDD)
Hors ligne
#3 Le 04/05/2017, à 20:16
- eeried
Re : Smartmontools smartctl - interpréter les infos
Merci serged. Il y a effectivement plein d’infos sur la page en anglais, très utile.
Si je comprends bien, il faut lire le chiffre RAW_Value (ça paraît logique):
The following chart lists some S.M.A.R.T. attributes and the typical meaning of their raw values
Je n’ai pas l’impression que la page de Wikipedia explique les erreurs 349 à 345, qui suivent plus bas après le tableau chez moi.
Je vais essayer GSmartControl.
Libres-Ailé(e)s association pour GNU/Linux et le monde du Libre (Haute-Loire)
Hors ligne