Portál AbcLinuxu, 11. května 2025 05:09
/dev/sdx
, přičemž lsscsi
ho zase iniciovalo a vidět byl. Vyměnil jsem kabel, na krátko to pomohlo, ale jen asi na dva dny. Později se ho přestal detekovat i BIOS. A za krátko se přestal detekovat i další disk, který je zapojen do přídavné karty s 2xSATA. Takže jsem očekával, že problém může být s disky a pořídil náhradní disky a v externí dvoudiskové dokině jsem provedl ddrescue starý nový
. Výsledek: Žádná chyba, vše se zkopírovalo na první průjezd. Takže předpokládám, že disky jsou v pořádku a pokud ne tak data mám stejně dvakrát. A díky výměně kabelu je už jediné místo, kde může být problém konektor na motherboardu nebo následné obvody. Jak se konektor dá vyčistit? Mohu do něj kápnout kontaktol?
Řešení dotazu:
Vždy jsme používali alkohol (kdysi se prodával v běžných potravinářský obchodech v malých placatých lahvičkách, kolem 90%), náhražkou je isopropylalkohol. U dostupných konektrů se ještě užívala tvrdá guma, u špatně dostupných seřezaná tvrdá párátka (dřevěná).
Kontaktol použít možná někde na kontaktech silnoproudých nebo u auta.
smartctl -a /dev/sdi smartctl 7.0 2018-12-30 r4883 [x86_64-linux-5.3.13-arch1-1] (local build) Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Red Device Model: WDC WD40EFRX-68WT0N0 Serial Number: WD-WCC4E0699883 LU WWN Device Id: 5 0014ee 2099c84f7 Firmware Version: 80.00A80 User Capacity: 4 000 787 030 016 bytes [4,00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 5400 rpm Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s) Local Time is: Thu Nov 28 20:19:14 2019 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (54780) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 548) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x703d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 173 173 021 Pre-fail Always - 8341 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 97 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 19 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 97 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 95 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 14 194 Temperature_Celsius 0x0022 122 109 000 Old_age Always - 30 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 19 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.Což sice vypadá hezky, ale je to lež. Nevím jak to ten disk udělal, ale jednak má najeto něco mezi 7-10 tisici hodin. A také projelo mnoho testů. jak short tak long, všechny úspěšně. Po mém dotazu jsem disk vyřadil a dnes se k problému vrátil. (Mám kompresor a tak jsem na konektory pustil tenký paprsek vzduchu asi o 7 barech, což odstraní i dost pevnou špínu) Po vyčištení vzduchem a isopropyl alkoholem úplně všech konektorů jsem systém složil a začínám testovat. Ten výpis jsem dal na to short test. Ted běží long, ale to je do rána. Uvidíme jak to dopadne. Nicméně plán je disk vyřadím z pole, otestuji, pokud bude trochu použitelný, tak na něm budou jen drobnosti z netu, o které je možné kdykoliv přijít.
btrfs scrub
nad 9,6 TB RAID 1 polem. Zatím má 2,11 TB se 6 opravenými chybami a nic více což bych klidně připsal na vrub tomu kabelu/konektoru. I nějaký zápis prošel bez chyb (asi 800 fotek do digikamu) Smart je tohle
smartctl -x /dev/sdi smartctl 7.0 2018-12-30 r4883 [x86_64-linux-5.3.13-arch1-1] (local build) Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Red Device Model: WDC WD40EFRX-68WT0N0 Serial Number: WD-WCC4E0699883 LU WWN Device Id: 5 0014ee 2099c84f7 Firmware Version: 80.00A80 User Capacity: 4 000 787 030 016 bytes [4,00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 5400 rpm Device is: In smartctl database [for details use: -P show] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s) Local Time is: Fri Nov 29 10:14:08 2019 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled AAM feature is: Unavailable APM feature is: Unavailable Rd look-ahead is: Enabled Write cache is: Enabled DSN feature is: Unavailable ATA Security is: Disabled, NOT FROZEN [SEC1] Wt Cache Reorder: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (54780) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 548) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x703d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 0 3 Spin_Up_Time POS--K 173 173 021 - 8341 4 Start_Stop_Count -O--CK 100 100 000 - 97 5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0 7 Seek_Error_Rate -OSR-K 100 253 000 - 0 9 Power_On_Hours -O--CK 100 100 000 - 33 10 Spin_Retry_Count -O--CK 100 253 000 - 0 11 Calibration_Retry_Count -O--CK 100 253 000 - 0 12 Power_Cycle_Count -O--CK 100 100 000 - 97 192 Power-Off_Retract_Count -O--CK 200 200 000 - 95 193 Load_Cycle_Count -O--CK 200 200 000 - 16 194 Temperature_Celsius -O---K 107 107 000 - 45 196 Reallocated_Event_Count -O--CK 200 200 000 - 0 197 Current_Pending_Sector -O--CK 200 200 000 - 0 198 Offline_Uncorrectable ----CK 100 253 000 - 0 199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0 200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0 ||||||_ K auto-keep |||||__ C event count ||||___ R error rate |||____ S speed/performance ||_____ O updated online |______ P prefailure warning General Purpose Log Directory Version 1 SMART Log Directory Version 1 [multi-sector log support] Address Access R/W Size Description 0x00 GPL,SL R/O 1 Log Directory 0x01 SL R/O 1 Summary SMART error log 0x02 SL R/O 5 Comprehensive SMART error log 0x03 GPL R/O 6 Ext. Comprehensive SMART error log 0x06 SL R/O 1 SMART self-test log 0x07 GPL R/O 1 Extended self-test log 0x09 SL R/W 1 Selective self-test log 0x10 GPL R/O 1 NCQ Command Error log 0x11 GPL R/O 1 SATA Phy Event Counters log 0x21 GPL R/O 1 Write stream error log 0x22 GPL R/O 1 Read stream error log 0x80-0x9f GPL,SL R/W 16 Host vendor specific log 0xa0-0xa7 GPL,SL VS 16 Device vendor specific log 0xa8-0xb7 GPL,SL VS 1 Device vendor specific log 0xbd GPL,SL VS 1 Device vendor specific log 0xc0 GPL,SL VS 1 Device vendor specific log 0xc1 GPL VS 93 Device vendor specific log 0xe0 GPL,SL R/W 1 SCT Command/Status 0xe1 GPL,SL R/W 1 SCT Data Transfer SMART Extended Comprehensive Error Log Version: 1 (6 sectors) No Errors Logged SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 30 - # 2 Short offline Completed without error 00% 19 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. SCT Status Version: 3 SCT Version (vendor specific): 258 (0x0102) Device State: Active (0) Current Temperature: 45 Celsius Power Cycle Min/Max Temperature: 24/45 Celsius Lifetime Min/Max Temperature: 23/45 Celsius Under/Over Temperature Limit Count: 0/0 Vendor specific: 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 SCT Temperature History Version: 2 Temperature Sampling Period: 1 minute Temperature Logging Interval: 1 minute Min/Max recommended Temperature: 0/60 Celsius Min/Max Temperature Limit: -41/85 Celsius Temperature History Size (Index): 478 (287) Index Estimated Time Temperature Celsius 288 2019-11-29 02:17 43 ************************ ... ..(134 skipped). .. ************************ 423 2019-11-29 04:32 43 ************************ 424 2019-11-29 04:33 42 *********************** ... ..(150 skipped). .. *********************** 97 2019-11-29 07:04 42 *********************** 98 2019-11-29 07:05 43 ************************ ... ..( 4 skipped). .. ************************ 103 2019-11-29 07:10 43 ************************ 104 2019-11-29 07:11 42 *********************** ... ..( 3 skipped). .. *********************** 108 2019-11-29 07:15 42 *********************** 109 2019-11-29 07:16 41 ********************** ... ..( 5 skipped). .. ********************** 115 2019-11-29 07:22 41 ********************** 116 2019-11-29 07:23 40 ********************* ... ..( 8 skipped). .. ********************* 125 2019-11-29 07:32 40 ********************* 126 2019-11-29 07:33 39 ******************** ... ..( 32 skipped). .. ******************** 159 2019-11-29 08:06 39 ******************** 160 2019-11-29 08:07 38 ******************* ... ..( 26 skipped). .. ******************* 187 2019-11-29 08:34 38 ******************* 188 2019-11-29 08:35 39 ******************** ... ..( 4 skipped). .. ******************** 193 2019-11-29 08:40 39 ******************** 194 2019-11-29 08:41 40 ********************* ... ..( 3 skipped). .. ********************* 198 2019-11-29 08:45 40 ********************* 199 2019-11-29 08:46 41 ********************** ... ..( 4 skipped). .. ********************** 204 2019-11-29 08:51 41 ********************** 205 2019-11-29 08:52 42 *********************** ... ..( 19 skipped). .. *********************** 225 2019-11-29 09:12 42 *********************** 226 2019-11-29 09:13 43 ************************ ... ..( 18 skipped). .. ************************ 245 2019-11-29 09:32 43 ************************ 246 2019-11-29 09:33 44 ************************* ... ..( 32 skipped). .. ************************* 279 2019-11-29 10:06 44 ************************* 280 2019-11-29 10:07 45 ************************** ... ..( 6 skipped). .. ************************** 287 2019-11-29 10:14 45 ************************** SCT Error Recovery Control: Read: 70 (7,0 seconds) Write: 70 (7,0 seconds) Device Statistics (GP/SMART Log 0x04) not supported Pending Defects log (GP Log 0x0c) not supported SATA Phy Event Counters (GP Log 0x11) ID Size Value Description 0x0001 2 0 Command failed due to ICRC error 0x0002 2 0 R_ERR response for data FIS 0x0003 2 0 R_ERR response for device-to-host data FIS 0x0004 2 0 R_ERR response for host-to-device data FIS 0x0005 2 0 R_ERR response for non-data FIS 0x0006 2 0 R_ERR response for device-to-host non-data FIS 0x0007 2 0 R_ERR response for host-to-device non-data FIS 0x0008 2 0 Device-to-host non-data FIS retries 0x0009 2 3 Transition from drive PhyRdy to drive PhyNRdy 0x000a 2 2 Device-to-host register FISes sent due to a COMRESET 0x000b 2 0 CRC errors within host-to-device FIS 0x000f 2 0 R_ERR response for host-to-device data FIS, CRC 0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC 0x8000 4 51103 Vendor specificHodnota Power On = 33 a Spin Up = 8341 je divná. Pravda je to spin up. Výkonové testy na write na poli zkusím jak dojede scrub.
Tiskni
Sdílej:
ISSN 1214-1267, (c) 1999-2007 Stickfish s.r.o.