AnsweredAssumed Answered

RAID 1 continual drive disconnect (x399, Threadripper 1920x, Win 10 x64 build 1903 June 2019)

Question asked by mattfl on Jul 6, 2019
Latest reply on Jul 16, 2019 by mattfl

Can anyone help me figure out why one drive in my RAID 1 array continually disconnects?  Time between disconnects is measured in hours.  I can clean the drive and add it back as a spare and it rebuilds fine, only to "disconnect" again a few hours later.  It may be a coincidence, but the problem first popped up after running Windows Update on June 30, though the machine was less than 2 days old at this point so I cannot say with 100% certainty that the update was to blame.  I know there was a compatibility problem with the older RAID drivers and the latest Windows 10, is this perhaps a new driver issue with the latest RAID driver?  Any tips on resolving this would be greatly appreciated. 

 

System details:

  • New clean "Windows 10 Pro for Workstations" x64  install, version 1903, build 18362.207, installed June 28, fully updated (windows update) June 30
  • Two 2TB Samsung 860 EVO SSD's in the RAID-1 array
  • Ryzen Threadripper 1920x, not overclocked
  • ASRock Taichi x399 motherboard
  • AMD RAID driver version 9.2.0.120 (build date May 13, 2019)
  • SCSI Disk Device driver (disk.sys, EhStorClass.sys, partmgr.sys) version 10.0.18362.1 built June 21, 2006 (WinBuild 160101.0800)
  • Latest AMD chipset drivers as of June 28, 2019
  • 32GB ECC Ram

 

 

Here is my RAID log that includes a disconnect, rebuild and second disconnect: 

 

6/28/2019 11:13:05 AM - Initialized messages space, starting message logging.
6/28/2019 11:27:32 AM - Device disconnected on controller 0 channel 1.
6/28/2019 11:27:32 AM - Drive failed on controller 0 channel 1.
6/28/2019 11:27:32 AM - Array 1 has gone critical.
06/28/19 11:59:55 RC CGI Service Start
06/28/19 11:59:55 Dumping current configuration to log file.
06/28/19 11:59:55 BufferSize: 1923
CO: 1 GE: 21 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0xab0b SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: COMPLETED DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 1 HD: 36864 ID: 0x7db1127b3f6385b3 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105

6/28/2019 12:00:01 PM - Initialized messages space, starting message logging.
6/28/2019 12:00:01 PM - Array 1 has gone critical.
06/28/19 12:14:46 RC CGI Service Start
06/28/19 12:14:46 Dumping current configuration to log file.
06/28/19 12:14:46 BufferSize: 2762
CO: 1 GE: 24 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0xb685 SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
PD: 2 TY: Unknown ST: Online ID: 0x7db1127b3f6385b3 NM: //./Core1/Route1/Device1 CD: 0 CH: 1 RT: 1 HD: 1 FE: 0x21ed001c GS: N SI: 0x00000000e8e088b0 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507276F_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: COMPLETED DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 1 HD: 36864 ID: 0x7db1127b3f6385b3 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105
LD: 1 ST: NORMAL ID: 0x7db1127b3f6385b3 NM: //./Core1/Route0/Device2 OSN: NONE RT: 0 HD: 2 SI: 0x00000000e8e088b0 1ST: 1 2ND: 1 CA: NC CTS: 1 TY: LEGACY GE: 0x00000000 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 0 TTY: NOT_ACTIVE TST: NOT_ACTIVE DS: 0 PRE: 0 AT: 0x0000 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: NONE
LE: 0 RT: 1 HD: 1 ID: 0x7db1127b3f6385b3 SO: 0x0000000000000000 SS: 0x00000000e8e088b0 DO: 0x0000000000000000 DS: 0x00000000e8e088b0 ER: NoError
PA: 0 RT: 0 HD: 65535 ID: 0x0000859010220000

6/28/2019 12:14:51 PM - Initialized messages space, starting message logging.
6/28/2019 12:14:51 PM - A new unknown device was inserted on controller 0 channel 1.
6/28/2019 12:14:51 PM - Array 1 has gone critical.
6/28/2019 12:14:51 PM - Drive failed on controller 0 channel 1.
6/28/2019 12:15:57 PM - A user request has set identify state for controller 0, slot 1.
6/28/2019 12:16:02 PM - A user request has cleared identify state for controller 0, slot 1.
6/28/2019 12:16:42 PM - Core rescan command received.
6/28/2019 12:19:07 PM - Array 2 has been deleted.
6/28/2019 12:19:07 PM - Drive Initialized on controller 0 channel 1.
6/28/2019 12:19:17 PM - Core rescan command received.
06/28/19 12:22:42 RC CGI Service Start
06/28/19 12:22:42 Dumping current configuration to log file.
06/28/19 12:22:42 BufferSize: 2229
CO: 1 GE: 31 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0xdd07 SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
PD: 2 TY: Disk ST: Online ID: 0x7db1127b3f6385b3 NM: //./Core1/Route1/Device1 CD: 0 CH: 1 RT: 1 HD: 1 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x00000000e8d00000 TS: 0x00000000e8d00000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507276F_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: COMPLETED DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 0 HD: 36864 ID: 0x0000000000000000 SO: 0x0000000000000000 SS: 0x0000000000000000 DO: 0x0000000000000000 DS: 0x0000000000000000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105

6/28/2019 12:22:47 PM - Initialized messages space, starting message logging.
6/28/2019 12:22:47 PM - Array 1 has gone critical.
6/28/2019 12:24:27 PM - Core rescan command received.
6/28/2019 12:24:52 PM - Drive assigned as global spare on controller 0 channel 1.
6/28/2019 12:24:52 PM - A rebuild task has been initiated for Array 1.
6/28/2019 12:24:52 PM - A rebuild task has resumed on Array 1.
6/28/2019 2:16:45 PM - A rebuild task has completed on Array 1.
6/28/2019 4:59:33 PM - Device disconnected on controller 0 channel 1.
6/28/2019 4:59:33 PM - Drive failed on controller 0 channel 1.
6/28/2019 4:59:33 PM - Array 1 has gone critical.
6/28/2019 5:11:50 PM - A new unknown device was inserted on controller 0 channel 1.
6/28/2019 5:11:51 PM - Drive failed on controller 0 channel 1.
6/28/2019 5:13:46 PM - Core rescan command received.
6/28/2019 5:15:01 PM - Core rescan command received.
06/28/19 17:17:23 RC CGI Service Start
06/28/19 17:17:23 Dumping current configuration to log file.
06/28/19 17:17:23 BufferSize: 2763
CO: 1 GE: 47 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0x8a8e SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
PD: 2 TY: Unknown ST: Online ID: 0x7db1127b3f6385b3 NM: //./Core1/Route1/Device1 CD: 0 CH: 1 RT: 1 HD: 1 FE: 0x21ed001c GS: Y SI: 0x00000000e8e088b0 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507276F_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: NOT_ACTIVE DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 1 HD: 36864 ID: 0x7db1127b3f6385b3 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105
LD: 1 ST: NORMAL ID: 0x7db1127b3f6385b3 NM: //./Core1/Route0/Device2 OSN: NONE RT: 0 HD: 2 SI: 0x00000000e8e088b0 1ST: 1 2ND: 1 CA: NC CTS: 1 TY: LEGACY GE: 0x00000000 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 0 TTY: NOT_ACTIVE TST: NOT_ACTIVE DS: 0 PRE: 0 AT: 0x0000 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: NONE
LE: 0 RT: 1 HD: 1 ID: 0x7db1127b3f6385b3 SO: 0x0000000000000000 SS: 0x00000000e8e088b0 DO: 0x0000000000000000 DS: 0x00000000e8e088b0 ER: NoError
PA: 0 RT: 0 HD: 65535 ID: 0x0000859010220000

Outcomes