cancel
Showing results for 
Search instead for 
Did you mean: 

Drivers & Software

mattfl
Newcomer

RAID 1 continual drive disconnect (x399, Threadripper 1920x, Win 10 x64 build 1903 June 2019)

Can anyone help me figure out why one drive in my RAID 1 array continually disconnects?  Time between disconnects is measured in hours.  I can clean the drive and add it back as a spare and it rebuilds fine, only to "disconnect" again a few hours later.  It may be a coincidence, but the problem first popped up after running Windows Update on June 30, though the machine was less than 2 days old at this point so I cannot say with 100% certainty that the update was to blame.  I know there was a compatibility problem with the older RAID drivers and the latest Windows 10, is this perhaps a new driver issue with the latest RAID driver?  Any tips on resolving this would be greatly appreciated. 

System details:

  • New clean "Windows 10 Pro for Workstations" x64  install, version 1903, build 18362.207, installed June 28, fully updated (windows update) June 30
  • Two 2TB Samsung 860 EVO SSD's in the RAID-1 array
  • Ryzen Threadripper 1920x, not overclocked
  • ASRock Taichi x399 motherboard
  • AMD RAID driver version 9.2.0.120 (build date May 13, 2019)
  • SCSI Disk Device driver (disk.sys, EhStorClass.sys, partmgr.sys) version 10.0.18362.1 built June 21, 2006 (WinBuild 160101.0800)
  • Latest AMD chipset drivers as of June 28, 2019
  • 32GB ECC Ram

Here is my RAID log that includes a disconnect, rebuild and second disconnect: 

6/28/2019 11:13:05 AM - Initialized messages space, starting message logging.
6/28/2019 11:27:32 AM - Device disconnected on controller 0 channel 1.
6/28/2019 11:27:32 AM - Drive failed on controller 0 channel 1.
6/28/2019 11:27:32 AM - Array 1 has gone critical.
06/28/19 11:59:55 RC CGI Service Start
06/28/19 11:59:55 Dumping current configuration to log file.
06/28/19 11:59:55 BufferSize: 1923
CO: 1 GE: 21 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0xab0b SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: COMPLETED DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 1 HD: 36864 ID: 0x7db1127b3f6385b3 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105

6/28/2019 12:00:01 PM - Initialized messages space, starting message logging.
6/28/2019 12:00:01 PM - Array 1 has gone critical.
06/28/19 12:14:46 RC CGI Service Start
06/28/19 12:14:46 Dumping current configuration to log file.
06/28/19 12:14:46 BufferSize: 2762
CO: 1 GE: 24 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0xb685 SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
PD: 2 TY: Unknown ST: Online ID: 0x7db1127b3f6385b3 NM: //./Core1/Route1/Device1 CD: 0 CH: 1 RT: 1 HD: 1 FE: 0x21ed001c GS: N SI: 0x00000000e8e088b0 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507276F_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: COMPLETED DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 1 HD: 36864 ID: 0x7db1127b3f6385b3 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105
LD: 1 ST: NORMAL ID: 0x7db1127b3f6385b3 NM: //./Core1/Route0/Device2 OSN: NONE RT: 0 HD: 2 SI: 0x00000000e8e088b0 1ST: 1 2ND: 1 CA: NC CTS: 1 TY: LEGACY GE: 0x00000000 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 0 TTY: NOT_ACTIVE TST: NOT_ACTIVE DS: 0 PRE: 0 AT: 0x0000 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: NONE
LE: 0 RT: 1 HD: 1 ID: 0x7db1127b3f6385b3 SO: 0x0000000000000000 SS: 0x00000000e8e088b0 DO: 0x0000000000000000 DS: 0x00000000e8e088b0 ER: NoError
PA: 0 RT: 0 HD: 65535 ID: 0x0000859010220000

6/28/2019 12:14:51 PM - Initialized messages space, starting message logging.
6/28/2019 12:14:51 PM - A new unknown device was inserted on controller 0 channel 1.
6/28/2019 12:14:51 PM - Array 1 has gone critical.
6/28/2019 12:14:51 PM - Drive failed on controller 0 channel 1.
6/28/2019 12:15:57 PM - A user request has set identify state for controller 0, slot 1.
6/28/2019 12:16:02 PM - A user request has cleared identify state for controller 0, slot 1.
6/28/2019 12:16:42 PM - Core rescan command received.
6/28/2019 12:19:07 PM - Array 2 has been deleted.
6/28/2019 12:19:07 PM - Drive Initialized on controller 0 channel 1.
6/28/2019 12:19:17 PM - Core rescan command received.
06/28/19 12:22:42 RC CGI Service Start
06/28/19 12:22:42 Dumping current configuration to log file.
06/28/19 12:22:42 BufferSize: 2229
CO: 1 GE: 31 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0xdd07 SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
PD: 2 TY: Disk ST: Online ID: 0x7db1127b3f6385b3 NM: //./Core1/Route1/Device1 CD: 0 CH: 1 RT: 1 HD: 1 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x00000000e8d00000 TS: 0x00000000e8d00000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507276F_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: COMPLETED DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 0 HD: 36864 ID: 0x0000000000000000 SO: 0x0000000000000000 SS: 0x0000000000000000 DO: 0x0000000000000000 DS: 0x0000000000000000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105

6/28/2019 12:22:47 PM - Initialized messages space, starting message logging.
6/28/2019 12:22:47 PM - Array 1 has gone critical.
6/28/2019 12:24:27 PM - Core rescan command received.
6/28/2019 12:24:52 PM - Drive assigned as global spare on controller 0 channel 1.
6/28/2019 12:24:52 PM - A rebuild task has been initiated for Array 1.
6/28/2019 12:24:52 PM - A rebuild task has resumed on Array 1.
6/28/2019 2:16:45 PM - A rebuild task has completed on Array 1.
6/28/2019 4:59:33 PM - Device disconnected on controller 0 channel 1.
6/28/2019 4:59:33 PM - Drive failed on controller 0 channel 1.
6/28/2019 4:59:33 PM - Array 1 has gone critical.
6/28/2019 5:11:50 PM - A new unknown device was inserted on controller 0 channel 1.
6/28/2019 5:11:51 PM - Drive failed on controller 0 channel 1.
6/28/2019 5:13:46 PM - Core rescan command received.
6/28/2019 5:15:01 PM - Core rescan command received.
06/28/19 17:17:23 RC CGI Service Start
06/28/19 17:17:23 Dumping current configuration to log file.
06/28/19 17:17:23 BufferSize: 2763
CO: 1 GE: 47 NM: //./Core1/Route0/Device0 BN: 9.2.0-00120
CD: 0 CN: VSTOR CSN: 0x000040004b008590 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x8590 DI: 0x1022 SVI: 0x8a8e SDI: 0xffff ID: 0x0000859010220000
CD: 1 CN: AMD-RAID CSN: 0x000000004b2a1d00 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 8 VI: 0x1022 DI: 0x43bd SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 2 CN: AMD-RAID CSN: 0x000000004b2a1d01 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
CD: 3 CN: AMD-RAID CSN: 0x000000004b2a1d02 KE: 00000000000000000000000000000000 BB: NONE MX: 0x0000000000000000
NP: 1 VI: 0x1022 DI: 0x7917 SVI: 0x0000 SDI: 0x0000 ID: 0x0000000000000000
PD: 0 TY: Core ST: Online ID: 0x0000859010220000 NM: //./Core1/Route0/Device65535
PD: 1 TY: Disk ST: Online ID: 0x20c398f13f50b683 NM: //./Core1/Route1/Device0 CD: 0 CH: 0 RT: 1 HD: 0 FE: 0x21ed001c GS: N SI: 0x00000000e8e00000 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507279L_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
PD: 2 TY: Unknown ST: Online ID: 0x7db1127b3f6385b3 NM: //./Core1/Route1/Device1 CD: 0 CH: 1 RT: 1 HD: 1 FE: 0x21ed001c GS: Y SI: 0x00000000e8e088b0 LS: 0x0000000000000000 TS: 0x0000000000000000 MO: ^Samsung SSD 860 EVO 2TB ^ SN: ^S3YUNB0M507276F_____^ FW: RVT03B6Q PT: 0x00100b04
VI: NONE
LD: 0 ST: CRITICAL ID: 0x0000da586e7b3a76 NM: //./Core1/Route0/Device1 OSN: NONE RT: 0 HD: 1 SI: 0x00000000e8cfb000 1ST: 1 2ND: 2 CA: RW CTS: 1 TY: RAID1 GE: 0x00000006 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 20 TTY: NOT_ACTIVE TST: NOT_ACTIVE DS: 0 PRE: 0 AT: 0x0001 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: ^Boot_2TB-Samsung^
LE: 0 RT: 1 HD: 0 ID: 0x20c398f13f50b683 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: NoError
LE: 1 RT: 1 HD: 36864 ID: 0x7db1127b3f6385b3 SO: 0x0000000000100000 SS: 0x00000000e8d00000 DO: 0x0000000000105000 DS: 0x00000000e8cfb000 ER: Fail_Offline
PA: 0 RT: 0 HD: 65535 ID: 0x000040000000b105
LD: 1 ST: NORMAL ID: 0x7db1127b3f6385b3 NM: //./Core1/Route0/Device2 OSN: NONE RT: 0 HD: 2 SI: 0x00000000e8e088b0 1ST: 1 2ND: 1 CA: NC CTS: 1 TY: LEGACY GE: 0x00000000 TPST: 0x0000000000000000 TPSI: 0 TPC: 0.0000 TP: 0 TTY: NOT_ACTIVE TST: NOT_ACTIVE DS: 0 PRE: 0 AT: 0x0000 RC: 0 CF: 0x0000000000000000 CT: 0x0000000000000000 DP: NONE
LE: 0 RT: 1 HD: 1 ID: 0x7db1127b3f6385b3 SO: 0x0000000000000000 SS: 0x00000000e8e088b0 DO: 0x0000000000000000 DS: 0x00000000e8e088b0 ER: NoError
PA: 0 RT: 0 HD: 65535 ID: 0x0000859010220000

Tags (4)
0 Likes
2 Replies
mattfl
Newcomer

Re: RAID 1 continual drive disconnect (x399, Threadripper 1920x, Win 10 x64 build 1903 June 2019)

Quick follow up; I replaced the drive (the original was brand new) and simultaneously moved it to a different SATA port, and now it has been a couple of days without a failure.  Too early to say it's fixed, but I'm cautiously optimistic.  

0 Likes
mattfl
Newcomer

Re: RAID 1 continual drive disconnect (x399, Threadripper 1920x, Win 10 x64 build 1903 June 2019)

Final follow up; as of today it's still error free, so it seems the problem was most likely either a bad SSD out of the box, or maybe something wrong with SATA port 1 (second port) on my motherboard.  

0 Likes