Chia crashing my SSDs and making them disappear until PC is rebooted

9 hours later and it’s still at 1%. No errors.

Want to earn more Chia? Add more plots to your farm.

Total Plot Size: 0 B

K-Size Queue name Plot Key Pool Key Filename Status Action
K-32, 101.4GiB X to Red 2 1%

Plotting||

Rows per page:

10

0-0 of 0

View Log

1%

2021-05-29T22:31:23.779 chia.plotting.create_plots : e[32mINFO e[0m Creating 1 plots of size 32, pool public key: 83b78b3d9d5731ed219e52d8715acb21cdb4df2452fd77e004202d458f5523380373258b682b55523b30330325af04d7 farmer public key: b8a88f25bc0fc8c4a4b56e9f0abbf55f5a104dacfa3828058a2d16b461b4e813427983b8287e1c8a6d79e5addf425fc9e[0m 2021-05-29T22:31:23.787 chia.plotting.create_plots : e[32mINFO e[0m Memo: 83b78b3d9d5731ed219e52d8715acb21cdb4df2452fd77e004202d458f5523380373258b682b55523b30330325af04d7b8a88f25bc0fc8c4a4b56e9f0abbf55f5a104dacfa3828058a2d16b461b4e813427983b8287e1c8a6d79e5addf425fc91e2e4defcba740bdb70871e8c680784b7b11b69bc63aa6d9098e7a7d5811db4ee[0m 2021-05-29T22:31:23.788 chia.plotting.create_plots : e[32mINFO e[0m Starting plot 1/1e[0m Starting plotting progress into temporary dirs: X:\Plots and X:\Plots ID: 393d79fbb69d8d629d20b0f257078ad3186251bfe9810fd8d36931801fedd31b Plot size is: 32 Buffer size is: 1501MiB Using 128 buckets Using 2 threads of stripe size 65536 Starting phase 1/4: Forward Propagation into tmp files… Sat May 29 22:31:23 2021 Computing table 1 F1 complete, time: 125.29 seconds. CPU (180.71%) Sat May 29 22:33:29 2021 Computing table 2 Bucket 0 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 1 uniform sort. Ram: 1.406GiB, u_sort min: 0.563GiB, qs min: 0.281GiB. Bucket 2 uniform sort. Ram: 1.406GiB, u_sort min: 0.563GiB, qs min: 0.281GiB. Bucket 3 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 4 uniform sort. Ram: 1.406GiB, u_sort min: 0.563GiB, qs min: 0.281GiB. Bucket 5 uniform sort. Ram: 1.406GiB, u_sort min: 0.563GiB, qs min: 0.281GiB. Bucket 6 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 7 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 8 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 9 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 10 uniform sort. Ram: 1.406GiB, u_sort min: 0.563GiB, qs min: 0.281GiB. Bucket 11 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 12 uniform sort. Ram: 1.406GiB, u_sort min: 0.563GiB, qs min: 0.281GiB. Bucket 13 uniform sort. Ram: 1.406GiB, u_sort min: 0.563GiB, qs min: 0.281GiB. Bucket 14 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 15 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB. Bucket 16 uniform sort. Ram: 1.406GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.

Any thoughts?

Decided to go looking through the windows logs. Has anyone seen these before?

Event 153, disk - Warning
The IO operation at logical block address 0x57cee18 for Disk 2 (PDO name: \Device\0000003c) was retried.

Event 129 Storahci - Warning
Reset to device, \Device\RaidPort2, was issued.

Event 10010, DistributedCOM - Error
The server {E60687F7-01A1-40AA-86AC-DB1CBF673334} did not register with DCOM within the required timeout.

Hey :smiley:

wich windows version do you use?

Its not much, but it may help:
Did you uncheck the MBR feature at the Disk-Format?
Did you format the SSD to NTFS?
Did you check ports in BIOS for AHCI or RAID?
Do you use the drives as RAID?
Did you check your “addiitonal Power Setting”?

I found searching “Event 129 Storahci - Warning” that your Samsung SSD may need an Firmware update. (something called “Samsung Magician software”)

greetings

I had similar issues with a XPG8100 M.2 Nvme drive. It kept heating up and crashing. I replaced it with WD Black 850 and now everything is fine. Your plotting drives may not be suitable to handle the workloads.

1 Like

Did you uncheck the MBR feature at the Disk-Format? - GPT
Did you format the SSD to NTFS? - NTFS
Did you check ports in BIOS for AHCI or RAID? - ACHI
Do you use the drives as RAID? - no
Did you check your “addiitonal Power Setting”? - No on this i didn’t think power was an issue.

I found searching “Event 129 Storahci - Warning” that your Samsung SSD may need an Firmware update. (something called “Samsung Magician software”) - Yup already had it, didn’t want to run the firmware update, cause they were working. But i did and now all four show up as

"disk #
Unknown
Not initialized

and now when i try to initialize it i get “a device which does not exist was specified”

may have to do that. i thought samsung with a read / write speed over 500 MBs should have worked.

I am not 100% on this. but it may be just more than speeds that is needed here. The robustness, build quality and heat management may come into play as well.

I have the same problem with Firecuda 510 2TB and Aorus 1TB.
Both are plotting for 24 hours and then suddenly read/write errors and “A device which does not exist was specified” error
If I try to delete volume or format in Disk Management they disappear.
After a restart they work fine for like a day.
Any ideas?

I have a Firecuda 520 1TB that I am using and it is also crashing in a similiar fashion. My OS is on that drive so it would crash and than the computer would reset. I am thinking the drive is overheating. Tomorrow I plan to swap the Firecuda for a spare WD Black 750 to see if it fixes the issue.

Your ssd’s are not suitable for chia plotting even if they say they are high endurance.

Not suitable and not working are 2 very different things.
Both my Firecuda 510 2TB and Aorus 1TB would plot for 24 hours, create 15 plots and then fail.
After a restart I can do the same thing again.

And I have no idea why.

So i ended up taking all 4 drives back, i crashed out last night without even the chia system running and having the computer on for less than 5 minutes, 2 of the 4 drives were so hot they could barely be touched. I wasn’t fighting with them anymore. So Samsung drives, don’t buy, even if they say 500MBs

“So Samsung drives, don’t buy, even if they say 500MBs”
Statements like this are unfounded and rather dumb. Many cheap consumer grade drives can’t handle Chia plotting as the behaviour is totally different than OS boot or gaming for example.

I use Samsung 980 Pro’s which are very good drives for this sort of work.

Read this: SSD speed problem / Kingston 1 TB NV1 SNVS/1000G M.2 PCI-Express 3.0 SSD - #8 by SteveTheLonelyFarmer

1 Like

“And I have no idea why.”
Read this: SSD speed problem / Kingston 1 TB NV1 SNVS/1000G M.2 PCI-Express 3.0 SSD - #8 by SteveTheLonelyFarmer

Thank you very much.

Honestly, over the years I have found Samsung SSD’s to be amongst the if not the very best.

I had this same issue with a Firecuda 520 1TB hooked up externally via USB to an Ubuntu machine. In my case, the drive was overheating and crashing. I put it in an external enclosure, added thermal pads/heatsinks to it, and have a fan on it, and the problem went away.

I have the same issue with my Samsung 980 Pro 2TB. Unfortunately, covering it with the heatsink did NOT solve the issue, and Samsung Magician froze and failed to refresh the temperature rating as a result. Windows logs say nothing. Can someone point me to what to look for in the logs? Thanks!

I would have to concur with some of the above comments regarding heat dissipation. I’ve just noticed this same issue last night for the first time and found this thread, unsurprisingly, describing the issue.

I have 3 (allegedly) enterprise grade Micron 5100 Pro 960gb Ssds, I liked the idea of several smaller drives to plot multiples in parallel at high speed over one large drive with the speed bottleneck of that single drive.

Initially when plotting on the single drive that had arrived so far I noticed it was blazing hot, to the point that I’d have burned my hand holding it, and the hotter it got the slower plotting got. I half assed a case fan directed straight at the trays where all 3 drives would eventually go, and that solved the issue instantly. Once all 3 arrived and I installed them, everything was great with the fan.

I had to move some stuff around though and didn’t have the fan in there for a bit and figured it’d be fine overnight last night at least, since I’d cranked the case fans to compensate. Woke up this morning to ALL my plots frozen because the SSDs had clearly disconnected from the SATA mobo slots overnight at some point, and were reconnected now but just idling, with Chia giving the “trying again in 5 minutes” ad infinitum.

Add a fan directed at your drives or some thermal paste and a heat sink maybe and I bet at least some individuals problems will go away. Also don’t use consumer grade drives unless you want them to pop in three weeks.

update

So i took the SSDs back and because no matter what i did, i couldn’t get them to work. In turn i bought a WD_Black 1TB AN1500 NVMe andi have been plotting since it got delivered without any issues.

Thank you very everyone’s interest.