Hey all.
I’m setting up a chia farm and wanted to log my progress, as well as hopefully get some feedback in the process. I plan to set up the farm under various conditions, i.e… windows kvm guest w/ virtio, windows kvm guest with vfio, lxc on zfs, and native linux.
So far I’m up to the first condition: windows kvm guest w/ virtio.
Server specs are:
Dual Xeon X5690 (6C/12T ea. @ 3.47 Ghz)
96 GB DDR3 1333
1 x HDD for Host and Guest OS @ 0.5 TB
2 x Intel NVMe for temp storage
8 x WD 16 TB
VM specs are:
2 Sockets 24 Cores in ‘Host’ mode
48 GB Ram
All disks passed through at block device level with virtio and writeback cache
All disks are formated ntfs with default settings and assigned a letter
Method: I will start a single plot with -b 6000 and measure the time it takes to complete phase 1 table 2, which per the wiki represents about 6% of a plot under various other conditions.
Results and Notes:
At first, I forgot to enable cache=writeback for the VM. This was before I’d resolved to test various layouts, and would have left it, had performance been decent. It was not decent. If I recall correctly, it took about 9000 seconds to get through Phase 1 Table 2. That works out to a 41 hour plot time.
Aside from enabling cache, I also tried:
-increasing from 3 threads per plot to 4
-having windows partition block size match the media
-passing cpu=host to the guest so that bitfield would work
-with and without bitfield
The biggest difference came from enabling cache. Now I can get through table 2 in about 3000 seconds. Sadly, that still works out to about 13 hours per plot.
Surprisingly, the second biggest difference came disabling bitfield with cpu=host passed to the vm. That reduces the time to get through table 2 to about 2500 seconds (table 1 takes longer but table 2 is faster). That works out to under 12 hours a plot, which is ok, but still less than I want, and it remains to be seen if this will scale to 6 or 7 parallel threads (initially it did, but it was so slow without cache).
Matching the vm’s block size to the media made no obvious difference, but I did not run a full test before reverting back to default block size…
I’ve never seen latencies like this either. The response time for the nvme array in Task Manager for the windows guest has spiked higher than 50000 ms. I’m not positive, but it seems cpu=host helped with that, but response times are still spiking as high as 5000 ms. I’ve never tried a workload like this, but that seems like it can’t be good. I’m interested to see how that changes with vfio.
./chia plots create -k 32 -b 6000 -r 4 -n 145 -e -t z:\ -d f:
Starting phase 1/4: Forward Propagation into tmp files... Wed Jun 9 17:58:12 2021
Computing table 1
F1 complete, time: 437.415 seconds. CPU (95.58%) Wed Jun 9 18:05:30 2021
Computing table 2
Bucket 0 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 1 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 2 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 3 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 4 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 5 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 6 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 7 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 8 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 9 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 10 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 11 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 12 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 13 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 14 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 15 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 16 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 17 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 18 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 19 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 20 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 21 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 22 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 23 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 24 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 25 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 26 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 27 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 28 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 29 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 30 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 31 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 32 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 33 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 34 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 35 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 36 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 37 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 38 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 39 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 40 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 41 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 42 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 43 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 44 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 45 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 46 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 47 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 48 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 49 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 50 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 51 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 52 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 53 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 54 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 55 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 56 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 57 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 58 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 59 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 60 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 61 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 62 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 63 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 64 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 65 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 66 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 67 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 68 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 69 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 70 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 71 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 72 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 73 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 74 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 75 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 76 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 77 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 78 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 79 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 80 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 81 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 82 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 83 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 84 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 85 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 86 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 87 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 88 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 89 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 90 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 91 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 92 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 93 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 94 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 95 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 96 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 97 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 98 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 99 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 100 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 101 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 102 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 103 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 104 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 105 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 106 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 107 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 108 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 109 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 110 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 111 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 112 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 113 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 114 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 115 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 116 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 117 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 118 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 119 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 120 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 121 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 122 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 123 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 124 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Bucket 125 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 126 uniform sort. Ram: 5.793GiB, u_sort min: 0.563GiB, qs min: 0.281GiB.
Bucket 127 uniform sort. Ram: 5.793GiB, u_sort min: 1.125GiB, qs min: 0.281GiB.
Total matches: 4294850723
Forward propagation table time: 2007.665 seconds. CPU (185.760%) Wed Jun 9 18:38:58 2021
Lastly, I reduced the thread count to 3, and noticed about a 7% slowdown, with stage 1 table 1 going surprisingly quick, around 316 s, and stage 2 taking longer, at 2400s,
Next, I’ll try lxc with zfs.
Thanks for reading.