Optimizing plotting on an AMD Ryzen 5950x (or any other 16c/32t CPU)

Here are my results with -u and -v = 512, -t = Corsair MP600 (non RAID), -2 = DDR4-2666 110G:

  1. Number of Threads: 30
Phase 1 took 601.129 sec
Phase 2 took 387.489 sec
Phase 3 took 319.588 sec
Phase 4 took 37.1284 sec
Total plot creation time was 1345.38 sec (22.423 min)
  1. Number of Threads: 20
Phase 1 took 585.479 sec
Phase 2 took 392.958 sec
Phase 3 took 306.668 sec
Phase 4 took 33.5924 sec
Total plot creation time was 1318.75 sec (21.9791 min)
  1. Number of Threads: 16
Phase 1 took 584.581 sec
Phase 2 took 401.289 sec
Phase 3 took 301.141 sec
Phase 4 took 32.9146 sec
Total plot creation time was 1319.98 sec (21.9996 min)

It seems the overall plotting times are almost identical between 16 and 20, while it’s the slowest when using 30 threads. This is because only in phase 2 that throwing as many threads as possible would actually help, while it would be better to limit the number of threads to the number of physical cores in the other phases.