I've heard that the ability to run this instruction set varies wildly depending on the motherboard and how it is set. If the board has a form of MCE with power limits above the Intel spec, temps hit 100C almost instantly during a stress test (even with 3x120mm AIO). You'll need to have a significant offset or a board that more closely follows Intel specifications (but the CPU runs at A LOT lower clocks, like several hundred MHz) to keep temps reasonable. Remember, the AVX boost isn't really defined/listed at ARK. Those turbo boost and TVB values are not with AVX loads.
From the article - Now, that is 10 second load and off... imagine running a stress test with 512...you'll end up thermally throttling almost immediately as there isn't a significant break between loads.
Ok, I missed that detail since I was focused on the results and the temp was in another section. I'll admit, I didn't read all the way through it.
I've only got one system with AVX-512 and how that mobo/bios deals with it confuses me. I think it also varies with specific CPU as it behaved differently with my old 7800X and current 7920X. Right now, with mobo default settings other than XMP on, the 7920X runs AVX-512 at base clock. That runs really cool, and with AVX-512 there's still a lot of performance even at the lower clock. In the past it had run at higher clocks, and as many observed, that can really suck power. When I still bothered with hwbot, getting records on y-cruncher 10b with AVX-512 was basically balancing my cooling. The CPU might run faster with better cooling but for a 10b run which mine lasted over 5 minutes, it appears I had used 3.7 GHz and 0.97v. The shorter 1b run at 24 seconds allowed me to run at 4.0 GHz, and the tiny 25m at under half a second I could get away with 4.5 GHz.
I have to wonder if there is an official recommendation for AVX offset values. I don't recall where I saw it, but they were even looking at adding additional classes of AVX offset to allow for different types of AVX workload, as not all are as power consuming as others.
I also wonder how well would an Intel system work with a power limit set. For example, set a hard cap at 156W. Would it behave more like a Ryzen system? In that a heavy load would lower clocks to keep in the same power budget? Don't need to deal with instruction set offsets. In case you're wondering, that's taking the TDP (PL1) value of 125W and multiplying it by 1.25 to give the PL2 value. So that's an implied/suggested power limit from Intel absent anything better. I could try this on my main system...