AVX2 application
Message boards :
Number crunching :
AVX2 application
Message board moderation
Author | Message |
---|---|
Send message Joined: 22 Mar 20 Posts: 3 Credit: 24,125,123 RAC: 8 |
|
Send message Joined: 16 Nov 22 Posts: 131 Credit: 144,608,061 RAC: 486,524 |
I would guess not much of any difference. TN-Grid compiled 3 different apps from the same source code with just the change in SIMD instruction flags. They offer SSE2, AVX2 and FMA applications. The AVX2 and FMA applications are about equal but the SSE2 application loses out by about 10% in performance compared to the higher level SIMD instructions sets. Without specific changes in the code optimized for each SIMD class, I doubt there would be much difference between the AVX app and the proposed AVX2 app. A proud member of the OFA (Old Farts Association) |
Send message Joined: 1 Jan 13 Posts: 90 Credit: 10,400,688 RAC: 8,061 |
The current app uses hand-written optimized code for AVX & FMA. Unfortunately, there are no major advantages AVX2 can provide for this specific algorithm in comparison to AVX. Simply enabling AVX2 in the compiler won't give any speedup. AVX512 could have some features to potentially squeeze more performance on processors that support it, but the code has to be written specifically for this instruction set. |
Message boards :
Number crunching :
AVX2 application