AVX2 application


Message boards : Number crunching : AVX2 application

Message board moderation

To post messages, you must log in.
AuthorMessage
CallMeFoxie

Send message
Joined: 22 Mar 20
Posts: 3
Credit: 24,125,123
RAC: 8
Message 7758 - Posted: 3 Mar 2023, 9:11:51 UTC
Hello

would it be beneficial to offer also AVX2 app? Or are there no major advantages in performance compared to SSE3/AVX?
ID: 7758 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Keith Myers
Avatar

Send message
Joined: 16 Nov 22
Posts: 131
Credit: 144,570,485
RAC: 486,931
Message 7759 - Posted: 3 Mar 2023, 19:13:21 UTC
I would guess not much of any difference. TN-Grid compiled 3 different apps from the same source code with just the change in SIMD instruction flags.

They offer SSE2, AVX2 and FMA applications. The AVX2 and FMA applications are about equal but the SSE2 application loses out by about 10% in performance compared to the higher level SIMD instructions sets.

Without specific changes in the code optimized for each SIMD class, I doubt there would be much difference between the AVX app and the proposed AVX2 app.

A proud member of the OFA (Old Farts Association)
ID: 7759 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ahorek's team
Volunteer developer
Volunteer tester

Send message
Joined: 1 Jan 13
Posts: 90
Credit: 10,400,371
RAC: 8,100
Message 8056 - Posted: 20 Sep 2023, 22:39:12 UTC
The current app uses hand-written optimized code for AVX & FMA. Unfortunately, there are no major advantages AVX2 can provide for this specific algorithm in comparison to AVX. Simply enabling AVX2 in the compiler won't give any speedup.
AVX512 could have some features to potentially squeeze more performance on processors that support it, but the code has to be written specifically for this instruction set.
ID: 8056 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : AVX2 application