New optimized versions for SSE3 released
Message boards :
News :
New optimized versions for SSE3 released
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Send message Joined: 26 Jan 13 Posts: 31 Credit: 1,549,557 RAC: 239 |
Last modified: 19 Aug 2013, 10:28:36 UTC The next will be SSE2 and AVX. And I have to think over the credit, I think that we could send longer WUs which would be better. But first we try to finish the applications to knowing how much time would it take. Does the AVX version use SSSE3, SSE4.1 and SSE4.2 as well? Not that I have CPU with AVX instructions, just wondering. |
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
Last modified: 19 Aug 2013, 11:02:32 UTC No, mix of SSEx and AVX instruction is very ineffective. Especially when SSE2 instruction follow AVX instruction. We will not release AVX version, simply because it's slower than SSE3. The final approach will be: 1. Standard app 2. Pure SSE2 app 3. SSE3 app (the fastest one) Kyong is testing SSE2 now. I'm working on standard app now (some backports from sse3 version) as preparation step for nVidia CUDA development. |
Send message Joined: 1 Jun 13 Posts: 7 Credit: 0 RAC: 0 |
Last modified: 19 Aug 2013, 11:17:57 UTC |
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
|
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
I have inspect this and it looks like AVX2 app for Intel Hasvel. AVX2 brings new integer instructions to 256 bit AVX world which is missing in AVX. We use them in app so our AVX app must use SSE2 instructions for integers. We use Visual studio 2010 for win builds and there is no AVX2 support. Visual studio 2012 do not support Win Vista and older OS. I have ordered one i5-4670 in our company and I will test AVX2. If tests will be succesfull we will create download section and let users download special app with app_info.xml included. |
Send message Joined: 15 Feb 13 Posts: 5 Credit: 2,128,794 RAC: 0 |
|
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
|
Send message Joined: 26 Jan 13 Posts: 31 Credit: 1,549,557 RAC: 239 |
Last modified: 20 Aug 2013, 12:52:29 UTC I have inspect this and it looks like AVX2 app for Intel Hasvel. Boinc does not allow you to release an application which would be only available to specific Windows versions? |
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
|
Send message Joined: 1 Apr 13 Posts: 37 Credit: 153,496,537 RAC: 0 |
|
Send message Joined: 1 Oct 12 Posts: 17 Credit: 24,549,679 RAC: 0 |
|
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
How do I get a copy of the optimized version for sse3? how do I get a copy of the cuda? I'd love to accomplish more with my machines. Thanks in advance, Rick SSE3 application is distributed automatically. As I see in your hosts you have got SSE3 app. You have Intel Haswell so you will profit from AVX version of app which will be released soon. |
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
Last modified: 25 Aug 2013, 20:27:38 UTC |
Send message Joined: 1 Oct 12 Posts: 17 Credit: 24,549,679 RAC: 0 |
|
Send message Joined: 20 Jul 13 Posts: 15 Credit: 5,985,840 RAC: 0 |
Thanks for the reply HA-SOFT. I thought I had to do something manually, like I do for Seti. Can't wait for the cuda, as I have lots of capability I'm willing to contribute. Rick oh my god you're not kidding...i just saw your arsenal, and your GPU power is insane! of the 5 multi-GPU machines you've got on A@H right now, might i ask what motherboards you're using in each of them? thanks, Eric |
Send message Joined: 1 Oct 12 Posts: 17 Credit: 24,549,679 RAC: 0 |
|
Send message Joined: 26 Jan 13 Posts: 31 Credit: 1,549,557 RAC: 239 |
|
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 11 |
Is there some huge performance difference on certain CPUs between SSE2 and SSE3 versions? See: http://asteroidsathome.net/boinc/forum_thread.php?id=171&postid=1551#1551 |
Send message Joined: 18 Mar 13 Posts: 32 Credit: 2,506,320 RAC: 0 |
Last modified: 7 Sep 2013, 16:08:29 UTC Boinc does not allow you to release an application which would be only available to specific Windows versions? If you want to support a specific CPU feature, that BOINC doesn't report, you could use a method similar to the RNA-World one : They deliver only one ZIP (for each OS basic type) that has everything : common files, SSE4 version, SSE3 version ... basic version. Plus they use an unzip "wrapper", that checks the CPU features, unzips the common files, then checks the ZIP file for the best version that fits the CPU. If unavailable, it checks the next one and so on. Advantage : It will work even with ancient BOINC versions, that are much older than the date the CPU feature has been introduced - without the need for am optimized app_info.xml. |
Previous · 1 · 2
Message boards :
News :
New optimized versions for SSE3 released