New optimized versions for SSE3 released


Message boards : News : New optimized versions for SSE3 released

Message board moderation

To post messages, you must log in.
1 · 2 · Next
AuthorMessage
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 1493 - Posted: 16 Aug 2013, 17:48:14 UTC

Last modified: 16 Aug 2013, 17:50:56 UTC
There are new SSE3 optimized versions released for 32bit and 64bit linux and Windows.

Radim VanĨo (Kyong)
ID: 1493 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Jan 13
Posts: 11
Credit: 13,143,554
RAC: 3,124
Message 1494 - Posted: 16 Aug 2013, 22:30:31 UTC
cool!
ID: 1494 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[B^S]Beremat

Send message
Joined: 4 Sep 12
Posts: 3
Credit: 154,643
RAC: 0
Message 1495 - Posted: 17 Aug 2013, 1:19:19 UTC
So fast! Down to 22 minutes on my 3570K.
ID: 1495 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile chip
Avatar

Send message
Joined: 1 Jun 13
Posts: 7
Credit: 0
RAC: 0
Message 1496 - Posted: 17 Aug 2013, 5:21:03 UTC
Next step is AVX?
ID: 1496 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile den777

Send message
Joined: 22 Jun 13
Posts: 15
Credit: 3,118,560
RAC: 0
Message 1497 - Posted: 17 Aug 2013, 6:41:50 UTC
Twice faster again.
Still 120 credits per unit?
ID: 1497 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 11
Message 1498 - Posted: 17 Aug 2013, 8:41:59 UTC - in response to Message 1497.  
Twice faster again.
Still 120 credits per unit?

Yes
ID: 1498 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 1499 - Posted: 17 Aug 2013, 8:44:10 UTC
The next will be SSE2 and AVX. And I have to think over the credit, I think that we could send longer WUs which would be better. But first we try to finish the applications to knowing how much time would it take.
ID: 1499 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Jan 13
Posts: 11
Credit: 13,143,554
RAC: 3,124
Message 1500 - Posted: 17 Aug 2013, 15:25:18 UTC
I'm seeing a mixture of WUs. Some are for the old optimized app and some are for the new sse3 app. Is this normal? Can't the new sse3 app crunch all WUs?
ID: 1500 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 1501 - Posted: 17 Aug 2013, 16:13:44 UTC
There is no mixture. They are still the same WUs, so both application can compute them. The problem is, than BOINC developers made the system the way that at first scheduler send all possible version to the host and when it figure out which one is faster, it will then send the app to it. There is no option to make the prefered version so you just have to wait and then it should send you only SSE3 application. The only way is to rewrite the validator but that is more work and after BOINC server upgrade it doesn't have to work and it would be needed to fixing it.
ID: 1501 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Jan 13
Posts: 11
Credit: 13,143,554
RAC: 3,124
Message 1502 - Posted: 17 Aug 2013, 17:40:34 UTC
Thanks Kyong. Sometimes I think Boinc has a mind of its own. :)
ID: 1502 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ananas

Send message
Joined: 18 Mar 13
Posts: 32
Credit: 2,506,320
RAC: 0
Message 1504 - Posted: 18 Aug 2013, 0:26:25 UTC

Last modified: 18 Aug 2013, 0:27:53 UTC
I wonder if the x64 version without SSE support is even required.

For RNA we made an SSE2 version for x64 "just in case" but is there even any x64 CPU without SSE3 (only talking about 8086 based architecture of course) ?
ID: 1504 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 11
Message 1506 - Posted: 18 Aug 2013, 10:32:46 UTC - in response to Message 1504.  

Last modified: 18 Aug 2013, 10:37:48 UTC
X86 64 bit world has SSE2 as minimal instruction set. Not sure about Atoms and SSE3..
ID: 1506 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
frankhagen

Send message
Joined: 18 Jun 12
Posts: 15
Credit: 5,027,400
RAC: 0
Message 1507 - Posted: 18 Aug 2013, 11:36:56 UTC - in response to Message 1506.  
X86 64 bit world has SSE2 as minimal instruction set. Not sure about Atoms and SSE3..


they do.


http://en.wikipedia.org/wiki/SSE3
ID: 1507 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jan Vaclavik

Send message
Joined: 26 Jan 13
Posts: 31
Credit: 1,549,661
RAC: 241
Message 1509 - Posted: 18 Aug 2013, 16:33:21 UTC - in response to Message 1504.  
I wonder if the x64 version without SSE support is even required.

For RNA we made an SSE2 version for x64 "just in case" but is there even any x64 CPU without SSE3 (only talking about 8086 based architecture of course) ?

Early Athlon 64s?
ID: 1509 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 11
Message 1510 - Posted: 18 Aug 2013, 16:39:33 UTC - in response to Message 1509.  
If we will have SSE2 version of app (now in testing) we can remove standard app for 64bit x86 world. SSE3 will be an option.
ID: 1510 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
VictordeHollander

Send message
Joined: 15 Feb 13
Posts: 5
Credit: 2,128,794
RAC: 0
Message 1512 - Posted: 18 Aug 2013, 17:44:55 UTC
I've a WU with an error, maybe useful:

http://asteroidsathome.net/boinc/result.php?resultid=10081386
ID: 1512 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 11
Message 1513 - Posted: 18 Aug 2013, 17:51:34 UTC - in response to Message 1512.  
Do you have some antivirus installed?
ID: 1513 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ananas

Send message
Joined: 18 Mar 13
Posts: 32
Credit: 2,506,320
RAC: 0
Message 1514 - Posted: 18 Aug 2013, 18:04:06 UTC

Last modified: 18 Aug 2013, 18:07:17 UTC
Sounds very much like a missing DLL. VC++ runtime not static maybe? Especially as the optimized version should rather be larger (loop unrolling) than smaller than the regular version. In fact it is about half the size.
ID: 1514 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 11
Message 1515 - Posted: 18 Aug 2013, 18:09:35 UTC - in response to Message 1514.  
I have to ask Kyong. He is building final app. I have static linking preset in my project file. Error is invalid instruction which means that operating system detected this instruction when moving app into memory before start.
ID: 1515 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 1517 - Posted: 18 Aug 2013, 19:47:50 UTC
I have also static linking in VS, but the application should be good. I tested them on clean installed windows where was nothing than BOINC and updates installed.
ID: 1517 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : News : New optimized versions for SSE3 released