avx feature/app


Message boards : Number crunching : avx feature/app

Message board moderation

To post messages, you must log in.
1 · 2 · 3 · 4 · Next
AuthorMessage
rebirther
Avatar

Send message
Joined: 18 Jun 12
Posts: 10
Credit: 111,120
RAC: 0
Message 1826 - Posted: 29 Sep 2013, 5:53:22 UTC

Last modified: 29 Sep 2013, 6:07:52 UTC
With the last WUs I never got any avx app, only SSE2/3. I have 7.2.16 that should support this feature. Is there a problem with the scheduler?
ID: 1826 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boinc127

Send message
Joined: 24 Jan 13
Posts: 5
Credit: 628,532
RAC: 0
Message 1827 - Posted: 29 Sep 2013, 8:40:53 UTC - in response to Message 1826.  

Last modified: 29 Sep 2013, 8:42:43 UTC
I'm not sure if I can help but...

There are a couple of possibilities concerning the avx feature. I know in Boinc version 7.0.64 when I use PrimeGrid, Windows does not report avx as a feature of the processor, even though I am running Windows 8 (when I initially start up Boinc it lists all the processor features in the log). My understanding is that Linux does not have this problem, even with Boinc 7.0.65 installed. So for PrimeGrid they devised a way to force the scheduler to give avx workunits to Windows computers. I'm not sure if that bug still applies to Boinc 7.2.16 or not. Also, Windows 7 supports avx if service pack 1 is installed. I believe it is enabled by default, but I could be wrong. I'm eager to try out the avx version of asteroid as well since the sse3 version is much faster than the stock app.
ID: 1827 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 6
Message 1831 - Posted: 29 Sep 2013, 10:41:33 UTC - in response to Message 1827.  
Avx version is faster on intel haswell only. Detection is problem.. we are discussing it now.
ID: 1831 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rebirther
Avatar

Send message
Joined: 18 Jun 12
Posts: 10
Credit: 111,120
RAC: 0
Message 1841 - Posted: 29 Sep 2013, 18:43:22 UTC

Last modified: 29 Sep 2013, 19:02:51 UTC
I have built an app_info file for those who want only crunch on avx on windows 64bit until a solution is found by admins. Before you use it download "period_search_10210_windows_x86_64__avx.exe" file.

<app_info>
<app>
    <name>period_search</name>
    <user_friendly_name>Period Search Application</user_friendly_name>
</app>
	<file_info>
		<name>period_search_10210_windows_x86_64__avx.exe</name>
		<executable/>
	</file_info>
	<app_version>
		<app_name>period_search</app_name>
		<version_num>1021</version_num>
  	    <avg_ncpus>1.00</avg_ncpus>
		<max_ncpus>1.00</max_ncpus> 
		<plan_class>avx</plan_class> 
		<file_ref>
			<file_name>period_search_10210_windows_x86_64__avx.exe</file_name>
			<main_program/>
		</file_ref>
	</app_version>
</app_info>


Lets see if its faster on an i5 than SSE3.

Edit:
Seems to be as fast as SSE3. Good luck if anyone has a Haswell CPU ;)
ID: 1841 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 6
Message 1842 - Posted: 29 Sep 2013, 19:18:23 UTC - in response to Message 1841.  
Haswell can crunch one task in abou 720s with avx version.
ID: 1842 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boinc127

Send message
Joined: 24 Jan 13
Posts: 5
Credit: 628,532
RAC: 0
Message 1844 - Posted: 29 Sep 2013, 22:58:47 UTC - in response to Message 1842.  

Last modified: 29 Sep 2013, 23:15:19 UTC
wow I definitely can't wait to try it out.

edit:

I know the admins at PrimeGrid@home allowed individual users to download the avx version by adjusting the PrimeGrid@home preferences page to allow users to force the server to download the avx program to the client computer with a checkbox so they could crunch with it, of course with the warning that you should not select the option unless your cpu has the avx function.
ID: 1844 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
k3ack3r

Send message
Joined: 31 Jul 13
Posts: 5
Credit: 21,236,337
RAC: 0
Message 1845 - Posted: 29 Sep 2013, 23:13:30 UTC - in response to Message 1842.  
So it is only for Haswell, not SB or IB? Have you tested it? Assuming you have, I guess.
ID: 1845 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rebirther
Avatar

Send message
Joined: 18 Jun 12
Posts: 10
Credit: 111,120
RAC: 0
Message 1846 - Posted: 30 Sep 2013, 3:55:25 UTC - in response to Message 1845.  
So it is only for Haswell, not SB or IB? Have you tested it? Assuming you have, I guess.


Sandy Bridge is not affected. The temps were still normal.
ID: 1846 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Aug 12
Posts: 293
Credit: 1,116,280
RAC: 0
Message 1847 - Posted: 30 Sep 2013, 4:27:29 UTC - in response to Message 1846.  

Last modified: 30 Sep 2013, 4:31:46 UTC
Temps were normal? Are you mixing French and English here? French "temps" translates to English "times" so I think you meant to say "the times were still normal".

@k3ack3r

It's possible to know the times would be normal even without testing. We know IB and SB don't have AVX therefore they cannot benefit from code designed to use AVX. At this time only Haswell CPUs have AVX therefore only Haswell CPUs will run the code faster.
ID: 1847 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile BilBg
Avatar

Send message
Joined: 19 Jun 12
Posts: 221
Credit: 623,640
RAC: 0
Message 1848 - Posted: 30 Sep 2013, 5:21:00 UTC - in response to Message 1847.  
We know IB and SB don't have AVX therefore they cannot benefit from code designed to use AVX.

"Upgraded features from Nehalem include:
Advanced Vector Extensions (AVX) 256-bit instruction set with wider vectors, new extensible syntax and rich functionality":
http://en.wikipedia.org/wiki/Sandy_Bridge

"Intel® AVX is a new-256 bit instruction set extension to Intel® SSE and is designed for applications that are Floating Point (FP) intensive.
It was released early 2011 as part of the Intel® microarchitecture code name Sandy Bridge processor family":
http://software.intel.com/en-us/intel-isa-extensions#pid-16007-1495





- ALF - "Find out what you don't do well ..... then don't do it!" :)
ID: 1848 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 1849 - Posted: 30 Sep 2013, 6:10:33 UTC
About the detection - scheduler works as it should. The problem is that the current version of BOINC manager doesn't detect avx on all processors that support it. Updated detection is already implemented in BOINC source codes (I don't know if it is already in branch) so when new version of BOINC manager is out it should detect avx on all processors (no related to version of os).
ID: 1849 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
terencewee*

Send message
Joined: 28 Jul 12
Posts: 17
Credit: 700,427
RAC: 0
Message 1850 - Posted: 30 Sep 2013, 8:14:41 UTC
the download URL is

http://asteroidsathome.net/boinc/download/
terencewee*
Sicituradastra.
ID: 1850 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
terencewee*

Send message
Joined: 28 Jul 12
Posts: 17
Credit: 700,427
RAC: 0
Message 1852 - Posted: 30 Sep 2013, 10:48:57 UTC

Last modified: 30 Sep 2013, 10:52:54 UTC
My friends, please beware AVX code on Hotwell = increased VID = more heat.
Mine is pulling 1.277V for 4.1GHz and temps shot up +10c (yeah, I'm lazy to optimize... :) ).

When your CPU gets too hot, thermal throttling will happen = lower performance.

If you have good CPU cooling, AVX shaves off a few mins (~3-4mins, x6 WU).

If not, it's best to run sse2/sse3 only.

As there is no way to control which version of the app gets sent to you, you can modify the following app_info.xml by removing AVX plan and stick it in your
BOINC-DATA/projects/asteroidsathome.net_boinc
folder

<app_info>
	<app>
		<name>period_search</name>
		<user_friendly_name>Asteroids</user_friendly_name>
	</app>

<!-- SSE2 -->
	<file_info>
		<name>period_search_10210_windows_x86_64__sse2.exe</name>
		<executable/>
	</file_info>
	<app_version>
		<app_name>period_search</app_name>
		<version_num>1021</version_num>
		<avg_ncpus>1.00</avg_ncpus>
		<max_ncpus>1.00</max_ncpus>
		<plan_class>sse2</plan_class>
		<file_ref>
			<file_name>period_search_10210_windows_x86_64__sse2.exe</file_name>
			<main_program/>
		</file_ref>
	</app_version>


<!-- SSE3 -->
	<file_info>
		<name>period_search_10210_windows_x86_64__sse3.exe</name>
		<executable/>
	</file_info>
	<app_version>
		<app_name>period_search</app_name>
		<version_num>1021</version_num>
		<avg_ncpus>1.00</avg_ncpus>
		<max_ncpus>1.00</max_ncpus>
		<plan_class>sse3</plan_class>
		<file_ref>
			<file_name>period_search_10210_windows_x86_64__sse3.exe</file_name>
			<main_program/>
		</file_ref>
	</app_version>


<!-- AVX -->
	<file_info>
		<name>period_search_10210_windows_x86_64__avx.exe</name>
		<executable/>
	</file_info>
	<app_version>
		<app_name>period_search</app_name>
		<version_num>1021</version_num>
		<avg_ncpus>1.00</avg_ncpus>
		<max_ncpus>1.00</max_ncpus>
		<plan_class>avx</plan_class>
		<file_ref>
			<file_name>period_search_10210_windows_x86_64__avx.exe</file_name>
			<main_program/>
		</file_ref>
	</app_version>

</app_info>




Happy crunching!


EDIT: Maybe Radim can possibly offer an option in Preferences page? "Enable AVX computing", perhaps?
terencewee*
Sicituradastra.
ID: 1852 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 6
Message 1853 - Posted: 30 Sep 2013, 11:09:19 UTC - in response to Message 1852.  

Last modified: 30 Sep 2013, 11:09:34 UTC
If someone has Intel Haswell and run AVX version, please post your times.

Thanks
Z
ID: 1853 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rebirther
Avatar

Send message
Joined: 18 Jun 12
Posts: 10
Credit: 111,120
RAC: 0
Message 1854 - Posted: 30 Sep 2013, 14:16:28 UTC - in response to Message 1847.  

Last modified: 30 Sep 2013, 14:16:38 UTC
Temps were normal? Are you mixing French and English here? French "temps" translates to English "times" so I think you meant to say "the times were still normal".

@k3ack3r

It's possible to know the times would be normal even without testing. We know IB and SB don't have AVX therefore they cannot benefit from code designed to use AVX. At this time only Haswell CPUs have AVX therefore only Haswell CPUs will run the code faster.


temps = temperatures ;)

SB and Ivy have of course AVX instructions and can be used very efficient with primegrid's llr.
ID: 1854 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rebirther
Avatar

Send message
Joined: 18 Jun 12
Posts: 10
Credit: 111,120
RAC: 0
Message 1855 - Posted: 30 Sep 2013, 14:18:13 UTC - in response to Message 1849.  

Last modified: 30 Sep 2013, 14:18:29 UTC
About the detection - scheduler works as it should. The problem is that the current version of BOINC manager doesn't detect avx on all processors that support it. Updated detection is already implemented in BOINC source codes (I don't know if it is already in branch) so when new version of BOINC manager is out it should detect avx on all processors (no related to version of os).


Could be a problem, here is a i5 snippet from 7.2.16

Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes syscall nx lm vmx tm2 pbe
ID: 1855 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 6
Message 1856 - Posted: 30 Sep 2013, 14:32:47 UTC - in response to Message 1855.  
AVX detection is in source code only, I think. No official build. The only way for now is to use app_info.

ID: 1856 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile den777

Send message
Joined: 22 Jun 13
Posts: 15
Credit: 3,118,560
RAC: 0
Message 1857 - Posted: 30 Sep 2013, 14:49:10 UTC - in response to Message 1853.  

Last modified: 30 Sep 2013, 14:52:40 UTC
If someone has Intel Haswell and run AVX version, please post your times.


i5-4670 - 966 sec.
http://asteroidsathome.net/boinc/workunit.php?wuid=7296844
http://asteroidsathome.net/boinc/workunit.php?wuid=7296827


sse3 was about 1170-1180 sec.
http://asteroidsathome.net/boinc/results.php?hostid=50173
ID: 1857 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile HA-SOFT, s.r.o.
Project developer
Project tester

Send message
Joined: 21 Dec 12
Posts: 176
Credit: 136,462,135
RAC: 6
Message 1858 - Posted: 30 Sep 2013, 14:59:25 UTC - in response to Message 1857.  
ID: 1858 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile den777

Send message
Joined: 22 Jun 13
Posts: 15
Credit: 3,118,560
RAC: 0
Message 1859 - Posted: 30 Sep 2013, 15:05:48 UTC - in response to Message 1858.  

Nice. As expected. Is it on stock clocks?

Yeah. No overclocking and TurboBoost disabled. Box cooler :(
ID: 1859 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · 4 · Next

Message boards : Number crunching : avx feature/app