avx feature/app
Message boards :
Number crunching :
avx feature/app
Message board moderation
Author | Message |
---|---|
Send message Joined: 10 Jul 13 Posts: 21 Credit: 10,363,908 RAC: 2 |
My new AMD A10-7850K with Steamroller cores seems to be so far a little faster with the AVX version. Unfortunately, he gets currently only SSE2 WUs. Hello ! If you run XP/W7/W8 you need an app_info.xml in order to receive AVX WU's. ie (this is the one I found on the forums and use with W7 / i7 4770K / HT OFF) Philippe <app_info> <app> <name>period_search</name> <user_friendly_name>Asteroids</user_friendly_name> </app> <!-- AVX --> <file_info> <name>period_search_10210_windows_x86_64__avx.exe</name> <executable/> </file_info> <app_version> <app_name>period_search</app_name> <version_num>1021</version_num> <avg_ncpus>1.00</avg_ncpus> <max_ncpus>1.00</max_ncpus> <plan_class>avx</plan_class> <file_ref> <file_name>period_search_10210_windows_x86_64__avx.exe</file_name> <main_program/> </file_ref> </app_version> </app_info> |
Send message Joined: 10 Jul 13 Posts: 21 Credit: 10,363,908 RAC: 2 |
|
Send message Joined: 13 Jan 13 Posts: 14 Credit: 149,243,286 RAC: 477 |
|
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 543 |
|
Send message Joined: 18 Nov 13 Posts: 5 Credit: 200,400 RAC: 0 |
Last modified: 25 May 2015, 14:38:56 UTC Hello ! Hi all, I'd like to see how does AVX version perform on my Windows 8.1 machine, so I've tried the way described above (XML file in project folder) to obtain that kind of WUs but it doesn't succeed: I still get SSE3 only, even if there's AVX executable too in project folder. What's wrong? Features of my Haswell CPU: 25/05/2015 15:06:57 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx tm2 pbe fsgsbase bmi1 smep bmi2 |
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 543 |
|
Send message Joined: 18 Nov 13 Posts: 5 Credit: 200,400 RAC: 0 |
Yes I did. Idem: I put it in A@H project folder, the one containing SSE and AVX executables. Yes it was: I modified an app_config.xml file and renamed it so the outcome is another XML file. I suspended and then restarted the project too after restarting client and seeing no AVX WUs, but didn't work. Still only SSE3 WUs. Anything else? Thanks for tour reply. |
Send message Joined: 18 Nov 13 Posts: 5 Credit: 200,400 RAC: 0 |
|
Send message Joined: 18 Nov 13 Posts: 5 Credit: 200,400 RAC: 0 |
I managed to get AVX WUs finally. I copied the AVX executable in another folder, removed the project from Boinc Manager, restarted B.M., attatched to the project (it downloaded immediately the SSE3 executable and a WU), suspended it, closed B.M., put the XML file and AVX exectutable in the project folder, deleted the SSE executable from project folder (left ony app_info and AVX executable), restarted B.M. (the event log recognized the presence of app_info this time, past times it didn't I guess), restarted the project and finally I get AXV WUs. Displayed ETA isn't reliable (2% done in 2 minutes but ETA is 244 h), but I assume it gets closer to reality after some WUs completed. |
Send message Joined: 13 Jan 13 Posts: 14 Credit: 149,243,286 RAC: 477 |
BTW if I limit HT via BIOS settings, this will impact other projects that benefit by the HT. Correct ? On a my old xeon initally with HT disabled, I observed a 5% RAC increase 30 days after enabling HT, could be realistic ? Could be correct say that highly optimized (double precision) apps do not benefit by the HT, while less optimized benefit more by the HT ? |
Send message Joined: 21 Dec 12 Posts: 176 Credit: 136,462,135 RAC: 543 |
Last modified: 28 May 2015, 15:21:52 UTC |
Send message Joined: 18 Nov 13 Posts: 5 Credit: 200,400 RAC: 0 |
Here's my results. Executing one AVX WU at a time the execution time is reduced to 66% of SSE3 WU's execution time (I'm comparing WUs of same lenght in GFLOPs) on a Windows 8.1 64 bit laptop with Intel Core i7-4700MQ @2.4 GHz with Hyper-Threading on (4 cores, 8 threads), Intel Turbo Boost on (up to 3,4 GHz on single core that is 20% max cpu usage). CPU usage wavers between 10% and 20% but clock can't reach its maximum since overheating problem comes up (temperatures fluctuate between 70° C and 80° C). Maybe disabling HT, AVX version becomes competitive on my laptop too since I usually compute three SSE3 WUs simultaneously in 2:15 hours which is a little longer time than the one needed to complete two AVX WUs in sequence with HT on and without HT there should be an increase in performance. Unfortunately on my Toshiba laptop it isn't possible to disable HT via BIOS so I can't do further tests. |
Send message Joined: 13 Jan 13 Posts: 14 Credit: 149,243,286 RAC: 477 |
I'm trying and app_info "avx only", on a xeon E5-2620 (win7 64 bit) and before with SSE3 and ps_150618* WUs cpu time was about 16000 sec, now with avx and ps_150619* WUs cpu time is about 20000 sec. Is this due to diff between WUs or on some avx cpu SS3 are more efficient ? Moreover, using app_info the WUs queue is always empty so seems that CPU stay idle many hours between WUs download. |
Message boards :
Number crunching :
avx feature/app