Posts by cyrusNGC_224@P3D

21) (Message 2363)
Posted 4 Jan 2014 by cyrusNGC_224@P3D
Post:
New updated CUDA version has been released.


An improvement (18%) from (v100.00) ~9:40 to (v101.00) about 8h on my GT 635M!
22) (Message 2362)
Posted 4 Jan 2014 by cyrusNGC_224@P3D
Post:
2) Always cancel Astroid Cuda55 work units - you'll notice if one is computed by suddenly extreme slow mouse reactions. Tiresome, as you'll always have to press "cancel" if a Cuda55-WU starts.
Alternatively you add the line mentioned here [1] a place to remove them. That should prevent boinc from fetching NVIDIA work for asteroids.

[1] http://asteroidsathome.net/boinc/forum_thread.php?id=234&postid=2248
23) (Message 2324)
Posted 30 Dec 2013 by cyrusNGC_224@P3D
Post:
The performance on a highend card is also a bit disappointing. My GTX Titan seems to need nearly one hour for a workunit.

That's why i'm looking forward to the OpenCL version with the very much stronger AMD GPUs.
24) (Message 2268)
Posted 30 Dec 2013 by cyrusNGC_224@P3D
Post:
GTX 780Ti ~2250 s http://asteroidsathome.net/boinc/result.php?resultid=28194421
GTX Titan ~2300 s (recommended CUDA double precision disabled)
GTX 680 ~5300 s
GTX 650Ti ~8400 s
GTX 580 ~4200 s
GTX 570 ~3900-5000 s
GTX 480 ~4300-5300 s
GTX 560Ti ~5800 s
.
GTX 660M ~15000 s
GT 635M ~35000 s (Linux) http://asteroidsathome.net/boinc/result.php?resultid=27529265

Too slow than it is worth.
25) (Message 2250)
Posted 29 Dec 2013 by cyrusNGC_224@P3D
Post:
There's some trick to get wus?

Look at your boinc log.

Maybe you have the same problem as I had:
http://asteroidsathome.net/boinc/forum_thread.php?id=234
26) (Message 2248)
Posted 29 Dec 2013 by cyrusNGC_224@P3D
Post:
Problem solved!


It was, as I said, but something in the local xml files.
In the boinc directory are two almost(?) identical files: client_state_prev.xml, client_state.xml

I stopped boinc, edited the files and started boinc.
In section for asteroids i have removed this line:
    <no_rsc_apps>NVIDIA</no_rsc_apps>


Result:
So 29 Dez 2013 21:04:01 CET | Asteroids@home | update requested by user
So 29 Dez 2013 21:04:05 CET | Asteroids@home | Sending scheduler request: Requested by user.
So 29 Dez 2013 21:04:05 CET | Asteroids@home | Requesting new tasks for NVIDIA
So 29 Dez 2013 21:04:07 CET | Asteroids@home | Scheduler request completed: got 2 new tasks
So 29 Dez 2013 21:04:09 CET | Asteroids@home | Started download of period_search_10000_x86_64-pc-linux-gnu__cuda55
So 29 Dez 2013 21:04:09 CET | Asteroids@home | Started download of libcudart.so.5.5
So 29 Dez 2013 21:04:11 CET | Asteroids@home | Finished download of libcudart.so.5.5
So 29 Dez 2013 21:04:11 CET | Asteroids@home | Started download of input_63267_6
So 29 Dez 2013 21:04:12 CET | Asteroids@home | Finished download of input_63267_6
So 29 Dez 2013 21:04:12 CET | Asteroids@home | Started download of input_63268_7
So 29 Dez 2013 21:04:13 CET | Asteroids@home | Finished download of input_63268_7
So 29 Dez 2013 21:04:14 CET | Asteroids@home | Finished download of period_search_10000_x86_64-pc-linux-gnu__cuda55
So 29 Dez 2013 21:04:14 CET | Asteroids@home | Starting task ps_131225_63267_6_0 using period_search version 10000 (cuda55) in slot 6

27) (Message 2244)
Posted 29 Dec 2013 by cyrusNGC_224@P3D
Post:
I had reset A@H, but a NVIDIA app is still not retrieved.

So 29 Dez 2013 20:20:00 CET |  | Starting BOINC client version 7.0.65 for x86_64-pc-linux-gnu
So 29 Dez 2013 20:20:00 CET |  | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-3632QM CPU @ 2.20GHz [Family 6 Model 58 Stepping 9]
So 29 Dez 2013 20:20:00 CET |  | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm ida arat epb xsaveopt pln pts dtherm tpr_shadow vnmi flexpriority ept vpid fsgsbase smep erms
So 29 Dez 2013 20:20:00 CET |  | OS: Linux: 3.12.5-i7
So 29 Dez 2013 20:20:00 CET |  | Memory: 7.72 GB physical, 9.77 GB virtual
So 29 Dez 2013 20:20:00 CET |  | CUDA: NVIDIA GPU 0: GeForce GT 635M (driver version unknown, CUDA version 6.0, compute capability 2.1, 2048MB, 2032MB available, 182 GFLOPS peak)
So 29 Dez 2013 20:20:00 CET |  | OpenCL: NVIDIA GPU 0: GeForce GT 635M (driver version 331.20, device version OpenCL 1.1 CUDA, 2048MB, 2032MB available, 182 GFLOPS peak)
[...]
So 29 Dez 2013 20:30:04 CET | Asteroids@home | Resetting project
So 29 Dez 2013 20:30:17 CET | Asteroids@home | update requested by user
So 29 Dez 2013 20:30:20 CET | Asteroids@home | Master file download succeeded
So 29 Dez 2013 20:30:25 CET | Asteroids@home | Sending scheduler request: Requested by user.
So 29 Dez 2013 20:30:25 CET | Asteroids@home | Requesting new tasks for CPU
So 29 Dez 2013 20:30:32 CET | Asteroids@home | Scheduler request completed: got 26 new tasks
So 29 Dez 2013 20:30:34 CET | Asteroids@home | Started download of period_search_10210_x86_64-pc-linux-gnu__avx
[...]
28) (Message 2237)
Posted 29 Dec 2013 by cyrusNGC_224@P3D
Post:
@Pooh Bear 27 my installation is amd64 (64bit) linux.

@Cirrussc,

Did you edit your website preferences and select "Use GPU"?
Yes.
I do not know exactly how boinc works.
Boinc has probably even specified that of a@h is not to get gpu work.
So I just have to know how to force boinc, but to ask for gpu work.

I have already searched all the local xml files.

So 29 Dez 2013 19:19:45 CET | Asteroids@home | Requesting new tasks for CPU
29) (Message 2229)
Posted 29 Dec 2013 by cyrusNGC_224@P3D
Post:
After project update boinc still gets no work for the NVIDIA gpu.

In manager still displayed:
Don't fetch tasks for NVIDIA GPU: Project has no apps for NVIDIA GPU


installed: GeForce GT 635M, NVIDIA 331.20, Linux amd64

what to do?

A@H news for new apps:
http://asteroidsathome.net/boinc/forum_thread.php?id=233
30) (Message 2206)
Posted 20 Dec 2013 by cyrusNGC_224@P3D
Post:
My raspberry is now at 40% and 120 hours. The Work Unit will be finished in appoximate 180 hours, but it should be finished in 110 hour.
Such a misery - we will be too late.

Rooooobert


My raspi did longer task in 447,546.86 s. It's overclocked to 1GHz.
My RPi:
604,307.00 s @950 MHz OC.

31) (Message 2089)
Posted 23 Nov 2013 by cyrusNGC_224@P3D
Post:
Thanks for pointing on 780ti dp speed. I thought it's like titan and it is not.

We need cuda 5.0 and better because of linker. I think cuda version do not affect speed on older cards. It's more related to code than to cuda version.


Ok. So let's be honest... If we're porting asteroids to gpu, the only way to go is opencl! That will of course take into account that all nvidia gpus except the titan suck at DP...

So no need to waste energy in porting the app to cuda when we already know that asteroids will be dominated by ATI/AMD gpus because those are less crippled regarding DP performance.

To face the fact, Asteroids will be a "second" Milkyway@home regarding GPU dominance.

AMD/ATI GPUs will dominate and nvidia will be left biting the dust.

Yes. The best SP/DP (1/4) ratio is provided by the relatively cheap Tahiti GPUs (AMD) only (professional cards excluded). The Hawaii Chip of the new R200 series is here crippled again (1/8)!
My 7870 Boost Edition (Tahiti LE) is best utilized by the milkyway OpenCL apps (100%, especially under Linux!).

The strength of the double precision Tahiti GPUs shows up in the first pages (!) of the best computer here:
http://milkyway.cs.rpi.edu/milkyway/top_hosts.php
http://einstein.phys.uwm.edu/top_hosts.php


Accordingly, it is good to hear that the A&H GPU apps are making progress.
32) (Message 1937)
Posted 14 Oct 2013 by cyrusNGC_224@P3D
Post:
Great.
33) (Message 1903)
Posted 8 Oct 2013 by cyrusNGC_224@P3D
Post:
"ChertseyAl" wrote:
Machine 'L' P4 XP32

v100.00 43956
v101.00 (sse2) 6054
v102.10 12853
v102.10 (sse2) 13650 SLOWER


Machine 'B' P4HT XP32

v100.00 48963
v101.00 (sse2) 9341
v102.10 18592
v102.10 (sse2) 18779 SLOWER


Basically, the new versions are at best no faster, and in 2 cases much slower.

It seems that this slower through hyper-threading. Limit the multicore utilization to 50% (one core).
34) (Message 1837)
Posted 29 Sep 2013 by cyrusNGC_224@P3D
Post:
I too noticed a big increase in estimated completion times shown by BOINC.
It seems that the reason is an increased <rsc_fpops_est> value in the init_data.xml file for the new WUs.

<rsc_fpops_est>315010000000000.000000</rsc_fpops_est>


That results in 315010 GFLOPS. AFAIK the old value was something like 10000(?) GFLOPS.
No.
35) (Message 1548)
Posted 20 Aug 2013 by cyrusNGC_224@P3D
Post:
Kyong is testing SSE2 now. I'm working on standard app now (some backports from sse3 version) as preparation step for nVidia CUDA development.
Only CUDA, no OpenCL?
36) (Message 1475)
Posted 10 Aug 2013 by cyrusNGC_224@P3D
Post:
Any idea why the linux64 app isn't 2x faster? There's an improvement, but not a 2 fold decrease in crunch time like the windows app.
Hyper Threading.


My crunch times:
Raspbian 2.6.11+, RaspberryPI@950MHz: 55h ... 63h to 45h
Debian Squeeze 2.6.32 i686, AMD Geode LX@500MHz: 90h ... 150h to <in-progress>
Debian Squeeze 3.2.0 amd64, AMD X2 4600+@2,4GHz(SSE3): 4,3h ... 5h to 3,1h
Debian Wheezy 3.2.0 amd64, Intel Core 2 6400@2,13GHz(SSE3): 5h ... 5,8h to 2,3h
Debian Lenny 2.6.39 i686, Intel P4 M@2GHz(SSE2): 11,1h to 5,9h
Debian Wheezy 3.2.0 i486, Intel Celeron M@1,4GHz(SSE2): 7,7h to 3,8h
Debian Wheezy 3.2.41 i686, Intel P3-S@1,4GHz(SSE): 8h to 4,4h
Debian Wheezy 3.2.0 amd64, AMD X6 1090T@3,8GHz(SSE3): 2,7h to 1,22h
Debian Wheezy 3.10.3 amd64, Intel i7-3632QM@2,88GHz(AVX): 2,5h ... 4h to 1,58h
37) (Message 1442)
Posted 3 Aug 2013 by cyrusNGC_224@P3D
Post:
It would be really nice if we could get nearly as well optimized applications like this: http://asteroidsathome.net/boinc/results.php?hostid=37108
To 8 times faster than my shortest periods (~1200 sec vs. ~9800 sec)!


Previous 20