Computation errors
Message boards :
Problems and bug reports :
Computation errors
Message board moderation
Author | Message |
---|---|
Send message Joined: 28 Sep 13 Posts: 29 Credit: 120,376,940 RAC: 12,347 |
|
Send message Joined: 17 Nov 13 Posts: 3 Credit: 8,356,053 RAC: 1,630 |
Heh, something odd happened a few hours ago, my i5 on Linux started completing WUs in 1.5 minutes instead of normal 1.2 hours, guess thats related, they get validated so I guess they are ok. These are with app "Period Search Application v102.10 (avx)" and wus ps_160915_input.. Very weird.. ;-P |
Send message Joined: 30 Aug 16 Posts: 2 Credit: 31,691,520 RAC: 0 |
Yesterday I added six computers to my account and today I got very pleasantly surprised with my statistics. I then got pleasantly shocked and later sadly so to see the rate at which my stats kept increasing... There are two problems (maybe two different symptoms of the same issue): A. Several computational errors, on one computer that I watched the tasks barely get to start before they get marked as failures. However not all tasks fail but I've got more than 200 tasks labeled with Error in just a day. B. Extremely fast completion times. I have several computers running trough tasks in 150-300s but it should take more like 4000-8000s. I've seen it happen to the AVX tasks. They do get validated though...! I'm guessing the results are rubbish? What will happen to stats, will someone eventually clean out these short run time tasks? |
Send message Joined: 28 Sep 13 Posts: 29 Credit: 120,376,940 RAC: 12,347 |
|
Send message Joined: 29 Aug 16 Posts: 3 Credit: 18,415,150 RAC: 1,721 |
Yep, also a bunch of failures here. Some with SSE2, others with SSE3. But these are only a handful, most runs complete successfully. As for the short run times, I do get those occasionally. I also had one with CUDA that completed in 172 seconds! http://asteroidsathome.net/boinc/workunit.php?wuid=52894026 It was validated with AVX by someone else who did it in ~300 secs. Maybe it's just a short WU, and not a bug. |
Send message Joined: 28 Sep 13 Posts: 29 Credit: 120,376,940 RAC: 12,347 |
|
Send message Joined: 19 Sep 16 Posts: 3 Credit: 81,600 RAC: 0 |
Last modified: 24 Sep 2016, 23:32:22 UTC |
Send message Joined: 29 Aug 16 Posts: 3 Credit: 18,415,150 RAC: 1,721 |
|
Send message Joined: 19 Sep 16 Posts: 3 Credit: 81,600 RAC: 0 |
Last modified: 26 Sep 2016, 2:40:48 UTC I guess they're old. Here's one that failed http://asteroidsathome.net/boinc/workunit.php?wuid=52226661 I'll just let them error out and see if I start getting some good ones. The problem with that though, is that I tried it before and one task, instead of erroring out immediately (no big deal) actually consumed my GPU for 100% for over 14 hours... that is some SERIOUS electricity wasted. |
Send message Joined: 19 Sep 16 Posts: 3 Credit: 81,600 RAC: 0 |
|
Send message Joined: 28 Sep 13 Posts: 29 Credit: 120,376,940 RAC: 12,347 |
|
Send message Joined: 10 Apr 15 Posts: 1 Credit: 17,308,320 RAC: 0 |
|
Send message Joined: 1 Oct 14 Posts: 1 Credit: 8,228,849 RAC: 0 |
Last modified: 28 Sep 2016, 7:09:32 UTC I am getting 24 errors at a time, likely, falsely reported as computational errors. The error occurs the moment the work unit is completely downloaded, not when actual computations are started, as this laptop can only process 4 work units at a time and is, at the moment, processing the last work 2 error-free work units, and 2 World Community Grid work-units. (I suppose, it is possible, that BOINC may stop processing the WCG units, given the high priority I have given Asteroids@Home, and the errors, immediately, occur then.)
|
Send message Joined: 25 Jul 14 Posts: 64 Credit: 100,582,080 RAC: 0 |
I attached one fresh "Microsoft Windows running on an AMD x86_64 or Intel EM64T CPU" host to see how CPU tasks are doing at the moment. All tasks with SSE2 and SSE3 application finish with immediate computation error. Tasks with AVX application complete unnaturally fast, but without error. I can't see yet if they will validate successfully. |
Send message Joined: 25 Jul 14 Posts: 64 Credit: 100,582,080 RAC: 0 |
I can't see yet if they will validate successfully. Yes, they seem to validate successfully. A task which now takes a couple of minutes to complete is treated similarly with old tasks that used to take hours to complete. So, this current situation is basically damaging the long time "badge system" by heavily twisting the requirements of how much work is needed. |
Send message Joined: 9 Jun 12 Posts: 584 Credit: 52,667,664 RAC: 0 |
|
Send message Joined: 31 Aug 16 Posts: 1 Credit: 237,120 RAC: 0 |
Last modified: 6 Oct 2016, 3:17:13 UTC I thought replacing my old graphics card with a new one caused it, but it seems that this has been happening to me since I started back on Asteroids@home a few days ago. Before the new card had arrived. Went from an nVidia GTX 460 to an nVidia GTX 1060. GPU GTX 1060 wrote: <core_client_version>7.6.22</core_client_version> GPU GTX 460 wrote: <core_client_version>7.6.22</core_client_version> I've also had a few CPU access violations or unknown errors, but I'm primarily concerned about GPU for now since it's rejecting them all flat-out. As far as I can tell, I can still do CPU fine. Historically, the GTX 460 worked fine. Or rather, stderr said roughly the same thing but the exit status was success. The only difference was that instead of "Unsupported CC detected (CC2.0 and better supported only)" it said "05:15:13 (8272): called boinc_finish" or a similar string. Please let me know if there's anything I can do to help. |
Send message Joined: 3 Aug 16 Posts: 19 Credit: 54,746,634 RAC: 5,350 |
|
Send message Joined: 16 Jan 23 Posts: 2 Credit: 3,086,200 RAC: 3,625 |
I appear to be causing computation errors when I run and debug a c program. Asteroids seems to run fine when left alone. I recently looked at all sixteen asteroids tasks running on my computer and all of them were doing fine. After running my c program several of the tasks had computation errors. Am I causing those errors? If so, what can I do to fix the problem? Thank you.
|
Send message Joined: 16 Nov 22 Posts: 127 Credit: 123,185,342 RAC: 384,897 |
|
Message boards :
Problems and bug reports :
Computation errors