Download errors are back


Message boards : Problems and bug reports : Download errors are back

Message board moderation

To post messages, you must log in.
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next
AuthorMessage
Ted

Send message
Joined: 16 Nov 14
Posts: 1
Credit: 77,760
RAC: 0
Message 3838 - Posted: 30 Nov 2014, 20:20:21 UTC
I've been getting many of these today as well
ID: 3838 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
hoolibas

Send message
Joined: 21 Nov 14
Posts: 1
Credit: 864,480
RAC: 0
Message 3849 - Posted: 4 Dec 2014, 6:58:03 UTC
I've been getting a few 1-3 a day (that I notice). This is the event log entry of one of the recent ones:

12/3/2014 11:53:40 PM | Asteroids@home | Giving up on download of input_44151_7: permanent HTTP error
ID: 3849 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 1 Jan 14
Posts: 300
Credit: 32,048,340
RAC: 14,845
Message 3853 - Posted: 4 Dec 2014, 12:40:20 UTC - in response to Message 3849.  
I've been getting a few 1-3 a day (that I notice). This is the event log entry of one of the recent ones:

12/3/2014 11:53:40 PM | Asteroids@home | Giving up on download of input_44151_7: permanent HTTP error


It's not really a problem, more of a pain in the neck, as the Project just has to send another unit as that one didn't get thru to you. I'm thinking, since this is a long standing problem here, that the Project has a standard home type internet connection and it is not metered like a business one. A business one can stay with you even thru bad moments, while a home one can drop the whole thing if the connection goes bad. A business type one is more expensive.
ID: 3853 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile cliff
Avatar

Send message
Joined: 19 Nov 14
Posts: 93
Credit: 30,066,240
RAC: 0
Message 3856 - Posted: 4 Dec 2014, 15:14:38 UTC - in response to Message 3853.  
Hi Mikey,
Recent weather might have something to do with it as well.
I have a 107Mbit down & 6.1Mbit up service, today that's down to 93Mbit, my guess is quite a lot of routine maint doesn't get done wrt making sure CCP's and joints are waterproofed.

I'm not on a business service, mine is domestic, but usually more reliable then at present. Firms incl ISP's are all trying to cut costs, some are daft enough to cut maint staff 1st, then they discover that 1 person cant be in 5 places at the same time.. But by then its to late to hire and train new staff.
I've had one ISP go under that way.

Regards,
Cliff
ID: 3856 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jeff Buck

Send message
Joined: 23 Nov 14
Posts: 2
Credit: 10,952,103
RAC: 0
Message 3860 - Posted: 5 Dec 2014, 5:00:56 UTC - in response to Message 3853.  
I've been getting a few 1-3 a day (that I notice). This is the event log entry of one of the recent ones:

12/3/2014 11:53:40 PM | Asteroids@home | Giving up on download of input_44151_7: permanent HTTP error


It's not really a problem, more of a pain in the neck, as the Project just has to send another unit as that one didn't get thru to you. I'm thinking, since this is a long standing problem here, that the Project has a standard home type internet connection and it is not metered like a business one. A business one can stay with you even thru bad moments, while a home one can drop the whole thing if the connection goes bad. A business type one is more expensive.

Seems to me it's more than just a pain in the neck. True, for each individual user/host it may just be a minor nuisance, but from what I've seen in the last several days, once one task for a WU gets a download error, all the resends get essentially the same error, resulting in the entire WU finally ending up in a "Too many errors (may have bug)" status. That often leaves one host, which actually successfully processed the unit, with zero credit for the work done, such as on this WU, and the project with no usable results for that WU.

Last weekend I only got a few of the D/L errors, but the last few days it seems to be running around 40% of the tasks sent. It's not just the "permanent HTTP error" either. Many are checksum errors, like this recent one;

12/4/2014 8:10:26 PM | Asteroids@home | Started download of input_43848_5
12/4/2014 8:10:27 PM | Asteroids@home | Finished download of input_43848_5
12/4/2014 8:10:27 PM | Asteroids@home | [error] MD5 check failed for input_43848_5
12/4/2014 8:10:27 PM | Asteroids@home | [error] expected c64f0e4a6d04f542258293e8e22d10bb, got 764d56d0874e7d21f8e56fea9faa142b
12/4/2014 8:10:27 PM | Asteroids@home | [error] Checksum or signature error for input_43848_5

or this one:

12/4/2014 8:31:11 PM | Asteroids@home | Started download of input_43869_10
12/4/2014 8:31:16 PM | Asteroids@home | Finished download of input_43869_10
12/4/2014 8:31:16 PM | Asteroids@home | [error] File input_43869_10 has wrong size: expected 20889, got 20890
12/4/2014 8:31:16 PM | Asteroids@home | [error] Checksum or signature error for input_43869_10

I have no idea what ultimately happens to these WUs that max out on errors, but it doesn't seem like it would be particularly good for the project, and they're really starting to pile up.
ID: 3860 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 1 Jan 14
Posts: 300
Credit: 32,048,340
RAC: 14,845
Message 3863 - Posted: 5 Dec 2014, 12:08:36 UTC - in response to Message 3860.  

I have no idea what ultimately happens to these WUs that max out on errors, but it doesn't seem like it would be particularly good for the project, and they're really starting to pile up.


The units just get recycled and resent to someone else, it's not really a problem as the Server does it all automatically.
ID: 3863 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tobben

Send message
Joined: 17 Apr 14
Posts: 5
Credit: 10,440,960
RAC: 0
Message 3864 - Posted: 5 Dec 2014, 16:49:44 UTC - in response to Message 3863.  

Last modified: 5 Dec 2014, 16:51:48 UTC
I am getting the same error, and it is a problem because big amounts of processing time is lost/wasted.

i run 48 projects at a time on my 4 processors, when the client is failing to download more than half of the wu's i have to sit here and manually make the project re-sync 5 times to atleast try and stack up a few wu's. When i get home my server will be at 0% load and i have to start over again.

I usually have about 100 projects if not more running/waiting, so you can probably imagine this is pretty hopeless.
ID: 3864 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tobben

Send message
Joined: 17 Apr 14
Posts: 5
Credit: 10,440,960
RAC: 0
Message 3867 - Posted: 6 Dec 2014, 12:36:06 UTC
i am turning back to folding in the meantime,
i would love to know when this issue is fixed.
Running asteroids@home currently isn't feasible.
ID: 3867 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 1 Jan 14
Posts: 300
Credit: 32,048,340
RAC: 14,845
Message 3869 - Posted: 6 Dec 2014, 12:47:50 UTC - in response to Message 3864.  
I am getting the same error, and it is a problem because big amounts of processing time is lost/wasted.

i run 48 projects at a time on my 4 processors, when the client is failing to download more than half of the wu's i have to sit here and manually make the project re-sync 5 times to atleast try and stack up a few wu's. When i get home my server will be at 0% load and i have to start over again.

I usually have about 100 projects if not more running/waiting, so you can probably imagine this is pretty hopeless.


If you have a "100 projects" lined up then your server should NEVER be out of work, as when one project doesn't give it any the next one, or the one after that, etc, etc, should. If you really have that many active projects, AND you are out of work sometimes, then you have a setting messed up someplace.
ID: 3869 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile BilBg
Avatar

Send message
Joined: 19 Jun 12
Posts: 221
Credit: 623,640
RAC: 0
Message 3870 - Posted: 6 Dec 2014, 16:30:54 UTC - in response to Message 3863.  

Last modified: 6 Dec 2014, 16:47:53 UTC

I have no idea what ultimately happens to these WUs that max out on errors ...

The units just get recycled and resent to someone else, it's not really a problem as the Server does it all automatically.

No, they are not

No more tasks are sent for a WU which is at "max # of error / Too many errors (may have bug)"
http://asteroidsathome.net/boinc/workunit.php?wuid=20964261

Those with Status "Completed, can't validate" are in this state (not mine tasks):
http://asteroidsathome.net/boinc/results.php?hostid=93179&offset=0&show_names=0&state=5&appid=


Since one task was completed OK the WU file was on the server at that time.
I think there is some wrong setting on the server which deletes files after a period of time even if those files are still needed.



- ALF - "Find out what you don't do well ..... then don't do it!" :)
ID: 3870 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile BilBg
Avatar

Send message
Joined: 19 Jun 12
Posts: 221
Credit: 623,640
RAC: 0
Message 3871 - Posted: 6 Dec 2014, 16:35:08 UTC - in response to Message 3869.  

Last modified: 6 Dec 2014, 16:35:52 UTC
I usually have about 100 projects if not more running/waiting ...

If you have a "100 projects" lined up then your server should NEVER be out of work ...

I'm sure you know he (obviously) don't know what is a project and what is a task ;)



- ALF - "Find out what you don't do well ..... then don't do it!" :)
ID: 3871 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tobben

Send message
Joined: 17 Apr 14
Posts: 5
Credit: 10,440,960
RAC: 0
Message 3873 - Posted: 7 Dec 2014, 11:08:43 UTC - in response to Message 3871.  

Last modified: 7 Dec 2014, 11:10:21 UTC
well, i used the wrong terminology. But as i am
specifically talking about asteroids at home, i
don't think it would be impossible to deduce that
i did indeed mean tasks/wu's and not different
individual projects.

it's just a bad habit from folding.

Either way, this is a problem for me. To the extent
that i simply can't run asteroids@home until this
download problem is sorted. No matter how many times
i try, i can never download enough tasks to keep the
server busy.
ID: 3873 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile BilBg
Avatar

Send message
Joined: 19 Jun 12
Posts: 221
Credit: 623,640
RAC: 0
Message 3874 - Posted: 7 Dec 2014, 11:44:27 UTC - in response to Message 3873.  
i can never download enough tasks to keep the server busy.

Judging by 'Average turnaround time' of ~1 day:
http://asteroidsathome.net/boinc/host_app_versions.php?hostid=93179

... you have set your WU cache to 1 day
Try to set for e.g. 5 + 0.1 days (this is global Setting = total work to have on-board from all the projects added/attached to this computer)

To set this only for this computer you may use Local preferences

I don't remember if BOINC version 7.0.65 have some bugs to decide when and how much work to ask - you may want to try newer BOINC



- ALF - "Find out what you don't do well ..... then don't do it!" :)
ID: 3874 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 3877 - Posted: 9 Dec 2014, 10:31:33 UTC
Hi, I am aware of this unfortunately I am now solving another problem. I am changing my job and I have now temporarily 2 two jobs at once, so it is very complicated for me now. Since January, I will have just only one job, so I will be able to take care about the server much more again. I am sorry for it. I will try to solve the problems with download errors as soon as possible.
ID: 3877 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tobben

Send message
Joined: 17 Apr 14
Posts: 5
Credit: 10,440,960
RAC: 0
Message 3878 - Posted: 9 Dec 2014, 17:40:26 UTC - in response to Message 3877.  

Last modified: 9 Dec 2014, 17:40:56 UTC
Thanks for the update, and good luck with sorting out your issues.
Are you the only one doing maintenance/upkeep on this project though?
ID: 3878 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 3883 - Posted: 12 Dec 2014, 14:43:43 UTC
Unfortunately yes.
ID: 3883 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>Le_Pommier] Jerome_C2005

Send message
Joined: 21 Apr 13
Posts: 5
Credit: 2,074,191
RAC: 862
Message 3897 - Posted: 29 Dec 2014, 13:52:49 UTC
Hello

I realize I have 8 WUs that are failing upload since mi-December, I find this thread were I can see that there are indeed issues with up/down of WUs, and also that Kyong has another kind of workload issue :)

So I hope you can solve both issues and wish you the best for the new year to come !

(and I hope that they won't be "lost" since their deadline was set on 27/12/2014...)

Cheers !
ID: 3897 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richie

Send message
Joined: 25 Jul 14
Posts: 64
Credit: 100,582,080
RAC: 0
Message 4154 - Posted: 18 Mar 2015, 2:37:14 UTC
Download errors are producing digital waste again. Many tasks are ending up like this:

Exit status -186 (0xffffffffffffff46) ERR_RESULT_DOWNLOAD


Stderr output

<core_client_version>7.4.42</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>input_1417_50</file_name>
<error_code>-200 (wrong size)</error_code>
</file_xfer_error>

</message>
]]>
ID: 4154 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kyong
Project administrator
Project developer
Project tester
Project scientist
Avatar

Send message
Joined: 9 Jun 12
Posts: 584
Credit: 52,667,664
RAC: 0
Message 4157 - Posted: 18 Mar 2015, 12:21:12 UTC
These are last old workunits. Just cancel or ignore them. New workunits should be fine.
ID: 4157 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 15 Jan 13
Posts: 12
Credit: 904,320
RAC: 0
Message 4163 - Posted: 19 Mar 2015, 18:31:51 UTC
222 download errors in the last 2 days on 102.10 SSE2 app.


1568 Asteroids@home 3/19/2015 8:11:09 AM [error] File input_2706_79 has wrong size: expected 68619, got 0
1569 Asteroids@home 3/19/2015 8:11:09 AM [error] Checksum or signature error for input_2706_79
1570 Asteroids@home 3/19/2015 8:11:09 AM [error] File input_2762_32 has wrong size: expected 62946, got 0
1571 Asteroids@home 3/19/2015 8:11:09 AM [error] Checksum or signature error for input_2762_32
1572 Asteroids@home 3/19/2015 8:11:09 AM [error] File input_2727_84 has wrong size: expected 63174, got 0
1573 Asteroids@home 3/19/2015 8:11:09 AM [error] Checksum or signature error for input_2727_84
1574 Asteroids@home 3/19/2015 8:11:09 AM [error] File input_2696_13 has wrong size: expected 56813, got 0
1575 Asteroids@home 3/19/2015 8:11:09 AM [error] Checksum or signature error for input_2696_13
1576 Asteroids@home 3/19/2015 8:11:09 AM [error] File input_2727_82 has wrong size: expected 63174, got 0
1577 Asteroids@home 3/19/2015 8:11:09 AM [error] Checksum or signature error for input_2727_82
ID: 4163 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Problems and bug reports : Download errors are back