Message boards :
News :
Extra long WUs
Message board moderation
Author | Message |
---|---|
Send message Joined: 8 Jul 11 Posts: 1341 Credit: 494,193,741 RAC: 559,842 |
As some of you may be aware from the message boards, there are a small number of extra long WUs (about 6 per data set). These will eventually be fixed in the WU generator, but until then I am manually deleting them when I come across them. IF you happen to be crunching on one of these when I pull the plug, please let me know and I will manually grant you credit for the lost work. I can't think of a better way to handle these troublesome WUs. They are like a hot potato that keeps getting passed around from one user to another. Most results fail with "no reply", others are "aborted via the GUI" (probably because users think they are stuck). With a 10 day grace period and 8 failures before the server cancels the WU, it takes a large amount of time before these WUs naturally disappear. Note that these long WUs will finish normally within 2 or 3 days, so it doesn't hurt to let them continue if you think you may have one. It takes a history of "no replies" before I become aware of it, so the first few users that get a long one are safe from having it cancelled. |
Send message Joined: 12 Oct 13 Posts: 17 Credit: 39,662,588 RAC: 4,663 |
I have no issue with long WUs. I let them finish. The project appears to be movingly along swimmingly. Cheers. |
Send message Joined: 5 Jan 13 Posts: 43 Credit: 41,024,048 RAC: 1,138 |
It looks like I have one: http://numberfields.asu.edu/NumberFields/workunit.php?wuid=11936859 Do you mean it will be finished in 2-3 days? Thanks. |
Send message Joined: 8 Jul 11 Posts: 1341 Credit: 494,193,741 RAC: 559,842 |
It looks like I have one: Yes, it should finish within a few days. Obviously will depend on how fast your machine is. I will add it to my watch list. Thanks! |
Send message Joined: 5 Jan 13 Posts: 43 Credit: 41,024,048 RAC: 1,138 |
Is it possible to prolong my task for 2-3 days? It took 75 hours on my machine and 91 percent has already been completed. The task stuck two times: on 9% and 91%. And I am stuck again. Thanks. |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 241,855,319 RAC: 145,314 |
My wu_12E10_SF53-0_Idx9_Grp75512of81406 is in much the same position. It's been running for 8.75 days, and is due to reach even the extended deadline (grace period) at 11:45 tomorrow morning. Like Vitaly's, it moved very slowly from 0 to ~9%, and from ~90% onwards (exact transition points not observed). But it's now reached 95.796%, and I don't think it's ever stopped completely. Barring disasters, I see no reason why it shouldn't finish here, and it would be nice to supply one of the last half-dozen for this subfield. But it would hardly be worth sending out the next replication which would be automatically generated tomorrow, only to cancel it again later. |
Send message Joined: 8 Jul 11 Posts: 1341 Credit: 494,193,741 RAC: 559,842 |
Is it possible to prolong my task for 2-3 days? It looks like you have until Dec 22nd, so I think you should be good. But let me know if it starts to go beyond the 22nd. Thanks! |
Send message Joined: 8 Jul 11 Posts: 1341 Credit: 494,193,741 RAC: 559,842 |
My wu_12E10_SF53-0_Idx9_Grp75512of81406 is in much the same position. It's been running for 8.75 days, and is due to reach even the extended deadline (grace period) at 11:45 tomorrow morning. I believe if I "cancel" that WU then you will end up not getting credit, and I am uncertain if the final result gets returned to the server if it's been cancelled. Since I rarely cancel WUs, I really don't know how the server behaves under these circumstances. If anyone has ideas, I am open to suggestions. |
Send message Joined: 5 Jan 13 Posts: 43 Credit: 41,024,048 RAC: 1,138 |
This is strange but on my machine is displayed that I should sent out the task until December 19: http://prntscr.com/9fk6vf There is a 3 days gap. I have this problem for all NumberFields tasks. Hope I really have time untill December 22. |
Send message Joined: 8 Jul 11 Posts: 1341 Credit: 494,193,741 RAC: 559,842 |
This is strange but on my machine is displayed that I should sent out the task until December 19: Here's my link to the result: http://numberfields.asu.edu/NumberFields/result.php?resultid=12927950 Maybe the difference is the 3 day grace period. Whatever the cause, don't worry about it. I doubt anyone will be able to get it and return it before you. And if they did, just let me know and I will manually give you credit. |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 241,855,319 RAC: 145,314 |
Well, mine didn't make it within the grace period, although it's making progress and seems to have speeded up this morning - now at 96.046%. But a new copy has gone out anyway - WU 11855997 - and the new holder of the hot (cold?) potato is anonymous. Strange computer - 16-CPU Mac with an ancient GPU. |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 241,855,319 RAC: 145,314 |
The long one is back and validated. Looking at the anonymous new wingmate, I suggest it might be safe to cancel that task - I think (s)he might have shut down for the weekend, or even the holiday - plenty of tasks in progress, and no contact since yesterday. I doubt they've even started it yet. |
Send message Joined: 8 Jul 11 Posts: 1341 Credit: 494,193,741 RAC: 559,842 |
The long one is back and validated. Looking at the anonymous new wingmate, I suggest it might be safe to cancel that task - I think (s)he might have shut down for the weekend, or even the holiday - plenty of tasks in progress, and no contact since yesterday. I doubt they've even started it yet. Great! Thanks Richard. It looks like it capped your credit since it was so long - I will fix that in the morning. |
Send message Joined: 5 Jan 13 Posts: 43 Credit: 41,024,048 RAC: 1,138 |
It is interesting what will be with that cancelled task? Does it mean that it will not be calculated at all? Thanks. |
Send message Joined: 8 Jul 11 Posts: 1341 Credit: 494,193,741 RAC: 559,842 |
It is interesting what will be with that cancelled task? It turns out that a little while after Richard returned it, the newer host contacted the server at which time the server automatically cancelled that task since it hadn't been started yet. So it all worked out well. |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 241,855,319 RAC: 145,314 |
I will fix that in the morning. Wow - thanks Eric. I was needing a new toaster. |
Send message Joined: 11 Sep 12 Posts: 1 Credit: 40,723,221 RAC: 0 |
Seems I got two of them long running WUs http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12420118 has currently been running for 683 hours and is at ~51% http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12314856 has been running for 775 hours and is at ~43% I'll keep crunching, maybe they'll finish one day... |