Message boards :
Number crunching :
Long running wu_Qsqrt421_DS1x5 units - how long to let them run?
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · Next
Author | Message |
---|---|
Send message Joined: 12 Jul 12 Posts: 9 Credit: 10,000,929 RAC: 0 |
Found one task on my server, http://numberfields.asu.edu/NumberFields/result.php?resultid=13633161 It could be worse. zombie finished it the day after you did. He got it before you and finished it after. His machine spent 1.5 million seconds working in. My two active ones are still running. Both in the mid 50's of completion with 25 days of runtime. Both have two wingmen. One of my wingmen aborted his after 1.5 million seconds. |
Send message Joined: 25 Feb 13 Posts: 216 Credit: 9,899,302 RAC: 0 |
Found one task on my server, http://numberfields.asu.edu/NumberFields/result.php?resultid=13633161 You mean http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12351121 and http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12350747. Over 25 days? Which you got luck by completing them. Bounded tasks are not available, I´ll switch to decic tasks right now. EDIT: I haveone task running on my server. 369 hours runtime, http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12310752 |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 257,294,379 RAC: 194,567 |
He's lucky. I'm celebrating, because my Christmas Eve task (WU 12347065) - wu_Qsqrt421_DS1x5_CV2_S815_N2_-55_N1_-518to462 - has just reached positive territory. N1 = 5. 26 days, 53.346% progress - and counting. (pssst - I think it might be starting to speed up) |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 257,294,379 RAC: 194,567 |
WU 12347065 is back - 2,410,576.86 seconds! |
Send message Joined: 12 Jul 12 Posts: 9 Credit: 10,000,929 RAC: 0 |
I just had a long one finish so here is my current status on i7-2700k/stock wu_Qsqrt421_DS1x5_CV2_S815_N2_-61_N1_-613to551 53% after 28d03h. Two Wingmen running. wu_Qsqrt421_DS1x5_CV2_S815_N2_-63_N1_-645to581 finished after 27 days 22 hours 2 min 16 sec for 114,280.69 credits. 1 wingman running. 1 wingman aborted after a million seconds. wu_Qsqrt421_DS1x5_CV2_S815_N2_-72_N1_-775to702 finished after 15 days 20 hours 16 min 47 sec for 64,859.97 credits. Wingman finished after 11 days 7 hours 32 min 15 sec for no credit. wu_Qsqrt421_DS1x8_CV1_S815_N2_-72_N1_-8018to-3291_0 finished after 7 days 18 hours 51 min 2 sec for 31,743.02 credits. No wingman. wu_Qsqrt421_DS1x8_CV1_S815_N2_-73_N1_-8020to-3290_0 finished after 8 days 7 hours 11 min 27 sec. No longer in system. No wingman. I am currently set at "no new tasks" until I can look up the config setting to limit my machines to 2 work units. I can absorb a couple with incorrect estimates but having it too many would confuse boinc too much. |
Send message Joined: 12 Jul 12 Posts: 9 Credit: 10,000,929 RAC: 0 |
All done! Current status on i7-2700k/stock wu_Qsqrt421_DS1x5_CV2_S815_N2_-61_N1_-613to551 finished after 31 days 12 hours 4 min 37 sec for 128,955.42 credits. Two Wingmen running. wu_Qsqrt421_DS1x5_CV2_S815_N2_-63_N1_-645to581 finished after 27 days 22 hours 2 min 16 sec for 114,280.69 credits. 1 wingman running. 1 wingman aborted after a million seconds. wu_Qsqrt421_DS1x5_CV2_S815_N2_-72_N1_-775to702 finished after 15 days 20 hours 16 min 47 sec for 64,859.97 credits. Wingman finished after 11 days 7 hours 32 min 15 sec for no credit. wu_Qsqrt421_DS1x8_CV1_S815_N2_-72_N1_-8018to-3291_0 finished after 7 days 18 hours 51 min 2 sec for 31,743.02 credits. No longer in system. wu_Qsqrt421_DS1x8_CV1_S815_N2_-73_N1_-8020to-3290_0 finished after 8 days 7 hours 11 min 27 sec. No longer in system. How do the "DS1x5" work units map onto http://numberfields.asu.edu/NumberFields/batch_status.html? |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 257,294,379 RAC: 194,567 |
I've still got three actively in play: wu_Qsqrt421_DS1x8_CV1_S815_N2_-115_N1_-8062to-3289 wu_Qsqrt421_DS1x8_CV1_S815_N2_-92_N1_-8049to-3280 wu_Qsqrt421_DS1x8_CV1_S815_N2_-117_N1_-8062to-3291 |
Send message Joined: 8 Jul 11 Posts: 1346 Credit: 545,365,516 RAC: 628,706 |
All done! Thanks for the update! I'm still getting caught up after my 2 week hiatus, so haven't had a chance yet to reassess the status of this special search. Right now, there is no mapping to the batch status. Originally it was hoped this search would be relatively quick and would not need a status table, but I guess that is not the case. Hopefully I will have some time this weekend to add such a table. |
Send message Joined: 10 Dec 12 Posts: 5 Credit: 22,083,545 RAC: 0 |
The pattern on these WUs I've noticed is that they speed up at around 62-63%, once they've hit that percentage they using complete within 24hrs. |
Send message Joined: 5 Nov 11 Posts: 1 Credit: 5,827,360 RAC: 0 |
Hi. Wu: http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12731827 is up and running. Now 400 hours and it is going to end. |
Send message Joined: 10 Dec 12 Posts: 5 Credit: 22,083,545 RAC: 0 |
http://numberfields.asu.edu/NumberFields//workunit.php?wuid=12351452 I have just completed this Work Unit, after 2,366,625 secs (27.39 days) I get: "Completed, too late to validate" and zero credit :( It ran 24/7 since the day I got it. Edit: Is it because one above had already been completed that I got no credit? Because I've now notice that so has this one http://numberfields.asu.edu/NumberFields//workunit.php?wuid=12348078 that I'm currently 27 days into crunching, shall I just abort it as it's already been completed? shame about my 27 days wasted on it however :( |
Send message Joined: 8 Jul 11 Posts: 1346 Credit: 545,365,516 RAC: 628,706 |
http://numberfields.asu.edu/NumberFields//workunit.php?wuid=12351452 No worries. I went ahead and granted you the canonical credit on both WUs. You can abort the one still running; it sounds like it should be close to finishing but no reason to waste the cpu cycles. To answer your question... yes, someone returned the WU before you. It looks like it had originally timed out for them, the WU was reissued to you, and then they returned it after you had started it. When that happens, you won't get credit unless you return it within the grace period (which is hard to do with these really long WUs). |
Send message Joined: 10 Dec 12 Posts: 5 Credit: 22,083,545 RAC: 0 |
http://numberfields.asu.edu/NumberFields//workunit.php?wuid=12351452 Thank you very much. I have found another one http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12404933 which someone else has already completed, I'm 2 weeks into this but if the grace period is 10 days after the expiry I should be OK. I still have another 3 long ones, 2 at around 30 days, one at 40 days, so far no one else has completed them but I'll give you a shout if they do if that's OK. |
Send message Joined: 8 Jul 11 Posts: 1346 Credit: 545,365,516 RAC: 628,706 |
http://numberfields.asu.edu/NumberFields//workunit.php?wuid=12351452 Sounds good to me. Thanks! |
Send message Joined: 12 Aug 12 Posts: 7 Credit: 20,464,039 RAC: 0 |
It has recently come to my attention that the Qsqrt421 cases suffer from the same problem that the Bounded app did a couple weeks ago. I am currently looking into a similar fix for these WUs. I have 2 of these WUs running at the moment. One at 271 hours and the other at 385 hours. Leave them run? http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12420089 http://numberfields.asu.edu/NumberFields/workunit.php?wuid=12404866 Thanks/Ed |
Send message Joined: 8 Jul 11 Posts: 1346 Credit: 545,365,516 RAC: 628,706 |
It has recently come to my attention that the Qsqrt421 cases suffer from the same problem that the Bounded app did a couple weeks ago. I am currently looking into a similar fix for these WUs. No one else has returned them yet, so I would say let them continue, especially given the amount of time you have already spent on them. Thanks! |
Send message Joined: 20 Dec 14 Posts: 17 Credit: 12,153,123 RAC: 0 |
Someone has returned a result that was validated on the first of the two work units, so you might just as well abort it and crunch some work unit that has not yet been solved. The other one has not been solved yet, so you can continue on that one. |
Send message Joined: 8 Jul 11 Posts: 1346 Credit: 545,365,516 RAC: 628,706 |
Someone has returned a result that was validated on the first of the two work units, so you might just as well abort it and crunch some work unit that has not yet been solved. The other one has not been solved yet, so you can continue on that one. Thanks for catching that Jesse. Ed - you can go ahead and abort that WU. I gave you the canonical credit for it. Thanks! |
Send message Joined: 12 Aug 12 Posts: 7 Credit: 20,464,039 RAC: 0 |
Someone has returned a result that was validated on the first of the two work units, so you might just as well abort it and crunch some work unit that has not yet been solved. The other one has not been solved yet, so you can continue on that one. Thanks! The second one continues to run, now at 509 hours. |
Send message Joined: 3 Sep 12 Posts: 2 Credit: 16,239,835 RAC: 0 |
After 44 days my last long running wu (task #13680649) ended up in a computation error. :-( On the day before I saw the wu stopping for a few times without any reason. |