Message boards :
Number crunching :
Tasks stall while running
Message board moderation
Author | Message |
---|---|
Send message Joined: 28 Oct 11 Posts: 181 Credit: 272,518,663 RAC: 243,493 ![]() ![]() ![]() |
I'm seeing this as a very occasional problem. ![]() Task has been running for a very long time, and is still using CPU cycles - but isn't going anywhere. boinc_task_state.xml shows that the last checkpoint as a long time ago, too: <checkpoint_cpu_time>1117.872000</checkpoint_cpu_time> This is task 11409701, and it's being done with the windows_x86_64 version of the app. I suspended the task manually, without leaving it in memory - from previous experience, I'm expecting it to resume from the checkpoint values and finish normally. Edit - yes, it restarted while I was typing and has now validated. I should have said that some 40 tasks were processed normally and reported by the same machine, while this task was stalled and occupying one core. |
![]() Send message Joined: 8 Jul 11 Posts: 1361 Credit: 594,274,517 RAC: 819,191 ![]() ![]() ![]() |
Very strange. I will run that same case on my test server and see if I can reproduce this. If it's not case specific, then this will be very hard to debug. Richard - In your experience do you think this is purely an app problem or could it be partly a client problem? |
Send message Joined: 28 Oct 11 Posts: 181 Credit: 272,518,663 RAC: 243,493 ![]() ![]() ![]() |
I've only seen it once or twice before - I think I possibly aborted the first, and worked out the 'restart from checkpoint' procedure later. This was the first time I dug into it as far as the properties dialog and boinc_task_state. I've worked on a few client bugs recently, so I'm open-minded - but my gut instinct is that this is an application problem. Or, just possibly, the API code is missing an exception that could have been caught? I mentioned it in passing on the boinc_alpha list, so possibly David might take a look. If/when I catch it again, I'll maybe take a look with Process Explorer. But my guess is that your copy will sail straight through without a hitch, and we'll be none the wiser. |
![]() Send message Joined: 8 Jul 11 Posts: 1361 Credit: 594,274,517 RAC: 819,191 ![]() ![]() ![]() |
I've run it now 3 times on my 64bit windows machine with the exact same version of the executable, and I can't get it to happen. So whatever the issue is, it's going to be very difficult to debug. I usually run the GD app and never see this, so it might be something specific to the GBD app. I've been trying to debug my Android version of the apps and for some reason the GBD app crashes with a random seg fault about 75% of the time; the GD app does not. I'm starting to wonder if this is more than a coincidence. |