32 bit linux - process got signal 11

Message boards : Number crunching : 32 bit linux - process got signal 11
Message board moderation

To post messages, you must log in.

AuthorMessage
ChertseyAl

Send message
Joined: 19 Aug 11
Posts: 45
Credit: 1,014,069
RAC: 0
Message 610 - Posted: 16 Apr 2012, 15:13:44 UTC

The 32 bit linux version isn't working for me, all tasks die with "process got signal 11". An example WU:

http://stat.la.asu.edu/NumberFields/result.php?resultid=904465


<core_client_version>5.10.45</core_client_version>
<![CDATA[
<message>
process got signal 11
</message>
<stderr_txt>

</stderr_txt>
]]>


Not enough of a linux expert to offer much help. Maybe I got the 64-bit verion or something?

Cheers,

Al.
ID: 610 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 493,891,636
RAC: 558,817
Message 612 - Posted: 16 Apr 2012, 16:31:12 UTC - in response to Message 610.  

I just verified that you are definitely running the 32 bit version. There is 1 other user who is also getting compute errors, but it's a different error.

If you could, try running the GetDecics app and see if you still have the problem. There are multiple users who are returning successful 32bit results for that app. For the GetBoundedDecics app, my virtual machine is the only one that's returned a successful result so far (but that might just be a coincidence).
ID: 612 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 493,891,636
RAC: 558,817
Message 613 - Posted: 16 Apr 2012, 17:48:46 UTC - in response to Message 610.  

The 32 bit linux version isn't working for me, all tasks die with "process got signal 11". An example WU:

http://stat.la.asu.edu/NumberFields/result.php?resultid=904465


5.10.45

process got signal 11




]]>


Not enough of a linux expert to offer much help. Maybe I got the 64-bit verion or something?

Cheers,

Al.


This link may give some hints:

http://boincfaq.mundayweb.com/index.php?view=459&language=1

So far you are the only one with the signal 11 problem, but there are also very few users running the 32bit linux version of BoundedDecics.
ID: 613 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ChertseyAl

Send message
Joined: 19 Aug 11
Posts: 45
Credit: 1,014,069
RAC: 0
Message 614 - Posted: 16 Apr 2012, 18:16:30 UTC - in response to Message 613.  

Yeah, I was aware that it was a segmentation error - I've seen numerous threads on other project fora about it ;)

Not worth me trying to run the unbounded app on such a slow machine :(

Hopefully someone else can try out the bounded app and have more success than me, or at least be able to debug it if it fails for them.

Cheers,

Al.
ID: 614 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 493,891,636
RAC: 558,817
Message 615 - Posted: 16 Apr 2012, 20:08:58 UTC - in response to Message 614.  

Yeah, I was aware that it was a segmentation error - I've seen numerous threads on other project fora about it ;)

Not worth me trying to run the unbounded app on such a slow machine :(

Hopefully someone else can try out the bounded app and have more success than me, or at least be able to debug it if it fails for them.

Cheers,

Al.



There's only about 40 of the slow unbounded WUs left and they are all currently "in progress", so you don't have to worry about getting one of those. The other unbounded Wus are much more reasonable.
ID: 615 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 28 Oct 11
Posts: 180
Credit: 241,761,954
RAC: 143,546
Message 619 - Posted: 17 Apr 2012, 10:34:02 UTC - in response to Message 615.  

There's only about 40 of the slow unbounded WUs left and they are all currently "in progress", so you don't have to worry about getting one of those. The other unbounded Wus are much more reasonable.

Not an absolute certainty, I'm afraid - I got this resend overnight.

But if somebody with a slower machine happened, by some remote unlucky chance, to pick up a task named "wu_DS-14x121_...", I don't think anybody would be offended if you aborted it in the hope of picking up a shorter one next time - it would just go back into the resend pool and hopefully be picked up by a faster machine.

I've bumped mine to run next, in the hope we can get rid of these pesky blighters once and for all.
ID: 619 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 493,891,636
RAC: 558,817
Message 621 - Posted: 17 Apr 2012, 21:31:35 UTC - in response to Message 619.  

Not an absolute certainty, I'm afraid - I got this resend overnight.

But if somebody with a slower machine happened, by some remote unlucky chance, to pick up a task named "wu_DS-14x121_...", I don't think anybody would be offended if you aborted it in the hope of picking up a shorter one next time - it would just go back into the resend pool and hopefully be picked up by a faster machine.

I've bumped mine to run next, in the hope we can get rid of these pesky blighters once and for all.


LOL. Yes, it will be nice once these pesky WUs are finally gone.

The accelerating retries mechanism was turned back on a couple weeks ago, and all the remaining pesky WUs have been issued to "reliable" hosts. So hopefully it won't be much longer.
ID: 621 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 493,891,636
RAC: 558,817
Message 622 - Posted: 17 Apr 2012, 21:36:29 UTC - in response to Message 620.  

Well, it works fine on my Ubuntu computer.
I had a bit of a shock when my first work unit jumped to 100% after 3 seconds and I thought it had crashed, but it had simply finished correctly, and was a valid result!
I'll do a few more to make sure everything is working properly.


WUs that fast are not very common, but I do see them occasionally. Thanks for checking it out.
ID: 622 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 493,891,636
RAC: 558,817
Message 624 - Posted: 17 Apr 2012, 22:17:09 UTC - in response to Message 614.  

Yeah, I was aware that it was a segmentation error - I've seen numerous threads on other project fora about it ;)

Not worth me trying to run the unbounded app on such a slow machine :(

Hopefully someone else can try out the bounded app and have more success than me, or at least be able to debug it if it fails for them.

Cheers,

Al.


Hey Al,

I just checked and there are now 2 users who have returned successful results for the GetBoundedDecics app. So far you are the only one getting the signal 11 error.

Just for the record, there is another user with this problem:
http://numberfields.asu.edu/NumberFields/forum_thread.php?id=66&nowrap=true#600
But this should be independent of the signal 11 error.

I also read that sometimes doing a project reset can fix this. You might give that a try.
ID: 624 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : 32 bit linux - process got signal 11


Main page · Your account · Message boards


Copyright © 2024 Arizona State University