21)
Message boards :
Number crunching :
outch!
(Message 603)
Posted 11 Apr 2012 by ![]() Post: Looks like the boinc client is bombing. Can you run under strace or gdb? |
22)
Message boards :
Number crunching :
OpenCL/GPU app?
(Message 579)
Posted 23 Mar 2012 by ![]() Post:
You can link userspace apps to GPL libraries and not encumber under the GPL. I'm all for posting the source though. |
23)
Message boards :
Number crunching :
abort?
(Message 567)
Posted 19 Mar 2012 by ![]() Post: Wiki updated for the confusion over units, though I don't think that's the real underlying problem. It was report_grace_period. I changed to grace_period_hours. |
24)
Message boards :
Number crunching :
FUBAR!
(Message 450)
Posted 5 Mar 2012 by ![]() Post: i only want to run Get Bounded Decics. Sorry about that. I think the weights were messed up again and confusing the server with fractional values. I just changed them to something that looks reasonable. It should be possible to just run the Get Bounded Decics app if you want to. |
25)
Message boards :
Number crunching :
ARIZONA - we got a problem..
(Message 444)
Posted 28 Feb 2012 by ![]() Post: I won't be getting home until very late tonight, so I decided to turn the bounded app back on. This time I set the weights to 100-to-1. We will see if that has any effect. OK, I restarted. |
26)
Message boards :
Number crunching :
ETA for project
(Message 433)
Posted 23 Jan 2012 by ![]() Post: Good day everybody. According to Eric we could potentially go on forever. Check out his response in the science topics. http://numberfields.asu.edu/NumberFields/forum_thread.php?id=5#8 |
27)
Message boards :
Number crunching :
Runnitme discrepancy
(Message 333)
Posted 13 Nov 2011 by ![]() Post: Hi, Do you have a workunit number for these? The boinc manager measures the runtime and calculates the requested credit but we also print some times that we could cross check against. |
28)
Message boards :
Number crunching :
FATAL: Kernel too old
(Message 330)
Posted 10 Nov 2011 by ![]() Post: I have 2.6.32. There shouldn't be any special kernel requirements for our project. |
29)
Message boards :
Number crunching :
Had to abort some WUs
(Message 320)
Posted 4 Nov 2011 by ![]() Post: Greg, Thanks, this looks like a reasonable thing to do. I'm a little confused by the documentation though. What would I put for <reliable_on_priority> to enable: 1? If so I will add the following. <reliable_on_priority>1</reliable_on_priority> <reliable_reduced_delay_bound>1.33</reliable_reduced_delay_bound> |
30)
Message boards :
Number crunching :
Had to abort some WUs
(Message 318)
Posted 3 Nov 2011 by ![]() Post:
I'll try to hold down the fort Eric. I'll look into it tonight. |
31)
Message boards :
Number crunching :
Had to abort some WUs
(Message 311)
Posted 2 Nov 2011 by ![]() Post: wu_12E10_SF161-1_Idx3_Grp57438of59290 Looks like this wu has some problems. http://numberfields.asu.edu/NumberFields/result.php?resultid=258591 [/url] |
32)
Message boards :
News :
Only Linux and Windows currently supported
(Message 307)
Posted 1 Nov 2011 by ![]() Post: Sorry about it taking so long. If I had login access to a mac with XCode installed it probably wouldn't take that long. I had an older version running fine at one time but my mac access went away. I'm about to give up on the VM method. Anyone know where I could get login access to a mac with XCode installed? |
33)
Message boards :
Number crunching :
Process got signal 11
(Message 219)
Posted 21 Sep 2011 by ![]() Post: I am having similar problems on my RHEL5 x64 hosts, We fixed the issue that was causing some Suse distros to fail. However, it looks like we are using a syscall that was added since kernel 2.6.18. I'm not sure which one or if we can avoid it. Looking through the kernel git log for the syscall table I think the last one was added in 2008. Perhaps you could run an strace on your RHEL5 system to see which syscall fails and I can look up when it was added. |
34)
Message boards :
Number crunching :
Massive drop of credits per CPU hour
(Message 206)
Posted 16 Sep 2011 by ![]() Post: Well I can't work out why my faster computer (by only 200 MHz, an AMD Phenom II 1100T @ 3.3 GHz, my other is AMD Phenom II 955 @ 3.2 GHz), is getting consistently much lower results than my slower machine. I really don't understand how the credit system works but I gather is is all based on these benchmarks that run initially. Perhaps there was something else running during the benchmark phase which is lowering the scores on your Phenom. I believe you can force the manager to re-run them. Can you give that a try before you bail on your AMD? |
35)
Message boards :
Number crunching :
Error in the PARI system
(Message 181)
Posted 11 Sep 2011 by ![]() Post: I'm getting PARI errors too. but we should have very low disk requirements. We don't have big data sets to scan like Einsteinand the work units are tiny. I don't see how we ever triggered that. |
36)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 177)
Posted 7 Sep 2011 by ![]() Post: Thanks to the beta testers we confirmed the fix. It was the second and hopefully last bug we found in the pari source. Thanks again guys. |
37)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 162)
Posted 6 Sep 2011 by ![]() Post: I don't think it has anything to do with lib versions on the client. I think something early is ill conditioned or relies on a failed syscall that leads to the exception. Any chance someone can run under a debugger? You would do #gdb <app name> then r for run. After the exception it should trap and give you a line number. That would be great since we have no way to reproduce. Any takers? This is helpful thanks. Looks like the same issue as Bok is having. We need to patch the pari source again and see if we can eliminate this outdated reference. Perhaps you can beta test for us when we do? |
38)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 161)
Posted 6 Sep 2011 by ![]() Post: Sure thing. System has 16Gb Ram with almost all of it totally free right now. Bok, this is great. Just what we need. I think we know what the problem is from this. The pari library (a collection of source for numberfields math) is using an outdated version of getpwuid(). Some systems have a workaround but others don't deal with it so well. We need to patch it so that it doesn't call this function and tell the pari people. Thanks for the effort on this. You rock Bok! --Greg |
39)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 158)
Posted 6 Sep 2011 by ![]() Post: Tried the debug-info command but it didn't find anything to install. Looks like the debuginfo repos for centos 6 are not updated yet.. Do I need a version compiled with a -g ? OK thanks. Looks to me like a memory issue. _int_free() I believe is in the malloc path. You already have symbol information so don't worry about adding more. If you could do a backtrace (type bt at the gdb prompt) after the fail this may confirm it. Is your system heavily loaded on memory? Perhaps you could add a vmstat output. Thanks for your help on this. |
40)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 146)
Posted 6 Sep 2011 by ![]() Post: I don't think it has anything to do with lib versions on the client. I think something early is ill conditioned or relies on a failed syscall that leads to the exception. Any chance someone can run under a debugger? You would do #gdb <app name> then r for run. After the exception it should trap and give you a line number. That would be great since we have no way to reproduce. Any takers? |