|
21)
Message boards :
Number crunching :
OpenCL/GPU app?
(Message 579)
Posted 23 Mar 2012 by Greg Tucker
Post:
You can link userspace apps to GPL libraries and not encumber under the GPL. I'm all for posting the source though. |
|
22)
Message boards :
Number crunching :
abort?
(Message 567)
Posted 19 Mar 2012 by Greg Tucker
Post: Wiki updated for the confusion over units, though I don't think that's the real underlying problem. It was report_grace_period. I changed to grace_period_hours. |
|
23)
Message boards :
Number crunching :
FUBAR!
(Message 450)
Posted 5 Mar 2012 by Greg Tucker
Post: i only want to run Get Bounded Decics. Sorry about that. I think the weights were messed up again and confusing the server with fractional values. I just changed them to something that looks reasonable. It should be possible to just run the Get Bounded Decics app if you want to. |
|
24)
Message boards :
Number crunching :
ARIZONA - we got a problem..
(Message 444)
Posted 28 Feb 2012 by Greg Tucker
Post: I won't be getting home until very late tonight, so I decided to turn the bounded app back on. This time I set the weights to 100-to-1. We will see if that has any effect. OK, I restarted. |
|
25)
Message boards :
Number crunching :
ETA for project
(Message 433)
Posted 23 Jan 2012 by Greg Tucker
Post: Good day everybody. According to Eric we could potentially go on forever. Check out his response in the science topics. http://numberfields.asu.edu/NumberFields/forum_thread.php?id=5#8 |
|
26)
Message boards :
Number crunching :
Runnitme discrepancy
(Message 333)
Posted 13 Nov 2011 by Greg Tucker
Post: Hi, Do you have a workunit number for these? The boinc manager measures the runtime and calculates the requested credit but we also print some times that we could cross check against. |
|
27)
Message boards :
Number crunching :
FATAL: Kernel too old
(Message 330)
Posted 10 Nov 2011 by Greg Tucker
Post: I have 2.6.32. There shouldn't be any special kernel requirements for our project. |
|
28)
Message boards :
Number crunching :
Had to abort some WUs
(Message 320)
Posted 4 Nov 2011 by Greg Tucker
Post: Greg, Thanks, this looks like a reasonable thing to do. I'm a little confused by the documentation though. What would I put for <reliable_on_priority> to enable: 1? If so I will add the following. <reliable_on_priority>1</reliable_on_priority> <reliable_reduced_delay_bound>1.33</reliable_reduced_delay_bound> |
|
29)
Message boards :
Number crunching :
Had to abort some WUs
(Message 318)
Posted 3 Nov 2011 by Greg Tucker
Post:
I'll try to hold down the fort Eric. I'll look into it tonight. |
|
30)
Message boards :
Number crunching :
Had to abort some WUs
(Message 311)
Posted 2 Nov 2011 by Greg Tucker
Post: wu_12E10_SF161-1_Idx3_Grp57438of59290 Looks like this wu has some problems. http://numberfields.asu.edu/NumberFields/result.php?resultid=258591 [/url] |
|
31)
Message boards :
News :
Only Linux and Windows currently supported
(Message 307)
Posted 1 Nov 2011 by Greg Tucker
Post: Sorry about it taking so long. If I had login access to a mac with XCode installed it probably wouldn't take that long. I had an older version running fine at one time but my mac access went away. I'm about to give up on the VM method. Anyone know where I could get login access to a mac with XCode installed? |
|
32)
Message boards :
Number crunching :
Process got signal 11
(Message 219)
Posted 21 Sep 2011 by Greg Tucker
Post: I am having similar problems on my RHEL5 x64 hosts, We fixed the issue that was causing some Suse distros to fail. However, it looks like we are using a syscall that was added since kernel 2.6.18. I'm not sure which one or if we can avoid it. Looking through the kernel git log for the syscall table I think the last one was added in 2008. Perhaps you could run an strace on your RHEL5 system to see which syscall fails and I can look up when it was added. |
|
33)
Message boards :
Number crunching :
Massive drop of credits per CPU hour
(Message 206)
Posted 16 Sep 2011 by Greg Tucker
Post: Well I can't work out why my faster computer (by only 200 MHz, an AMD Phenom II 1100T @ 3.3 GHz, my other is AMD Phenom II 955 @ 3.2 GHz), is getting consistently much lower results than my slower machine. I really don't understand how the credit system works but I gather is is all based on these benchmarks that run initially. Perhaps there was something else running during the benchmark phase which is lowering the scores on your Phenom. I believe you can force the manager to re-run them. Can you give that a try before you bail on your AMD? |
|
34)
Message boards :
Number crunching :
Error in the PARI system
(Message 181)
Posted 11 Sep 2011 by Greg Tucker
Post: I'm getting PARI errors too. but we should have very low disk requirements. We don't have big data sets to scan like Einsteinand the work units are tiny. I don't see how we ever triggered that. |
|
35)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 177)
Posted 7 Sep 2011 by Greg Tucker
Post: Thanks to the beta testers we confirmed the fix. It was the second and hopefully last bug we found in the pari source. Thanks again guys. |
|
36)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 162)
Posted 6 Sep 2011 by Greg Tucker
Post: I don't think it has anything to do with lib versions on the client. I think something early is ill conditioned or relies on a failed syscall that leads to the exception. Any chance someone can run under a debugger? You would do #gdb <app name> then r for run. After the exception it should trap and give you a line number. That would be great since we have no way to reproduce. Any takers? This is helpful thanks. Looks like the same issue as Bok is having. We need to patch the pari source again and see if we can eliminate this outdated reference. Perhaps you can beta test for us when we do? |
|
37)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 161)
Posted 6 Sep 2011 by Greg Tucker
Post: Sure thing. System has 16Gb Ram with almost all of it totally free right now. Bok, this is great. Just what we need. I think we know what the problem is from this. The pari library (a collection of source for numberfields math) is using an outdated version of getpwuid(). Some systems have a workaround but others don't deal with it so well. We need to patch it so that it doesn't call this function and tell the pari people. Thanks for the effort on this. You rock Bok! --Greg |
|
38)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 158)
Posted 6 Sep 2011 by Greg Tucker
Post: Tried the debug-info command but it didn't find anything to install. Looks like the debuginfo repos for centos 6 are not updated yet.. Do I need a version compiled with a -g ? OK thanks. Looks to me like a memory issue. _int_free() I believe is in the malloc path. You already have symbol information so don't worry about adding more. If you could do a backtrace (type bt at the gdb prompt) after the fail this may confirm it. Is your system heavily loaded on memory? Perhaps you could add a vmstat output. Thanks for your help on this. |
|
39)
Message boards :
Number crunching :
Comp errors on 1.05 for Gentoo Linux
(Message 146)
Posted 6 Sep 2011 by Greg Tucker
Post: I don't think it has anything to do with lib versions on the client. I think something early is ill conditioned or relies on a failed syscall that leads to the exception. Any chance someone can run under a debugger? You would do #gdb <app name> then r for run. After the exception it should trap and give you a line number. That would be great since we have no way to reproduce. Any takers? |
|
40)
Message boards :
Number crunching :
one wu at a time
(Message 94)
Posted 31 Aug 2011 by Greg Tucker
Post: Well, it looks like we may have over-adjusted and some work queues were getting short. I put the depths back to the original values and we will look into finer adjustments later. |