Message boards :
News :
GPU app - beta version for linux nvidia
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next
Author | Message |
---|---|
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
Would it be possible for the Server Status page to separate the CPU/GPU WU counts out? So we can tell if there are GPU WU's available? It is probably out of date, but my version of the server status page only does apps, not app versions. But I'm sure I can modify it to do that. I will add that to my list. In the meantime, I have been watching it to make sure it doesn't run dry, so no need to worry about that. |
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
^I have seen the same behaviour on a 2-GPU system. It shows one task for each device, but it's actually crunching both on a single card while the other one is left idle. I wonder if I should be calling a function to set the device. I vaguely remember seeing something about that, but I completely forgot to follow up on it. Having only a single GPU, I was not perceptive to this "bug". |
Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 485 |
@lakewik which command did you use to see processes running on each GPU please? |
Send message Joined: 13 Mar 19 Posts: 10 Credit: 35,453,876 RAC: 19,711 |
@lakewik which command did you use to see processes running on each GPU please? nvidia-smi If you want realtime data: watch -n 1 nvidia-smi |
Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 485 |
Thank you Yes, confirms that multiple WU were running on a single GPU rather than one per GPU |
Send message Joined: 28 Oct 11 Posts: 180 Credit: 252,529,025 RAC: 182,120 |
I wonder if I should be calling a function to set the device. I vaguely remember seeing something about that, but I completely forgot to follow up on it. Having only a single GPU, I was not perceptive to this "bug".Yes, you should. https://boinc.berkeley.edu/trac/wiki/AppCoprocessor#Deviceselection Concentrate on boinc_get_init_data() - the older command line --device N is so old it can be relegated to an afterthought. |
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
I just put a new app version out there that should take care of the device selection bug. It is version 3.01 The only problem is there are 15k WUs already out there associated with version 3.00. I created another 10k with no version association. Still deciding what to do with the other 15k... |
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
I wonder if I should be calling a function to set the device. I vaguely remember seeing something about that, but I completely forgot to follow up on it. Having only a single GPU, I was not perceptive to this "bug".Yes, you should. It turns out the --device N command line option was helpful for debugging. When running in stand-alone mode there is no init data, so that helped me to get a device number into the code for testing. |
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
I deprecated the 3.00 app version so that my client would pick up the newer one. Maybe someone knows another way but this seemed to do the trick. Please test if it works as expected with multiple GPUs. Assuming it works, those who have multiple GPUs may want to abort the old WUs. |
Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 485 |
Thanks Eric - I have quite a few of the original WU to work my way through but can do this now that I know what is happening. If it looks like I'll miss the deadline, I'll abort some of them. Thanks for fixing this so quickly |
Send message Joined: 1 Feb 17 Posts: 23 Credit: 62,005,857 RAC: 2,459 |
Yesterday was triple the output of the Formula BOINC Sprint's best day. |
Send message Joined: 20 May 18 Posts: 6 Credit: 165,471,630 RAC: 0 |
I just put a new app version out there that should take care of the device selection bug. It is version 3.01 Works, both GPUs are now actually being used. |
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
I just put a new app version out there that should take care of the device selection bug. It is version 3.01 Good to hear! |
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
Yesterday was triple the output of the Formula BOINC Sprint's best day. Unfortunately, I don't think those numbers are accurate, just based on the numbers of results coming back. It's correlated to the obscenely high GPU credits being awarded. Maybe it's time I change to "Credit New", from the current "Credit based on runtime"? |
Send message Joined: 1 Feb 17 Posts: 23 Credit: 62,005,857 RAC: 2,459 |
Yesterday was triple the output of the Formula BOINC Sprint's best day. I can only see credit, not # of results, so that's what I mean by output. CreditNew provides lower credit for CPUs than the current system here so those CPU credits will be lowered too. But something will have to be done. |
Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 485 |
Why? (asked in all seriousness) Presumably the GPU app advances the project much quicker so has to pay the going rate to attract GPU owners to crunch them. By my rough reckoning, the new WU are paying approx double what GPUGrid is paying but less than half of what Collatz pays. |
Send message Joined: 7 Jan 16 Posts: 1 Credit: 42,301,682 RAC: 3,274 |
I deprecated the 3.00 app version so that my client would pick up the newer one. Maybe someone knows another way but this seemed to do the trick. Works fine for me now. WUs are now distributed over multiple GPUs in contrast to yesterdays behaviour. Thank you! |
Send message Joined: 18 Aug 13 Posts: 2 Credit: 1,003,792 RAC: 0 |
Only errors here. GTX645; newest drivers 418 Anyone can help? PG PPS sieve works fine. |
Send message Joined: 23 Aug 11 Posts: 5 Credit: 84,950,248 RAC: 0 |
Yesterday was triple the output of the Formula BOINC Sprint's best day. I do believe you need to change the credit system. On my GPUs it takes running about 3 WUs to fully load the GPU. If I run 6 WUs per GPU I get twice the credit, but the amount of work done is probably about the same. I only ran 6 WUs per GPU on 1 GPU for a couple of hours. I started to run other ones for a few minutes, but then backed off to 3 since it doesn't really seem fair. Thanks for the great job. Things do run really well. |
Send message Joined: 8 Jul 11 Posts: 1344 Credit: 530,216,564 RAC: 559,997 |
Only errors here. What version cuda do you have? If its pre 10.1 that could be the problem. I might have to update the plan_class to reflect the cuda version. |