GPU app - beta version for linux nvidia

Author	Message
PecosRiverM Send message Joined: 13 Mar 19 Posts: 2 Credit: 220,432,972 RAC: 0	Message 2214 - Posted: 23 Mar 2019, 18:39:45 UTC - in response to Message 2213. Na just keep it linux (I need to catch up to pututu and some of the others running only win products) ;-) ID: 2214 · Rating: 0 · rate: / Reply Quote

Tomas Brada Send message Joined: 16 Feb 17 Posts: 2 Credit: 671,890 RAC: 0	Message 2215 - Posted: 23 Mar 2019, 18:44:45 UTC I know that this is built with CUDA. If you start working on OpenCL, please know that OpenCL ICD relies on dynamic libraries. It is double edged sword for gpu apps. While you eliminate dependency on specific system libraries, the OpenCL icd on system may depend on version other than what you statically lined in. I know that versioned symbols of libc is supposed to prevent such conflicts, it did not for me on another project. I had to recompile to get rid of sigsegv. Good luck. And thanks for working on accelerated apps! ID: 2215 · Rating: 0 · rate: / Reply Quote

Stiwi Send message Joined: 13 Mar 19 Posts: 10 Credit: 39,759,940 RAC: 2,782	Message 2216 - Posted: 23 Mar 2019, 21:54:39 UTC Thx for the GPU App. For me it works fine so far. 75 valid tasks 0 invalid :) ID: 2216 · Rating: 0 · rate: / Reply Quote

mmonnin Send message Joined: 1 Feb 17 Posts: 23 Credit: 62,263,152 RAC: 729	Message 2217 - Posted: 23 Mar 2019, 23:25:18 UTC Just low to mid 70% GPU util so I'm running 2x. Now mid 90%. <app_config> <app> <name>GetDecics</name> <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.1</cpu_usage> </gpu_versions> </app> </app_config> ID: 2217 · Rating: 0 · rate: / Reply Quote

Eric Driver Project administrator Project developer Project tester Project scientist Send message Joined: 8 Jul 11 Posts: 1388 Credit: 696,162,292 RAC: 829,239	Message 2218 - Posted: 24 Mar 2019, 0:50:48 UTC - in response to Message 2217. Just low to mid 70% GPU util so I'm running 2x. Now mid 90%. <app_config> <app> <name>GetDecics</name> <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.1</cpu_usage> </gpu_versions> </app> </app_config> Interesting... I forgot you could control those things through the client. I just did the same thing. GPU is running a little hotter and my browser is a little sluggish... I might have to go back to running 1x. So is 70% considered low? Should I try to optimize the utilization? During development, having too many blocks or threads per block caused crashes due to resource problems. I eventually settled on some values that seemed to work well, but maybe they are not optimal. But I am pressing on with the OpenCL version first; I will worry about optimization later. ID: 2218 · Rating: 0 · rate: / Reply Quote

mmonnin Send message Joined: 1 Feb 17 Posts: 23 Credit: 62,263,152 RAC: 729	Message 2219 - Posted: 24 Mar 2019, 1:57:49 UTC - in response to Message 2218. Last modified: 24 Mar 2019, 2:17:09 UTC Just low to mid 70% GPU util so I'm running 2x. Now mid 90%. <app_config> <app> <name>GetDecics</name> <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.1</cpu_usage> </gpu_versions> </app> </app_config> Interesting... I forgot you could control those things through the client. I just did the same thing. GPU is running a little hotter and my browser is a little sluggish... I might have to go back to running 1x. So is 70% considered low? Should I try to optimize the utilization? During development, having too many blocks or threads per block caused crashes due to resource problems. I eventually settled on some values that seemed to work well, but maybe they are not optimal. But I am pressing on with the OpenCL version first; I will worry about optimization later. Hotter is to be expected. My Ti is in a dedicated system so slow browsing won't bother me. I'd consider it low, so I added another task even at the expense of another lost CPU thread feeding the GPU exe. Some other projects have options the user can configure, I'm guessing something like the blocks and threads per block that you mentioned. Collatz, Amicable and even SETI's user apps can have custom settings that can push GPUs harder. Amicable has a kernel size setting on BOINC's user preferences. Collatz and SETI can look at text files in the project folder. Memory usage in Nvidia X server is showing about 1.5GB with 2 tasks so plenty of resources left. 567mb per task. 0-1% PCI-E usage. Just 115-118 watts via nvidia-smi on a 1080Ti. Same OC uses 185-195 watts with E@H. ID: 2219 · Rating: 0 · rate: / Reply Quote

davidBAM Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 0	Message 2220 - Posted: 24 Mar 2019, 3:18:44 UTC Anyone know what's going on at Formula Boinc please? They don't seem to be including points from the new GPU app :-( ID: 2220 · Rating: 0 · rate: / Reply Quote

mmonnin Send message Joined: 1 Feb 17 Posts: 23 Credit: 62,263,152 RAC: 729	Message 2221 - Posted: 24 Mar 2019, 4:13:18 UTC There is no way to separate it without reading every users tasks, which is impossible with hidden computers. ID: 2221 · Rating: 0 · rate: / Reply Quote

davidBAM Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 0	Message 2222 - Posted: 24 Mar 2019, 4:18:07 UTC - in response to Message 2221. Last modified: 24 Mar 2019, 4:41:52 UTC Okay ta - maybe it is just taking a while. Nothing has updated yet in fact Formula Boinc, Boincstats and even my new badge on project site all seem to be 24hrs behind Are points from the GPU app exported please Eric? ID: 2222 · Rating: 0 · rate: / Reply Quote

mmonnin Send message Joined: 1 Feb 17 Posts: 23 Credit: 62,263,152 RAC: 729	Message 2223 - Posted: 24 Mar 2019, 4:34:32 UTC There is only once per day external update to FreeDC/BOINCSats but FB can read the team points every hour which worked for the Sprint. Marathon may just look once per day. ID: 2223 · Rating: 0 · rate: / Reply Quote

Eric Driver Project administrator Project developer Project tester Project scientist Send message Joined: 8 Jul 11 Posts: 1388 Credit: 696,162,292 RAC: 829,239	Message 2224 - Posted: 24 Mar 2019, 6:32:04 UTC - in response to Message 2222. Okay ta - maybe it is just taking a while. Nothing has updated yet in fact Formula Boinc, Boincstats and even my new badge on project site all seem to be 24hrs behind Are points from the GPU app exported please Eric? I didn't think the GPU credits were treated any different during export. Looking at it a little further, the update_stats is a periodic task run once a day and according to time stamps that occurs at about 6:00 UTC (just a little while ago). So I think that explains the delay. I also grepped for your name and you are definitely in the export file. ID: 2224 · Rating: 0 · rate: / Reply Quote

davidBAM Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 0	Message 2225 - Posted: 24 Mar 2019, 8:14:17 UTC All sorted now. Thanks Eric. If it sounds a little mercenary, the points are the only thing we get for all the money spent on hardware and electricity. Oh wait, make that points plus a warm furry feeling that we are helping to advance science. ID: 2225 · Rating: 0 · rate: / Reply Quote

Eric Driver Project administrator Project developer Project tester Project scientist Send message Joined: 8 Jul 11 Posts: 1388 Credit: 696,162,292 RAC: 829,239	Message 2226 - Posted: 24 Mar 2019, 16:07:14 UTC I just wanted to thank everyone for your comments, suggestions, etc. It's all very helpful. If I haven't responded enough, it's only because I've been too busy with life's other responsibilities (and learning about OpenCL), but know that I am definitely reading and considering all your suggestions. ID: 2226 · Rating: 0 · rate: / Reply Quote

Eric Driver Project administrator Project developer Project tester Project scientist Send message Joined: 8 Jul 11 Posts: 1388 Credit: 696,162,292 RAC: 829,239	Message 2227 - Posted: 24 Mar 2019, 16:21:45 UTC - in response to Message 2205. Hello :) I waited a long time for GPU app. Great job! I don't getting any GPU tasks. I have drivers version 418.39. Linux 4.13.16-hiveos. My machine: https://numberfields.asu.edu/NumberFields/show_host_detail.php?hostid=1465179 5: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 0: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) 6: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 1: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) 7: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 2: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) 8: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 3: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) 9: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 4: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) 10: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 5: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) 11: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 6: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) 12: 23-Mar-2019 10:04:14 (low) [] CUDA: NVIDIA GPU 7: GeForce GTX 1060 3GB (driver version 418.39, CUDA version 10.1, compute capability 6.1, 3019MB, 2943MB available, 4053 GFLOPS peak) And I getting this: 74: 23-Mar-2019 10:17:03 (low) [NumberFields@home] Sending scheduler request: Requested by user. 75: 23-Mar-2019 10:17:03 (low) [NumberFields@home] Requesting new tasks for NVIDIA GPU 76: 23-Mar-2019 10:17:05 (low) [NumberFields@home] Scheduler request completed: got 0 new tasks 77: 23-Mar-2019 10:17:05 (user notification) [NumberFields@home] This project is using an old URL. When convenient, remove the project, then add http://numberfields.asu.edu/NumberFields/ Sorry, I missed this earlier... Did you ever get it to work? Could it have been there was no work available at the time? There should be plenty of work now. I also don't think that old url message should affect anything, I have seen that before and it didn't affect downloading of work. But you could always try resetting the project anyways to see if it helps. ID: 2227 · Rating: 0 · rate: / Reply Quote

lakewik Send message Joined: 7 Apr 15 Posts: 4 Credit: 18,772,724 RAC: 0	Message 2228 - Posted: 24 Mar 2019, 16:47:14 UTC - in response to Message 2227. Yes, now it working, probably yesterday there was out of work, but I'm limited by CPU and can use only 4 GPU so i need to buy CPU with more cores :) ID: 2228 · Rating: 0 · rate: / Reply Quote

davidBAM Send message Joined: 25 Oct 18 Posts: 15 Credit: 112,744,248 RAC: 0	Message 2229 - Posted: 24 Mar 2019, 16:51:14 UTC - in response to Message 2228. Yes, it does seem to take a whole CPU core to keep each GPU WU busy ID: 2229 · Rating: 0 · rate: / Reply Quote

Eric Driver Project administrator Project developer Project tester Project scientist Send message Joined: 8 Jul 11 Posts: 1388 Credit: 696,162,292 RAC: 829,239	Message 2230 - Posted: 24 Mar 2019, 18:11:42 UTC - in response to Message 2228. Yes, now it working, probably yesterday there was out of work, but I'm limited by CPU and can use only 4 GPU so i need to buy CPU with more cores :) Does it help if you set cpu_usage to .1 in app_config.xml? ID: 2230 · Rating: 0 · rate: / Reply Quote

lakewik Send message Joined: 7 Apr 15 Posts: 4 Credit: 18,772,724 RAC: 0	Message 2231 - Posted: 24 Mar 2019, 18:59:01 UTC Last modified: 24 Mar 2019, 19:25:51 UTC Yes it helps :), but also I see one more strange behavior. On default configuration, in BOINC manager I see 5 WUs running: (0.981 CPUs + 1 NVIDIA GPU (device 0)) (0.981 CPUs + 1 NVIDIA GPU (device 1)) (0.981 CPUs + 1 NVIDIA GPU (device 2)) (0.981 CPUs + 1 NVIDIA GPU (device 3)) (0.981 CPUs + 1 NVIDIA GPU (device 4)) but in practice it working only on one GPU: +-----------------------------------------------------------------------------+ \| Processes: GPU Memory \| \| GPU PID Type Process name Usage \| \|=============================================================================\| \| 0 840 G /usr/lib/xorg/Xorg 6MiB \| \| 0 29346 C ...Decics_3.00_x86_64-pc-linux-gnu__cuda30 195MiB \| \| 0 29347 C ...Decics_3.00_x86_64-pc-linux-gnu__cuda30 195MiB \| \| 0 29348 C ...Decics_3.00_x86_64-pc-linux-gnu__cuda30 195MiB \| \| 0 29349 C ...Decics_3.00_x86_64-pc-linux-gnu__cuda30 195MiB \| \| 0 31804 C ...Decics_3.00_x86_64-pc-linux-gnu__cuda30 195MiB \| \| 1 840 G /usr/lib/xorg/Xorg 6MiB \| \| 2 840 G /usr/lib/xorg/Xorg 6MiB \| \| 3 840 G /usr/lib/xorg/Xorg 6MiB \| \| 4 840 G /usr/lib/xorg/Xorg 6MiB \| \| 5 840 G /usr/lib/xorg/Xorg 6MiB \| \| 6 840 G /usr/lib/xorg/Xorg 6MiB \| \| 7 840 G /usr/lib/xorg/Xorg 6MiB \| +-----------------------------------------------------------------------------+ ID: 2231 · Rating: 0 · rate: / Reply Quote

Azmodes Send message Joined: 20 May 18 Posts: 6 Credit: 165,471,630 RAC: 0	Message 2233 - Posted: 24 Mar 2019, 19:21:50 UTC - in response to Message 2231. Last modified: 24 Mar 2019, 20:01:40 UTC ^I have seen the same behaviour on a 2-GPU system. It shows one task for each device, but it's actually crunching both on a single card while the other one is left idle. EDIT: For now I have simply added an exclusion for the second GPU (GTX 980) and assigned it to another project. The first (GTX 1660 Ti) is now crunching two tasks at the same time with a noticeable boost in GPU utilization (and throughput? projected credit/day is 4+ million). Too bad each seems to require a full CPU thread. ID: 2233 · Rating: 0 · rate: / Reply Quote

Mumps [MM] Send message Joined: 19 Aug 11 Posts: 8 Credit: 254,440,744 RAC: 2,358	Message 2234 - Posted: 24 Mar 2019, 20:05:51 UTC Would it be possible for the Server Status page to separate the CPU/GPU WU counts out? So we can tell if there are GPU WU's available? ID: 2234 · Rating: 0 · rate: / Reply Quote