New GPU OpenCL versions available

Message boards : News : New GPU OpenCL versions available
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2738 - Posted: 8 May 2020, 15:56:38 UTC - in response to Message 2736.  

Hi Eric Driver and mg13[HWU],

If you have time and want to try version 19.9.2, just for the correctness of the tests and my curiosity, thank you.


Of course, I’m going to find some spare time to do it but I’d like Eric Driver reads what I’ve written below.

The test makes sense if the development team tell me how to enable the debug mode and to collect the trace/log file.
I explain me better.
My environment could be affected by the obsolescence of the GPU HW, not by a device driver bug.
Only the trace file could show us where the root cause was.


I am 99.9% sure the problem with Paolo's card is its age. Several volunteers tested this offline and it was found that cards produced before ~2014 had issues, no matter what driver version was used. The architecture would be a better metric than age (although they are correlated), but I don't have that information. I suggest not wasting any more time fiddling with drivers.

mg13 has a different problem. He has a newer card that requires the older driver version to work. This is because of a change that AMD made to their openCL implementation in their latest drivers.
ID: 2738 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mippi

Send message
Joined: 29 Apr 20
Posts: 4
Credit: 3,003,261
RAC: 1,328
Message 2766 - Posted: 16 May 2020, 2:08:08 UTC - in response to Message 2738.  

Hi Administrator,
I have got AMD Radeon HD 8790M (2048MB) OpenCL: 1.2 and I cannot get any jobs for it:(

Is it OK? If so, could you publish a list of supported cards or at least requirements for cards to be useful for the projct, please?

By the way, it would be great to support Intel GPUs, there are so many around the world in every core processor:)

Thanks

Adam
ID: 2766 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2767 - Posted: 16 May 2020, 3:51:47 UTC - in response to Message 2766.  

Hi Administrator,
I have got AMD Radeon HD 8790M (2048MB) OpenCL: 1.2 and I cannot get any jobs for it:(

Is it OK? If so, could you publish a list of supported cards or at least requirements for cards to be useful for the projct, please?

By the way, it would be great to support Intel GPUs, there are so many around the world in every core processor:)

Thanks

Adam



There should be work. Do you have the box checked to allow beta apps?

It would be difficult to publish a list of supported cards, since there are so many out there. What I can say is that cards produced after about 2014 seem to work most of the time. I haven't had the time yet to figure out a better discriminator, but my best guess is that the GCN architecture version plays a role.

It's hard to say if your card will work properly. Best to keep an eye on it the first time to make sure it doesn't hang.

In theory, the Intel GPUs should be easy to implement as long as they support openCL 1.2. I plan to look into that after I improve the AMD version.
ID: 2767 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mippi

Send message
Joined: 29 Apr 20
Posts: 4
Credit: 3,003,261
RAC: 1,328
Message 2768 - Posted: 17 May 2020, 21:09:30 UTC - in response to Message 2767.  

Thanks for your quick reply. Indeed I did not tick the beta version software box.

However, I downloaded the new software, new jobs and it does not work. It starts working, going very well up to 96% and then return to 3.56% and is frozen at that value without any changes. I work on wu_sf3_DS-15x271_Grp4189113of6553600. I have tried to restart many times, but always the same effect:( Moreover, the process takes more than 4GB of my memory which is a little bit high.

Related to the rest of your message, I think it would be good if you could publish an open ranking with user confirmation that a card works or not simply based on experience.
ID: 2768 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2769 - Posted: 17 May 2020, 22:16:34 UTC - in response to Message 2768.  

Thanks for your quick reply. Indeed I did not tick the beta version software box.

However, I downloaded the new software, new jobs and it does not work. It starts working, going very well up to 96% and then return to 3.56% and is frozen at that value without any changes. I work on wu_sf3_DS-15x271_Grp4189113of6553600. I have tried to restart many times, but always the same effect:( Moreover, the process takes more than 4GB of my memory which is a little bit high.

Related to the rest of your message, I think it would be good if you could publish an open ranking with user confirmation that a card works or not simply based on experience.


Sounds like your card should be added to the doesn't work column. And yes, I can publish something when I find time. I'm not sure exactly what you are suggesting, but I can start a thread that users can report on and I could post the list on a web page.

I also have some ideas for making the openCL work better. For example, breaking the current code into smaller kernels. The hope would be to get the older cards working.
ID: 2769 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mippi

Send message
Joined: 29 Apr 20
Posts: 4
Credit: 3,003,261
RAC: 1,328
Message 2770 - Posted: 17 May 2020, 22:40:29 UTC - in response to Message 2769.  

Hi Eric,
thanks for your reply which is not good for me as I need to abort 21 tasks:( I will continue with CPU support for your project, but it is not massive:( GPUs are much more effective for math projects.

And yes, I can publish something when I find time. I'm not sure exactly what you are suggesting, but I can start a thread that users can report on and I could post the list on a web page.

I think the best solution would be a simple spread sheet with GPU names and tick boxes that users could confirm if it works or not. Honestly there are not so many GPUs in the market to consider, I believe less than 100. I can imagine your time is very limited, so maybe you need some volunteers :)

I also have some ideas for making the openCL work better. For example, breaking the current code into smaller kernels. The hope would be to get the older cards working.

I think it is a great idea, there are so many people with older cards which can help and there are no many projects they can be used. CPU version takes very small amoung of resources (probably the smallest requirements among all BOINC projects) and can be run on very old machines, so it would be great to have similar GPU version. For instance I have got more than old Geforce 210 cards which I could use for your project.
And obviously Intel GPUs. Everyone who has got Intel Core process has got a GPU which is pretty powerful and at the moment just 2 BOINC projects can use that feature....

Thanks!
ID: 2770 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2771 - Posted: 18 May 2020, 4:45:48 UTC - in response to Message 2770.  

I think the best solution would be a simple spread sheet with GPU names and tick boxes that users could confirm if it works or not. Honestly there are not so many GPUs in the market to consider, I believe less than 100. I can imagine your time is very limited, so maybe you need some volunteers :)


I'm not sure how to implement a spread sheet with tick boxes. Do you mean a web form for users to click boxes? Integrating that into the server web code would be time consuming, and maintaining it would be a nightmare during server upgrades.
ID: 2771 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mippi

Send message
Joined: 29 Apr 20
Posts: 4
Credit: 3,003,261
RAC: 1,328
Message 2772 - Posted: 21 May 2020, 0:31:18 UTC - in response to Message 2771.  

So, you can do that in a simpler way:
Card name (type) + votes
If the card does not work, there will be more negative votes than positive, if it works more positive, simple as that.

Then, you only need to keep number of votes and nothing more.
ID: 2772 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2773 - Posted: 21 May 2020, 4:11:47 UTC - in response to Message 2772.  

So, you can do that in a simpler way:
Card name (type) + votes
If the card does not work, there will be more negative votes than positive, if it works more positive, simple as that.

Then, you only need to keep number of votes and nothing more.


But the issue is how to get the votes. I still think the easiest way is a forum thread where people can post whether or not their card worked. Then I can link to a summarized list.
ID: 2773 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mg13 [HWU]
Avatar

Send message
Joined: 24 May 19
Posts: 38
Credit: 1,186,123
RAC: 1,321
Message 2809 - Posted: 2 Jun 2020, 21:43:33 UTC - in response to Message 2773.  

Update.
AMD has released a new beta version of the drivers, version 20.5.1 and I tried it, it works, but finished processing the GPU remains 99% busy, after which it crashes the PC and then restarts.
This problem does so at every WU processed.
OpenCL: AMD/ATI GPU 0: AMD Radeon RX 5700 XT 50th Anniversary (driver version 3075.12 (PAL,LC), device version OpenCL 2.0 AMD-APP (3075.12), 8176MB, 8176MB available, 4685 GFLOPS peak).
Reported issue to AMD and I invite anyone with the same problem, to do the same.
Let's see if they solve the problem to the fury of reports.
I also upgraded Windows 10 to the 2004 version and the BOINC client to version 7.16.7 all 64-bit.
ID: 2809 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2810 - Posted: 2 Jun 2020, 22:52:30 UTC - in response to Message 2809.  

Update.
AMD has released a new beta version of the drivers, version 20.5.1 and I tried it, it works, but finished processing the GPU remains 99% busy, after which it crashes the PC and then restarts.
This problem does so at every WU processed.
OpenCL: AMD/ATI GPU 0: AMD Radeon RX 5700 XT 50th Anniversary (driver version 3075.12 (PAL,LC), device version OpenCL 2.0 AMD-APP (3075.12), 8176MB, 8176MB available, 4685 GFLOPS peak).
Reported issue to AMD and I invite anyone with the same problem, to do the same.
Let's see if they solve the problem to the fury of reports.
I also upgraded Windows 10 to the 2004 version and the BOINC client to version 7.16.7 all 64-bit.


Thanks for the update!
ID: 2810 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mg13 [HWU]
Avatar

Send message
Joined: 24 May 19
Posts: 38
Credit: 1,186,123
RAC: 1,321
Message 2862 - Posted: 13 Jul 2020, 14:42:24 UTC - in response to Message 2810.  

Double upgrade.
AMD has released a new beta version of the drivers, version 20.5.1 HWS, I tried but does not work, the percentage of processing progress of the application stops at 0.542% and if it stops or pauses the processing and then therefore the application, in driver metrics the GPU usage gets stuck at 99% and after a few minutes crashes the system and restarts.
OpenCL: AMD/ATI GPU 0: AMD Radeon RX 5700 XT 50th Anniversary (driver version 3075.12 (PAL,LC), OpenCL 2.0 AMD-APP device version (3075.12), 8176MB, 8176MB available, peak of 4685 GFLOPS).

After a few days AMD released a new beta version of the drivers, version 20.7.1 and this works strangely:
the percentage of application processing progress stops at 0.542% and if you stop or pause processing and then then pause the application, in driver metrics GPU usage remains stuck at 99% for just over a minute after that returns to 0% as it is right, without system locks or restarts.
When you restart the application, the percentage starts at 0.542% and then stays still, while the processing time starts from zero but advances.
I also noticed that by opening the Microsoft Edge application (with active in the settings "SYSTEM>uses hardware acceleration when available" ), the GPU usage driver metrics stopped between 1 and 4%, and the application's processing percentage resumed to advance until the WU was completed.
It also created the checkpoint file.
When processing a new WU starts, the processing percentage stays at zero for about one minute, then suddenly advances more than 4% and then back to 0.189% after 1-2 seconds and then stay there still.
OpenCL: AMD/ATI GPU 0: AMD Radeon RX 5700 XT 50th Anniversary (driver version 3110.7 (PAL,LC), device version OpenCL 2.0 AMD-APP (3110.7), 8176MB, 8176MB available, 9370 GFLOPS peak)

Reported both issues to AMD and I invite anyone with the same problem, to do the same.
Let's see if they solve the problem to the fury of relationships.
ID: 2862 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mg13 [HWU]
Avatar

Send message
Joined: 24 May 19
Posts: 38
Credit: 1,186,123
RAC: 1,321
Message 2880 - Posted: 24 Aug 2020, 14:36:04 UTC - in response to Message 2862.  

Double upgrade.
AMD has released a new beta version of the drivers, the 20.8.1 and after a few days released another new beta version of the drivers, the 20.8.2 and both work oddly:
the percentage of application processing progress stops at x%, and if you pause processing and then then the application, in driver metrics GPU usage gets stuck at 99% for just over a minute after that back to 0% as it is right, without system locks or restarts.
When you restart the application, the percentage starts from x% where it stopped first and then remains stuck there, while the processing time starts from scratch but advances.
The same thing happens if you pause all ongoing processing of the BOINC client and then restart.
Whereas if you close the BOINC client and therefore all the processing in progress and after about in the minute, when the GPU usage driver metrics drop from 99% to 0%, you reopen the BOINC client and the processing starts again, the percentage and processing time of the application start from scratch and after about a minute, then suddenly advances to x% and then stay there stuck, while time advances.
I also noticed that by opening the Microsoft Edge application (with GPU hardware acceleration set), the GPU usage driver metrics stopped at 1 to 4%, and the application's processing percentage resumed to advance until the WU was completed.
It also created the checkpoint file.
When processing a new WU starts, the processing percentage stays at zero for about one minute, then suddenly advances x% and then stays there stuck as time progresses.
Both versions, OpenCL: AMD/ATI GPU 0: AMD Radeon RX 5700 XT 50th Anniversary (driver version 3110.7 (PAL,LC), OpenCL 2.0 AMD-APP device version (3110.7), 8176MB, 8176MB available, 9370 GFLOPS peak)

Reported both issues to AMD and I invite anyone with the same problem to do the same.
Let's see if they solve the problem to the fury of relationships.
ID: 2880 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mg13 [HWU]
Avatar

Send message
Joined: 24 May 19
Posts: 38
Credit: 1,186,123
RAC: 1,321
Message 2881 - Posted: 3 Sep 2020, 11:49:37 UTC - in response to Message 2880.  

Update.
AMD has released a new beta version of the drivers, the 20.8.3 and works like the recent versions 20.8.1 and 20.8.2.
OpenCL: AMD/ATI GPU 0: AMD Radeon RX 5700 XT 50th Anniversary (driver version 3110.7 (PAL,LC), device version OpenCL 2.0 AMD-APP (3110.7), 8176MB, 8176MB available, 9370 GFLOPS peak).
Reported issue to AMD and I invite anyone, with the same problem, to do the same.
Let's see if they solve the problem to the fury of reports.
ID: 2881 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 5 Oct 19
Posts: 11
Credit: 2,176,974
RAC: 0
Message 2885 - Posted: 2 Oct 2020, 15:14:53 UTC - in response to Message 2881.  

My RX 570 with the Radeon 20.9.1 drivers (Win7 64-bit) is completing them OK in about 20 minutes.
It consumes 66 watts according to GPU-Z.
https://numberfields.asu.edu/NumberFields/results.php?hostid=1918394&offset=0&show_names=0&state=4&appid=

My GTX 1060 (Ubuntu 18.04.5) is completing them about twice as fast.
And it consumes about 58 watts according to nvidia-smi -l.
https://numberfields.asu.edu/NumberFields/results.php?hostid=2726664&offset=0&show_names=0&state=4&appid=

I don't know how that compares to a CPU core for efficiency, but it would be an interesting test.
ID: 2885 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2886 - Posted: 2 Oct 2020, 16:49:23 UTC - in response to Message 2885.  

My RX 570 with the Radeon 20.9.1 drivers (Win7 64-bit) is completing them OK in about 20 minutes.
It consumes 66 watts according to GPU-Z.
https://numberfields.asu.edu/NumberFields/results.php?hostid=1918394&offset=0&show_names=0&state=4&appid=

My GTX 1060 (Ubuntu 18.04.5) is completing them about twice as fast.
And it consumes about 58 watts according to nvidia-smi -l.
https://numberfields.asu.edu/NumberFields/results.php?hostid=2726664&offset=0&show_names=0&state=4&appid=

I don't know how that compares to a CPU core for efficiency, but it would be an interesting test.


You will also notice the AMD version has a much smaller CPU time than the Nvidia version, meaning the AMD version uses less of the CPU. Note that Nvidia cards operate the same with both the OpenCL and Cuda versions, so the difference between AMD and Nvidia is due to their drivers (the openCL implementation is part of the driver).
ID: 2886 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 5 Oct 19
Posts: 11
Credit: 2,176,974
RAC: 0
Message 2887 - Posted: 2 Oct 2020, 18:43:51 UTC - in response to Message 2886.  

You will also notice the AMD version has a much smaller CPU time than the Nvidia version, meaning the AMD version uses less of the CPU. Note that Nvidia cards operate the same with both the OpenCL and Cuda versions, so the difference between AMD and Nvidia is due to their drivers (the openCL implementation is part of the driver).

Yes, the RX 570 uses only 26% of a CPU core (i7-4771), whereas the GTX 1060 uses 99.6% of a Ryzen 2700 core.
CUDA used to use less CPU power, but that was a few years ago. Maybe they changed the driver, or the options for using it.
ID: 2887 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1320
Credit: 409,071,730
RAC: 243,965
Message 2888 - Posted: 2 Oct 2020, 21:12:46 UTC - in response to Message 2887.  

You will also notice the AMD version has a much smaller CPU time than the Nvidia version, meaning the AMD version uses less of the CPU. Note that Nvidia cards operate the same with both the OpenCL and Cuda versions, so the difference between AMD and Nvidia is due to their drivers (the openCL implementation is part of the driver).

Yes, the RX 570 uses only 26% of a CPU core (i7-4771), whereas the GTX 1060 uses 99.6% of a Ryzen 2700 core.
CUDA used to use less CPU power, but that was a few years ago. Maybe they changed the driver, or the options for using it.


Someone pointed out to me that CUDA has a blockingSync flag that limits the cpu usage without much hit to the performance. My first attempt at using it caused a crash and I haven't gone back to it yet. It's on my to-do list along with a hundred other things.
ID: 2888 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 28 Oct 11
Posts: 179
Credit: 223,069,734
RAC: 122,956
Message 2889 - Posted: 3 Oct 2020, 8:56:12 UTC - in response to Message 2888.  

Someone pointed out to me that CUDA has a blockingSync flag that limits the cpu usage without much hit to the performance. My first attempt at using it caused a crash and I haven't gone back to it yet. It's on my to-do list along with a hundred other things.
Synchronisation is definitely one of the black arts of GPU programming. David Anderson wrote a useful primer for BOINC application programmers 10 years ago - https://boinc.berkeley.edu/trac/wiki/GpuSync - but so far as I know has never revisited the subject since.

The native CUDA language for NVidia probably has the best support for various synchronisation methods - since David wrote, Cuda 5.0 has added stream callbacks, and I'm told they're the most efficient - but I've never seen them used in a BOINC science application. I've also been told they were not added to the OpenCL interface (though https://www.khronos.org/registry/OpenCL/sdk/1.1/docs/man/xhtml/clSetEventCallback.html suggests otherwise).

The CPU load of a GPU application varies hugely according to the architecture of the GPU, the programming language chosen, and the sync model used. I tried to start a debate on tackling those issues within BOINC a year ago - https://github.com/BOINC/boinc/issues/2949 - but interest has been, to say the least, minimal. We are now up to CUDA 11.1, with doubled FP32 pathways on the RTX 30x0 range, but still nothing moves - it's all left to individual project programmers like yourself, with hundreds of other priorities to juggle.
ID: 2889 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mg13 [HWU]
Avatar

Send message
Joined: 24 May 19
Posts: 38
Credit: 1,186,123
RAC: 1,321
Message 2890 - Posted: 3 Oct 2020, 23:06:55 UTC - in response to Message 2889.  

Update.
AMD has released a new beta version of the drivers (WIN 10 64 bit vers 2004), the 20.9.1 and the 20.9.2 and both work like the latest version 20.8.3.
In addition, AMD has also released a new Recommended (WHQL) version of the drivers, the 20.9.1 and this also works as the aforementioned 20.8.3.
OpenCL: AMD/ATI GPU 0: AMD Radeon RX 5700 XT 50th Anniversary (driver version 3110.7 (PAL,LC), device version OpenCL 2.0 AMD-APP (3110.7), 8176MB, 8176MB available, 9370 GFLOPS peak).
Reported issue to AMD and I invite anyone, with the same problem, to do the same.
Let's see if they solve the problem to the fury of reports.
ID: 2890 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : News : New GPU OpenCL versions available


Main page · Your account · Message boards


Copyright © 2024 Arizona State University