Support for Intel GPUs

Message boards : News : Support for Intel GPUs
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
poppinfresh99

Send message
Joined: 1 Mar 22
Posts: 2
Credit: 1,174
RAC: 0
Message 3256 - Posted: 7 Jun 2022, 16:59:34 UTC - in response to Message 3178.  

I just tested a few weak Intel GPUs (mostly an Intel HD Graphics 4000), and they did not work but seemed to be working. I'd like to summarize this situation (described in parts throughout this thread) for anyone else who encounters the same.

For weak Intel GPUs, the task seems to be making progress because...
- its Progress percent keeps going up, though the increase slows over time (mine reached 98% after 23 hours after an initial 6-hour-total pace)
- it fully runs a CPU core
- it uses a lot of RAM (around a GB for me depending on device) even though my AMD GPU uses only a little RAM, though I would predict that Intel GPUs that do not stall would not use as much RAM

However, there are many ways to know that a NumberFields task never really started...
- checkpoints are not being done (Progress resets after task is suspended)
- in its BOINC/slots/ folder, boinc_task_state.xml is never created
- in its BOINC/slots/ folder, the stderr.txt file only has a couple lines (for me, I only saw the usual "GPU not found in table" 6 lines)
ID: 3256 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
poppinfresh99

Send message
Joined: 1 Mar 22
Posts: 2
Credit: 1,174
RAC: 0
Message 3257 - Posted: 7 Jun 2022, 17:01:24 UTC - in response to Message 3178.  
Last modified: 7 Jun 2022, 17:01:57 UTC

Yes, we only use integer ops here.

The good news is the results are correct.


Since OpenCL makes arithmetic errors on AMD and Intel GPUs (for 64-bit integers at least), how do you know if the results are correct? I am curious since this project has single-task validation.
ID: 3257 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3258 - Posted: 8 Jun 2022, 4:08:36 UTC - in response to Message 3257.  

Yes, we only use integer ops here.

The good news is the results are correct.


Since OpenCL makes arithmetic errors on AMD and Intel GPUs (for 64-bit integers at least), how do you know if the results are correct? I am curious since this project has single-task validation.


What do you mean by "OpenCL makes arithmetic errors... for 64-bit integers".

That should never happen. I could possibly see 64-bit floats having some round off errors, but not integers.

Anyways, during development, I ran hundreds of GPU cases and compared against the CPU versions and they all agreed after computing billions of polynomial discriminants. So I am pretty confident when I say the OpenCL code produces correct results.
ID: 3258 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bill F

Send message
Joined: 27 Sep 21
Posts: 11
Credit: 2,411,392
RAC: 3,547
Message 3259 - Posted: 9 Jun 2022, 3:09:33 UTC - in response to Message 3258.  

You have more months of experience to consider now. Are you leaving the App as BETA because of the issues that the weaker IGPU systems may have with inexperienced users or other reason's ?

Thanks
Bill F
ID: 3259 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3260 - Posted: 9 Jun 2022, 3:24:46 UTC - in response to Message 3259.  

You have more months of experience to consider now. Are you leaving the App as BETA because of the issues that the weaker IGPU systems may have with inexperienced users or other reason's ?

Thanks
Bill F


Yes, that is the primary reason. And it's easy to enable beta apps for those that want to try it out.
ID: 3260 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bill F

Send message
Joined: 27 Sep 21
Posts: 11
Credit: 2,411,392
RAC: 3,547
Message 3261 - Posted: 10 Jun 2022, 1:26:51 UTC

I have added two systems to the project in the last 24 hours that have Intel GPU's. After reviewing this tread and seeing that some Intel GPU's have been identified as successful and some not. Can the successful models be added one or more at a time, to the look up table, for optimization ?

From my first successful task

GPU Summary String = [INTEL|Intel(R)UHDGraphics620|1|3227MB||300].
Loading GPU lookup table from file.
GPU was not found in the lookup table. Using default values:
numBlocks = 1024.
threadsPerBlock = 32.
polyBufferSize = 32768.


System information
GenuineIntel
Intel(R) Core(TM) i5-8265U CPU @ 1.60GHz [Family 6 Model 142 Stepping 11]
(8 processors) INTEL Intel(R) UHD Graphics 620 (3227MB) OpenCL: 3.0

Thank you
Bill F
ID: 3261 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3262 - Posted: 10 Jun 2022, 5:37:25 UTC - in response to Message 3261.  

I have added two systems to the project in the last 24 hours that have Intel GPU's. After reviewing this tread and seeing that some Intel GPU's have been identified as successful and some not. Can the successful models be added one or more at a time, to the look up table, for optimization ?

From my first successful task

GPU Summary String = [INTEL|Intel(R)UHDGraphics620|1|3227MB||300].
Loading GPU lookup table from file.
GPU was not found in the lookup table. Using default values:
numBlocks = 1024.
threadsPerBlock = 32.
polyBufferSize = 32768.


System information
GenuineIntel
Intel(R) Core(TM) i5-8265U CPU @ 1.60GHz [Family 6 Model 142 Stepping 11]
(8 processors) INTEL Intel(R) UHD Graphics 620 (3227MB) OpenCL: 3.0

Thank you
Bill F


Yes, a line could be added to the lookup table. But there is no reason to do that until someone has done the dirty work of determining what the optimal settings should be. The default settings were chosen conservatively so they would work well for most cards.

If you were interested in trying to get optimal values, one thing you could do is edit the gpuLookupTable.txt file in the project directory to add a line for your gpu. You would have to run enough WUs to get a good average in order to compare your new settings with the default settings. And it would definitely help to understand the underlying architecture of the gpu to give a good starting point for the settings. For example, Nvidia uses a "warp size" of 32, which is the number of threads to keep in lockstep, and it turns out the optimal value for threadsPerBlock was 32.
ID: 3262 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
kotenok2000

Send message
Joined: 27 Dec 19
Posts: 6
Credit: 623,103
RAC: 0
Message 3264 - Posted: 10 Jun 2022, 16:45:15 UTC - in response to Message 3262.  

Can someone create a script that would run test workunit and tune worker?
ID: 3264 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3265 - Posted: 10 Jun 2022, 23:10:11 UTC - in response to Message 3264.  

Can someone create a script that would run test workunit and tune worker?


This has actually been done. It took some searching but I found where this was discussed in the past:
https://numberfields.asu.edu/NumberFields/forum_thread.php?id=488&postid=3063#3063

In a nutshell, there is a public github project with all the source code and testing scripts. For the GPU testing, see the README file in the test directory. And don't hesitate to ask if you have any questions.

Also see this for a discussion of the lookup table:
https://numberfields.asu.edu/NumberFields/forum_thread.php?id=501&postid=3176#3176
ID: 3265 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>EDLS]zOU

Send message
Joined: 27 Feb 16
Posts: 11
Credit: 14,722,790
RAC: 0
Message 3406 - Posted: 5 Nov 2022, 9:17:28 UTC

Adding an Intel ARC A750 to the pool, watch these spaces => https://numberfields.asu.edu/NumberFields/show_host_detail.php?hostid=2835341 and https://numberfields.asu.edu/NumberFields/results.php?hostid=2835341
ID: 3406 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3408 - Posted: 5 Nov 2022, 16:17:17 UTC - in response to Message 3406.  

Adding an Intel ARC A750 to the pool, watch these spaces => https://numberfields.asu.edu/NumberFields/show_host_detail.php?hostid=2835341 and https://numberfields.asu.edu/NumberFields/results.php?hostid=2835341


It's been several hours and nothing returned yet, I hope that's not a bad sign.

I would expect the Arc A750 to do well, assuming Intel puts out a decent openCL driver.
ID: 3408 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>EDLS]zOU

Send message
Joined: 27 Feb 16
Posts: 11
Credit: 14,722,790
RAC: 0
Message 3409 - Posted: 5 Nov 2022, 16:34:56 UTC - in response to Message 3408.  
Last modified: 5 Nov 2022, 16:47:24 UTC

Adding an Intel ARC A750 to the pool, watch these spaces => https://numberfields.asu.edu/NumberFields/show_host_detail.php?hostid=2835341 and https://numberfields.asu.edu/NumberFields/results.php?hostid=2835341


It's been several hours and nothing returned yet, I hope that's not a bad sign.

I would expect the Arc A750 to do well, assuming Intel puts out a decent openCL driver.


The machine is busy with Einstein@home tasks :D
Arc A750 doesn't have FP64 though

I'll pause Einstein and let some tasks run
ID: 3409 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3410 - Posted: 6 Nov 2022, 19:18:58 UTC - in response to Message 3409.  

Adding an Intel ARC A750 to the pool, watch these spaces => https://numberfields.asu.edu/NumberFields/show_host_detail.php?hostid=2835341 and https://numberfields.asu.edu/NumberFields/results.php?hostid=2835341


It's been several hours and nothing returned yet, I hope that's not a bad sign.

I would expect the Arc A750 to do well, assuming Intel puts out a decent openCL driver.


The machine is busy with Einstein@home tasks :D
Arc A750 doesn't have FP64 though

I'll pause Einstein and let some tasks run


I saw some cpu tasks go through. Any problems with tasks on the Arc A750?
ID: 3410 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>EDLS]zOU

Send message
Joined: 27 Feb 16
Posts: 11
Credit: 14,722,790
RAC: 0
Message 3413 - Posted: 10 Nov 2022, 21:23:33 UTC - in response to Message 3410.  

Adding an Intel ARC A750 to the pool, watch these spaces => https://numberfields.asu.edu/NumberFields/show_host_detail.php?hostid=2835341 and https://numberfields.asu.edu/NumberFields/results.php?hostid=2835341


It's been several hours and nothing returned yet, I hope that's not a bad sign.

I would expect the Arc A750 to do well, assuming Intel puts out a decent openCL driver.


The machine is busy with Einstein@home tasks :D
Arc A750 doesn't have FP64 though

I'll pause Einstein and let some tasks run


I saw some cpu tasks go through. Any problems with tasks on the Arc A750?

I don't see any errors, I've been away for a few days and only Einstein tasks were processed even though numberfield was enabled and on the same ressource share ...
ID: 3413 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
kotenok2000

Send message
Joined: 27 Dec 19
Posts: 6
Credit: 623,103
RAC: 0
Message 3414 - Posted: 10 Nov 2022, 21:24:21 UTC - in response to Message 3413.  
Last modified: 10 Nov 2022, 21:24:40 UTC

Set Einstein to no new tasks then.
ID: 3414 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>EDLS]zOU

Send message
Joined: 27 Feb 16
Posts: 11
Credit: 14,722,790
RAC: 0
Message 3415 - Posted: 11 Nov 2022, 7:08:55 UTC - in response to Message 3414.  

bunch of WU processed :)

https://numberfields.asu.edu/NumberFields/result.php?resultid=161210884
https://numberfields.asu.edu/NumberFields/result.php?resultid=161211661

and plenty others
ID: 3415 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3416 - Posted: 11 Nov 2022, 15:18:02 UTC - in response to Message 3415.  

bunch of WU processed :)

https://numberfields.asu.edu/NumberFields/result.php?resultid=161210884
https://numberfields.asu.edu/NumberFields/result.php?resultid=161211661

and plenty others


Thx! Good to see it's working, and that run times are reasonable on Intel's newest GPUs.
ID: 3416 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bill F

Send message
Joined: 27 Sep 21
Posts: 11
Credit: 2,411,392
RAC: 3,547
Message 3417 - Posted: 12 Nov 2022, 17:29:55 UTC - in response to Message 3416.  

bunch of WU processed :)

https://numberfields.asu.edu/NumberFields/result.php?resultid=161210884
https://numberfields.asu.edu/NumberFields/result.php?resultid=161211661

and plenty others


Thx! Good to see it's working, and that run times are reasonable on Intel's newest GPUs.


Run times are longer for the Intel GPU tasks. Are they doing larger amounts of science and should the credit amount be higher ?

Bill F
In October of 1969 I took an oath to support and defend the Constitution of the United States against all enemies, foreign and domestic;
There was no expiration date.


ID: 3417 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1344
Credit: 530,311,408
RAC: 560,936
Message 3418 - Posted: 13 Nov 2022, 4:46:51 UTC - in response to Message 3417.  

bunch of WU processed :)

https://numberfields.asu.edu/NumberFields/result.php?resultid=161210884
https://numberfields.asu.edu/NumberFields/result.php?resultid=161211661

and plenty others


Thx! Good to see it's working, and that run times are reasonable on Intel's newest GPUs.


Run times are longer for the Intel GPU tasks. Are they doing larger amounts of science and should the credit amount be higher ?

Bill F


Run times are longer compared to what? Compared to the previous dataset, run times are longer and credits have gone up accordingly.

If comparing to a different gpu then average times will probably be different. Each WU is a fixed amount of work and therefore has a fixed credit. Just because they run longer doesn't mean they deserve more credit. If it's running longer than expected then it could be an inefficient driver or bad OpenCL implementation.
ID: 3418 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bill F

Send message
Joined: 27 Sep 21
Posts: 11
Credit: 2,411,392
RAC: 3,547
Message 3419 - Posted: 14 Nov 2022, 1:04:14 UTC - in response to Message 3418.  

bunch of WU processed :)

https://numberfields.asu.edu/NumberFields/result.php?resultid=161210884
https://numberfields.asu.edu/NumberFields/result.php?resultid=161211661

and plenty others


Thx! Good to see it's working, and that run times are reasonable on Intel's newest GPUs.


Run times are longer for the Intel GPU tasks. Are they doing larger amounts of science and should the credit amount be higher ?

Bill F


Run times are longer compared to what? Compared to the previous dataset, run times are longer and credits have gone up accordingly.

If comparing to a different gpu then average times will probably be different. Each WU is a fixed amount of work and therefore has a fixed credit. Just because they run longer doesn't mean they deserve more credit. If it's running longer than expected then it could be an inefficient driver or bad OpenCL implementation.


I was comparing to a default Windows Get Decic Fields v4.00 (default)windows_x86_64

You may have a point about Driver or implementation

Bill F
ID: 3419 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : News : Support for Intel GPUs


Main page · Your account · Message boards


Copyright © 2024 Arizona State University