Posts by Eric Driver

1) Message boards : Number crunching : Consisting Crashing on GPU (Message 3618)
Posted 13 days ago by Profile Eric Driver
Post:
I suppose that NF is FP64-intense which is not friendly to NVIDIA cards. Am I right?


Actually, it's integer intensive.
2) Message boards : Number crunching : Consisting Crashing on GPU (Message 3612)
Posted 18 days ago by Profile Eric Driver
Post:
Yes. I see about 8x speedup on 4060 Laptop than the CPU when running 3 tasks in parallel, but with 60W power consumption. When on CPU, it only consumes 25W with 16 tasks in parallel. That's awkward because heterogeneous computing normally increases power efficiency by nearly a magnitude.


Where is this power measurement coming from? Is it the GPU only or the whole system?

Another thing to keep in mind is the GPU app also uses a portion of a CPU core, probably somewhere between 20% to 50% depending on the speed of the GPU. The CPU generates the list of polynomials to test and the GPU does the actual testing; when the GPU is really fast, the CPU has to work harder to keep up feeding it, hence the CPU usage goes up.
3) Message boards : Number crunching : Consisting Crashing on GPU (Message 3610)
Posted 19 days ago by Profile Eric Driver
Post:
Besides, I don't see any energy efficiency increase when switching to GPU as expected. Is that normal?


I'm not sure exactly what you mean. I see about 25x speedup on my 3070 Ti compared to a single cpu core, but it also uses a bunch more power, so not sure if it's any more energy efficient.
4) Message boards : Number crunching : Consisting Crashing on GPU (Message 3606)
Posted 20 days ago by Profile Eric Driver
Post:
Sorry for the frustration. I'm not sure what the problem is. I saw similar behavior years ago when overclocking the cpu - the system would overheat and then shut itself down. Maybe something similar is happening with the GPU?
5) Message boards : Number crunching : How can i earn septic count and ℚ(√-10) count? (Message 3602)
Posted 31 Oct 2023 by Profile Eric Driver
Post:
Also I did a little Paint.net work using layers to make the Galois Field badges much easier to distinguish apart at as low as 10% scale of the original size.

If you wish to use these, and I hope you do - feel free!

It should be relatively easy to swap out the images. But for some reason they are not loading for me. Not sure if it's my browser or the site. I will look into it later when I have a free moment.
6) Message boards : Number crunching : How can i earn septic count and ℚ(√-10) count? (Message 3598)
Posted 27 Oct 2023 by Profile Eric Driver
Post:
As title. How can i earn septic count and ℚ(√-10) count?


The septic search ended a long time ago, so no more for that one.

The ℚ(√-10) search is also known as sf7. This one is mostly being computed at Gerasim (no badges). Periodically, some cases are run here giving you the chance to earn some counts. Ironically, yesterday I dropped about 35k WUs for sf7, so that would have given you a chance.
7) Message boards : News : Support for Intel GPUs (Message 3595)
Posted 24 Oct 2023 by Profile Eric Driver
Post:
I have had 3 tasks fail after between 12 and 14 hours with the following error:
Exit status 198 (0x000000C6) EXIT_MEM_LIMIT_EXCEEDED

The tasks are https://numberfields.asu.edu/NumberFields/result.php?resultid=204294692, https://numberfields.asu.edu/NumberFields/result.php?resultid=204294798 and https://numberfields.asu.edu/NumberFields/result.php?resultid=204294071

Part of Stderr Output gives:

<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
working set size > client RAM limit: 16385.39MB > 16309.76MB</message>
<stderr_txt>
GPU Summary String = [INTEL|Intel(R)HDGraphics4600|1|1629MB||102].
Loading GPU lookup table from file.
GPU was not found in the lookup table. Using default values:
numBlocks = 1024.
threadsPerBlock = 32.
polyBufferSize = 32768.


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Breakpoint Encountered (0x80000003) at address 0x00007ffe3287b892

Engaging BOINC Windows Runtime Debugger...

From these messages I understand that the tasks are running out of GPU memory.
Is there any way to reduce the amount of memory being used?

Thanks, Ruud


Sorry for your troubles. The application doesn't use near that much memory and I think that error code is referring to CPU memory not GPU (but I could be wrong). Also, the memory is allocated up front, so it shouldn't take 14 hours before it errors out.

Looking at the stderr, it doesn't even get to the openCL messages. My best guess for what is happening- the openCL driver is hanging during the build phase. More specifically, the openCL compiler gets stuck and is slowly chewing up system memory until it runs out (do you have 16GB of system memory?). Since the openCL compiler is part of the graphics driver, the only solution I see is to upgrade the driver, if that's even possible.
8) Message boards : News : Support for Intel GPUs (Message 3591)
Posted 7 Oct 2023 by Profile Eric Driver
Post:
Pardon me for the impertinence, but I may have a better solution for the GPU problems that this project is hampered by:

*Expand the number of GPUs in the file your project supplies! (gpuLookupTable_v402.txt)*

      GPU Name      |   numBlocks   |  threadsPerBlock
==========================================================
     GTX 1050       |      9600     |       32
     GTX 1050 Ti    |      9600     |       32
     GTX 1660       |      8192     |       32
      RX 570        |      2048     |       64
     GTX 1070       |      9600     |       32

is a bit meagre, don't you think?
There's whole generations of GPUs (and IGPs) that do not get mentioned, and the default values don't work for all those. Editing the file does not work, even when you have a line in the cc_config.xml preventing the check for changed files -at least not in my BOINC client version. It looks like the file gets overwritten at times too, because I have made the change several times -to no avail.


To remind you, the default values should work well for most cards (some older cards will have problems). The lookup table is for those who want to tweak the settings to eek out a little more performance for their specific card. I can't add new entries if I don't have access to other cards. The original hope was for some users to send me their optimal settings and then I could add them to the official lookup table.
9) Message boards : News : Batch Plan (Message 3585)
Posted 26 Sep 2023 by Profile Eric Driver
Post:
The plan is to go up to 13x11 and then move on to row 14. Rows 14 and 15 are comparable to row 13, so we should be able to get up to row 16 (the final row) with only data sets 13x12, 14x12, and 15x12 unfinished.
10) Message boards : News : Support for Intel GPUs (Message 3581)
Posted 14 Sep 2023 by Profile Eric Driver
Post:
I tried the solution
If it is truly a resource problem, you could try adding the following line to the gpuLookupTable file in your projects directory:
UHD Graphics 605     |     256     |      8

but it had no effect whatsoever -it still can't find the GPU in a certain table, apparently not the one in my drictory. I made my hosts visible now, so you can see more yourself.


It's been a while since I tried this, so I thought it might be wise to try it again. It still works for me.

For me, the file to change was: [BOINC_root]/projects/numberfields.asu.edu_NumberFields/gpuLookupTable_v402.txt

If that's the file you changed, then maybe the problem is the client version as mentioned earlier in this thread. My client/manager version was 7.20.2
11) Message boards : News : Support for Intel GPUs (Message 3576)
Posted 15 Aug 2023 by Profile Eric Driver
Post:
DKlimax - Thanks for looking into that.

So it looks like we have a working solution for anyone who wants to modify the GPU lookup table. Again, it's not necessary, but allows users the option to tweak parameters for their specific card (no guarantee, but it might improve performance by 5 to 10%).
12) Message boards : Number crunching : Older Batches (Message 3574)
Posted 13 Aug 2023 by Profile Eric Driver
Post:
I noticed on the batch status page the sf6-DS-12x9 has been marked completed. However, I am still getting a few of these tasks from time to time. Do you still need these tasks? If not, I would rather abort them and work on tasks that are needed.


When it gets very close to being done, I turn it off on the batch status page to reduce database queries. Tasks that are not needed anymore will be aborted, so no need to worry about it yourself.
13) Message boards : News : Support for Intel GPUs (Message 3571)
Posted 13 Aug 2023 by Profile Eric Driver
Post:
Thinking about this, there are only two options: Anonymous platform or removing signature and its requirement from gpuLookupTable.txt. Both require changing server-side settings. (I don't like downgrading BOINC for too many versions.)

Even if I wanted to, I don't think it's possible to change the signature requirement. That is there so the client knows it's running the true executable from the project and not some hacker's version that made its way onto your computer.

I did a quick search and I couldn't find anything about how to enable anonymous platforms on the project server. But I did find this:
<verify_files_on_app_start/>
    Before starting or restarting an app, check contents of input files and app version files by either MD5 or digital signature check. Detects user tampering with file (but doesn't really increase security, since user could also change MD5s or signatures in client state file). 

This option was not in the project config file, so I added it and set it to 0. Maybe the default value was on? We'll see if this changes anything...
14) Message boards : News : Support for Intel GPUs (Message 3567)
Posted 11 Aug 2023 by Profile Eric Driver
Post:
Have either of you considered using the configuration option

<dont_check_file_sizes>0|1</dont_check_file_sizes>
Normally, the size of application and input files are compared with the project-supplied values after the files are downloaded, and just before starting an application. If this flag is set, this check is skipped. Use it if you need to modify files locally for some reason.
(details in the User manual)


That sounds like a good explanation. I checked my cc_config and I did not have that set, so I'm not sure why it worked for me (maybe the older linux clients ignore the setting?)
15) Message boards : News : Support for Intel GPUs (Message 3565)
Posted 11 Aug 2023 by Profile Eric Driver
Post:
Sorry for the dumb question, but are we thinking this is a BOINC problem with the newer version or an error with the hardware producer itself?


If I had to guess, I would say they "fixed" the newer version of the client to make it more secure, so that the user can't change any of the application files. But again, that's only a guess, and there could be other explanations.
16) Message boards : Number crunching : Computing Preferences ??? (Message 3561)
Posted 10 Aug 2023 by Profile Eric Driver
Post:
Project doesn't follow the Computing Preferences I set the "days of work at":

Store at least 1 days of work
Store up to an additional 0.1 days of work

When I update the Computer I keep getting:

SagerNP8872T-S

2344 8/9/2023 1:26:40 PM - Store at least 2.00 days of work
2345 8/9/2023 1:26:40 PM - Store up to an additional 1.00 days of work

https://signature.statseb.fr/sig-1323.png


This sounds like a manager problem. The project server doesn't have control of that. Do you see this on other projects too?
17) Message boards : News : Support for Intel GPUs (Message 3559)
Posted 8 Aug 2023 by Profile Eric Driver
Post:
This experiment is unlikely to work. Apparently BOINC gets somewhat confused, deletes/resets file as corrupted, while somehow erroring out all GPU tasks by failing to copy over either version of file.

Maybe anonymous platform might provide a way to test this, but I haven't yet used it.

ETA: It looks like it affects only newer versions. One of my computers is still on version 7.16.20 and modification works, while all others are on 7.22.2 and failed.


You might be right. It works with all my managers, but I only run linux and my versions are 70.20.2 or older.
18) Message boards : News : Support for Intel GPUs (Message 3557)
Posted 6 Aug 2023 by Profile Eric Driver
Post:
I have seen the "Out of Resources" error when there is not enough RAM. Your card appears to have enough RAM, but is it possible something else could be using up the memory? Internet browsers are notorious for using huge amounts of GPU RAM.

The system is dedicated to running BOINC -various projects- on all the four cores of the Pentium J5005 at the same time.
But in general I think the UHD 605 should be capable of running the NumberFields OpenCL app. I am basing that on the following list of hosts that have successfully returned results:
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 600|1|3021MB||300][opencl_gpu|Intel(R) UHD Graphics 600|1|3776MB|102]
[BOINC|7.20.2][INTEL|Intel(R) Iris(R) Xe Graphics|1|6427MB||300][vbox|6.1.34|0|1]
[BOINC|7.20.5][INTEL|Intel(R) Iris(R) Plus Graphics 655 [0x3ea5]|1|25565MB||300]
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 630|1|6488MB||201][vbox|6.1.34|1|1]
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 630|1|6415MB||300][vbox|7.0.10|0|1]
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 620|1|1590MB||201]
[BOINC|7.16.20][INTEL|Intel(R) HD Graphics 530|1|6507MB||300]
[BOINC|7.20.5][INTEL|Intel(R) UHD Graphics [0x4e55]|1|3276MB||300]

I couldn't find your host in the database. Is it windows or linux?

Manjaro Linux

If it is truly a resource problem, you could try adding the following line to the gpuLookupTable file in your projects directory:
UHD Graphics 605     |     256     |      8

If it still doesn't work then the resource problem is probably related to the size of the code that the openCL generates, which could be another driver problem or the app is just too complex for your card.
19) Message boards : News : Support for Intel GPUs (Message 3555)
Posted 5 Aug 2023 by Profile Eric Driver
Post:
Eric

I am experimenting with an Intel GPU on Computer 2807553. It is a INTEL Intel(R) HD Graphics 4600 (1629MB) OpenCL: 1.2

Running on Operating System Microsoft Windows 10 Professional x64 Edition, (10.00.19045.00)
BOINC version 7.22.2

It is running a whole lot slower than a non-GPU task Get Decic Fields v4.00 (default) windows_x86_64

Is the GPU application processing a bigger chunk of data or is the GPU not efficient enough to run the GPU task types ?

Thanks
Bill F


The GPU and CPU tasks process the same amount of data, so my guess is the GPU is not efficient enough.
20) Message boards : News : Support for Intel GPUs (Message 3553)
Posted 4 Aug 2023 by Profile Eric Driver
Post:
Updated the driver, error occurs later now
<stderr_txt>
<core_client_version>7.22.1</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)</message>
<stderr_txt>
GPU Summary String = [INTEL|Intel(R)UHDGraphics605|1|3276MB||300].
Loading GPU lookup table from file.
GPU was not found in the lookup table. Using default values:
numBlocks = 1024.
threadsPerBlock = 32.
polyBufferSize = 32768.
Successfully Built Program.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantInit.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB8.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantMpInit.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB7DegA9.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB7DegA8.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB6DegA9.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB6DegA8.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB6DegA7.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB5.
Successfully Created Stage 1 Kernel: pdtKernelSubResultantDegB4.

Successfully Created Stage 2 Kernel: pdtKernelDiv2.
Successfully Created Stage 2 Kernel: pdtKernelDiv5.
Successfully Created Stage 2 Kernel: pdtKernelDivP.

Successfully Created Stage 3 Kernel.

Successfully Created Polynomial Memory Buffer.
Successfully Created Output Flag Memory Buffer.
Successfully Created Discriminant Data Buffer.
Successfully Created PolyA Data Buffer.
Successfully Created PolyB Data Buffer.
Successfully Created DegA Data Buffer.
Successfully Created DegB Data Buffer.
Successfully Created G Data Buffer.
Successfully Created H Data Buffer.
Successfully Created mpA Data Buffer.
Successfully Created mpB Data Buffer.

OpenCL initialization was successful.
CHECKPOINT_FILE = wu_sf6_DS-12x9_Grp49971of128000_checkpoint.
Checkpoint Flag = 0.
Reading file ../../projects/numberfields.asu.edu_NumberFields/sf6_DS-12x9_Grp49971of128000.dat
K = x^2 - 10
S = [2, 5]
Disc Bound = 800000000000
Skip = (P^3)*(Q^5)
Num Congruences = 10
SCALE = 1.000000
|dK| = 40
Signature = [2,0]
Opening output file ../../projects/numberfields.asu.edu_NumberFields/wu_sf6_DS-12x9_Grp49971of128000_0_r611689789_0
Now starting the targeted Martinet search:
Num Cvecs = 10.
Doing Cvec 1.
File polDiscTest_gpuOpenCL.cpp, Line 201: Error: Failed to Enqueue Kernel pdtKernelSubResultantMpInit. clEnqueueNDRangeKernel returned CL_OUT_OF_RESOURCES
polDisc Test had an error. Aborting.
</stderr_txt>

Techpowerup has the UHD 605 as
The UHD Graphics 605 Mobile is a mobile integrated graphics solution by Intel, launched on December 11th, 2017. Built on the 14 nm process, and based on the Gemini Lake GT1.5 graphics processor, the device supports DirectX 12. This ensures that all modern games will run on UHD Graphics 605 Mobile. It features 144 shading units, 18 texture mapping units, and 3 ROPs. The GPU is operating at a frequency of 200 MHz, which can be boosted up to 750 MHz.
Its power draw is rated at 5 W maximum.

I have seen the "Out of Resources" error when there is not enough RAM. Your card appears to have enough RAM, but is it possible something else could be using up the memory? Internet browsers are notorious for using huge amounts of GPU RAM.

But in general I think the UHD 605 should be capable of running the NumberFields OpenCL app. I am basing that on the following list of hosts that have successfully returned results:
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 600|1|3021MB||300][opencl_gpu|Intel(R) UHD Graphics 600|1|3776MB|102]
[BOINC|7.20.2][INTEL|Intel(R) Iris(R) Xe Graphics|1|6427MB||300][vbox|6.1.34|0|1]
[BOINC|7.20.5][INTEL|Intel(R) Iris(R) Plus Graphics 655 [0x3ea5]|1|25565MB||300]
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 630|1|6488MB||201][vbox|6.1.34|1|1]
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 630|1|6415MB||300][vbox|7.0.10|0|1]
[BOINC|7.22.2][INTEL|Intel(R) UHD Graphics 620|1|1590MB||201]
[BOINC|7.16.20][INTEL|Intel(R) HD Graphics 530|1|6507MB||300]
[BOINC|7.20.5][INTEL|Intel(R) UHD Graphics [0x4e55]|1|3276MB||300]

I couldn't find your host in the database. Is it windows or linux?


Next 20


Main page · Your account · Message boards


Copyright © 2023 Arizona State University