Indefinite recycling of a Gerasim workunit.

Message boards : Number crunching : Indefinite recycling of a Gerasim workunit.
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile SerVal

Send message
Joined: 1 Jan 20
Posts: 10
Credit: 2,408,308
RAC: 2,927
Message 3751 - Posted: 4 Sep 2024, 18:26:13 UTC
Last modified: 4 Sep 2024, 19:09:15 UTC

pututu wrote:
Indefinite recycling of a Gerasim workunit.
https://gerasim.boinc.ru/users/viewWorkunit.aspx?WorkunitId=85332177
The WU above has been endlessly recycled for over hundreds of times
to hundreds of different crunchers.

Seems like there is no maximum quorum limit set in the "error/total/success tasks" on the server side handing out these tasks. Even though the task fails immediately, best to keep things clean, maybe deleting this WU and perhaps figuring out why it fails.
----
Спасибо.
Смотрю что происходит, но сначала хотелось бы узнать мнение Эрика. Не видит ли он что-нибудь необычного?

* теоретически, я могу удалить эту WU и Results,
назначить этот WU только моим компьютерам и.т.д

* investigation in progress.
p.s. I can't assign WU just for myself because I don't have a Linux machine.
ID: 3751 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile SerVal

Send message
Joined: 1 Jan 20
Posts: 10
Credit: 2,408,308
RAC: 2,927
Message 3752 - Posted: 4 Sep 2024, 19:33:43 UTC - in response to Message 3751.  

* Send tasks for application "Get Decic Fields (Linux)" suspended.
ID: 3752 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 494,413,486
RAC: 558,227
Message 3753 - Posted: 4 Sep 2024, 19:56:05 UTC - in response to Message 3751.  

It looks like the WU input file got corrupted somehow. I will attempt to delete that WU from the Gerasim system.
ID: 3753 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile SerVal

Send message
Joined: 1 Jan 20
Posts: 10
Credit: 2,408,308
RAC: 2,927
Message 3754 - Posted: 4 Sep 2024, 20:06:08 UTC - in response to Message 3752.  

@Eric Driver
https://gerasim.boinc.ru/users/viewResult.aspx?resultid=118917129
Please take a look at "data_in" and "stderr_txt"
ID: 3754 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile SerVal

Send message
Joined: 1 Jan 20
Posts: 10
Credit: 2,408,308
RAC: 2,927
Message 3755 - Posted: 4 Sep 2024, 20:36:11 UTC - in response to Message 3754.  

@Eric Driver
Fine! Don't forget to enable sending tasks. :)
ID: 3755 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile SerVal

Send message
Joined: 1 Jan 20
Posts: 10
Credit: 2,408,308
RAC: 2,927
Message 3756 - Posted: 14 Sep 2024, 11:21:26 UTC

@Eric Driver
1.
It seems, the Gerasim does not see the difference between returned open_cl and broken WUs errors.
( Exit status: ERR_INVALID_APP_FUNCTION = 1).
Please suspend adding tasks.

2. Additionally, if it is more convenient for you, I can change the purpose of the WUs batch field( from int to string ).
ID: 3756 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Eric Driver
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 8 Jul 11
Posts: 1341
Credit: 494,413,486
RAC: 558,227
Message 3757 - Posted: 14 Sep 2024, 14:51:47 UTC - in response to Message 3756.  

@SerVal
1. Ok, I will not add new tasks for now. But I don't understand the problem. Is that an internal Gerasim problem?
2. I don't use the WU batch field. It is always set to "0".
ID: 3757 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile SerVal

Send message
Joined: 1 Jan 20
Posts: 10
Credit: 2,408,308
RAC: 2,927
Message 3758 - Posted: 14 Sep 2024, 20:49:19 UTC - in response to Message 3757.  

Is that an internal Gerasim problem?
Yes.
Gerasim continues to send tasks when error_mask >= max_error_results.
Or does not increment wu.error_mask on Result error.

update:
For the CPU application, you can already add tasks. For the GPU, most likely in 2-3 days.
The error has not been fixed yet. I'm working on it.
Please accept my apologies for the inconvenience.
ID: 3758 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Indefinite recycling of a Gerasim workunit.


Main page · Your account · Message boards


Copyright © 2024 Arizona State University