all WU for version 1.1 end with error

Message boards : Number crunching : all WU for version 1.1 end with error
Message board moderation

To post messages, you must log in.

AuthorMessage
newman

Send message
Joined: 10 Apr 15
Posts: 2
Credit: 167,274
RAC: 0
Message 1190 - Posted: 5 Apr 2017, 21:47:21 UTC

All my WU for the new 1.1 version ends with an error for me after exact 1,289 sec :( 1.0 was running perfect.

Kind regards,
Marcus
ID: 1190 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bill Michael

Send message
Joined: 6 Oct 15
Posts: 1
Credit: 239,558
RAC: 0
Message 1191 - Posted: 5 Apr 2017, 22:51:15 UTC - in response to Message 1190.  

Ditto - also on Windows 10.
ID: 1191 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 8 Apr 15
Posts: 28
Credit: 116,469
RAC: 3
Message 1192 - Posted: 6 Apr 2017, 4:16:21 UTC - in response to Message 1190.  

Same here. Crashing on Win 7 x64.

DENIS Project that replicate the calculus made in Literature and fill the Markers Database (V1.01)

So far I have 19 Tasks that end with SAME message:
Outcome Computation error
Client state Compute error
Exit status 197 (0xc5) EXIT_TIME_LIMIT_EXCEEDED


CONFIG END
Checkpoint file not found
Backup Checkpoint file not found


ID: 1192 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dr Who Fan
Avatar

Send message
Joined: 8 Apr 15
Posts: 28
Credit: 116,469
RAC: 3
Message 1193 - Posted: 6 Apr 2017, 6:23:17 UTC

Since the tasks continued to crash/error out I ABORTED the one remaining on my PC's and DISABLED requesting more work. No use wasting resources on bad tasks and no credit.

Will follow this thread to see what transpires before allowing more work on my PC's.

ID: 1193 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
adrianxw

Send message
Joined: 27 Dec 15
Posts: 11
Credit: 458,800
RAC: 0
Message 1194 - Posted: 6 Apr 2017, 8:28:20 UTC
Last modified: 6 Apr 2017, 8:28:40 UTC

All workunits since 19:15 UTC yesterday evening have failed after ~17:35. Seems to be after the 17th before the 18th CP post. No new tasks set.
ID: 1194 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile BlackHeart64

Send message
Joined: 12 Jul 17
Posts: 3
Credit: 0
RAC: 0
Message 1195 - Posted: 6 Apr 2017, 13:16:25 UTC

Same boat here.. All machines removed until this is fixed.



** It seems to be happening to my OSX machines mostly.
ID: 1195 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile jcastro
Avatar

Send message
Joined: 16 Mar 15
Posts: 219
Credit: 14,859
RAC: 0
Message 1197 - Posted: 6 Apr 2017, 16:54:29 UTC

Hi! We are going to stop the delivery of that faulty WUs until we know exactly was goes wrong.

Best regards, Joel.
ID: 1197 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile jcastro
Avatar

Send message
Joined: 16 Mar 15
Posts: 219
Credit: 14,859
RAC: 0
Message 1199 - Posted: 6 Apr 2017, 17:05:44 UTC
Last modified: 6 Apr 2017, 17:12:24 UTC

The problem I think don't have relation to the software version. It could be related to some parameters that are calculated with the time needed for other WUs of the same project. Those WUs were smaller simulations and they could change the time exceed factor that multiply the FLOPs necessary for each task. Anyway, we have cancel new tasks and we will relaunch them after few tests.

Best regards, Joel.
ID: 1199 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
adrianxw

Send message
Joined: 27 Dec 15
Posts: 11
Credit: 458,800
RAC: 0
Message 1204 - Posted: 7 Apr 2017, 8:17:31 UTC
Last modified: 7 Apr 2017, 8:18:27 UTC

I've re-enabled the project and the first work unit is running. It has gone past the point where they were all failing before, so looking good right now.
ID: 1204 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Henk Haneveld

Send message
Joined: 31 Jul 15
Posts: 6
Credit: 85,016
RAC: 0
Message 1205 - Posted: 7 Apr 2017, 11:09:04 UTC - in response to Message 1204.  

I've re-enabled the project and the first work unit is running. It has gone past the point where they were all failing before, so looking good right now.


What you have running now is for the other subproject.
There is no work for the subproject with the problem.
ID: 1205 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
adrianxw

Send message
Joined: 27 Dec 15
Posts: 11
Credit: 458,800
RAC: 0
Message 1206 - Posted: 7 Apr 2017, 12:13:04 UTC

I just came back to say that all 5 of the units I got finished as expected, but I suppose nobody cares now.
ID: 1206 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile jcastro
Avatar

Send message
Joined: 16 Mar 15
Posts: 219
Credit: 14,859
RAC: 0
Message 1207 - Posted: 16 Apr 2017, 19:38:02 UTC

WUs from the faulty subproject are working correctly now.

Best regards, Joel.
ID: 1207 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Yuryi

Send message
Joined: 18 Jul 17
Posts: 1
Credit: 0
RAC: 0
Message 1208 - Posted: 19 Apr 2017, 19:49:49 UTC

I just started up project this morning and haven't any problems with WU's.

Jerry
ID: 1208 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : all WU for version 1.1 end with error

©2022 Universidad San Jorge