𝕏

WUs reset?

Message boards : Number crunching : WUs reset?
Message board moderation

To post messages, you must log in.

AuthorMessage
Nombus

Send message
Joined: 12 Jul 16
Posts: 2
Credit: 9,396
RAC: 0
Message 968 - Posted: 21 Jul 2016, 17:06:00 UTC

Hey there, all you benevolent heart geniuses,

I got my first 2 projects a while ago, and this is the second time they've reset back to 0% progress after getting to 89-98% completion. I would suspend them and reboot the PC, or shut down BOINC and reboot, and they would completely restart. It's just a bit annoying, since I've taken two cores and put what seems like a good 25+ hours each into the two of them.

This doesn't seem to happen for other systems like Poem or rosetta; is this to be expected for DENIS? Do I need to make sure they are all completed in one sitting? Maybe this problem won't happen if I don't take the precaution of suspending them before I reboot the PC?

Stay frosty, my CPU friends.
ID: 968 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe
Avatar

Send message
Joined: 2 Dec 15
Posts: 3
Credit: 85,066
RAC: 0
Message 970 - Posted: 22 Jul 2016, 0:06:26 UTC - in response to Message 968.  
Last modified: 22 Jul 2016, 0:07:29 UTC

I don't know if these tasks checkpoint but if they don't, and that sounds like what is happening, they will start from 0 if you reboot and it won't matter if you suspend them first.
ID: 970 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 8 Apr 15
Posts: 20
Credit: 64,498
RAC: 0
Message 971 - Posted: 22 Jul 2016, 0:08:48 UTC
Last modified: 22 Jul 2016, 0:09:10 UTC

We're tracking a couple problems in this thread:
http://denis.usj.es/denisathome/forum_thread.php?id=105

Namely:
- Some tasks aren't checkpointing properly
- Some tasks aren't finishing gracefully, and are instead starting completely over infinitely.

:/

You might want to subscribe to that other thread, and hope the developers can fix it up. I've set "No New Tasks" on all my PCs, until it can work correctly.
ID: 971 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Nombus

Send message
Joined: 12 Jul 16
Posts: 2
Credit: 9,396
RAC: 0
Message 972 - Posted: 22 Jul 2016, 1:55:28 UTC

Thanks, you two!

I'm glad to know that I'm not alone.
ID: 972 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile sir spuddly buddly
Avatar

Send message
Joined: 30 Aug 15
Posts: 47
Credit: 1,248,591
RAC: 0
Message 986 - Posted: 24 Jul 2016, 13:39:12 UTC

Running the Beta work and they re-start from 0 after re-booting (Windows 7 64 bit OS).
LZ Loon
The Most Handsome Man on the Interweb (TM)
ID: 986 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dr. Merkwürdigliebe

Send message
Joined: 23 Dec 15
Posts: 6
Credit: 105,476
RAC: 0
Message 987 - Posted: 24 Jul 2016, 14:44:16 UTC - in response to Message 986.  

Running the Beta work and they re-start from 0 after re-booting (Windows 7 64 bit OS).

Running the beta application and it picks up from where it was interrupted by a shutdown.

Running Linux 64-Bit.
ID: 987 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile vinn@[CNT]

Send message
Joined: 9 Apr 15
Posts: 1
Credit: 1,120,811
RAC: 0
Message 989 - Posted: 24 Jul 2016, 19:34:46 UTC

Hey guys, some feedback on running the Beta WU (v. 1.03).
I suspended the project (about 40% done already) and switched it on again, the process started from scratch (0%) = no checkpoints?
So in total to reach 100% took about 26hours and ended with this message:
24.07.2016 21:20:29 | DENIS@Home | Task BETA_23071039_8000_0401_0 exited with zero status but no 'finished' file

.... then started again from 0%
ID: 989 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile sir spuddly buddly
Avatar

Send message
Joined: 30 Aug 15
Posts: 47
Credit: 1,248,591
RAC: 0
Message 990 - Posted: 24 Jul 2016, 20:11:15 UTC

From reading these posts it seems the Windows client has the problem with checkpointing.
LZ Loon
The Most Handsome Man on the Interweb (TM)
ID: 990 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile sir spuddly buddly
Avatar

Send message
Joined: 30 Aug 15
Posts: 47
Credit: 1,248,591
RAC: 0
Message 992 - Posted: 25 Jul 2016, 7:54:45 UTC

The Linux client seems to be ok with checkpoints - rebooted without problems.
LZ Loon
The Most Handsome Man on the Interweb (TM)
ID: 992 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : WUs reset?