𝕏

Posts by Crystal Pellet

1) Message boards : Number crunching : Canceled Work Units by Server (Message 2191)
Posted 20 Sep 2023 by Crystal Pellet
Post:
I'm getting work that's been validated for days, but it's NOT getting cancelled by the server.

https://denis.usj.es/denisathome/workunit.php?wuid=10610647
22500584 223279 16 Sep 2023, 18:44:29 UTC 20 Sep 2023, 4:29:29 UTC Completed and validated 7,297.88 7,273.73 60.66 New human ventricular cell model v0.03 windows_x86_64
22500585 214760 16 Sep 2023, 18:42:40 UTC 18 Sep 2023, 9:24:34 UTC Completed and validated 4,618.68 4,615.95 60.66 New human ventricular cell model v0.03 windows_x86_64
22757401 224548 20 Sep 2023, 4:20:39 UTC 20 Sep 2023, 5:58:05 UTC Completed and validated 5,819.38 5,752.14 60.66 New human ventricular cell model v0.03 windows_x86_64


The first task was returned 9 minutes late, but meanwhile a resend was sent to your computer.
When your computer has not connected the server to report finished tasks or ask for new tasks after the first task received and your machine has already started task 3, your task will not be aborted/cancelled.
2) Message boards : Number crunching : Upload errors: server is out of disk space (Message 2183)
Posted 17 Sep 2023 by Crystal Pellet
Post:
17 Sep 16:56:45 [error] Error reported by file upload server: Server is out of disk space
3) Message boards : Number crunching : Checksum error (Message 2075)
Posted 18 Apr 2023 by Crystal Pellet
Post:
Thanks Jesús for your suggestion. I tried something similar.
I resetted the project, what also deletes all project files and that worked too.
4) Message boards : Number crunching : Checksum error (Message 2072)
Posted 16 Apr 2023 by Crystal Pellet
Post:
It's not solved:
There are still the same errors after a fresh start of BOINC:
41			16 Apr 09:02:49	Checking presence of 90 project files	
42	DENIS@home	16 Apr 09:02:49	Resetting file projects/denis.usj.es_denisathome/denis_logo_mini.png: md5 checksum failed for file	
43	DENIS@home	16 Apr 09:02:49	Resetting file projects/denis.usj.es_denisathome/Error_evolution_20230411_en.png: md5 checksum failed for file	
44	DENIS@home	16 Apr 09:02:49	Sending scheduler request: To fetch work.	
45	DENIS@home	16 Apr 09:02:49	Requesting new tasks for CPU	
46	DENIS@home	16 Apr 09:02:51	Started download of denis_logo_mini.png	
47	DENIS@home	16 Apr 09:02:51	Started download of Error_evolution_20230411_en.png	
48	DENIS@home	16 Apr 09:02:52	Finished download of denis_logo_mini.png	
49	DENIS@home	16 Apr 09:02:52	Finished download of Error_evolution_20230411_en.png	
50	DENIS@home	16 Apr 09:02:52	[error] MD5 check failed for denis_logo_mini.png	
51	DENIS@home	16 Apr 09:02:52	[error] expected 9e49b9b414ce41ac0fad7dc877fd994f18267080a78ed407169599edbefd265, got f5fa73e2c8a34a6b94d628beb60ca8c9	
52	DENIS@home	16 Apr 09:02:52	[error] Checksum or signature error for denis_logo_mini.png	
53	DENIS@home	16 Apr 09:02:52	[error] MD5 check failed for Error_evolution_20230411_en.png	
54	DENIS@home	16 Apr 09:02:52	[error] expected d54c12fbe289b35d0fdc5e400b9faf422baf42f4a6c7d82036c30de9d307ebd, got 359af274ab0ee7871407ab1e00f659f9	
55	DENIS@home	16 Apr 09:02:52	[error] Checksum or signature error for Error_evolution_20230411_en.png	
56	DENIS@home	16 Apr 09:02:52	Scheduler request completed: got 0 new tasks
5) Message boards : Number crunching : Checksum error (Message 2058)
Posted 13 Apr 2023 by Crystal Pellet
Post:
After BOINC startup on 5 machines the same error:
42 DENIS@home 13 Apr 08:43:25 Resetting file projects/denis.usj.es_denisathome/denis_logo_mini.png: md5 checksum failed for file
43 DENIS@home 13 Apr 08:43:25 Resetting file projects/denis.usj.es_denisathome/Error_evolution_20230411_en.png: md5 checksum failed for file
44 DENIS@home 13 Apr 08:43:27 Started download of denis_logo_mini.png
45 DENIS@home 13 Apr 08:43:27 Started download of Error_evolution_20230411_en.png
46 DENIS@home 13 Apr 08:43:28 Finished download of denis_logo_mini.png
47 DENIS@home 13 Apr 08:43:28 Finished download of Error_evolution_20230411_en.png
48 DENIS@home 13 Apr 08:43:28 [error] MD5 check failed for denis_logo_mini.png
49 DENIS@home 13 Apr 08:43:28 [error] expected 9e49b9b414ce41ac0fad7dc877fd994f18267080a78ed407169599edbefd265, got f5fa73e2c8a34a6b94d628beb60ca8c9
50 DENIS@home 13 Apr 08:43:28 [error] Checksum or signature error for denis_logo_mini.png
51 DENIS@home 13 Apr 08:43:28 [error] MD5 check failed for Error_evolution_20230411_en.png
52 DENIS@home 13 Apr 08:43:28 [error] expected d54c12fbe289b35d0fdc5e400b9faf422baf42f4a6c7d82036c30de9d307ebd, got 359af274ab0ee7871407ab1e00f659f9
53 DENIS@home 13 Apr 08:43:28 [error] Checksum or signature error for Error_evolution_20230411_en.png
6) Message boards : Number crunching : Myocyte beta v0.16 (Message 1822)
Posted 3 Aug 2022 by Crystal Pellet
Post:
We will continue trying to find why the checkpoints are not working in some windows machines.
Thanks Jesús for your information.

I just ran 2 of your tasks with the previous version 0.14 to understand what's the difference.
I suspended those two tasks several times and after a resume they restarted from the last checkpoint, at least did not went back to zero.
However in the result log I see several times Problem reading the states from the checkpoint file... Restarting the simulation, but
not all previous iterations are repeated. So maybe those results are not valid scientifically for you.

https://denis.usj.es/denisathome/result.php?resultid=2136910
https://denis.usj.es/denisathome/result.php?resultid=2137064
7) Message boards : Number crunching : Myocyte beta v0.16 (Message 1819)
Posted 3 Aug 2022 by Crystal Pellet
Post:
I've uploaded the beta_0.17 that should solve this issue (I hope). Issues with checkpoints are the most difficult to solve, thank you for all the information. It helps us a lot.

Best,
Jesús.

I just tested v0.17 on another machine. I suspended all 4 tasks after the 2nd checkpoint with "Leave tasks in memory while suspended" not selected.
I resumed the 4 one by one. All four started from scratch. What was wrong with v0.14?
8) Message boards : Number crunching : Myocyte beta v0.16 (Message 1816)
Posted 3 Aug 2022 by Crystal Pellet
Post:
I suspended yesterday evening 3 beta's to shutdown the PC overnight.
I watched the checkpoints were set. Progress was at about 88% after 1hr40m runtime.
This morning I resumed the three tasks at the same time.
One task resumed from the last checkpoint, the other 2 started from zero.

The two tasks that did not resumed from the last checkpoint:
https://denis.usj.es/denisathome/result.php?resultid=2110666
https://denis.usj.es/denisathome/result.php?resultid=2110975
9) Message boards : Number crunching : Long Running Task (Message 574)
Posted 19 Oct 2015 by Crystal Pellet
Post:
Hi Mr. M.

On that system with the stock application they should last about 10 hours.
It's up to you what to do with the running tasks.

For speeding up process have a look at the optimized applications: http://denis.usj.es/denisathome/forum_thread.php?id=53
10) Message boards : Number crunching : Error: Finish File Present Too Long (Message 414)
Posted 25 Aug 2015 by Crystal Pellet
Post:
Moreover, where in the release notes does it say anything about slot directory bugs getting fixed?

Sorry Dayle, that you've wasted so much cpu-cycles and in fact the jobs were ready. Sometimes it's good being ahead of the troops.

Do you really think, that every change in the source code will find its way into release notes.
Sorry, I've to disappoint you in that, but the code change was made on the 17th of June by David Anderson himself and called:

"client: fix bug that caused delay in job cleanup"

with the comment:

"If a job has an output file with <copy_file> and <optional>,
and it doesn't create the file,
then the call to boinc_rename() (to move it to the project dir) fails,
and we back off and retry."
11) Message boards : Number crunching : Error: Finish File Present Too Long (Message 411)
Posted 24 Aug 2015 by Crystal Pellet
Post:
As for the newest BOINC Beta, I would not feel comfortable switching to that if it might disrupt my other tasks.

v7.6.6 is not Beta, it's the recommended BOINC version.

Your problem is recognized as a client issue. It happens to some users more than to others and also some projects suffering more than others.

It has to do with cleaning up the slot directory after a task is ready.
Older versions of BOINC tries to rename an optional non existing file and retried that too often.

BOINC tells you "finish file present - too long", because the temporary finish file placed by BOINC during cleanup stays longer than 10 seconds.
12) Message boards : Number crunching : Error: Finish File Present Too Long (Message 407)
Posted 23 Aug 2015 by Crystal Pellet
Post:
Still getting them. A lot of these happen when I'm nowhere near the computer to close or open the application.

Upgrade to BOINC 7.6.6