Myocyte v0.15 Beta
Message boards :
Number crunching :
Myocyte v0.15 Beta
Message board moderation
Author | Message |
---|---|
Send message Joined: 3 Nov 15 Posts: 23 Credit: 2,254,547 RAC: 0 |
v0.15 Beta WU have been running on my Windows system for 214476 (clx10980xe-rtx3090) for 4 hours each and indicate they are going to take another 3 days. Seems like there might be a problem with the long run times. |
Send message Joined: 17 Jul 22 Posts: 9 Credit: 2,213,858 RAC: 0 |
v0.15 Beta WU had to abort all those. Too agressive on cpu, will take forever on a ryzen 9 3950 |
Send message Joined: 2 Jan 18 Posts: 4 Credit: 6,532,293 RAC: 0 |
If you look closer and deeper into Task Manager you will find the Windows Defender Malware app going bonkers and eating up CPU and affecting these DENIS v0.15 tasks. Linux crunching remains unaffected. |
Send message Joined: 17 Jul 22 Posts: 9 Credit: 2,213,858 RAC: 0 |
nothing to do with Windows Defender because i'm not running it on my win 11 machine. Anyway the wu's I have are ending computation errors. They would overreach the dead line! |
Send message Joined: 17 Jul 22 Posts: 9 Credit: 2,213,858 RAC: 0 |
These wu's are not behaving correctly . Had to force terminate them after Boinc had exited, still running in the background! AV's will react if they look like malware or viruses activities. |
Send message Joined: 9 Apr 15 Posts: 172 Credit: 1,552,856 RAC: 0 |
AV's will react if they look like malware or viruses activities. +1 I'll try to stop Windows Defender to stop the scan... |
Send message Joined: 8 Apr 15 Posts: 34 Credit: 389,238 RAC: 0 |
These wu's are not behaving correctly . Had to force terminate them after Boinc had exited, still running in the background! Had to do SAME thing on All my Windows PC's (Vista, Win 7 Win 8.1 - all 64 bit). The tasks were using memory & disk space but not CPU after aborting tasks in BOINC. The Vista PC did not have any MS AV's software to "interfere". The Win 7 & Win 8.1 PC's do have MS AV but I have BOINC data folders specifically DISABLED from AV scan and according to ProcessExplorer MS AV did not have the files "open" for scan, etc. Have set all PC's to NO NEW WORK ON THE DENIS PROJECT UNTIL THINGS GET FIXED (probably next week some time since it's early Saturday AM in Spain). |
Send message Joined: 7 Jul 22 Posts: 7 Credit: 300,007 RAC: 0 |
Same problem here. Turn Defender off: https://support.microsoft.com/en-us/windows/turn-off-defender-antivirus-protection-in-windows-security-99e6004f-c54c-8509-773c-a4d776b77960 Further units are stalling after about 1.4%. Is this a batch of units in particular that are affected? NNT - back to PG for me. It was going so well . |
Send message Joined: 28 Apr 15 Posts: 29 Credit: 1,426,883 RAC: 0 |
They are running OK for me under Ubuntu 20.04.4. They complete in the usual 51 minutes. https://denis.usj.es/denisathome/results.php?hostid=215226&offset=0&show_names=0&state=4&appid= |
Send message Joined: 12 Apr 15 Posts: 4 Credit: 321,711 RAC: 0 |
Good Day and Hello, sorry I have now many Problems with the new WU`s too. My Antivir Scanner from AVIRA make big Trouble about the Checkpoints and say was a Virus by me. Then break most WU after 6 hours up by me. I stop now Projekt DENIS, why i crunch with 4 Maschines for nothing and I have only Errors and Virus Alarms by me. I go now to Project RAKE SEARCH. Greetz SEARCHER Member of CHARITY TEAM Member of Team FREE TIBET/ TIBET LIBRE |
Send message Joined: 9 Apr 15 Posts: 172 Credit: 1,552,856 RAC: 0 |
I disabled the MS antivirus, but still very slow. After 2hrs the wus are at 38% (other versions finished after 70 minutes) |
Send message Joined: 17 Jul 22 Posts: 9 Credit: 2,213,858 RAC: 0 |
Those ( still BETA) (0.15) wu's shows up with an estimated time of 24 minutes but then grow to hours on this computer any way, they are terminated.... next batch ...please! I use kaspersky av ( not defender) and it did react but, that was less than 4% of the total activities no big deal. As far as the unix-like os that would be a different coding and may not react the same. I have no option settings selected to leave boinc running in the background (services) or idling in memory storage so they should terminate with boinc. |
Send message Joined: 13 Apr 15 Posts: 13 Credit: 1,590,484 RAC: 0 |
Aborted mine (Beta of DENIS-myocyte v0.15 windows_x86_64) after they had come to a standstill (estimated running times up to 97 days!) |
Send message Joined: 3 Nov 15 Posts: 23 Credit: 2,254,547 RAC: 0 |
They are running OK for me under Ubuntu 20.04.4. Fedora 36 is working OK too. The two Windows machines are both failing. I have the Boinc Data directory exempted from Norton antivirus so I am pretty sure there is no antivirus involvement. I ran the free version of Intel Vtune on my system running multiple Denis WU and did not see anything obvious. I will try that again tonight and look again. There has to be something that is different between Windows and Linux. The thing that comes up for me is the difference in the file systems. Linux will allow multiple opens on a file where Windows will not. |
Send message Joined: 17 Jul 22 Posts: 9 Credit: 2,213,858 RAC: 0 |
My results show large stderr files.... saying " problem saving checkpoints" ! probable problems |
Send message Joined: 18 Mar 15 Posts: 283 Credit: 2,748,608 RAC: 0 |
Hi! We are experience problems with the chekpoint in windows hosts. We have tryed to add a temporary file for the checkpoint to avoid corrupted checkpoints, but it fails when it tries to rename it. As the checkpoint fails, it tries again in all the iterations... for that reason the aplication goes so slow. I will upload a new version solving it as fast as possible. Many thanks for the comments. Checking your taks it easier to find the problem. Best, Jesús. Jesús Carro Universidad San Jorge @InSilicoHeart |
Send message Joined: 3 Nov 15 Posts: 23 Credit: 2,254,547 RAC: 0 |
Hi! Myocyte v0.16 Beta runs as expected on both Windows 11 and Linux Fedora in expected time on my machines. Checkpointing seems to be happening every 2 minutes which seems to be a little too frequent. |
Send message Joined: 9 Apr 15 Posts: 172 Credit: 1,552,856 RAC: 0 |
Checkpointing seems to be happening every 2 minutes which seems to be a little too frequent. With a strange behaviour. I restarted my pc with 4 wus at 88%. After the restart, 1 wus was at 88%, the others 3 restarted from 0% |
Send message Joined: 7 Jul 22 Posts: 7 Credit: 300,007 RAC: 0 |
After the restart, 1 wus was at 88%, the others 3 restarted from 0% Ouch. I'll continue to wait. |
Send message Joined: 18 Mar 15 Posts: 283 Credit: 2,748,608 RAC: 0 |
Checkpointing seems to be happening every 2 minutes which seems to be a little too frequent. Hi! This is due to the checkpoint issue, but at least now it's detected and reset so you get credit for all the compute time. In some cases, in windows, the checkpoint is not completely saved, and that leaves a corrupted checkpoint. In previous versions, the process continued with the erroneous data and in the validation your result was discarded. Now the program detects it and restarts the simulation so that the results you send are valid and you are given credit. We know that this behavior is not the best one because in some cases it will act as no checkpoint, but it is an improvement from previous versions. The tasks are not very long so it is not a big problem, but we want to improve it. In the next version we will try a double checkpoint file system (while we create a new checkpoint, the previous one is keept to be sure the program has at least one valid checkpoint). We tried it in a very simple way in version 0.15 and it didn't work, but we will improve it to make it more robust. The frequency of checkpointing is decided by the boinc client. We do not control it. We only control in what parts of the program it can be done. Best, Jesús. Jesús Carro Universidad San Jorge @InSilicoHeart |