𝕏

Posts by Col323

1) Message boards : Number crunching : All new! (Message 1069)
Posted 19 Aug 2016 by Col323
Post:
Beta seems to be too much long for me (over 35h).
I'm waiting for optimized app

Agreed. They are definitely long! I don't blame you for dropping out. Thanks for the work you did contribute. The work you completed helped lighten the load for the rest of us.

At last check this morning, there were just 16 Beta tasks left to send.
2) Message boards : Number crunching : DENIS_BETA v1.04 checkpoints are working (Message 1043)
Posted 2 Aug 2016 by Col323
Post:
Just wondering what the Beta strategy might be. It seems like there are a handful of people Beta testing. The server status page shows 30 active users in the past 24 hours. That number has remained the same the past few days. There are fewer than 1,000 tasks in progress, and over 9,500 tasks still to chew through in this Beta test. Are we going to work through it all? Any way to try to encourage folks who dropped out of the test to come back?

My cycles are yours for the using, regardless. I'm just curious by nature. :-)
3) Message boards : Number crunching : DENIS_BETA v1.04 checkpoints are working (Message 1029)
Posted 29 Jul 2016 by Col323
Post:
Checkpointing also works fine on Windows 7 x64 Professional.

ditto here after a reboot :)

Same here after a reboot! :-) Also optimistic that the looping on this machine is fixed. Long way to go in the WU before I determine that, however.
4) Message boards : Number crunching : project dont save (Message 1016)
Posted 27 Jul 2016 by Col323
Post:
but the minute we cant save that is Total RED flag.

Your passion is appreciated. However, as the project admins have said (repeatedly), the problems reported on Windows machines were not seen on their Windows machine. And as we have reported the issues, they've pulled units from production, launched additional Beta tests, even looked into granting credit for runtime instead of just a valid result.

I know it sounds fishy that they ran it on their Windows machine and all was well. But I have 2 ThinkPads, each with Win 7 SP1 (64-bit). One is a i5-5300U and the other a i5-2520M. The 5300 loops units repeatedly. The 2520 chews them up and spits them out, no looping required. It's completed 11 units from the 1.03 Beta test. The 5300 has looped 4 it received, and I'm waiting to see what happens on these last 2 from 1.03. (I'm guessing they will loop as well.) The point is, if two fairly similar machines can exhibit very different behavior, then it would be feasible for the admins to release work thinking all would go smoothly. They aren't just blindly sending work trying to clog up our machines.
5) Message boards : Number crunching : Very long wus (Message 994)
Posted 25 Jul 2016 by Col323
Post:
BETA_23071039_8000_0173_0 looped on me (on the aforementioned Win 7 laptop), so I aborted it. This machine is also running two other Beta units. One has also looped, but I'm going to see if the second loop finishes correctly. The other unit has not looped, and I'm hoping that somehow it finishes gracefully.
6) Message boards : Number crunching : Very long wus (Message 976)
Posted 22 Jul 2016 by Col323
Post:
I also just aborted WU GD_jcarro_20160714201506000000_ThirdSimulations_SteadyState3000Schmidt98_conf_1473.xml_5 which restarted from 0% after 20+ hours. This is the same machine where I aborted the 8000 WU above. Here are the system specs and other info from Boinc startup:

7/22/2016 7:33:04 AM | | Starting BOINC client version 7.2.47 for windows_x86_64
7/22/2016 7:33:04 AM | | log flags: file_xfer, sched_ops, task
7/22/2016 7:33:04 AM | | Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6
7/22/2016 7:33:04 AM | | Data directory: C:\ProgramData\BOINC
7/22/2016 7:33:04 AM | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 5500 (driver version 10.18.14.4029, device version OpenCL 2.0, 1298MB, 1298MB available, 58 GFLOPS peak)
7/22/2016 7:33:04 AM | | OpenCL CPU: Intel(R) Core(TM) i5-5300U CPU @ 2.30GHz (OpenCL driver vendor: Intel(R) Corporation, driver version 4.2.0.130, device version OpenCL 2.0 (Build 130))
7/22/2016 7:33:04 AM | | Processor: 4 GenuineIntel Intel(R) Core(TM) i5-5300U CPU @ 2.30GHz [Family 6 Model 61 Stepping 4]
7/22/2016 7:33:04 AM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes nx lm vmx smx tm2 pbe
7/22/2016 7:33:04 AM | | OS: Microsoft Windows 7: Professional x64 Edition, Service Pack 1, (06.01.7601.00)
7/22/2016 7:33:04 AM | | Memory: 7.69 GB physical, 15.37 GB virtual
7/22/2016 7:33:04 AM | | Disk: 238.47 GB total, 126.87 GB free
7/22/2016 7:33:04 AM | DENIS@Home | URL http://denis.usj.es/denisathome/; Computer ID 62844; resource share 0
7/22/2016 7:33:04 AM | WUProp@Home | URL http://wuprop.boinc-af.org/; Computer ID 89286; resource share 100
7/22/2016 7:33:04 AM | malariacontrol.net | URL http://www.malariacontrol.net/; Computer ID 1667973; resource share 0
7/22/2016 7:33:04 AM | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 3454626; resource share 0
7/22/2016 7:33:04 AM | World Community Grid | General prefs: from World Community Grid (last modified 04-Dec-2015 22:34:24)
7/22/2016 7:33:04 AM | World Community Grid | Computer location: school
7/22/2016 7:33:04 AM | | General prefs: using separate prefs for school
7/22/2016 7:33:04 AM | | Preferences:
7/22/2016 7:33:04 AM | | max memory usage when active: 5904.19MB
7/22/2016 7:33:04 AM | | max memory usage when idle: 7085.03MB
7/22/2016 7:33:04 AM | | max disk usage: 100.00GB
7/22/2016 7:33:04 AM | | (to change preferences, visit a project web site or select Preferences in the Manager)
7/22/2016 7:33:04 AM | | Not using a proxy


In good news, this machine has successfully completed and validated two WUs in the last 24 hours:
GD_jcarro_20160714201352000000_ThirdSimulations_SteadyState2000Schmidt98_conf_1329.xml_4
GD_jcarro_20160714202241000000_ThirdSimulations_SteadyState750Schmidt98_conf_697.xml_3

Not all is futile! And thank you for looking into credit for runtime. Like Jacob, points are not my main motivation for this, but it's nice to know that the project admins feel our pain. I will keep the gates open and look forward to more Beta units.
7) Message boards : Number crunching : Very long wus (Message 973)
Posted 22 Jul 2016 by Col323
Post:
Well, I've picked up GD_jcarro_20160714202359000000_ThirdSimulations_SteadyState8000Schmidt98_conf_916.xml_7 (Yes, _7.) Units 0-6 are a mixed crew of 1 pending valid after 147,000 seconds, 2 Errors while computing, and 4 aborted by user. _6 was actually aborted by mm67 after 43,863.05 seconds, so I'm guessing it's another self-restarted/failed checkpoint restart WU. Since I don't think this machine is going to reboot in the next 36 hours, I'll see what happens.

(For what it's worth, it's running Win 7.)


Well, after 34+ hours, it reset back to 0%. This WU will be aborted.
8) Message boards : Number crunching : Very long wus (Message 965)
Posted 20 Jul 2016 by Col323
Post:
I appreciate you looking into this and your responsiveness. Please don't take my multiple postings about errors as complaining; I am hoping to be helpful. I understand that failed experiments are part of science. I will keep my machines attached and hope that we can iron out all bugs. Then hopefully we can get more people on board and do more science!
9) Message boards : Number crunching : Lots of WUs "Error While Computing" (Message 962)
Posted 20 Jul 2016 by Col323
Post:
I don't think it's quite fixed. On a Linux box:

GD_jcarro_20160714201445000000_ThirdSimulations_SteadyState2000Schmidt98_conf_93.xml_2

Outcome Computation error
Client state Compute error
Exit status 193 (0xc1) EXIT_SIGNAL
Computer ID 62747
Run time 1 days 1 hours 5 min 53 sec
CPU time 1 days 0 hours 20 min 58 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 0.77 GFLOPS
Application version Carro-Rodriguez-Laguna-Pueyo Epicardial Model (Carro et al. 2011) for human ventricular cells v1.08
Peak working set size 4.36 MB
Peak swap size 14.12 MB
Peak disk usage 0.05 MB

Doing CP It:2185491915.000000
Doing CP It:2187622491.000000
Doing CP It:2189763663.000000
Doing CP It:2191921198.000000
Doing CP It:2194083377.000000
Doing CP It:2196235201.000000SIGSEGV: segmentation violation
Stack trace (8 frames):
../../projects/denis.usj.es_denisathome/CRLP2011EPI_108_x86_64-pc-linux-gnu(boinc_catch_signal+0x57)[0x4ca917]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x10d10)[0x7f8fe7dadd10]
/lib/x86_64-linux-gnu/libc.so.6(_IO_vfprintf+0x24)[0x7f8fe7a1cc44]
/lib/x86_64-linux-gnu/libc.so.6(_IO_fprintf+0x87)[0x7f8fe7a27b97]
../../projects/denis.usj.es_denisathome/CRLP2011EPI_108_x86_64-pc-linux-gnu[0x41b225]
../../projects/denis.usj.es_denisathome/CRLP2011EPI_108_x86_64-pc-linux-gnu[0x462b0f]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf0)[0x7f8fe79f3a40]
../../projects/denis.usj.es_denisathome/CRLP2011EPI_108_x86_64-pc-linux-gnu[0x4048b9]

Exiting...

</stderr_txt>
]]>

There are 4 related units, 2 are In Progress, and 2 also errored. One of the errors was also on version 1.08.
10) Message boards : Number crunching : Very long wus (Message 960)
Posted 20 Jul 2016 by Col323
Post:
Well, I've picked up GD_jcarro_20160714202359000000_ThirdSimulations_SteadyState8000Schmidt98_conf_916.xml_7 (Yes, _7.) Units 0-6 are a mixed crew of 1 pending valid after 147,000 seconds, 2 Errors while computing, and 4 aborted by user. _6 was actually aborted by mm67 after 43,863.05 seconds, so I'm guessing it's another self-restarted/failed checkpoint restart WU. Since I don't think this machine is going to reboot in the next 36 hours, I'll see what happens.

(For what it's worth, it's running Win 7.)
11) Message boards : Number crunching : Very long wus (Message 947)
Posted 19 Jul 2016 by Col323
Post:
Not sure all bugs are squashed just yet. Last night I was running WU GD_jcarro_20160714201545000000_ThirdSimulations_SteadyState3000Schmidt98_conf_689.xml_2. It was at 8 hours and about 65% complete. This morning, it was at 15 hours and only 5% complete. The machine did not shut off or reboot overnight. This sounds like the "hit 100% and start over" bug others have reported.

I verified it was version 1.08. I have aborted the task.
12) Message boards : Number crunching : Very long wus (Message 931)
Posted 17 Jul 2016 by Col323
Post:
My laptop had an 8000 series unit run for 22 hours and was at 45% complete. Out of curiosity, I restarted Boinc. Sure enough, progress went back to 0% even though runtime was cumulative. This laptop can be shut down multiple times in a 24 hour period, making 8000 units a non-starter.

However, another machine cranked through two 8000 units at 32 hours each. They hit 100% and reported successfully. They are now awaiting validation. (Fingers crossed!)

I would still like to contribute to Denis. Is it possible to have any of the following?

1) Select the type of work you would like via profile? (e.g. default = short, home = short, medium, long)
2) Optimizations?
3) Bonus points for working a long WU?
13) Message boards : Number crunching : Very long wus (Message 909)
Posted 15 Jul 2016 by Col323
Post:
You're not alone, at least. On my i5-5300 (mobile), currently being heavily used, the WU known as GD_jcarro_20160714202312000000_ThirdSimulations_SteadyState8000Schmidt98_conf_1513.xml_0 is at 4.55% after 1 hour 30 minutes.

This laptop has also crunched through 4 other WUs today, and averaged about 2 hours 30 minutes for each. Those were all "SteadyState6XX" units.
14) Message boards : Number crunching : Lots of WUs "Error While Computing" (Message 899)
Posted 11 Jul 2016 by Col323
Post:
One of my computers successfully completed a couple units which took a couple of hours, as expected. Its stderr.txt output is over 2400 lines long. However, this computer also successfully completed units which took seconds, and their stderr.txt is only about 50 lines. Here is an example:

Name GD_jcarro_20160708110503000000_SecondSimulations_SteadyState4000_conf_917.xml_0
Workunit 18800550
Created 8 Jul 2016, 9:05:04 UTC
Sent 8 Jul 2016, 11:33:53 UTC
Report deadline 22 Jul 2016, 11:33:53 UTC
Received 8 Jul 2016, 11:34:25 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 62844
Run time 2 sec
CPU time
Validate state Valid
Credit 0.01
Device peak FLOPS 2.60 GFLOPS
Application version Carro-Rodriguez-Laguna-Pueyo Epicardial Model (Carro et al. 2011) for human ventricular cells v1.06
Stderr output

<core_client_version>7.2.47</core_client_version>
<![CDATA[
<stderr_txt>
MName:CRLP2011_EPI
MID:0
OpT:12000000.000000
DT:0.002000
OutFreq:50
InT:11996000.000000
NumConstToChange:15
NumStatesToPrint:1
NumAlgToPrint:0
CC ID:16 NAME: G_Na in component Fast_Na_Current VALUE:13.5582
CC ID:17 NAME: G_Na_B in component Background_Na_Current VALUE:0.000690863
CC ID:23 NAME: G_Kr in component Rapidly_Activating_K_Current VALUE:0.0269904
CC ID:24 NAME: G_Ks in component Slowly_Activating_K_Current VALUE:0.00349788
CC ID:25 NAME: G_Kp in component Plateau_K_Current VALUE:0.00157695
CC ID:26 NAME: G_to in component Transient_Outward_K_Current VALUE:0.124306
CC ID:28 NAME: G_K1 in component Inward_Rectifier_K_Current VALUE:0.609042
CC ID:29 NAME: G_ClCa in component Ca_Activated_Cl_Current VALUE:0.0414283
CC ID:31 NAME: G_Cl_B in component Background_Cl_Current VALUE:0.0064337
CC ID:34 NAME: G_Ca in component L_Type_Calcium_Current VALUE:0.000175819
CC ID:47 NAME: G_Ca_B in component Background_Ca_Current VALUE:0.000676662
CC ID:19 NAME: Ibar_NaK in component Na_K_Pump_Current VALUE:0.907297
CC ID:44 NAME: Ibar_NCX in component Na_Ca_Exchanger_Current VALUE:5.70955
CC ID:46 NAME: Ibar_PMCA in component Sarcolemmal_Ca_Pump_Current VALUE:0.0681847
CC ID:7 NAME: I_Stim_CL in component membrane VALUE:4000
STP ID:0 - V in component membrane
CONFIG END
SolveModel 388
SolveModel 406 NUMC2CHANGE: 15
SolveModel 410 ITER: 0 , 16 --- 1.355820e+001
SolveModel 410 ITER: 1 , 17 --- 6.908630e-004
SolveModel 410 ITER: 2 , 23 --- 2.699040e-002
SolveModel 410 ITER: 3 , 24 --- 3.497880e-003
SolveModel 410 ITER: 4 , 25 --- 1.576950e-003
SolveModel 410 ITER: 5 , 26 --- 1.243060e-001
SolveModel 410 ITER: 6 , 28 --- 6.090420e-001
SolveModel 410 ITER: 7 , 29 --- 4.142830e-002
SolveModel 410 ITER: 8 , 31 --- 6.433700e-003
SolveModel 410 ITER: 9 , 34 --- 1.758190e-004
SolveModel 410 ITER: 10 , 47 --- 6.766620e-004
SolveModel 410 ITER: 11 , 19 --- 9.072970e-001
SolveModel 410 ITER: 12 , 44 --- 5.709550e+000
SolveModel 410 ITER: 13 , 46 --- 6.818470e-002
SolveModel 410 ITER: 14 , 7 --- 4.000000e+003
SolveModel 413
PRINTABLE_STATE ID:0
07:33:49 (5300): called boinc_finish(0)

</stderr_txt>
]]>
15) Message boards : Number crunching : Benchmarks of beta (Message 839)
Posted 6 Apr 2016 by Col323
Post:
Not the same for me.

AMD Phenom(tm) II X6 1065T Processor [Family 16 Model 10 Stepping 0]

Float 2374.07 million ops/sec
Integer 8699.13 million ops/sec

It's done 64 tasks in March & April, all in about 3:45 give or take. This is on Windows 7.
16) Message boards : Number crunching : Boinc Stats Team Challenge 31/10/15 to 7/11/15 - Is Denis Ready for Us? (Message 654)
Posted 7 Nov 2015 by Col323
Post:
I'm just a spectator of the team challenge, but I must say I have thoroughly enjoyed it. Hopefully some of the new firepower sticks around for a little while longer.

I'm also intrigued to know what sub-projects might arise in DENIS down the road. :-)
17) Message boards : Number crunching : Boinc Stats Team Challenge 31/10/15 to 7/11/15 - Is Denis Ready for Us? (Message 642)
Posted 4 Nov 2015 by Col323
Post:
Hello,

I think that the most chruncher are very disappointed ...
<snip>

One not disappointed cruncher here. Happily crunching, and thankful for the folks who create optimized apps as well as others around here who are rather helpful.

I'm sure there's a lot to do, and am looking forward to running this project long-term.