Issues with Charmm 5.07
Message boards : Number crunching : Issues with Charmm 5.07
Author | Message | |
---|---|---|
OK, I'll kick this off.
|
||
ID: 3127 | Rating: 0 | rate: / | ||
MacOS X 10.4.9, BOINC 5.8.17:
|
||
ID: 3130 | Rating: 0 | rate: / | ||
Don't believe the initial estimates of runtime.
|
||
ID: 3133 | Rating: 0 | rate: / | ||
Don't believe the initial estimates of runtime. I'm blowing through 2 workunits every hour and 20 mins >:P I'm stoked. Before it was 2 hours or more for the same result! |
||
ID: 3137 | Rating: 0 | rate: / | ||
Charmm 5.06/5.07=5h/2:48h
|
||
ID: 3140 | Rating: 0 | rate: / | ||
This is weird. Unfortunately we only have an Intel MiniMac in the lab,but our tests showed a consistent 5 hours running time for this workunit. Your computer is much more powerful than a minimac so you should be lots faster. Weird... Can you send us your input and output files (the ones ending with .inp, x_0, x_1, x_2 and x_3 in the projects/docking.utep.edu directory? An email to dockingadmin@utep.edu would suffice.
MacOS X 10.4.9, BOINC 5.8.17: ____________ D@H the greatest project in the world... a while from now! |
||
ID: 3145 | Rating: 0 | rate: / | ||
>> Despite all your hard work Docking team, the new application only improved a bit for Linux but heaps for Windows, so old Linux is getting slugged again.
|
||
ID: 3146 | Rating: 0 | rate: / | ||
mac os x (10.4.9) and macbook (2GHz Core Duo)
|
||
ID: 3148 | Rating: 0 | rate: / | ||
Wow. Maybe it's something to do with my setup, then. I'm pretty sure my prefs are set correctly, now that I enabled more than 2 procs (last month)...
|
||
ID: 3149 | Rating: 0 | rate: / | ||
First Linux 5.07 wu finished...
|
||
ID: 3151 | Rating: 0 | rate: / | ||
linux 5.07 is about 5700s, 2000s faster than 5.05/5.06. Thats good since I had about 800 worth of pending credits that was granted a big fat 0. Yes, I got the mail but doesn't help if the units were already crunched.
|
||
ID: 3158 | Rating: 0 | rate: / | ||
Computer Current Config
AuthenticAMD Memory: 1 GB PC-3200 Video: on-board S-3 Unichrome (I don't do gaming) Disk Interface: IDE - old 40 GB drive NOTE: This computer previously ran Fedora Core 3 with default gui. Also, it doesn't do anything that uses much memory or CPU other than BOINC. The screensaver is turned off. It's set to power down the display after about 10 minutes of no keyboard or mouse activity. BOINC Current Version: 5.8.17 Current General Preferences: ...Leave applications in memory while suspended: YES ...Switch between applications every: 60 Minutes ...On multiprocessors, use at most: 16 processors ...Use at most: 100 percent of CPU time ...Use at most: 30 GB disk space ...Leave at least: 5 GB disk space free ...Use at most: 50% of total disk space ...Write to disk at most every: 60 seconds ...Use at most: 75% of page file (swap space) ...Use at most: 40% of memory when computer is in use ...Use at most: 40% of memory when computer is idle Stats for WU on different versions Boinc = 5.8.17, D@H = 5.07, Approx seconds per WU = 11,250 Boinc = 5.8.17, D@H = 5.05, Approx seconds per WU = 16,250 Boinc = 5.8.17, D@H = 5.04, Approx seconds per WU = 14,900 Boinc = 5.4.11, D@H = 5.02, Approx seconds per WU = 14,800 Boinc = 5.4.9 , D@H = 5.02, Approx seconds per WU = 14,900 Boinc = 5.4.9 , D@H = 5.02, Approx seconds per WU = 7,500 - Mid October 2006 and earlier I think that jump from about 7,500 to 14,900 in October 2006 was when the project decided to double the work done per WU. Hope This Helps, -- David Ball EDIT: There's almost no disk activity except BOINC. I see the disk activity light blink every 2 - 3 seconds. ____________ The views expressed are my own. Facts are subject to memory error :-) Have you read a good science fiction novel lately? |
||
ID: 3159 | Rating: 0 | rate: / | ||
Hi,
|
||
ID: 3162 | Rating: 0 | rate: / | ||
OK, I'll kick this off. Back to my original report: Hosts ID 5 Completed 7 results for Charmm 5.07 Windows at of 2:44:48 to 2:47:52 (184s difference min-max) which is faster than 5.05 was at 4:20:00 **GOOD** --- ID 1829 - Twin1 GenuineIntel Intel(R) Pentium(R) 4 CPU 3.20GHz [x86 Family 15 Model 4 Stepping 1] [fpu tsc pae nx sse sse2 mmx] Microsoft Windows XP Professional Edition, Service Pack 2, (05.01.2600.00) ID 1932 - Twin2 GenuineIntel Intel(R) Pentium(R) 4 CPU 3.20GHz [Family 15 Model 4 Stepping 1] [fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx constant_tsc pni monitor ds_cpl cid xtpr] Linux 2.6.17-10-generic (Ubuntu 6) Twin2 could be either windows(ID:1828) or linux(ID:1932), dual boot. Both use the same BOINC venue settings. You can look at results for host 1828 (Twin2 on windows) vs host 1829 (Twin1 on windows) and see that for Charmm 5.05 the run times all were within the same run time range, average 17000s. Twin1 Completed 6 Charmm 5.07 windows tasks at 11036s to 11376s (340s difference min-max) which beats previous times of ~17000s **GOOD** I compared these times against one other host I found with same processor on docking and they were about the same. Twin2 Completed 5 Charmm 5.07 linux tasks at 16892 to 17323 (431s difference min-max) I do not have and 5.05 times for this host, sorry I just installed linux on it a few days ago. **NEUTRAL** BUT . The windows 5.07 app at on the same speed processor and identical hardware is outperforming the linux 5.07 app. approx 11206s vs 17107s difference of 5901s or 98min or 1.63 hours I could live with a small difference, say a few minutes. **Note** At this time there are no other applications installed or running on these hosts. They were both set up with the O/S, BOINC installed and run, thats it. No modifications to hardware (except memory upgrade, they both have same amount, brand and type.). Bottom line: The WindowsXP hosts can crunch more work per 24 hours for a bigger benefit to science. |
||
ID: 3164 | Rating: 0 | rate: / | ||
Thanks for all the useful posts people. From all of these results, it looks like we have to go back to some sort of variable credit scheme: there is just too much variability between results to continue with the fixed credit. In the lab we have begun doing experiments with bigger molucules and we've observed that the variability becomes even worse. Anybody that has a view on what the state of the boinc client benchmarks is right now? Are linux/windows/macs in the same league now? If they are we could probably go back to variable credits immediately; if they are not, that might not be a great solution too.
OK, I'll kick this off. ____________ D@H the greatest project in the world... a while from now! |
||
ID: 3166 | Rating: 0 | rate: / | ||
Anybody that has a view on what the state of the boinc client benchmarks is right now? Are linux/windows/macs in the same league now? I just ran a set Twin1 Windows cc5.8.16 1398 floating point MIPS (Whetstone) per CPU 1657 integer MIPS (Drystone) per CPU Twin2 Linux cc5.8.17 1427 floating point MIPS (Whetstone) per CPU 1794 integer MIPS (Drystone) per CPU I would say these are fairly close |
||
ID: 3168 | Rating: 0 | rate: / | ||
Anybody that has a view on what the state of the boinc client benchmarks is right now? Are linux/windows/macs in the same league now? If they are we could probably go back to variable credits immediately; if they are not, that might not be a great solution too. According to the alpha test mailing list, there are some funky things going on with benchmarking in the entire 5.9 series of BOINC. It hasn't quite settled yet. The official releases seem to be unaffected, and have no benchmarking issues that I know of.. |
||
ID: 3172 | Rating: 0 | rate: / | ||
MacPro 2.66GHz Xeon(four cores), MacOS X 10.4.9, BOINC 5.8.17:
|
||
ID: 3178 | Rating: 0 | rate: / | ||
John,
MacPro 2.66GHz Xeon(four cores), MacOS X 10.4.9, BOINC 5.8.17: ____________ D@H the greatest project in the world... a while from now! |
||
ID: 3179 | Rating: 0 | rate: / | ||
Yeah. I don't understand those numbers either. They should all be roughly the same.
|
||
ID: 3180 | Rating: 0 | rate: / | ||
One thing I have noticed on my linux box (single core) is that other projects like Rosetta spend about 0.3 % in system time and Docking spends about 18 % to 25 % in system time.
|
||
ID: 3181 | Rating: 0 | rate: / | ||
> I currently run Docking on 4 computers, 2 Windows ans 2 Linux.
|
||
ID: 3182 | Rating: 0 | rate: / | ||
John, Yes, that's correct. All charmm processes finish at about the same time. It's just that the time increases with the more processors I enable. Very weird... ____________ John MacPro 2 x 2.66GHz Dual-Core Xeon | 2GB RAM | ATI x1900 | BOINC 5.9.5 |
||
ID: 3194 | Rating: 0 | rate: / | ||
One thing I have noticed on my linux box (single core) is that other projects like Rosetta spend about 0.3 % in system time and Docking spends about 18 % to 25 % in system time. Linux (64-bit Ubuntu 7.04, 32-bit boinc 5.9.4, 32-bit Charmm 5.07). Noticed that the application starts four processes. Using strace on the second (next-to-lowest numbered) of these processes, it is __continuously__ issuing 'gettimeofday()' calls. Seems to me it is *this* activity that is the cause of the huge demand for Linux system time by recent Docking application versions. . |
||
ID: 3203 | Rating: 0 | rate: / | ||
Thanks Mikus,
One thing I have noticed on my linux box (single core) is that other projects like Rosetta spend about 0.3 % in system time and Docking spends about 18 % to 25 % in system time. ____________ D@H the greatest project in the world... a while from now! |
||
ID: 3205 | Rating: 0 | rate: / | ||
Are these 'gettimeofday()' calls used for the following...
Use network only between the hours of or are those calls monitoring the actual time of the unit? Just a question.. ;-) ____________ |
||
ID: 3207 | Rating: 0 | rate: / | ||
One thing I have noticed on my linux box (single core) is that other projects like Rosetta spend about 0.3 % in system time and Docking spends about 18 % to 25 % in system time. @Mikus, Does your 32-bit BOINC Manager work with your 64-bit Ubuntu? |
||
ID: 3211 | Rating: 0 | rate: / | ||
Mikus, Does your 32-bit BOINC Manager work with your 64-bit Ubuntu? Yes. I'm using only the Berkeley BOINC packages (which include boinc and boincmgr). So far, they have been available for Linux only in 32-bit. [On 64-bit Ubuntu, some versions of boincmgr (but 5.9.4 works) would "hang" (100% CPU) if the terminal from which I started boincmgr had 'ulimit -s unlimited'. On 64-bit SuSE, the ulimit setting made no difference to boincmgr - it always worked.] What is more interesting to me is that even with the 32-bit boinc client, I am able to download 64-bit applications (if available) and use app_info.xml to run them. For example, ABC@home workunits run much faster under the 64-bit application than under the 32-bit application. [Docking aims to use "homogeneous redundancy", so does not support the use of app_info.xml.] . |
||
ID: 3212 | Rating: 0 | rate: / | ||
http://docking.utep.edu/result.php?resultid=181260
<core_client_version>5.8.8</core_client_version> I'm not quite sure, but surely it's related with 'forrtl'. I search the forum for the same exit code, and I found this: http://docking.utep.edu/forum_thread.php?id=77&nowrap=true#1232 It is what I've reported as a issue with charmm 5.03, very old one. Also I noticed finally this thread should be made sticky. suguruhirahara ____________ I'm a volunteer participant; my views are not necessarily those of Docking@Home or its participating institutions. |
||
ID: 3261 | Rating: 0 | rate: / | ||
No issues up till now for me... only thing I had to alter was my Win Avast scanner.
|
||
ID: 3265 | Rating: 0 | rate: / | ||
Thanks Suguru. I'm going with the post you refer as I can read the english in the message and it seems to me that Charmm is trying to open a file that is already open by another process. Often these things occur due to a virus scanner or something scanning through files. Could you track down if this might have been happening? I've seen this error quite a bit and really suspect the virus scanners...
http://docking.utep.edu/result.php?resultid=181260 ____________ D@H the greatest project in the world... a while from now! |
||
ID: 3281 | Rating: 0 | rate: / | ||
> I currently run Docking on 4 computers, 2 Windows ans 2 Linux. > I have only been using my Windows machines for the last 1 to 2 weeks due to the slow processing times on Linux. I thought this may of changed so I downloaded 5 Wus to my AMD Opteron 275 Linux Fedora Core 6 machine. Unfortunately things have gotten worse. Where it was taking an average of about 10,500 seconds (sowewhere around 2h 50m) before 5.07 and then went to 19,500 seconds on 5.07, with a few hitting 23,000 seconds. Well now it has increased to 28,000 seconds (7h 30m) and the corrosponding granted cobblestones have dropped from around 17/h to 9-10/h now they are 6-6.5/h. What I can't understand is that some other users using Linux, paired against me still have similar run times as before mostly less than 3 hours. What gives? I have changed nothing on my machine (my other Opteron has the same problem), and other projects work as normal. ____________ |
||
ID: 3284 | Rating: 0 | rate: / | ||
|
||
ID: 3298 | Rating: 0 | rate: / | ||
I just attached my G4/733 running BOINC v5.8.17 under Mac OS 10.3.9, and the first three results it returned errored out immediately with “exit status 5 (0x5)”. The output files say: <message> P.S. I notice this host is being grouped with G5s running Mac OS 10.4.x—obviously these particular results never got that far, but could this degree of ‘heterogeneity’ affect validation? |
||
ID: 3328 | Rating: 0 | rate: / | ||
Here are the messages logged by BOINC Manager for one of those tasks:
Tue May 15 17:23:12 2007|Docking@Home|Starting 1tng_mod0011_7508_204189_1 |
||
ID: 3338 | Rating: 0 | rate: / | ||
Hmmm, we compile on 10.4 (don't have 10.3) I wonder if there are functions in the 10.4 system library that are missing in 10.3. I think you are the first Panther user and so we didn't see this before. We'll check.
I just attached my G4/733 running BOINC v5.8.17 under Mac OS 10.3.9, and the first three results it returned errored out immediately with “exit status 5 (0x5)”. The output files say:<message> ____________ D@H the greatest project in the world... a while from now! |
||
ID: 3343 | Rating: 0 | rate: / | ||
Hmmm, we compile on 10.4 (don't have 10.3) I wonder if there are functions in the 10.4 system library that are missing in 10.3. I think you are the first Panther user and so we didn't see this before. We'll check. Thanks. I was preoccupied today and forgot to stop my host from asking for more work, so I’m afraid it’s coughed up a few more failed results, and will probably add a couple more overnight. Sorry about that: I’ll try and remember to set it to NNT for D@h tomorrow morning. |
||
ID: 3348 | Rating: 0 | rate: / | ||
If you look at "top", where it shows the percent of cpu time spent in different states, Other projects spend almost all of their time in either "user" or "nice" CPU state, but Charmm is spending a LOT of time in the "system" state (running inside the OS). I’ve noticed this too on my G5 Mac (with Activity Monitor—essentially a GUI for top ): when running other projects there’s very little non-“nice” activity, but a fair bit of “system” activity appears (on both CPUs) when a Docking task is running. The only other project whose app I’ve noticed behaving this way is SzTAKI Desktop Grid. |
||
ID: 3351 | Rating: 0 | rate: / | ||
We're looking into this, but haven't found the issue causing the high system overhead yet. Thanks for the report.
If you look at "top", where it shows the percent of cpu time spent in different states, Other projects spend almost all of their time in either "user" or "nice" CPU state, but Charmm is spending a LOT of time in the "system" state (running inside the OS). ____________ D@H the greatest project in the world... a while from now! |
||
ID: 3353 | Rating: 0 | rate: / | ||
Message boards : Number crunching : Issues with Charmm 5.07
Database Error: The MySQL server is running with the --read-only option so it cannot execute this statement
array(3) { [0]=> array(7) { ["file"]=> string(47) "/boinc/projects/docking/html_v2/inc/db_conn.inc" ["line"]=> int(97) ["function"]=> string(8) "do_query" ["class"]=> string(6) "DbConn" ["object"]=> object(DbConn)#44 (2) { ["db_conn"]=> resource(144) of type (mysql link persistent) ["db_name"]=> string(7) "docking" } ["type"]=> string(2) "->" ["args"]=> array(1) { [0]=> &string(51) "update DBNAME.thread set views=views+1 where id=239" } } [1]=> array(7) { ["file"]=> string(48) "/boinc/projects/docking/html_v2/inc/forum_db.inc" ["line"]=> int(60) ["function"]=> string(6) "update" ["class"]=> string(6) "DbConn" ["object"]=> object(DbConn)#44 (2) { ["db_conn"]=> resource(144) of type (mysql link persistent) ["db_name"]=> string(7) "docking" } ["type"]=> string(2) "->" ["args"]=> array(3) { [0]=> object(BoincThread)#3 (16) { ["id"]=> string(3) "239" ["forum"]=> string(1) "2" ["owner"]=> string(1) "4" ["status"]=> string(1) "0" ["title"]=> string(23) "Issues with Charmm 5.07" ["timestamp"]=> string(10) "1179621707" ["views"]=> string(4) "2260" ["replies"]=> string(2) "38" ["activity"]=> string(22) "6.263654538361599e-120" ["sufferers"]=> string(1) "0" ["score"]=> string(1) "0" ["votes"]=> string(1) "0" ["create_time"]=> string(10) "1177645935" ["hidden"]=> string(1) "0" ["sticky"]=> string(1) "0" ["locked"]=> string(1) "0" } [1]=> &string(6) "thread" [2]=> &string(13) "views=views+1" } } [2]=> array(7) { ["file"]=> string(63) "/boinc/projects/docking/html_v2/user/community/forum/thread.php" ["line"]=> int(184) ["function"]=> string(6) "update" ["class"]=> string(11) "BoincThread" ["object"]=> object(BoincThread)#3 (16) { ["id"]=> string(3) "239" ["forum"]=> string(1) "2" ["owner"]=> string(1) "4" ["status"]=> string(1) "0" ["title"]=> string(23) "Issues with Charmm 5.07" ["timestamp"]=> string(10) "1179621707" ["views"]=> string(4) "2260" ["replies"]=> string(2) "38" ["activity"]=> string(22) "6.263654538361599e-120" ["sufferers"]=> string(1) "0" ["score"]=> string(1) "0" ["votes"]=> string(1) "0" ["create_time"]=> string(10) "1177645935" ["hidden"]=> string(1) "0" ["sticky"]=> string(1) "0" ["locked"]=> string(1) "0" } ["type"]=> string(2) "->" ["args"]=> array(1) { [0]=> &string(13) "views=views+1" } } }query: update docking.thread set views=views+1 where id=239