Doubt about linux benchmark and some docking users
Message boards : Number crunching : Doubt about linux benchmark and some docking users
Author | Message | |
---|---|---|
Hi all :)
|
||
ID: 1248 | Rating: 0 | rate: / | ||
Daniele,
Hi all :) ____________ D@H the greatest project in the world... a while from now! |
||
ID: 1251 | Rating: 0 | rate: / | ||
It is by now well known that the standard boinc benchmarks give much lower numbers than the same benchmarks on windows (on the same hardware). Yes, as I told that's not real a problem for me, I understand that it's fair if I leave credits to winsicure users, as I have a good and secure OS. I can't have all the best things for me, so I'm generous and leave my credit for them. :P What I meant is it's not normal (of course it is normal, but it shouldn't be) that an Athlon XP 2600+ with linux (nearly the same kernel as mine) claims for 2/3 of the credit for the same WU in the same CPU time. When I crunch a WU with it as a companion, I receive 6 credit per hour claiming for 8,5. And There's another Athlon XP (2700+) in the same situation, an Opteron(!!) crunching for 6 hours and claiming for 20 total credits O_O There are many cases like these, let's say one in 3 crunched WUs, and This is not only the benchmark, this could be a problem in those systems. I only wanted to let this guys know that their linux boxes aren't running properly. Maybe it's boinc's fault, maybe not. Could be many causes together. As a matter of fact, because of this inequality between platforms we are planning to stop using the boinc benchmarks all together and use other metrics for credit assignment like Rosetta and CPDN do. I've read about this, I think it's the best solution for users regarding fairness. Probably it's not the best one for being used as a term of comparison between projects with different directions and admins, unless there's a guidance by someone. ANyway I'm crunching more and more for d@h, credits are not important by themselves, and I'm rather waiting for new results from the project. Bye :) |
||
ID: 1252 | Rating: 0 | rate: / | ||
Good News Andre,
|
||
ID: 1253 | Rating: 0 | rate: / | ||
> I may as well get in early the, please don't copy Rosetta for their crediting system. I believe it to be flawed and still open to a form of unfair claiming (some call cheating), where the first result returned gets full cobblestone claim, while the later ones get a lower average.
|
||
ID: 1259 | Rating: 0 | rate: / | ||
But the real question is (I ask because today I've seen this happening! I would never have thought about this): are you sure you are not running the client itself with nice 19? This sometimes happens with Fedora and similar, and it's a totally wrong thing to do. I run BOINC on Fedora Core 5. I start BOINC at boot with the following lines added to the end of /etc/profile: ulimit -s unlimited cd /home/me/bin/BOINC ./run_manager & The /home/me/bin/BOINC/run_manager file is: cd /home/me/bin/BOINC exec ./boincmgr $@ As far as I can see I am not running BOINC with nice 19 but the benchmarks and credit claims are very low compared to what I get when that computer crunches under Windoze. I am another Linux newbie and I wonder... maybe some line in some other script is causing BOINC to run with nice 19? |
||
ID: 1261 | Rating: 0 | rate: / | ||
But the real question is (I ask because today I've seen this happening! I would never have thought about this): are you sure you are not running the client itself with nice 19? This sometimes happens with Fedora and similar, and it's a totally wrong thing to do. I don't know FC very well, but you can try this ps -eo ni,args|grep boinc The first coulumn is the NICE value, it should be 0 for the client. If it's 0, your problem should be another one. How many credits per hour do you claim for, in default conditions? |
||
ID: 1272 | Rating: 0 | rate: / | ||
... the first result returned gets full cobblestone claim, while the later ones get a lower average. I agree with you, it happened to me few days ago to receive 90+ credits for a WU lasting 3 hours. At the moment I didn't realize that I could be the first to return that WU, but probably I was. |
||
ID: 1273 | Rating: 0 | rate: / | ||
The output is... 0 ./boincmgr 0 ./boinc -redirectio -insecure 0 grep boinc OK, so boinc is running at nice 0. How many credits per hour do you claim for, in default conditions? Here at D@H I was completing WUs in about 1hr 20 min and claiming 9 credits. That's about 7 credits per hour, most of the time that machine does very little other work. |
||
ID: 1276 | Rating: 0 | rate: / | ||
I don't know Rosetta that well, so are there more details on this parameter?
Is Docking going to introduce a parameter where the user can change how long they process a Workunit like Rosetta does? ____________ D@H the greatest project in the world... a while from now! |
||
ID: 1287 | Rating: 0 | rate: / | ||
I don't know Rosetta that well, so are there more details on this parameter? They talk about it HERE It's slightly outdated in that the run time can now be selected from 1 hour to 24 hours and defaults to 3 hours. -- David |
||
ID: 1290 | Rating: 0 | rate: / | ||
Here at D@H I was completing WUs in about 1hr 20 min and claiming 9 credits. That's about 7 credits per hour, most of the time that machine does very little other work. This is the problem which we have to concentrate on. If your Athlon 64 runs for 1,5 hours and claims for 9 credits, and one core of my CoreDuo@2GHz runs for 3 hours and claims for 30 credits (for a similar WU), then our hosts are claiming for nearly the same number of credits per hour, but they shouldn't. If the work done is the same, the claimed credits should be the same. Before saying this, however, we have to know if the WU "contains" exactly the same work when running on different systems. Also your celeron, crunching for 3 hours, claims for half the credits I claim for. This could be right only if the work done is half the work I do on my hosts. If the work is the same, than your systems are claiming respectively for one third and half the credits than mine. Without going any further for now, the problem could be "distro-related". In few days, when I finish my current work, I want to set up a poll in order to know how the claimed credits are distributed "per distro". Anyway I hope that we'll soon use another way for granting credits :) |
||
ID: 1305 | Rating: 0 | rate: / | ||
I don't know Rosetta that well, so are there more details on this parameter? That's an interesting additional parameter for "fine tuning", but it's not so important at this moment. First we (you) should grant credits based on the work done (not on the benchmark results), then you can dinamically change the work per WU. edit: I read about the changes that will occur in the client (and its benchmark) for linux in few weeks, so this could be another way of solving the problem. Let's see what happens. I also read your post in another thread saying that it would be better to solve the problem at root, and make all projects use only one credits system. This would be nice too, but I don't think this will happen in short time for the biggest projects. Would be nice for small starting projects, anyway. |
||
ID: 1306 | Rating: 0 | rate: / | ||
For this alpha, the generated workunits have exactly the same number of FLOPS.
If the work done is the same, the claimed credits should be the same. ____________ D@H the greatest project in the world... a while from now! |
||
ID: 1312 | Rating: 0 | rate: / | ||
For this alpha, the generated workunits have exactly the same number of FLOPS. I don't yet have conclusive evidence, but am running some tests on my Athlon XP 3000+ which previously crunched with Windows 2000 Professional. I'm now running it with Puppy Linux. http://docking.utep.edu/show_host_detail.php?hostid=760 (Windows) http://docking.utep.edu/show_host_detail.php?hostid=1032 (Linux) With Windows, it took approx. 6.5 hours per work unit. I won't know for a few hours yet, but I have set it up with Puppy Linux and it looks like the first work unit will take under 4 hours (38.124% complete after 01:26:02). The interesting thing to me is that Linux boxes could thus get more credit (per CPU hour) than identical boxes running Windows if we go to Flops counting. But perhaps I'm missing something here. EDIT: Unfortunately the first WU errored out after 2:20 hours Docking@Home 26/11/2006 15:09:34 Unrecoverable error for result 1tng_mod0001_12704_112971_2 (process got signal 11) I'm letting it upload the huge error log in the hope it helps the project. |
||
ID: 1621 | Rating: 0 | rate: / | ||
Hi Yoda,
For this alpha, the generated workunits have exactly the same number of FLOPS. ____________ D@H the greatest project in the world... a while from now! |
||
ID: 1623 | Rating: 0 | rate: / | ||
The memory prefernces are only available in the alpha version of the BOINC software right now.
|
||
ID: 1625 | Rating: 0 | rate: / | ||
Hi Yoda, I allow BOINC to use up to 75% of total virtual memory and (apart from the tiny O/S) it was running nothing but BOINC. The WU had not finished, BOINC is set to keep applications in memory, but as Docking is the only project it's attached to, there should have been no swapping. The second WU finished without incident (in 3:45 hours). I'm aware of the credit/benchmarks issue. I just ran Docking under Linux for testing, not for the credit. But if we go to FLOPS counting or fixed credit, Linux should get its revenge :D ____________ Join the #1 Aussie Alliance on Docking@Home |
||
ID: 1627 | Rating: 0 | rate: / | ||
Hi Andre,
The question is now why this is happening? I see that your machine has 512 MB of memory. Is it swapping to disk at the moment Charmm exits? How much memory do you allow boinc (and thus the app) to use (I believe that's a preference setting)? Were any other applications that need a lot of memory running at the time? If there aren't plenty of available RAM and charmm need to use swap, does the application run so unstably, enough to cause 0x1 incorrect function? It's like science applications of climateprediction.net;-) Also I've never seen the error on linux, except the case where I forgot to set ulimit to unlimited. Regarding the difference between linux and windows: yes there will be a difference. Charmm has originally been developed for UNIX (and later Linux) and was never meant to run on Windows (who would ever run a scientific application on Windows? ;-), but of course then BOINC came around and since most volunteers run Windows (unfortunately...) the Scripps team has ported Charmm to windows. But it won't run as fast windows as on a unix-based platform (for the moment). Unfortunately to utilize the resources of PC one must optimise source code for windows or mac... please keep up nice works:) ____________ I'm a volunteer participant; my views are not necessarily those of Docking@Home or its participating institutions. |
||
ID: 1628 | Rating: 0 | rate: / | ||
If there aren't plenty of available RAM and charmm need to use swap, does the application run so unstably, enough to cause 0x1 incorrect function? It's like science applications of climateprediction.net;-) Also I've never seen the error on linux, except the case where I forgot to set ulimit to unlimited. We will!! Thanks for your good work on the forums Suguru :-) Andre ____________ D@H the greatest project in the world... a while from now! |
||
ID: 1648 | Rating: 0 | rate: / | ||
Message boards : Number crunching : Doubt about linux benchmark and some docking users
Database Error: The MySQL server is running with the --read-only option so it cannot execute this statement
array(3) { [0]=> array(7) { ["file"]=> string(47) "/boinc/projects/docking/html_v2/inc/db_conn.inc" ["line"]=> int(97) ["function"]=> string(8) "do_query" ["class"]=> string(6) "DbConn" ["object"]=> object(DbConn)#25 (2) { ["db_conn"]=> resource(102) of type (mysql link persistent) ["db_name"]=> string(7) "docking" } ["type"]=> string(2) "->" ["args"]=> array(1) { [0]=> &string(50) "update DBNAME.thread set views=views+1 where id=90" } } [1]=> array(7) { ["file"]=> string(48) "/boinc/projects/docking/html_v2/inc/forum_db.inc" ["line"]=> int(60) ["function"]=> string(6) "update" ["class"]=> string(6) "DbConn" ["object"]=> object(DbConn)#25 (2) { ["db_conn"]=> resource(102) of type (mysql link persistent) ["db_name"]=> string(7) "docking" } ["type"]=> string(2) "->" ["args"]=> array(3) { [0]=> object(BoincThread)#3 (16) { ["id"]=> string(2) "90" ["forum"]=> string(1) "2" ["owner"]=> string(3) "190" ["status"]=> string(1) "0" ["title"]=> string(50) "Doubt about linux benchmark and some docking users" ["timestamp"]=> string(10) "1164774823" ["views"]=> string(4) "1410" ["replies"]=> string(2) "19" ["activity"]=> string(20) "1.5735231723376e-127" ["sufferers"]=> string(1) "0" ["score"]=> string(1) "0" ["votes"]=> string(1) "0" ["create_time"]=> string(10) "1162564930" ["hidden"]=> string(1) "0" ["sticky"]=> string(1) "0" ["locked"]=> string(1) "0" } [1]=> &string(6) "thread" [2]=> &string(13) "views=views+1" } } [2]=> array(7) { ["file"]=> string(63) "/boinc/projects/docking/html_v2/user/community/forum/thread.php" ["line"]=> int(184) ["function"]=> string(6) "update" ["class"]=> string(11) "BoincThread" ["object"]=> object(BoincThread)#3 (16) { ["id"]=> string(2) "90" ["forum"]=> string(1) "2" ["owner"]=> string(3) "190" ["status"]=> string(1) "0" ["title"]=> string(50) "Doubt about linux benchmark and some docking users" ["timestamp"]=> string(10) "1164774823" ["views"]=> string(4) "1410" ["replies"]=> string(2) "19" ["activity"]=> string(20) "1.5735231723376e-127" ["sufferers"]=> string(1) "0" ["score"]=> string(1) "0" ["votes"]=> string(1) "0" ["create_time"]=> string(10) "1162564930" ["hidden"]=> string(1) "0" ["sticky"]=> string(1) "0" ["locked"]=> string(1) "0" } ["type"]=> string(2) "->" ["args"]=> array(1) { [0]=> &string(13) "views=views+1" } } }query: update docking.thread set views=views+1 where id=90