Howto: Dealing With 0x1 Error


Advanced search

Message boards : Unix/Linux : Howto: Dealing With 0x1 Error

Sort
Author Message
Profile suguruhirahara
Forum moderator
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 282
ID: 15
Credit: 56,614
RAC: 0
Message 1441 - Posted 16 Nov 2006 11:52:06 UTC
Last modified: 8 Feb 2007 12:13:06 UTC

Hello,

the information about 0x1 error (Incorrect function) doesn't seems to be widely known for linux crunchers (especially newcomers), and there come to appear several results which were canceled with "Too many error results". It shouldn't be expected for both participants and developers side, therefore, I'll notice 0x1 error and let you know how to avoid the issue of Charmm 5.02 for linux.

At first please have a read over the FAQ, and you'll got column named "I have been receiving a message on my Linux machine where charm exits with code 1. Is there a fix to this problem?" The answer says everything needed, but it may be difficult for you to got it, so I'll paste howto written by conan, one of the hard crunchers who've joined this project from the start.

The solution to your problem, and to nearly all testers using Linux on this project has been to

1) Stop Boinc

2) Add the line " ulimit -s unlimited " (without the " marks), as the first line in the 'run_manager' file in Boinc folder.

3) Restart Boinc

If you have a dual processor then you should also add the same line to 'run_client' file in the Boinc folder as well. Also if a dual core then you may also need to reboot (as I needed to with a dual core Opteron).
(space lines added and made bold by me. thanks conan:)

Please make sure that this issue won't be solved until the next version of charmm is published.

Also, please let us know if you couldn't solve the issue by doing in that way.

Add: there's thought to be several distributions that don't work with this option.

Thanks for reading,
suguruhirahara
____________

I'm a volunteer participant; my views are not necessarily those of Docking@Home or its participating institutions.
Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 1448 - Posted 16 Nov 2006 19:44:46 UTC

Thanks Suguru,

Only one remark: we can't garantee that the memory issue will be solved in the next release, since we haven't arrived at the point of finding an app-based solution yet.

Thanks

Andre
____________
D@H the greatest project in the world... a while from now!

Profile suguruhirahara
Forum moderator
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 282
ID: 15
Credit: 56,614
RAC: 0
Message 1461 - Posted 17 Nov 2006 1:36:59 UTC - in response to Message ID 1448 .
Last modified: 17 Nov 2006 1:37:57 UTC

Thanks for letting us know the point, Andre:)

Since there are so many cases without that process, could you notice this topic on the top page, in the column of News? There are probably users who haven't noticed on this point.

I'm afraid that thousands(!) of workunits are / will be wasted with this issue...:(

thanks,
suguruhirahara

Profile Conan
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 219
ID: 100
Credit: 4,256,493
RAC: 0
Message 1472 - Posted 18 Nov 2006 1:48:40 UTC - in response to Message ID 1461 .

Thanks for letting us know the point, Andre:)

Since there are so many cases without that process, could you notice this topic on the top page, in the column of News? There are probably users who haven't noticed on this point.

I'm afraid that thousands(!) of workunits are / will be wasted with this issue...:(

thanks,
suguruhirahara


>> @ suguruhirahara, thanks for your words above. I noticed in the "Distro Questionaire" thread (near the end), that Cold Shot added some more information on Ubuntu linux that will help other Linux users. It shows that not all Linux's are equal or the same.

____________
Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 1485 - Posted 19 Nov 2006 0:18:50 UTC - in response to Message ID 1461 .

Done. People should look at the FAQ too where this issue is mentioned.

Andre

Thanks for letting us know the point, Andre:)

Since there are so many cases without that process, could you notice this topic on the top page, in the column of News? There are probably users who haven't noticed on this point.

I'm afraid that thousands(!) of workunits are / will be wasted with this issue...:(

thanks,
suguruhirahara


____________
D@H the greatest project in the world... a while from now!
Tom Philippart
Volunteer tester
Avatar

Joined: Dec 22 06
Posts: 17
ID: 340
Credit: 44,929
RAC: 0
Message 1894 - Posted 29 Dec 2006 15:01:44 UTC
Last modified: 29 Dec 2006 15:01:58 UTC

On Ubuntu you have to add the command in the "run_manager" file, simply entering it in the terminal doesn't fix the problem.

j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 1898 - Posted 29 Dec 2006 20:00:36 UTC - in response to Message ID 1894 .

On Ubuntu you have to add the command in the "run_manager" file, simply entering it in the terminal doesn't fix the problem.


Shouldn't you also put it in the "run_client" file?
Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 1900 - Posted 29 Dec 2006 20:48:06 UTC - in response to Message ID 1898 .

I understand that that is only necessary for multi-core/processor machines.

AK


Shouldn't you also put it in the "run_client" file?


____________
D@H the greatest project in the world... a while from now!
j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 1901 - Posted 29 Dec 2006 20:57:31 UTC - in response to Message ID 1900 .

I understand that that is only necessary for multi-core/processor machines.

AK


Shouldn't you also put it in the "run_client" file?



I don't run any of my machines with the "run_manager" routinely. I avoid the overhead of the manager by using the "run_client".

Profile Keck_Komputers
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 14
ID: 20
Credit: 1,263,209
RAC: 0
Message 1905 - Posted 30 Dec 2006 12:19:22 UTC - in response to Message ID 1898 .

On Ubuntu you have to add the command in the "run_manager" file, simply entering it in the terminal doesn't fix the problem.


Shouldn't you also put it in the "run_client" file?

It depends on how you start the client. If you only use one way then that is the only one you need to modify. I have only changed the "run_client" file since the client is already running whenever I use the manager.
____________
BOINC WIKI

BOINCing since 2002/12/8
j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 1906 - Posted 30 Dec 2006 13:08:48 UTC - in response to Message ID 1905 .

On Ubuntu you have to add the command in the "run_manager" file, simply entering it in the terminal doesn't fix the problem.


Shouldn't you also put it in the "run_client" file?

It depends on how you start the client. If you only use one way then that is the only one you need to modify. I have only changed the "run_client" file since the client is already running whenever I use the manager.


Exactly. I modified both, so I'm covered either way.
Profile clownius
Volunteer tester
Avatar

Joined: Nov 14 06
Posts: 61
ID: 280
Credit: 2,677
RAC: 0
Message 1955 - Posted 6 Jan 2007 0:50:53 UTC

Because i didnt find it in these threads.

For people using (K)Ubuntu 6.10 (and 6.06.1 as well i think) and have used the boinc from the universe repositories for their install.

The file you need to modify is found at
/etc/init.d/boinc-client not in the boinc folder as mentioned here
____________

Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 1962 - Posted 6 Jan 2007 4:36:11 UTC - in response to Message ID 1955 .
Last modified: 6 Jan 2007 4:38:25 UTC

Thanks for that info! (I've added it to the FAQ too)

AK

Because i didnt find it in these threads.

For people using (K)Ubuntu 6.10 (and 6.06.1 as well i think) and have used the boinc from the universe repositories for their install.

The file you need to modify is found at
/etc/init.d/boinc-client not in the boinc folder as mentioned here


____________
D@H the greatest project in the world... a while from now!
Profile clownius
Volunteer tester
Avatar

Joined: Nov 14 06
Posts: 61
ID: 280
Credit: 2,677
RAC: 0
Message 2021 - Posted 10 Jan 2007 2:32:51 UTC

I think this applies to most if not all debian systems.
____________

j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 2173 - Posted 17 Jan 2007 12:23:23 UTC - in response to Message ID 1441 .

Hello,

the information about 0x1 error (Incorrect function) doesn't seems to be widely known for linux crunchers (especially newcomers), and there come to appear several results which were canceled with "Too many error results". It shouldn't be expected for both participants and developers side, therefore, I'll notice 0x1 error and let you know how to avoid the issue of Charmm 5.02 for linux.

At first please have a read over the FAQ, and you'll got column named "I have been receiving a message on my Linux machine where charm exits with code 1. Is there a fix to this problem?" The answer says everything needed, but it may be difficult for you to got it, so I'll paste howto written by conan, one of the hard crunchers who've joined this project from the start.
The solution to your problem, and to nearly all testers using Linux on this project has been to

1) Stop Boinc

2) Add the line " ulimit -s unlimited " (without the " marks), as the first line in the 'run_manager' file in Boinc folder.

3) Restart Boinc

If you have a dual processor then you should also add the same line to 'run_client' file in the Boinc folder as well. Also if a dual core then you may also need to reboot (as I needed to with a dual core Opteron).
(space lines added and made bold by me. thanks conan:)

Please make sure that this issue won't be solved until the next version of charmm is published.

Also, please let us know if you couldn't solve the issue by doing in that way.

Thanks for reading,
suguruhirahara


I converted a computer (1348) to Ubuntu 6.10. the "unlimit -s unlimited" is in the run_client and run_manager, but it still errors.

The computer ran D@H with no problems under W2K and runs Tanpaku and Malaria under the Linux install with no problems.

I'm going to wipe the disk and do a fresh Linux install to see if that fixes the issue.

BobCat13
Volunteer tester

Joined: Nov 14 06
Posts: 22
ID: 239
Credit: 285,322
RAC: 0
Message 2179 - Posted 17 Jan 2007 17:03:08 UTC - in response to Message ID 2173 .

I converted a computer (1348) to Ubuntu 6.10. the "unlimit -s unlimited" is in the run_client and run_manager, but it still errors.

The computer ran D@H with no problems under W2K and runs Tanpaku and Malaria under the Linux install with no problems.

I'm going to wipe the disk and do a fresh Linux install to see if that fixes the issue.


How did you install BOINC? From the Ubuntu repository or by getting the client from Berkeley's site?

If from the Ubuntu repository you need to edit the /etc/init.d/boinc-client file for "ulimit -s unlimited" to work properly.
j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 2181 - Posted 17 Jan 2007 17:10:27 UTC - in response to Message ID 2179 .

I converted a computer (1348) to Ubuntu 6.10. the "unlimit -s unlimited" is in the run_client and run_manager, but it still errors.

The computer ran D@H with no problems under W2K and runs Tanpaku and Malaria under the Linux install with no problems.

I'm going to wipe the disk and do a fresh Linux install to see if that fixes the issue.


How did you install BOINC? From the Ubuntu repository or by getting the client from Berkeley's site?

If from the Ubuntu repository you need to edit the /etc/init.d/boinc-client file for "ulimit -s unlimited" to work properly.


Installed from Berkeley's URL.

Re-installing as I type.
j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 2197 - Posted 18 Jan 2007 0:00:38 UTC
Last modified: 18 Jan 2007 0:01:29 UTC

Fresh Linux install worked.....now is 1358.

I don't know what happened the first time.

Tom Philippart
Volunteer tester
Avatar

Joined: Dec 22 06
Posts: 17
ID: 340
Credit: 44,929
RAC: 0
Message 2292 - Posted 21 Jan 2007 22:24:39 UTC

Is a new application for linux fixing this already in developement? Any update on progress?
Other projects like Rosetta crash with this ulimit setting on my ubuntu 6.10 computer :(
____________

Profile Conan
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 219
ID: 100
Credit: 4,256,493
RAC: 0
Message 2295 - Posted 22 Jan 2007 1:02:53 UTC - in response to Message ID 2292 .
Last modified: 22 Jan 2007 1:05:16 UTC

Is a new application for linux fixing this already in developement? Any update on progress?
Other projects like Rosetta crash with this ulimit setting on my ubuntu 6.10 computer :(


> Tom could there be something specific with Ubuntu 6.10 ? Such as some setting not configured?
I use Linux Fedora Core 3 on 2 computers (both AMD Opterons) and have not had this issue (I run Rosetta, Einstein, CPDN, Ralph, QMC, Docking, Predictor and LHC on these 2 machines). As my machines are dual core I had to add the 'ulimit' fix to both 'run_manager' and to 'run_client', otherwise it did not work.

Conan.

____________
Profile Trog Dog
Volunteer tester
Avatar

Joined: Nov 14 06
Posts: 16
ID: 279
Credit: 145,805
RAC: 0
Message 2302 - Posted 22 Jan 2007 12:38:50 UTC

Can't say that I've noticed this either. I have had issues with 5.8.x clients and Rosetta (and WCG) though.
____________

Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 2306 - Posted 22 Jan 2007 21:18:59 UTC - in response to Message ID 2292 .

No unfortunately, no server-side solution yet; we've tried a couple of things already, but none worked without the ulimit workaround.

It cannot be that rosetta doesn't work with the ulimit set to unlimited; all Suse linux boxes come with this set to unlimited by default and I know many people that run rosetta fine with that. It must be something else on your machine that breaks it.

Andre

Is a new application for linux fixing this already in developement? Any update on progress?
Other projects like Rosetta crash with this ulimit setting on my ubuntu 6.10 computer :(


____________
D@H the greatest project in the world... a while from now!
Kenneth Larsen

Joined: Feb 7 07
Posts: 12
ID: 354
Credit: 66,929
RAC: 0
Message 2422 - Posted 7 Feb 2007 19:51:41 UTC - in response to Message ID 2306 .

Where do you set the ulimit -s unlimited when using Gentoo Linux? I don't have any run_manager or run_client, Gentoo uses /etc/init.d/boinc to start/stop Boinc. I guess it's a script but have no idea where to put the ulimit code. I've tried writing it directly in the boinc script, without success.

Any other Gentoo users here who can help me?

Augustine
Volunteer tester

Joined: Sep 13 06
Posts: 46
ID: 5
Credit: 143,502
RAC: 0
Message 2425 - Posted 7 Feb 2007 22:11:01 UTC - in response to Message ID 2422 .

Any other Gentoo users here who can help me?

See http://boinc.berkeley.edu/dev/forum_thread.php?id=703&nowrap=true#7813 .

HTH

____________
Kenneth Larsen

Joined: Feb 7 07
Posts: 12
ID: 354
Credit: 66,929
RAC: 0
Message 2430 - Posted 8 Feb 2007 11:53:59 UTC - in response to Message ID 2425 .

I can't get this to work at all - setting ulimit -s unlimited on the command line as super user works, though only for a short time. When I run ulimit -s a few minutes later, it's back to the initial 8192... Causing all work units to error out.

I've searched the Gentoo forums and google without luck, does anyone here know Gentoo and can help me? I'm running kernel version 2.6.18-gentoo-r6.

Kenneth Larsen

Joined: Feb 7 07
Posts: 12
ID: 354
Credit: 66,929
RAC: 0
Message 2431 - Posted 8 Feb 2007 14:17:38 UTC - in response to Message ID 2430 .

OK, I think I solved this myself: adding the ulimit code in /root/.bashrc seems to have solved it; one workunit has run for 1.5 hours now, without erroring out.

Kenneth Larsen

Joined: Feb 7 07
Posts: 12
ID: 354
Credit: 66,929
RAC: 0
Message 2432 - Posted 8 Feb 2007 16:33:59 UTC

And then the error came... this time the work unit crunched for almost 2 hours before erroring out. Ulimit now stays at unlimited, but it still hasn't helped. I've now put this project on standby on all Linux machines, until I get it fixed.
Any ideas what I can try now?

One thought: Ulimit is set for root, but Boinc runs as special user boinc. However, I can't log in as user boinc and change anything, so is it possible to set ulimit for this user in any way? Can this even be the problem?
Running ulimit as normal user shows the old 8192 value, so root does not set it for all users.

Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 2433 - Posted 8 Feb 2007 17:05:15 UTC - in response to Message ID 2432 .

As an experiment, could you run boinc under the root user? Just remove the 'su ${BOINC_USER} -c ' part of the start line.

Another thing to try is to give the boinc user a shell (in /etc/passwd change /bin/false to /bin/bash or so) and a password (use the passwd command). Then create a .bashrc file for the boinc user with the ulimit -s unlimited line and store that in the boinc user's homedir.

Thanks
Andre

And then the error came... this time the work unit crunched for almost 2 hours before erroring out. Ulimit now stays at unlimited, but it still hasn't helped. I've now put this project on standby on all Linux machines, until I get it fixed.
Any ideas what I can try now?

One thought: Ulimit is set for root, but Boinc runs as special user boinc. However, I can't log in as user boinc and change anything, so is it possible to set ulimit for this user in any way? Can this even be the problem?
Running ulimit as normal user shows the old 8192 value, so root does not set it for all users.


____________
D@H the greatest project in the world... a while from now!
Kenneth Larsen

Joined: Feb 7 07
Posts: 12
ID: 354
Credit: 66,929
RAC: 0
Message 2448 - Posted 9 Feb 2007 15:23:52 UTC

Finally it works! The solution was indeed to put the code in the start script, which runs as user "boinc". I did do this before but thought it didn't work because running ulimit -s showed 8192 as both root and normal user.

Thanks for the help everyone!

Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 2450 - Posted 9 Feb 2007 22:59:28 UTC - in response to Message ID 2448 .

It's probably more than a workaround than a real solution, but at least it works :-)

Cheers
Andre

Finally it works! The solution was indeed to put the code in the start script, which runs as user "boinc". I did do this before but thought it didn't work because running ulimit -s showed 8192 as both root and normal user.

Thanks for the help everyone!


____________
D@H the greatest project in the world... a while from now!
Profile Keck_Komputers
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 14
ID: 20
Credit: 1,263,209
RAC: 0
Message 2467 - Posted 13 Feb 2007 11:52:07 UTC

The new 5.8.11 client seems to allow the ulimit command to work again on my FC6 host.
____________
BOINC WIKI

BOINCing since 2002/12/8

Profile clownius
Volunteer tester
Avatar

Joined: Nov 14 06
Posts: 61
ID: 280
Credit: 2,677
RAC: 0
Message 2473 - Posted 14 Feb 2007 12:16:06 UTC

Is the ulimit command still required?
Im currently upgrading everything to 5.8.11 for my return to docking so it would be nice to know
____________

Profile suguruhirahara
Forum moderator
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 282
ID: 15
Credit: 56,614
RAC: 0
Message 2474 - Posted 14 Feb 2007 12:27:51 UTC - in response to Message ID 2473 .

Is the ulimit command still required?
Im currently upgrading everything to 5.8.11 for my return to docking so it would be nice to know

As the news in yesterday shows that issue was fixed in the version. As I didn't try it on linux I'm not fully sure, but probably so.
____________

I'm a volunteer participant; my views are not necessarily those of Docking@Home or its participating institutions.
Profile clownius
Volunteer tester
Avatar

Joined: Nov 14 06
Posts: 61
ID: 280
Credit: 2,677
RAC: 0
Message 2475 - Posted 14 Feb 2007 13:04:40 UTC

I decided to play it safe for now and upgraded my little farm to 5.8.11 and put the ulimit fix into all of them just in case lol
____________

j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 2477 - Posted 14 Feb 2007 16:44:21 UTC - in response to Message ID 2474 .
Last modified: 14 Feb 2007 16:45:51 UTC

Is the ulimit command still required?
Im currently upgrading everything to 5.8.11 for my return to docking so it would be nice to know

As the news in yesterday shows that issue was fixed in the version. As I didn't try it on linux I'm not fully sure, but probably so.


I took "ulimit" out of the script. D@H crashed the WU in 23 minutes.
Kenneth Larsen

Joined: Feb 7 07
Posts: 12
ID: 354
Credit: 66,929
RAC: 0
Message 2478 - Posted 14 Feb 2007 16:55:50 UTC - in response to Message ID 2477 .


I took "ulimit" out of the script. D@H crashed the WU in 23 minutes.


Before or after removing ulimit?
j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 2480 - Posted 14 Feb 2007 17:39:40 UTC - in response to Message ID 2478 .


I took "ulimit" out of the script. D@H crashed the WU in 23 minutes.


Before or after removing ulimit?


After.

Suspended all processes, removed "ulimit" from script, restarted processes.

D@H crashed.

Suspended all processes, put "ulimit" back in script, restarted processes.

Haven't got another D@H WU yet.

Profile suguruhirahara
Forum moderator
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 282
ID: 15
Credit: 56,614
RAC: 0
Message 2481 - Posted 14 Feb 2007 17:57:32 UTC

Allright, then, we need to add ulimit option in the files yet.
____________

I'm a volunteer participant; my views are not necessarily those of Docking@Home or its participating institutions.

j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 2482 - Posted 14 Feb 2007 18:03:17 UTC - in response to Message ID 2481 .
Last modified: 14 Feb 2007 18:03:49 UTC

Allright, then, we need to add ulimit option in the files yet.


Each new version (Linux) of BOINC seems to wipe out the old scripts and install a new blank.

Can you ask the devel team to only install blank scripts if no previous scripts exist?
Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 2492 - Posted 15 Feb 2007 0:19:56 UTC - in response to Message ID 2481 .

Correct. We still haven't found a served-side solution for this. To go to beta phase, we will need a solution though, so if anybody has any bright ideas.. :-)

Thanks
Andre

Allright, then, we need to add ulimit option in the files yet.


____________
D@H the greatest project in the world... a while from now!
Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 2493 - Posted 15 Feb 2007 0:20:30 UTC - in response to Message ID 2481 .

Good to see you back Suguru! Haven't seen you for a while.
AK

Allright, then, we need to add ulimit option in the files yet.


____________
D@H the greatest project in the world... a while from now!
Profile suguruhirahara
Forum moderator
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 282
ID: 15
Credit: 56,614
RAC: 0
Message 2494 - Posted 15 Feb 2007 2:24:46 UTC - in response to Message ID 2493 .

Good to see you back Suguru! Haven't seen you for a while.
AK

Allright, then, we need to add ulimit option in the files yet.


Yeah, I'm busier in my regular occupation. I've visited boards anyway;-)
____________

I'm a volunteer participant; my views are not necessarily those of Docking@Home or its participating institutions.
Profile David Ball
Forum moderator
Volunteer tester
Avatar

Joined: Sep 18 06
Posts: 274
ID: 115
Credit: 1,634,401
RAC: 0
Message 2496 - Posted 15 Feb 2007 5:53:34 UTC


I know it can be confusing to say that that 5.8.11 has a ulimit fix, but it doesn't fix the ulimit problem.

Basically, somewhere around 5.8.8, the boinc client from Berkeley was modified in a way that resulted in lowering the stack limit from unlimited to 500 MB (IIRC) and this broke Charmm so Andre asked them to change it.

FYI, I don't think Charmm actually uses 500 MB of stack, but I do think that setting the stack limit to unlimited changes the way a segment fault is handled. I suspect that Fortran is doing something that confuses the logic for growing the stack. Of course, that's just my opinion :-) I'm going through the kernel source to try to find out for sure.

____________
The views expressed are my own.
Facts are subject to memory error :-)
Have you read a good science fiction novel lately?

Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 2584 - Posted 26 Feb 2007 22:36:55 UTC - in response to Message ID 2496 .

Hi all,

We have finally found and implemented a fix for the stack limit problem! It's a simple (but initially non-obvious) solution where the boinc client starts a shell script that first sets the ulimit to unlimited and then starts Charmm. David Anderson attended us on the fact the boinc client can start shell scripts without any problems and this triggered the solution. So from charmm version 5.4 for linux (32 and 64 bit) you won't need the ulimit fix anymore :-)

This is probably a solution that we won't need anymore when we start using the new charmm version (no, it still hasn't arrived) as this version does not seem to have the stack limit problem.

Thanks for your patience on this.
Andre


I know it can be confusing to say that that 5.8.11 has a ulimit fix, but it doesn't fix the ulimit problem.

Basically, somewhere around 5.8.8, the boinc client from Berkeley was modified in a way that resulted in lowering the stack limit from unlimited to 500 MB (IIRC) and this broke Charmm so Andre asked them to change it.

FYI, I don't think Charmm actually uses 500 MB of stack, but I do think that setting the stack limit to unlimited changes the way a segment fault is handled. I suspect that Fortran is doing something that confuses the logic for growing the stack. Of course, that's just my opinion :-) I'm going through the kernel source to try to find out for sure.


____________
D@H the greatest project in the world... a while from now!
j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 2589 - Posted 26 Feb 2007 23:37:46 UTC - in response to Message ID 2584 .

Hi all,

We have finally found and implemented a fix for the stack limit problem! It's a simple (but initially non-obvious) solution where the boinc client starts a shell script that first sets the ulimit to unlimited and then starts Charmm. David Anderson attended us on the fact the boinc client can start shell scripts without any problems and this triggered the solution. So from charmm version 5.4 for linux (32 and 64 bit) you won't need the ulimit fix anymore :-)

This is probably a solution that we won't need anymore when we start using the new charmm version (no, it still hasn't arrived) as this version does not seem to have the stack limit problem.

Thanks for your patience on this.
Andre


I know it can be confusing to say that that 5.8.11 has a ulimit fix, but it doesn't fix the ulimit problem.

Basically, somewhere around 5.8.8, the boinc client from Berkeley was modified in a way that resulted in lowering the stack limit from unlimited to 500 MB (IIRC) and this broke Charmm so Andre asked them to change it.

FYI, I don't think Charmm actually uses 500 MB of stack, but I do think that setting the stack limit to unlimited changes the way a segment fault is handled. I suspect that Fortran is doing something that confuses the logic for growing the stack. Of course, that's just my opinion :-) I'm going through the kernel source to try to find out for sure.



Does this make having the ulimit in the script a problem now or we just don't have to use it in the future?
Profile Andre Kerstens
Forum moderator
Project tester
Volunteer tester
Avatar

Joined: Sep 11 06
Posts: 749
ID: 1
Credit: 15,199
RAC: 0
Message 2590 - Posted 26 Feb 2007 23:42:11 UTC - in response to Message ID 2589 .

This means you don't have to set it in your scripts in the future. It doesn't matter if these ulimit commands are in your scripts or not, it will just be set to unlimited twice (twinlimited ;-)

Thanks
Andre

Hi all,

We have finally found and implemented a fix for the stack limit problem! It's a simple (but initially non-obvious) solution where the boinc client starts a shell script that first sets the ulimit to unlimited and then starts Charmm. David Anderson attended us on the fact the boinc client can start shell scripts without any problems and this triggered the solution. So from charmm version 5.4 for linux (32 and 64 bit) you won't need the ulimit fix anymore :-)

This is probably a solution that we won't need anymore when we start using the new charmm version (no, it still hasn't arrived) as this version does not seem to have the stack limit problem.

Thanks for your patience on this.
Andre


I know it can be confusing to say that that 5.8.11 has a ulimit fix, but it doesn't fix the ulimit problem.

Basically, somewhere around 5.8.8, the boinc client from Berkeley was modified in a way that resulted in lowering the stack limit from unlimited to 500 MB (IIRC) and this broke Charmm so Andre asked them to change it.

FYI, I don't think Charmm actually uses 500 MB of stack, but I do think that setting the stack limit to unlimited changes the way a segment fault is handled. I suspect that Fortran is doing something that confuses the logic for growing the stack. Of course, that's just my opinion :-) I'm going through the kernel source to try to find out for sure.



Does this make having the ulimit in the script a problem now or we just don't have to use it in the future?


____________
D@H the greatest project in the world... a while from now!

Message boards : Unix/Linux : Howto: Dealing With 0x1 Error

Database Error
: The MySQL server is running with the --read-only option so it cannot execute this statement
array(3) {
  [0]=>
  array(7) {
    ["file"]=>
    string(47) "/boinc/projects/docking/html_v2/inc/db_conn.inc"
    ["line"]=>
    int(97)
    ["function"]=>
    string(8) "do_query"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#51 (2) {
      ["db_conn"]=>
      resource(120) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(51) "update DBNAME.thread set views=views+1 where id=102"
    }
  }
  [1]=>
  array(7) {
    ["file"]=>
    string(48) "/boinc/projects/docking/html_v2/inc/forum_db.inc"
    ["line"]=>
    int(60)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#51 (2) {
      ["db_conn"]=>
      resource(120) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(3) {
      [0]=>
      object(BoincThread)#3 (16) {
        ["id"]=>
        string(3) "102"
        ["forum"]=>
        string(1) "6"
        ["owner"]=>
        string(2) "15"
        ["status"]=>
        string(1) "0"
        ["title"]=>
        string(29) "Howto: Dealing With 0x1 Error"
        ["timestamp"]=>
        string(10) "1172533331"
        ["views"]=>
        string(4) "2407"
        ["replies"]=>
        string(2) "45"
        ["activity"]=>
        string(23) "1.4702361237874999e-123"
        ["sufferers"]=>
        string(1) "0"
        ["score"]=>
        string(1) "0"
        ["votes"]=>
        string(1) "0"
        ["create_time"]=>
        string(10) "1163677926"
        ["hidden"]=>
        string(1) "0"
        ["sticky"]=>
        string(1) "0"
        ["locked"]=>
        string(1) "0"
      }
      [1]=>
      &string(6) "thread"
      [2]=>
      &string(13) "views=views+1"
    }
  }
  [2]=>
  array(7) {
    ["file"]=>
    string(63) "/boinc/projects/docking/html_v2/user/community/forum/thread.php"
    ["line"]=>
    int(184)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(11) "BoincThread"
    ["object"]=>
    object(BoincThread)#3 (16) {
      ["id"]=>
      string(3) "102"
      ["forum"]=>
      string(1) "6"
      ["owner"]=>
      string(2) "15"
      ["status"]=>
      string(1) "0"
      ["title"]=>
      string(29) "Howto: Dealing With 0x1 Error"
      ["timestamp"]=>
      string(10) "1172533331"
      ["views"]=>
      string(4) "2407"
      ["replies"]=>
      string(2) "45"
      ["activity"]=>
      string(23) "1.4702361237874999e-123"
      ["sufferers"]=>
      string(1) "0"
      ["score"]=>
      string(1) "0"
      ["votes"]=>
      string(1) "0"
      ["create_time"]=>
      string(10) "1163677926"
      ["hidden"]=>
      string(1) "0"
      ["sticky"]=>
      string(1) "0"
      ["locked"]=>
      string(1) "0"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(13) "views=views+1"
    }
  }
}
query: update docking.thread set views=views+1 where id=102