Work Units That Never Want To End

Message boards : Number crunching : Work Units That Never Want To End

Author	Message
Conan Volunteer tester Joined: Sep 13 06 Posts: 219 ID: 100 Credit: 4,256,493 RAC: 0	Message 4946 - Posted 29 Apr 2009 11:11:23 UTC Last modified: 29 Apr 2009 11:13:30 UTC
	I have had a couple of work units that showed no progress over the past month or so but either aborted them when found or restarted Boinc and then all was ok. And all has been OK for a while now, that is until today anyway. This Result was running in High Priority but no progress was being shown, no time or percentage done. When I aborted it the time then appeared of over 2 hours, but I have no idea if it was still running ok as nothing was happening in the Manager. However This Result , which was also running in High Priority, I caught at over 51 HOURS and only at 60% with 20 HOURS to go. Not the usual run time for a Docking WU. I aborted this one as well. Can management look into these results and let me know if their is a problem with the work units please. Thanks, Conan. ____________
	ID: 4946 \| Rating: 0 \| rate: /

Aganazzar Joined: Mar 28 09 Posts: 1 ID: 8961 Credit: 17,629,027 RAC: 0	Message 4949 - Posted 30 Apr 2009 0:30:52 UTC
	I am having a similar problem with every work unit that starts with "1hpv". They are all running as if they are working, but no progress is being shown. It normally takes my computer around 2 hours to complete a work unit, and these work units were running for 3 hours and still showing 0% progress. I exited the client, and restarted, and the same thing happened - it was showing CPU time but it was not showing any progress. I aborted a couple of the units and got some new ones (all starting with "1hpv") and the same thing happened. However, a "1hps" file ran fine.
	ID: 4949 \| Rating: 0 \| rate: /

Conan Volunteer tester Joined: Sep 13 06 Posts: 219 ID: 100 Credit: 4,256,493 RAC: 0	Message 4950 - Posted 30 Apr 2009 1:16:20 UTC Last modified: 30 Apr 2009 1:24:48 UTC
	I also can confirm that '1hpv' work units show the time going up but no progress. I hadn't taken notice of the WU type before, just how long they were running. With another 4 on a different machine doing the same thing after an hour of running, I will abort them as they appear to be just wasting my time with no benefit for me or the project. Have also found this happening on my Windows machine as well, with a WU at 7 hours and no progress, it has been aborted along with a quite a few others. As all I am getting on the Windows machine is '1hpv' work units, I have set it to No New Work, till the problem work units are removed. ____________
	ID: 4950 \| Rating: 0 \| rate: /

Wang Solutions Volunteer tester Joined: Nov 14 06 Posts: 5 ID: 272 Credit: 5,326,180 RAC: 0	Message 4952 - Posted 30 Apr 2009 9:28:30 UTC
	I can also confirm the same issue on both Windows and Linux, with 1hpv units running for 8 hours or more with zero progress and either no sign of completing, or a computation error after 8 or so hours. I have suspended all work on these. ____________ Proud member of BOINC@AUSTRALIA
	ID: 4952 \| Rating: 0 \| rate: /

TPRDroid Joined: Mar 29 09 Posts: 2 ID: 9049 Credit: 368,198 RAC: 0	Message 4953 - Posted 30 Apr 2009 10:41:17 UTC
	I can confirm the same problem with 1hpv units. All sticking on zero progress Droid
	ID: 4953 \| Rating: 0 \| rate: /

ohiomike Joined: Apr 28 09 Posts: 2 ID: 10461 Credit: 76,356 RAC: 0	Message 4954 - Posted 30 Apr 2009 10:42:08 UTC Last modified: 30 Apr 2009 11:17:32 UTC
	I also am seeing the 1hpv_mod.. WUs "hanging" as described above on one of my machines. Arch Linux on a Intel Q6600. All WUs are sitting @0 % using 100% CPU for >27,732 seconds then erroring out. (Normal runtime on this machine is apx 10,000 secs). Edit: All 1hpv work units on all platforms doing this now.....
	ID: 4954 \| Rating: 0 \| rate: /

adrianxw Volunteer tester Joined: Dec 30 06 Posts: 164 ID: 343 Credit: 1,669,741 RAC: 0	Message 4955 - Posted 30 Apr 2009 10:53:28 UTC Last modified: 30 Apr 2009 11:42:08 UTC
	This wu also a 1hpv one, is "running" but the time to completion is not changing and the progress bar is 0.000%. Suspended pending advice. What is the yellow atom in the graphic, Sulphur perhaps? It's valence state seems variable from protein to protein. Sorry, posted this earlier but wrong thread really. ____________ Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
	ID: 4955 \| Rating: 0 \| rate: /

TPR_Mojo Joined: Mar 26 09 Posts: 6 ID: 8777 Credit: 7,205,188 RAC: 0	Message 4956 - Posted 30 Apr 2009 13:21:29 UTC
	Exactly the same problem here, Linux and Windows. Cancelling the units clears the problems but new work is full of 1hpv, presumably because everyone is cancelling and so they are being resent. Have set the project to no new work and am crunching the other units in cache as normal. Please sort this out, and cancel the units at server level - how many people have not even noticed?
	ID: 4956 \| Rating: 0 \| rate: /

Beyond Joined: Feb 9 09 Posts: 8 ID: 6984 Credit: 3,132,056 RAC: 0	Message 4957 - Posted 30 Apr 2009 14:41:25 UTC
	1hpv WUs are doing the same thing here. Is there an admin reading this thread?
	ID: 4957 \| Rating: 0 \| rate: /

DGG Joined: Feb 6 09 Posts: 10 ID: 6857 Credit: 1,719,735 RAC: 0	Message 4958 - Posted 30 Apr 2009 15:05:09 UTC
	Getting some taks like an 1hbv... that went to 52.975% and quit showing progress and one 1hpv...that shows 0% all the time with now change to time to complete but CPU time is clocking up. Had another one I aborted prior to these two thinking it was just a bad WU. Can we get a response from someone at docking@home regarding this problem?
	ID: 4958 \| Rating: 0 \| rate: /

adrianxw Volunteer tester Joined: Dec 30 06 Posts: 164 ID: 343 Credit: 1,669,741 RAC: 0	Message 4959 - Posted 30 Apr 2009 15:16:35 UTC Last modified: 30 Apr 2009 15:22:11 UTC
	>>> Is there an admin reading this thread? Bear in mind the time differences, what is it where the team members are, how long have they had at work? ____________ Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
	ID: 4959 \| Rating: 0 \| rate: /

fractal Joined: Sep 3 08 Posts: 10 ID: 563 Credit: 1,285,769 RAC: 0	Message 4960 - Posted 30 Apr 2009 15:29:35 UTC - in response to Message ID 4956 .
	Exactly the same problem here, Linux and Windows. Cancelling the units clears the problems but new work is full of 1hpv, presumably because everyone is cancelling and so they are being resent. Actually, they are set to 0/1/1 so are canceled the first error. That said, I just noticed a bunch of red units in boincview. All hpv's. All ran 12 1/2 hrs before exceeding cpu, claiming 150 credits and getting none. So, the good news is ... they only fail once. The bad news is they run for 12 hrs instead of 2, and you get no credit.
	ID: 4960 \| Rating: 0 \| rate: /

TPR_Mojo Joined: Mar 26 09 Posts: 6 ID: 8777 Credit: 7,205,188 RAC: 0	Message 4961 - Posted 30 Apr 2009 15:32:16 UTC - in response to Message ID 4960 .
	Exactly the same problem here, Linux and Windows. Cancelling the units clears the problems but new work is full of 1hpv, presumably because everyone is cancelling and so they are being resent. Actually, they are set to 0/1/1 so are canceled the first error. That said, I just noticed a bunch of red units in boincview. All hpv's. All ran 12 1/2 hrs before exceeding cpu, claiming 150 credits and getting none. So, the good news is ... they only fail once. The bad news is they run for 12 hrs instead of 2, and you get no credit. Sorry my mistake, must have just had bad luck on the downloads then
	ID: 4961 \| Rating: 0 \| rate: /

KWSN glidersaur Joined: Apr 29 09 Posts: 2 ID: 10519 Credit: 4,679 RAC: 0	Message 4962 - Posted 30 Apr 2009 15:41:14 UTC
	I'm seeing the same problem with 1hpv wu's running on Vista. I have tried them all and wound up aborting them all and accepting no new work until the problem is corrected. Is anyone taking a look at this problem from the admin standpoint????
	ID: 4962 \| Rating: 0 \| rate: /

Beyond Joined: Feb 9 09 Posts: 8 ID: 6984 Credit: 3,132,056 RAC: 0	Message 4963 - Posted 30 Apr 2009 15:42:08 UTC - in response to Message ID 4959 .
	>>> Is there an admin reading this thread? Bear in mind the time differences, what is it where the team members are, how long have they had at work? It's been 28 hours since this problem was first posted, I doubt people sleep that long even in Delaware :-)
	ID: 4963 \| Rating: 0 \| rate: /

zpm Joined: Mar 13 09 Posts: 13 ID: 8257 Credit: 474,576 RAC: 0	Message 4964 - Posted 30 Apr 2009 16:27:43 UTC - in response to Message ID 4963 . Last modified: 30 Apr 2009 16:28:19 UTC
	looks like all the 1hpv wu are bad..... i'm also having same problems.... i've got 5 of them setting here, i've gone ahead and suspended them, better to let rosseta and others run and be able to report.....
	ID: 4964 \| Rating: 0 \| rate: /

TPRDroid Joined: Mar 29 09 Posts: 2 ID: 9049 Credit: 368,198 RAC: 0	Message 4968 - Posted 1 May 2009 4:36:47 UTC
	Can't get any decent units & no response from admins, so no new tasks set. Downloading Sztaki instead.
	ID: 4968 \| Rating: 0 \| rate: /

Michela Forum moderator Project administrator Project developer Project tester Project scientist Joined: Sep 13 06 Posts: 163 ID: 10 Credit: 97,083 RAC: 0	Message 4969 - Posted 1 May 2009 13:00:55 UTC - in response to Message ID 4968 .
	Dear All, sorry for the delay in answering. We are at the end of the semester and the last weeks of classes are taking most of our time. We will immediately stop the distribution of the task. Thanks a lot for your support and patience. More information will follow soon, Michela PS: Stay tuned with D@H :-) ____________ If you are interested in working on Docking@Home in a great group at UDel, contact me at 'taufer at acm dot org'!
	ID: 4969 \| Rating: 0 \| rate: /

Michela Forum moderator Project administrator Project developer Project tester Project scientist Joined: Sep 13 06 Posts: 163 ID: 10 Credit: 97,083 RAC: 0	Message 4970 - Posted 1 May 2009 13:14:56 UTC - in response to Message ID 4969 .
	We suspended the generation of jobs. We are looking at the problem and we expect to restart generation of more robust jobs in the next 5 hours. All the other functionalities of D@H are on. We still collect results and assign credits. Thank you to all for the indication of the problem. Michela ____________ If you are interested in working on Docking@Home in a great group at UDel, contact me at 'taufer at acm dot org'!
	ID: 4970 \| Rating: 0 \| rate: /

Krunchin-Keith [USA] Volunteer tester Joined: Sep 13 06 Posts: 41 ID: 4 Credit: 1,539,093 RAC: 0	Message 4973 - Posted 1 May 2009 16:20:46 UTC
	You need to cancel all 1hpv tasks on the server. As soon as I abort all 1hpv... tasks on my computer, it downloads more of the same.
	ID: 4973 \| Rating: 0 \| rate: /

John Hunt Volunteer tester Joined: Nov 14 06 Posts: 40 ID: 270 Credit: 114,129 RAC: 0	Message 4974 - Posted 1 May 2009 17:13:00 UTC Last modified: 1 May 2009 17:47:55 UTC
	Aborted all 1hpv - received new work 1hsg. Hope these run OK! <edit> Seems to be running OK after 30 mins. ____________
	ID: 4974 \| Rating: 0 \| rate: /

Kevint Joined: Jun 26 08 Posts: 10 ID: 389 Credit: 2,724,494 RAC: 0	Message 4976 - Posted 1 May 2009 20:08:59 UTC - in response to Message ID 4973 .
	You need to cancel all 1hpv tasks on the server. As soon as I abort all 1hpv... tasks on my computer, it downloads more of the same. Agreed, A server side abort would be the thing to do. Please
	ID: 4976 \| Rating: 0 \| rate: /

Trilce Estrada Forum moderator Project administrator Project developer Project tester Joined: Sep 19 06 Posts: 189 ID: 119 Credit: 1,217,236 RAC: 0	Message 4977 - Posted 1 May 2009 23:54:08 UTC - in response to Message ID 4976 .
	1hsg were tested and they don't have the problem of 1hpv. All 1hpv were canceled from the server, 1hbv (note the b instead of the p) are still around but they are running fine Thank you You need to cancel all 1hpv tasks on the server. As soon as I abort all 1hpv... tasks on my computer, it downloads more of the same. Agreed, A server side abort would be the thing to do. Please
	ID: 4977 \| Rating: 0 \| rate: /

Kevint Joined: Jun 26 08 Posts: 10 ID: 389 Credit: 2,724,494 RAC: 0	Message 4978 - Posted 2 May 2009 0:06:39 UTC
	Thank you. I have just too many hosts to check each one of them and abort just the bad WU's.
	ID: 4978 \| Rating: 0 \| rate: /

KWSN glidersaur Joined: Apr 29 09 Posts: 2 ID: 10519 Credit: 4,679 RAC: 0	Message 4979 - Posted 2 May 2009 7:13:07 UTC - in response to Message ID 4977 .
	[quote]1hsg were tested and they don't have the problem of 1hpv. All 1hpv were canceled from the server, 1hbv (note the b instead of the p) are still around but they are running fine[quote] 1hsg units are hanging up at 1.000% completion. Looks like these are bad too.
	ID: 4979 \| Rating: 0 \| rate: /

adrianxw Volunteer tester Joined: Dec 30 06 Posts: 164 ID: 343 Credit: 1,669,741 RAC: 0	Message 4980 - Posted 2 May 2009 12:58:42 UTC Last modified: 2 May 2009 13:00:12 UTC
	My current 1hsg wu, this one , has been crunching about an hour and says it is 30.700% done. ____________ Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
	ID: 4980 \| Rating: 0 \| rate: /

adrianxw Volunteer tester Joined: Dec 30 06 Posts: 164 ID: 343 Credit: 1,669,741 RAC: 0	Message 4981 - Posted 2 May 2009 18:53:56 UTC Last modified: 2 May 2009 18:55:45 UTC
	It finished without issue. A later wu also 1hsg sat at 1.000% for around 5 minutes, then continued. I say "Ni". ____________ Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
	ID: 4981 \| Rating: 0 \| rate: /

DGG Joined: Feb 6 09 Posts: 10 ID: 6857 Credit: 1,719,735 RAC: 0	Message 4982 - Posted 3 May 2009 1:19:28 UTC
	I've now had 2 of 1hsg units reach 43% and 45% and quit showing any progress for hours. I've also had 1 of the 1hbv die as well. I'm guessing the whole D@H batch of work unit is corrupted. Has anyone been able to finish a WU lately? Again suspending all D@H work as it's wasting hours of CPU time reaching a point of no progress without any results. I know the folks at D@H are busy with their own lives but it looks like they need to delete all their tasks and start again.
	ID: 4982 \| Rating: 0 \| rate: /

DGG Joined: Feb 6 09 Posts: 10 ID: 6857 Credit: 1,719,735 RAC: 0	Message 4983 - Posted 3 May 2009 1:34:23 UTC - in response to Message ID 4982 .
	I've now had 2 of 1hsg units reach 43% and 45% and quit showing any progress for hours. I've also had 1 of the 1hbv die as well. I'm guessing the whole D@H batch of work unit is corrupted. Has anyone been able to finish a WU lately? Again suspending all D@H work as it's wasting hours of CPU time reaching a point of no progress without any results. I know the folks at D@H are busy with their own lives but it looks like they need to delete all their tasks and start again. I just received an email from a friend who also does D@H and has told me his 1hsg units also stopped for several hours showing progress and then kicked back in to normal speed and finished. So maybe I'm just not being patient enough? I had aborted the 1hbv one and maybe that was premature. In the last 4 hours and 22 minutes one of my current 1hsgs hasn't moved at all and one has shown 0.055 increase in % of work done. Maybe these WUs have something special about them that makes them process slowly in the middle? Anyone else still having trouble with them or is it just me?
	ID: 4983 \| Rating: 0 \| rate: /

DGG Joined: Feb 6 09 Posts: 10 ID: 6857 Credit: 1,719,735 RAC: 0	Message 4984 - Posted 3 May 2009 4:52:33 UTC
	After several hours of very little noticable progress the 1hsg units I mentioned before have slowly started to show progress again. So I guess they are indeed running, just very slowly. Just a long period of not showing progress. I hope they don't error out at the end. I can normally finish a WU in about 8 hours but these are going to be around 14-16 hours before they are done if the current estimates are correct.
	ID: 4984 \| Rating: 0 \| rate: /

Michela Forum moderator Project administrator Project developer Project tester Project scientist Joined: Sep 13 06 Posts: 163 ID: 10 Credit: 97,083 RAC: 0	Message 4985 - Posted 3 May 2009 15:44:25 UTC - in response to Message ID 4984 .
	After several hours of very little noticable progress the 1hsg units I mentioned before have slowly started to show progress again. So I guess they are indeed running, just very slowly. Just a long period of not showing progress. I hope they don't error out at the end. I can normally finish a WU in about 8 hours but these are going to be around 14-16 hours before they are done if the current estimates are correct. The docking method we are currently using is taking more time. We tried to reduce the overall time by reducing the number of docking attempts per job. Also, some jobs may start with ligand conformations that do not really make any sense (from the science point of view) and the job may seems stack but it is actually searching for good ligand conformations. We are monitoring the situation and at the same time looking for possible changes in the docking method that can reduce docking delays. We will post more about the results we are collecting for 1hsg (and their accuracy) in the next 24 hours. The calibration and execution of docking simulations is for sure very challenging when the docking methods become more accurate. Hopefully using our simulations we will be able to help scientists to identify criteria to decide when sophisticated docking methods (like the one we are currently using) are really needed and when these sophisticated methods are not needed because simpler methods (like the one we use in the past) are still accurate (i.e., provide scientists with results that are meaningful). Thanks for the patience and commitment. Michela ____________ If you are interested in working on Docking@Home in a great group at UDel, contact me at 'taufer at acm dot org'!
	ID: 4985 \| Rating: 0 \| rate: /

Trilce Estrada Forum moderator Project administrator Project developer Project tester Joined: Sep 19 06 Posts: 189 ID: 119 Credit: 1,217,236 RAC: 0	Message 4986 - Posted 3 May 2009 16:06:03 UTC
	1hsg are longer than previous workunits (same number of conformations and rotations, but they take longer). Most of them take from 3.4 to 5.8 hours. Most of the results we are getting are valid, so, this batch of work is fine, just longer. Today we will move to 1htf, they are expected to be long as well
	ID: 4986 \| Rating: 0 \| rate: /

Trilce Estrada Forum moderator Project administrator Project developer Project tester Joined: Sep 19 06 Posts: 189 ID: 119 Credit: 1,217,236 RAC: 0	Message 4987 - Posted 3 May 2009 16:08:03 UTC
	1hsg are longer than previous workunits (same number of conformations and rotations, but they take longer). Most of them take from 3.4 to 5.8 hours. Most of the results we are getting are valid, so, this batch of work is fine, just longer. Today we will move to 1htf, they are expected to be long as well
	ID: 4987 \| Rating: 0 \| rate: /

DGG Joined: Feb 6 09 Posts: 10 ID: 6857 Credit: 1,719,735 RAC: 0	Message 4989 - Posted 4 May 2009 16:18:48 UTC
	Yes my 1hsgs did complete once they got over the hump of showing no progress for a while. I've finished both those I was concerned about. Just needed to be patient and wait a little longer. It takes my old Pentium 4 3GHz a good deal more time to complete than you mention so maybe it feels the increase a bit more than the more current CPUs.
	ID: 4989 \| Rating: 0 \| rate: /

kevin Project developer Project tester Joined: Aug 12 08 Posts: 10 ID: 393 Credit: 1,507 RAC: 0	Message 4990 - Posted 4 May 2009 19:08:22 UTC - in response to Message ID 4989 .
	Yes my 1hsgs did complete once they got over the hump of showing no progress for a while. I've finished both those I was concerned about. Just needed to be patient and wait a little longer. It takes my old Pentium 4 3GHz a good deal more time to complete than you mention so maybe it feels the increase a bit more than the more current CPUs. your logic seems right. for example, i'm running a core2duo 3.0ghz with 4gigs of ram. the newest complex, the 1htf, is taking me about 3 hours to complete. hang in there on those long tasks. :o)
	ID: 4990 \| Rating: 0 \| rate: /

robertmiles Joined: Apr 16 09 Posts: 96 ID: 9967 Credit: 1,290,747 RAC: 0	Message 4994 - Posted 5 May 2009 21:26:38 UTC Last modified: 5 May 2009 21:30:07 UTC
	I often saw something similar on another BOINC project where the part of the programs for actually doing the work was working properly, but in a way that made reporting how much work was already done so difficult that there were problems with measuring progress. Could the problems be related to that?
	ID: 4994 \| Rating: 0 \| rate: /

Conan Volunteer tester Joined: Sep 13 06 Posts: 219 ID: 100 Credit: 4,256,493 RAC: 0	Message 4996 - Posted 7 May 2009 4:23:05 UTC - in response to Message ID 4987 . Last modified: 7 May 2009 4:28:18 UTC
	1hsg are longer than previous workunits (same number of conformations and rotations, but they take longer). Most of them take from 3.4 to 5.8 hours. Most of the results we are getting are valid, so, this batch of work is fine, just longer. Today we will move to 1htf, they are expected to be long as well Well I just aborted 3 of this type (1hsg) as all had reached 1 hour 6 minutes or there abouts but were not doing anything. Running High Priority, time to completion still going up (I think) but CPU time was not moving on all three, suspended, resumed still not moving so killed them. Also there seems to be a bug in the result tables, after a job is aborted and sent back it shows that it ran for ZERO time, this was on all three (that all ran for over an hour) of the ones I have just reported. And I noticed also on a previous faulty WU (from the ones that were cancelled) that had run for over 51 HOURS and it showed Zero time as well, unsure how you can fault find when this is happening. ____________
	ID: 4996 \| Rating: 0 \| rate: /

Trilce Estrada Forum moderator Project administrator Project developer Project tester Joined: Sep 19 06 Posts: 189 ID: 119 Credit: 1,217,236 RAC: 0	Message 4997 - Posted 7 May 2009 14:57:02 UTC - in response to Message ID 4996 .
	My guess is that it is the BOINC policy for aborted WUs, let me see if I can find where to change it, so in the case of 1hpv, people is not penalized Thank you 1hsg are longer than previous workunits (same number of conformations and rotations, but they take longer). Most of them take from 3.4 to 5.8 hours. Most of the results we are getting are valid, so, this batch of work is fine, just longer. Today we will move to 1htf, they are expected to be long as well Well I just aborted 3 of this type (1hsg) as all had reached 1 hour 6 minutes or there abouts but were not doing anything. Running High Priority, time to completion still going up (I think) but CPU time was not moving on all three, suspended, resumed still not moving so killed them. Also there seems to be a bug in the result tables, after a job is aborted and sent back it shows that it ran for ZERO time, this was on all three (that all ran for over an hour) of the ones I have just reported. And I noticed also on a previous faulty WU (from the ones that were cancelled) that had run for over 51 HOURS and it showed Zero time as well, unsure how you can fault find when this is happening.
	ID: 4997 \| Rating: 0 \| rate: /

shauge Joined: Oct 24 08 Posts: 1 ID: 2889 Credit: 10,183,200 RAC: 0	Message 5007 - Posted 10 May 2009 20:58:03 UTC - in response to Message ID 4946 .
	I got a "1hvk" task that had run for 104 hours and had 140 hours left before estimated finish, before I aborted it. I guess this is CPU time (electricity/money) down the drain :(
	ID: 5007 \| Rating: 0 \| rate: /

Ron Joined: Apr 28 09 Posts: 1 ID: 10476 Credit: 21,547 RAC: 0	Message 5041 - Posted 28 May 2009 1:57:00 UTC
	Having same issue now with 1ohr_mod0014_45* WU's. aborting them also.
	ID: 5041 \| Rating: 0 \| rate: /

Trilce Estrada Forum moderator Project administrator Project developer Project tester Joined: Sep 19 06 Posts: 189 ID: 119 Credit: 1,217,236 RAC: 0	Message 5043 - Posted 28 May 2009 17:19:49 UTC
	Hi Ron and Shauge, I wonder if you have the id for those workuints, so that we can repeat the experiment under controlled conditions to see if we can discover the cause of that error. Many thanks for letting us know about this issue, we will try to monitor long workunits Best wishes
	ID: 5043 \| Rating: 0 \| rate: /

Chamberlain Joined: Oct 4 09 Posts: 4 ID: 19348 Credit: 405,446 RAC: 0	Message 5567 - Posted 2 Dec 2009 4:09:51 UTC
	Not sure where to post this but I am running BOINC With SETI@HOME and DOCKING@HOME loaded and running. SETI seems to run like it should. The work unit in progress shows progress, elapsed and time to completion and it runs. DOCKING, however, shows no progress bar, and the elapsed time is way past what it should have taken to complete the work unit, 14hrs. The time to completion is sitting at what it was when the work unit started around 3:32 to complete. I even suspended work on SETI and Docking isnt getting anywhere. I first noticed this when I noted a message that said my units were overdue and probably wouldnt be counted as completed. I have already done the normal uninstall and reinstall of BOINC, etc. Whats wrong?
	ID: 5567 \| Rating: 0 \| rate: /

kd55 Joined: Sep 21 08 Posts: 3 ID: 1086 Credit: 40,624 RAC: 0	Message 5568 - Posted 2 Dec 2009 20:19:17 UTC
	Just as Chamberlain reported, I have been running D@H along with 2 other projects. For the past 4 weeks, all docking work units are running at least 12-14 hours with no progress shown. I have had to abort every one of the units so other work would continue. World Community and Rosetta are functioning properly. Windows 7, x64 / BOINC 6.10.18 / Charmm 34a2 6.23 (most recent work unit). Thanks, KJ
	ID: 5568 \| Rating: 0 \| rate: /

Chamberlain Joined: Oct 4 09 Posts: 4 ID: 19348 Credit: 405,446 RAC: 0	Message 5580 - Posted 18 Dec 2009 3:56:12 UTC
	As noted in my original post, I continue to not complete work units or show any percent of completion with time elapsed on this particular project. Since I was running BOINC on windows 7 x64 I decided to try adding Docking to my laptop running BOINC on XP x32. Surprising, the unit showed work, elapsed time, and percent completed right away. I would then surmise that it is Docking at Home having issues running on x64 version of BOINC or BOINC running on x64 Windows7. I have to say that since no one has posted any response to my issues, I feel it necessary to drop Docking at Home and run those projects that work, and that I get support for. It bothers me that I am donating my extra PC cycles to a project, have a problem that others also seem to have and I dont even get some kind of confirmation that someone acknowledges my issue. I am very disappointed. Good luck, I will be closing out my account and donating my cycel to someone who cares. Sincerely, ex participant.
	ID: 5580 \| Rating: 0 \| rate: /

skgiven Joined: Oct 10 08 Posts: 10 ID: 2331 Credit: 3,721,673 RAC: 0	Message 5582 - Posted 18 Dec 2009 9:23:15 UTC
	I had 3 tasks that ran for 21h, 19h and 17h on a Q9400 @ 3.46gHz. All tasks showed 0.000% Progress. Charmm 34a2 6.23 Applications, 1k1i_89_mod0014trypsin_18046_294560_0 1k1i_89_mod0014trypsin_18910_123518_0 1k1i_89_mod0014trypsin_19550_291072_0 I exited Boinc and started Boinc again. All task time went back to zero, and again no progress was made. Bye-bye!
	ID: 5582 \| Rating: 0 \| rate: /

Message boards : Number crunching : Work Units That Never Want To End

Database Error
: The MySQL server is running with the --read-only option so it cannot execute this statement

array(3) {
  [0]=>
  array(7) {
    ["file"]=>
    string(47) "/boinc/projects/docking/html_v2/inc/db_conn.inc"
    ["line"]=>
    int(97)
    ["function"]=>
    string(8) "do_query"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#50 (2) {
      ["db_conn"]=>
      resource(192) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(51) "update DBNAME.thread set views=views+1 where id=424"
    }
  }
  [1]=>
  array(7) {
    ["file"]=>
    string(48) "/boinc/projects/docking/html_v2/inc/forum_db.inc"
    ["line"]=>
    int(60)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#50 (2) {
      ["db_conn"]=>
      resource(192) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(3) {
      [0]=>
      object(BoincThread)#3 (16) {
        ["id"]=>
        string(3) "424"
        ["forum"]=>
        string(1) "2"
        ["owner"]=>
        string(3) "100"
        ["status"]=>
        string(1) "0"
        ["title"]=>
        string(33) "Work Units That Never Want To End"
        ["timestamp"]=>
        string(10) "1261128195"
        ["views"]=>
        string(4) "1007"
        ["replies"]=>
        string(2) "44"
        ["activity"]=>
        string(22) "5.2375679589370004e-80"
        ["sufferers"]=>
        string(1) "0"
        ["score"]=>
        string(1) "0"
        ["votes"]=>
        string(1) "0"
        ["create_time"]=>
        string(10) "1241003483"
        ["hidden"]=>
        string(1) "0"
        ["sticky"]=>
        string(1) "0"
        ["locked"]=>
        string(1) "0"
      }
      [1]=>
      &string(6) "thread"
      [2]=>
      &string(13) "views=views+1"
    }
  }
  [2]=>
  array(7) {
    ["file"]=>
    string(63) "/boinc/projects/docking/html_v2/user/community/forum/thread.php"
    ["line"]=>
    int(184)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(11) "BoincThread"
    ["object"]=>
    object(BoincThread)#3 (16) {
      ["id"]=>
      string(3) "424"
      ["forum"]=>
      string(1) "2"
      ["owner"]=>
      string(3) "100"
      ["status"]=>
      string(1) "0"
      ["title"]=>
      string(33) "Work Units That Never Want To End"
      ["timestamp"]=>
      string(10) "1261128195"
      ["views"]=>
      string(4) "1007"
      ["replies"]=>
      string(2) "44"
      ["activity"]=>
      string(22) "5.2375679589370004e-80"
      ["sufferers"]=>
      string(1) "0"
      ["score"]=>
      string(1) "0"
      ["votes"]=>
      string(1) "0"
      ["create_time"]=>
      string(10) "1241003483"
      ["hidden"]=>
      string(1) "0"
      ["sticky"]=>
      string(1) "0"
      ["locked"]=>
      string(1) "0"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(13) "views=views+1"
    }
  }
}

query: update docking.thread set views=views+1 where id=424