Not getting work


Advanced search

Message boards : Number crunching : Not getting work

Sort
Author Message
j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 4555 - Posted 4 Nov 2008 16:29:02 UTC

Some puters running dry.

biodoc

Joined: Nov 1 08
Posts: 3
ID: 3235
Credit: 253,162
RAC: 0
Message 4556 - Posted 4 Nov 2008 17:01:30 UTC

Same here. No work for one of my computers.

Profile Trilce Estrada
Forum moderator
Project administrator
Project developer
Project tester

Joined: Sep 19 06
Posts: 189
ID: 119
Credit: 1,217,236
RAC: 0
Message 4569 - Posted 5 Nov 2008 15:18:45 UTC

I had to reduce the amount of work because we change the protein and I was expecting some nasty problems. I didn't want to distribute many of those. Now it seems to be stable and we are going back to our usual generation of work.

Thank you

j2satx
Volunteer tester

Joined: Dec 22 06
Posts: 183
ID: 339
Credit: 16,191,581
RAC: 0
Message 4579 - Posted 6 Nov 2008 16:26:33 UTC - in response to Message ID 4569 .

I had to reduce the amount of work because we change the protein and I was expecting some nasty problems. I didn't want to distribute many of those. Now it seems to be stable and we are going back to our usual generation of work.

Thank you


No work available.
Profile adrianxw
Volunteer tester
Avatar

Joined: Dec 30 06
Posts: 164
ID: 343
Credit: 1,669,741
RAC: 0
Message 4580 - Posted 6 Nov 2008 20:09:50 UTC
Last modified: 6 Nov 2008 20:18:34 UTC

Opposite problem! One of my machines has suddenly been flooded with wu's, (flooded in relative terms, a lot more then usual shall we say - couple of pages), all claiming to have a run time of 00:03:08.

The one running at the moment has been running 00:03:00 and is showing 3.356% complete and the time to completion is going up at the same rate as the CPU time.

Errrr....

*** EDIT ***

They do seem to be finishing very quickly. The percentage completion wanders up to 2-3% then jumps to 100%. The time to completion is nonsense.

Weird things are occurring. Looks to be happening to many though.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.

P . P . L .
Avatar

Joined: Oct 20 08
Posts: 69
ID: 2725
Credit: 1,000,979
RAC: 0
Message 4581 - Posted 6 Nov 2008 21:17:35 UTC

I've also had some tasks run for less than 2 min's, is there a problem

with these tasks. I haven't seen any error messages.

pete.

____________


Profile Conan
Volunteer tester
Avatar

Joined: Sep 13 06
Posts: 219
ID: 100
Credit: 4,256,493
RAC: 0
Message 4583 - Posted 7 Nov 2008 1:04:02 UTC - in response to Message ID 4581 .

I've also had some tasks run for less than 2 min's, is there a problem

with these tasks. I haven't seen any error messages.

pete.


G'Day Pete,

Doesn't appear to be a problem, just different WU type from previous (new 1c1r, old 1t7k which took over 2 hours for me).
Mine run from 100 odd seconds to over 6,000 seconds, for the same WU type.
This happens on both Windows and Linux.

The amount of credit awarded for the new work is still the same as before (quite low), so this has not been adjusted like indicated by Michela, still waiting for that change.
____________
Profile Trilce Estrada
Forum moderator
Project administrator
Project developer
Project tester

Joined: Sep 19 06
Posts: 189
ID: 119
Credit: 1,217,236
RAC: 0
Message 4585 - Posted 7 Nov 2008 2:04:39 UTC

There is no error with those workunit finishing too quickly, at least not from you. It is because the new protein tends to have more energy violations than the previous one and many conformations fall into this energy violation and return too quickly. we change the tolerance to try more times before to return this energy violation, and also we increased the threshold of the energy before to be considered a violation from 1200 to 5000, but I'm not sure it is taking effect.

Well the message is: there is no error from the BOINC clients, is the specific protein-ligand the problem, we are trying to find how to be more tolerant to those energy violations, but is gonna take some time.

P . P . L .
Avatar

Joined: Oct 20 08
Posts: 69
ID: 2725
Credit: 1,000,979
RAC: 0
Message 4586 - Posted 7 Nov 2008 2:36:38 UTC

G'day Trilce.

Thanks for that, it's good to know.

pete.

____________


P . P . L .
Avatar

Joined: Oct 20 08
Posts: 69
ID: 2725
Credit: 1,000,979
RAC: 0
Message 4662 - Posted 30 Dec 2008 4:29:24 UTC

Hi. Just to let you know if you don't already.

I seem to be getting a lot of these again, most finish quickly between

1 min & 20 min showing the same in my messages. On both rigs.

Calling BOINC init.
Starting charmm run (initial or from checkpoint)...
WARNING - Total energy change exceeded . Code101.
Resolving file charmm.out...
Calling BOINC finish.

pete.





____________


Test

Joined: Aug 5 08
Posts: 4
ID: 392
Credit: 0
RAC: 0
Message 4668 - Posted 4 Jan 2009 2:18:10 UTC - in response to Message ID 4662 .

Yes, I know =( they will continue probably. It is because of the structure of the ligands. I hope the next round will be better. Once we finish the trypsin we will use a different protein and model and hopefully the workunits will be longer

Thank you anyway

Trilce

P . P . L .
Avatar

Joined: Oct 20 08
Posts: 69
ID: 2725
Credit: 1,000,979
RAC: 0
Message 4734 - Posted 17 Jan 2009 1:58:58 UTC

Hi.

Seems like you have run out of new work!

Server says 0 tasks.

pete.

____________


P . P . L .
Avatar

Joined: Oct 20 08
Posts: 69
ID: 2725
Credit: 1,000,979
RAC: 0
Message 4807 - Posted 28 Jan 2009 2:07:43 UTC

Hi.

Looks like we have ran out of work again.

pete.

____________


P . P . L .
Avatar

Joined: Oct 20 08
Posts: 69
ID: 2725
Credit: 1,000,979
RAC: 0
Message 4893 - Posted 18 Mar 2009 4:48:59 UTC

Hi again.

Looks like you've ran out of work again.

Wed 18 Mar 2009 15:46:25 EST|Docking@Home|Sending scheduler request: To fetch work. Requesting 2617 seconds of work, reporting 0 completed tasks

Wed 18 Mar 2009 15:46:30 EST|Docking@Home|Scheduler request succeeded: got 0 new tasks

pete.


____________


Profile Trilce Estrada
Forum moderator
Project administrator
Project developer
Project tester

Joined: Sep 19 06
Posts: 189
ID: 119
Credit: 1,217,236
RAC: 0
Message 4894 - Posted 19 Mar 2009 17:23:57 UTC

Its is gonna be a low production in the following two days because we changed the model and we want to be sure everything is working as supposed to be. We don't want to discover that there is an error and that a massive amount of work ahs to be restarted. So far everything looks fine, and we will increase the work produced slowly.

Sorry for the inconvenience, but thank you for your patience =)

Message boards : Number crunching : Not getting work

Database Error
: The MySQL server is running with the --read-only option so it cannot execute this statement
array(3) {
  [0]=>
  array(7) {
    ["file"]=>
    string(47) "/boinc/projects/docking/html_v2/inc/db_conn.inc"
    ["line"]=>
    int(97)
    ["function"]=>
    string(8) "do_query"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#20 (2) {
      ["db_conn"]=>
      resource(90) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(51) "update DBNAME.thread set views=views+1 where id=366"
    }
  }
  [1]=>
  array(7) {
    ["file"]=>
    string(48) "/boinc/projects/docking/html_v2/inc/forum_db.inc"
    ["line"]=>
    int(60)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#20 (2) {
      ["db_conn"]=>
      resource(90) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(3) {
      [0]=>
      object(BoincThread)#3 (16) {
        ["id"]=>
        string(3) "366"
        ["forum"]=>
        string(1) "2"
        ["owner"]=>
        string(3) "339"
        ["status"]=>
        string(1) "0"
        ["title"]=>
        string(16) "Not getting work"
        ["timestamp"]=>
        string(10) "1237483437"
        ["views"]=>
        string(3) "710"
        ["replies"]=>
        string(2) "14"
        ["activity"]=>
        string(18) "6.968811666444e-92"
        ["sufferers"]=>
        string(1) "0"
        ["score"]=>
        string(1) "0"
        ["votes"]=>
        string(1) "0"
        ["create_time"]=>
        string(10) "1225816142"
        ["hidden"]=>
        string(1) "0"
        ["sticky"]=>
        string(1) "0"
        ["locked"]=>
        string(1) "0"
      }
      [1]=>
      &string(6) "thread"
      [2]=>
      &string(13) "views=views+1"
    }
  }
  [2]=>
  array(7) {
    ["file"]=>
    string(63) "/boinc/projects/docking/html_v2/user/community/forum/thread.php"
    ["line"]=>
    int(184)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(11) "BoincThread"
    ["object"]=>
    object(BoincThread)#3 (16) {
      ["id"]=>
      string(3) "366"
      ["forum"]=>
      string(1) "2"
      ["owner"]=>
      string(3) "339"
      ["status"]=>
      string(1) "0"
      ["title"]=>
      string(16) "Not getting work"
      ["timestamp"]=>
      string(10) "1237483437"
      ["views"]=>
      string(3) "710"
      ["replies"]=>
      string(2) "14"
      ["activity"]=>
      string(18) "6.968811666444e-92"
      ["sufferers"]=>
      string(1) "0"
      ["score"]=>
      string(1) "0"
      ["votes"]=>
      string(1) "0"
      ["create_time"]=>
      string(10) "1225816142"
      ["hidden"]=>
      string(1) "0"
      ["sticky"]=>
      string(1) "0"
      ["locked"]=>
      string(1) "0"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(13) "views=views+1"
    }
  }
}
query: update docking.thread set views=views+1 where id=366