Also noticed we are getting lots of download errors in the log. Most results seem to fail :-(
Michela mentioned we need upgrading to the latest BOINC version this weekend as many fields the client sends back to us are not recognized. I guess lots of work to do!
Weee! What HR class were they for? I can't get them with my macs or windows boxes.
They were gone in a second :-) Mostly Windows XP boxes as far as I can see....
I was getting the "there was work, but it was committed to other platforms" on both my macs and my XP machines.
____________
D@H the greatest project in the world... a while from now!
Also noticed we are getting lots of download errors in the log. Most results seem to fail :-(
Michela mentioned we need
upgrading to the latest BOINC version this weekend
as many fields the client sends back to us are not recognized. I guess lots of work to do!
Yes!! I'll finally be able to merge some duplicate hosts. Cool beans.
____________
Dublin, CA
Team
SETI.USA
[file_xfer] Started download of file charmm_5.7_windows_intelx86
[file_xfer] Temporarily failed download of charmm_5.7_windows_intelx86: file not found
Backing off 10 min 31 sec on download of file charmm_5.7_windows_intelx86
Zombie67 and j2satx, we just sent out some new wus and we saw that you got them. Please let us know it it worked on your pcs without any errors. Thanks.
Zombie67 and j2satx, we just sent out some new wus and we saw that you got them. Please let us know it it worked on your pcs without any errors. Thanks.
1st: Thanks for upgrading the server! I have now successfully merged my duplicate machines!
2nd: I got two tasks:
82
failed downloading:
<core_client_version>5.10.41</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>charmm_5.7_windows_x86_64</file_name>
<error_code>-224</error_code>
<error_message>file not found</error_message>
</file_xfer_error>
</message>
]]>
86
seems to be stuck downloading. I am getting the error message "Temporarily failed download of charmm_5.7_windows_x86_64: File not found" This is an XP64 machine, with 5.10.13. It should be getting the 32 bit app if a 64 bit app is not available. Do you want me to cancel it?
____________
Dublin, CA
Team
SETI.USA
ID:
3759 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
Zombie67 and j2satx, we just sent out some new wus and we saw that you got them. Please let us know it it worked on your pcs without any errors. Thanks.
1st: Thanks for upgrading the server! I have now successfully merged my duplicate machines!
2nd: I got two tasks:
82
failed downloading:
<core_client_version>5.10.41</core_client_version>
<![CDATA[
<message>
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>charmm_5.7_windows_x86_64</file_name>
<error_code>-224</error_code>
<error_message>file not found</error_message>
</file_xfer_error>
</message>
]]>
86
seems to be stuck downloading. I am getting the error message "Temporarily failed download of charmm_5.7_windows_x86_64: File not found" This is an XP64 machine, with 5.10.13. It should be getting the 32 bit app if a 64 bit app is not available. Do you want me to cancel it?
Zombie67 and j2satx, we just sent out some new wus and we saw that you got them. Please let us know it it worked on your pcs without any errors. Thanks.
I don't find any on my machines.
Which computer(s) got a WU?
edit: never mind, found the two that failed downloading.
Don't be wimpy, throw a thousand over the fence instead of ten.......LOL
OK. We've upgraded the version number of all apps to 5.8 (same app as 5.7) to see if it is a problem with the generation and things are still pointing to utep. I've seen that at least zombie67 got a WU again. Let us know if this one downloads...
Thanks
Andre
[edit] See that Memo and Tank_Master got one too.
____________
D@H the greatest project in the world... a while from now!
OK. We've upgraded the version number of all apps to 5.8 (same app as 5.7) to see if it is a problem with the generation and things are still pointing to utep. I've seen that at least zombie67 got a WU again. Let us know if this one downloads...
Thanks
Andre
[edit] See that Memo and Tank_Master got one too.
It D/L successfully, and is crunching away on my XP64 machine. Fingers crossed!
I've gotten WU's for XP, Vista, Ubuntu, and Centos with no problems so far. I sure am glad to see docking back :D
____________
The views expressed are my own.
Facts are subject to memory error :-)
Have you read a good science fiction novel lately?
An hour ago I got a big bunch of the new WUs under openSuse 10.2 64-Bit.
But as the very first one started some minutes ago it errored out, the complete rest did follow.
That is the text from the result files:
<core_client_version>5.10.39</core_client_version>
<![CDATA[
<message>
process exited with code 131 (0x83, -125)
</message>
<stderr_txt>
Calling BOINC init.
Starting charmm run (initial or from checkpoint)...
SIGSEGV: segmentation violation
Stack trace (3 frames):
[0x83699a5]
[0x83e0de0]
[0xffffe500]
Exiting...
</stderr_txt>
]]>
____________
Life is Science, and Science rules. To the universe and beyond
Proud member of
BOINC@Heidelberg
Hi,
22 WU picked-up with Mac-intel.
One returned, few others will be returned by tuesday.
It looks good, but application running on Mac-intel seems needed more CPU time than Windows version.
____________
The faster Mac is over twice as slow, which is very odd. On windows, old legacy processors must be supported, which means you can't take advantage of things like SSE/2/3. But with OSX/Intel, there is no need for legacy support. The oldest processors support at least SSE3. So the OSX app could be written to take advantage of those things. So it should be at least as fast as the Win app, if not faster.
____________
Dublin, CA
Team
SETI.USA
it's the same app that we used at utep, incl all the same issues. If I remember correctly, we had the same 'slow' issue there. People are working on new versions of charmm, so it wouldn't make a whole lot of sense to start hacking and fixing the old ones. Hope you'll have some patience; after all we only put a 1000 wu's out there :-)
Thanks!
Andre
It looks good, but application running on Mac-intel seems needed more CPU time than Windows version.
Agreed. The crunch time is over double with OSX/Intel, compared to Windows on a similar processor.
The faster Mac is over twice as slow, which is very odd. On windows, old legacy processors must be supported, which means you can't take advantage of things like SSE/2/3. But with OSX/Intel, there is no need for legacy support. The oldest processors support at least SSE3. So the OSX app could be written to take advantage of those things. So it should be at least as fast as the Win app, if not faster.
____________
D@H the greatest project in the world... a while from now!
Thanks for the work units, unfortunately they are appear to be faulty.
Had 9 WU's on an AMD Linux machine all error out with the same message that DoctorNow received
<core_client_version>5.10.21</core_client_version>
<![CDATA[
<message>
process exited with code 131 (0x83, -125)
</message>
<stderr_txt>
Calling BOINC init.
Starting charmm run (initial or from checkpoint)...
SIGSEGV: segmentation violation
Stack trace (3 frames):
[0x83699a5]
[0x83e0de0]
[0xffffe500]
Exiting...
Also the HR must not be turned on as I was thrown on with Intel Linux machines.
EDIT:: Just checked and none of my partners have the same technolgy as me, including being paired with Windows XP and Vista, Intel P4 and Quads.
Thanks for the work units, unfortunately they are appear to be faulty.
Had 9 WU's on an AMD Linux machine all error out with the same message that DoctorNow received
<core_client_version>5.10.21</core_client_version>
<![CDATA[
<message>
process exited with code 131 (0x83, -125)
</message>
<stderr_txt>
Calling BOINC init.
Starting charmm run (initial or from checkpoint)...
SIGSEGV: segmentation violation
Stack trace (3 frames):
[0x83699a5]
[0x83e0de0]
[0xffffe500]
Exiting...
Also the HR must not be turned on as I was thrown on with Intel Linux machines.
EDIT:: Just checked and none of my partners have the same technolgy as me, including being paired with Windows XP and Vista, Intel P4 and Quads.
BOINC changed the way HR is specified in the config.xml file. We'll have to change it and restart the server. The other stuff we will have to trouble shoot. We'll keep you posted.
My G5 has crunched three tasks without apparent problems, but I notice none of my quorum partners are even Macs, let alone PPC. Will these results be possible to validate without HR, or should I mentally write them off now to avoid future disappointment?
My G5 has crunched three tasks without apparent problems, but I notice none of my quorum partners are even Macs, let alone PPC. Will these results be possible to validate without HR, or should I mentally write them off now to avoid future disappointment?
I would go with the latter, and then be pleasantly surprised if the former happens.
____________
Dublin, CA
Team
SETI.USA
My G5 has crunched three tasks without apparent problems, but I notice none of my quorum partners are even Macs, let alone PPC. Will these results be possible to validate without HR, or should I mentally write them off now to avoid future disappointment?
I assume the number of Mac/PPC is in this phase for the project very small (it was so even when the project was active in the last year). So perhaps what you can do now is just to wait for other hosts, which belong to the class of Mac/PPC. After participants come back to the project there shall be a possibility that your task is validated.
It's of course welcomed if you would invite someone who has the same kind of machine to join this project or if you have other Macs ;-)
thanks,
suguruhirahara
____________
I'm a volunteer participant; my views are not necessarily those of Docking@Home or its participating institutions.
I am only going to try and process the work units for the purpose of checking all is working with the system.
As HR is not working, none on my results will validate, so it may seem as a waste of time from my side of the fence.
No other Linux AMD machine appears to gotten WU's with me so Windows controls my destiny in determining if I get any credit back or not, being outnumbered, Windows will win this one.
Also as said earlier by Andre or Abel, these are the same WU's from last year so are going to run as slow as the proverbial tortoise on Linux and Windows runs like the proverbial hare.
For the ones that have not crashed I am returning results slowly.
Keep Smiling.
____________
ID:
3835 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
I am only going to try and process the work units for the purpose of checking all is working with the system.
As HR is not working, none on my results will validate, so it may seem as a waste of time from my side of the fence.
No other Linux AMD machine appears to gotten WU's with me so Windows controls my destiny in determining if I get any credit back or not, being outnumbered, Windows will win this one.
Also as said earlier by Andre or Abel, these are the same WU's from last year so are going to run as slow as the proverbial tortoise on Linux and Windows runs like the proverbial hare.
For the ones that have not crashed I am returning results slowly.
Keep Smiling.
Thanks for the conviction Conan :-) At first I had thought the HR problem was just a matter of restarting the server with the new configuration file, unfortunately it's not gonna be that easy. When we updated to the new version of BOINC there where things on our project specific code that we needed to change also. We are working on that and we'll get HR working ASAP. I'm not to sure how the credits are going to play out this time, but feel free to cancel any work units you don't think will get credit, or be tough like Conan and crunch them anyway.
____________
Mmmmm, doughnut...
I am only going to try and process the work units for the purpose of checking all is working with the system.
As HR is not working, none on my results will validate, so it may seem as a waste of time from my side of the fence.
No other Linux AMD machine appears to gotten WU's with me so Windows controls my destiny in determining if I get any credit back or not, being outnumbered, Windows will win this one.
Also as said earlier by Andre or Abel, these are the same WU's from last year so are going to run as slow as the proverbial tortoise on Linux and Windows runs like the proverbial hare.
For the ones that have not crashed I am returning results slowly.
Keep Smiling.
Thanks for the conviction Conan :-) At first I had thought the HR problem was just a matter of restarting the server with the new configuration file, unfortunately it's not gonna be that easy. When we updated to the new version of BOINC there where things on our project specific code that we needed to change also. We are working on that and we'll get HR working ASAP. I'm not to sure how the credits are going to play out this time, but feel free to cancel any work units you don't think will get credit, or be tough like Conan and crunch them anyway.
Well Abel you leave me with no choice do you? I will have to crunch them.
____________
I assume the number of Mac/PPC is in this phase for the project very small (it was so even when the project was active in the last year). So perhaps what you can do now is just to wait for other hosts, which belong to the class of Mac/PPC. After participants come back to the project there shall be a possibility that your task is validated.
The problem isn’t waiting for partners, which I don’t mind much—and which didn’t seem to be an issue last year, anyway—it’s getting partners who are unlikely (in proportion to the need for HR here, I guess) to produce matching results. As Zombie67 said, without HR the odds against a quorum of PPC Macs forming will be very long.
It's of course welcomed if you would invite someone who has the same kind of machine to join this project or if you have other Macs ;-)
Most of my other hosts are at work, and I have a policy not to run test projects on servers, colleagues’ workstations, or other systems I can’t easily keep an eye on. I do have a couple of old G4s available, but charmm doesn’t seem to like them. :(
Anyway, I think I’ll set my G5 to NNT for now, and keep an eye out for news.
SIMAP is issuing work in about an hours time if you want something to turn over while you wait.
____________
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
Sorry I put the WU id in instead of the Result id, should be this
Result
.
Also have had another 2 fail with different error on results
3870
and
3918
.
<core_client_version>5.10.21</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Calling BOINC init.
Starting charmm run (initial or from checkpoint)...
SIGSEGV: segmentation violation
Stack trace (3 frames):
[0x834f896]
[0x40000420]
[0x82a8178]
My first WU on Windows ran for over 2 hours but all since have been less than 20 minutes on both Windows and Linux.
OK, I just had another
WU
fail with the same Segmentation Violation Error as the last two results.
On closer inspection I have been teamed up with Darwin/Intel machines (zombie67 I think), so it appears that HR is not working.
My currently connected computers are all AMD and I run either Windows or Linux.
____________
At the moment there is 266 work units waiting to be sent, and has been for a few hours.
When I try and get work I am getting the message "There is work but it is committed to other platforms".
This is interesting because I am getting the same message on both Windows and Linux, so all the work must for Darwin.
This is of course if it means OS, if it is platform then all the work must be for Intel (and possibly Intel/PPC or whatever it is called for Apple/Darwin), as I have AMD and am unable to get any work at all.
____________
ID:
3994 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
Looks like we still have some issues to iron out. We are planning to update the version of CHARMM so we are not too worried about the seg fault issue since we already address some of those issues in the update. We just wanted to make sure HR and the screensaver where working. It seems the screensaver is fine but HR is still being difficult. We are working overtime to get it back in business. Thanks for the updates.
At the moment there is 266 work units waiting to be sent, and has been for a few hours.
When I try and get work I am getting the message "There is work but it is committed to other platforms".
This is interesting because I am getting the same message on both Windows and Linux, so all the work must for Darwin.
Nope. Neither my intel or PPC machines were able to pull them down.
And now I am getting this again:
Thu Jun 5 07:33:20 2008|Docking@Home|Message from server: No work sent
Thu Jun 5 07:33:20 2008|Docking@Home|Message from server: Charmm with screensaver is not available for your type of computer.
Hi, we are still submitting few, controlled WUs for crunching.
Tonight we have found a major issue in the checkpoint and we are testing it. Hopefully the solution works and tomorrow we can start distributing sufficient work for the whole weekend as well as a new complex.
Thanks for your patience!
Michela
____________
If you are interested in working on Docking@Home in a great group at UDel, contact me at 'taufer at acm dot org'!
Btw: I've tried Docking on my old (and lame *grin*) PIII 450 Mhz, running under Win2K.
Both WUs were processed without error but in my results list they show as "invalid".
On my
Host 1526
I am showing seven work units, numbers
1887
,
1904
,
1913
,
1920
,
1924
,
1935
,
1940
. I do not have these work units, they downloaded to my computer (well I could locate 4 of them in the logs), and all errored out with the error code 193 (as per my message logs), but none of this information has uploaded to your servers, and you still show them as being 'Pending'.
I have also
Host 130
showing a WU that my message logs tell me download OK, was processed in around 7 minutes without error and result uploaded from the computer (see
result 4790
).
This result also does not show as being sent back and still says 'pending'.
Also result
4022
on
Host 6710
processed in 7,847.56 seconds claiming 35.45 credits, the other quorum computer
Host 6502
took 806.54 seconds and claimed 1.97 credits. Guess what credit I got? How come?
Found another one, my
Host 1569
took 1,257.91 seconds claiming 3.78 credits, the quorum computer
Host 6954
took 0.76 seconds for a claim of 0.00 credits ??? Please explain ??
____________
ID:
4007 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
On my
Host 1526
I am showing seven work units, numbers
1887
,
1904
,
1913
,
1920
,
1924
,
1935
,
1940
. I do not have these work units, they downloaded to my computer (well I could locate 4 of them in the logs), and all errored out with the error code 193 (as per my message logs), but none of this information has uploaded to your servers, and you still show them as being 'Pending'.
I have also
Host 130
showing a WU that my message logs tell me download OK, was processed in around 7 minutes without error and result uploaded from the computer (see
result 4790
).
This result also does not show as being sent back and still says 'pending'.
Also result
4022
on
Host 6710
processed in 7,847.56 seconds claiming 35.45 credits, the other quorum computer
Host 6502
took 806.54 seconds and claimed 1.97 credits. Guess what credit I got? How come?
Found another one, my
Host 1569
took 1,257.91 seconds claiming 3.78 credits, the quorum computer
Host 6954
took 0.76 seconds for a claim of 0.00 credits ??? Please explain ??
Thanks for the updates on the buggy WUs.
As far as the points go, somehow they are not being normalized, we will take a look at this ASAP.
ID:
4008 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
Btw: I've tried Docking on my old (and lame *grin*) PIII 450 Mhz, running under Win2K.
Both WUs were processed without error but in my results list they show as "invalid".
I just saw
one WU
on my account, that was not on my puter. I seem to have found the messages for this ine (I'm UTC+2 at the moment, and I think the times on the server are UTC)
Fr 06 Jun 2008 16:14:30 CEST|Docking@Home|Sending scheduler request: To fetch work. Requesting 24417 seconds of work, reporting 0 completed tasks
Fr 06 Jun 2008 16:14:35 CEST|Docking@Home|Scheduler request succeeded: got 1 new tasks
Fr 06 Jun 2008 16:14:37 CEST|Docking@Home|Started download of 1tng_mod0011s_948_74060.inp
Fr 06 Jun 2008 16:14:41 CEST|Docking@Home|Finished download of 1tng_mod0011s_948_74060.inp
Fr 06 Jun 2008 16:14:42 CEST|Docking@Home|Starting logo.jpg
Fr 06 Jun 2008 16:14:42 CEST|Docking@Home|Starting task logo.jpg using charmmscreen version 700
Fr 06 Jun 2008 16:19:32 CEST|Docking@Home|Computation for task logo.jpg finished
Fr 06 Jun 2008 16:19:32 CEST|Docking@Home|Sending scheduler request: To fetch work. Requesting 24417 seconds of work, reporting 0 completed tasks
Fr 06 Jun 2008 16:19:34 CEST|Docking@Home|Started upload of 1tng_mod0011s_948_74060_5_0
Fr 06 Jun 2008 16:19:34 CEST|Docking@Home|Started upload of 1tng_mod0011s_948_74060_5_1
Fr 06 Jun 2008 16:19:36 CEST|Docking@Home|Finished upload of 1tng_mod0011s_948_74060_5_0
Fr 06 Jun 2008 16:19:36 CEST|Docking@Home|Finished upload of 1tng_mod0011s_948_74060_5_1
Fr 06 Jun 2008 16:19:36 CEST|Docking@Home|Started upload of 1tng_mod0011s_948_74060_5_2
Fr 06 Jun 2008 16:19:36 CEST|Docking@Home|Started upload of 1tng_mod0011s_948_74060_5_3
Fr 06 Jun 2008 16:19:37 CEST|Docking@Home|Scheduler request succeeded: got 0 new tasks
Fr 06 Jun 2008 16:19:37 CEST|Docking@Home|Message from server: No work sent
Fr 06 Jun 2008 16:19:38 CEST|Docking@Home|Finished upload of 1tng_mod0011s_948_74060_5_3
Fr 06 Jun 2008 16:19:39 CEST|Docking@Home|Finished upload of 1tng_mod0011s_948_74060_5_2
Fr 06 Jun 2008 16:20:37 CEST|Docking@Home|Sending scheduler request: To report completed tasks. Requesting 24417 seconds of work, reporting 1 completed tasks
Fr 06 Jun 2008 16:20:42 CEST|Docking@Home|Scheduler request succeeded: got 0 new tasks
Fr 06 Jun 2008 16:20:42 CEST|Docking@Home|Message from server: No work sent
According to this messages everything went fine, but your server knows nothing about it.
____________
Gruesse vom Saenger
For questions about Boinc look in the
BOINC-Wiki
Btw: I've tried Docking on my old (and lame *grin*) PIII 450 Mhz, running under Win2K.
Both WUs were processed without error but in my results list they show as "invalid".
I just got a bunch on the Mac and they still all fail instantly. I've seen that some others recently crunched successfully on their Macs and would appreciate any ideas. Stack trace etc. is reported on the failing tasks.
This
is the host's tasks page.
Your new '1abe' work units all fail with segmentation errors on my Linux AMD computers.
The only 3 that have worked ok have been sent to my Windows AMD computer.
I have one Windows box that cannot receive work. It gives the following message:
6/6/2008 6:23:46 PM|Docking@Home|Message from server: No work sent
6/6/2008 6:23:46 PM|Docking@Home|Message from server: Charmm with screensaver is not available for your type of computer.
Here is the startup info from boinc:
Starting BOINC client version 6.2.4 for windows_intelx86
log flags: task, file_xfer, sched_ops, cpu_sched, checkpoint_debug
Libraries: libcurl/7.18.0 OpenSSL/0.9.8e zlib/1.2.3
Executing as a daemon
Data directory: D:BOINC
BOINC is running as a service and as a non-system user.
Processor: 2 AuthenticAMD AMD Athlon(tm) 64 X2 Dual Core Processor 6000+ [x86 Family 15 Model 67 Stepping 3]
Processor features: fpu tsc pae nx sse sse2 3dnow mmx
OS: Microsoft Windows XP: Professional Edition, Service Pack 3, (05.01.2600.00)
Memory: 2.00 GB physical, 3.85 GB virtual
Disk: 100.00 GB total, 52.66 GB free
Local time is UTC -4 hours
No coprocessors
So exactly what is wrong with my "type of computer"? AMD processor, Windows XP SP3, boinc version 6.2.4, running as service? I can get work on an Intel Q6600 with Vista x64 using boinc 5.10.45 x64.
I tried a detach/reattach on the XP box but still it persists:
6/6/2008 6:37:04 PM|Docking@Home|Resetting project
6/6/2008 6:37:04 PM|Docking@Home|Detaching from project
6/6/2008 6:37:48 PM||Fetching configuration file from http://docking.cis.udel.edu/get_project_config.php
6/6/2008 6:38:13 PM|Docking@Home|Master file download succeeded
6/6/2008 6:38:18 PM|Docking@Home|Sending scheduler request: Project initialization. Requesting 1 seconds of work, reporting 0 completed tasks
6/6/2008 6:38:23 PM|Docking@Home|Scheduler request succeeded: got 0 new tasks
6/6/2008 6:38:23 PM|Docking@Home|Message from server: No work sent
6/6/2008 6:38:23 PM|Docking@Home|Message from server: Charmm with screensaver is not available for your type of computer.
I had three on my Linux64 Intel Quad, they behaved well.
I'm just waiting for my wingmen to deliver on two of them.
____________
Gruesse vom Saenger
For questions about Boinc look in the
BOINC-Wiki
I have one Windows box that cannot receive work. It gives the following message:
[…] Charmm with screensaver is not available for your type of computer.
My G5 Mac got the same thing last night, and then was apparently told to go away for 24 hours. (I’m guessing as to the interval: I recently installed BOINC v5.10.45 and it doesn’t report communication deferrals the way my previous version did.)
ID:
4023 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
I have one Windows box that cannot receive work. It gives the following message:
[…] Charmm with screensaver is not available for your type of computer.
My G5 Mac got the same thing last night, and then was apparently told to go away for 24 hours. (I’m guessing as to the interval: I recently installed BOINC v5.10.45 and it doesn’t report communication deferrals the way my previous version did.)
As far as Power PC architecture goes, we will no longer be supporting that architecture. We don't have a power pc in our lab and haven't been able to get one. It's getting harder and harder to find those machines.
Anyone have any suggestions as far getting a power pc binary?
As far as the windows machine not getting work, there is definitely something wrong with that. We will look into all these things on Monday.
Don't forget to vote on Monday.
ID:
4024 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
I just got a bunch on the Mac and they still all fail instantly. I've seen that some others recently crunched successfully on their Macs and would appreciate any ideas. Stack trace etc. is reported on the failing tasks.
This
is the host's tasks page.
Looks like the Xeon's are being put together with the intel core 2's. There is obviously a divergence in the two. We will reevaluate the Xeons HR.
ID:
4025 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
I have one Windows box that cannot receive work. It gives the following message:
6/6/2008 6:23:46 PM|Docking@Home|Message from server: No work sent
6/6/2008 6:23:46 PM|Docking@Home|Message from server: Charmm with screensaver is not available for your type of computer.
Here is the startup info from boinc:
Starting BOINC client version 6.2.4 for windows_intelx86
log flags: task, file_xfer, sched_ops, cpu_sched, checkpoint_debug
Libraries: libcurl/7.18.0 OpenSSL/0.9.8e zlib/1.2.3
Executing as a daemon
Data directory: D:BOINC
BOINC is running as a service and as a non-system user.
Processor: 2 AuthenticAMD AMD Athlon(tm) 64 X2 Dual Core Processor 6000+ [x86 Family 15 Model 67 Stepping 3]
Processor features: fpu tsc pae nx sse sse2 3dnow mmx
OS: Microsoft Windows XP: Professional Edition, Service Pack 3, (05.01.2600.00)
Memory: 2.00 GB physical, 3.85 GB virtual
Disk: 100.00 GB total, 52.66 GB free
Local time is UTC -4 hours
No coprocessors
So exactly what is wrong with my "type of computer"? AMD processor, Windows XP SP3, boinc version 6.2.4, running as service? I can get work on an Intel Q6600 with Vista x64 using boinc 5.10.45 x64.
I tried a detach/reattach on the XP box but still it persists:
6/6/2008 6:37:04 PM|Docking@Home|Resetting project
6/6/2008 6:37:04 PM|Docking@Home|Detaching from project
6/6/2008 6:37:48 PM||Fetching configuration file from http://docking.cis.udel.edu/get_project_config.php
6/6/2008 6:38:13 PM|Docking@Home|Master file download succeeded
6/6/2008 6:38:18 PM|Docking@Home|Sending scheduler request: Project initialization. Requesting 1 seconds of work, reporting 0 completed tasks
6/6/2008 6:38:23 PM|Docking@Home|Scheduler request succeeded: got 0 new tasks
6/6/2008 6:38:23 PM|Docking@Home|Message from server: No work sent
6/6/2008 6:38:23 PM|Docking@Home|Message from server: Charmm with screensaver is not available for your type of computer.
Nothing wrong with your cpu, we indeed support this architecture. We will address this on Monday.
Thanks!
ID:
4026 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
Your new '1abe' work units all fail with segmentation errors on my Linux AMD computers.
The only 3 that have worked ok have been sent to my Windows AMD computer.
I have had 3 work and 15 fail.
I noticed you have two computers with identical architectures yet one with a different linux kernel (one of which is the offending seg faulter). Can you confirm that both are crashing with 1abe? It looks like only one has gotten the 1abe complex, we will send some more WU's over next week and if you could check if your other AMD gets validated that would be awesome.
As far as Power PC architecture goes, we will no longer be supporting that architecture. We don't have a power pc in our lab and haven't been able to get one. It's getting harder and harder to find those machines.
Anyone have any suggestions as far getting a power pc binary?
I'm new to Macs but the current version of OS/X includes tools allowing cross-compiling for the ppc. I created intel and ppc binaries used by another project using my Mac-intel laptop
...the same one that bus-errors on all current Docking work :-(
ID:
4028 | Rating: 0
| rate:
/
Abel
Forum moderator
Project administrator
Project developer
Project tester
As far as Power PC architecture goes, we will no longer be supporting that architecture. We don't have a power pc in our lab and haven't been able to get one. It's getting harder and harder to find those machines.
Anyone have any suggestions as far getting a power pc binary?
I'm new to Macs but the current version of OS/X includes tools allowing cross-compiling for the ppc. I created intel and ppc binaries used by another project using my Mac-intel laptop
...the same one that bus-errors on all current Docking work :-(
Hmmm, we do have an intel mac. Will give it a look.
Your new '1abe' work units all fail with segmentation errors on my Linux AMD computers.
The only 3 that have worked ok have been sent to my Windows AMD computer.
I have had 3 work and 15 fail.
I noticed you have two computers with identical architectures yet one with a different linux kernel (one of which is the offending seg faulter). Can you confirm that both are crashing with 1abe? It looks like only one has gotten the 1abe complex, we will send some more WU's over next week and if you could check if your other AMD gets validated that would be awesome.
7-6-2008 12:07:11|Docking@Home|Message from server: No work sent
7-6-2008 12:07:11|Docking@Home|Message from server: (there was work but it was committed to other platforms)
We currently have 98 work units ready to send but only 1 in progress, is there no CPU type or OS type that matches these work units in order to download and work on them?
I know I can't get them going on my message logs.
____________
Hmmm, we do have an intel mac. Will give it a look.
I asked jedirock about this. He has helped build Mac apps for several projects. Here is his response:
Do they really need a PPC machine to compile PPC apps? Or can they do it on an intel machine? Perhaps they need a few tips?
No, you don't need a PPC machine to compile PPC apps, but it can require a bit of work. Assuming he has at least a bit of experience with the Terminal, point him to my BOINC library building script at http://michaelsprogramming.home.dyndns.org/boinc-build.sh and tell him to change the paths at the bottom as needed. Then maybe he can figure out how to compile PPC apps from there, but if not, see if you can get how he builds the apps.
Crosscompiling on MacOS is very easy.
You have got two different compilers installed with the Apple XCode installation. One is called i686-apple-darwin8-gcc-4.0.1 the other one powerpc-apple-darwin8-gcc-4.0.1.
There are two different methods to get it crosscompiled :
1. via the configure script
- set CC="powerpc-apple-darwin8-gcc-4.0.1" and CXX="powerpc-apple-darwin8-g++-4.0.1"
- issue the configure script with the option "--target=powerpc-apple-darwin" or "--target=powerpc-apple-darwin8".
- gmake
2. Via the Makefile
- Change all the gcc/g++... statements in the Makefile to powerpc-apple-darwin8-gcc...
- gmake
3. via XCode
- If you have a xcode project made, you can select the target platforms (Intel/PPC/Universal) by a switch
The BOINC Client , LIB and API crosscompiles very well on MacOS. Also the most science applications.
For the testing of the powerpc application on the Intel Mac you should nothing pay attention. Via Rosetta, the binary emulation of the PPC CPU, it is posible to let PPC binaries running on the Intel Mac with a los of performance.
If you have some further questions, feel free to contact me.
Crosscompiling on MacOS is very easy.
You have got two different compilers installed with the Apple XCode installation. One is called i686-apple-darwin8-gcc-4.0.1 the other one powerpc-apple-darwin8-gcc-4.0.1.
There are two different methods to get it crosscompiled :
1. via the configure script
- set CC="powerpc-apple-darwin8-gcc-4.0.1" and CXX="powerpc-apple-darwin8-g++-4.0.1"
- issue the configure script with the option "--target=powerpc-apple-darwin" or "--target=powerpc-apple-darwin8".
- gmake
2. Via the Makefile
- Change all the gcc/g++... statements in the Makefile to powerpc-apple-darwin8-gcc...
- gmake
3. via XCode
- If you have a xcode project made, you can select the target platforms (Intel/PPC/Universal) by a switch
The BOINC Client , LIB and API crosscompiles very well on MacOS. Also the most science applications.
For the testing of the powerpc application on the Intel Mac you should nothing pay attention. Via Rosetta, the binary emulation of the PPC CPU, it is posible to let PPC binaries running on the Intel Mac with a los of performance.
If you have some further questions, feel free to contact me.
Thanks for your suggestions. We will try them and will contact you if we have more questions.
Your new '1abe' work units all fail with segmentation errors on my Linux AMD computers.
The only 3 that have worked ok have been sent to my Windows AMD computer.
I have had 3 work and 15 fail.
I noticed you have two computers with identical architectures yet one with a different linux kernel (one of which is the offending seg faulter). Can you confirm that both are crashing with 1abe? It looks like only one has gotten the 1abe complex, we will send some more WU's over next week and if you could check if your other AMD gets validated that would be awesome.
Thanks!
OK will monitor it for you.
@ Arun,
I have not gotten very many work units to test this out for you but I have found that the other computer with the same architecture but with FC3 has not given the same error as the FC6 machine.
It has given, on the one WU it got, the same error most others are having, and that is 'maximum cpu time exceeded'. So it is not SEG Faulting.
My FC6 machine has just gotten 4 more WU's and they all SEG faulted so I will reset the project on that computer and see if that fixes things, if it does not then I will detach and reattach to see if that works.
This computer has no problems with the other 7 projects it also works on, just Docking.
____________
Your new '1abe' work units all fail with segmentation errors on my Linux AMD computers.
The only 3 that have worked ok have been sent to my Windows AMD computer.
I have had 3 work and 15 fail.
I noticed you have two computers with identical architectures yet one with a different linux kernel (one of which is the offending seg faulter). Can you confirm that both are crashing with 1abe? It looks like only one has gotten the 1abe complex, we will send some more WU's over next week and if you could check if your other AMD gets validated that would be awesome.
Thanks!
OK will monitor it for you.
@ Arun,
I have not gotten very many work units to test this out for you but I have found that the other computer with the same architecture but with FC3 has not given the same error as the FC6 machine.
It has given, on the one WU it got, the same error most others are having, and that is 'maximum cpu time exceeded'. So it is not SEG Faulting.
My FC6 machine has just gotten 4 more WU's and they all SEG faulted so I will reset the project on that computer and see if that fixes things, if it does not then I will detach and reattach to see if that works.
This computer has no problems with the other 7 projects it also works on, just Docking.
Hi Conan,
Thanks for your feedback. Can you please find if both the clients on FC3 and FC6 downloaded the same charmm version to execute when they got the workunits ?
We are testing a new algorithm for checkpointing on docking
Today 2 units came in, but I'm wondering if they were capable of making checkpoints.
I've got the [checkpoint_debug] option enabled, but no massages were shown in the log.
Your new '1abe' work units all fail with segmentation errors on my Linux AMD computers.
The only 3 that have worked ok have been sent to my Windows AMD computer.
I have had 3 work and 15 fail.
I noticed you have two computers with identical architectures yet one with a different linux kernel (one of which is the offending seg faulter). Can you confirm that both are crashing with 1abe? It looks like only one has gotten the 1abe complex, we will send some more WU's over next week and if you could check if your other AMD gets validated that would be awesome.
Thanks!
OK will monitor it for you.
@ Arun,
I have not gotten very many work units to test this out for you but I have found that the other computer with the same architecture but with FC3 has not given the same error as the FC6 machine.
It has given, on the one WU it got, the same error most others are having, and that is 'maximum cpu time exceeded'. So it is not SEG Faulting.
My FC6 machine has just gotten 4 more WU's and they all SEG faulted so I will reset the project on that computer and see if that fixes things, if it does not then I will detach and reattach to see if that works.
This computer has no problems with the other 7 projects it also works on, just Docking.
Hi Conan,
Thanks for your feedback. Can you please find if both the clients on FC3 and FC6 downloaded the same charmm version to execute when they got the workunits ?
Thanks
Arun
OK Arun, the FC6 machine had the '1abe_mod0011sc_' work units and the FC3 machine (the one not giving seg faults) had an '1tng_mod0011sc_' work unit.
So not the same type, will wait for more work units.
____________
OK Arun, the FC6 machine had the '1abe_mod0011sc_' work units and the FC3 machine (the one not giving seg faults) had an '1tng_mod0011sc_' work unit.
So not the same type, will wait for more work units.
Conan, thanks for your response. Actually I want to know if your client downloaded charmm_7.0_i686-pc-linux-gnu or charmm_7.0_x86_64-pc-linux-gnu when you attached to the project before downloading the wu.
OK Arun, the FC6 machine had the '1abe_mod0011sc_' work units and the FC3 machine (the one not giving seg faults) had an '1tng_mod0011sc_' work unit.
So not the same type, will wait for more work units.
Conan, thanks for your response. Actually I want to know if your client downloaded charmm_7.0_i686-pc-linux-gnu or charmm_7.0_x86_64-pc-linux-gnu when you attached to the project before downloading the wu.
Thanks
Arun
Arun, I have no idea. As I currently have no work units I am unable to locate this information. I will wait for more work and if I can catch it in time (they error out in 20 odd seconds) then I may be able to get this information.
____________
OK Arun, the FC6 machine had the '1abe_mod0011sc_' work units and the FC3 machine (the one not giving seg faults) had an '1tng_mod0011sc_' work unit.
So not the same type, will wait for more work units.
Conan, thanks for your response. Actually I want to know if your client downloaded charmm_7.0_i686-pc-linux-gnu or charmm_7.0_x86_64-pc-linux-gnu when you attached to the project before downloading the wu.
Thanks
Arun
Arun, I have no idea. As I currently have no work units I am unable to locate this information. I will wait for more work and if I can catch it in time (they error out in 20 odd seconds) then I may be able to get this information.
>> G'Day Arun,
Today I have been able to get a few work units and again I had four error out on the same machine with the same error.
I checked and found that the file
"charmm_7.2_i686-pc-linux-gnu"
and the file
"charmm_7.2_i686-pc-linux-gnu-main"
are currently in my Boinc/projects/docking folder.
Hope this can be of help as I will have to detach this machine if I can't get it to work. It worked before the move to this new university.
Perhaps I need to detach and reattach ???
Awaiting your reply, Conan.
EDIT::: I have found that a work unit has downloaded to the same spec machine running FC3 and in it's Boinc folder it not only has the
charmm_7.2_i686-pc-linux-gnu
and
charmm_7.2_i686-pc-linux-gnu-main
files but also the
charmm_5.8_i686-pc-linux-gnu
,
charmm_5.8_i686-pc-linux-gnu-main
,
charmm_7.0_i686-pc-linux-gnu
and
charmm_7.0_i686-pc-linux-gnu-main
files.
Could this be a reason why my FC6 machine is playing up, it has lost old and possibly needed files ??
____________
OK Arun, the FC6 machine had the '1abe_mod0011sc_' work units and the FC3 machine (the one not giving seg faults) had an '1tng_mod0011sc_' work unit.
So not the same type, will wait for more work units.
Conan, thanks for your response. Actually I want to know if your client downloaded charmm_7.0_i686-pc-linux-gnu or charmm_7.0_x86_64-pc-linux-gnu when you attached to the project before downloading the wu.
Thanks
Arun
Arun, I have no idea. As I currently have no work units I am unable to locate this information. I will wait for more work and if I can catch it in time (they error out in 20 odd seconds) then I may be able to get this information.
>> G'Day Arun,
Today I have been able to get a few work units and again I had four error out on the same machine with the same error.
I checked and found that the file
"charmm_7.2_i686-pc-linux-gnu"
and the file
"charmm_7.2_i686-pc-linux-gnu-main"
are currently in my Boinc/projects/docking folder.
Hope this can be of help as I will have to detach this machine if I can't get it to work. It worked before the move to this new university.
Perhaps I need to detach and reattach ???
Awaiting your reply, Conan.
EDIT::: I have found that a work unit has downloaded to the same spec machine running FC3 and in it's Boinc folder it not only has the
charmm_7.2_i686-pc-linux-gnu
and
charmm_7.2_i686-pc-linux-gnu-main
files but also the
charmm_5.8_i686-pc-linux-gnu
,
charmm_5.8_i686-pc-linux-gnu-main
,
charmm_7.0_i686-pc-linux-gnu
and
charmm_7.0_i686-pc-linux-gnu-main
files.
Could this be a reason why my FC6 machine is playing up, it has lost old and possibly needed files ??
It did it again so I have detached and reattached that computer and will await new work units (probably tomorrow as I have reached my daily limit of 4 WU's, seems a pretty low daily limit to me).
____________
OK Arun, the FC6 machine had the '1abe_mod0011sc_' work units and the FC3 machine (the one not giving seg faults) had an '1tng_mod0011sc_' work unit.
So not the same type, will wait for more work units.
Conan, thanks for your response. Actually I want to know if your client downloaded charmm_7.0_i686-pc-linux-gnu or charmm_7.0_x86_64-pc-linux-gnu when you attached to the project before downloading the wu.
Thanks
Arun
Arun, I have no idea. As I currently have no work units I am unable to locate this information. I will wait for more work and if I can catch it in time (they error out in 20 odd seconds) then I may be able to get this information.
>> G'Day Arun,
Today I have been able to get a few work units and again I had four error out on the same machine with the same error.
I checked and found that the file
"charmm_7.2_i686-pc-linux-gnu"
and the file
"charmm_7.2_i686-pc-linux-gnu-main"
are currently in my Boinc/projects/docking folder.
Hope this can be of help as I will have to detach this machine if I can't get it to work. It worked before the move to this new university.
Perhaps I need to detach and reattach ???
Awaiting your reply, Conan.
EDIT::: I have found that a work unit has downloaded to the same spec machine running FC3 and in it's Boinc folder it not only has the
charmm_7.2_i686-pc-linux-gnu
and
charmm_7.2_i686-pc-linux-gnu-main
files but also the
charmm_5.8_i686-pc-linux-gnu
,
charmm_5.8_i686-pc-linux-gnu-main
,
charmm_7.0_i686-pc-linux-gnu
and
charmm_7.0_i686-pc-linux-gnu-main
files.
Could this be a reason why my FC6 machine is playing up, it has lost old and possibly needed files ??
Hi Conan,
Thanks for your feedback. We will use the information to figure out the problem. The problem is not because of the absence of the old files in the FC6 machine.
I got a crashing WU (Unhandled Exception Detected) as I have pressed the show graphics button short after the start of the WU : http://docking.cis.udel.edu/result.php?resultid=9027
The graphics window had only a black background and I closed and opened the graphics several times...
Server status says 77 units ready to send but I get this error message:
7/17/2008 3:43:21 PM|Docking@Home|Sending scheduler request: To fetch work. Requesting 5480 seconds of work, reporting 0 completed tasks
7/17/2008 3:43:26 PM|Docking@Home|Scheduler request succeeded: got 0 new tasks
7/17/2008 3:43:26 PM|Docking@Home|Message from server: No work sent
Server status says 77 units ready to send but I get this error message:
7/17/2008 3:43:21 PM|Docking@Home|Sending scheduler request: To fetch work. Requesting 5480 seconds of work, reporting 0 completed tasks
7/17/2008 3:43:26 PM|Docking@Home|Scheduler request succeeded: got 0 new tasks
7/17/2008 3:43:26 PM|Docking@Home|Message from server: No work sent
Why?
Win XP Pro SP3 32bit, BOINC 5.10.45
Yeah, I'm getting these messages on all my OS's, too (Win XP 32 and 64 bit, Linux 64 bit).
Guess the server will not give those WUs away. ;-)
____________
Bribe me with Lasagna!! :-)