Monday, March 3, 2008

Daily Lab / POC Activities

  • Installed Globus Web Services on lab 1001
  • Worked on troubleshooting the following Reliable File Transfer error found in the /usr/local/globus-4.0.5/var/container.log on lab 1002:

2008-03-03 14:40:44,895 ERROR service.ReliableFileTransferImpl [main,:69] Unable to setup database driver with pooling.Connection refused. Check that the hostname and port are correct and that the postmaster is accepting TCP/IP connections.
2008-03-03 14:40:45,127 WARN service.ReliableFileTransferHome [main,initialize:97] All RFT requests will fail and all GRAM jobs that require file staging will fail.Connection refused. Check that the hostname and port are correct and that the postmaster is accepting TCP/IP connections.
Starting SOAP server at: https://192.168.0.202:8443/wsrf/services/
With the following services:

[1]: https://192.168.0.202:8443/wsrf/services/AdminService
[2]: https://192.168.0.202:8443/wsrf/services/AuthzCalloutTestService
[3]: https://192.168.0.202:8443/wsrf/services/CASService
[4]: https://192.168.0.202:8443/wsrf/services/ContainerRegistryEntryService
[5]: https://192.168.0.202:8443/wsrf/services/ContainerRegistryService

  • Added the following entry to the /var/lib/pgsql/data/pg_hba.conf file :
host rftDatabase "globus" 192.168.0.202/32 md
NOTE: IP Adress changed for privacy reasons.

This allowed the rft command to be run, but it produced a new error:

bubba@gump:/usr/local/globus-4.0.5/share/globus_wsrf_rft_test> rft -h gump.somedomain.net -f /tmp/rft.xfr
Number of transfers in this request: 1
Subscribed for overall status
Termination time to set: 60 minutes

Overall status of transfer:
Finished/Active/Failed/Retrying/Pending
0/0/1/0/0
Error:Error authenticating user at source/dest hostAuthentication failed [Caused by: Operation unauthorized (Mechanism level: Authorization failed. Expected "/CN=host/localhost" target but received "/O=SomeGrid/OU=Globus Toolkit/OU=Globus Names/OU=us.somegrid.us/CN=host/bubba.somedomain.net")] [Caused by: Authentication failed [Caused by: Operation unauthorized (Mechanism level: Authorization failed. Expected "/CN=host/localhost" target but received "/O=SomeGrid/OU=Globus Toolkit/OU=Globus Names/OU=us.somegrid.us/CN=host/bubba.somedomain.net")]]
All Transfers failed !

  • The issue was resolved by updating the /tmp/rft.xfr and replacing the localhost entry with a fully qualified domain name. An example of the file can be found below:

####################
true
16000
16000
false
1
true
1
null
null
false
10
gsiftp://atlanta.somedomain.net:2811/home/bubba/gridtesting
gsiftp://tarrant.anotherdomain.net:2811/tmp/rftTest_Done.tmp
####################################

  • Below are the results after the change was made:

bubba@gump:/tmp> rft -h gump.somedomain.net -f /tmp/rft.xfr
Number of transfers in this request: 1
Subscribed for overall status
Termination time to set: 60 minutes

Overall status of transfer:
Finished/Active/Failed/Retrying/Pending
0/1/0/0/0

Overall status of transfer:
Finished/Active/Failed/Retrying/Pending
1/0/0/0/0
All Transfers are completed

  • Started testing WS GRAM on lab 1002
  • Current State of WS GRAM test: Additional configuration needed. Test results below

bubba@gump:~> globusrun-ws -submit -c /bin/touch touched_it
Submitting job...Done.
Job ID: uuid:b516d216-e96a-11dc-ac6c-9fcdae28241b
Termination time: 03/04/2008 21:42 GMT
Current job state: Failed
Destroying job...Done.
globusrun-ws: Job failed: Error code: 201
Script stderr:
bubba's password:

No comments: