[OPEN-ILS-GENERAL] problem with OpenSRF 1.2 install

Roma Matott rmatott at pls-net.org
Wed Oct 21 13:53:11 EDT 2009


I am in the process of a fresh install of Evergreen 1.6 on debian Lenny (I previously installed Evergreen on Ubuntu) and have run into a problem with opensrf.  I am unable to complete the math computation. From what I can tell the Opensrf C is not actually starting - although it says it is.

I ran through the first few steps of the troubleshooting document and all went well until I hit Step 7: start of the OpenSRF C services (router and perl started without errors).  The osrfsys.log listed the following:

opensrf at rgmevergreen:/openils/var/log$ grep ERR /openils/var/log/*.log
/openils/var/log/osrfsys.log:opensrf 2009-10-21 10:45:48 [ERR :20449:osrf_settin        gs.c:48:] No osrfMessage received from host localhost (timeout?)
/openils/var/log/osrfsys.log:opensrf 2009-10-21 10:45:48 [ERR :20449:osrf_settin        gs.c:67:] Unable to load config for host localhost
/openils/var/log/osrfsys.log:opensrf 2009-10-21 10:45:48 [ERR :20449:osrf_system        .c:105:] Unable to retrieve settings for host localhost from configuration file         /openils/conf/opensrf_core.xml
/openils/var/log/osrfsys.log:opensrf 2009-10-21 10:45:48 [ERR :20449:opensrf.c:2        3:] Server Loop returned an error condition, exiting with -1
--------------------------------------------------------------------------------
The router.log looks like this:

router 2009-10-21 10:41:44 [INFO:20398:osrf_router_main.c:115:] Router connecting as: server: private.localhost port: 5222 user: router resource: router
router 2009-10-21 10:41:44 [INFO:20398:osrf_router_main.c:137:] Router adding trusted server: private.localhost
router 2009-10-21 10:41:44 [INFO:20398:osrf_router_main.c:149:] Router adding trusted client: private.localhost
router 2009-10-21 11:56:10 [WARN:20397:transport_session.c:447:] Received <stream:error> message from Jabber server
router 2009-10-21 11:56:10 [WARN:20397:transport_session.c:663:] ERROR XML fragment: Replaced by new connection</text></stream:error></stream:stream>

router 2009-10-21 11:56:10 [INFO:21235:osrf_router_main.c:115:] Router connecting as: server: private.localhost port: 5222 user: router resource: router
router 2009-10-21 11:56:10 [INFO:21235:osrf_router_main.c:137:] Router adding trusted server: private.localhost
router 2009-10-21 11:56:10 [INFO:21235:osrf_router_main.c:149:] Router adding trusted client: private.localhost
router 2009-10-21 11:56:10 [DEBG:20399:osrf_router.c:117:] Top router socket is active: 3
router 2009-10-21 11:56:10 [WARN:20399:transport_session.c:447:] Received <stream:error> message from Jabber server
router 2009-10-21 11:56:10 [WARN:20399:transport_session.c:663:] ERROR XML fragment: Replaced by new connection</text></stream:error></stream:stream>

router 2009-10-21 11:56:10 [DEBG:20399:socket_bundle.c:394:] removing socket 3
router 2009-10-21 11:56:10 [DEBG:20399:transport_session.c:200:] socket_wait returned error code -1
router 2009-10-21 11:56:10 [DEBG:20399:transport_client.c:127:] session_wait returned failure code -1: setting error=1

router 2009-10-21 11:56:10 [DEBG:20399:osrf_router.c:117:] Top router socket is active: 3
router 2009-10-21 11:56:10 [DEBG:20399:transport_session.c:200:] socket_wait returned error code -1
router 2009-10-21 11:56:10 [DEBG:20399:transport_client.c:127:] session_wait returned failure code -1: setting error=1
--------------------------------------------------------------------------------------

And when I check the user running opensrf user error log (/tmp/srfsh.log) I see:

srfsh 2009-10-21 11:59:01 [WARN:21361:osrf_stack.c:29:] transport_client had trouble reading from the socket..
srfsh 2009-10-21 11:59:01 [WARN:21361:osrf_stack.c:29:] transport_client had trouble reading from the socket..
srfsh 2009-10-21 11:59:01 [WARN:21361:osrf_stack.c:29:] transport_client had trouble reading from the socket..
srfsh 2009-10-21 11:59:01 [WARN:21361:osrf_stack.c:29:] transport_client had trouble reading from the socket..
srfsh 2009-10-21 11:59:01 [WARN:21361:osrf_stack.c:29:] transport_client had trouble reading from the socket..
srfsh 2009-10-21 11:59:01 [WARN:21361:srfsh.c:756:] Unable to connect to remote service opensrf.math

srfsh 2009-10-21 11:59:01 [WARN:21361:osrf_stack.c:29:] transport_client had trouble reading from the socket..
srfsh 2009-10-21 11:59:01 [ERR :21361:osrf_app_session.c:619:] client_send_message failed
srfsh 2009-10-21 11:59:01 [INFO:21361:osrf_app_session.c:621:] [opensrf.math] sent 86 bytes of data to router at private.localhost/opensrf.math

-------------------------------------------------------------------

I double checked the settings my /etc/hosts, /etc/ejabberd/ejabberd.cfg, .srfsh.xml opensrf.xml and opensrf_core.xml files.  And it all appears to be correct per the instructions.  

I also looked at the opensrf_core.xml and opensrf.xml files in an xml editor to see if I had in inadvertently broken the structure or had a typo that would cause a problem.  But I did not see anything unusual.  

I poked around in the forums it looks like people have encountered these messages for various reasons, so I am not sure what to try next.  I have not as of yet finished the Evergreen install since I would like to get opensrf working first.  So, I haven't run the settings_tester.pl script as of yet.  Should I try that?  

If anyone has any suggestions, I would love to hear them!

Thanks,

Roma
Pioneer Library System
Canandaigua, NY


More information about the Open-ils-general mailing list