[OPEN-ILS-DEV] Debugging OpenSRF installation

Mike Rylander mrylander at gmail.com
Fri Jun 12 16:56:20 EDT 2009


On Fri, Jun 12, 2009 at 4:35 PM, Victoria Bush<vbush at ilstu.edu> wrote:
>
> On Jun 12, 2009, at 1:39 PM, Dan Wells wrote:
>
>> Hello Victoria,
>>
>> Though I think it is designed to test a full Evergreen install, running
>> the settings-tester.pl script may help identify problems with just OpenSRF
>> as well.  To quote the wiki:
>>
>> "As the opensrf user, run the settings-tester.pl script to see if it finds
>> any system configuration problems. The script is found at
>> Open-ILS/src/support-scripts/settings-tester.pl in the Evergreen source
>> tree."
>>
>> Feel free to send the output back to this list if it doesn't mean much to
>> you.
>
> Dan, thanks so much for your reply. I think there might be something more
> going on than just a simple configuration error, but I defer to the greater
> wisdom of the list.
>

[snip]

>
> And it looks fine. I see lots of OpenSRF processes:
>
>> opensrf   3753     1  2 Jun11 ?        00:37:07 OpenSRF Router
>> opensrf   3754     1  2 Jun11 ?        00:38:21 OpenSRF Router
>> opensrf  14159     1  0 14:32 ?        00:00:00 OpenSRF Router
>> opensrf  14165     1  0 14:32 ?        00:00:00 OpenSRF Router

Looks like you've got stale routers hanging around.  I don't think
this will cause the problem you're seeing, but just in case, let's get
rid of those.  First thing is to stop the opensrf services
(osrf_ctl.sh stop_all) and then doublecheck that every opensrf process
has one away.  If not, kill whatever is left running.

Can you attach your opensrf_core.xml as well, with passwords blanked,
of course, if appropriate.

[snip]

>> srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_stack.c:84:] We received 1
>> messages from router at public.localhost/opensrf.math
>> srfsh 2009-06-12 15:08:58 [WARN:14305:osrf_stack.c:95:]  !!! Received
>> Jabber layer error message
>> srfsh 2009-06-12 15:08:58 [WARN:14305:osrf_stack.c:105:]  * Jabber Error
>> is for top level remote  id [router at public.localhost/opensrf.math], no one
>> to send my message to!  Cutting request short...
>> srfsh 2009-06-12 15:08:58 [INFO:14305:osrf_stack.c:116:] Message
>> processing duration 0.000164
>> srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_stack.c:119:] after msg delete
>> srfsh 2009-06-12 15:08:58 [ERR :14305:osrf_app_session.c:516:] cannot
>> communicate with opensrf.math
>> srfsh 2009-06-12 15:08:58 [WARN:14305:srfsh.c:576:] Unable to connect to
>> remote service opensrf.math
>>
>> srfsh 2009-06-12 15:09:00 [DEBG:14305:socket_bundle.c:394:] removing
>> socket 3
>>
>
>
> Now why would I get an error about the top level connection, when testing
> via settings_tester.pl showed that the connections were successfully made?

That error means that opensrf.math did not register itself with the
public.localhost opensrf router when it started up.  That's why I'm
curious about your opensrf_core.xml.


-- 
Mike Rylander
 | VP, Research and Design
 | Equinox Software, Inc. / The Evergreen Experts
 | phone:  1-877-OPEN-ILS (673-6457)
 | email:  miker at esilibrary.com
 | web:  http://www.esilibrary.com


More information about the Open-ils-dev mailing list