[Evergreen-general] Documentation Question - Library visibility on the web and SKOS
Chris Sharp
csharp at georgialibraries.org
Fri Sep 6 12:06:04 EDT 2024
looks like this was authored by Jane Sandberg in 2017. Git commit
9ee6e6ced0fc5e39ce6c7c925ffc269c6bae3522. Attached the git diff for
reference.
On Fri, Sep 6, 2024 at 11:00 AM Jennifer Pringle via Evergreen-general <
evergreen-general at list.evergreen-ils.org> wrote:
> Hello everyone,
>
>
>
> At our meeting yesterday DIG found an orphaned page (can only be found
> through search), the content of which none of us at the meeting know much
> about.
>
>
>
> https://docs.evergreen-ils.org/docs/latest/opac/visibility_on_the_web.html
>
>
>
> We’re looking for someone who can review this page and let us know if the
> page is valid and if so if there are any changes that need to be made to it.
>
>
>
> The SKOS Support section of the page says “This is an experimental,
> "under-the-hood" feature that will be built upon in subsequent releases.”
> but we couldn’t tell from git what release this feature was originally
> added in. We’re not sure if this is still an experimental feature or not.
>
>
>
> Thank you to anyone who can shed some light on this,
>
>
>
> Jennifer
>
>
>
> --
>
> Jennifer Pringle (she/her)
>
> Co-op Support - Training Lead
>
> BC Libraries Cooperative
>
> Toll-free: 1-888-848-9250
>
> Email:jennifer.pringle at bc.libraries.coop
>
> Website: http://bc.libraries.coop
>
>
>
> Gratefully acknowledging that I live and work in the unceded Traditional
> Territory of the St'at'yemc Nations.
>
>
> _______________________________________________
> Evergreen-general mailing list
> Evergreen-general at list.evergreen-ils.org
> http://list.evergreen-ils.org/cgi-bin/mailman/listinfo/evergreen-general
>
--
[image: logo with link to Georgia Public Library Service website]
<https://georgialibraries.org/>
Chris Sharp, PINES System Administrator
------------------------------
Georgia Public Library Service
2872 Woodcock Blvd, Suite 250 | Atlanta, GA 30341
(404) 235-7147 | csharp at georgialibraries.org
[image: logo with link to Georgia Public Library Service Facebook page]
<https://www.facebook.com/georgialibraries>[image: logo with link to
Georgia Public Library Service Instagram page]
<https://www.instagram.com/georgialibraries/>[image: logo with link to
Georgia Public Library Service LinkedIn page]
<https://www.linkedin.com/company/georgia-public-library-service/>[image:
logo with link to Georgia Public Library Service Threads page]
<https://www.threads.net/@georgialibraries>
Join our email list <http://georgialibraries.org/subscription> for stories
of Georgia libraries making an impact in our communities.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://list.evergreen-ils.org/pipermail/evergreen-general/attachments/20240906/14bd1c32/attachment-0001.htm>
-------------- next part --------------
[33mcommit 9ee6e6ced0fc5e39ce6c7c925ffc269c6bae3522[m
Author: Jane Sandberg <sandbej at linnbenton.edu>
Date: Sun Sep 17 13:11:13 2017 -0700
Docs: adding info about TPAC microdata + linked data
Signed-off-by: Jane Sandberg <sandbej at linnbenton.edu>
[1mdiff --git a/docs/admin/sitemap_admin.adoc b/docs/admin/sitemap_admin.adoc[m
[1mnew file mode 100644[m
[1mindex 0000000000..50bcd82186[m
[1m--- /dev/null[m
[1m+++ b/docs/admin/sitemap_admin.adoc[m
[36m@@ -0,0 +1,42 @@[m
[32m+[m[32mRunning the sitemap generator[m
[32m+[m[32m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~[m
[32m+[m[32mThe `sitemap_generator` script must be invoked with the following argument:[m
[32m+[m
[32m+[m[32m* `--lib-hostname`: specifies the hostname for the catalog (for example,[m
[32m+[m[32m `--lib-hostname https://catalog.example.com`); all URLs will be generated[m
[32m+[m[32m appended to this hostname[m
[32m+[m
[32m+[m[32mTherefore, the following arguments are useful for generating multiple sitemaps[m
[32m+[m[32mper Evergreen instance:[m
[32m+[m
[32m+[m[32m* `--lib-shortname`: limit the list of record URLs to those which have copies[m
[32m+[m[32m owned by the designated library or any of its children;[m
[32m+[m[32m* `--prefix`: provides a prefix for the sitemap index file names[m
[32m+[m
[32m+[m[32mOther options enable you to override the OpenSRF configuration file and the[m
[32m+[m[32mdatabase connection credentials, but the default settings are generally fine.[m
[32m+[m
[32m+[m[32mNote that on very large Evergreen instances, sitemaps can consume hundreds of[m
[32m+[m[32mmegabytes of disk space, so ensure that your Evergreen instance has enough room[m
[32m+[m[32mbefore running the script.[m
[32m+[m
[32m+[m[32mSitemap details[m
[32m+[m[32m~~~~~~~~~~~~~~~[m
[32m+[m
[32m+[m[32mThe sitemap generator script includes located URIs as well as copies[m
[32m+[m[32m listed in the `asset.opac_visible_copies` materialized view, and checks[m
[32m+[m[32m the children or ancestors of the requested libraries for holdings as well.[m
[32m+[m
[32m+[m[32mScheduling[m
[32m+[m[32m~~~~~~~~~~[m
[32m+[m[32mTo enable search engines to maintain a fresh index of your bibliographic[m
[32m+[m[32mrecords, you may want to include the script in your cron jobs on a nightly or[m
[32m+[m[32mweekly basis.[m
[32m+[m
[32m+[m[32mSitemap files are generated in the same directory from which the script is[m
[32m+[m[32minvoked, so a cron entry will look something like:[m
[32m+[m
[32m+[m[32m------------------------------------------------------------------------[m
[32m+[m[32m12 2 * * * cd /openils/var/web && /openils/bin/sitemap_generator[m
[32m+[m[32m------------------------------------------------------------------------[m
[32m+[m
[1mdiff --git a/docs/admin_initial_setup/designing_your_catalog.adoc b/docs/admin_initial_setup/designing_your_catalog.adoc[m
[1mindex 9a5b9d4dc4..22df88e261 100644[m
[1m--- a/docs/admin_initial_setup/designing_your_catalog.adoc[m
[1m+++ b/docs/admin_initial_setup/designing_your_catalog.adoc[m
[36m@@ -800,74 +800,3 @@[m [mThe system doesn't need the file extension to know what kind of file it is.[m
[m
Reload the bib record summary in the web catalog and your new image will display.[m
[m
[31m-Sitemap generator[m
[31m------------------[m
[31m-A http://www.sitemaps.org[sitemap] directs search engines to the pages of[m
[31m-interest in a web site so that the search engines can intelligently crawl[m
[31m-your site. In the case of Evergreen, the primary pages of interest are the[m
[31m-bibliographic record detail pages.[m
[31m-[m
[31m-The sitemap generator script creates sitemaps that adhere to the[m
[31m-http://sitemaps.org specification, including:[m
[31m-[m
[31m-* limiting the number of URLs per sitemap file to no more than 50,000 URLs;[m
[31m-* providing the date that the bibliographic record was last edited, so[m
[31m- that once a search engine has crawled all of your sites' record detail pages,[m
[31m- it only has to reindex those pages that are new or have changed since the last[m
[31m- crawl;[m
[31m-* generating a sitemap index file that points to each of the sitemap files.[m
[31m-[m
[31m-Running the sitemap generator[m
[31m-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~[m
[31m-The `sitemap_generator` script must be invoked with the following argument:[m
[31m-[m
[31m-* `--lib-hostname`: specifies the hostname for the catalog (for example,[m
[31m- `--lib-hostname https://catalog.example.com`); all URLs will be generated[m
[31m- appended to this hostname[m
[31m-[m
[31m-Therefore, the following arguments are useful for generating multiple sitemaps[m
[31m-per Evergreen instance:[m
[31m-[m
[31m-* `--lib-shortname`: limit the list of record URLs to those which have copies[m
[31m- owned by the designated library or any of its children;[m
[31m-* `--prefix`: provides a prefix for the sitemap index file names[m
[31m-[m
[31m-Other options enable you to override the OpenSRF configuration file and the[m
[31m-database connection credentials, but the default settings are generally fine.[m
[31m-[m
[31m-Note that on very large Evergreen instances, sitemaps can consume hundreds of[m
[31m-megabytes of disk space, so ensure that your Evergreen instance has enough room[m
[31m-before running the script.[m
[31m-[m
[31m-Scheduling[m
[31m-~~~~~~~~~~[m
[31m-To enable search engines to maintain a fresh index of your bibliographic[m
[31m-records, you may want to include the script in your cron jobs on a nightly or[m
[31m-weekly basis.[m
[31m-[m
[31m-Sitemap files are generated in the same directory from which the script is[m
[31m-invoked, so a cron entry will look something like:[m
[31m-[m
[31m-------------------------------------------------------------------------[m
[31m-12 2 * * * cd /openils/var/web && /openils/bin/sitemap_generator[m
[31m-------------------------------------------------------------------------[m
[31m-[m
[31m-Troubleshooting TPAC errors[m
[31m----------------------------[m
[31m-[m
[31m-If there is a problem such as a TT syntax error, it generally shows up as an[m
[31m-ugly server failure page. If you check the Apache error logs, you will probably[m
[31m-find some solid clues about the reason for the failure. For example, in the[m
[31m-following example, the error message identifies the file in which the problem[m
[31m-occurred as well as the relevant line numbers.[m
[31m-[m
[31m-Example error message in Apache error logs:[m
[31m-[m
[31m-----[m
[31m-bash# grep "template error" /var/log/apache2/error_log[m
[31m-[Tue Dec 06 02:12:09 2011] [warn] [client 127.0.0.1] egweb: template error:[m
[31m- file error - parse error - opac/parts/record/summary.tt2 line 112-121:[m
[31m- unexpected token (!=)\n [% last_cn = 0;\n FOR copy_info IN[m
[31m- ctx.copies;\n callnum = copy_info.call_number_label;\n[m
[31m-----[m
[31m-[m
[1mdiff --git a/docs/admin_initial_setup/troubleshooting_tpac.adoc b/docs/admin_initial_setup/troubleshooting_tpac.adoc[m
[1mnew file mode 100644[m
[1mindex 0000000000..583517144f[m
[1m--- /dev/null[m
[1m+++ b/docs/admin_initial_setup/troubleshooting_tpac.adoc[m
[36m@@ -0,0 +1,19 @@[m
[32m+[m[32mTroubleshooting TPAC errors[m
[32m+[m[32m---------------------------[m
[32m+[m
[32m+[m[32mIf there is a problem such as a TT syntax error, it generally shows up as an[m
[32m+[m[32mugly server failure page. If you check the Apache error logs, you will probably[m
[32m+[m[32mfind some solid clues about the reason for the failure. For example, in the[m
[32m+[m[32mfollowing example, the error message identifies the file in which the problem[m
[32m+[m[32moccurred as well as the relevant line numbers.[m
[32m+[m
[32m+[m[32mExample error message in Apache error logs:[m
[32m+[m
[32m+[m[32m----[m
[32m+[m[32mbash# grep "template error" /var/log/apache2/error_log[m
[32m+[m[32m[Tue Dec 06 02:12:09 2011] [warn] [client 127.0.0.1] egweb: template error:[m
[32m+[m[32m file error - parse error - opac/parts/record/summary.tt2 line 112-121:[m
[32m+[m[32m unexpected token (!=)\n [% last_cn = 0;\n FOR copy_info IN[m
[32m+[m[32m ctx.copies;\n callnum = copy_info.call_number_label;\n[m
[32m+[m[32m----[m
[32m+[m
[1mdiff --git a/docs/opac/sitemap.adoc b/docs/opac/sitemap.adoc[m
[1mnew file mode 100644[m
[1mindex 0000000000..e65663d8df[m
[1m--- /dev/null[m
[1m+++ b/docs/opac/sitemap.adoc[m
[36m@@ -0,0 +1,18 @@[m
[32m+[m[32mSitemap generator[m
[32m+[m[32m-----------------[m
[32m+[m
[32m+[m[32mA http://www.sitemaps.org[sitemap] directs search engines to the pages of[m
[32m+[m[32minterest in a web site so that the search engines can intelligently crawl[m
[32m+[m[32myour site. In the case of Evergreen, the primary pages of interest are the[m
[32m+[m[32mbibliographic record detail pages.[m
[32m+[m
[32m+[m[32mThe sitemap generator script creates sitemaps that adhere to the[m
[32m+[m[32mhttp://sitemaps.org specification, including:[m
[32m+[m
[32m+[m[32m* limiting the number of URLs per sitemap file to no more than 50,000 URLs;[m
[32m+[m[32m* providing the date that the bibliographic record was last edited, so[m
[32m+[m[32m that once a search engine has crawled all of your sites' record detail pages,[m
[32m+[m[32m it only has to reindex those pages that are new or have changed since the last[m
[32m+[m[32m crawl;[m
[32m+[m[32m* generating a sitemap index file that points to each of the sitemap files.[m
[32m+[m
[1mdiff --git a/docs/opac/visibility_on_the_web.adoc b/docs/opac/visibility_on_the_web.adoc[m
[1mnew file mode 100644[m
[1mindex 0000000000..0ed5c530c5[m
[1m--- /dev/null[m
[1m+++ b/docs/opac/visibility_on_the_web.adoc[m
[36m@@ -0,0 +1,123 @@[m
[32m+[m[32mLibrary visibility on the Web[m
[32m+[m[32m-----------------------------[m
[32m+[m
[32m+[m[32mIntroduction[m
[32m+[m[32m~~~~~~~~~~~~[m
[32m+[m
[32m+[m[32mEvergreen follows a number of best practices to[m
[32m+[m[32mmake Library data integrate with the rest of the[m
[32m+[m[32mWeb. Evergreen's public catalog pages are[m
[32m+[m[32mdesigned so that search engines can easily extract[m
[32m+[m[32mmeaningful information about your library and[m
[32m+[m[32mcollections. Evergreen is also preparing for an[m
[32m+[m[32meventual shift toward linked open bibliographic[m
[32m+[m[32mdata.[m
[32m+[m
[32m+[m[32mCatalog data in search engines[m
[32m+[m[32m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~[m
[32m+[m
[32m+[m[32mEach record in the catalog is displayed to search[m
[32m+[m[32mengines using http://schema.org[schema.org] microdata.[m
[32m+[m
[32m+[m[32m[IMPORTANT][m
[32m+[m[32mMake sure your system administrator has not added[m
[32m+[m[32ma restrictive robots.txt file to your server.[m
[32m+[m[32mThese files restrict search engines, up to the[m
[32m+[m[32mpoint of not allowing search engines to index your[m
[32m+[m[32msite at all.[m
[32m+[m
[32m+[m[32mDetails of the schema.org mapping[m
[32m+[m[32m^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^[m
[32m+[m
[32m+[m[32m * Each item is listed as a[m
[32m+[m[32m http://schema.org/Offer[schema:Offer], which is[m
[32m+[m[32m the same category that an online bookseller might[m
[32m+[m[32m use to describe an item for sale. These Offers[m
[32m+[m[32m are always listed with a price of $0.00.[m
[32m+[m[32m * Subject headings are exposed as[m
[32m+[m[32m http://schema.org/about[schema:about][m
[32m+[m[32m properties.[m
[32m+[m[32m * Electronic resources are assigned a[m
[32m+[m[32m http://schema.org/url[schema:url][m
[32m+[m[32m property, and any notes or link text[m
[32m+[m[32m are assigned a[m
[32m+[m[32m http://schema.org/description[schema:description][m
[32m+[m[32m property.[m
[32m+[m[32m * Given a Library of Congress relator code for[m
[32m+[m[32m 1xx and 7xx fields, Evergreen surfaces the URL[m
[32m+[m[32m for that relator code along with the[m
[32m+[m[32m http://schema.org/contributor[schema:contributor][m
[32m+[m[32m property to give machines a better chance[m
[32m+[m[32m of understanding how the person or organization[m
[32m+[m[32m actually contributed to this work.[m
[32m+[m[32m * Linking out to related records:[m
[32m+[m[32m ** Given an LCCN (010 field), Evergreen links to[m
[32m+[m[32m the corresponding Library of Congress record[m
[32m+[m[32m using http://schema.org/sameAs[schema:sameAs].[m
[32m+[m[32m ** Given an OCLC number (035 field, subfield `a`[m
[32m+[m[32m beginning with `(OCoLC)`), Evergreen links to[m
[32m+[m[32m the corresponding WorldCat record using[m
[32m+[m[32m http://schema.org/sameAs[schema:sameAs].[m
[32m+[m[32m ** Given a URI (024 field, subfield 2 = `'uri'`),[m
[32m+[m[32m Evergreen links to the corresponding OCLC[m
[32m+[m[32m Work Entity record using[m
[32m+[m[32m http://schema.org/exampleOfWork[schema:exampleOfWork].[m
[32m+[m
[32m+[m
[32m+[m[32mViewing microdata[m
[32m+[m[32m^^^^^^^^^^^^^^^^^[m
[32m+[m[32mYou can learn more about how Evergreen publicizes[m
[32m+[m[32mthese data by viewing them directly. The[m[41m [m
[32m+[m[32mhttp://linter.structured-data.org[structured data linter][m
[32m+[m[32mis a helpful tool for viewing microdata.[m
[32m+[m
[32m+[m[32m. Using your favorite Web browser, navigate to a[m
[32m+[m[32m record in your public catalog.[m
[32m+[m[32m. Copy the URL that displays in your browser's[m
[32m+[m[32m address bar.[m
[32m+[m[32m. Go to http://linter.structured-data.org[m
[32m+[m[32m. Under the _Lint by URL_ tab, paste your URL[m
[32m+[m[32m into the text box.[m
[32m+[m[32m. Click _Submit_[m
[32m+[m
[32m+[m[32mOther helpful features for search engines[m
[32m+[m[32m^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^[m
[32m+[m[32m * Titles of catalog pages follow a[m
[32m+[m[32m "Page title - Library name" pattern to provide[m
[32m+[m[32m specific titles in search engine results pages,[m
[32m+[m[32m browser bookmarks, and browser tabs.[m
[32m+[m[32m * Links that robots should not crawl, such as search[m
[32m+[m[32m result links, are marked with the[m
[32m+[m[32m https://support.google.com/webmasters/answer/96569?hl=en[@rel="nofollow"][m
[32m+[m[32m property.[m
[32m+[m[32m * Catalog pages for record details and for library[m
[32m+[m[32m descriptions express a[m
[32m+[m[32m https://support.google.com/webmasters/answer/139066?hl=en[@rel="canonical"][m
[32m+[m[32m link to simplify the number of variations of page[m
[32m+[m[32m URLs that could otherwise have been derived from[m
[32m+[m[32m different search parameters.[m
[32m+[m[32m * Catalog pages that do not exist return a proper[m
[32m+[m[32m 404 "HTTP_NOT_FOUND" HTTP status code, and record[m
[32m+[m[32m detail pages for records that have been deleted[m
[32m+[m[32m now return a proper 410 "HTTP_GONE" HTTP status code.[m
[32m+[m[32m * Record detail and library pages include[m
[32m+[m[32m http://ogp.me/[Open Graph Protocol] markup.[m
[32m+[m[32m * Each library has its own page at[m
[32m+[m[32m _http://localhost/eg/opac/library/LIBRARY_SHORTNAME_[m
[32m+[m[32m that provides machine-readable hours and contact[m
[32m+[m[32m information.[m
[32m+[m
[32m+[m[32mSKOS support[m
[32m+[m[32m~~~~~~~~~~~~[m
[32m+[m
[32m+[m[32mSome vocabularies used (or which could be used) for[m
[32m+[m[32mstock record attributes and coded value maps in Evergreen[m
[32m+[m[32mare published on the web using SKOS. The record[m
[32m+[m[32mattributes system can now associate Linked Data URIs[m
[32m+[m[32mwith specific attribute values. In particular, seed data[m
[32m+[m[32msupplying URIs for the RDA Content Type, Media Type, and[m
[32m+[m[32mCarrier Type has been added.[m
[32m+[m
[32m+[m[32mThis is an experimental, "under-the-hood" feature that[m
[32m+[m[32mwill be built upon in subsuquent releases.[m
[32m+[m
[1mdiff --git a/docs/root_command_line_admin.adoc b/docs/root_command_line_admin.adoc[m
[1mindex cb046fdc02..6fea46b490 100644[m
[1m--- a/docs/root_command_line_admin.adoc[m
[1m+++ b/docs/root_command_line_admin.adoc[m
[36m@@ -104,6 +104,12 @@[m [minclude::admin/template_toolkit.adoc[][m
[m
include::admin_initial_setup/designing_your_catalog.adoc[][m
[m
[32m+[m[32minclude::opac/sitemap.adoc[][m
[32m+[m
[32m+[m[32minclude::admin/sitemap_admin.adoc[][m
[32m+[m
[32m+[m[32minclude::admin_initial_setup/troubleshooting_tpac.adoc[][m
[32m+[m
:leveloffset: 0[m
[m
include::admin/audio_alerts.adoc[][m
[1mdiff --git a/docs/root_opac.adoc b/docs/root_opac.adoc[m
[1mindex d495726203..863688f9e3 100644[m
[1m--- a/docs/root_opac.adoc[m
[1m+++ b/docs/root_opac.adoc[m
[36m@@ -16,7 +16,8 @@[m [mworkers in public services roles.[m
It is organized into Parts, Chapters, and Sections addressing key[m
aspects of the software. [m
[m
[31m-Copies of this guide can be accessed in PDF and HTML formats from http://docs.evergreen-ils.org/.[m
[32m+[m[32mCopies of this guide can be accessed in PDF and HTML formats from[m
[32m+[m[32mhttp://docs.evergreen-ils.org/.[m
[m
[m
[m
[36m@@ -44,8 +45,15 @@[m [minclude::opac/kids_opac.adoc[][m
[m
include::opac/opensearch.adoc[][m
[m
[32m+[m[32minclude::opac/visibility_on_the_web.adoc[][m
[32m+[m
:leveloffset: 0[m
[m
[32m+[m[32minclude::opac/sitemap.adoc[][m
[32m+[m
[32m+[m[32mSee the Command Line System Administration Manual for details about[m
[32m+[m[32mrunning this script.[m
[32m+[m
include::shared/attributions.adoc[][m
[m
include::shared/end_matter.adoc[][m
More information about the Evergreen-general
mailing list