<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Aurlog &#187; libraries</title>
	<atom:link href="http://www.aurochs.org/aurlog/categories/libraries/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.aurochs.org/aurlog</link>
	<description>Cows and stuff</description>
	<lastBuildDate>Sun, 08 Jan 2012 22:41:32 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Cataloguing coding</title>
		<link>http://www.aurochs.org/aurlog/2012/01/08/cataloguing-coding/</link>
		<comments>http://www.aurochs.org/aurlog/2012/01/08/cataloguing-coding/#comments</comments>
		<pubDate>Sun, 08 Jan 2012 16:50:00 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[catcode]]></category>
		<category><![CDATA[codeyear]]></category>
		<category><![CDATA[coding]]></category>
		<category><![CDATA[javascript]]></category>
		<category><![CDATA[perl]]></category>
		<category><![CDATA[php]]></category>
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=708</guid>
		<description><![CDATA[<p>I am not a trained programmer, coding is not part of my job description, and I have little direct access to cataloguing and metadata databases at work outside of normal catalogue editing and talking to the systems team, but I thought it might be worth making the point of how useful programming can be in [...]]]></description>
			<content:encoded><![CDATA[<p>I am not a trained programmer, coding is not part of my job description, and I have little direct access to cataloguing and metadata databases at work outside of normal catalogue editing and talking to the systems team, but I thought it might be worth making the point of how useful programming can be in all sorts of little ways. Of course, the most useful way is in gaining an awareness of how computers work, appreciating why some things might be more tricky than others for the systems team to implement, seeing why MARC21 is a bastard to do anything with even if editing it in a cataloguing module is not really that bad, and how the new world of FRDABRDF is going to be glued together. However, some more practical examples that I managed to cobble together include:</p>
<ul>
<li><strong><a href="http://www.aurochs.org/aurlog/2011/06/24/customizing-classification-web-with-greasemonkey/">Customizing Classification Web with Greasemonkey</a></strong>. This is a couple of short scripts using Javascript, which is what the default <a href="http://www.codecademy.com/">Codeacademy</a> lessons use. Javascript is designed for browers and is a good one to start with as you can do something powerful very quickly with a short script or even a couple of lines (think of all the 90s image rollovers). It&#8217;s also easy to have a go if you don&#8217;t have your own server, or even if you&#8217;re confined to your own PC.</li>
<li><strong>Aleph-formatted country and language codes</strong>. I wrote a small PHP script to read the XML files for the MARC21 <a href="http://www.loc.gov/standards/codelists/languages.xml">language</a> and <a href="http://www.loc.gov/standards/codelists/countries.xml">country</a> codes and convert them into an up to date list of preferred codes in a format that Aleph can read, basically a text file which needs line breaks and spaces in the right places. It is easy to tweak or run again in the event of any minor changes. I don&#8217;t have this publicly available anywhere though. PHP is not the most elegant language but is relatively easy to dip into if you ever want to go beyond Javascript and do more fancy things, although it can be harder to get access to a server running PHP.</li>
<li><strong><a href="http://www.aurochs.org/mrc_viewer/mrc_viewer.php">MARC21 .mrc file viewer</a></strong>. I occasionally need to quickly look at raw .mrc files to assess their quality and to figure out what batch changes we want to make before importing them into our catalogue. This is an attempt to create something that I could copy and paste snippets of .mrc files into for a quick look. It is written in PHP and is still under construction. There are other better tools for doing much the same thing to be honest, but coding this myself has had the advantages of forcing me to see how a MARC21 file is put together and realising how fiddly it can be. Try this with an .mrc which has some large 520 or 505 fields in it (there are some zipped ones <a href="http://www.rsc.org/Publishing/eBooks/marc21records.asp">here</a>, to pick at random) and watch the indicators mysteriously degrade thereafter. I <em>will</em> get to the bottom of this&#8230;</li>
</ul>
<p>The following examples are less useful for my own practical purposes but have been invaluable for learning about metadata and cataloguing, in particular, RDF/linked data. I was very interested in LD when I first heard about it. Being able to actually try something out with it (even if the results are not mind-blowing) rather than just read about it, has been very useful. Both are written in PHP and further details are available from the links:</p>
<ul>
<li><strong><a href="http://www.aurochs.org/aurlog/2011/07/31/lodopac-simple-linked-open-data-opac/">Lodopac: Linked Open Data OPAC</a></strong>. My entry for the UK Discovery <a href="http://discovery.ac.uk/developers/competition/">Developer Competition</a>.</li>
<li><a title="Permanent Link to In Our Time booklist" rel="bookmark" href="../2011/02/16/in-our-time-booklist/"><strong>In Our Time booklist</strong>.</a> A forerunner to Lodopac, written in response to an idea raised by <a href="http://ideas.okfn.org/users/23/psychemedia/">psychemedia</a> for the <a href="http://openbiblio.net/challenge/">Open Bibliographic Data Challenge</a>: the <a href="http://ideas.okfn.org/ideas/27/bbc-in-our-time-reading-list">BBC “In Our Time” Reading List</a>. Sadly it no longer works as the BNB service it relies upon was taken down.</li>
</ul>
<p>Nothing to do with cataloguing, but what I am most proud of is this, written in Javascript: <a href="http://www.aurochs.org/games/cowthello/">Cowthello</a>. Let me know if you beat it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2012/01/08/cataloguing-coding/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lodopac example searches</title>
		<link>http://www.aurochs.org/aurlog/2011/09/15/lodopac-example-searches/</link>
		<comments>http://www.aurochs.org/aurlog/2011/09/15/lodopac-example-searches/#comments</comments>
		<pubDate>Thu, 15 Sep 2011 09:03:18 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[british national bibliography]]></category>
		<category><![CDATA[cambridge university library]]></category>
		<category><![CDATA[devsci]]></category>
		<category><![CDATA[discodev]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[linked open data]]></category>
		<category><![CDATA[lodopac]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[sparql]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=699</guid>
		<description><![CDATA[<p>Yay, my entry for the Discovery &#38; DevCSI Developers Competition- Lodopac- was awarded a commendation for its use of the Cambridge University Library (CUL) dataset. During the judging I was asked for searches which were known to work well- the timeout issues I discussed under Limitations being not insignificant, especially with author or title searches. [...]]]></description>
			<content:encoded><![CDATA[<p>Yay, my entry for the <a href="http://discovery.ac.uk/developers/competition/">Discovery &amp; DevCSI Developers Competition</a>- <a href="http://www.aurochs.org/lodopac/lodopac.php">Lodopac</a>- was awarded a commendation for its use of the <a href="http://discovery.ac.uk/packages/api/demos/paclist.php">Cambridge University Library (CUL) dataset</a>. During the judging I was asked for searches which were known to work well- the timeout issues <a href="http://www.aurochs.org/aurlog/2011/07/31/lodopac-simple-linked-open-data-opac/">I discussed</a> under Limitations being not insignificant, especially with author or title searches. I submitted a version of the following brief general notes which I hope are helpful to anyone else who wants to play:</p>
<p>The British National Bibliography (BNB) server is generally more responsive than the Cambridge University Library one; title seems to  work better than author. The following are hopefully useful examples useful:</p>
<ul>
<li><a href="../../lodopac/lodopac.php?bnb=true&amp;cam=true&amp;author=fisher&amp;title=&amp;date=&amp;isbn=&amp;limit=5" target="_blank">author=&#8221;fisher&#8221;</a> seems to work (at least in BNB) so long as the number of hits to return is small (e.g. 5).</li>
<li><a href="../../lodopac/lodopac.php?bnb=true&amp;cam=true&amp;author=&amp;title=fisher&amp;date=&amp;isbn=&amp;limit=5" target="_blank">title=&#8221;fisher&#8221;</a> works well for both.</li>
<li><a href="../../lodopac/lodopac.php?bnb=true&amp;cam=true&amp;author=&amp;title=&amp;date=&amp;isbn=0709039425&amp;limit=3" target="_blank">ISBN=&#8221;0709039425&#8243;</a> should very quickly give you a single hit in both BNB and CUL.</li>
</ul>
<p>I would really like to try and think of ways of improving free text regular expression search times for things like author and title in Sparql* although I doubt there is one that doesn&#8217;t rely on the configuration, processing power, or indexing of the server being searched.</p>
<p>* thinking aloud, some ideas might include: downloading a larger imprecise set for further local searching (e.g. for an author/title search downloading the title matches and searching the authors locally: although this would also be slow, it would get round the timeout at least); forcing a look-up in a controlled vocab first in order to get an exact string match (esp for authors, although even if this is possible, this forces a user to do more work, which isn&#8217;t the point);  local indexing of the triple store (this is probably the best way but I&#8217;m not sure how to go about it, whether I really have the server capabilities to do it, and can be committed to the updating required).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2011/09/15/lodopac-example-searches/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Sparql recipes for bibliographic data</title>
		<link>http://www.aurochs.org/aurlog/2011/08/05/sparql-recipes-for-bibliographic-data/</link>
		<comments>http://www.aurochs.org/aurlog/2011/08/05/sparql-recipes-for-bibliographic-data/#comments</comments>
		<pubDate>Fri, 05 Aug 2011 14:06:00 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[british national bibliography]]></category>
		<category><![CDATA[cambridge university library]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[linked open data]]></category>
		<category><![CDATA[lodopac]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[sparql]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=677</guid>
		<description><![CDATA[<p>One of the difficulties in searching RDF data is knowing what the data looks like. For instance, finding a book by its title means knowing something about what how a dataset has recorded the relationship between a book and its title. There is no real standard for publishing MARC/AACR2-style bibliographic data as RDF: it seems [...]]]></description>
			<content:encoded><![CDATA[<p>One of the difficulties in searching RDF data is knowing what the data looks like. For instance, finding a book by its title means knowing something about what how a dataset has recorded the relationship between a book and its title. There is no real standard for publishing MARC/AACR2-style bibliographic data as RDF: it seems libraries publishing RDF are approaching this largely individually, although they are using many of the same vocabularies, <a href="http://dublincore.org/">dc</a>, <a href="http://bibliontology.com/">bibo</a>, etc. This was one reason why I wanted to create <a href="http://www.aurochs.org/lodopac/lodopac.php">Lodopac</a>: to present some kind of interface so that searchers didn&#8217;t need to know these different models but could start to explore them. Below are the Sparql recipes for the different search criteria I used for the <a href="http://www.bl.uk/bibliographic/datafree.html">BNB</a> and the <a href="http://data.lib.cam.ac.uk/">Cambridge University Library</a> datasets, so they can be compared, re-used, or corrected. All examples use prefixes, which are defined anew in each example. The examples are of course fragments and don&#8217;t have all the necessary SELECT and WHERE clauses.</p>
<p>By the way, for an excellent Sparql tutorial with ample opportunity to play as you go along, do have a look at the Cambridge University Library&#8217;s <a href="http://data.lib.cam.ac.uk/sparql.php">SPARQL tutorial</a>. It also gives clues to the way their data is structured. Of use for the BNB is their <a href="http://www.bl.uk/bibliographic/pdfs/datamodelv1_01.pdf">data model</a> (PDF), which is not nearly as scary as it looks at first, and incredibly helpful.</p>
<p><strong>Author keyword search</strong></p>
<p>This would be relatively straightforward-the unavoidable regular expression being the main complication- but for the fact that the traditional author/editor/etc of bibliographic records can be found in dc:creator as well as dc:contributor which necessitates a UNION. The BNB used foaf:name:</p>
<blockquote><p>PREFIX dc: &lt;http://purl.org/dc/terms/&gt;<br />
PREFIX foaf: &lt;http://xmlns.com/foaf/0.1/&gt;</p>
<p>{?book dc:creator ?author} UNION {?book dc:contributor ?author} .<br />
?author foaf:name ?name .<br />
FILTER regex(?name, &#8220;author&#8221;, &#8220;i&#8221;) .</p></blockquote>
<p>Cambridge uses much the same recipe except that it uses rdfs:label instead of foaf:name:</p>
<blockquote><p>PREFIX dc: &lt;http://purl.org/dc/terms/&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;</p>
<p>{?book dc:creator ?author} UNION {?book dc:contributor ?author} .<br />
?author rdfs:label ?name .<br />
FILTER regex(?name, &#8220;author&#8221;, &#8220;i&#8221;) .</p></blockquote>
<p><strong>Title keyword searches</strong></p>
<p>This is more straightforward and is in fact the same for both the BNB and Cambridge University Library:</p>
<blockquote><p>PREFIX dc: &lt;http://purl.org/dc/terms/&gt;</p>
<p>?book dc:title ?title .<br />
FILTER regex(?title, &#8220;title&#8221;, &#8220;i&#8221;) .</p></blockquote>
<p><strong>Date of publication (year)</strong></p>
<p>I imagined this one being simple and for Cambridge University Library it is. However the BNB took some unravelling as they have modelled publication as an event related to a book. The various elements of publication are then related to the event. So, for the BNB we have this:</p>
<blockquote><p>PREFIX bibliographic: &lt;http://data.bl.uk/schema/bibliographic#&gt;<br />
PREFIX event: &lt;http://purl.org/NET/c4dm/event.owl#&gt;<br />
PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt;</p>
<p>?book bibliographic:publication ?pub .<br />
?pub event:time ?year .<br />
?year rdfs:label &#8220;date&#8221; .</p></blockquote>
<p>By contrast, Cambridge University Library has it in one line:</p>
<blockquote><p>PREFIX dc: &lt;http://purl.org/dc/terms/&gt;</p>
<p>?book dc:created &#8220;date&#8221; .</p></blockquote>
<p><strong>ISBN</strong></p>
<p>As an identifier, ISBN is relatively straightforward in both models, although care must be taken with the BNB as 10 and 13 digit ISBNs are treated as separate properties and the following assumes that the search will cover both:</p>
<blockquote><p>PREFIX bibo: &lt;http://purl.org/ontology/bibo/&gt;</p>
<p>{?book bibo:isbn10 &#8220;isbn&#8221;} UNION {?book bibo:isbn13 &#8220;isbn&#8221;} .</p></blockquote>
<p>For Cambridge University Library, also using the bibo ontology, this is:</p>
<blockquote><p>PREFIX bibo: &lt;http://purl.org/ontology/bibo/&gt;</p>
<p>?book bibo:isbn &#8220;isbn&#8221; .</p></blockquote>
<p><strong>Conclusion</strong></p>
<p>I didn&#8217;t set up to provide ground-breaking conclusions. However, it is remarkable how different data models can be formulated for modelling the same type of data by similar organisations. The real question is whether this is a good, a bad thing, or doesn&#8217;t really mattter. Will it need to be standardised? My understanding of how this works is probably not. I think the days of monolithic library standards are probably now gone. I wonder, for instance, if there ever will be a single MARC22 (or whatever you like to call it) and doubt RDA will ever completely replace AACR2 in the way we imagine. What will emerge I suspect will be a soup of various standards and data models, some of which will be more prevalent. One thing I picked up from various linked data talks is that information has frequently been published then re-used in ways that the issuers never imagined; if that is the case, the precise modelling and format is probably not as important as the fact that it is of good quality and intelligently put together. The BNB and Cambridge University Library models are clearly quite different but quite capable of being mapped and used despite this.</p>
<p>If there are any other bibliographic data Sparql endpoints, I would like to include them in a future version of the Lodopac search. Do let me know if you come across them.</p>
<p>More mundanely, do say if there are errors in my Sparql recipes or if there are ways they could be done more efficiently.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2011/08/05/sparql-recipes-for-bibliographic-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lodopac : simple Linked Open Data OPAC</title>
		<link>http://www.aurochs.org/aurlog/2011/07/31/lodopac-simple-linked-open-data-opac/</link>
		<comments>http://www.aurochs.org/aurlog/2011/07/31/lodopac-simple-linked-open-data-opac/#comments</comments>
		<pubDate>Sun, 31 Jul 2011 20:34:25 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[arc2]]></category>
		<category><![CDATA[british national bibliography]]></category>
		<category><![CDATA[cambridge university library]]></category>
		<category><![CDATA[devsci]]></category>
		<category><![CDATA[discodev]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[linked open data]]></category>
		<category><![CDATA[lodopac]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[sparql]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=673</guid>
		<description><![CDATA[<p>Lodopac is my entry to the UK Discovery Developer Competition. Aside from obvious mocking of the name, comments on Lodopac are very welcome. If anyone  installs it locally, I&#8217;d also be very interested to know.</p>
<p>About Lodopac</p>
<p>Lodopac is a simple linked open data opac using Sparql to search remote bibliographic RDF data.   By default [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.aurochs.org/lodopac/lodopac.php">Lodopac</a> is my entry to the UK Discovery <a href="http://discovery.ac.uk/developers/competition/">Developer Competition</a>. Aside from obvious mocking of the name, comments on Lodopac are very welcome. If anyone  installs it locally, I&#8217;d also be very interested to know.</p>
<p><strong>About Lodopac</strong></p>
<p>Lodopac is a simple linked open data opac using Sparql to search remote bibliographic RDF data.   By default it is set up to search the <a href="http://www.bl.uk/bibliographic/datafree.html">BNB</a> and <a href="http://data.lib.cam.ac.uk/">Cambridge University Library</a> datasets, but is designed to allow easy setup of additional datasets with Sparql endpoints (see <a href="../../lodopac/installation">Installation, source code, and configuration</a> below). It was written in response to the UK Discovery <a href="http://discovery.ac.uk/developers/competition/">Developer Competition</a>.</p>
<p>The purpose of Lodopac is to provide a simple standard OPAC-style  interface to perform searches    of various bibliographic RDF datasets without having to know how to  formulate a Sparql query and   without having to know the structure of the database. I hope it is  especially useful for people wanting   to get a grip on how bibliographic RDF is put together, what it looks  like, and what a Sparql query looks like. For example, an author search  is possible without knowing about dc:creator and dc:contributor, or how  these need to be linked together in a Sparql search. Similarly, a  searcher wouldn&#8217;t need to worry about how to construct date searches in  different datasets. For the BNB and CUL, these are very different (three  lines in the BNB, one for CUL), but in Lodopac, there is only search  box to search both. Lodopac displays the Sparql query it constructed to  perform the search, as well as the combined RDF for all results found in  XML, JSON, N3, and TTL.</p>
<p><strong>How to search Lodopac</strong></p>
<p>Select one or more of the available datasets using the checkboxes.</p>
<p><em>Author</em> and <em>Title</em> searches are free text phrase searches. In other words, a string you   search for will match with   any exact match, including spacing and punctuation, and in the middle of words. E.g. searching for &#8220;shake&#8221; will match   &#8220;Shakespeare&#8221;, &#8220;milkshakes&#8221;, and &#8220;More hits that you can shake a stick at&#8221;. Searching is case insensitive. The   following punctuation is removed from searches: <em>\&#8221;&#8216;&lt;&gt;$^%</em>.</p>
<p>You are strongly advised to keep author and title searches simple: e.g. one word of a title or a surname only.</p>
<p><em>ISBN</em> searches 10 or 13 digit ISBNs. Any dashes or other non-digit characters are stripped from the search.</p>
<p><em>Date</em> search will accept a year.</p>
<p><strong>N.B.</strong> Keep searches as simple as possible, especially with author and title searches,    to avoid them timing out. ISBN and date searches are generally quicker.</p>
<p><strong>Limitations</strong></p>
<p>A bad workman blames his tools and I&#8217;m no exception. The greatest  limitation is the time taken by Sparql endpoints to perform a Sparql  query, especially one that involves    a regular expression, such as the Author and Title searches. What is  needed is some more robust indexing or some cheat like Virtuoso&#8217;s  unorthodox bif:contains, which the old version of the linked data BNB  used. I touched on this in a <a href="../2011/02/16/in-our-time-booklist/">blog post</a> about the In Our Time Booklist script I wrote (see section 6).</p>
<p>The load and current capacity on the Sparql endpoints at the time  the query is made is another important factor. A search which times    out one minute can work fine the next.</p>
<p>The search options are obviously limited but do I hope represent the most common methods of searching     normal library catalogues aside from, of course, a general keywords search. The manipulation of results is also    rather sparse but allows click through to full data associated with a book, the structure and contents of which    can be more fully explored. The aggregation of RDF data in various formats is I hope useful illustratively    as well as having potential for further manipulation.</p>
<p><strong>Installation, source code, and configuration</strong></p>
<p><a name="installation"></a>The source code for Lodopac is available as a <a href="../../lodopac/lodopac.zip">zip file</a>, which contains all the necessary PHP, Javascript, and CSS files.  In addition, you will need to install <a href="https://github.com/semsol/arc2/wiki">ARC2</a>, which makes the Sparql queries and manipulates the resultant RDF.  Edit the first line of lodopac.php so that it points at your local installation of ARC2.</p>
<p>The programme is basically one long script- there is only one page-  but is split for convenience of editing. The key file is lodopac.php  which includes the other files as it goes along. The main core of the  script which builds the queries and does the searching is all in  lodopac.php.</p>
<p>I have attempted to make the script as easily configurable as  possible so that additional Sparql endpoints can be added. There are  probably more components hard-coded into the script that I have  overlooked, but all the setup for the endpoints is in the file  setup_endpoints.php. The first part of this file is a list of necessary  prefixes that are needed for any possible queries from any of the  endpoints and, although not ideal, all these prefixes are sent with any  Sparql query. Following that and the declaration of an array of the  endpoints, each endpoint has a dedicated block with the information  added to a hash. To add another endpoint, duplicate a block and  configure the search recipes as appropriate. The keys marked &#8220;brief_&#8221;  are used to fetch information for the brief results display. I have  conspicuously chickened out of providing an author and the attendant  main entry and multiple author headaches involved.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2011/07/31/lodopac-simple-linked-open-data-opac/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Customizing Classification Web with Greasemonkey</title>
		<link>http://www.aurochs.org/aurlog/2011/06/24/customizing-classification-web-with-greasemonkey/</link>
		<comments>http://www.aurochs.org/aurlog/2011/06/24/customizing-classification-web-with-greasemonkey/#comments</comments>
		<pubDate>Fri, 24 Jun 2011 10:51:39 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[classification web]]></category>
		<category><![CDATA[greasemonkey]]></category>
		<category><![CDATA[javascript]]></category>
		<category><![CDATA[libraries]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=668</guid>
		<description><![CDATA[<p>Classification Web is ace, but there are a couple of things about the interface that annoy me and, in one colleague&#8217;s case, seriously put him off using it, in particular:</p>

The opening of a new tab/window when you click on the MARC view for a subject or name.
The confusing menu. We don&#8217;t use LCC or DDC, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://classificationweb.net/">Classification Web</a> is ace, but there are a couple of things about the interface that annoy me and, in one colleague&#8217;s case, seriously put him off using it, in particular:</p>
<ul>
<li>The opening of a new tab/window when you click on the MARC view for a subject or name.</li>
<li>The confusing menu. We don&#8217;t use LCC or DDC, and the browse options don&#8217;t really add much, so we only really need two options: <em>Search LC Subject Headings</em> and <em>Search LC Name Headings</em>.</li>
</ul>
<p>I managed to work out a simple way of modifying how Classification Web works on Firefox using the <a href="https://addons.mozilla.org/en-US/firefox/addon/greasemonkey/">Greasemonkey</a> add-on and a couple of simple scripts, all of which is quick and easy to install:</p>
<ol>
<li>Install Greasemonkey: <a href="https://addons.mozilla.org/en-US/firefox/addon/greasemonkey/">https://addons.mozilla.org/en-US/firefox/addon/greasemonkey/</a></li>
<li>Make sure the monkey in the bottom-right corner is happy and colourful. Click on it if not.</li>
<li>If you want to prevent the MARC view opening a new window, install the <strong>classweb_no_new_ window</strong> script by going to <a href="http://www.aurochs.org/zlib/js/userjs/classweb_no_new_window.user.js">http://www.aurochs.org/zlib/js/userjs/classweb_no_new_window.user.js</a> then</li>
<li>Click on the Install button</li>
<li>If you want to reduce the main menu, install <strong>classweb_prune_menu</strong> script by going to <a href="http://www.aurochs.org/zlib/js/userjs/classweb_prune_menu.user.js">http://www.aurochs.org/zlib/js/userjs/classweb_prune_menu.user.js</a> then</li>
<li>Click on Install button</li>
<li>Reload/refresh Classweb if it&#8217;s still open and it should work.</li>
</ol>
<p>If you want to turn Greasemonkey off altogether, click on the monkey so he&#8217;s sad and grey. If you want to stop individual scripts, right click on the monkey, click on <em>Manage User Scripts</em>, select a script from the list, and un-tick the <em>Enabled</em> box in the lower left corner.</p>
<p>These instructions were tested on Firefox 3.5.3 although I imagine they would be fine on any recent version of Firefox. I would be interested to hear anything confirming or undermining that assertion.</p>
<p>If you&#8217;re happy to play around, these scripts can be further altered. In particular, you can choose which menu items appear in the pruned menu script:</p>
<ol>
<li>Right click on the monkey</li>
<li>Click on <em>Manage User Scripts</em></li>
<li>Select <em>classweb_prune_menu</em> from the list</li>
<li>Click on <em>Edit</em> (you will probably have to select a text editor at this point)</li>
<li>Edit the list of pages under the line <strong>var menu_items_to_keep = Array (</strong>. Enter each page you want to appear on the menu on a separate line in quotes, with a comma at the end of each line <strong>except</strong> the last line. The menu item must appear exactly as it does on the Classification Web menu, including capitals. E.g., the default set up looks like this:<br />
<blockquote>
<pre>var menu_items_to_keep = Array ( // end each line with a comma except the last line
  "Search LC Subject Headings",
  "Search LC Name Headings"
);</pre>
</blockquote>
</li>
<li>Save the file, and reload Classification Web.</li>
</ol>
<p>If anyone else finds this useful or can think of more customizations let me know.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2011/06/24/customizing-classification-web-with-greasemonkey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>RDA as a closed standard</title>
		<link>http://www.aurochs.org/aurlog/2011/03/23/rda-as-a-closed-standard/</link>
		<comments>http://www.aurochs.org/aurlog/2011/03/23/rda-as-a-closed-standard/#comments</comments>
		<pubDate>Wed, 23 Mar 2011 12:16:32 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[rda]]></category>
		<category><![CDATA[aacr2]]></category>
		<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[open standards]]></category>
		<category><![CDATA[rda toolkit]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=638</guid>
		<description><![CDATA[<p>Resource Description and Access (RDA), the new bibliographic standard to replace AACR2, was released in 2010 on the web as a closed standard sitting behind a paywall. This really worries me. I strongly believe it should be an open standard.</p>
<p>What do I mean by closed?
By closed I basically mean that you have to pay or [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.rdatoolkit.org/">Resource Description and Access</a> (RDA), the new bibliographic standard to replace <a href="http://www.aacr2.org/">AACR2</a>, was released in 2010 on the web as a closed standard sitting behind a paywall. This really worries me. I strongly believe it should be an open standard.</p>
<p><strong>What do I mean by closed?</strong><br />
By <em>closed</em> I basically mean that you have to pay or subscribe to access it. In many ways, this is not dissimilar to AARC2. For decades, libraries (and individuals) paid for various editions of AACR2, which has always been primarily a print product, as well as for various updates when it changed to looseleaf format. Recently it has also been available on the web via Cataloger&#8217;s Desktop. RDA is primarily a web product via the <a href="http://www.rdatoolkit.org/">RDA Toolkit</a>, although a concession was eventually made to release it in print as well.</p>
<p>An open standard would be one that, according to <a href="http://en.wikipedia.org/wiki/Open_standards">Wikipedia</a>, &#8220;is publicly available&#8221; even if it &#8220;has various rights to use associated with it&#8221;. This would be one which any cataloguer, librarian, or crucially, non-librarian, could see and benefit from.  A practical definition for the purposes of this post would be a standard I could go and look at right now without subscription. I can if I wish apply the standard without hindrance, I can assess it with ease, and, ideally, build on the standard without restriction. RDA is not open, although to be fair, a part of RDA has been released openly, the <a href="http://metadataregistry.org/rdabrowse.htm">element set and vocabularies</a>.</p>
<p><strong>Other open standards</strong><br />
Open (publicly available) standards are quite common. Some well known-examples:</p>
<ul>
<li> Data: <a href="http://www.w3.org/TR/REC-xml/">XML</a>, <a href="http://dublincore.org/">Dublin Core</a>, <a href="http://www.w3.org/standards/techs/rdf#w3c_all">RDF</a>, <a href="http://www.loc.gov/standards/mods/">MODS</a>, <a href="http://www.loc.gov/standards/mads/">MADS</a>, <a href="http://www.loc.gov/marc/">MARC21</a>, <a href="http://www.ifla.org/en/publications/international-standard-bibliographic-description">ISBD</a></li>
<li> Documents: <a href="http://www.w3.org/TR/html401/">HTML</a>, <a href="http://www.adobe.com/devnet/pdf/pdf_reference.html">PDF</a>, <a href="http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=43485">ODF</a>,</li>
<li> Images: <a href="http://www.w3.org/TR/PNG/">PNG</a></li>
</ul>
<p>The following are open although I&#8217;d have a lawyer to hand if you wanted to do anything with them:</p>
<ul>
<li>Documents: <a href="http://msdn.microsoft.com/en-us/library/cc313118.aspx">DOC</a>,</li>
<li>Images: <a href="http://www.w3.org/Graphics/JPEG/itu-t81.pdf">JPG</a>, <a href="http://www.w3.org/Graphics/GIF/spec-gif89a.txt">GIF</a></li>
</ul>
<p>Some closed ones for comparision:</p>
<ul>
<li> Data: <a href="http://www.aacr2.org/">AACR2</a>, <a href="http://www.rdatoolkit.org/">RDA</a></li>
</ul>
<p>These are of course more subjective lists than they look, but you get the idea. The closed list was actually bigger until upon examination I found that JPG, GIF, and even MS Office standards are publicly readable even if I&#8217;m not sure what more you could legally do with them. I&#8217;d be happy to add more to the closed list to balance things out a little.</p>
<p><strong>Why is RDA not open?</strong><br />
Money. This is a delicate matter that I don&#8217;t want to delve into too much although it is obviously central to the openness of the standard. It&#8217;s also hard to talk about without appearing to make wild assertions, and I hope I haven&#8217;t been unfair. I&#8217;ve heard Alan Danskin of the JSC explaining that they&#8217;d thought about releasing RDA openly but that they had to cover costs. I&#8217;m not exactly sure what the costs of production were, although presumably included expenses and staff costs, and production of the product itself. The last is I think unfortunate as I would like to have seen a far simpler publication of RDA without all the bells and whistles, login barriers, and the need to learn a new interface as well as a new standard. Compare with the <a href="http://www.w3.org/TR/html4/">HTML4 standard</a> which is a set of simple HTML documents with normal links. I don&#8217;t need to learn how to use that. Or, come to that, the <a href="http://www.loc.gov/marc/">MARC21</a> site. I wonder how much of the fee goes towards setting up and maintaining the RDA Toolkit platform.</p>
<p>With my <a href="http://en.wikipedia.org/wiki/Tin_foil_hat">tin foil hat</a> on, I also wonder how much the fee is needed to resume revenue to the co-publishers since AACR2 has been in unrevised abeyance. <a href="http://www.w3.org/TR/REC-xml/"></a><a href="http://www.ifla.org/en/publications/international-standard-bibliographic-description"></a></p>
<p><strong>Why does it matter?</strong><br />
It matters because RDA (and with it all the high quality traditional cataloguing techniques) will not be widely used without being open. I think you can divide the potential RDA userbase as follows:</p>
<ol>
<li>Libraries with enough money to switch to RDA</li>
<li>Libraries without enough money to switch to RDA</li>
<li>Non-libraries dealing with metadata</li>
</ol>
<p>Those in group 1 will buy RDA, but some libraries- Group 2- will not see the benefit for the costs of conversion and training, let alone the costs of subscription. For &#8216;traditional&#8217; cataloguing to thrive, therefore, we need to involve Group 3. However, those in Group 3 will not be able to even have a look at RDA to see if it meets their needs. I think RDA will be lucky to retain the same user base as AARC2, let alone break into new areas and influence the way other metadata is carried out. Those in the metadata community who, I suspect, have already been put off by AACR2, are unlikely to even try looking at RDA if it involves forking out a subscription.</p>
<p>I recently sat in a room with about 15 or so people mostly involved in metadata for institutional repositories and the like. During some discussions they flagged up two problems they were having: establishing a consistent form of name, and a standard set of data elements. I asked myself, would I recommend RDA to them to help solve these problems? Even if I thought it met their needs, could they even have a look to see if it did? No. They will either come up with their own solution or look elsewhere for it, which is already what they have been doing. I can&#8217;t see us taking more people with us, just a proportion of the people already using AACR2.</p>
<p>Openness also matters because haven&#8217;t a closed standard  doesn&#8217;t reflect terrribly well on librarianship in general. I have a friend in IT who Laughed Out Loud* when I said the new library metadata standard was behind a paywall. In the new world of openness where even Microsoft loosely adhere to web standards, traditionally closed governments are leading the charge to release more data, and the world has been transformed by the the open standards of the web, are we to follow <a href="http://www.thetimes.co.uk">The Times</a> behind a paywall? Personally I feel libraries, librarians, and library data should be at the forefront of openness, not grudgingly following behind or not following at all.</p>
<p><strong>What could be done?</strong><br />
This is the nub of the matter. I&#8217;m no marketing expert and maybe I&#8217;m naive and there <em>is</em> nothing that could be done. However, working on the assumption that all that needs to be done is to break even and pay the costs of production for RDA, I would suggest the following ideas for a start:</p>
<ul>
<li> Make a flashy web product anyway and charge lots more for it. Many more well-off libraries would pay for a product like the Toolkit if it&#8217;s good enough.</li>
<li> There is a need for a more accessible version or versions of RDA, e.g. just for books or in a more convenient format like, say, the Chan books on LCSH or the green editions of AACR2. The co-publishers could fulfill this need which I imagine would be easily done by re-using the data they already have.</li>
<li> Explanatory books. There are a number of these on the market or on the way already. The co-publishers could publish an official companion.</li>
<li>Consultancy and training. There is going to be a big demand for this soon in any case.</li>
<li>Involve more organisations in the drafting and publishing of RDA to share the costs, e.g. publishers, LMS vendors, commercial metadata suppliers, other metadata initiatives. I think it would be a positive and pragmatic move to have these parties on board anyway. They would be more likely to use the high quality standard produced and we would be more likely to be using metadata that meets all our purposes.</li>
</ul>
<p><strong>See Also</strong><br />
I notice a post covering some of these issues by <a href="http://carolslib.wordpress.com/2011/03/21/rda-draft-and-cost/">carolslib</a> from a few days ago. From the Catalogs of Babes also has a similar post, <a href="http://catalogsofbabes.wordpress.com/2010/03/08/rda-why-it-wont-work/">RDA: why it won’t work</a>, from a few weeks ago which much more succinctly makes some of the same points:</p>
<blockquote><p>Many librarians are balking at the cost of implementing RDA, I think rightfully so, although not for the same reasons. I’m not bitching about it because it’s unaffordable for smaller libraries, or because it’s a subscription rather than a one-time printed book cost (although I think those are valid points). I’m bitching because putting a dollar amount on something, now matter how low it is, will stop people from using something, especially if there’s a free alternative. In this case, I see the free alternative as ‘ignoring rules altogether and/or making you your own standards.’ Requiring a price makes adhering to standards–a key value-added service of libraries and librarians–inaccessible. Which is pretty ironic, considering that libraries are supposed to be all about access. We’re all proactive about offering access to our patrons, but we can’t extend that same philosophy to ourselves, to help us do a better job??</p></blockquote>
<p>*** He literally LOL&#8217;d, although no ROFLing took place admittedly.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2011/03/23/rda-as-a-closed-standard/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>In Our Time booklist</title>
		<link>http://www.aurochs.org/aurlog/2011/02/16/in-our-time-booklist/</link>
		<comments>http://www.aurochs.org/aurlog/2011/02/16/in-our-time-booklist/#comments</comments>
		<pubDate>Wed, 16 Feb 2011 16:17:15 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[arc2]]></category>
		<category><![CDATA[bbc]]></category>
		<category><![CDATA[bibliographica]]></category>
		<category><![CDATA[british national bibliography]]></category>
		<category><![CDATA[in our time]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[open bibliographic data challenge]]></category>
		<category><![CDATA[open-biblio]]></category>
		<category><![CDATA[php]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[regular expressions]]></category>
		<category><![CDATA[sparql]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=623</guid>
		<description><![CDATA[<p>I have written a script which takes an unstructured reading list on the BBC&#8217;s In Our Time website, searches the British National Bibliography (BNB) using bibliographica for the books on the list, and returns structured metadata for the records it found.</p>
<p>This script was written in response to an idea raised by psychemedia for the Open [...]]]></description>
			<content:encoded><![CDATA[<p>I have written a <a href="http://www.aurochs.org/inourtime_booklist/inourtime_booklist_v1.php">script</a> which takes an unstructured reading list on the BBC&#8217;s <a href="http://www.bbc.co.uk/radio4/features/in-our-time/">In Our Time</a> website, searches the British National Bibliography (BNB) using <a href="http://bnb.bibliographica.org/">bibliographica</a> for the books on the list, and returns structured metadata for the records it found.</p>
<p>This script was written in response to an idea raised by <a href="http://ideas.okfn.org/users/23/psychemedia/">psychemedia</a> for the <a href="http://openbiblio.net/challenge/">Open Bibliographic Data Challenge</a>: the <a href="http://ideas.okfn.org/ideas/27/bbc-in-our-time-reading-list">BBC &#8220;In Our Time&#8221; Reading List</a>:</p>
<div>
<blockquote><p>The BBC &#8220;In Our Time&#8221; radio programme publishes suggested recommending reading in the programme data in an unstructured and citation style way: author, title (publisher, year), with what looks to be conventional character string separators between references (at least on the pages I looked at).</p>
<p>The idea is to extract and link suggested readings for the In Our time programmes to open, structured bibliographic data. This would make the In Our Time archive even more useful as an informal (open-ish) educational resource, especially as academic libraries start to release data relating to books used on courses. (So for example, this approach might help provide a link from a course to a relevant In Our Time broadcast via a common book.)</p></blockquote>
<p>I was drawn to this idea as I like the idea of turning unstructured data into structured data: I have for example had some previous fun converting HTML pages into RSS feeds (e.g. <a href="http://www.aurochs.org/aurlog/2007/05/16/cilip-lisjobnet-rss-feed/">CILIP Lisjobnet</a>, <a href="http://www.aurochs.org/aurlog/2006/06/01/big-brother-2006-rss-feed-that-actually-works/">Big Brother</a>). I think something similar for any reading list (e.g. a Word document produced by a lecturer) would be an interesting idea.</p>
<p>The programme is written in PHP and is designed to be fired from a <a href="javascript:(function(){window.location='http://www.aurochs.org/inourtime_booklist/inourtime_booklist_v1.php?'+window.location;})();">Javascript bookmarklet</a> from a page on the In Our Time site, or by appending the In Our Time URL to the end of the URL for this page: <a href="../../inourtime_booklist/inourtime_booklist_v1.php?">http://www.aurochs.org/inourtime_booklist/inourtime_booklist_v1.php?</a>. For example, to use it on the page for <a href="http://www.bbc.co.uk/programmes/b00xhz8d">The Mexican Revolution</a> (which I used a lot in testing), add the URL <a href="http://www.bbc.co.uk/programmes/b00xhz8d">http://www.bbc.co.uk/programmes/b00xhz8d</a> to produce <a href="http://www.bbc.co.uk/programmes/b00xhz8d">http://www.aurochs.org/inourtime_booklist/inourtime_booklist_v1.php?http://www.bbc.co.uk/programmes/b00xhz8d</a>.</p>
<p>The script follows the following steps:</p>
<ol>
<li>Set up <a href="http://arc.semsol.org/">ARC2</a> to enable <a href="http://en.wikipedia.org/wiki/SPARQL">SPARQL</a> searching and RDF processing</li>
<li>Extract Further Reading section</li>
<li>Separate out Raw Data for each book</li>
<li>Determine pattern used in citation then extract Basic Data, e.g. author, title, article, publication, using <a href="http://en.wikipedia.org/wiki/Regular_expressions">regular expressions</a></li>
<li>Further refine elements to make searching easier, i.e. one surname for author, only title proper for titles</li>
<li>Construct a SPARQL Query using author surname and title regular expressions pre-filtered for speed by a significant word using bif:contains</li>
<li>Filter Hits by date of publication</li>
<li>Obtain and display metadata from BNB</li>
</ol>
</div>
<div>More details of these steps are below:</div>
<p><strong>1. Set up ARC2 to enable SPARQL searching and RDF processing</strong></p>
<p>ARC2 is a simple-to-use system for using RDF and SPARQL in PHP. I had previously played with it <a href="http://www.aurochs.org/sandy/page/sandy.html">here</a> when experimenting with creating my own RDF. The Sandy site uses SPARQL to populate the See Also sections.</p>
<p><strong>2. Extract Further Reading section</strong></p>
<p>A simple regular expression identifies the div in the HTML code that contains the reading list, which enables the next stage of the script to look for individual books.</p>
<div><strong>3. Separate out Raw Data for each book</strong></div>
<div><strong><br />
</strong></div>
<div>Another regular expression pulls out the paragraphs containing books and puts them in an array. You can see this by viewing the Raw Data.</div>
<div><strong>4. Determine pattern used in citation then extract Basic Data, e.g. author, title, article, publication using regular expressions</strong></div>
<div><strong><br />
</strong></div>
<div>
<div>As the In Our Time site does not use a single standard form of citation, the script has to try and determine which of several possible patterns a citation is using with regular expressions and extract the correct bits of data. This only works as well as it is possible to identify all the patterns, which effectively means looking at as many In Our Time pages as possible. This is one area that would certainly reward more work. It also points out how difficult it would be to extrapolate this into a script  that could read any citations. The In Our Time booklist currently uses five citations each identified with a number (1, 2, 3, 4, 15, 5). If you look at the Book Data for a particular book you will see the citation style number given. The regular expressions capture author, title, and publication.</div>
<div>The author information in citations on In Our Time is  unpredictable. Sometimes surnames are first, sometimes they are last.  The citation patterns take care of this if possible and try to extract one significant name.</div>
<div><strong>5. Further refine elements to make searching easier, i.e. one surname for author, only title proper for titles.</strong></div>
<p>The script removes things like &#8220;(ed.)&#8221; from the author, which would obviously throw off a catalogue search. Subtitles- everything after and including semi-colons- are also removed from titles to lessen any chance of variation and lost matches.</p>
<p><strong>6. Construct a SPARQL Query using author surname and title regular expressions pre-filtered for speed by a significant word using bif:contains</strong></p>
<div>Constructing the SPARQL query was the most tricky part. Ignoring the various standard prefixes pilfered from the standard example, the most important part is the title search. There are three unsatisfactory options:</div>
<ul>
<li>Match the extracted title directly to a dc:title. This doesn&#8217;t work as the cited title is unlikely to be <em>exactly</em> the same in all matters of words included, spacing, punctuation, etc.</li>
<li>Use bif:contains for keyword searching as used in the <a href="http://bnb.bibliographica.org/sparql">BNB SPARQL example</a>. This is certainly quick, but has a number of drawbacks: it can only be used once for a single keyword (any one of the two significant words in The history of Mexico, for example, will produce a huge number of hits). It is also not standard SPARQL. I was happy to overlook this if it worked, but ARC2 didn&#8217;t like it at all until I worked out it has to be used in angle brackets e.g. <em>?title &lt;bif:contains&gt; &#8220;Mexico&#8221;</em>.</li>
<li>Use regular expressions (e.g. <em>FILTER regex (?title, &#8220;The History of Mexico&#8221;, &#8220;i&#8221;)</em>) for keyword searching. This is extremely powerful: you can easily construct searches but it is so slow as to routinely time out, so rendering it effectively useless.</li>
</ul>
<div>The In Our Time script uses a combination of the last two techniques to get a result. First, it finds a significant word in the title, ideally the first four letter word after the first word (i.e. to avoid &#8220;The&#8221;, &#8220;A&#8221;, &#8220;That&#8221;, etc.) or, failing that, the first word. The SPARQL query then uses bif:contains to search for that word. The query then does a regular expression filter on the whole title. I don&#8217;t know if this is how SPARQL endpoints would generally work, but the BNB appears to only look for regular expression matches on the records already filtered by the bif:contains. In any case, it works.</div>
<div>In addition, the script also uses a regular expression to search by the author&#8217;s surname. It doesn&#8217;t search by date as the date of publication on BNB (<em>dc:issued</em>) is not in a standard format (e.g. &#8220;1994-01-01 00:00:00&#8243;, &#8220;2005 printing&#8221;, &#8220;c2006&#8243;). It is also not keyword searchable. You can see all the author-title hits with links to BNB records by viewing Hits.</div>
<div><strong>7. Filter Hits by date of publication</strong></div>
<div><strong><br />
</strong></div>
<div>You can, however, retrieve the date from the BNB and process it afterwards, which is what the script does. It finds the four digit year and compares this to the four digit year it found on the In Our Time site. You can see all the author-title-date hits with links to BNB records by viewing Date Hits. Perhaps rather arbitrarily, the first book in the resulting array is selected as the result.</div>
<p><strong>8. Obtain and display metadata from BNB</strong></p>
</div>
<div>When the search took place, the matching title, author (only one), and date is obtained from BNB. This title and author are displayed, as are the stripped down year of publication, and a link to the full BNB record. For records that returned no hits on the BNB, the Basic Data is simply regurgitated.</div>
<div>The script also downloads the full combined RDF for all the hits is displayed at the bottom of the page, viewable in a several formats.</div>
<div><strong>Further work</strong></div>
<div><strong><br />
</strong></div>
<div>I think a lot more work could be done on this given time, both to improve it and to extend it. In no particular order:</div>
<div>
<ul>
<li>Make it more pretty. It is currently designed to look merely acceptable while I concentrated on functionality. I have also tried to show much of my working, which a finished version would obviously hide.</li>
<li>Refinement of the detection of citation style. This is probably the most critical improvement, and ultimately decides if this approach would be useful outside of In Our Time on other reading lists. There are more patterns that need to be added, especially for older pages.</li>
<li>Further preparation of data for searching. Currently, for example, a book on the Mexico reading list doesn&#8217;t return any hits because of the exclamation mark in &#8220;Zapata!&#8221;. This could be stripped, and there are lots of similar refinements no doubt.</li>
<li>More interesting/useful output. The script&#8217;s outputs are currently quite raw or basic as I concentrated on the mechanisms for pulling information from free text for automatic catalogue searching. It might be useful to output proper standalone RDF files, references in standard reference formats (e.g. Harvard) in HTML or text files, files in standard reference management formats, perhaps even MARC, and so on. Some of these would perhaps be fairly straightforward.</li>
<li>Links to catalogues or online bookshops so you could borrow or buy the books from the reading list based on ISBNs taken from the BNB record.</li>
<li>Searching more catalogues. If a search fails on the BNB, the script could search other open catalogues, e.g. the Cambridge catalogue.</li>
<li>Greasemonkey script or plugin so that a button appears next to the Further Reading section when you view an In Our Time page. This could even appear next to individual books. Ideally (pie in the sky) such a plugin would have a stab at finding books on any web page.</li>
<li>Other ways of firing the script not requiring manual addition to the URL or use of a bookmarklet, e.g. a searchbox of some kind (either accepting a URL as input or keywords for the titles of broadcasts).</li>
</ul>
<p>Please do leave comments or questions.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2011/02/16/in-our-time-booklist/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>ISKO-UK linked data day</title>
		<link>http://www.aurochs.org/aurlog/2010/12/10/isko-uk-linked-data-day/</link>
		<comments>http://www.aurochs.org/aurlog/2010/12/10/isko-uk-linked-data-day/#comments</comments>
		<pubDate>Fri, 10 Dec 2010 14:32:55 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cataloguing]]></category>
		<category><![CDATA[isko]]></category>
		<category><![CDATA[lcsh]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[skos]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=612</guid>
		<description><![CDATA[<p>On 14 September I went to the ISKO-UK one day conference on Linked Data: the Future of Knowledge Organisation on the Web.  For me, this followed on from a previous Talis session on Linked Data and Libraries I attended at the British Library in June, which I found really very interesting and informative.</p>
<p>The ISKO conference [...]]]></description>
			<content:encoded><![CDATA[<p>On 14 September I went to the ISKO-UK one day conference on <a href="http://www.iskouk.org/events/linked_data_sep2010.htm">Linked Data: the Future of Knowledge Organisation on the Web</a>.  For me, this followed on from a previous Talis session on <a href="http://blogs.talis.com/nodalities/2010/06/talis-open-day-linked-data-and-libraries.php">Linked Data and Libraries</a> I attended at the British Library in June, which I found really very interesting and informative.</p>
<p>The ISKO conference was a lot broader in scope- it was noticed by several speakers discussing the BBC&#8217;s use of linked data that there were 22 attendees from the BBC- and  included talks about local and national government, business, libraries, as well as the Ordnance Survey. The following is a brief and personal overview, pausing in more detail over aspects that interested me more. It assumes a passing acquaintance with <a href="http://linkeddata.org/">linked data</a> and basic RDF.</p>
<p>Professor Nigel Shadbolt from the University of Southampton, a colleague of Tim Berners-Lee at Southampton as well as in advising the British Government developing the <a href="http://data.gov.uk/">data.gov.uk</a> site, opened with a talk about Government Linked Data: A Tipping Point for the Semantic Web. There were two interesting points from this (there were many, but you know what I mean). First was the speedy and incredible effects of openly releasing government data. Professor Shadbolt used the analogy of the mapping by John Snow of the <a href="http://en.wikipedia.org/wiki/1854_Broad_Street_cholera_outbreak">1854 cholera epidemic</a> which identified a pump as the source and led to the realisation that water carried cholera. He mentioned the release of government bike accident data that was little used by the government but which was taken up and used by coders within days to produce maps of accident hotspots in London and guides to avoiding them.<br />
The second point was the notion of the &#8220;tipping point&#8221; for the semantic web and linked data referred to in the talk&#8217;s title. Several speakers and audience members referred to the similar idea of the &#8220;killer implementation&#8221;, a killer app for the semantic web that would push it into the mainstream. The sheer quantity of data and use it is quickly put to, often beyond the imagination of those who created and initially stored it, was quite compelling. Richard Wallis made a similar point when discussing the relative position of the semantic web compared to the old-fashioned web in the 1990s. He noted that it is now becoming popular to the extent that is nearly impossible to realistically list semantic web sites and predicts that it will explode in the next year or so. Common to both Nigel Shadbolt&#8217;s and Richard Wallis&#8217;s talks was a feeling almost of evangelism: Richard Wallis explicitly refers to himself as a technology evangelist; Nigel Shadbolt referred to open government data as &#8220;a gift&#8221;. Despite being relatively long in the tooth, RDF, linked data, and all that have not yet taken off and both seemed keen to push it: when people see the benefits, it won&#8217;t fail to take off. There were interesting dissenting voices to this. Martin Hepp, who had spent over eight years coming up with the commercial <a href="http://www.heppnetz.de/projects/goodrelations/">GoodRelations</a> ontology, was strongly of the opinion that it is not enough to merely convince people of the social or governmental benefits, but rather the linked data community should demonstrate that it can directly help commerce and make money. The fact that GoodRelations apparently accounts for 16% of all RDF triples in existence and is being used by corporations such as BestBuy and O&#8217;Reilly (IT publishers) seems to point to a different potential tipping point. Interestingly, Andreas Blumauer in a later talk said that SKOS (an RDF schema to be discussed in the next paragraph) could introduce Web 2.0 mechanisms to the &#8216;web of data&#8217;&#8221;. Perhaps, then, SKOS is the killer app for linked data (rather than government data or commercial data as suggested elsewhere), although Andreas Blumauer also agreed with Martin Hepp in saying that “If enterprises are not involved, there is no future for linked data”. In my own ignorant judgement, I would suggest government data is probably a more likely tipping point for linked data, closely followed by Martin Hepp&#8217;s commercial data. It is government data that is making people aware of linked data, and especially open data, in the first place. This is more likely to recruit and enthuse. I think the commercial data will be the one that provides the jobs based on the foregoing: it may change the web more profoundly but in ways fewer people will even notice. I suppose it all depends on how you define tipping points or killer apps, which I don&#8217;t intend to think about for much longer.</p>
<p>The second talk, and the start of a common theme, was about <a href="http://www.w3.org/2004/02/skos/">SKOS</a> and linked data, by Antoine Isaac. This was probably the most relevant talk for librarians and was for me a simple introduction to SKOS, which seems to be an increasingly common acronym. SKOS stands for Simple Knowledge Organisation System and is designed for representing (simply) things like thesauruses* and classification schemes, based around concepts. These concepts have defined properties such as preferred name (&#8220;skos:prefLabel&#8221;), non-preferred term (&#8220;skos:altLabel&#8221;), narrower term (&#8220;skos:narrower&#8221;), broader term (&#8220;skos:broader&#8221;), and related term (&#8220;skos:related&#8221;).  The example I&#8217;ve been aware of for some time is the representation of <a href="http://en.wikipedia.org/wiki/Library_of_Congress_Subject_Headings">Library of Congress Subject Headings</a> (LCSH) in SKOS, where all the SKOS ideas I&#8217;ve just mentioned will be recognisable to a lot of librarians. In the LCSH red books, for example, preferred terms are in bold, non-preferred terms not in bold preceded by UF, and the relationships between concepts is represented by the abbreviations NT, BT, and RT. In SKOS, concepts and labels are more clearly distinct. An example of SKOS using abbreviated linked data might be (stolen and adapted from the <a href="http://www.w3.org/TR/skos-primer/ ">W3C SKOS primer</a>):</p>
<blockquote><p>ex:animals rdf:type skos:Concept;<br />
skos:prefLabel &#8220;animals&#8221;;<br />
skos:altLabel &#8220;creatures&#8221;;<br />
skos:narrower ex:mammals.</p></blockquote>
<p>This means that ex:animals is a SKOS concept; that the preferred term for ex:animals is &#8220;animals&#8221;; a non-preferred term is &#8220;creatures&#8221;; and, that a narrower concept is ex:mammals. In a mock LCSH setting this might look something like this:</p>
<blockquote><p><strong>Animals</strong><br />
UF Creatures<br />
NT Mammals</p></blockquote>
<p>In the LCSH example, however, the distinction between concepts and terms is lost. One aspect of SKOS that Antoine Isaac spent some time on is the idea of equivalent concepts, especially across languages. In RDF you can bind terms to languages using an @ sign, something like this:</p>
<blockquote><p>ex:animals rdf:type skos:Concept;<br />
skos:prefLabel &#8220;animals&#8221;@en;<br />
skos:prefLabel &#8220;animaux&#8221;@fr.</p></blockquote>
<p>However, you can also link concepts more directly using skos:exactMatch, skos:closeMatch, skos:broadMatch, skos:narrowMatch, and relatedMatch to link thesauruses and schemes together. These are admittedly a bit nebulous. He mentioned work that had been done on linking LCSH to the French <a href="http://rameau.bnf.fr/">Rameau</a> and from there to the German subject thesaurus <a href="http://www.d-nb.de/eng/standardisierung/normdateien/swd.htm">SWD</a>. For example:</p>
<p>Go to <a href="http://lcsubjects.org/subjects/sh85005249">http://lcsubjects.org/subjects/sh85005249</a> which is the LCSH linked data page for &#8220;Animals&#8221;. (You can view the raw SKOS RDF using the links at the top right, although sadly not in n3 or turtle format which I have used above). At the bottom of the page there are links to &#8220;similar concepts&#8221; in other vocabularies, in this case Rameau.<br />
Go the the first one, <a href="http://lcsubjects.org/subjects/sh85005249">http://stitch.cs.vu.nl/vocabularies/rameau/ark:/12148/cb119328694</a>, and you see the Rameau linked data page for &#8220;Animaux&#8221;.</p>
<p>In the LCSH RDF you can pick out the following RDF/XML triples:</p>
<blockquote><p>&lt;rdf:Description rdf:about=&#8221;http://lcsubjects.org/subjects/sh85005249#concept&#8221;&gt;<br />
&lt;rdf:type rdf:resource=&#8221;http://www.w3.org/2004/02/skos/core#Concept&#8221;/&gt;<br />
&lt;skos:prefLabel&gt;Animals&lt;/skos:prefLabel&gt;<br />
&lt;skos:altLabel xml:lang=&#8221;en&#8221;&gt;Beasts&lt;/skos:altLabel&gt;<br />
&lt;skos:narrower rdf:resource=&#8221;http://lcsubjects.org/subjects/sh95005559#concept&#8221;/&gt;<br />
&lt;skos:closeMatch rdf:resource=&#8221;http://stitch.cs.vu.nl/vocabularies/rameau/ark:/12148/cb119328694&#8243;/&gt;</p></blockquote>
<p>which is basically saying the same as (clipping the URIs for the sake of clarity):</p>
<blockquote><p>lcsh:sh85005249#concept rdf:type skos:Concept;<br />
skos:prefLabel &#8220;Animals&#8221;@en;<br />
skos:altLabel &#8220;Beasts&#8221;@en;<br />
skos:narrower lcsh:sh95005559#concept;<br />
skos:closeMatch rameau:cb119328694.</p></blockquote>
<p>Not too far from the first example I gave, with the addition of  a mapping to a totally different scheme. Or in mock red book format again but with unrepresentable information missing:</p>
<blockquote><p><strong>Animals</strong><br />
UF Beasts<br />
NT Food animals</p></blockquote>
<p>Oh that some mapping like this were available to link LCSH and <a href="http://www.nlm.nih.gov/mesh/MBrowser.html">MeSH</a>&#8230;!</p>
<p>Several other talks touched on SKOS, such is its impact on knowledge management. Andreas Blumauer talked about it in demonstrating a service provided by <a href="http://www.punkt.at/">punkt. netServices</a>, called <a href="http://poolparty.punkt.at/">PoolParty</a>.** I don&#8217;t want to go into depth about it, but it seemed to offer a very quick and easy way to manage a thesaurus of terms without having to deal directly with SKOS or RDF. During the talk, Andraeas Blumauer briefly showed us an ontology based around breweries, then asked for suggestions for local breweries. Consequently, he added information for <a href="http://www.fullers.co.uk/">Fullers</a> and published it right away. To see linked data actually being created and published (if not hand-crafted) was certainly unusual and refreshing. Most of what I’ve read and seen has talked about converting large amounts of data from other sources, such as MARC records, EAD records, Excel files, Access databases, or Wikipedia. I’ve had a go at hand-coding RDF myself, which I intend to write about if/when I ever get this post finished.</p>
<p>I don’t want to go into detail too much about it***, but another SKOS-related talk was the final one from Bernard Vatant who drew on his experience in a multi-national situation in Europe to promote the need for systems such as SKOS to deal more rigorously with <em>terms</em>, as opposed to <em>concepts</em>. Although SKOS would appear to be about terms, in many ways it is not clear on many matters of context. For instance, using skos:altLabel “Beasts” for the concept of Animals as in the examples given above gives no real idea of what the context of the term is. Here is a theoretical made-up example of some potential altLabels for the concept of Animals which I think makes some of the right points:</p>
<blockquote><p>Animal (a singular)<br />
Beasts (synonym)<br />
Animaux (French term)<br />
Animalia (scientific taxonomic term)</p></blockquote>
<p>These could all be UF or altLabels but using UF or altLabel gives no idea about the relationship between the terms, and why one term is a non-preferred term. He gave another instance of where this might be important in a multinational and multilingual context, where the rather blunt instrument of adding @en or @fr is not enough, when a term is different in Belgian, French, or Canadian varieties of French. This has obvious parallels in English, where we often bemoan the use of American terms in LCSH. Whether embedded in LCSH or as a separate list, it might be possible to better tailor the catalogue for local conditions if non-preferred terms were given some context. Perhaps “Cellular telephones” could be chosen by a computer to be displayed in a US context, but “mobile telephones” could be chosen in a UK context if the context of those terms were known and specified in the thesaurus.</p>
<p>Moving away from SKOS, Andy Powell talked about the <a href="http://dublincore.org/">Dublin Core Metadata Initiative</a> (DCMI). I’ll admit I’ve always been slightly confused as to what the precise purpose of Dublic Core (DC) is and how one should go about using it. Andy Powell’s talk explained a lot of this confusion by detailing how much DC had changed and reshaped itself over the years. To be honest, in many ways I found it surprising how it is still active and relevant given the summary I heard. The most interesting part of his talk for me was his description of the mistakes or limitations of the DCMI caused by its library heritage. Another confession- my notes here are awful- but the most important point that stuck out for me was the library use of a record-centric approach, e.g.:</p>
<ul>
<li> each book needs a self-contained record</li>
<li>this record has all the details of the author, title, etc.</li>
<li>this record is used to ship the record from A to B (e.g. from bibliographic utility to library catalogue),</li>
<li>this record also tracks the provenance of the data within the record, such as within the 040 field: it all moves together as one unit.</li>
</ul>
<p>Contrast this with the sematic web approach where data is carried in triples. A ‘record’, such as an RDF file, might only contain a sameAs triple which relates data about a thing to a data store elsewhere; many triples from multiple sources could be merged together and information about a thing could be enriched or added to. This kind of merging is not particularly easy or encouraged by MARC records (although the RLUK database does something similar and quite tortuously when it deduplicates records). There’s a useful summary of all this at <a href="http://allthingscataloged.wordpress.com/2010/09/22/data-not-records/">all things cataloged</a> which opens thus:</p>
<blockquote><p>Despite recent efforts by libraries to release metadata as linked data, library records are still perceived as monolithic entities by many librarians. In order to open library data up to the web and to other communities, though, records should be seen as collections of chunks of data that can be separated / parsed out and modeled. Granted, the way we catalog at the moment makes us hold on to the idea of a “record” because this is how current systems represent metadata to us both on the back- and front-end. However with a bit of abstraction we can see that a library record is essentially nothing but a set of pieces of data.</p></blockquote>
<p>One problem with the linked data approach though is the issue of provenance which was referred to above as one of the roles the MARC record undertakes (ask OCLC, e.g. <a href="http://community.oclc.org/metalogue/archives/2008/11/notes-on-oclcs-updated-record.html">http://community.oclc.org/metalogue/archives/2008/11/notes-on-oclcs-updated-record.html</a>). If you take a triple out of its original context or host, how can you tell who created the triple? Is it important? Richard Wallis always makes the point that triples are merely statements: like other web content they are not necessarily true at all. Some uneasiness on the trustworthiness or quality of data turned up at various points during the day. I think it is an interesting issue, not that I know what the answer is, especially when current cataloguing practices largely rely on double checking work that has already been done by other institutions because that work cannot really be trusted. There are other issues and possible solutions that are a little outside my comfort zone at the moment, including excellent buzzwords like named graphs and bounded graphs.</p>
<p>Andy Powell also mentioned, among other things:</p>
<ul>
<li> the “broad semantics” or “fuzzy buckets” of DC which derive in large part from the library catalogue card, where, for instance, “title” or “creator” can mean all sorts of imprecise things;</li>
<li> flat world modelling where two records are needed to describe say, a painting and a digital image of the painting. This sounds to me like the kind of thing RDA/FRBR is attempting (awkwardly in my view) to deal with.</li>
<li> the use of strings instead of things, such as describing an author as “Shakespeare, William” rather than <a href="https://docs.google.com/document/edit?id=1PmYjJfBoK7rt11wmyS7qakzNAQHnMnp1m30CdaYMmCk&amp;hl=en&amp;pli=1">&lt;http://www.example/authors/data/williamshakespeare&gt;</a>. This mirrors one of the bizarre features of library catalogues where authorities matching is generally done by matching exact strings of characters rather than identifiers. See <a href="http://kcoyle.blogspot.com/2007/09/name-authority-control-aka-name.html">Karen Coyle</a> for an overview of the headaches involved.</li>
</ul>
<p>There were three other talks which I don’t propose to go into in much detail. I’ve touched on Richard Wallis’s excellent (and enthusiastic) introduction to the whole idea of linked data and RDF, a version of which I found dangerously intriguing at a previous event given by Talis. He talked about, among other things, the use of the BBC in using linked data to power its <a href="http://www.bbc.co.uk/wildlifefinder/">wildlife pages</a> (including drawing content from Wikipedia) and <a href="http://news.bbc.co.uk/sport1/hi/football/world_cup_2010/default.stm">World Cup 2010 site</a>; in fact, how linked data is making the BBC look at the whole way it thinks about data.</p>
<p>His other big message for me was to compare the take-up of the web to where the current take of linked data was in order to suggest that we are on the cusp of something big: see above for my discussion of the tipping point.</p>
<p>* I don&#8217;t like self-conscious classical plurals where I can help it, not that there&#8217;s anything wrong with them as such.<br />
** I can&#8217;t help but find this name a little odd, if not actually quite camp. I expect there&#8217;s some pun or reference I&#8217;m not getting. Anyway. Incidentally, finding information about PoolParty from the punkt. website is not easy, which I find hard to understand given that it is a company wanting to promote its products; and, more specifically, it is a knowledge management (and therefore also information retrieval) company.<br />
*** Partly because I don’t think I could do it justice, partly also because it was the most intellectual talk and took place at the end of the day.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2010/12/10/isko-uk-linked-data-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CILIP (not) on Newsnight</title>
		<link>http://www.aurochs.org/aurlog/2010/06/11/cilip-not-on-newsnight/</link>
		<comments>http://www.aurochs.org/aurlog/2010/06/11/cilip-not-on-newsnight/#comments</comments>
		<pubDate>Fri, 11 Jun 2010 11:28:14 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cilip]]></category>
		<category><![CDATA[bbc]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[media]]></category>
		<category><![CDATA[newsnight]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=571</guid>
		<description><![CDATA[<p>I understand that CILIP was actually offered a chance to talk on Newsnight the other day. I&#8217;ve always been of the opinion that CILIP should make more effort to push itself into the media, to publicise and stand-up for the profession. The other day the BBC actually asked CILIP onto a programme, and a significant [...]]]></description>
			<content:encoded><![CDATA[<p>I understand that CILIP was actually offered a chance to talk on Newsnight the other day. I&#8217;ve always been of the opinion that CILIP should make more effort to push itself into the media, to publicise and stand-up for the profession. The other day the BBC actually asked CILIP onto a programme, and a significant and highly influential one at that, and they didn&#8217;t have anyone available. I expect there are real and genuine reasons why there wasn&#8217;t anyone- apparently there were people willing to speak by phone or video link- but I think the fact that CILIP was not prepared for such an occasion, even when libraries were unusually in the news that day, is very symbolic of CILIP&#8217;s attitude and institutional inability to promote the profession and to deal with modern media. This is especially surprising given all the recent soul-searching within CILIP and the recognised urgent need for the library profession to defend itself as it&#8217;s a soft target in any upcoming cuts. Any robust response to the now infamous <a href="http://rd.kpmg.co.uk/docs/KPMG_PFS_June2010.pdf">KPMG report</a> (see p. 19) seems to have consisted of unofficial bloggers (<a href="http://bethaninfoprof.wordpress.com/2010/06/08/could-we-all-be-replaced-by-volunteers/">e.g.</a>) and bloggers from CILIP Update (<a href="http://communities.cilip.org.uk/blogs/update/archive/2010/06/08/alan-downie-s-is-the-voice-of-elitism-but-librarians-have-failed-to-champion-the-underprivileged.aspx">e.g.</a>), all largely preaching to the choir. I hope they were also pushing a press release down the BBC&#8217;s throat, although I haven&#8217;t heard of one.</p>
<p>Update:  I should point out that, according to the <a href="http://thewikiman.org/blog/?p=764">thewikiman</a>,  CILIP apparently asked to be on Newsnight in the first place:</p>
<blockquote><p>I now understand that in fact CILIP contacted Newsnight in the first place – although they ultimately couldn’t make it happen. For me, even though the end result is the same, there’s a huge difference between those two scenarios – in one, CILIP is shirking a fabulous opportunity, but in the other (in actuality) they were tryng to MAKE an opportunity offf their own proactiveness… Which is great. Shame, obviously, that it didn’t work out, and I stand by the idea that hevean and earth should be moved in such a scenario, but the most important thing to take from this new understanding of the events is that it shows CILIP is already moving in the right direction.</p></blockquote>
<p>I have to agree with that.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2010/06/11/cilip-not-on-newsnight/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Phil Bradley on CILIP in 2020</title>
		<link>http://www.aurochs.org/aurlog/2010/05/20/phil-bradley-on-cilip-in-2020/</link>
		<comments>http://www.aurochs.org/aurlog/2010/05/20/phil-bradley-on-cilip-in-2020/#comments</comments>
		<pubDate>Thu, 20 May 2010 15:28:43 +0000</pubDate>
		<dc:creator>Orangeaurochs</dc:creator>
				<category><![CDATA[cilip]]></category>
		<category><![CDATA[future]]></category>
		<category><![CDATA[libraries]]></category>

		<guid isPermaLink="false">http://www.aurochs.org/aurlog/?p=549</guid>
		<description><![CDATA[<p>Phil Bradley has written a long &#8220;a stream of thought&#8221; post on how he would like to see CILIP looking in ten years&#8217; time. I&#8217;m not sure how much I agree with a lot of it, but it is interesting and very positive nonetheless. What struck me (perhaps because I do agree with them) are [...]]]></description>
			<content:encoded><![CDATA[<p>Phil Bradley has written a long &#8220;<a href="http://philbradley.typepad.com/phil_bradleys_weblog/2010/05/cilip-in-2020.html">a stream of thought</a>&#8221; post on how he would like to see CILIP looking in ten years&#8217; time. I&#8217;m not sure how much I agree with a lot of it, but it is interesting and very positive nonetheless. What struck me (perhaps because I do agree with them) are the following couple of points:</p>
<blockquote><p>I want information professionals to be able to look at what CILIP does and say to their employers – this is what my professional organization is doing – why can’t I do it as well?</p></blockquote>
<p>I don&#8217;t think it is CILIP&#8217;s job to  just lead on technology (later on he gives an example of CILIP having something like an iPad that members could have a chance to play with)  or web design as this is getting beyond the remit of librarianship. Although those areas are vital I think one of CILIP&#8217;s weaknesses in fact is that it is in many respects a vanilla professional institute which, in moving away from some core of specific skills, is leaving us with nothing special to sell. For instance, trying to take ownership of the word Information rather than the word Library is dangerous as there are others, particularly computer scientists, who already own much of that ground, and have broken much of it too. Perhaps this is what Phil Bradley is driving at when he says,</p>
<blockquote><p>I want librarians, backed by the professional body, to be the ones telling the technical staff what they should be doing, not the other way around.</p></blockquote>
<p>However, I think that he talking about the role of librarians within an organisation rather than the acquirement of real technical skills that could increase our ability to adapt and increase our services.</p>
<p>Anyway, I do think it is important that CILIP leads the way as an example to its own community as well as a something to be pointed out to others as Phil Bradley suggests, something it certainly hasn&#8217;t done in the past.</p>
<p>I heartily agree with the following sentiment:</p>
<blockquote><p>I want to see CILIP mentioned in the press and the media every single time there’s reference to a library, for good or ill. CILIP needs to be the organisation that’s pulled onto programmes to talk on behalf of the profession.</p></blockquote>
<p>I think this is a must. Other issue-specific organisations are on the telly or pop up in newspapers quite often. I think if CILIP proactively offered its services and made a hue and cry on an issue, programmes like BBC Breakfast would probably listen. Incidentally, this is one area where I think changing the name from Library Association to CILIP was catastrophic: lots of people outside the profession knew the Library Association and its name is fairly self-explanatory; I don&#8217;t think the same could be said of CILIP, and I expect many people would still be mystified when the acronym is expanded.</p>
<p>In one other point I don&#8217;t think Phil Bradley goes far enough:</p>
<blockquote><p>I want CILIP to continue to run courses, and I want those courses to be held, not just in London, but at your desk, with webinars, conference calling/training and so on. <span> </span>Why should it be necessary for me to come to London in order to sit in and watch a council meeting? Why can’t I do it at my desk?</p></blockquote>
<p>I want those courses to be overhauled and more specific. In particular the monlithic MA needs to be ditched, a series of specific short courses needs to be introduced, and the CILIP courses on offer need to go beyond &#8220;An introduction to&#8230;&#8221;.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.aurochs.org/aurlog/2010/05/20/phil-bradley-on-cilip-in-2020/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

