This is a brief documentation of how I used Marcedit to import correct URLs from an Excel spreadsheet into a large file of MARC records. The name of the ebook supplier has been changed to protect the innocent. The values below worked for me on the Excel spreadsheet I used.
Problem. Ebook supplier (EBS) supplies MARC records of generally good quality for a package of 600 ebooks. However, the URLs are inconsistent: there are between one and four in each record; several ebook supplies are represented, not just EBS; many of the DOIs for EBS- the only URLs that are consistent- do not work. We do have an Excel spreadsheet listing OCLC numbers and valid URLs for all titles.
General plan. To delete all the 856 fields in the MARC file and replace them with those from the spreadsheet. To do this, convert the relevant bits of the spreadsheet to a simple MARC file and merge the two using Marcedit.
Delete the URLs from the original file
Load/convert the original file as an .mrk file. Use the Tools>Add/Delete Field option to delete all the 856 fields in the original file.
Convert the spreadsheet to MARC.
- In Marcedit (version 6), select Export Tab Delimited Text.
- Choose the spreadsheet for the Source File
- Choose a filename for the Marc text (.mrk ) file to be created
- Specify the name of the sheet for an Excel file (e.g. in my case EBS)
- Choose the delimiter that separates the data (in my case I left this alone as Tab. It worked)
- Choose options (I left the LDR/008 and character encoding alone as I don’t think they mattered)
- Next. The data snapshot shows the columns numbered Fields 0 to whatever. I needed columns A (OCLC number) and P (URL), so this meant Fields 0 and 15. The fields to select and how they work is done by using the Settings section to create Arguments. For this, I needed two arguments, one for each field:
- First Argument (OCLC control number to go into the 001 field): Select = ”Field 0”; Map to = “001” ; Indicators = “\\” ; Term. punctuation = “” ; Constant Data & Repeatable Subfield = “”
- Add Argument when done
- Second Argument (URL to go into the 856 field): Select = “Field 15”; Map to = “856$u” ; Indicator =”40” ; Term. punctuation = “” ; Constant Data & Repeatable Subfield = “”
- Add Argument when done
- Finish. This disconcertingly takes you back to the previous screen but if you open up the .mrk file in the MarcEditor it should be all done. Each record will look something like this:
=LDR 00000nam 2200000Ia 45e0
Edit the new .mrk
As the OCLC numbers in the original MARC records were in the form “ocn123456789” (rather than simply “123456789”), I needed to do a find for “=001 “ and replace it with “=001 ocn” on the new file, then save it.
- From the Tools menu of Marcedit, select Merge Records
- Choose the .mrk of the original MARC records as the Source File (I don’t know if the .mrc would work too)
- Choose the newly created .mrk file as the Merge File
- Choose a filename for the newly merged file to be created
- Leave Record identifier as 001. If you were searching on the ISBN, presumably the 020 would work but haven’t tried it. Other options are 010, 020, 022, and 035, and MARC21 (?)
- Select the Merge Selected Field option
- Specify the 856 and move it to the Merge Fields box
- “Merge Completed”
Install the bookmarklet by dragging the link to your bookmarks toolbar:
Or, create a bookmark manually, and change the Location property to the following:
To use it, go to a page which has an element with an id of “isbn” then click on the bookmark.
You can edit the bookmark if the id is called something else (change the bit in brackets and quotes from isbn to something else) or you want to search on another index (change q=isbn to q=somethingelse).
RLUK and the European Library (of which the RLUK is now a member) have just released 17 million records as linked open data. They have released three sets (via Mike Mertens), for which links to the RDF turtle versions are below:
I’ve tried to have a quick look at the last just to get an idea and I’ve isolated what I think is all the data for one book, chosen at random. The whole block of turtle prefixes from the start of the file are included:
@prefix rdaa: <http://rdaregistry.info/Elements/a/> .
@prefix rdac: <http://rdaregistry.info/Elements/c/> .
@prefix rdae: <http://rdaregistry.info/Elements/e/> .
@prefix rdam: <http://rdaregistry.info/Elements/m/> .
@prefix rdaw: <http://rdaregistry.info/Elements/w/> .
@prefix rdau: <http://rdaregistry.info/Elements/u/> .
@prefix dcterms: <http://purl.org/dc/terms/> .
@prefix edm: <http://www.europeana.eu/schemas/edm/> .
@prefix foaf: <http://xmlns.com/foaf/0.1/> .
@prefix frbrer: <http://iflastandards.info/ns/fr/frbr/frbrer/> .
@prefix ore: <http://www.openarchives.org/ore/terms/> .
@prefix owl: <http://www.w3.org/2002/07/owl#> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix skos: <http://www.w3.org/2004/02/skos/core#> .
@prefix wgs84pos: <http://www.w3.org/2003/01/geo/wgs84_pos#> .
<http://data.theeuropeanlibrary.org/BibliographicResource/3000084490807> a dcterms:BibliographicResource ;
rdam:P30004 "local identifier: http://data.copac.ac.uk/iid/65204626" ;
rdau:P60049 <http://rdvocab.info/termList/RDAContentType/1020> ;
rdam:P30003 "single unit"^^<http://rdvocab.info/termList/ModeIssue> ;
rdau:P60520 "Unkown"@en ;
rdam:P30004 "isbn: 0198750315" ;
rdam:P30156 "The philosophy of history" ;
rdau:P60339 "edited by Patrick Gardiner." ;
rdam:P30157 "Oxford readings in philosophy" ;
rdau:P60398 _:node18kdvnimbx4386 .
_:node18kdvnimbx4386 a rdac:C10004 ;
rdaa:P50111 "Patrick L. Gardiner" ;
rdaa:P50121 "1922" .
<http://data.theeuropeanlibrary.org/BibliographicResource/3000084490807> rdau:P60073 "1974" ;
rdau:P60099 <http://id.loc.gov/vocabulary/iso639-2/eng> ;
rdau:P60163 _:node18kdvnimbx4387 .
_:node18kdvnimbx4387 rdau:P60366 "Oxford University Press" .
<http://data.theeuropeanlibrary.org/BibliographicResource/3000084490807> rdau:P60444 _:node18kdvnimbx4388 .
_:node18kdvnimbx4388 a rdac:C10005 ;
rdaa:P50032 "London" .
<http://data.theeuropeanlibrary.org/BibliographicResource/3000084490807> rdau:P60163 <http://id.loc.gov/vocabulary/countries/uk> ;
dcterms:subject _:node18kdvnimbx4389 .
_:node18kdvnimbx4389 a frbrer:C1007 ;
rdfs:label "History, Philosophy." ;
dcterms:hasPart _:node18kdvnimbx4390 .
_:node18kdvnimbx4390 a frbrer:C1007 ;
rdfs:label "History" .
<http://data.theeuropeanlibrary.org/BibliographicResource/3000084490807> dcterms:extent "224 p. ;" , "21 cm." ;
rdau:P60470 "Includes index." ;
dcterms:description "Bibliography: p. -222." .
Some initial observations:
- Lots of RDA! And no ISBD.
- Various vocabularies used, if not all in this example, including: RDA, FRBR, DCTerms, Europeana Data Model (EDM), FOAF, OAI-ORE, OWL, SKOS, and WGS84 geodata.
- No Bibframe.
- Authors are given with RDA elements (as above) rather than headings as such and with VIAF links (although not in this example: see below for a snippet from another book).
A short snippet from another book showing a blank node asserted as being the same as a VIAF entity, having a relationship with a work using RDA, and the detailed RDA data elements for the name:
_:node18kdvnimbx245 owl:sameAs <http://viaf.org/viaf/17463572/> .
<http://data.theeuropeanlibrary.org/BibliographicResource/3000087185802> rdau:P60398 _:node18kdvnimbx245 .
_:node18kdvnimbx245 a rdac:C10004 ;
rdaa:P50111 "Niccolo Pagliarini" ;
rdaa:P50121 "1717" ;
rdaa:P50120 "1795" .
Last I managed to see six species of bumblebee in Sandy* and another one further afield in Bedfordshire**. This year I managed to spot eight in Sandy, all but one in the garden. I’m hoping to have lots more wild flowers in the garden this year so hope to attract the bees to go with them.
I’ve submitted all the following as records to Beewatch, which is also very useful in getting confirmation of IDs.
The garden bumblebee, seen in the garden. This is the first one of these I’ve seen, despite them being very common.
The tree bumblebee. We in fact had two nests in our house and garden. The bee above is coming out of one they made in an old bird box in the garden. The birds never used it but these bees did. We had a second nest in the roof too.
Bombus lapidarius (worker)
Bombus lapidarius (male)
The red-tailed bumblebee. Both in the garden.
The early bumblebee, one of the smaller species. In the garden. They seem to like flatter flower heads, like on this senetti.
Bombus lucrorum (worker)
Bombus lucorum (male)
White-tailed bumblebee. The queens and workers look practically identical to the buff-tailed bumblebee (B. terrestris) although the males are very much more yellow and quite striking. In the garden.
Common carder bee. In the garden.
Bombus terrestris (queen)
Buff-tailed bumblebee. The buff tail is more obvious in the queen especially just forward of the tail.
The southern cuckoo bumblebee. It takes over nests of B. terrestris. Sadly not seen in the garden but the bee on the bramble flower was on a piece of waste ground next to a path a stone’s throw away. The one on clover was near the railway station.
* Bombus vestalis?, B. pratorum, B. terrestris, B. rupestris, B. hypnorum, B. pascuorum
** B. campestris?
Below is described a way to add tables of contents to RDA Toolkit workflows automatically, i.e. without manually adding anchors and creating a list. You can see an example of it action on this workflow (although of course I can’t guarantee that this workflow will always be around or look like this).
Instructions follow and some caveats are below.
- Open an RDA Toolkit workflow for editing
- Click on Source
- Insert the following snippet of HTML where you want the table of contents to appear:
<div class="generate_from_h2" id="generated-toc"><a name="top"></a></div>
- If you have access to a local web server:
- At the very end of the workflow, put the following HTML snippet, changing the URL to where your copy of generate_toc_rda.js now lives:
- If you don’t have access to a local server:
- At the very end of the workflow, put the following HTML snippet:
- On another line underneath, i.e. right at the end, put the following snippet of HTML:
- Save the workflow.
- Click on the workflow in the Toolkit to refresh it.
- Buy Stuart some beer next time you see him, e.g. some gueuze, or give him some custom.
Please do let me know if you try this and how you get on. I might be amenable to making changes to it, time and circumstances allowing. Stuart released the original toc script “under an X11 licence. What this boils down to is: do what you like with it. You can use the script in commercial environments, you can use it on your intranet, you can use it anywhere you like.” Sounds good to me too.
This year I have a bee hotel attached to the house which provides somewhere for solitary bees to nest. The most commonly seen garden solitary bee is the red mason bee (Osmia bicornis) which finds cavities, such as in walls or canes in which to lay its eggs. It adds some pollen for food and seals up the cell with mud.
Yesterday I managed to catch a female red mason bee sealing off one of the tubes in the nest box and took some photos. They are not amazing photos, my main excuses being failing light and rubbish photography skills; I was also watering the garden and getting some washing in at the same time, etc etc. A selection of photos in chronological order are below, with the time taken in the caption.
In the first photo, you can see the female deep inside the tube.
Twenty minutes later the start of a wall is in place and the bee has gone off to get some more mud.
A few minutes later and she’s back constructing the wall.
Again she’s off, and there is now a complete ring.
Three minutes later and the ring is closing in.
The female is back with more mud.
Now the hole is clearly too small for her to get in or out.
…and the hole is sealed!
The bee continues to add more mud for a better seal.
The last photo is from over an hour later and shows the cap jutting out from the wall, not entirely neatly.
There are now six holes filled up in the bee hotel. I understand that each of the holes will have a number of cells, one in front of the other. Next spring, the small males, whose eggs are laid near the front, will hatch first, followed shortly by the larger females. There are hundreds of sorts of solitary bee in the UK, although only the larger ones will use the bee hotel. I saw some leaf-cutter bees, such as the one below, last year.
Leaf-cutter bee cutting a leaf
They use mashed-up leaf pulp instead of mud to do much the same thing, so I hope they might pop by too.
BIBFRAME has worked on modelling works as Works within the BIBFRAME model, similar to the RDA modelling work, itself modelled on the work on the FRBR model of Works and Expressions. A BIBFRAME Work is a creative work, perhaps a FRBR Work, or an RDA FRBR Work but it also expresses a FRBR Expression, and of course an RDA FRBR Expression. A Work may express another Work based on others’ work, not just a FRBR Work or an RDA Work. That also works. FRBR Works or RDA Works expressed as BIBFRAME Works can relate to FRBR Expressions (BIBFRAME Works or RDA Expressions). So, Works are works that can be Works but also Expressions linked to Works that really are Works.