XQuery & IRs

Institutional Repository (IR)

Open Access online collection of the university community's digital content. Aim at archiving faculty scholarship in an open access fashion as well as student work and community publications (e.g. newsletters, newspapers/magazines, podcasts, and other media).
DiscoverArchive, Vanderbilt University's IR.
Open Access

DSpace

Open Source IR solution
all data managed by SQL db (postgres
spits out XML here and there
- OAI-PMH - expose metadata records
- XMLUI - interface based on XML views transformed to HTML via XSLT

IR Workflow

identify faculty scholarship (journal articles)
collect bib records (normalize? metadata)
- MS access; Zotero
get permissions from copyright holder to deposit in IR
- agree to terms in IR license
- which version can we use - PrePrint, PostPrint, PublishedPDF
collect and prepare content (articles)
submit content we have permissions for to IR
catalogers review, edit, and accept submissions
mint handle (persistent URI to item page in IR)
add handle and add bitstream URL to bib record
- query IR metadata (SOLR?, OAI-PMH), extract this data, add to bib records

Tracking Permissions

track individual publications/articles
- export MODS records from Zotero
  - example: NewClearedList-MODS.xml
add permissions data to bib records (MODS+Permissions)
- create namespace and element set to express permissions data if there are no suitable MODS elements
Permissions data elements
- Publisher
- Journal Title
- Clearance - (Y/N)
- Version (P,P,orP)
- Embargo
- Blanket Permission (Y/N)
- Conditions
store bib records in eXist-db
use database to track permissions for individual items
use XQuery to query collection
- view articles cleared for deposit
- other administrative views

Bibliography Service

artifact of IR workflow is collection of faculty bibliographies (see bib records above)
use XQuery to transform bib records
- what form do I need to put the bibliographies on the web?
- library website
- Vanderbilt faculty website
- give to faculty for use on personal site
- what else?
transform to document - PDF?

IR Administration

produce metadata for ingest from bib records
- csv for batch import
pull XML data about existing content items in IR
- OAI-PMH
  - LawFacultyOAIMets_2014-02-19.xml
  - ReturnMods.xquery
- scrape repository pages (download stats)
  - download count example - ExtractDownloadCount.xquery
- SOLR?
use XQuery to compile information about them
- usage report example
metadata quality
parse data from existing items and add to bib records
can I do what I need to with the IR's XML data, or do I still need to access data from postgres occasionaly? -or- even with the NoSQL toolbox, I still need SQL

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Div-MODS.xquery		Div-MODS.xquery
ExtractDownloadCount.xquery		ExtractDownloadCount.xquery
LawFacultyOAIMets_2014-02-19.xml		LawFacultyOAIMets_2014-02-19.xml
MODS-CSV.xquery		MODS-CSV.xquery
MODS-JournalArticles-2014-05-21.xml		MODS-JournalArticles-2014-05-21.xml
README.md		README.md
ReturnMODS.xquery		ReturnMODS.xquery

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XQuery & IRs

Institutional Repository (IR)

DSpace

IR Workflow

Tracking Permissions

Bibliography Service

IR Administration

About

Releases

Packages

Languages

EdWarga/IRandXQuery

Folders and files

Latest commit

History

Repository files navigation

XQuery & IRs

Institutional Repository (IR)

DSpace

IR Workflow

Tracking Permissions

Bibliography Service

IR Administration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages