-
Open Access online collection of the university community's digital content. Aim at archiving faculty scholarship in an open access fashion as well as student work and community publications (e.g. newsletters, newspapers/magazines, podcasts, and other media).
-
DiscoverArchive, Vanderbilt University's IR.
-
Open Access
- Open Source IR solution
- all data managed by SQL db (postgres
- spits out XML here and there
- OAI-PMH - expose metadata records
- XMLUI - interface based on XML views transformed to HTML via XSLT
- identify faculty scholarship (journal articles)
- collect bib records (normalize? metadata)
- MS access; Zotero
- get permissions from copyright holder to deposit in IR
- agree to terms in IR license
- which version can we use - PrePrint, PostPrint, PublishedPDF
- collect and prepare content (articles)
- submit content we have permissions for to IR
- catalogers review, edit, and accept submissions
- mint handle (persistent URI to item page in IR)
- add handle and add bitstream URL to bib record
- query IR metadata (SOLR?, OAI-PMH), extract this data, add to bib records
- track individual publications/articles
- export MODS records from Zotero
- example: NewClearedList-MODS.xml
- export MODS records from Zotero
- add permissions data to bib records (MODS+Permissions)
- create namespace and element set to express permissions data if there are no suitable MODS elements
- Permissions data elements
- Publisher
- Journal Title
- Clearance - (Y/N)
- Version (P,P,orP)
- Embargo
- Blanket Permission (Y/N)
- Conditions
- store bib records in eXist-db
- use database to track permissions for individual items
- use XQuery to query collection
- view articles cleared for deposit
- other administrative views
- artifact of IR workflow is collection of faculty bibliographies (see bib records above)
- use XQuery to transform bib records
- what form do I need to put the bibliographies on the web?
- library website
- Vanderbilt faculty website
- give to faculty for use on personal site
- what else?
- transform to document - PDF?
- produce metadata for ingest from bib records
- csv for batch import
- pull XML data about existing content items in IR
- OAI-PMH
- LawFacultyOAIMets_2014-02-19.xml
- ReturnMods.xquery
- scrape repository pages (download stats)
- download count example - ExtractDownloadCount.xquery
- SOLR?
- OAI-PMH
- use XQuery to compile information about them
- usage report example
- metadata quality
- parse data from existing items and add to bib records
- can I do what I need to with the IR's XML data, or do I still need to access data from postgres occasionaly? -or- even with the NoSQL toolbox, I still need SQL