Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)
Validator & data extractor Tool


Download and evaluate XML metadata from OAI-PMH enabled digital libraries.

Example OAI-PMH URL: https://dspace.lib.uom.gr/dspace-oai/request

Download all the records from a digital library. Please insert an OAI-PMH URL, select the metadata prefix and check out the results.

Optional parameters:

REST API

OAIPMH.com is completely FREE for personal or academic use.

Large scale use of the OAIPMH.com service via its REST API as well as technical and/or scientific support is available on a fee.

Please contact the author, Vangelis Banos, to learn more.

About

The OAI-PMH validator was created by Vangelis Banos ( email: vbanos |at| gmail {dot} com ).

The main idea behind this project was to concentrate the knowledge I have accumulated regarding OAI-PMH harvesting and systematize the validation of OAI-PMH interfaces.

The OAI-PMH validator is used primarily in the administration and maintenance of

About OAI-PMH

OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) is a protocol developed by the Open Archives Initiative. It is used to harvest (or collect) the metadata descriptions of the records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.

The protocol is usually just referred to as the OAI Protocol. Check full article in wikipedia »

What is my OAI-PMH URL?

The OAI-PMH URL address of a digital library varies depending on the software it is based on. Please check out some widely used software and their Sample URLs.

Software Sample URL Sample OAI-PMH URL
Dspace up to version 1.4 http://dspace.lib.uom.gr/ http://dspace.lib.uom.gr/dspace-oai/request
Dspace version 1.5 and newer http://medusa.libver.gr/ http://medusa.libver.gr/oai/request
Open Journal System (OJS) http://dl.acs.org.au/ http://dl.acs.org.au/index.php/index/oai

These URLs are just the defaults and can be changed by the library's administrator.

How does OAIPMH.com works?

OAIPMH.com is a web application. It works in the background on our servers and presents the final output to the users. There is no need to install software in your computer, everything is done using a web browser

What kind of validation tests are performed?

Generic Checks
  1. Check HTTP Response code.
  2. Check HTTP Response document content type.
  3. Check XML document file size.
  4. Check HTTP Request & Response Time.
  5. Check XML document compliance against the OAI-PMH XML Schema http://oaipmh.com/files/OAI-PMH.xsd.
Content-specific Checks
  1. Check OAI-PMH Protocol version.
  2. Check Administrator email address.
  3. Check ListSets command for sets.
  4. Check available Metadata formats.
  5. Check Total Records number.
  6. Check Dublin Core metadata in Records.
  7. Check ESE metadata in Records.
  8. Check XML document compliance against the ESE XML Schema http://www.europeana.eu/schemas/ese/ESE-V3.3.xsd
  9. Also check various ESE XML elements for common mistakes such as
    • Check for invalid europeana:isShownAt URL
    • Check for invalid europeana:isShownBy URL
  10. Check various metadata elements in Records:
    • Empty dc:title
    • Empty or invalid dc:identifier
    • Empty setSpec

I want an extra feature of some kind, what can I do?

Please contact me at vbanos [at] gmail {{dot}} com. Any suggestions for improvements and new features are always welcome.

References in scientific publications

  1. Koulouris, Alexandros, Vangelis Banos, and Emmanouel Garoufallou. "Aggregating metadata for Europeana: the Greek paradigm." Proceedings of the International Conference on Integrated Information (IC-ININFO). 2011.
  2. Garoufallou, Emmanouel, Vangelis Banos, and Alexandros Koulouris. "Solving aggregation problems of Greek cultural and educational repositories in the framework of Europeana." International Journal of Metadata, Semantics and Ontologies 8.2 (2013): 134-144.
  3. Rousidis, Dimitris, et al. "Metadata for Big Data: A preliminary investigation of metadata quality issues in research data repositories." Information Services and Use 34.3 (2014): 279-286.
  4. Ivanović, Lidija, Dragan Ivanović, and Bojana Dimić Surla. "IMPROVING DISSEMINATION OF HUMAN KNOWLEDGE BY EXPORTING DATA FROM RESEARCH INFORMATION SYSTEMS."
  5. Georgiadis, Haris, et al. "Ensuring the quality and interoperability of open cultural digital content: System architecture and scalability." Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on. IEEE, 2014.
  6. Rousidis, Dimitris, et al. "Data Quality Issues and Content Analysis for Research Data Repositories: The Case of Dryad." Let’s Put Data to Use: Digital Scholarship for the Next Generation, 18th International Conference on Electronic Publishing, Thessaloniki, Greece. 2014.
  7. Stathopoulou, Ioanna-Ourania, et al. "An Open Cultural Digital Content Infrastructure." Digital Libraries (JCDL), 2014 IEEE/ACM Joint Conference on. IEEE, 2014.
  8. Hirschmann, Barbara. "DOI Registration Manual." (2014).
  9. Calzolari, Nicoletta, Monica Monachini, and Valeria Quochi. "Interoperability framework: The FLaReNet action plan proposal." Language Resources, Technology and Services in the Sharing Paradigm (2011): 41.
  10. Birello, Giancarlo, et al. "Step by step installation guide of a digital preservation infrastructure." (2012).
  11. Antonius Rachmat, C. "Analisis Rancang Bangun Sistem Repositori Institusi Berbasis Metadata Dublin Core di UKDW Yogyakarta."
  12. Houssos, Nikos, et al. "Enhanced OAI-PMH services for metadata sharing in heterogeneous environments." Library Review 63.6/7 (2014): 465-489.
  13. Kapidakis, Sarantos. "Comparing metadata quality in the Europeana context." Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments. ACM, 2012.
  14. Calzolari, Nicoletta, et al. "Final FLaReNet Deliverable Language Resources for the Future–The Future of Language Resources." The Strategic Language Resource Agenda. FLaReNet project (2011).

Features include:

  • View, print or download the output of all OAI-PMH supported commands.
  • Detect problems with metadata records (e.g. invalid URLs, empty titles, invalid date formats etc.)

 

  • Download all records from one or more digital libraries in parallel.
  • Check compliance with OAI-PMH, Dublin Core (DC), Europeana Semantic Elements (ESE) and other standards.

12613 digital libraries have been already tested with OAIPMH.com, including:

Created by Vangelis Banos, © 2011 - 2015