Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 History  





2 OAI workshops  





3 Uses  





4 Software  





5 Archives  





6 See also  





7 References  





8 External links  














Open Archives Initiative Protocol for Metadata Harvesting







Català
Čeština
Deutsch
Ελληνικά
Español
Français
Italiano

Nederlands

Polski
Português
ி
Türkçe
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a protocol developed for harvesting metadata descriptions of records in an archive so that services can be built using metadata from many archives. An implementation of OAI-PMH must support representing metadata in Dublin Core, but may also support additional representations.[1][2]

The protocol is usually just referred to as the OAI Protocol.

OAI-PMH uses XML over HTTP. Version 2.0 of the protocol was released in 2002; the document was last updated in 2015. It has a Creative Commons license BY-SA.

History[edit]

In the late 1990s, Herbert Van de Sompel (Ghent University) was working with researchers and librarians at Los Alamos National Laboratory (US) and called a meeting to address difficulties related to interoperability issues of e-print servers and digital repositories. The meeting was held in Santa Fe, New Mexico, in October 1999.[3] A key development from the meeting was the definition of an interface that permitted e-print servers to expose metadata for the papers it held in a structured fashion so other repositories could identify and copy papers of interest with each other. This interface/protocol was named the "Santa Fe Convention".[1][2][4]

Several workshops were held in 2000 at the ACM Digital Libraries conference,[5] at the 1st ACM/IEEE-CS joint conference on Digital libraries[6][7] and elsewhere to share the ideas from the Santa Fe Convention.[8] It was discovered at the workshops that the problems faced by the e-print community were also shared by libraries, museums, journal publishers, and others who needed to share distributed resources. To address these needs, the Coalition for Networked Information[9] and the Digital Library Federation[10] provided funding to establish an Open Archives Initiative (OAI) secretariat managed by Herbert Van de Sompel and Carl Lagoze. The OAI held a meeting at Cornell University (Ithaca, New York) in September 2000 aimed to improve the interface developed at the Santa Fe Convention.[11] The specifications were refined over e-mail.

OAI-PMH version 1.0 was introduced to the public in January 2001 at a workshop in Washington D.C.,[12] and another in February in Berlin, Germany.[13] Subsequent modifications to the XML standard by the W3C required making minor modifications to OAI-PMH resulting in version 1.1. The current version, 2.0, was released in June 2002. It contained several technical changes and enhancements and is not backward compatible.[14]

OAI workshops[edit]

From 2001 CERN, and later in collaboration with University of Geneva, has organized bi-annual OAI workshops,[15] which over time have developed to cover most aspects of open science. Since 2021 the workshop series is named the Geneva Workshop on Innovations in Scholarly Communication, with the nick name OAI reflecting its origin.[16]

Uses[edit]

Some commercial search engines use OAI-PMH to acquire more resources. Google initially included support for OAI-PMH when launching sitemaps, however decided to support only the standard XML Sitemaps format in May 2008.[17] In 2004, Yahoo! acquired content from OAIster (University of Michigan) that was obtained through metadata harvesting with OAI-PMH. Wikimedia uses an OAI-PMH repository to provide feeds of Wikipedia and related site updates for search engines and other bulk analysis/republishing endeavors.[18] Especially when dealing with thousands of files being harvested every day, OAI-PMH can help in reducing the network traffic and other resource usage by doing incremental harvesting.[19] NASA's Mercury metadata search system uses OAI-PMH to index thousands of metadata records from Global Change Master Directory (GCMD) every day.[20]

The mod_oai project is using OAI-PMH to expose content to web crawlers that is accessible from Apache Web servers.

OAI-PMH has later been applied to sharing of scientific data.[21]

Software[edit]

OAI-PMH is based on a client–server architecture, in which "harvesters" request information on updated records from "repositories". Requests for data can be based on a datestamp range, and can be restricted to named sets defined by the provider. Data providers are required to provide XML metadata in Dublin Core format, and may also provide it in other XML formats.

A number of software systems support the OAI-PMH, including Fedora, EThOS from the British Library, GNU EPrints from the University of Southampton, Open Journal Systems from the Public Knowledge Project, Desire2Learn, DSpace from MIT, HyperJournal from the University of Pisa, Digibib from Digibis, MyCoRe, Koha, Primo, DigiTool, Rosetta and MetaLib from Ex Libris, ArchivalWare from PTFS, DOOR [22] from the eLab[23] in Lugano, Switzerland, panFMP from the PANGAEA data library,[24] SimpleDL from Roaring Development, and jOAI from the National Center for Atmospheric Research.[25]

Archives[edit]

A number of large archives support the protocol including arXiv and the CERN Document Server.

See also[edit]

References[edit]

  1. ^ a b Lynch, Clifford A. (August 2001). "Metadata harvesting and the Open Archives Initiative". ARL: A Bimonthly Report (217). Archived from the original (PDF) on 25 May 2012.{{cite journal}}: CS1 maint: date and year (link)
  • ^ a b Marshall Breeding (September 2002). "Understanding the Protocol for Metadata Harvesting of the Open Archives Initiative". Computers in Libraries. 22 (8): 24–29. Retrieved 2021-02-08.
  • ^ Marshall, E. (1999). "Researchers plan free global preprint archive". Science. 286 (5441): 887a–887. doi:10.1126/science.286.5441.887a. PMID 10577235. S2CID 178990556.
  • ^ "The Santa Fe Convention by the Open Archives Initiative". Open Archives Initiative. February 15, 2000. Retrieved May 29, 2022.
  • ^ "The Santa Fe Convention of the Open Archives Initiative". dspace.library.uu.nl. Retrieved 2021-02-10.
  • ^ Edward A. Fox; Christine L. Borgman, eds. (2001). Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries. Roanoke, Virginia, United States: ACM Press. doi:10.1145/379437. ISBN 978-1-58113-345-5.{{cite book}}: CS1 maint: date and year (link)
  • ^ Lagoze, Carl; Van de Sompel, Herbert (2001). "The open archives initiative". Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries. Roanoke, Virginia, United States: ACM Press. pp. 54–62. CiteSeerX 10.1.1.161.6800. doi:10.1145/379437.379449. ISBN 978-1-58113-345-5. S2CID 1315824.{{cite book}}: CS1 maint: date and year (link)
  • ^ Van de Sompel, Herbert; Lagoze, Carl (2000). "The Santa Fe Convention of the Open Archives Initiative". D-Lib Magazine. 6 (2). doi:10.1045/february2000-vandesompel-oai. ISSN 1082-9873.
  • ^ "Homepage". Coalition for Networked Information. Retrieved May 29, 2022.
  • ^ "Homepage". Digital Library Federation. Retrieved May 29, 2022.
  • ^ "OAi-tech Meeting, Cornell University, September 7-8 2000". www.openarchives.org. Retrieved 2021-02-10.
  • ^ "The Open Archives Initiative: Open Meeting Renaissance Hotel, Washington DC January 23, 2001". www.openarchives.org. Retrieved 2021-02-10.
  • ^ "The Open Archives Initiative: Open Meeting Staatsbibliothek zu Berlin, Germany February 26, 2001". www.openarchives.org. Retrieved 2021-02-10.
  • ^ Van de Sompel, Herbert; Young, Jeffrey A.; Hickey, Thomas B. (2003). "Using the OAI-PMH ... Differently". D-Lib Magazine. 9 (7/8). doi:10.1045/july2003-young. ISSN 1082-9873.
  • ^ "Previous OAI Workshops – OAI". The Geneva Workshop on Innovations in Scholarly Communication. Retrieved 2023-01-13.
  • ^ Azwa, Adnan Siti Norfateha. "Library Guide: Open Access Guide: The Latest on OA". umlibguides.um.edu.my. Retrieved 2023-01-13.
  • ^ "Retiring Support for OAI-PMH in Sitemaps". Google Search Central Blog. April 23, 2008. Retrieved May 29, 2022.
  • ^ "Wikimedia update feed service". Wikimedia Meta-Wiki. Retrieved 14 July 2013.
  • ^ "OAI Harvesting System". DLXS. Retrieved May 29, 2022.
  • ^ R. Devarakonda; G. Palanisamy; J. Green; B. Wilson (2010). "Data sharing and retrieval uses OAI-PMH". Earth Science Informatics. 4 (1). Springer Berlin / Heidelberg: 1–5. doi:10.1007/s12145-010-0073-0. S2CID 46330319.
  • ^ Devarakonda, Ranjeet; Palanisamy, Giri; Green, James M.; Wilson, Bruce E. (2011). "Data sharing and retrieval using OAI-PMH". Earth Science Informatics. 4 (1): 1–5. doi:10.1007/s12145-010-0073-0. ISSN 1865-0473. S2CID 46330319.
  • ^ "Overview". DOOR. Retrieved May 29, 2022.
  • ^ "eLab". Universita della Svizzera italiana (in Italian). Retrieved May 29, 2022.
  • ^ "PANGAEA® Framework for Metadata Portals". panfmp.org.
  • ^ "NCAR/joai-project". Github.com. 31 May 2022.

  • External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Open_Archives_Initiative_Protocol_for_Metadata_Harvesting&oldid=1220690143"

    Categories: 
    Online archives
    Internet protocols
    Metadata
    Open access projects
    Archival science
    Hidden categories: 
    CS1 maint: date and year
    CS1 Italian-language sources (it)
    Articles with short description
    Short description matches Wikidata
     



    This page was last edited on 25 April 2024, at 09:29 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki