Instructional exercise OAI and OAI-PMH for Beginners A prologue to the Open Archives Initiative and the Protocol for Me

0
0
1832 days ago, 791 views
PowerPoint PPT Presentation

Presentation Transcript

Slide 1

Instructional exercise OAI and OAI-PMH for Beginners A prologue to the Open Archives Initiative and the Protocol for Metadata Harvesting Pete Cliff UKOLN, University of Bath, United Kingdom p.d.cliff@ukoln.ac.uk Uwe Müller Humboldt University Berlin, Germany u.mueller@cms.hu-berlin.de

Slide 2

Agenda Part I History and diagram Part II Main Ideas of the OAI-PMH/Technical presentation Short break Part III – Breakout Sessions Implementation issues – information and administration supplier Coffee Break Part IV Implementation issues – XML mapping and supporting various record groups third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners

Slide 3

Acknowledgments Some of the slides exhibited here are our own! Huge numbers of them have been sympathetically given by (taken from!) Herbert Van de Sompel Carl Lagoze Michael Nelson Simeon Warner Andy Powell (and others likely!) third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners

Slide 4

Tutorial OAI and OAI-PMH for Beginners A prologue to the Open Archives Initiative and the Protocol for Metadata Harvesting Part I: History and outline

Slide 5

A History Lesson - Roots of OAI Some early movement XXX (arXiv), CogPrints, NCSTRL, RePEc Web interfaces for individuals No machine interfaces Different interfaces for various chronicles End Users compelled to learn assorted interfaces Little or no independent metadata sharing third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 6

Santa Fe Meeting "… the joint effect of these and future activities can be considerably higher when interoperability between them [e-print archives] can be set up… " [Ginsparg, Luce, Van de Sompel, UPS Call, July 1999] third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 7

The Problems Two issues: End clients where/are confronted with numerous inquiry interfaces making asset disclosure harder. No machine based method for sharing the metadata third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 8

Cross Search? US Digital Library Experience proposes cross seeking doesn't scale - N > 100 = awful! Gathering depiction - knowing which focus to utilize Query dialect and inquiry trait variety Rank blending issue Different size and kind of target can skew results Performance - restricted to slowest target Difficult to assemble a peruse interface SOLUTION: get all the metadata records in one place third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 9

Harvest? Collect records out of chronicles into one place Universal Preprint Service Prototype So: N = 1 more often than not… One question dialect, set of hunt qualities and positioning calculation A familiarity with the information makes peruse structures less demanding to manufacture UPS was immediately changed to OAI - the Open Archives Initiative third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 10

Data and Service Providers Data Provider Creators and guardians of the metadata and vaults of assets Service Provider Harvesters of metadata with the end goal of giving an administration, for example, a pursuit interface, peer-audit framework, and so on. One "administration" can assume both parts third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 11

The Dawn of a Protocol To encourage metadata gathering there should be concession to: Transport convention - HTTP or FTP or … Metadata arrange - Dublin Core or MARC or … Metadata Quality Assurance - obligatory component set, naming and subject traditions, and so on. Licensed innovation and Usage Rights - who can do what with what ? Understanding prompted to (display): the Santa Fe Convention third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 12

The Santa Fe Convention First incarnation of the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Drew upon: The UPS Prototype RePEc/SODA - the Service/Data supplier show the Dienst Protocol Work of the Santa Fe gathering To "enhance the disclosure of e-prints" third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 13

The OAI-PMH 1.0 Introduced Dublin Core component set Drew upon: Santa Fe Convention Digital Library Federation gatherings Work at Cornell Feedback from alpha-analyzers another center to encourage the revelation of "record like items" third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 14

The OAI-PMH 1.0 - Summary Low obstruction interoperability detail Based around metadata collecting model Focus on "report like articles" HTTP based GET/POST asks for XML reactions Uses unfit Dublin Core Not a hunt convention! Test third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 15

The OAI-PMH 1.1 An update of the 1.0 determination assessing changes to the developing XML Schema detail third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 16

The OAI-PMH 2.0 Major correction - not perfect with 1.x Drew upon: OAI-PMH 1.x Feedback from OAI Implementers List OAI tech consideration Feedback from alpha-analyzers "the repetitive trade of metadata about assets between frameworks" third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 17

The OAI-PMH 2.0 - Summary Still a low obstruction interoperability particular Based around metadata reaping model Metadata about assets HTTP based GET/POST asks for XML reactions Uses inadequate Dublin Core Not a pursuit convention! Stable - OAI has focused on making ensuing amendments of the convention in reverse perfect third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 18

nature exploratory trial stable Dienst verbs OAI-PMH OAI-PMH asks for HTTP GET/POST HTTP GET/POST HTTP GET/POST reactions XML transport HTTP unfit Dublin Core inadequate Dublin Core metadata OAMS record like items assets about eprints metadata reaping metadata gathering metadata collecting model Santa Fe tradition OAI-PMH v.1.0/1.1 OAI-PMH v.2.0 third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 19

Multiple information and administration p's Data suppliers Harvesting in light of OAI-PMH Service suppliers third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 20

Aggregators Data suppliers Aggregator Service suppliers third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 21

Can be blended with x-looking Data suppliers Harvesting in view of OAI-PMH Searching in light of Z39.50 or SRW Service suppliers third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 22

The Benefits of OAI-PMH Simple Web (thus firewall) benevolent Access-control, pressure, mistake codes, and so on in light of HTTP Many toolboxs - can conceal the convention from designers Multiple SPs can reap from different DPs guaranteeing a more extensive spread of metadata A base layer to assemble different administrations on Complements seek conventions like Z39.50 third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 23

Summary So Far Early movers growing independently Need for interoperability Santa Fe Meeting prompted to OAI advances interoperability through: OAI-PMH Low cost Harvest display Data Providers/Service Providers Simple, simple and based on existing innovation An open standard third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 24

Resources OAI Web webpage: http://www.openarchives.org/OAI-PMH particular: http://www.openarchives.org/OAI/openarchivesprotocol.html Implementation rules: http://www.openarchives.org/OAI/2.0/guidelines.htm Discussion records: http://www.openarchives.org/postal worker/listinfo/oai-general http://oaisrv.nsdl.cornell.edu/postal carrier/listinfo/oai-implementers Repository traveler: http://oai.dlib.vt.edu/cgi-canister/Explorer/oai2.0/testoai Tools: http://oai.dlib.vt.edu/cgi-container/Explorer/oai2.0/testoai third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 25

Examples of Service Providers Citation Indexing http://icite.sissa.it Search Engine http://www.ncstrl.org/Printing on Demand Service http://www.proprint-service.de Value included Search Engine http://www.myoai.com third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part I

Slide 26

Tutorial OAI and OAI-PMH for Beginners A prologue to the Open Archives Initiative and the Protocol for Metadata Harvesting Part II: Main Ideas of OAI-PMH Technical Introduction

Slide 27

Agenda Protocol Basics Protocol Details Request Types Examples third OAForum workshop - Berlin - 27th-29th March 2003 - Tutorial: OAI and OAI-PMH for Beginners - Part II

Slide 28

The Open Archives Initiative (OAI) Main thoughts overall solidification of insightful files free access on the chronicles (at any rate: metadata) predictable interfaces for documents and administration supplier low hindrance convention/easy usage in view of existing principles (e.g. HTTP, XML, DC) Basic working Requests (in light of HTTP) Metadata (Documents) „ Service" Metadata (encoded in XML) Harvester Repository

SPONSORS