Building Collections Using Greenstone

0
0
2547 days ago, 841 views
PowerPoint PPT Presentation
Greenstone. New Zealand Digital Library Project at the University of WaikatoIn collaboration with UNESCO, Human Info NGOInternational, each continentExamples:AcademicDigitization projectsClasses on computerized librariesNon-academicUNESCO compassionate documentation. Greenstone highlights. Works with existing documentsImports a few formatsSearching: full content and metadataDublin Core, custom metadataBrows

Presentation Transcript

Slide 1

Building Collections Using Greenstone Tod A. Olson <tod@uchicago.edu> Sr. Developer/Analyst Digital Library Development Center University of Chicago Library http://www.lib.uchicago.edu/dldc/talks/2003/dlf-greenstone/

Slide 2

Greenstone New Zealand Digital Library Project at the University of Waikato In participation with UNESCO, Human Info NGO International, each landmass Examples: Academic Digitization ventures Classes on advanced libraries Non-scholastic UNESCO compassionate documentation

Slide 3

Greenstone highlights Works with existing reports Imports a few configurations Searching: full content and metadata Dublin Core, custom metadata Browse Structured archives Indexing, get to Extensible & adjustable OpenSource programming (GPL)

Slide 7

Greenstone Architecture Receptionist Protocol Collection Server Collection Server Collection DB & Indexes DB & Indexes DB & Indexes Import Redrawn from Witten & Bainbridge, How to Build a Digital Library, p. 356

Slide 8

Receptionist Provides UI Accept client input Send to fitting gathering server Accept comes about Dynamic page era Collection Server Handle accumulation content Search and channel data Return comes about different accumulations Greenstone Architecture

Slide 9

Building Collections HTML DB & Indexes Import Build GSAF PDF ???

Slide 10

Building accumulations Create a gathering system or work with an old accumulation Select reports Import archives Converts to inside XML organize (GSAF) Build gathering makes look lists and peruse postings

Slide 11

GSAF: interior XML arrange < Section > <Description> <Metadata name="Title" value="… "> <Content> [Text, pictures, joins, etc.] < Section > <Description> <Metadata name="Title" … > <Content>… < Section >… < Section >… < Section >…

Slide 12

GSAF: inward XML design Section: Description Metadata fields Content Text,internal markup, pictures Section No cutoff in number or profundity Hierarchical records Sections settle, tree structure

Slide 13

Config record: collect.cfg Collection-particular setup document, collect.cfg, indicates:   record sorts to import Indexes and peruse records Document or segment level passage (content file just) show of results and peruse postings record shows

Slide 14

Chopin Early Editions Over 400 early version Chopin scores 1830's to 1880's Target group of onlookers: music researchers & performers. On web, page-turnable JPEG pictures. Online in March 2003 Currently 372 scores in online accumulation Usage: Nearly100 hits every day, > 30% of utilization is global.

Slide 22

Catalog records Scanned Images Structural metadata Build diagram Greenstone Archive Format Greenstone Dig. Library Software XSLT METS & MODS Human preparing XML-based robotized handling

Slide 23

Structural and other metadata "chopin","108","001","","1","" "chopin","108","002","","1","" "chopin","108","003","1","1","Nocturne, no.15" "chopin","108","004","2","1","" "chopin","108","005","3","1",""

Slide 24

Catalog records Scanned Images Structural metadata Build outline Greenstone Archive Format Greenstone Dig. Library Software XSLT METS & MODS Human handling XML-based computerized preparing

Slide 25

METS & MODS dmdSec MODS fileSec URL: page1.jpg URL: page2.jpg structMap div DMDID=1 div FILEID=1 div FILEID=2 Catalog record (MARC) Scanned pictures (JPEG) Structural metadata

Slide 26

METS & MODS Program utilizes auxiliary metadata to: Generate structMap Generate picture URLs for fileSec Images put away by naming tradition Structural md conveys index record no. Extricate MARC from inventory crosswalk to MODS Embed in dmdSec

Slide 27

GSAF XML organize for inward stockpiling Hierarchical record structure Nested segments: e.g. section 1, chapt. 2 METS to GSAF by means of XSLT Natural mapping from METS to GSAF Map basic progression Follow joins Descriptive metadata File content

Slide 28

METS to GSAF Section Description Metadata: Title, … Content: Title, … Section Content: Page 1 page1.jpg Section Content: Page 2 page2.jpg dmdSec MODS: Title, … fileSec page1.jpg page2.jpg structMap div: Score div: Page 1 div: Page 2

Slide 29

METS to GSAF Section Description Metadata: Title , … Content: Title , … Section Content: Page 1 page1.jpg Section Content: Page 2 page2.jpg dmdSec MODS: Title , … fileSec page1.jpg page2.jpg structMap div: Score div: Page 1 div: Page 2

Slide 30

METS to GSAF Section Description Metadata: Title , … Content: Title , … Section Content: Page 1 page1.jpg Section Content: Page 2 page2.jpg dmdSec MODS: Title , … fileSec page1.jpg page2.jpg structMap div: Score div: Page 1 div: Page 2

Slide 31

METS to GSAF Walk basic metadata to make the tree of <Section> components Descriptive metadata: <Description> Crosswalk to sought metadata names <Content>: Format metadata coveted for show File information <Content>: Inline content, connection to pictures, and so on

Slide 32

Customizing Chopin accumulation Focus on route Metadata for custom get to E.g. kind, dedicatee not in MARC/AACR2 Can bolster with METS, MODS, Greenstone Custom archive route Separate depiction from scores Custom page route Improves ease of use Branding in next stage

Slide 33

Comments on Chopin Early Editions Data made by staff utilizing commonplace instruments Structural md made in desktop application Catalog records an extravagance Catalog is DB of record Project IDs in 909 POIs point into Greenstone METS/MODS gathered by program Expect to repurpose METS for different applications Customization: route, not marking Faster to raise accumulation, get client response

Slide 34

Greenstone benefits for Chopin Robust, develop framework Recovered time in venture Fast to raise UI out of the case Dynamic page era Incremental customization XML agreeable Natural mapping from METS to GSAF

Slide 35

Future work: Chopin Add DjVu picture arrange Repurpose METS for different applications OAI Standardize new digitization generation stream Project was first for METS, MODS, GS, & 6 depts. Institutionalize accumulation of basic metadata Plug in illustrative metadata as proper Store authentic engaging metadata in METS question Repurpose by means of XSLT for conveyance

Slide 36

Other custom UI cases Lehigh Digital Bridges Extensive changes to look Washington Research Libraries Consortium (WRLC) Custom page flag Popup page turner in Perl GS as segment of DL suite

Slide 42

Ongoing work: Greenstone Librarian Interface (GLI) Greenstone 3

Slide 43

Collection administration Informed by work at GS destinations Assist accumulation fashioner Support all periods of gathering construct handle Do not determine work process Java-based GUI device Formerly called the "Gatherer" 2 yrs being developed In beta outside of lab Bangalore, different locales in flow appropriation Greenstone Librarian Interface (GLI)

Slide 44

Greenstone 3 GS2 develop, 5+ yrs., wide arrangement Constraints: bolster legacy frameworks Other advancements have developed: Java, XML GS3: revamp in Java, XML, XSLT Distributed engineering, SOAP METS as inside organization Group amassed for Greenstone METS profile(s) OAI bolster arranged 1 year in dev; alpha testing in lab

Slide 45

Conclusion Positive encounters Good course for improvement Strong client group Proven in genuine computerized library ventures

Slide 46

Links & Further Information Chopin Early Editions: http://chopin.lib.uchicago.edu/Greenstone: http://www.greenstone.org/Downloads, documentation, cases New Zealand Digital Library Project: http://www.nzdl.org/UNESCO & related accumulations, numerous demos Witten & Bainbridge. The most effective method to Build a Digital Library . Morgan Kaufman, 2003.

SPONSORS