इस ब्लॉग्स को सृजन करने में आप सभी से सादर सुझाव आमंत्रित हैं , कृपया अपने सुझाव और प्रविष्टियाँ प्रेषित करे , इसका संपूर्ण कार्य क्षेत्र विश्व ज्ञान समुदाय हैं , जो सभी प्रतियोगियों के कॅरिअर निर्माण महत्त्वपूर्ण योगदान देगा ,आप अपने सुझाव इस मेल पत्ते पर भेज सकते हैं - chandrashekhar.malav@yahoo.com
28. Case Studies : Dspace.
P- 01. Digital Libraries*
By :Jagdish Arora, Paper Coordinator
Multiple Choice Question
1 / 1 Points
Question 1: Multiple Choice
Dspace is written in?
- PERL
- JAVA programming language
- C++
- Ruby
0 / 1 Points
Question 2: Multiple Choice
What is Dspace?
- An Open source software
- A System Software
- An Automation software
- A Commercial software
0 / 1 Points
Question 3: Multiple Choice
Which metadata standard does DSpace support as native metadata format?
- MARC
- Dublin Core
- AACR
- IEEE LOM
1 / 1 Points
Question 4: Multiple Choice
Who built DSpace?
- IBM
- Microsoft
- MIT and Hewlett-Packard
- HP
0 / 1 Points
Question 5: Multiple Choice
Who contributes to DSpace development?
- Developer community across the world
- Developer team from MIT and HP
- Employees of Duraspace foundation.
- JAVA developers from INFLIBNET Centre.
2 / 5 PointsFinal Score:
True / False
1 / 1 Points
Question 1: True or False
DSpace is a free software
True
False
0 / 1 Points
Question 2: True or False
Dspace is a library automation software
True
False
0 / 1 Points
Question 3: True or False
DSpace uses Z39.50 protocol for metadata harvesting
True
False
1 / 3 PointsFinal Score:
Case Study : DSpace
Case Study : DSpace
Introduction
1.0 Key factors to DSpace’s adaption
2.0 DSpace Information Model
2.1 DSpace System Architecture
3.0 Major Features of DSpace
3.1 Metadata Registry
3.2 File Format Registry 3.3 E-Persons 3.4 Authorization 3.5 Ingestion Process and Work Flow 3.6 Search and Browse 3.7 Handle System 3.8 OAI-PMH Support 3.9 Statistics 3.10 SWORD and Open URL Support.
4.0 Customization in DSpace
5.0 Some Live Examples
References
Introduction
Digital libraries are nothing but services related to management and organization of available digital information and its retrieval with proper user interfacing. It also include archiving and preservation of digital material, social issues attached to the same along with its application and evaluation to specific focused areas. To achieve all there are certain expectation from the software or solution which helps to create digital libraries.
For a proper digital library solution, primarily it is being expected that
There is one stop solution, which meets all the expectations described above, that is DSpace, It is a platform that;
DSpace is typically being used to create digital library with three major roles ; First, it facilitates capture and ingestion of material with associated metadata; Second, DSpace provides easy access to the material with user friendly searching and listing mechanisms; Third, it facilitates long term preservation of digital material.
When initiated (in year 2000), DSpace was a joint project of Massachusetts Institute of Technology and Hewllet-Packard , DSpace project is now being handled by DuraSpace, a non-for-profit organization.
Key factors to DSpace’s adaption
DSpace has become quite popular among digital library implementers because it is open source and freely available software; it is being backed by very large worldwide user community who are ready to help.
DSpace software has been packaged in a way that it is very easy to use, It handles content in number of digital format, and the major advantage is; contents in DSpace can be made searchable through search engines like Google scholar, thus one can increase outreach of digital library without much effort.
DSpace can be used to store any type of digital material, it can store journal papers, Data Sets, Electronic Theses, Reports, Conference posters, Video’s , images. Logically speaking DSpace can be used to store any material which is available in digital format.
DSpace is basically an open source software available under Berkeley Software Distribution (BSD) licence , where in one can use and redistribute source as well as binaries or executable programme. DSpace software can be obtained from www.dspace.org or SourceForge’s dspace project site.
DSpace is having community based development model, where in there is SVN (subversion) based common source code control repository, which is having dedicated committers and contributors. This developer community welcomes every one to submit bug reports, patches, feature requests and other related things. There are number of active discussion groups and email lists are available for dspace support.
DSpace Information Model
Information Model of dspace is broadly divided in four components, that is Communities, Collections, Items and Bitstreams.
Community reflects the unit of an organization, collection in each community is distinct grouping of items, Items are logical content objects where as bitstreams are individual files.
The way data is organized in DSpace is supposed to reflect the structure of the organization and its digital collection.
Each DSpace site is divided into communities, which can be further divided into sub-communities reflecting the typical university structure of college, department, research centre, or laboratory. Communities contain collections, which are groupings of related content. A collection may appear in more than one community. Each collection is composed of items, which are the basic archival elements of the archive. Each item is owned by one collection. Additionally, an item may appear in additional collections; however every item has one and only one owning collection.
Items are further subdivided into named bundles of bitstreams. Bitstreams are, as the name suggests, streams of bits, usually ordinary computer files.
As discussed earlier, communities and collections are used to organize digital content or items in to a hierarchical form, It also contain limited set of descriptive metadata like name, description, licence and availability for that particular community and collection etc.digital library implementer can create communities based on logical grouping of digital items and then further subdivide in to the collections.
Items are logical units of content which consist Dublin core based metadata, as well as other metadata which has been encoded as bitstream. The item can be an electronic thesis, an e-book, photographs, a complete web page which can include images and style sheets associated with HTML page. Each item can contain one or multiple files along with metadata.
DSpace facilitates all three type of metadata that is Descriptive, Administrative and Structural. Descriptive metadata can by any thing which describes the item; it takes care of all elements of Dublin Core metadata set. Dspace also supports non Dublin Core metadata; these elements may not be searchable. Administrative metadata associated with an item can be access restrictions, means who can access, remove or modify and item, there is no standard format available for these type of metadata.Structural metadata describes very basic attributes about item, for example what are the bitstream that contained in an item, or under which community and collection that item belongs to.
Bitstreams are individual digital files, which are having limited set of descriptive metadata like name, size of the file, format of the file etc. A pdf file, word doument, jpeg or bmp picture, executable program etc can be considered as bitstream.
Bundles are basically nothing but group of related files, for example, when you have a HTML page, it may also contain link to other HTML documents, images, flash object etc; therefore to view that html page, you also need to have all associated file, therefore html file along with other associated files makes a bundle. Dspace doesn’t support any metadata for bundles.
2.1 DSpace System Architecture
The architecture of DSpace has been divided in to three major parts, that is DSpace public API on the top, business logic layer in the middle and Storage API at the bottom.
(Image Courtesy: http://www.dspace.org)
DSpace public API takes care of user interfacing and services, it contains components for web user interfacing, Federation services, Metadata providing services as defined by Open Archive Initiative’s protocol, interfaces for web services like SWORD which is Simple Web-service Offering Repository Deposit etc.
These public API components interacts with business logic layer in the middle, which provides search and browse components, Handle manager, History manager which takes care of logs and statistics. Business logic layer also contains components to manage ingestion process and workflow, components to manage e-persons, groups and their authorization along with content management API and administrative toolkit.
The bottom layer, the storage api has been two major components, one is Relational Database Management System wrapper which connects to RDBMS like Postgres or Oracle through Java Database Connectivity and the Bitstream storage manager which directly interact with file system to store bitstreams.
Major Features of DSpace
3.1 Metadata Registry
Dspace provides facility to create new metadata registry or manage existing dublin core metadata registry, where in digital library implementer can manage and customize metadata elements.Metadata registry has three major components that is Schema, Element and Qualifier.
3.2 File Format Registry
In addition to metadata registry dspace also features registry to handle file formats, this file formats can be managed based on three levels that is supported, known and unknown. Here dspace administrator can specify MIME type, Name, Long Description and support level of the file along with file extension.
3.3 E-Persons
Persons or users who interact with dspace are called e-people, it is basically dspace user accounts, dspace provides facility to permit these e-people to login to the site, sign up to receive notification changes to subscription, submit new items to the collections, Administer collection / communities or entire dspace site.These e-people can also be managed by forming groups.
3.4 Authorization
Authorization system in dspace enables administrators to give e-people the ability to perform add and remove operation by which an e-person can remove or add any community, collection or item.
As a collection administrator e-person can edit an item’s metadata, withdraw items or can map the items in to the collection Write permission enables e-person to add or remove bitstreams, where as read permission enables only reading of bitstreams.
3.5 Ingestion Process and Work Flow
Ingestion process is nothing but getting or putting contents in to Dspace. Dspace facilitates batch import as well as web based submisstion.
In batch import, multiple items can be submitted to dspace in one shot, this requires item to be in specific format along with metadata encoded in XML. Whereas in web based submission only one item can go each time, the item being submitted has to go through a work flow process defined for that collection. Assume that there are three steps
Step 1: May reject the submission
Step 2: Edit metadata or reject
Step 3: Edit Metadata
Image Courtesy http://www.dspace.org
A collection's workflow can have up to three steps as shown in figure above. Each collection may have an associated e-person group for performing each step; if no group is associated with a certain step, that step is skipped. If a collection has no e-person groups associated with any step, submissions to that collection are ingested straight into the main archive.
In other words, the sequence is this: The collection receives a submission. If the collection has a group assigned for workflow step 1, that step is invoked, and the group is notified. Otherwise, workflow step 1 is skipped. Likewise, workflow steps 2 and 3 are performed if and only if the collection has a group assigned to those steps.
When a step is invoked, the submission is put into the 'task pool' of the step's associated group. One member of that group takes the task from the pool, and it is then removed from the task pool, to avoid the situation where several people in the group may be performing the same task without realizing it.
3.6 Search and Browse
DSpace allows end-users to discover content in a number of ways, that is via external reference, such as a handle searching for one or more keywords in metadata or extracted full-text. It also provides option for browsing through title, author, date or subject indices, with optional image thumbnails etc.
3.7 Handle System
In present web world Universal Resource Location – URL of a digital content may change due to change in hardware or software, change in network or because of political change. This can be handled by creating a permanent URL independent of the repository. Handle system in DSpace provides a persistent handle for each item, if configured properly.
3.8 OAI-PMH Support
The Open Archives Initiative has developed a protocol for metadata harvesting. This allows sites to programmatically retrieve or 'harvest' the metadata from several sources, and offer services using that metadata, such as indexing or linking services. Such a service could allow users to access information from a large number of sites from one place. DSpace exposes the Dublin Core metadata for items that are publicly accessible. Additionally, the collection structure is also exposed via the OAI protocol's 'sets' mechanism.
3.9 Statistics
DSpace offers system statistics for administrator usage, as well as usage statistics on the level of items, communities and collections. Dspace also provide customizable general overview of activities in the archive, by default including:
3.10 SWORD and Open URL Support.
SWORD (Simple Web-service Offering Repository Deposit) is a protocol that allows the remote deposit of items into repositories. SWORD was further developed in SWORD version 2 to add the ability to retrieve, update, or delete deposits. DSpace supports the SWORD protocol via the 'sword' web application and SWord v2 via the swordv2 web application
DSpace supports the OpenURL protocol from SFX, in a rather simple fashion. If your institution has an SFX server, DSpace will display an OpenURL link on every item page, automatically using the Dublin Core metadata. Additionally, DSpace can respond to incoming OpenURLs too.
Customization in DSpace
Dspace provides varios flexibility and customization options, the areas that can be customized are;
Some Live Examples
ReferencesSelf Learning (Interactive / Video Tutorial)
Self Learning
Self Learning Part - I
Self Learning
Self Learning Part - IIInteresting Facts
Timeline
|
Web links
http://www.greenstone.org/
|
http://wiki.greenstone.org/wiki/gsdoc/others/Greenstone_history.htm
|
http://ie.archive.ubuntu.com/disk1/disk1/sourceforge/g/project/gr/greenstone/OldFiles/gsdl-manual-oct2000.pdf
|
http://drtc.isibang.ac.in/xmlui/bitstream/handle/1849/153/S_gsdltutorial.pdf?sequence=2
|
http://wiki.greenstone.org/gsdoc/tutorial/gs2-current/en/install_greenstone.htm
|
http://www.disa.ukzn.ac.za/downloads/presentations/Greenstone%20Digital%20Library%20software.pdf
|
http://www.cs.waikato.ac.nz/~ihw/papers/05-IHW-DB-CreatingDL.pdf
|
gndec.ac.in/~librarian/kk/466-752-1-SP%5B1%5D.doc
|
http://www.publications.drdo.gov.in/ojs/index.php/djlit/article/viewFile/3655/2067
|
http://eprints.rclis.org/19924/
|
Points to Ponder
Greenstone has two separate interactive interfaces, the Reader interface and the Librarian interface. End users access the digital library through the Reader interface, which operates within a web browser.
|
Greenstone is highly interoperable using contemporary standards, It incorporates a server that can serve any collection over the Open Archives Protocol for Metadata Harvesting (OAI-PMH), and Greenstone can harvest documents over OAI-PMH and include them in a collection.
|
In GSDL the Librarian interface is a Java-based graphical user interface (also available as an applet) that makes it easy to gather material for a collection (downloading it from the web where necessary), enrich it by adding metadata, design the searching and browsing facilities that the collection will offer the user, and build and serve the collection.
|
In GSDL "Plug-ins" are used to ingest externally-prepared metadata in different forms, and plug-ins exist for XML, MARC, CDS/ISIS, ProCite, BibTex, Refer, OAI, DSpace, METS.
|
The reader's interface is available in the following languages: Arabic, Armenian, Bengali, Catalan, Croatian, Czech, Chinese (both simplified and traditional), Dutch, English, Farsi, Finnish, French, Galician, Georgian, German, Greek, Hebrew, Hindi, Indonesian, Italian, Japanese, Kannada, Kazakh, Kyrgyz, Latvian, Maori, Mongolian, Portuguese (BR and PT versions), Russian, Serbian, Spanish, Thai, Turkish, Ukrainian, Vietnamese.
|