D-Lib MagazineMarch/April 2016 RAMLET: a Conceptual Model for Resource Aggregation for Learning, Education, and Training
Katrien Verbert, KU Leuven, Belgium AbstractThe Resource Aggregation Model for Learning, Education and Training (RAMLET) is a new IEEE standard, IEEE Std 1484.13.1-2012, that has been developed to facilitate interoperability of existing standards for multimedia resource aggregations. Examples of such standards include METS and MPEG-21 Digital Item Declaration. These standards are used on a large scale for describing the structure of a collection of multimedia content. In addition, multiple structures can be specified to provide different paths through the same content. A limitation lies in the fact that they cannot interoperate. For instance, content structured in MPEG-21 cannot be reused in a METS context. This article presents different use cases in which the RAMLET standard enables interoperability of resource aggregation formats. We describe the RAMLET standard and mappings to other standards. In addition, we discuss lessons learned from the development process of this standard and future research challenges. 1 IntroductionResource aggregation is both a process and the end result of that process. The process gathers resources and describes their structure, so that the resulting aggregation can be used for transmission, storage, and delivery to users [1]. The resource aggregation specifies how resources fit together into a coherent, structured whole. In addition, resources composing the resource aggregation can be structured in more than one way. As an example, a resource aggregation may collect a short video resource that explains the history of the Web, basic tutorials on HTML, CSS and Javascript and exercises related to these topics. A resource aggregation collects and structures these resources. In addition, multiple structures can be specified to provide different paths through the same content, such as starting first with a simple exercise for students with previous experience. Different communities, such as the multimedia, library, technical documentation, and learning technology communities, have created their own specifications and standards for resource aggregations. Examples include the Metadata Encoding and Transmission Standard (METS), an initiative of the Digital Library Federation [7], the IMS Content Packaging (IMS CP) specification, which is predominantly used in the educational domain, and MPEG-21 Digital Item Declaration (MPEG-21 DID), a standard for the audio-visual content industry. OASIS OpenDocument and International Digital Publishing Forum EPUB also fit into this mold. Often, a need exists to exchange resource aggregations among communities that may be using different aggregation formats. In these cases, it is necessary to map between aggregation formats, so that existing resource aggregations can be exchanged and reused in new and different contexts. To facilitate such exchange, there is a need for a common conceptual model that enables the interpretation and translation of different resource aggregation formats. The Resource Aggregation Model for Learning, Education and Training (RAMLET) is a new IEEE standard, IEEE Std 1484.13.1-2012, that defines such a conceptual model for digital aggregations of resources for learning, education, and training applications. The RAMLET standard has been developed using an ontological approach. Five aggregation formats, IMS CP, METS, MPEG-21 DID, Atom, and OAI-ORE, were analyzed, and a core ontology was constructed that represents an abstract generalization of the aggregation formats. In addition, mappings have been defined between the RAMLET model and these resource aggregation formats to enable their interoperability. The approach enables the exchange and transformation of resource aggregations that are stored in different formats and the exporting of an aggregation for use in OAI-ORE, IMS CP, METS, Atom or MPEG-21 applications. In this paper, we present our work on the development of RAMLET. Section 2 presents use cases that illustrate the need for the standard. Section 3 outlines the resource aggregation formats that were used for the development of RAMLET. Section 4 presents the RAMLET model and mappings that define crosswalks among aggregation formats. Finally, concluding remarks and critical reflections are presented. 2 Use CasesThis section presents several use cases that illustrate the need for the standard. The first use case illustrates exchange and reuse of resource aggregations among systems using different standards and specifications. Use Case 2 illustrates how the use of RAMLET can enable interoperability of systems that use their own internal format for resource aggregations. Seven other use cases have been described in [8]. Use Case 1This use case addresses exchange and reuse of resource aggregations among systems using different specifications. For example, a system using METS might import resource aggregations that use IMS CP and MPEG-21 DID, and create a new resource aggregation. Usage scenario A content author in a university is developing a new resource aggregation and wishes to include resources from different sources, including learning resources, reference materials, and research data. The author searches for appropriate materials and retrieves each resource to an authoring system. The resources are exported from their repositories in resource aggregation formats specific to their respective repositories. The authoring system interprets the incoming resource aggregation formats and converts them to its native format. The author then creates the new resource aggregation, including the imported resources, and makes the new resource aggregation available to the local learning management system (LMS) or run-time system (RTS). The new resource aggregation is in the aggregation format used by the authoring system. Figure 1 presents this scenario. Figure 1: Use Case 1 ((IEEE Std 1484.13.1-2012 Reprinted with permission from IEEE. copyright IEEE, 2012. All rights reserved.) Use case summary: retrieve-interpret-aggregate-deploy This use case addresses retrieving resource aggregations from diverse resource repositories that provide resource aggregations in different resource aggregation formats. The retrieved resource aggregations are interpreted, and converted into a single format that can be used by an authoring system and then aggregated into a new resource aggregation. The new resource aggregation can be deployed by an LMS or RTS that is limited to a single resource aggregation format. Use Case 2An LMS creates a time and context-sensitive resource aggregation and will import, store, and make available resource aggregations from systems using different specifications. For example, a system using its own internal format might import resource aggregations that use IMS CP, METS, MPEG-21 DID, and the format used by a student information store at the time they are required in the learning path. Usage scenario An LMS supports a learner by using and providing learning resources that are appropriate in the respective context of the learning situation and the individual requirements of the learner at a particular time. Such requirements may include accessibility preferences or needs in order to access the material. The LMS retrieves, provides, and aggregates required resources just in time and makes use of different sources that provide resources in different resource aggregation formats. Figure 2 presents this scenario. Figure 2: Use Case 2 ((IEEE Std 1484.13.1-2012 Reprinted with permission from IEEE. copyright IEEE, 2012. All rights reserved.) Use case summary: retrieve-interpret-internalize-deploy This use case avoids building a complete resource aggregation prior to deployment. RAMLET supports transformation and interpretation of retrieved resource aggregations in diverse aggregation formats into a single format. The delivery system is able to produce an internal representation of the resource aggregation and to render the resources. 3 Resource AggregationsThis section briefly outlines the resource aggregation formats that were used in the development process of RAMLET: IMS CP, METS, MPEG-21 DID, Atom, and OAI-ORE. The structure and properties of the resource aggregation formats are described. Simple examples are included to illustrate the different constructs. Note that these examples are incomplete and much simpler than real world applications. 3.1 IMS CPIMS CP is a specification that enables learning resources to be transported between educational environments. The specification was developed by the IMS Global Learning Consortium and plays a central role in the learning technology community. An IMS content package contains two major components: an XML document called a manifest file that describes the content structure and associated resources of the package, and the content making up the content package. The manifest file is comprised of four sections:
The organizations section describes zero, one, or multiple organizations of the resource aggregation. Multiple organizations can provide learners with a variety of alternative structures for content. Each organization within the organizations section specifies how resources fit together into a hierarchically arranged sequence of items. These items point to a resource in the resource section. Listing 1 shows an example. Listing 1: A simple IMS CP manifest document 3.2 METSMETS is a standard for encoding descriptive, administrative, and structural metadata regarding resources within a digital library [9]. The standard is maintained by the U.S. Library of Congress. A METS document consists of seven sections:
METS and IMS CP are similar. Both resource aggregation formats specify how resources fit together into a hierarchically structured whole, express metadata pertaining to the content, and provide an inventory of files. The distinction between administrative and descriptive metadata of METS is not available in IMS CP nor is behavioral information. Listing 2 shows an example of a simple METS document. The <mets> element is the root element of the aggregation. The <mets:structMap> element corresponds to the IMS CP organization element. As with the IMS CP organization element, the <mets:structMap> represents the structure of content in a hierarchically arranged sequence of <mets:div> elements. In the example, the <mets:structMap> organizes content into a sequence of two physical pages. Each page division points to a single file by a <mets:fptr> element, which corresponds to an IMS CP resources element. Listing 2: A simple METS document (adapted from [10]) 3.3 MPEG-21 DIDMPEG-21 is a framework for networked digital multimedia, designed by the Moving Picture Experts Group. MPEG-21 describes a standard that defines the description of content and also processes for accessing, searching, storing, and protecting the copyrights of content. The basic concept in MPEG-21 is the digital item. Digital items are structured resources, and include a standard representation, identification, and metadata. Digital items are defined by the MPEG-21 Digital Item Declaration (DID), a subpart of the standard, which defines the following entities:
Another set of entities contains choice, selection, condition and assertion and allows the description of a digital item to be optional or available under specific conditions. For example, a choice may represent the choice between a high- or low-bandwidth Internet connection. Based on the selection made by a user, the conditions attached to an entity of a digital item may be fulfilled and the entity may become available [11]. Such conditional behavior is not available in METS or IMS CP. Listing 3 shows an example of an MPEG-21 DID document. The DIDL element is the root element. A container element corresponds to IMS CP organization and METS structMap elements and describes the structure of the resource aggregation. In the example, the container element groups two item elements, track01 and track02. MPEG-21 items correspond to IMS CP items and METS div elements. These items bind components to descriptor elements. Such components, then, correspond to IMS CP Resource and METS file elements. Listing 3: A simple MPEG-21 DID document 3.4 ATOMAtom is an XML-based document format that describes lists of related information, known as feeds. Feeds are comprised of one or more entries, each with an extensible set of metadata. For example, each entry has a title. Atom primarily addresses the syndication of web content, such as weblogs and news headlines, to websites and users. Listing 4 shows a brief, single-entry, Atom feed document. The feed element is the root element of the aggregation. Metadata associated with the feed are represented by title, link, updated, author and id elements in the example. Feeds aggregate entries that are containers for information relating to one content element. Such entries correspond to IMS CP files, METS files, and MPEG-21 resources. Listing 4: An Atom feed document 3.5 OAI-OREOAI-ORE defines standards for the description and exchange of aggregations of web resources. The OAI-ORE Atom is comprised of the followin elements:
ORE supports serializations of Resource Maps in RDF/XML, RDFa and Atom. An example in Atom XML is shown in Listing 5. Listing 5: A simple OAI-ORE document 4 RAMLETMappings between aggregation formats have been elaborated using different methods including XSLT [10] and Perl scripts. RAMLET uses a Semantic Web approach in the form of a conceptual model to map between aggregation formats. The conceptual model describes the similarities of, and the relationships among, the components of the data models of various aggregation formats. It focuses on the structures of aggregation formats by analyzing and describing the characteristics of common structural elements, then abstracting and expressing them in ontological form as a core ontology. 4.1 Conceptual ModelTo develop the conceptual model, the five aggregation formats, METS, MPEG-21 DID, OAI-ORE, Atom and IMS CP, were analyzed. The model was refined to avoid inconsistencies and then expressed as a core ontology. Mappings were then defined between the conceptual model and the aggregation formats and expressed as ontologies. These mappings are being published as IEEE recommended practices [2, 3, 4, 5, 6]. The conceptual model was first implemented in OWL and then expressed in human-readable tables in the IEEE standard. OWL was selected for use for formally expressing the structural concepts of the conceptual model and their interrelationships for the following reasons:
A simplified version of the RAMLET model is presented in the first column of Table 1. This table presents concepts of RAMLET, and their corresponding concepts in existing standards and specifications. Relationships and constraints are not presented in the table. The ramlet:topNode class represents the root element of an aggregation. ramlet:descriptorObject is a wrapper for metadata, usually expressed using external standards. The distinction is made between administrative and descriptive metadata to represent the functional metadata segmentation of METS. Other subclasses of ramlet:descriptorObject define metadata elements that appear as XML attributes in the source representation of the resource aggregation formats. Because METS allows the most specific level of specification related to metadata, most metadata elements are derived from METS, including information regarding how the files were created and stored, intellectual property rights, and metadata regarding the original source object from which the aggregation derives. ramlet:generatingTool, ramlet:humanLanguage, ramlet:partyEmail, and ramlet:textType derive from Atom. ramlet:contentEncoding and ramlet:nodeVisibility derive respectively from MPEG-21 and IMS CP. Classes for representing identifiers are defined for the aggregation (ramlet:aggrega- tionID), local elements (ramlet:elementID), and structural nodes (ramlet:nodeID). A ramlet:nodeID is a special kind of local identifier and can be found in METS. ramlet:staticStructure represents the structure of the resource aggregation into a hierarchically arranged sequence of ramlet:structureNodes. ramlet:dynamicStructure and its subclasses describe a resource aggregation as being optional or available under conditions.Table 1: A simplified version of RAMLET and corresponding mappings
4.2 MappingsMappings have been defined between the RAMLET model and IMS CP, METS, MPEG-21 DID, OAI-ORE and Atom. Table 1 summarizes these mappings. cp:manifest, mets:mets, mpeg:DIDL, atom:feed and oai:resourceMap represent the root node of an aggregation and are mapped to ramlet:topNode. cp:metadata, mets:mdWrap, mpeg:annotation, mpeg:statement, mpeg:descriptor, and atom:category are wrappers for metadata and are mapped to the ramlet:descriptorObject class. If elements have similar semantics, such as mets:role and mets:otherRole, they are mapped to the same RAMLET class, in this case, ramlet:partyRole. Mappings of the other elements proceed analogously. All the elements and attributes of IMS CP, METS, MPEG-21 DID, OAI-ORE, and Atom have been represented in the core RAMLET model and are mapped to their equivalent classes or subclasses. These mappings will be available as IEEE recommended practices. 4.3 DiscussionOntology mappings defined by RAMLET are bidirectional, i.e., an IMS CP aggregation can be expressed using RAMLET terms and vice versa. To map resource aggregation instances from one aggregation format, such as IMS CP, to another, such as METS, two steps are required. First, a mapping is made from the IMS CP instance to the core ontology and then from the core ontology to the METS instance. This approach can scale as the number of aggregation formats increases. To include a new aggregation format, such as RSS, only a bidirectional mapping between the core ontology and the new aggregation format would need to be expressed. Without the RAMLET conceptual model, instead of a single mapping from RSS to the core ontology, individual mappings from RSS to all other aggregation formats of interest would need to be developed (e.g., RSS to Atom, RSS to METS, RSS to MPEG-21 DID, etc.). RAMLET provides an integrative conceptual model that covers IMS CP, MPEG-21 DID, METS, Atom, and OAI-ORE. Coverage of these aggregation formats implies that all data elements of these formats are represented in the conceptual model, which enables lossless transformation from the aggregation formats to the conceptual model. However, a potential loss of information exists when translating from an aggregation instance in one format to the core ontology and then to an aggregation instance in another format (e.g., METS to RAMLET to IMS CP), depending on the semantic overlap between the source and target aggregation formats. Although the transformation between resource aggregations is not always lossless, important structural relationships should be preserved, thus achieving a maximal preservation of meaning. The core ontology can also be used to develop new aggregation formats, both because it records many of the components that have already been used in content aggregation, and via the mappings because it indicates how widely those components are used in existing formats. In addition, generalizations about how the various components relate to each other are captured by the core and mapping ontologies. Consequently, it is easier for communities to follow good practice and design new formats that stay within the bounds of the semantics of existing formats as much as possible, adding new functionality only where necessary. Because the RAMLET core ontology and mappings have been formulated using standard Semantic Web technology, extensions to the ontologies can be easily accommodated by defining equivalence, refinement, inclusion, or any other relationship between the extensions and the existing ontology using OWL and other RDF vocabularies, such as RDF Schema (RDFS) and Simple Knowledge Organization Systems (SKOS). When elements and attributes from schemas external to these aggregation formats have been used within them, such as xlink attributes, they have usually taken the form of object properties for associated RAMLET classes within each of the mappings. 5 ConclusionIn this paper, we have presented the RAMLET standard for structuring and exchanging resource aggregations. The standard enables implementers to assemble and structure resources and to export the resulting aggregations for use in IMS CP, METS, Atom, OAI-ORE, and MPEG-21 applications. In addition, aggregations available in different resource aggregation formats can be imported. Such interoperability is essential to enable sharing and reuse of multimedia resources on a global scale. Previous attempts to enable such interoperability are limited to a crosswalk between IMS CP and METS, expressed in XSLT [10]. Such direct crosswalks cannot scale. If n is the number of resource aggregation formats, 2n transformation implementations are required for each new format to enable its interoperability with the other resource aggregation formats. RAMLET provides a generic approach that can scale as the number of resource aggregation formats increases, as only two transformation implementations are needed for a new format, one expanding RAMLET core, if necessary, and one mapping of the new aggregation format to RAMLET core. The effort to create the RAMLET conceptual model was quite difficult at the beginning of the process until the working group came to agreement on the underlying principles to scope the work, and then came to agreement on an ontological approach to the expression of the conceptual model. While the latter decision was controversial at the time, it proved to be a pivotal decision for a number of reasons:
Because of the evolution of open source and web-based visualization tools for ontology creation (ultimately using Knoodl by Revelytix), the group was able to make significant progress without having each person develop expertise in any given software product. The importance of a standard like RAMLET has been articulated by several researchers [13, 14, 15, 16] during the past decade. McDonough [16] indicates that: "the strategy is to accept that the need for local community control over encoding practices is a valid one, that regional 'dialects' of markup languages are inevitable, and that we must find ways to facilitate information exchange across the boundaries of different communities' markup vernacular." and "the library community needs to shift from its current singular focus on schema development to a dual focus on both schema development and translation between schemas." The RAMLET conceptual model currently covers IMS CP, METS, MPEG-21, OAI-ORE and Atom. Interesting future directions include extending the model for coverage of other aggregations formats, such as OASIS OpenDocument. Finally, it is worth noting that the work presented in this paper has been standardized, but real world uptake is still in early stages. The conceptual transformations will need to be implemented and evaluated in a real world environment. Implementation options have been discussed by Kraan [12]. Evaluation of these options constitutes an important future line of research. References
About the Authors
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|