The Organization of Knowledge Spaces for a Virtual Learning Environment Supported by a Digital Library

 

Terence R. Smith, Olga Agapova, Michael Freeston, Alex Ushakov*

University of California, Santa Barbara, CA93106 USA

smithtr@cs.ucsb.edu, olga@geog.ucsb.edu, freeston@alexandria.ucsb.edu, ushakov@alexandria.ucsb.edu

 

* This paper represents the combined work of the whole of the ADEPT Knowledge Organization Team. Other members are: Olha Buchel, Jim Frew, Linda Hill, Richard Mayer, Jian Qin, Laura Smart, and Tim Tierney.

Abstract

The Alexandria Digital Library Project is developing a concept-based learning environment for the sciences. In this paper, we briefly discuss: the rationale for the approach; the structure of the concept model and correlative relationships between concepts; the components and associated services of the concept-based learning environment; and planned and potential applications of the learning environment.

        

1. Introduction

In addition to services and collections that provide access to geospatially referenced information, the Alexandria Digital Library (ADL) Project is developing concept-based learning environments for the sciences. Such learning environments form an important component of the Alexandria Digital Earth Prototype [ADEPT 2002] and are integrated into the system's basic middleware search services  [Janee and Frew, 2002]. The principal reasons for developing these digital library (DL) services and collections is to help students acquire a ``deeper'' understanding of science.

The approach of ADEPT to concept-based learning rests on premises that:

(1)  scientific activity, understanding, and learning are based on a large core of scientific concepts and their interrelations which are used for representing scientific methods, abstract representational systems, and the phenomena under investigation.

(2) the syntactic and semantic information associated with any scientific concept may be represented in terms of a strongly-structured  model (SSM) of the concept.

(3)  a deeper understanding of any domain of science is promoted when learning environments are based explicitly  on the use of SSMs to represent important sets of concepts and their interrelationships [Smith et. al. 2002a, 2002b.]

We have developed a concept-based learning environment that embodies such principles in the context of Web-based services. This environment has three main components and nine associated services (see figure 1.) The first component consists of one or more knowledge bases (KBs)  containing SSMs of the concepts needed for representing the scientific knowledge about a given domain. In the next section we outline our current SSM of concepts. Three services associated with such KBs include:

(a)    a service for the input and editing of SSMs of concepts;

(b)    a service for searching the KB; and

(c)    a service for graphic representations of the concepts and their interrelationships.

 


Figure 1: Components and associated services of a concept-based learning environment

The second component comprises one or more (heterogeneous) collections of items for use in illustrating elements of the SSMs of a given set of concepts. Three services associated with this collection include:

(a)    a service for the input and editing of metadata records associated with such items;

(b)    a service for searching the collections, particularly in terms of the scientific concepts to which they refer; and

(c)    a service for graphical representations of the items.

The third component is a collection of lectures, self-learning modules, or laboratory sessions that provides access to the information in the KB and collection in a manner that supports efficient learning. Services associated with this collection include:

(a) graphic services for inputting and editing lectures, self-paced learning modules, or laboratory sessions;

(b) a service for searching the lecture collections; and

(c) a service for graphic displays of the contents of the collection.

 

In figure 2, we indicate how such components and services may be employed in supporting lecture-based learning environments. Other applications include self-paced learning and laboratory environments. In lecture-based learning, for example, an instructor may employ three distinct graphics displays for presenting concept-based learning materials to students:

1. a Knowledge Window for representing information about the SSMs of concepts  in the KBs;

2. a Collection Window for displaying DL objects from ADEPT collections that illustrate various aspects of concepts;

3. a Lecture Window for displaying the lecture notes that organize the presentation of information about a topic and its associated concepts.


 


Figure 2: A three-window display of concept-based learning materials

 

We have applied these ideas in teaching introductory courses in Physical Geography and have constructed KBs, Object Collections, and Lecture Collections and implemented the three corresponding sets of services. We currently support the creation of lecture materials by an instructor with the various input, edit, search, and display services integrated into the Lecture Window. This permits the instructor to search the KBs, Object Collections, and Lecture Collections and to organize the results of such search for presentation in the Knowledge and Collection Windows during the course of the lecture.

The rest of the paper is structured as follows. We first describe the SSM of concepts that forms a foundation for our learning environments. We then describe the three services input, search and display for each of the three components concept, collection and lecture of the concept-based learning environment. Finally we discuss planned and potential applications of this learning environment.

2. Strongly-Structured Models of Concepts

We briefly discuss the strongly-structured concept model (SSM) and the services for creating, editing, searching, and displaying the KBs. A basic premise, noted above, is that concepts and their interrelationships are the fundamental building blocks for representing the phenomena of science. The value of any scientific concept rests heavily on the degree to which it may be represented in a manner that: 1. is objectively-communicable; 2. supports the the derivation of scientifically-useful information; and 3. possesses a well-defined (operational) semantics. Such attributes permit scientists to communicate unambiguously about how to represent, use, and interpret scientific concepts and to derive useful information from representations of phenomena that are based on the concepts.

Such requirements suggest the feasibility of constructing ``strongly structured models'' (SSMs) of scientific concepts that incorporate  the scientific semantics of a concept. For example, one may view the semantics of a concept  relating to the measurable properties of some phenomenon (e.g. the Velocity or Depth of a Water Flow) in terms of objectively communicable and reproducible procedures for the measurement of such attributes. We note that this idea goes well beyond the typically thesauri-like definitions of concepts that have traditionally been used in library environments. Analogous ideas have been recognized and employed, albeit implicitly, for some time by various scientific groups including, for example:

1. The MatML Working Group of the National Institute of Standards and Technology (NIST), which constructed a model for representing concepts relating to substances of Materials Science and their properties [NIST 2001]. In particular, this group has created an abstract DTD for representing such concepts as XML records.

2. The Chemical Abstracts Service [CAS 2001] Registry [Weisgerber, 1997], associated with the American Chemical Society, has developed an SSM of chemical substances that includes systematic chemical names, their various representations (molecular structure diagrams, molecular formula, index of ring systems), and information about the properties and interrelations of substances.

 


 


Figure 3: Abstract model (SSM) of a concept and correlative relationships

               

 

Based on such examples of SSMs and previous work, ADEPT has developed an SSM of concepts for scientific domains in terms of a frame-based knowledge representation system with slots and attribute-value fillers. Since the SSM has been described in a previous document [Smith et. al. 2002a, 2002b] we provide only a summary outline at this point. Our current SSM of a scientific concept takes the form shown in figure 3.

We comment briefly on the various elements of this model.

1. DESCRIPTION and HISTORICAL ORIGINS are natural language representations of the concept. While they indicate the scientific semantics associated with the concept, this semantics is typically imprecise and ambiguous.

2.     CLASSIFICATIONS include, for example an ADEPT classification of concepts into abstract, methodological, and phenomenological (or concrete) concepts [Smith et.al., 2002b] as well as various subclasses, while KNOWLEDGE DOMAINS list (some of) the academic FIELDS and TOPICS in which the concept is used.

3.     TERMS are simple (linguistic) expressions denoting the concept, while the HIERARCHICAL RELATIONSHIPS  are specific, thesaurus-like relations between terms. While such terms provide bases for inference (for example: a ISA b AND b ISA c IMPLIES a ISA c) these terms and relationships are essentially without any scientific semantics.

4.     REPRESENTATIONS, DEFINING OPERATIONS, PROPERTIES and CAUSAL RELATIONS, together with the HIERARCHICAL RELATIONSHIPS, relate to relationships among concepts. The REPRESENTATIONS, DEFINING OPERATIONS, PROPERTIES and CAUSAL RELATIONS, however, constitute the heart of the concept model by providing the scientific semantics of a concept.  The Defining Operations provide scientifically objective descriptions of activities that define the concept. For example: abstract concepts are defined in terms of syntactic manipulations that are permitted on various representations of such a concept (e.g. as in the manipulation of Algebraic Equations); methodological concepts may be described in terms of recipes for activities carried out by machines and scientists, such as measurement activities; phenomenological concepts may be defined using phenomenological interpretations of the TERM representation of a concept. PROPERTIES (such as the Area property of the concept of a Polygon) are represented not only using the TERMS denoting the property, but also the activity for computing the property from a given representation.

 

3. Concept input, search, and display services

 

A knowledge base (KB) of concepts for a given domain is defined in terms of a collection of SSMs for a set of concepts. We have developed a form-based input tool to support the construction of KBs of concepts. The input tool is structured to reflect the abstract model of a concept presented above. Parts of the input process may be automated, as in the case of using authoritative glossaries in electronic form to provide input of natural language DESCRIPTIONS of the concepts. Other elements of the SSM for a concept, however, must currently be entered by knowledge domain specialists.

 

The current operational version of the learning environment employs RDBMS technology (and MySQL in particular) for storing and accessing the collections of SSMs of concepts. The concept input form is based on this instantiation. We have also developed support for XML databases of concept SSMs using an XML schema to represent the abstract concept model. We plan to convert from relational to semi-structured (XML) versions in the near future.

 

We have developed visualization tools for concepts and their interrelationships. A current visualization tool, SPROING [Ancona, 2002], supports the visualization of concept maps of subsets of concepts and their interrelationships that may be derived from the KB. This tool employs an OpenGL-based visualization with a simple binary force springs algorithm. The data for visualization are currently provided by a stand-alone extraction routine that runs over the KB and extracts peer-to-peer pairs of relationships. These are represented as a pair of nodes connected with an edge. The colors and styles for the presentation are stored in a parameter file in XML format. A simple search function is implemented on the presentation space. Nodes and edges, found by a given search, may then be stored as a visualization subset, which is essentially a substructure of the presentational graph. Extensions of SPROING support the hand-editing of the graphic outputs for the purpose of in class presentation as well as the visualization of such aspects of concepts as DESCRIPTIONS and Mathematical Representations.

 

4. Collection input, search, and display services

We may associate with any domain-specific KB one or more collections of DL Objects that illustrate various aspects of the concepts modeled in the KB. In the case of a collection of Objects that illustrate the concepts in Hydrology, for example, we may incorporate figures, photographs, or natural language instructions describing measurement techniques for determining the rate at which groundwater flows through some Aquifer. Such collections generally consist of heterogeneous and multimedia Objects, ranging from text to video with soundtrack.

A requirement of such collections is that they be searchable, not only in terms of the usual criteria of spatio-temporal footprint, subject matter, etc., but also in terms of the concepts and concept models represented in a given KB. Hence, for example, an instructor or student may search for all image items that explicitly contain information relating to the concepts of Floodwater Waves from Rivers with a footprint in the Central Valley of California.

Access to such collections within the ADEPT environment and their items is supported by:

1. metadata descriptions of each Object that contain explicit references to the domain-based concepts to which they refer;

2. standard ADEPT middleware search services [Janee and Frew, 2002];

3. ADEPT collection-level metadata for each collection.

We may access the Objects themselves using pointers obtained from the metadata records. Each metadata record takes the form prescribed by the ADEPT/DLESE/NASA (ADN) metadata content standard [ADEPT/DELESE 2001] with an extension defined in terms of an additional element containing TERM representations of a concept. Current collections are implemented as relational databases. This permits the easy construction of ADEPT search buckets using ADEPT's generic Bucket-99 driver.

A web-based metadata entry form has been constructed to support the entry of item metadata records by non-technical users. This entry tool is currently implemented as a series of PHP-driven input forms, developed to accommodate metadata entry into the respective data fields.

The display component of the collection window is functionally controlled from the lecture window. It allows not only the visualization of items selected and linked by the instructor, but also real-time (i.e., lecture-time) search for ADEPT collection items through the use of an ADEPT Browse window.

 

As an example of one of our collections, we have cataloged and stored each of the more than 400 illustrations, available in electronic form, that come with the textbook that is being used to support a physical geography course that will be taught using the learning services.

5. Lecture input, search, and display Services

In terms of concept-based learning in science, we may characterize a course of instructional material as a trajectory through a space of concepts. This characterization is general, as almost any scientific lecture or scientific text-book will prove. Instructors as well as texts largely differ from each other in terms of the order in which concepts are introduced and the different emphases with which they are modeled and related. A major strength of a concept-based approach, therefore, is its support for the construction of flexible, personalized trajectories through concept spaces from the KBs and Object Collections. Another strength is that instructors may re-use/repurpose trajectories that have been constructed previously by other instructors.

We have developed a set of Lecture Window services that may be used to support the creation, and recreation, of structured lecture notes that integrate links to the KBs and Object collections. In particular, an instructor may employ the Lecture Window for:

1. creating (structured) lecture notes;

2. searching over the KBs and Object collections;

3. embedding links from the lecture notes to displays generated by the KB and collection search services;

4. driving displays on the Knowledge, Lecture, and Collection Windows during a presentation.

 


 


Figure 4: The lecture input and editing tool

We have developed a lecture input and editing tool that allows an instructor to create a series of lecture notes. At the most basic level, these tools allow an instructor to create lecture materials in terms of rich textual outline (figure 4). They support the usual creation and editing operations, including linking concept terms to illustrative materials from the KBs and Object Collections. The links attached to an item may be represented in terms of thumbnails or icons immediately following the highlighted item. We have additionally developed services that allow an instructor to create more personalized trajectories concepts through concept space with the use of various organizing subheads. Our current lecture creation service, for example, supports the quasi-automated structuring of lecture materials in terms not only of scientific concepts but also in terms of basic sets of scientific activities, such as:

1.  identifying, observing, and characterizing phenomena;

2. representing phenomena; and

3. understanding phenomena.

The current lecture input tool allows the instructor to choose, with the help of a control panel, lecture note headings from a relatively small subset of terms that characterize each of these sets of activities. Under the category of representing phenomena, for example, we currently support:

1. the creation of subheads for TOPICS(S), CONCEPTS, and MODELS;

2. the use of TOPICS (and SUBTOPICS and SUBSUBTOPICS) subheads for the automated creation of high-level views of lecture materials;

3. the introduction of CONCEPT subheads for explicit description of SSMs of scientific concepts.

The high-level views of lecture materials are presented in a small window to provide context for both instructors and students in ways analogous to a collapsible folder tree. The results of lecture creation are saved in a Lecture Collection. The internal format of lectures follows the enriched Outline Processor Markup Language (OPML) format, which is essentially a subset of XML. Hence lectures are stored in the collection as XML records, supporting interoperability and persistence. The collection itself is implemented as an indexed file structure.

Future developments include putting the lecture collections into the format of Xindice XML databases to support input and editing inside a common browser (IE 5.5+ or NS6+ on a variety of platforms such as Win, Mac, Unix, etc.) Onscreen editing is accomplished on the client side with JavaScript DOM features, matched by Java servlet/JSP components on the server side.

 

6. Applications of the Learning Environment

The current ADEPT learning environment is being deployed in teaching an introductory course in physical geography to about 100 students in the Fall of 2002 and again in the Spring of 2003. Such applications of KBs of SSMs of concepts in learning environments build on recent studies in educational psychology (see Mayer, Smith, and Borgman, 2002.) These studies indicate that concept maps, defined in this context as ``all terms used in a lesson with labeled links among them'', play significant roles in promoting student learning and assessing student knowledge.

Educational research relating to rhetorical structures, or the ways in which materials can be organized into coherent structures for learning, suggests how the KBs and collections may be used in furthering student understanding of some domain of knowledge. In process rhetorical structures, for example, cause-and-effect systems can be represented by flow charts and maps. This corresponds to a view of the KB that extracts Causal and Hierarchical Partitive relations between a given set of concepts. Other rhetorical structures associated with different views of the KB include problem solution (problem based views), classification (hierarchical views), and compare (matrix-based views in which the attributes of different concepts are compared.) Recent research [Mayer et al, 2002] indicates that greater depths of learning occur when rhetorical structures are used to organize learning materials. We note that our services supporting access to, and use of, the resources in the KBs and associated collections enable the construction of views corresponding both to different rhetorical structures and the personal preferences of instructors.

7. Conclusion

A key issue in the development of ADEPT learning services based on SSMs of scientific concepts is an evaluation of the degree to which they deepen a student's understanding of both the conceptual basis of science and scientific knowledge in specific domains. We are currently instrumenting the teaching of courses in order to develop such evaluations. It will take time to assess the pedagogic value of using KBs of strongly-structured representation of scientific concepts, together with the associated collections  and services, in learning environments. Nevertheless, recent developments in both text publishing and in educational psychology indicate their potential value.

Acknowledgments

The work described herein was partially supported by the NSF-DARPA-NASA Digital Libraries Initiative and the NSF NSDL initiative under NSF IR94-11330, NSF IIS-9817432, DUE-0121578, and UCAR S02-36644.

References

ADEPT. (2002)  http://www.alexandria.ucsb.edu

ADEPT/DLESE. (2001) ADN Joint Metadata Content Model. http://www.dlese.org/Metadata/.

Ancona, D. (2002). Visual Explorations for the Alexandria Digital Earth Prototype. Second International Workshop on Visual Interfaces to Digital Libraries, JCDL, Portland, Oregon, 2002.

CAS. (2001). CAS Chemical Abstracts Service Homepage.  http://www.cas.org.

Janée, G. and Frew, J., (2002). The ADEPT Digital Library Architecture. Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '02), Portland, OR, July 14-18, 2002.

Mayer, R.E. (1991). The Promise of Educational Psychology: Learning in the Content Areas. Merrill Prentice Hall, Upper Saddle River, N.J.

Mayer, R., Smith, T.R., and C.L.Borgman, (2002). Digital Libraries as Instructional Aids for Knowledge Construction. Educational Technology.

NIST. (2001). MatML: eXtensible Markup Language for Materials Property Data. MatML DTD Version 2.0.,2001. Prepared by E.F. Begley on behalf of the MatML Working Group.

http://www.ceramics.nist.gov/matml/matml.htm

Smith, T.R.  Zeng, M.L., and ADEPT Knowledge Team, (2002a). Structured models of scientific concepts as a basis for organizing, accessing, and using learning materials, UCSB Department of CS Technical Report 2002-04.

Smith, T.R., Zeng, M.L., and ADEPT Knowledge Team, (2002b). Structured models of scientific concepts for organizing learning materials, Proceedings of the Seventh International Conference on Knowledge Organization, Granada, Spain.

 

Weisgerber,D.W., (1997). Chemical Abstracts Service Chemical Registry System: history, scope, and impacts.  Journal of the American Society for Information Science, 148(4):349-360.