SeMap : a generic schema matching system

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

SeMap : a generic schema matching system Wang, Ting

Abstract

The rapidly growing number of autonomous data sources on the web makes the need of effective tools of creating semantic mappings increasingly crucial. Moreover, the goal of allowing applications to have more expressive semantics requires a change in focus. While most previous work focus on creating mappings in specific data models for data transformation, they fail to capture a richer set of possible relationships between schema elements. For example, current schema matching approaches might discover that ’TA’ in one schema equals to ’grad TA’ in another one, even though the relationship can be modeled more accurately by saying that ’grad TA’ is a specialization of ’TA’. This increased semantics of the mapping in turn allows for applications involving richer semantics. In this thesis we concentrate on the following problem: given initial match (correspondence) information produced by current schema matching techniques, how to construct a complex, semantically richer mapping that can be used across data models? Specifically, we aim at detecting the relationship types of ’Has-a’, ’Is-a’, ’Associates’ and ’Equivalent’. Technically, we achieve this goal in mainly three steps: (1) exploiting various types of semantic evidence for possible matches; (2) finding a globally optimal match assignment; (3) identifying the relationship embedded in the selected matches. We implemented our semantic matching approach within a prototype system SeMap, and tested its accuracy and effectiveness.

Item Metadata

Title	SeMap : a generic schema matching system
Creator	Wang, Ting
Publisher	University of British Columbia
Date Issued	2006
Description	The rapidly growing number of autonomous data sources on the web makes the need of effective tools of creating semantic mappings increasingly crucial. Moreover, the goal of allowing applications to have more expressive semantics requires a change in focus. While most previous work focus on creating mappings in specific data models for data transformation, they fail to capture a richer set of possible relationships between schema elements. For example, current schema matching approaches might discover that ’TA’ in one schema equals to ’grad TA’ in another one, even though the relationship can be modeled more accurately by saying that ’grad TA’ is a specialization of ’TA’. This increased semantics of the mapping in turn allows for applications involving richer semantics. In this thesis we concentrate on the following problem: given initial match (correspondence) information produced by current schema matching techniques, how to construct a complex, semantically richer mapping that can be used across data models? Specifically, we aim at detecting the relationship types of ’Has-a’, ’Is-a’, ’Associates’ and ’Equivalent’. Technically, we achieve this goal in mainly three steps: (1) exploiting various types of semantic evidence for possible matches; (2) finding a globally optimal match assignment; (3) identifying the relationship embedded in the selected matches. We implemented our semantic matching approach within a prototype system SeMap, and tested its accuracy and effectiveness.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2010-01-16
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0051728
URI	http://hdl.handle.net/2429/18322
Degree	Master of Science - MSc
Program	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2006-11
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

ubc_2006-0700.pdf -- 3.27MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

SeMap : a generic schema matching system Wang, Ting

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights