¡¡Chinese Journal of Computers   Full Text
  TitleUncertain Schema Matching in Deep Web Integration Service
  AuthorsJIANG Fang-Jiao1),2) MENG Xiao-Feng1) JIA Lin-Lin1)
  Address1)(School of Information, Renmin University of China, Beijing 100872)
2)(Xuzhou Normal University, Xuzhou, Jiangshu 221116)
  Year2008
  IssueNo.8(1412¡ª1421)
  Abstract &
  Background
Abstract With increasing of Deep Web, providing high quality data from autonomous, heterogeneous and dynamic Web databases to users is becoming a hot topic in recent research of Deep Web integration service. How to generate the reasonable schema matching between the keywords of the user request and schema of integrated interface as well as between the schema of integrated interface and that of Web database interface is essential. The related works about schema matching are generating the best schema matching which slide over its uncertainty. This paper analyzes the uncertainty of schema matching, and then proposes a series of similarity measures. To reduce the cost of execution, it proposes the type-based optimization method and schema matching pruning method of numeric data. Based on above analysis, this paper proposes the uncertain schema matching method. The experiments prove the effectiveness and efficiency of the new method.
Keywords Deep Web; integration service; similarity; schema matching; uncertainty
Background This research is partially supported by the grants from the Natural Science Foundation of China under grant No.60573091; The National High Technology Research and Development Program (863 Program) under grant No.2007AA01Z155; China National Basic Research and Development Program¡¯s Semantic Grid Project under grant 2003CB317000; Program for New Century Excellent Talents in University (NCET).
With an increasing number of Web databases, it is more and more difficult for users to get their desired information among these Web data sources manually. The purpose of those projects is to provide users an automatic approach to achieve and integrate the information in Deep Web. In recent years, more and more researchers have focused on some issues in it. In the past years, the authors have researched and developed a lot of techniques in the area of Deep Web integration service, and these works mainly focus on schema matching, query translation and database selection. This paper focuses on the uncertain schema matching method.