Web Data Management: A Warehouse Approach
by
Sourav Bhowmick, Sanjay Madria, and Wee Keong Ng
April 2003 * 472 pages * $59.95
Existence of heterogeneous autonomous Websites containing related information has given rise to the problem of retrieving the data from these Web sources effectively to provide a comprehensive and integrated source of relevant information. Also, e-commerce and the increasing ability of commercial data on the Web have made it necessary to analyze and manipulate data to support corporate decision making.
This unique book addresses the problem of efficient management of Web information by utilizing the concept of a web warehouse: a repository of web views that can be used. A web warehouse called WHOWEDA( Warehouse of Web Data ) is created for managing and manipulating heterogeneous, semi-structured Web data. The focus is on offering a complete and comprehensive treatment to the issues involved in the creation and management of warehoused data.
Topics & features:
Describes a novel technique for generating the schema of hyperlinked web data and addressing related issues for both HTML and XML data . |
Highlights two important applications of web warehouse: change management and knowledge discovery | |
Identifies several open research problems that readers can pursue in the context of the web warehouse | |
Provides a set of web algebraic operators for manipulating semi-structured warehoused data | |
Clear, well-developed diagrams simplify learning and applying core concepts and techniques |
Employing an accessible approach. Web Data Management provides a detailed presentation of relevant concepts, models and methods. The book is an authoritative and comprehensive survey and resource for database management systems developers and enterprise website developers.
www.springer-ny.com 1-800-SPRINGER
Book Contents
Chapter 1: Introduction
Motivation |
|
Architecture and Functionalities |
Chapter 2: A Survey of Web Data Management Systems
Web Query Systems |
|
Web Information Integration Systems |
|
Web Data Restructuring |
|
Semi-structured Data |
|
XML Query Languages |
|
XML Data Warehouses |
|
Summary |
Chapter 3: Node and Link Objects
Introduction |
|
Representing Metadata of Web Documents and Hyperlinks |
|
Metadata Associated with HTML and XML Documents |
|
Representing Structure and Content of Web Documents |
|
Representing Structure and Content of Hyperlinks |
|
Node and Link Objects |
|
Node and Link Structure Trees |
|
Recent Approaches in Modeling Web Data |
|
Summary |
Chapter 4: Predicate on Node and Link Objects
Introduction |
|
Components of Comparison-free Predicate |
|
Comparison Predicates |
|
Summary |
Chapter 5: Imposing Constraints on Hyperlink Structure
Introduction |
|
Components of Connectivities |
|
Types of Connectivities |
|
Transformation of Complex Connectivities |
|
Conformity Conditions |
|
Summary |
Chapter 6: Query Mechanism for the Web
Introduction |
|
Definition of Coupling Query |
|
Types of Coupling Query |
|
Examples of Coupling Queries |
|
Valid Canonical Query Generation |
|
Formulation of Coupling Queries |
|
Coupling Query Results |
|
Computability of Valid Coupling Queries |
|
Recent Approaches for Querying the Web |
|
Summary |
Chapter 7: Schemas for Warehouse Data
Recent Approaches for Modeling Schema for Web Data |
|
Web Schema |
|
Features of Web Schema |
|
Importance of Web Schema in a Web Warehouse |
|
Generation of Simple Web Schema Set from Coupling Query |
|
Web Schema Generation in Local Operations |
|
Summary |
Chapter 8: WHOM-Algebra
Types of Manipulation |
|
Global Web Coupling |
|
Web Select |
|
Web Project |
|
Web Distinct |
|
Web Join |
|
Web Correlate |
|
Web Differences |
|
Web Union |
|
Summary |
Chapter 9: Applications of Web Warehouse
Detection and Representation of Relevant Web Deltas |
|
Knowledge Discovery and Web Mining in the Web Warehouse |
Chapter 10: Concluding Remarks
Summary |
|
Future Research |
References and Index