Springer Computing & Information Science

Web Data Management: A Warehouse Approach

by

Sourav Bhowmick, Sanjay Madria, and Wee Keong Ng

April 2003 * 472 pages * $59.95

    Existence of heterogeneous autonomous Websites containing related information has given rise to the problem of retrieving the data from these Web sources effectively to provide a comprehensive and integrated source of relevant information. Also, e-commerce and the increasing ability of commercial data on the Web have made it necessary to analyze and manipulate data to support corporate decision making.

   This unique book addresses the problem of efficient management of Web information by utilizing the concept of a web warehouse: a repository of web views that can be used. A web warehouse called WHOWEDA( Warehouse of Web Data ) is created for managing and manipulating heterogeneous, semi-structured Web data. The focus is on offering a complete and comprehensive treatment to the issues involved in the creation and management of warehoused data.

Topics & features:

bullet  Describes a novel technique for generating the schema of hyperlinked web data and addressing related issues for both HTML  and XML data .
bullet Highlights two important applications of web warehouse: change management and knowledge discovery
bullet Identifies several open research problems that readers can pursue in the context of the web warehouse
bullet Provides a set of web algebraic operators for manipulating semi-structured warehoused data
bullet Clear, well-developed diagrams simplify learning and applying core concepts and techniques

    Employing  an accessible approach. Web Data Management provides a detailed presentation of relevant concepts, models and methods. The book is an authoritative and comprehensive survey and resource for database management systems developers and enterprise website developers.

                 www.springer-ny.com                                                                       1-800-SPRINGER

Book Contents

 

Chapter 1:  Introduction

bullet

Motivation

bullet

Architecture and Functionalities

 Chapter 2:  A Survey of Web Data Management Systems

bullet

Web Query Systems

bullet

Web Information Integration Systems

bullet

Web Data Restructuring

bullet

Semi-structured Data

bullet

XML Query Languages

bullet

XML Data Warehouses 

bullet

Summary

 Chapter 3:  Node and Link Objects

bullet

Introduction

bullet

Representing Metadata of Web Documents and Hyperlinks

bullet

Metadata Associated with HTML and XML Documents

bullet

Representing Structure and Content of Web Documents

bullet

Representing Structure and Content of Hyperlinks

bullet

Node and Link Objects

bullet

Node and Link Structure Trees

bullet

Recent Approaches in Modeling Web Data

bullet

Summary

 Chapter 4:  Predicate on Node and Link Objects  

bullet

Introduction

bullet

Components of Comparison-free Predicate

bullet

Comparison Predicates

bullet

Summary

 Chapter 5: Imposing Constraints on Hyperlink Structure  

bullet

Introduction

bullet

Components of Connectivities 

bullet

Types of Connectivities

bullet

Transformation of Complex Connectivities

bullet

Conformity Conditions

bullet

Summary

 Chapter 6: Query Mechanism for the Web

bullet

Introduction

bullet

Definition of Coupling Query

bullet

Types of Coupling Query

bullet

Examples of Coupling Queries

bullet

Valid Canonical Query Generation

bullet

Formulation of Coupling Queries

bullet

Coupling Query Results

bullet

Computability of Valid Coupling Queries

bullet

Recent Approaches for Querying the Web

bullet

Summary

 Chapter 7: Schemas for Warehouse Data  

bullet

Recent Approaches for Modeling Schema for Web Data

bullet

Web Schema 

bullet

Features of Web Schema

bullet

Importance of Web Schema in a Web Warehouse

bullet

Generation of Simple Web Schema Set from Coupling Query

bullet

Web Schema Generation in Local Operations

bullet

Summary

Chapter 8: WHOM-Algebra

bullet

Types of Manipulation

bullet

Global Web Coupling

bullet

Web Select

bullet

Web Project

bullet

Web Distinct

bullet

Web Join

bullet

Web Correlate

bullet

Web Differences

bullet

Web Union

bullet

Summary

Chapter 9:  Applications of Web Warehouse

bullet

Detection and Representation of Relevant Web Deltas

bullet

Knowledge Discovery and Web Mining in the Web Warehouse

Chapter 10: Concluding Remarks 

bullet

Summary

bullet

Future Research

References and Index