Ebook: Web Data Management: A Warehouse Approach
- Tags: Database Management, Information Systems and Communication Service, Information Storage and Retrieval, Multimedia Information Systems
- Series: Springer Professional Computing
- Year: 2004
- Publisher: Springer-Verlag New York
- Edition: 1
- Language: English
- pdf
The existence of huge data volume on the Web has fueled an unrelenting need to locate "right information at right time," as well as to effectively develop an integrated, comprehensive information source. This calls for tools for efficiently analyzing and managing web data—and for efficiently managing web information from the database perspective.
This comprehensive resource presents a data model called WHOM (WareHouse Object Model) to represent HTML and XML documents in the warehouse. The book defines a set of web-algebraic operators for building new web tables by extracting relevant data from the Web, as well as generating new tables from existing ones. Its "web-warehouse approach" incorporates modern and effective shared web data-management concepts, methods, and models.
Features & Benefits:
*Presents a simple and generic data model for representing metadata, structure, and content of Web documents and hyperlinks
*Addresses schema-related issues for both HTML and XML data, with their associated challenges of irregularity and heterogeneity
*Describes a web algebra for manipulating warehoused data
*Utilizes numerous examples to illustrate various concepts of web data management and to simplify all key issues for readers
*Highlights change management and knowledge discovery, two important applications of a web warehouse
With its accessible style and emphasis on practicality, the book delivers an excellent survey of all current principles for structured, web-based data management technologies. Database management systems developers, enterprise website developers, and applied R&D researchers will find the work an essential companion for new concepts, development strategies, and application models.
Key Topics:
>> Node and link objects
>> Comparison predicates
>> Connectivities
>> Coupling-query formulation
>> Web schema
>> WHOM (WareHouse Object Model)
>> Schema generation and pruning
>> Data visualization
>> Web deltas & web bags
>> Knowledge-discovery applications
-- Databases / Information Systems
-- Beginning / Intermediate
The existence of huge data volume on the Web has fueled an unrelenting need to locate "right information at right time," as well as to effectively develop an integrated, comprehensive information source. This calls for tools for efficiently analyzing and managing web data—and for efficiently managing web information from the database perspective.
This comprehensive resource presents a data model called WHOM (WareHouse Object Model) to represent HTML and XML documents in the warehouse. The book defines a set of web-algebraic operators for building new web tables by extracting relevant data from the Web, as well as generating new tables from existing ones. Its "web-warehouse approach" incorporates modern and effective shared web data-management concepts, methods, and models.
Features & Benefits:
*Presents a simple and generic data model for representing metadata, structure, and content of Web documents and hyperlinks
*Addresses schema-related issues for both HTML and XML data, with their associated challenges of irregularity and heterogeneity
*Describes a web algebra for manipulating warehoused data
*Utilizes numerous examples to illustrate various concepts of web data management and to simplify all key issues for readers
*Highlights change management and knowledge discovery, two important applications of a web warehouse
With its accessible style and emphasis on practicality, the book delivers an excellent survey of all current principles for structured, web-based data management technologies. Database management systems developers, enterprise website developers, and applied R&D researchers will find the work an essential companion for new concepts, development strategies, and application models.
Key Topics:
>> Node and link objects
>> Comparison predicates
>> Connectivities
>> Coupling-query formulation
>> Web schema
>> WHOM (WareHouse Object Model)
>> Schema generation and pruning
>> Data visualization
>> Web deltas & web bags
>> Knowledge-discovery applications
-- Databases / Information Systems
-- Beginning / Intermediate
The existence of huge data volume on the Web has fueled an unrelenting need to locate "right information at right time," as well as to effectively develop an integrated, comprehensive information source. This calls for tools for efficiently analyzing and managing web data—and for efficiently managing web information from the database perspective.
This comprehensive resource presents a data model called WHOM (WareHouse Object Model) to represent HTML and XML documents in the warehouse. The book defines a set of web-algebraic operators for building new web tables by extracting relevant data from the Web, as well as generating new tables from existing ones. Its "web-warehouse approach" incorporates modern and effective shared web data-management concepts, methods, and models.
Features & Benefits:
*Presents a simple and generic data model for representing metadata, structure, and content of Web documents and hyperlinks
*Addresses schema-related issues for both HTML and XML data, with their associated challenges of irregularity and heterogeneity
*Describes a web algebra for manipulating warehoused data
*Utilizes numerous examples to illustrate various concepts of web data management and to simplify all key issues for readers
*Highlights change management and knowledge discovery, two important applications of a web warehouse
With its accessible style and emphasis on practicality, the book delivers an excellent survey of all current principles for structured, web-based data management technologies. Database management systems developers, enterprise website developers, and applied R&D researchers will find the work an essential companion for new concepts, development strategies, and application models.
Key Topics:
>> Node and link objects
>> Comparison predicates
>> Connectivities
>> Coupling-query formulation
>> Web schema
>> WHOM (WareHouse Object Model)
>> Schema generation and pruning
>> Data visualization
>> Web deltas & web bags
>> Knowledge-discovery applications
-- Databases / Information Systems
-- Beginning / Intermediate
Content:
Front Matter....Pages i-xxi
Introduction....Pages 1-16
A Survey of Web Data Management Systems....Pages 17-63
Node and Link Objects....Pages 65-92
Predicates on Node and Link Objects....Pages 93-126
Imposing Constraints on Hyperlink Structures....Pages 127-143
Query Mechanism for the Web....Pages 145-206
Schemas for Warehouse Data....Pages 207-250
WHOM-Algebra....Pages 251-351
Web Data Visualization....Pages 353-366
Detecting and Representing Relevant Web Deltas....Pages 367-387
Knowledge Discovery Using Web Bags....Pages 389-416
The Road Ahead....Pages 417-448
Back Matter....Pages 449-465
The existence of huge data volume on the Web has fueled an unrelenting need to locate "right information at right time," as well as to effectively develop an integrated, comprehensive information source. This calls for tools for efficiently analyzing and managing web data—and for efficiently managing web information from the database perspective.
This comprehensive resource presents a data model called WHOM (WareHouse Object Model) to represent HTML and XML documents in the warehouse. The book defines a set of web-algebraic operators for building new web tables by extracting relevant data from the Web, as well as generating new tables from existing ones. Its "web-warehouse approach" incorporates modern and effective shared web data-management concepts, methods, and models.
Features & Benefits:
*Presents a simple and generic data model for representing metadata, structure, and content of Web documents and hyperlinks
*Addresses schema-related issues for both HTML and XML data, with their associated challenges of irregularity and heterogeneity
*Describes a web algebra for manipulating warehoused data
*Utilizes numerous examples to illustrate various concepts of web data management and to simplify all key issues for readers
*Highlights change management and knowledge discovery, two important applications of a web warehouse
With its accessible style and emphasis on practicality, the book delivers an excellent survey of all current principles for structured, web-based data management technologies. Database management systems developers, enterprise website developers, and applied R&D researchers will find the work an essential companion for new concepts, development strategies, and application models.
Key Topics:
>> Node and link objects
>> Comparison predicates
>> Connectivities
>> Coupling-query formulation
>> Web schema
>> WHOM (WareHouse Object Model)
>> Schema generation and pruning
>> Data visualization
>> Web deltas & web bags
>> Knowledge-discovery applications
-- Databases / Information Systems
-- Beginning / Intermediate
Content:
Front Matter....Pages i-xxi
Introduction....Pages 1-16
A Survey of Web Data Management Systems....Pages 17-63
Node and Link Objects....Pages 65-92
Predicates on Node and Link Objects....Pages 93-126
Imposing Constraints on Hyperlink Structures....Pages 127-143
Query Mechanism for the Web....Pages 145-206
Schemas for Warehouse Data....Pages 207-250
WHOM-Algebra....Pages 251-351
Web Data Visualization....Pages 353-366
Detecting and Representing Relevant Web Deltas....Pages 367-387
Knowledge Discovery Using Web Bags....Pages 389-416
The Road Ahead....Pages 417-448
Back Matter....Pages 449-465
....