Data mining the Web (Record no. 12890)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 05276cam a2200649Ma 4500 |
001 - CONTROL NUMBER | |
control field | on1317436663 |
003 - CONTROL NUMBER IDENTIFIER | |
control field | OCoLC |
005 - DATE AND TIME OF LATEST TRANSACTION | |
control field | 20240523125544.0 |
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS | |
fixed length control field | m o d |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION | |
fixed length control field | cr ||||||||||| |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 060731s2007 njua ob 001 0 eng d |
010 ## - LIBRARY OF CONGRESS CONTROL NUMBER | |
Canceled/invalid LC control number | 2006025099 |
040 ## - CATALOGING SOURCE | |
Original cataloging agency | SFB |
Language of cataloging | eng |
Transcribing agency | SFB |
Modifying agency | OCLCF |
-- | OCLCQ |
-- | OCLCO |
-- | OCLCL |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 1280901039 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9781280901034 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9786610901036 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 6610901031 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 0470108096 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9780470108093 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 0470108088 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9780470108086 |
035 ## - SYSTEM CONTROL NUMBER | |
System control number | (OCoLC)1317436663 |
050 #4 - LIBRARY OF CONGRESS CALL NUMBER | |
Classification number | QA76.9.D343 |
Item number | M38 2007 |
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER | |
Classification number | 005.74 |
049 ## - LOCAL HOLDINGS (OCLC) | |
Holding library | MAIN |
100 1# - MAIN ENTRY--PERSONAL NAME | |
Personal name | Markov, Zdravko, |
Dates associated with a name | 1956- |
245 10 - TITLE STATEMENT | |
Title | Data mining the Web |
Medium | [electronic resource] : |
Remainder of title | uncovering patterns in Web content, structure, and usage / |
Statement of responsibility, etc. | Zdravko Markov and Daniel T. Larose. |
260 ## - PUBLICATION, DISTRIBUTION, ETC. | |
Place of publication, distribution, etc. | Hoboken, N.J. : |
Name of publisher, distributor, etc. | Wiley-Interscience, |
Date of publication, distribution, etc. | c2007. |
300 ## - PHYSICAL DESCRIPTION | |
Extent | 1 online resource (236 p.). |
336 ## - CONTENT TYPE | |
Content type term | text |
Content type code | txt |
337 ## - MEDIA TYPE | |
Media type term | computer |
Media type code | c |
338 ## - CARRIER TYPE | |
Carrier type term | online resource |
Carrier type code | cr |
490 1# - SERIES STATEMENT | |
Series statement | Wiley series on methods and applications in data mining |
500 ## - GENERAL NOTE | |
General note | Description based upon print version of record. |
505 0# - FORMATTED CONTENTS NOTE | |
Formatted contents note | DATA MINING THE WEB; CONTENTS; PREFACE; ACKNOWLEDGMENTS; PART I WEB STRUCTURE MINING; 1 INFORMATION RETRIEVAL AND WEB SEARCH; Web Challenges; Web Search Engines; Topic Directories; Semantic Web; Crawling the Web; Web Basics; Web Crawlers; Indexing and Keyword Search; Document Representation; Implementation Considerations; Relevance Ranking; Advanced Text Search; Using the HTML Structure in Keyword Search; Evaluating Search Quality; Similarity Search; Cosine Similarity; Jaccard Similarity; Document Resemblance; References; Exercises; 2 HYPERLINK-BASED RANKING; Introduction |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | Social Networks AnalysisPageRank; Authorities and Hubs; Link-Based Similarity Search; Enhanced Techniques for Page Ranking; References; Exercises; PART II WEB CONTENT MINING; 3 CLUSTERING; Introduction; Hierarchical Agglomerative Clustering; k-Means Clustering; Probabilty-Based Clustering; Finite Mixture Problem; Classification Problem; Clustering Problem; Collaborative Filtering (Recommender Systems); References; Exercises; 4 EVALUATING CLUSTERING; Approaches to Evaluating Clustering; Similarity-Based Criterion Functions; Probabilistic Criterion Functions |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | MDL-Based Model and Feature EvaluationMinimum Description Length Principle; MDL-Based Model Evaluation; Feature Selection; Classes-to-Clusters Evaluation; Precision, Recall, and F-Measure; Entropy; References; Exercises; 5 CLASSIFICATION; General Setting and Evaluation Techniques; Nearest-Neighbor Algorithm; Feature Selection; Naive Bayes Algorithm; Numerical Approaches; Relational Learning; References; Exercises; PART III WEB USAGE MINING; 6 INTRODUCTION TO WEB USAGE MINING; Definition of Web Usage Mining; Cross-Industry Standard Process for Data Mining; Clickstream Analysis |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | Web Server Log FilesRemote Host Field; Date/Time Field; HTTP Request Field; Status Code Field; Transfer Volume (Bytes) Field; Common Log Format; Identification Field; Authuser Field; Extended Common Log Format; Referrer Field; User Agent Field; Example of a Web Log Record; Microsoft IIS Log Format; Auxiliary Information; References; Exercises; 7 PREPROCESSING FOR WEB USAGE MINING; Need for Preprocessing the Data; Data Cleaning and Filtering; Page Extension Exploration and Filtering; De-Spidering the Web Log File; User Identification; Session Identification; Path Completion |
505 8# - FORMATTED CONTENTS NOTE | |
Formatted contents note | Directories and the Basket TransformationFurther Data Preprocessing Steps; References; Exercises; 8 EXPLORATORY DATA ANALYSIS FOR WEB USAGE MINING; Introduction; Number of Visit Actions; Session Duration; Relationship between Visit Actions and Session Duration; Average Time per Page; Duration for Individual Pages; References; Exercises; 9 MODELING FOR WEB USAGE MINING: CLUSTERING, ASSOCIATION, AND CLASSIFICATION; Introduction; Modeling Methodology; Definition of Clustering; The BIRCH Clustering Algorithm; Affinity Analysis and the A Priori Algorithm |
500 ## - GENERAL NOTE | |
General note | Discretizing the Numerical Variables: Binning. |
520 ## - SUMMARY, ETC. | |
Summary, etc. | This book introduces the reader to methods of data mining on the web, including uncovering patterns in web content (classification, clustering, language processing), structure (graphs, hubs, metrics), and usage (modeling, sequence analysis, performance). |
546 ## - LANGUAGE NOTE | |
Language note | English. |
504 ## - BIBLIOGRAPHY, ETC. NOTE | |
Bibliography, etc. note | Includes bibliographical references and index. |
590 ## - LOCAL NOTE (RLIN) | |
Local note | John Wiley and Sons |
Provenance (VM) [OBSOLETE] | Wiley Online Library: Complete oBooks |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Data mining. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Web databases. |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Exploration de donn�ees (Informatique) |
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Bases de donn�ees sur le Web. |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Data mining |
Source of heading or term | fast |
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name entry element | Web databases |
Source of heading or term | fast |
700 1# - ADDED ENTRY--PERSONAL NAME | |
Personal name | Larose, Daniel T. |
758 ## - RESOURCE IDENTIFIER | |
Relationship information | has work: |
Label | Data mining the Web (Text) |
Real World Object URI | https://id.oclc.org/worldcat/entity/E39PCGmjGK3FvHrf8jGBghfrdP |
Relationship | https://id.oclc.org/worldcat/ontology/hasWork |
776 ## - ADDITIONAL PHYSICAL FORM ENTRY | |
International Standard Book Number | 0-471-66655-6 |
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE | |
Uniform title | Wiley series on methods and applications in data mining. |
856 40 - ELECTRONIC LOCATION AND ACCESS | |
Uniform Resource Identifier | <a href="https://onlinelibrary.wiley.com/doi/book/10.1002/0470108096">https://onlinelibrary.wiley.com/doi/book/10.1002/0470108096</a> |
994 ## - | |
-- | 92 |
-- | INLUM |
No items available.