NLU Meghalaya Library

Online Public Access Catalogue (OPAC)

Data mining the Web (Record no. 12890)

MARC details
000 -LEADER
fixed length control field 05276cam a2200649Ma 4500
001 - CONTROL NUMBER
control field on1317436663
003 - CONTROL NUMBER IDENTIFIER
control field OCoLC
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20240523125544.0
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS
fixed length control field m o d
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field cr |||||||||||
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 060731s2007 njua ob 001 0 eng d
010 ## - LIBRARY OF CONGRESS CONTROL NUMBER
Canceled/invalid LC control number 2006025099
040 ## - CATALOGING SOURCE
Original cataloging agency SFB
Language of cataloging eng
Transcribing agency SFB
Modifying agency OCLCF
-- OCLCQ
-- OCLCO
-- OCLCL
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 1280901039
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781280901034
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9786610901036
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 6610901031
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 0470108096
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9780470108093
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 0470108088
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9780470108086
035 ## - SYSTEM CONTROL NUMBER
System control number (OCoLC)1317436663
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number QA76.9.D343
Item number M38 2007
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 005.74
049 ## - LOCAL HOLDINGS (OCLC)
Holding library MAIN
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Markov, Zdravko,
Dates associated with a name 1956-
245 10 - TITLE STATEMENT
Title Data mining the Web
Medium [electronic resource] :
Remainder of title uncovering patterns in Web content, structure, and usage /
Statement of responsibility, etc. Zdravko Markov and Daniel T. Larose.
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Place of publication, distribution, etc. Hoboken, N.J. :
Name of publisher, distributor, etc. Wiley-Interscience,
Date of publication, distribution, etc. c2007.
300 ## - PHYSICAL DESCRIPTION
Extent 1 online resource (236 p.).
336 ## - CONTENT TYPE
Content type term text
Content type code txt
337 ## - MEDIA TYPE
Media type term computer
Media type code c
338 ## - CARRIER TYPE
Carrier type term online resource
Carrier type code cr
490 1# - SERIES STATEMENT
Series statement Wiley series on methods and applications in data mining
500 ## - GENERAL NOTE
General note Description based upon print version of record.
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note DATA MINING THE WEB; CONTENTS; PREFACE; ACKNOWLEDGMENTS; PART I WEB STRUCTURE MINING; 1 INFORMATION RETRIEVAL AND WEB SEARCH; Web Challenges; Web Search Engines; Topic Directories; Semantic Web; Crawling the Web; Web Basics; Web Crawlers; Indexing and Keyword Search; Document Representation; Implementation Considerations; Relevance Ranking; Advanced Text Search; Using the HTML Structure in Keyword Search; Evaluating Search Quality; Similarity Search; Cosine Similarity; Jaccard Similarity; Document Resemblance; References; Exercises; 2 HYPERLINK-BASED RANKING; Introduction
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note Social Networks AnalysisPageRank; Authorities and Hubs; Link-Based Similarity Search; Enhanced Techniques for Page Ranking; References; Exercises; PART II WEB CONTENT MINING; 3 CLUSTERING; Introduction; Hierarchical Agglomerative Clustering; k-Means Clustering; Probabilty-Based Clustering; Finite Mixture Problem; Classification Problem; Clustering Problem; Collaborative Filtering (Recommender Systems); References; Exercises; 4 EVALUATING CLUSTERING; Approaches to Evaluating Clustering; Similarity-Based Criterion Functions; Probabilistic Criterion Functions
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note MDL-Based Model and Feature EvaluationMinimum Description Length Principle; MDL-Based Model Evaluation; Feature Selection; Classes-to-Clusters Evaluation; Precision, Recall, and F-Measure; Entropy; References; Exercises; 5 CLASSIFICATION; General Setting and Evaluation Techniques; Nearest-Neighbor Algorithm; Feature Selection; Naive Bayes Algorithm; Numerical Approaches; Relational Learning; References; Exercises; PART III WEB USAGE MINING; 6 INTRODUCTION TO WEB USAGE MINING; Definition of Web Usage Mining; Cross-Industry Standard Process for Data Mining; Clickstream Analysis
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note Web Server Log FilesRemote Host Field; Date/Time Field; HTTP Request Field; Status Code Field; Transfer Volume (Bytes) Field; Common Log Format; Identification Field; Authuser Field; Extended Common Log Format; Referrer Field; User Agent Field; Example of a Web Log Record; Microsoft IIS Log Format; Auxiliary Information; References; Exercises; 7 PREPROCESSING FOR WEB USAGE MINING; Need for Preprocessing the Data; Data Cleaning and Filtering; Page Extension Exploration and Filtering; De-Spidering the Web Log File; User Identification; Session Identification; Path Completion
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note Directories and the Basket TransformationFurther Data Preprocessing Steps; References; Exercises; 8 EXPLORATORY DATA ANALYSIS FOR WEB USAGE MINING; Introduction; Number of Visit Actions; Session Duration; Relationship between Visit Actions and Session Duration; Average Time per Page; Duration for Individual Pages; References; Exercises; 9 MODELING FOR WEB USAGE MINING: CLUSTERING, ASSOCIATION, AND CLASSIFICATION; Introduction; Modeling Methodology; Definition of Clustering; The BIRCH Clustering Algorithm; Affinity Analysis and the A Priori Algorithm
500 ## - GENERAL NOTE
General note Discretizing the Numerical Variables: Binning.
520 ## - SUMMARY, ETC.
Summary, etc. This book introduces the reader to methods of data mining on the web, including uncovering patterns in web content (classification, clustering, language processing), structure (graphs, hubs, metrics), and usage (modeling, sequence analysis, performance).
546 ## - LANGUAGE NOTE
Language note English.
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc. note Includes bibliographical references and index.
590 ## - LOCAL NOTE (RLIN)
Local note John Wiley and Sons
Provenance (VM) [OBSOLETE] Wiley Online Library: Complete oBooks
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Data mining.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Web databases.
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Exploration de donn�ees (Informatique)
650 #6 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Bases de donn�ees sur le Web.
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Data mining
Source of heading or term fast
650 #7 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Web databases
Source of heading or term fast
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Larose, Daniel T.
758 ## - RESOURCE IDENTIFIER
Relationship information has work:
Label Data mining the Web (Text)
Real World Object URI https://id.oclc.org/worldcat/entity/E39PCGmjGK3FvHrf8jGBghfrdP
Relationship https://id.oclc.org/worldcat/ontology/hasWork
776 ## - ADDITIONAL PHYSICAL FORM ENTRY
International Standard Book Number 0-471-66655-6
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title Wiley series on methods and applications in data mining.
856 40 - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier <a href="https://onlinelibrary.wiley.com/doi/book/10.1002/0470108096">https://onlinelibrary.wiley.com/doi/book/10.1002/0470108096</a>
994 ## -
-- 92
-- INLUM

No items available.