|
|
Bimonthly Since 1986 |
ISSN 1004-9037
|
|
|
|
|
Publication Details |
Edited by: Editorial Board of Journal of Data Acquisition and Processing
P.O. Box 2704, Beijing 100190, P.R. China
Sponsored by: Institute of Computing Technology, CAS & China Computer Federation
Undertaken by: Institute of Computing Technology, CAS
Published by: SCIENCE PRESS, BEIJING, CHINA
Distributed by:
China: All Local Post Offices
|
|
|
|
|
|
|
|
|
|
Abstract
The rapid growth of the World Wide Web (www) is demanding for an automated assistance for Web page classification and categorization. In most existing Web page classification tasks, Web pages are classified into topical categories based on their content regardless of the possible relationships among them. In this paper, a comprehensive survey on classification of Web pages is presented. The features for creating tag information, classifiers and datasets used for experimentation are also discussed. It also gives comparative analysis of all Web page classification techniques. The challenges/Issues involved in developing Web page classification are also discussed. This would help researchers to take up new work on Web page classification and address most of the important challenges/issues.
Keyword
Web page Classification; Web pages; Web content mining; Text mining.
PDF Download (click here)
|
|
|
|
|