Volume 3 Number 6 (Dec. 2013)
Home > Archive > 2013 > Volume 3 Number 6 (Dec. 2013) >
IJEEEE 2013 Vol.3(6): 441-445 ISSN: 2010-3654
DOI: 10.7763/IJEEEE.2013.V3.275

Clustering Web Pages Considering the Position of Each Word and the Search Term

Ryutaro Akiyama, Katsutoshi Kanamori, and Hayato Ohwada
Abstract— Users can easily find the pages they are seeking by clustering the web pages of search results obtained using a search engine. The vector space method is often used to cluster web pages. However, the method that has been conventionally used has low clustering accuracy and high computational cost. In this study, we propose a method to solve these problems. We assume that the words that appear near the search term have a high degree of importance. We then try to solve the problems by considering the distance to the search term in the text. We conducted verification experiments in Japan for Japanese search terms. The results confirmed that the proposed method considering the distance to the search term in the text has higher clustering accuracy and lower computational cost than the conventional method.

Index Terms— Information retrieval, search engine, web clustering, web mining.

Ryutaro Akiyama was with the Department of Industrial Administration, Faculty of Science and Technology, Tokyo University of Science, Japan. He is now with the Department of Industrial and Management Systems Engineering, Creative Science and Engineering, Graduate School, Waseda University, Japan (e-mail: r-akiyama@akane.waseda.jp).
Katsutoshi Kanamori and Hayato Ohwada are with the Department of Industrial Administration, Faculty of Science and Technology, Tokyo University of Science, Japan (e-mail: katsu@rs.tus.ac.jp, ohwada@rs.tus.ac.jp).

Cite: Ryutaro Akiyama, Katsutoshi Kanamori, and Hayato Ohwada, " Clustering Web Pages Considering the Position of Each Word and the Search Term," International Journal of e-Education, e-Business, e-Management and e-Learning vol. 3, no. 6, pp. 441-445, 2013.

General Information

ISSN: 2010-3654 (Online)
Frequency: Quarterly (Since 2015)
Editor-in-Chief: Prof. Kuan-Chou Chen
Executive Editor: Ms. Nancy Lau
Abstracting/ Indexing: EBSCO, Google Scholar, Electronic Journals Library, QUALIS, ProQuest, EI (INSPEC, IET)
E-mail: ijeeee@iap.org
  • May 14, 2019 News!

    Vol.7, No.4-Vol.8, No.2 have been indexed by EI (Inspec).   [Click]

  • May 21, 2019 News!

    The papers published in Vol.9, No.2 have all received dois from Crossref.

  • May 09, 2019 News!

    IJEEEE Vol. 9, No. 2 is available online!    [Click]

  • Oct 08, 2018 News!

    The papers published in Vol.9, No.1 have all received dois from Crossref.

  • Aug 06, 2018 News!

    Vol.7, No.1-No.3 have been indexed by EI (Inspec).   [Click]

  • Read more>>