Improvement in TF-IDF scheme for Web pages based on the contents of their hyperlinked neighboring pages

Hyperlink
DOI: 10.1002/scj.20189 Publication Date: 2005-10-11T18:51:28Z
ABSTRACT
The TF-IDF scheme is widely used to characterize documents in an information retrieval (IR) system based on the vector space model. However, for having a hyperlink structure such as Web pages, page contents can be characterized more accurately by using of hyperlinked neighboring pages. Therefore, this paper, we propose several techniques pages improve and then verify effectiveness our techniques. © 2005 Wiley Periodicals, Inc. Syst Comp Jpn, 36(14): 56–68, 2005; Published online InterScience (www.interscience.wiley.com). DOI 10.1002/scj.20189
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (30)
CITATIONS (7)