Mo Chen, Jian-Tao Sun, Hua-Jun Zeng, and Kwok-Yan Lam
2005
Keyphrases can be used to facilitate Web users grasping the main topic(s) of a Web page. We present a practical system of automatic keyphrase extraction for Web pages. In this system, a regression model was first trained based on a set of human-labeled documents. Then it was used to extract keyphrases from new pages automatically. This paper makes three contributions. First, the structure information in a Web page was investigated for keyphrase extraction task. Second, the query log data associated with a Web page collected by a search engine server were used to help keyphrase extraction. Third, a method was put forward in this paper in order to evaluate the similarity of phrases.
In CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management
Publisher ACM
Copyright is held by the author/owner(s).
CIKM’05, October 31-November 5, 2005, Bremen, Germany.
ACM 1-59593-140-6/05/0010.
| Type | Proceedings |
| URL | http://doi.acm.org/10.1145/1099554.1099625 |
| Pages | 277–278 |
| ISBN | 1-59593-140-6 |
| Address | New York, NY, USA |