How to find better index terms through citations

  • Anna Ritchie ,
  • Simone Teufel ,
  • Stephen Robertson

Proceedings of the Workshop `Can Computational Linguistics Improve Information Retrieval?', at ACL/COLING-2006 |

We consider the question of how information from the textual context of citations in scientific papers could improve indexing of the cited papers. We first present examples which show that the context should in principle provide better and new index terms. We then discuss linguistic phenomena around citations and which type of processing would improve the automatic determination of the right context. We present a case study, studying the effect of combining the existing index terms of a paper with additional terms from papers citing that paper in our corpus. Finally, we discuss the need for experimentation for the practical validation of our claim.