Modelling Anchor Text Retrieval in Book Search based on Back-of-Book Index

  • Gabriella Kazai

Published by Association for Computing Machinery, Inc.

This paper proposes a probabilistic logic abstraction for modelling tf -boosting approaches for page-search in book retrieval. The back-of-book index can be viewed as a list of anchors pointing to pages. An initial strategy deploys a hypertext-based modelling which turns out to be naive, since the propagation of anchor-text from the book index does not deliver the desired tf -boosting effect. For achieving a tf -boosting effect from the book index, we propose a novel anchor-text model.