Author2Vec: Learning Author Representations by Combining Content and Link Information

  • Ganesh J ,
  • Soumyajit Ganguly ,
  • ,
  • Vasudeva Varma ,
  • Vikram Pudi

The 25th International World Wide Web Conference (WWW 2016) |

Published by ACM - Association for Computing Machinery

In this paper, we consider the problem of learning representations for authors from bibliographic co-authorship networks. Existing methods for deep learning on graphs, such as DeepWalk, suffer from link sparsity problem as they focus on modeling the link information only. We hypothesize that capturing both the content and link information in a unified way will help mitigate the sparsity problem. To this end, we present a novel model `Author2Vec’ 1, which learns lowdimensional author representations such that authors who write similar content and share similar network structure are closer in vector space. Such embeddings are useful in a variety of applications such as link prediction, node classification, recommendation and visualization. The author embeddings we learn are empirically shown to outperform DeepWalk by 2.35% and 0.83% for link prediction and clustering task respectively.