Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis

We demonstrate that it is possible to perform automatic sentiment classification in the very noisy domain of customer feedback data. We show that by using large feature vectors in combination with feature reduction, we can train linear support vector machines that achieve high classification accuracy on data that present classification challenges even for a human annotator. We also show that, surprisingly, the addition of deep linguistic analysis features to a set of surface level word n-gram features contributes consistently to classification accuracy in this domain.

coling2004_sentiment.pdf
PDF file

In  Proceeding of COLING-04, the 20th International Conference on Computational Linguistics

Publisher  International Conference on Computational Linguistics
Copyright COLING 2004

Details

TypeInproceedings
URLhttp://www.coling.org/
Pages841–847
AddressGeneva, CH
> Publications > Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis