Intra-sentence Punctuation Insertion in Natural Language Generation

Zhu Zhang, Michael Gamon, Simon Corston-Oliver, and Eric Ringger

Abstract

We describe a punctuation insertion model used in the sentence realization module of a natural language generation system for English and German. The model is based on a decision tree classifier that uses linguistically sophisticated features. The classifier outperforms a word n-gram model trained on the same data.

Details

Publication typeTechReport
NumberMSR-TR-2002-58
Pages6
InstitutionMicrosoft Research
> Publications > Intra-sentence Punctuation Insertion in Natural Language Generation