Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection

Lihong Li, Jason D. Williams, and Suhrid Balakrishnan

Details

Publication typeInproceedings
Published inProc Interspeech, Brighton, United Kingdom
> Publications > Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection