On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

Pei-Hao Su | Milica Gašić | Nikola Mrkšić | Lina M. Rojas-Barahona | Stefan Ultes | David Vandyke | Tsung-Hsien Wen | Steve Young |

Paper Details:

Month: August
Year: 2016
Location: Berlin, Germany
Venue: ACL |

Citations

URL