dialogue policy task-oriented dialog

ddpt-policy-sgd

This is a DDPT model (https://aclanthology.org/2022.coling-1.21/) trained on Schema-Guided Dialog

Refer to ConvLab-3 for model description and usage.

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

Framework versions