dialog problem overview

一 Datasets

1. ubuntu chat corpus: http://daviduthus.org/UCC/
2. dialogue  datasets (circa 2005)

二 Tasks

task1: next utterance classfication

the dual encoder model

task2: large copurs next-utterance retrievl 

task3: narural language generation

hierachical encoder-decoder architecture(bengio AAAI 2015)

三 Interesting topics 

(Interesting! ) exploring ethical and social  challenge:

bias:

 bias in image classification, debiasing word embeddings(bolukbasi et al 2016), hate and offensive language detection  (Davidson at al 2017)

privacy

safety

value alignment

adversarial examples

 四 Challenges

Challenge: how to evaluation ? 

evaluation for generative models:

perplexity

word error rate

word overlap metrics, BLEU

camparing response with human

 五 adversarial examples

原文地址:https://www.cnblogs.com/baiting/p/9012124.html