Towards a Human-like Open-Domain Chatbot
| dc.contributor.author | Adiwardana, Daniel | |
| dc.contributor.author | Luong, Minh-Thang | |
| dc.contributor.author | So, David R. | |
| dc.contributor.author | Hall, Jamie | |
| dc.contributor.author | Fiedel, Noah | |
| dc.contributor.author | Thoppilan, Romal | |
| dc.contributor.author | Yang, Zi | |
| dc.contributor.author | Kulshreshtha, Apoorv | |
| dc.contributor.author | Nemade, Gaurav | |
| dc.contributor.author | Lu, Yifeng | |
| dc.contributor.author | Le, Quoc V. | |
| dc.date.accessioned | 2025-06-02T13:34:32Z | |
| dc.date.available | 2025-06-02T13:34:32Z | |
| dc.date.issued | 2020-01-27 | |
| dc.description | 2.6B 파라미터 Meena 모델을 제안하며, SSA(Sensibleness and Specificity Average)라는 새로운 평가 지표를 통해 다중턴 대화 성능을 평가합니다. 최고 성능 모델은 SSA 79%를 기록하며, 인간 수준(86%)에 근접하는 가능성을 제시합니다 ©2020 Google Research | |
| dc.description.abstract | We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation. Our experiments show strong correlation between perplexity and SSA. The fact that the best perplexity end-to-end trained Meena scores high on SSA (72% on multi-turn evaluation) suggests that a human-level SSA of 86% is potentially within reach if we can better optimize perplexity. Additionally, the full version of Meena (with a filtering mechanism and tuned decoding) scores 79% SSA, 23% higher in absolute SSA than the existing chatbots we evaluated. | |
| dc.description.sponsorship | Google Research (Brain Team) | |
| dc.identifier.uri | https://arxiv.org/abs/2001.09977 | |
| dc.identifier.uri | http://data.inu.ac.kr/handle/123456789/1961 | |
| dc.language.iso | en_US | |
| dc.publisher | arXiv | |
| dc.subject | Meena | |
| dc.subject | Open-Domain Chatbot | |
| dc.subject | SSA | |
| dc.subject | Multi-turn Conversation | |
| dc.subject | Dialogue Systems | |
| dc.title | Towards a Human-like Open-Domain Chatbot | |
| dc.type | Article |