mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer

dc.contributor.authorXue, Linting
dc.contributor.authorConstant, Noah
dc.contributor.authorRoberts, Adam
dc.contributor.authorKale, Mihir
dc.contributor.authorAl‑Rfou, Rami
dc.contributor.authorSiddhant, Aditya
dc.contributor.authorBarua, Aditya
dc.contributor.authorRaffel, Colin
dc.date.accessioned2025-06-02T13:31:54Z
dc.date.available2025-06-02T13:31:18Z
dc.date.available2025-06-02T13:31:54Z
dc.date.issued2020-10-22
dc.description영어 위주의 T5를 101개 언어로 확장한 mT5 모델을 제안하며, 다국어 벤치마크에서 우수한 결과를 보여줍니다. 특히 제로샷 번역 시 특정 언어로 전이되는 문제를 완화한 전략도 포함됩니다 ©2020 Google Research
dc.description.abstractThe recent "Text-to-Text Transfer Transformer" (T5) leveraged a unified text-to-text format and scale to attain state-of-the-art results on a wide variety of English-language NLP tasks. In this paper, we introduce mT5, a multilingual variant of T5 that was pre-trained on a new Common Crawl-based dataset covering 101 languages. We detail the design and modified training of mT5 and demonstrate its state-of-the-art performance on many multilingual benchmarks. We also describe a simple technique to prevent "accidental translation" in the zero-shot setting, where a generative model chooses to (partially) translate its prediction into the wrong language. All of the code and model checkpoints used in this work are publicly available.
dc.description.sponsorshipGoogle Research
dc.identifier.urihttps://arxiv.org/abs/2010.11934
dc.identifier.urihttp://data.inu.ac.kr/handle/123456789/1960.2
dc.language.isoen_US
dc.publisherarXiv
dc.subjectmT5
dc.subjectMultilingual NLP
dc.subjectText-to-Text
dc.subjectTransfer Learning
dc.subjectZero-shot
dc.titlemT5: A Massively Multilingual Pre-trained Text-to-Text Transformer
dc.typeArticle

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2010.11934v3.pdf
Size:
729.13 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
97 B
Format:
Item-specific license agreed to upon submission
Description:

Version History

Now showing 1 - 2 of 2
VersionDateSummary
2*
2025-06-02 22:31:37
add subject
2025-06-02 22:31:18
* Selected version