Korean Generative Pre-trained Transformer

No Thumbnail Available

Date

2021-12-06

Journal Title

Journal ISSN

Volume Title

Publisher

arXiv

Abstract

With the advent of Transformer, which was used in translation models in 2017, attention-based architectures began to attract attention. Furthermore, after the emergence of BERT, which strengthened the NLU-specific encoder part, which is a part of the Transformer, and the GPT architecture, which strengthened the NLG-specific decoder part, various methodologies, data, and models for learning the Pretrained Language Model began to appear. Furthermore, in the past three years, various Pretrained Language Models specialized for Korean have appeared. In this paper, we intend to numerically and qualitatively compare and analyze various Korean PLMs released to the public.

Description

KoGPT는 한국어 언어 특성에 최적화된 생성형 언어모델로, 한국어 자연어처리(NLP) 태스크에서의 성능 향상을 목표로 개발되었습니다. ©2021 NAVER AI Lab

Keywords

KoGPT, Korean NLP, LLM, Language Model, Generation

Citation

Version History

Now showing 1 - 2 of 2
VersionDateSummary
2*
2025-06-02 22:40:35
add abstract
2025-06-02 22:05:39
* Selected version