October 23, 2019Open Access

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Key Points

Key points are not available for this paper at this time.

Abstract

Transfer learning, where a model is first pre-trained on a data-rich task being fine-tuned on a downstream task, has emerged as a powerful in natural language processing (NLP). The effectiveness of transfer has given rise to a diversity of approaches, methodology, and. In this paper, we explore the landscape of transfer learning for NLP by introducing a unified framework that converts all-based language problems into a text-to-text format. Our systematic study pre-training objectives, architectures, unlabeled data sets, transfer, and other factors on dozens of language understanding tasks. By the insights from our exploration with scale and our new ``Colossal Crawled Corpus'', we achieve state-of-the-art results on many benchmarks summarization, question answering, text classification, and more. To future work on transfer learning for NLP, we release our data set, -trained models, and code.

Perguntar à IA

Bookmark

View Full Paper

Cite This Study

Raffel et al. (Wed,) studied this question.

synapsesocial.com/papers/6984b6e33ee498a9db49a3e6 https://doi.org/https://doi.org/10.48550/arxiv.1910.10683

Perguntar à IA

Bookmark

View Full Paper