January 1, 2019Open Access

HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization

Key Points

Key points are not available for this paper at this time.

Abstract

Neural extractive summarization models usually employ a hierarchical encoder for document encoding and they are trained using sentence-level labels, which are created heuristically using rule-based methods. Training the hierarchical encoder with these inaccurate labels is challenging. Inspired by the recent work on pre-training transformer sentence encoders We apply the pre-trained HIBERT to our summarization model and it outperforms its randomly initialized counterpart by 1.25 ROUGE on the CNN/Dailymail dataset and by 2.0 ROUGE on a version of New York Times dataset. We also achieve the state-of-the-art performance on these two datasets.

KI fragen

Bookmark

View Full Paper

Cite This Study

Zhang et al. (Tue,) studied this question.

synapsesocial.com/papers/6a0fdc44b6f5ee0401600947 https://doi.org/https://doi.org/10.18653/v1/p19-1499

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

KI fragen

Bookmark

View Full Paper