Key points are not available for this paper at this time.
Despite recent progress in abstractive summarization, systems still suffer from faithfulness errors. While prior work has proposed models that improve faithfulness, it is unclear whether the improvement comes from an increased level of extractiveness of the model outputs as one naive way to improve faithfulness is to make summarization models more extractive. In this work, we present a framework for evaluating the effective faithfulness of summarization systems, by generating a faithfulnessabstractiveness trade-off curve that serves as a control at different operating points on the abstractiveness spectrum. We then show that the baseline system as well as recently proposed methods for improving faithfulness, fail to consistently improve over the control at the same level of abstractiveness. Finally, we learn a selector to identify the most faithful and abstractive summary for a given document, and show that this system can attain higher faithfulness scores in human evaluations while being more abstractive than the baseline system on two datasets. Moreover, we show that our system is able to achieve a better faithfulnessabstractiveness trade-off than the control at the same level of abstractiveness.
Building similarity graph...
Analyzing shared references across papers
Loading...
Faisal Ladhak
Esin Durmus
He He
Stanford University
Cornell University
Columbia University
Building similarity graph...
Analyzing shared references across papers
Loading...
Ladhak et al. (Sat,) studied this question.
www.synapsesocial.com/papers/6a087fd1ab15ea61dee8e377 — DOI: https://doi.org/10.18653/v1/2022.acl-long.100
Synapse has enriched one closely related paper. Consider it for comparative context: