March 6, 2024Open Access

Metric-aware LLM inference

Key Points

Key points are not available for this paper at this time.

Abstract

Large language models (LLMs) have demonstrated strong results on a range of NLP tasks. Typically, outputs are obtained via autoregressive sampling from the LLM's underlying distribution. We show that this inference strategy can be suboptimal for a range of tasks and associated evaluation metrics. As a remedy, we propose metric aware LLM inference: a decision theoretic approach optimizing for custom metrics at inference time. We report improvements over baselines on academic benchmarks and publicly available models.

Metric-aware LLM inference

Key Points

Abstract

Cite This Study

Also Consider

Also Consider