What question did this study set out to answer?

To improve log parsing accuracy and efficiency using open-source large language models while addressing privacy and cost issues.

February 11, 2026

Unsupervised, Accurate, and Efficient Log Parsing Using Smaller Open-Source Large Language Models

Key Points

To improve log parsing accuracy and efficiency using open-source large language models while addressing privacy and cost issues.
Developed LibreLog as an unsupervised log parsing approach using open-source LLMs.
Implemented a fixed-depth tree to group logs for parsing.
Utilized similarity scoring for retrieval augmented generation and self-reflection for template refinement.
LibreLog achieved GA 87.2, PA 85.4, FGA 82.3, and FTA 65.1 on LogHub-2.0.
Outperformed previous LLM-based parsers in PA and FTA by 13.7% and 6.9%, respectively.
Processed all logs in 5.94 hours, achieving a 1.7 times speedup over the fastest LLM parser.

Abstract

Log parsing transforms unstructured logs into structured templates for downstream analysis. Syntax-based parsers are fast but lose accuracy on logs that deviate from predefined rules. Recently, large language models (LLMs) based log parsers have shown superior parsing accuracy but face three issues: (1) manual labeling for fine-tuning or in-context learning, (2) high cost from large volumes and limited context size of LLMs, and (3) privacy risks with commercial models. We present LibreLog, an unsupervised approach using open-source LLMs to enhance privacy and reduce cost while achieving state-of-the-art accuracy. LibreLog groups logs with a fixed-depth tree, then parses each group via: (i) similarity scoring-based retrieval augmented generation, (ii) self-reflection to refine templates, and (iii) a template memory to reduce LLM queries. On LogHub-2.0, LibreLog achieves GA 87.2, PA 85.4, FGA 82.3, and FTA 65.1, PA and FTA outperforming prior state-of-the-art LLM-based parsers by 13.7% and 6.9%, respectively. LibreLog processes all logs in 5.94 hours, a 1.7 times speedup over the fastest LLM parser. Using a larger LLM only for self-reflection further improves PA to 86.3 and FTA to 68.3 with a moderate runtime cost increase (31%). In short, LibreLog addresses privacy and cost concerns of using commercial LLMs while achieving state-of-the-art parsing efficiency and accuracy.

Bookmark

Cite This Study

Ma et al. (Sat,) studied this question.

synapsesocial.com/papers/698c1c53267fb587c655ebdb https://doi.org/https://doi.org/10.1145/3796239

Bookmark