What does this research mean for the field?

Log-sequence representation significantly influences anomaly detection performance, with open-vocabulary semantic and hybrid embeddings improving robustness to out-of-vocabulary effects. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to assess how different log-sequence embeddings impact anomaly detection performance.

March 1, 2026Open Access

Investigating the Impact of Log-Sequence Embeddings on Anomaly Detection: A Systematic Study

Key Points

This research aims to assess how different log-sequence embeddings impact anomaly detection performance.
Three types of sequence embeddings: template-ID lookup, semantic, and hybrid.
Embedded sequences paired with CNN, LSTM, and Transformer models.
Controlled experiments conducted on various public datasets.
Evaluation metrics included PR–AUC, AUROC, F1, precision at high recall.
Sequence representation significantly influences anomaly detection performance.
Open-vocabulary semantic and hybrid embeddings improved robustness to OOV effects.
Transfer gains between datasets were inconsistent and showed degradation under strict conditions.

Abstract

Operational logs are a central information source for monitoring and diagnosing complex information systems, yet the effect of log-sequence representation on anomaly detection remains underexplored. This paper investigates three families of sequence embeddings, E1 (template-ID lookup), E2 (semantic), and E3 (hybrid), for log-based anomaly detection. Each embedding is paired with CNN, LSTM, and Transformer heads under a unified training protocol. We conduct controlled experiments on diverse public corpora to assess in-domain and cross-dataset generalization. We report PR–AUC (primary), AUROC, F1, and precision at recall ≥0.9, with 95% bootstrap confidence intervals. Beyond accuracy, we analyze the impact of sequence length, parser choice, and out-of-vocabulary (OOV) rates at both token and template levels within and across datasets. The results suggest that representation choice can meaningfully influence detection performance, particularly under distribution shift. Open-vocabulary semantic and hybrid embeddings can improve robustness to OOV effects, but transfer gains are inconsistent, and degradation often persists under strict cross-dataset transfer.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Musaad Alzahrani (Fri,) studied this question.

synapsesocial.com/papers/69a3d8b8ec16d51705d2fd4a https://doi.org/https://doi.org/10.3390/info17030228

Bookmark

View Full Paper