What question did this study set out to answer?

The aim is to improve named entity recognition in drilling reports using a fine-tuned BERT model.

April 4, 2026

Fine-Tuning a Robustly Optimized Encoder Representation from Transformers Model for Domain-Specific Named Entity Recognition in Drilling Reports

Key Points

The aim is to improve named entity recognition in drilling reports using a fine-tuned BERT model.
Curated a domain-specific corpus for training.
Utilized low-rank adaptation for parameter-efficient fine-tuning.
Executed iterative error analysis to enhance annotation quality.
Conducted tests on a held-out set to evaluate performance.
Achieved F1-scores over 0.97 for entity recognition tasks.
Demonstrated strong generalization across different entity types.
Highlighted the effectiveness of medium-sized models requiring fewer resources.

Abstract

Summary The application of language models in petroleum engineering, particularly for the analysis of daily drilling reports (DDRs), has become an area of increasing interest, given the need for automated information extraction. Through this study, we investigated the fine-tuning of a medium-sized language model from the bidirectional encoder representations from transformers (BERT) family (approximately 60–130 million parameters) for named entity recognition (NER) in drilling operations and mud motor reports. A domain-specific corpus was curated using a custom annotation support function to streamline the labeling process and further refined through iterative error analysis. This approach enabled the correction of inconsistencies such as mislabeling of lobe configurations, units, and contextual definitions, ultimately enhancing annotation quality and model performance. Fine-tuning was carried out using low-rank adaptation (LoRA), enabling parameter-efficient training by updating only a small subset of model weights. The fine-tuned model demonstrated strong generalization on a held-out test set, achieving F1-scores exceeding 0.97 across both frequent and infrequent entity types. These findings underscore the importance of high-quality annotations and targeted fine-tuning strategies in achieving reliable domain adaptation. Furthermore, the study highlights that fine-tuned medium-sized models BERT, robustly optimized encoder representation from transformers (RoBERTa), distilled BERT (DistilBERT) can achieve strong performance while requiring significantly fewer computational resources, suggesting their potential suitability for future deployment in offline or edge environments where computational and connectivity constraints limit the use of larger models.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Mahtab Ghoroori

Zhangxing Chen

Gerritt Hooff

Journals

SPE Journal

Actions

Institutions

Ningbo University of Technology

Cenovus Energy (Canada)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Fine-Tuning a Robustly Optimized Encoder Representation from Transformers Model for Domain-Specific Named Entity Recognition in Drilling Reports

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study