March 15, 2024

A Distributed Training Approach on Email Spam Classification using DistilBERT

Key Points

Key points are not available for this paper at this time.

Abstract

The exponential rise of daily emails raises concerns about spam, which can be intrusive and harmful to user data. Effective email classification is crucial to address this issue. This study proposes a system using the DistilBERT model to identify spam and non-spam (ham) emails. We leverage distributed training with Hugging Face's Accelerate library to significantly reduce training time. Compared to a non-distributed approach, this method achieves a 46.39% reduction in training time while maintaining 96% accuracy. We recommend exploring multi-GPU training in future work for further efficiency gains.

AI에게 질문

Bookmark

Cite This Study

Padilla et al. (Fri,) studied this question.

synapsesocial.com/papers/68e73fd5b6db6435876b92c7 https://doi.org/https://doi.org/10.1109/icict62343.2024.00028

AI에게 질문

Bookmark