What question did this study set out to answer?

The aim is to optimize knowledge extraction from high-dimensional big data through advanced classification techniques.

April 17, 2026Open Access

Knowledge Extraction From Big Data Using Feature Classification and Ensemble Learning With Granular Computing

Key Points

The aim is to optimize knowledge extraction from high-dimensional big data through advanced classification techniques.
Utilized rule-based granular computing and fuzzy clustering for preliminary rule extraction.
Employed ensemble learning to optimize Takagi–Sugeno–Kang rules.
Conducted performance comparisons with traditional algorithms like Random Forest and gradient Boosting.
Effective feature selection through fuzzy clustering improved optimal rule generation.
The proposed model outperformed conventional algorithms significantly.
Experiments demonstrated notable performance enhancements in data analysis efficiency.

Abstract

Managing large‐scale data, particularly those with high‐dimensional features, poses a significant challenge in data analysis. With the rapid growth in data generation and the increasing importance of data analysis for researchers, extracting meaningful patterns from big data has become a critical concern. Data analysts utilize data mining techniques to discover knowledge, extract patterns, and generate rules. However, the high number of features often leads to substantial computational costs, requiring advanced hardware resources. Moreover, the ability to derive interpretable classification rules in high‐dimensional big data systems is essential. This paper proposes a method for knowledge extraction from big data using rule‐based granular computing (GrC). Initially, the architecture extracts preliminary fuzzy rules using GrC techniques and fuzzy clustering. Subsequently, it employs ensemble learning to generate optimized Takagi–Sugeno–Kang (TSK) rules. The results indicate that effective feature selection through fuzzy clustering significantly contributes to the generation of optimal rules. Furthermore, experimental outcomes demonstrate that the proposed model achieves notable performance improvements compared to conventional algorithms such as Random Forest and standard gradient Boosting.

Knowledge Extraction From Big Data Using Feature Classification and Ensemble Learning With Granular Computing

Key Points

Abstract

Cite This Study