Los puntos clave no están disponibles para este artículo en este momento.
Data is the most important component of any organisation for meeting productivity needs or increasing profits. Data over the cloud is rapidly growing and its handling and processing is challenging due to its varied nature. Rapid data generation necessitates unique database storage. Deduplication is a technology that removes duplicate data from databases while still providing data backup. Data deduplication algorithms may locate and delete redundant data, resulting in a unique copy of the contents, reducing the amount of transmitted data over the network, and saving storage space and energy. Data deduplication is categorised into different levels. At its core, data deduplication works in four stages: chunking, fingerprinting, indexing, and writing. The process of finding duplicate data takes place in the indexing stage. It is a powerful tool for organizations to improve data quality and helps in decision-making. Data deduplication is widely used in backup and archive systems, as well as primary storage settings, to cut storage costs, improve performance, and simplify data administration. Numerous authors have developed different methods of deduplicating the data. Every technique has its way of implementing deduplication and removing redundant data from the cloud. This paper provides a brief insight into the indexing techniques of deduplication in cloud storage.
Goel et al. (Wed,) studied this question.