May 29, 2020

CoClean: Collaborative Data Cleaning

Key Points

Key points are not available for this paper at this time.

Abstract

High quality data is crucial for many applications but real-life data is often dirty. Unfortunately, automated solutions are often not trustable and are thus seldom employed in practice. In real-world scenarios, it is often necessary to resort to manual cleaning for obtaining pristine data. Existing human-in-the-loop solutions, such as Trifacta and OpenRefine, typically involve a single user. This is often error-prone, limited to a single-person expertise, and cannot scale with the ever growing volume, variety and veracity of data.

CoClean: Collaborative Data Cleaning

Key Points

Abstract

Cite This Study